Data-driven and interpretable machine learning for performance-determining interactions governing C–C coupling and C2+ selectivity in Cu-catalyzed CO2 electroreduction
Abstract
Efficient electrochemical conversion of CO2 into multi-carbon C2+ products remains limited by the complex interplay between catalyst morphology, electrochemical environment, and reaction conditions. Here, we present an interpretable machine-learning framework to analyze C2+ selectivity trends on Cu-based electrocatalysts using a literature-curated dataset of 380 experimental entries covering applied potential, morphology, particle size, support, electrolyte composition, and membrane type. Multiple regression models were benchmarked, with LightGBM giving the best performance for the C2 : Product target, reaching R2 ≈ 0.78. SHAP-based feature attribution and interaction analysis identified applied potential, particle size, and morphology as the dominant descriptors associated with C2+ selectivity, while electrolyte and membrane-related variables provided secondary but non-negligible contributions. ANN-based response-surface projections and Pareto analysis further suggested a model-inferred C2+-selective region near moderately negative potentials and sub-100 nm catalyst dimensions. Because the dataset is heterogeneous and literature-derived, these regions should be interpreted as data-supported trends rather than experimentally validated universal optima. Overall, this work converts fragmented CO2RR literature data into an interpretable trend-analysis framework, providing hypothesis-guiding descriptor relationships for future catalyst and operating-condition design.

Please wait while we load your content...