Open Access Article
Yuhang Wang
,
Yong Wang and
Hao Li
*
Advanced Institute for Materials Research (WPI-AIMR), Tohoku University, Sendai, 980-8577, Japan. E-mail: li.hao.b8@tohoku.ac.jp
First published on 14th May 2026
Over decades of catalytic research, a vast amount of knowledge has been documented in the literature, yet its potential scientific value has not been fully explored. Catalytic performance is determined by multiple factors such as electronic structure and reaction conditions, exhibiting complex nonlinear structure–performance relationships. Simultaneously, differences in experimental conditions and inconsistent data dimensions among different studies make it difficult to directly use existing data for rule extraction and catalyst design. This perspective systematically summarizes three strategies for discovering new catalytic knowledge from literature data: first, leveraging “human intelligence” and statistical analysis to discover new catalytic knowledge through literature integration and mechanistic insights; second, constructing interpretable descriptors using symbolic regression and machine learning to achieve quantitative prediction of catalytic performance; and third, combining large language models and AI agents for multi-source data integration, knowledge extraction, and intelligent catalyst recommendation. Overall, these data-driven methods can transform scattered experience into computable design criteria, enabling a new paradigm for the rational design and efficient screening of catalytic materials, and accelerating the development of catalysis research towards a digital materials ecosystem and closed-loop research model that integrates AI, theoretical calculations, and autonomous experiments.
Broader contextCatalysis plays a central role in sustainable energy conversion, clean fuel production, carbon recycling, and environmental remediation. Over decades of research, enormous amounts of catalytic knowledge have been accumulated in the scientific literature, covering experimental performance, reaction conditions, material structures, and theoretical calculations. However, much of this knowledge remains scattered across individual studies and is difficult to reuse for rational catalyst design. This perspective discusses how hidden catalytic knowledge can be rediscovered from existing literature data through three complementary strategies: human-intelligence-guided data analysis, regression-model-based descriptor development, and AI-agent-assisted knowledge extraction. By summarizing representative examples across key electrocatalytic reactions, we show how historical data can be transformed into transferable design principles and computable structure–performance relationships. More broadly, this perspective highlights the importance of high-quality databases, expert validation, and AI–theory integration for building a digital materials ecosystem and accelerating the transition from empirical catalyst discovery to predictable, closed-loop catalyst design. |
Despite the rapid accumulation of catalytic data, uncovering their full potential remains particularly challenging.9 On the one hand, experimental conditions (electrolytes, pH, potential ranges, support environments, etc.) vary significantly across studies, limiting the comparability of data within small sample ranges, which makes it difficult to directly extract cross-system patterns. On the other hand, literature data often suffers from inconsistent dimensions, missing key parameters, and varying characterization standards, causing potential structure–performance correlations to be obscured by noise. Therefore, there is an urgent need to develop systematic methods to reorganize, reanalyze, and remodel historical data to extract physically meaningful catalytic patterns from complex and heterogeneous data, providing a reliable basis for predictable catalyst design.
In recent years, several key strategies have emerged for extracting new catalytic knowledge from existing data.10,11 First, through knowledge induction driven by “human intelligence”, combined with mechanistic understanding and cross-literature statistical analysis, potential commonalities and anomalous correlations can be identified, further proposing new physical insights or reaction mechanisms. Second, regression models and descriptor engineering methods based on large-scale data can construct physically meaningful structure–performance relationships from historical data, enabling quantitative prediction of catalytic performance and providing an interpretable theoretical basis for catalyst design. Furthermore, with the development of large language models (LLMs) and automated tools, AI agents are emerging as powerful tools for data mining. They can integrate literature, databases, and computational results to achieve high-throughput knowledge extraction and candidate material screening, thereby significantly improving the exploration efficiency of complex materials space.
Despite the broad applicability of data-driven approaches, such as their successful application in data-driven materials design,12 this perspective does not attempt to cover all aspects of the field. Instead, we focus on a key but underdiscussed question: what new catalytic insights can be gained by revisiting existing literature data? Based on the above background, this perspective systematically summarizes recent progress along three representative research paths: human intelligence-driven knowledge induction, regression model-based descriptor development, and AI agent-assisted knowledge discovery, illustrated through typical reactions such as carbon dioxide reduction reaction (CO2RR),13–18 oxygen reduction reaction (ORR),19–24 oxygen evolution reaction (OER),25–28 hydrogen evolution reaction (HER),29–31 electrocatalytic ammonia synthesis,32 and chlorine evolution reaction (CER).33 By summarizing the applicability and advantages of different strategies, this perspective aims to construct an overall framework for data-driven catalysis research, clarify the fundamental role of high-quality databases in the digital catalysis ecosystem, and highlight emerging opportunities brought about by the deep integration of AI and theoretical computation, thereby providing guidance for achieving efficient and predictable rational catalyst design.
Electrocatalytic CO2RR has attracted widespread attention due to its potential applications in carbon resource recycling and renewable energy conversion,15,34 resulting in a wealth of experimental and theoretical data. Formic acid (HCOOH) has become an important target product due to its high economic feasibility and ease of storage and transportation.14 However, it remains challenging to summarize universal structure–performance rules from single experiments or a small number of independent studies. Notably, when examining CO2RR-to-HCOOH catalysts from a higher-dimensional, systematic perspective, some potential commonalities gradually emerge. For example, Guo et al.,35 through systematic mining of Sn-based catalyst data, discovered that various materials, including Sn-based metals, oxides, sulfides, and alloys, exhibit high intrinsic HCOOH selectivity, revealing the crucial role of the Sn center in regulating the reaction pathway (Fig. 1a). This result indicates that systematic analysis of existing data reveals consistent HCOOH selectivity across Sn-based materials while capturing the general trend of reported literature, providing important guidance for the rational design of highly selective CO2RR catalysts.
![]() | ||
| Fig. 1 (a) Systematic statistical results of the selectivity of the main product in CO2RR by Sn-based catalysts. Reproduced with permission from ref. 35 copyright 2024, Wiley-VCH. (b) pH dependence of Faraday efficiency (FE) for Sn-based catalysts via large-scale data mining. Reproduced with permission from ref. 36 licensed under a Creative Commons License CC BY 4.0. Comparison of experimental Faraday efficiencies of Cu-based single-atom alloys (SAAs) in CO2RR for C2+ product formation (c) and HER (d), compiled from the DigCat database. Reproduced with permission from ref. 37 licensed under a Creative Commons License CC BY 4.0. (e) Comparison of CO faradaic efficiency for various DACs reported in the past three years. Reproduced with permission from ref. 38 copyright 2023, American Chemical Society. | ||
With the continuous development of the DigCat database, more and more scattered experimental data can be systematically integrated and quantitatively analyzed, thus providing new opportunities to reveal universal rules. Based on large-scale statistical data, Wang et al.36 constructed a relationship between pH and the FE of HCOOH, finding that Sn-based catalysts exhibited a good correlation in reported experimental data (Fig. 1b): as the electrolyte pH increased, the FE for CO2RR to HCOOH formation by Sn-based catalysts generally showed a significant upward trend. This finding further contributes to the establishment and development of theoretical understanding of pH-dependent volcano relations. Overall, this study demonstrates that systematic analysis based on experimental databases and the extraction of patterns using human intelligence can effectively promote the construction of new theoretical models, reflecting the important role of data-driven research in the development of catalytic mechanisms.
Although Cu-based single-atom alloys (SAAs) have been extensively studied in CO2RR, their overall performance patterns and further optimization directions still lack systematic understanding. To overcome this limitation, Wang et al.37 conducted a systematic statistical analysis of approximately 50 articles and over 80 data entries in the DigCat database, extracting potential common patterns from scattered historical data. As shown in Fig. 1c, doping Cu-based SAA with 29 different elements significantly improved C2+ product selectivity (average FE exceeding 50%). More importantly, despite significant differences in C2+ selectivity among different doping systems, the FE of the competitive HER remained relatively stable (Fig. 1d). This counterintuitive phenomenon indicates that the main role of single-atom doping is not to suppress HER, but rather to directionally promote C–C coupling kinetics, i.e., to improve C2+ selectivity by accelerating the target reaction pathway. Overall, this finding highlights that that human-driven re-mining of old data can extract catalytic laws across research scales, providing a more targeted theoretical basis for the rational design of Cu-based alloy catalysts.
To date, most dual-atom catalysts (DACs) have been employed in the electrocatalytic CO2RR with the initial hope of generating substantial multicarbon products (e.g., ethane and ethanol). Given that DACs offer two adjacent metal sites capable of enhancing surface *CO coverage, C–C coupling would appear feasible. By systematically reviewing the experimental CO2RR data from 11 typical M–N–C DACs and combining it with DFT calculations and surface Pourbaix analysis, Yang et al.38 found that although DACs possess adjacent bimetal sites, CO is the dominant product in nearly all experimental reports, and even Cu-containing DACs fail to effectively generate C–C coupling products (Fig. 1e). Theoretical simulations further reveal that under reaction conditions, the stable surface state of DACs is pre-covered by *CO at the metal–metal bridge sites, and the hydrogenation of *CO to *COH or *CHO is thermodynamically unfavorable, rendering C–C coupling a high-barrier process. In summary, both statistical analysis and computational results indicate that the strong *CO adsorption leading to site poisoning and the difficulty of *CO hydrogenation are the fundamental reasons why M–N–C DACs struggle to achieve C–C coupling. Overall, these CO2RR examples collectively show that human-guided reanalysis of literature data can reveal both general selectivity trends and counterintuitive deviations that are difficult to recognize from individual studies.
Electrochemical nitrate reduction reaction (NO3RR) offers a potential low-carbon ammonia synthesis route as an alternative to the traditional Haber–Bosch process, but the rational design of efficient catalysts still faces challenges such as unclear structure–performance relationships. As shown in Fig. 2a and b, Jiang et al.39 systematically analyzed experimental data from over 60 reported M–N–C catalysts, extracting two key patterns. First, most studies focus on alkaline and neutral conditions, while acidic systems remain relatively underexplored due to strong competition from the hydrogen evolution reaction. Second, although experimental characterization of coordination environments is limited, statistical results indicate that pyrrole-coordinated M–N–C catalysts generally exhibit high faradaic efficiency under both alkaline and neutral conditions. Overall, this study demonstrates that even with scattered and incomplete historical data, systematic reanalysis can still reveal catalytic patterns across different studies, providing a crucial data-driven basis for the rational design of NO3RR catalysts.
![]() | ||
| Fig. 2 Data mining analysis based on the experimental performance of 60 M–N–C catalysts in electrocatalytic NO3RR: (a) alkaline conditions; (b) neutral conditions. Reproduced with permission from ref. 39 licensed under a Creative Commons License CC BY 4.0. (c) statistical analysis of literature data on the enhanced electrocatalytic effect of magnetic field; (d) changes in the adsorption energy of pyrrole-type N-coordinated single atom Ru (illustrated structure) under different magnetic moments for *NH2. Reproduced with permission from ref. 40 licensed under a Creative Commons License CC BY 4.0. (e) Data-mined summary of previously reported experimental activities for ammonia synthesis. Reproduced with permission from ref. 41 licensed under a Creative Commons License CC BY 4.0. | ||
In the fields of electrocatalysis and thermocatalysis, the regulatory effects of external fields and unconventional promoters on reaction performance have attracted increasing attention.40 However, their underlying mechanisms are often scattered across different studies, lacking a unified understanding. Through systematic mining of existing literature data, researchers have begun to reveal potential commonalities from a higher perspective. As shown in Fig. 2c, You et al.,40 based on large-scale data statistics, found that research reports on the magnetic field (MF) enhancement effect in electrocatalysis show a continuous upward trend. Further DFT calculations show that in the constructed RuN4r model, the spin state of single-atom Ru significantly affects its adsorption capacity for *NH2 intermediates. Among them, the low-spin ground state of Ru (0.30 µB) exhibits the strongest adsorption, while the applied MF can induce higher spin states, thereby weakening the *NH2 adsorption strength (Fig. 2d). On the other hand, Cao et al.41 summarized experimental data on the promotion of thermocatalytic ammonia synthesis by various non-traditional promoters (e.g., Ba, Ca, Li, electride, amide, and rare earth nitrides), and found that these promoters can still significantly enhance activity on weakly N-adsorbed metals (e.g., Co and Ni) (Fig. 2e). Although individual studies usually explain this from the perspective of electronic or structural effects,42 the comprehensive analysis shows that various promoters can lower the N2 dissociation-related energy barrier, which is particularly evident in magnetic transition metal systems. Overall, these studies, through human-driven data reanalysis, propose a more unified physical picture: some promoters not only regulate electronic structure, but may also lower the spin-related energy barrier by adjusting the spin polarization of active sites, generating an additional spin promotion effect, providing a new theoretical perspective for overcoming traditional catalytic activity limitations.
The ORR is a crucial process in new energy devices such as hydrogen fuel cells and metal–air batteries.20,43–45 However, its slow kinetics have long relied on expensive Pt-group metal catalysts, severely restricting the large-scale application of clean energy technologies. Therefore, a data-driven approach to deeply understand the structure–performance relationship of non-Pt catalysts and identify their intrinsic activity limitations has become an important direction in current catalysis research. Through systematic data mining, Zhang et al.46 summarized experimental data from 1018 M–N–C catalysts. Although traditional volcano plots predict low activity for weakly adsorbed systems, weakly adsorbing catalysts such as Ni–N–C, Cu–N–C, and Zn–N–C still exhibit significant ORR activity and a clear pH dependence under alkaline conditions, revealing an anomalous correlation between adsorption strength and catalytic performance and highlighting limitations of conventional scaling relationships (Fig. 3a and b). Further theoretical studies show that there is a systematic shift in the scaling relationship between weakly adsorbed systems and moderately adsorbed systems, and their activity is significantly regulated by electric field response and solvation effects, thus exhibiting a reaction pathway different from the typical Fe/Co–N–C system. On the other hand, large-scale data analysis by Li et al.47 on transition metal oxides (TMOs), a potential non-noble metal system, shows that even after optimization, the intrinsic ORR activities of Mn, Fe, and Ni-based TMOs are generally lower than those of Pt, exhibiting a clear performance upper limit (Fig. 3c). Mechanistic analysis indicates that the weak adsorption of oxygen intermediates makes O–O bond breaking the rate-determining step, while the interfacial electric field effect further increases the energy barrier, explaining the activity bottleneck of the TMO system at the atomic scale.
![]() | ||
| Fig. 3 (a and b) Half-wave potentials of M–N–C catalysts under alkaline and acidic conditions, respectively. Reproduced with permission from ref. 46 licensed under a Creative Commons License CC BY 4.0. (c) Current densities of transition metal oxides (TMOs) at 0.6 V/RHE by anion classification (blue: geometric; purple: specific from high-throughput data). Reproduced with permission from ref. 47 copyright 2021, Springer Nature. (d) Comparison of experimental ORR performance between SACs and DACs. Reproduced with permission from ref. 48 licensed under a Creative Commons License CC BY 4.0. (e) Overpotential summary at 10 mA cm−2 for various catalysts in CER. Reproduced with permission from ref. 49 licensed under a Creative Commons License CC BY 4.0. | ||
In addition, Liu et al. performed large-scale statistical mining of ORR experimental data for SACs and DACs using the DigCat database and discovered an anomaly in which the activity volcano plot of DACs exhibits a bimodal distribution inconsistent with classical theory (Fig. 3d). By further combining theoretical calculations with interpretable machine learning, they revealed that this anomaly originates from a dissociative mechanism and a dynamic switch in the rate-determining step, thereby proposing a new dual-Sabatier optima principle that revises the conventional design paradigm for single-site catalysts. This work vividly demonstrates how large-scale data mining can uncover deviations between theory and experiment, and how human intelligence (mechanistic analysis and advanced modeling) can transform such deviations into new scientific discoveries, thus providing a general paradigm for the design of multi-atom catalysts in multi-step catalytic reactions.
CER is a key reaction in the chlor-alkali industry and electrochemical oxidation processes.33 However, the development of efficient catalysts still heavily relies on noble metal systems, limiting their cost advantage and potential for large-scale application. Yang et al.49 systematically analyzed nearly 15 years of CER reaction reports, as shown in Fig. 3e. Based on systematic data mining of CER catalyst literature since 2010 and combined with mechanistic analysis, it was found that current mainstream catalysts still focus on noble metal materials such as Ru and Ir. Most mixed metal oxide systems still require approximately 5–30% noble metal content to maintain high activity. In contrast, SACs developed in recent years can achieve lower overpotentials with extremely low noble metal loading, exhibiting higher atom utilization efficiency and catalytic potential. However, further summarizing existing data shows that non-noble metal single-atom catalysts are still almost non-existent in the CER field, especially carbon-supported non-noble metal systems, which lack systematic research. Overall, data mining reveals the continued dependence of CER catalysts on noble metals and potential research gaps, while human-powered intelligent analysis based on mechanistic understanding further points to feasible design directions for developing low-cost, high-efficiency CER catalysts.
In another fundamental electrocatalytic reaction (i.e., HER), GH* (adsorption free energy of H*) is usually used as an effective activity descriptor.50 However, Ye et al.51 directly discovered a significant anomaly through data mining (Fig. 4a): GH* alone is insufficient to describe HER activity. For example, Ni–N–C still requires a large overpotential at higher GH* (∼1.6 eV), while V–N–C exhibits poor activity even with a GH* close to 0 eV, demonstrating the limitations of weak adsorption descriptors in predicting actual catalytic performance. Fig. 4b further reveals that V single-atom catalysts tend to be poisoned by OH under acidic conditions with URHE = 0. Furthermore, a 2D volcano (Fig. 4c) as a function of GHO* (adsorption free energy of HO*) and GH* successfully corrects the shortcomings of GH* as the descriptor, compensating for the lack of catalytic knowledge discovery driven by data mining.
![]() | ||
| Fig. 4 (a) Comparison of HER activities of M–N–C SACs reported in the literature; (b) surface Pourbaix phase diagram of V-pyrrole-N4 sites; (c) two-dimensional HER microkinetic volcano plot constructed with GH* and GOH* as variables under −0.5 V/RHE (pH = 0). Reproduced with permission from ref. 51 copyright 2025, Wiley-VCH. Closed-loop design for discovering non-noble MO electrocatalysts for water splitting, including data mining (d), catalyst synthesis (e), and advanced characterizations (f). Reproduced with permission from ref. 52 licensed under a Creative Commons License CC BY 4.0. | ||
Achieving efficient water splitting relies heavily on discovering highly stable and active electrocatalysts capable of accelerating both OER and HER.53–56 Although certain non-noble stoichiometric metal oxides (MOs) have demonstrated OER stability under alkaline conditions, their behavior in acidic environments remains largely unexplored. To address this challenge, Jia et al.52 proposed a data-driven, human-assisted research strategy. The overall research workflow is illustrated in Fig. 4d–f, involving data mining from the reported literature, synthesis and electrochemical testing, and characterizations using advanced techniques. By leveraging large-scale data mining on the DigCat platform (which contains 413
189 experimental datasets and 429
473 computational data sets), combining data-driven methods such as surface state analysis and microkinetic modeling, and integrating human-intelligence-guided mechanistic understanding, the researchers successfully screened massive datasets and discovered for the first time a novel non-noble metal oxide bifunctional catalyst, RbSbWO6. The selected catalyst exhibits excellent activity and stability for both OER and HER under acidic water splitting conditions. Notably, OER and HER occur on different facets of the stable surface, highlighting its unique dual-terminal catalytic feature. This achievement effectively integrates data-driven mining with human-intelligence-guided theoretical modeling, thereby effectively narrowing the catalyst search space and compensating for the limitations of conventional bulk stability analysis. Moreover, the closed-loop strategy of “data mining → theoretical screening → experimental validation → data feedback” provides a generalizable paradigm for the efficient and rational discovery of novel electrocatalysts, accelerating the transformation of catalysis research toward digitalization and intelligentization.
Overall, the above examples indicate that human-guided statistical analysis can contribute to catalytic knowledge discovery in different ways. Specifically, Sn-based CO2RR catalysts and M–N–C catalysts for NO3RR exemplify common pattern extraction, revealing general trends from dispersed experimental data. Cu-based SAAs, ORR DACs, and HER descriptor studies represent anomaly-driven discovery, where unexpected deviations from established descriptors or mechanisms lead to new theoretical insights. In comparison, the CER and acidic water-splitting studies mainly demonstrate research gap identification and design-direction mining, helping to locate underexplored catalyst systems and propose rational optimization strategies. Therefore, this section is not a simple accumulation of literature examples, but rather shows how existing data, when reanalyzed with human intelligence, can generate different levels of catalytic knowledge.
![]() | ||
| Fig. 5 (a) Schematic diagram of symbolic regression based on genetic programming. Reproduced with permission from ref. 57 licensed under a Creative Commons License CC BY 4.0. (b) Explainable machine learning framework for the discovery of catalyst materials for solid oxide fuel cells. Reproduced with permission from ref. 58 copyright 2022, Springer Nature. (c) Overview of the overall catalyst design process constructed for electrochemical hydrogen peroxide synthesis via 2e− water oxidation (2e− WOR). Reproduced with permission from ref. 59 licensed under a Creative Commons License CC BY 4.0. | ||
Electrochemical synthesis of H2O2 via 2e− water oxidation (2e− WOR) is considered a low-cost, environmentally friendly, and sustainable pathway for converting electrical energy into chemical energy using water as the single source, showing significant application potential in renewable energy utilization and energy storage.60,61 Against this backdrop, Liu et al.59 systematically mined adsorption energy data from existing literature and databases, extracted the geometric and chemical characteristics of atomic centers, developed a weighted atom-centered symmetry function (wACSF) descriptor (Fig. 5c), and further established a high-precision XGBoost regression model, achieving rapid prediction of GHO* and GO*. Notably, this wACSF method can, for the first time, incorporate different types of catalysts (e.g., metal alloys, metal oxides, SACs, and perovskites) into the same training set, which helps address the transferability challenge in AI for catalyst design. Based on this, combined with a microkinetic volcano model, they were able to efficiently screen catalysts for the 2e− WOR, exhibiting both high activity and selectivity, thus achieving rapid transformation from old data to new material design. Overall, experimental verification using LiScO2 as an example demonstrated ∼90% H2O2 Faradaic efficiency and excellent stability under mildly alkaline conditions, highlighting the reliability and practical value of regression models derived from literature data for efficient catalyst design.
In summary, regression models based on big data can extract key features from historical data and construct interpretable descriptors, establishing reliable structure–performance relationships. This strategy significantly improves catalyst screening efficiency, enabling rapid transformation from old data to new material design, and provides an important method for data-driven rational catalyst development. However, the reliability of such descriptors still strongly depends on data quality and feature selection, which may limit their transferability across different catalytic systems.
![]() | ||
| Fig. 6 A smart design process for Cu-based SAAs CO2RR electrocatalysts for multi-carbon product generation: (a) experimental data-driven theoretical simulation; (b) descriptor construction assisted by Catalysis AI Agent; (c) application and verification of general design principles. Reproduced with permission from ref. 37 licensed under a Creative Commons License CC BY 4.0. | ||
MOs have long been considered important electrocatalytic materials owing to their abundant resources, diverse structures, and tunable chemical properties. However, traditional experimental screening is inherently inefficient and struggles to support the rapid exploration of complex materials spaces. Databases such as the Materials Project and DigCat have accumulated extensive data on material structures, electrochemical stability, and experimental performance metrics. However, accessing and analyzing these datasets typically require expertise in programming and computational methods, which limits their accessibility to the broader research community. In recent years, AI agents combining LLMs and automated workflows have offered a new solution to this problem. For example, as shown in Fig. 7a, the StableOx-Cat AI agent automates the material screening process through natural language interaction, integrating functions such as thermodynamic and electrochemical stability analysis, elemental filtering, and custom evaluation, transforming complex calculation processes into an intuitive and easy-to-use research interface. Overall, the AI agent significantly lowers the barrier to data utilization, improves the efficiency of MOs stability assessment, and provides a reproducible, systematic data-driven workflow for mining potential novel catalysts based on existing data.
![]() | ||
| Fig. 7 (a) Schematic diagram of the overall architecture of the StableOx-Cat AI agent, where MOs represents metal oxides and MCP represents the Model Context Protocol. Reproduced with permission from ref. 64 licensed under a Creative Commons License CC BY 4.0. (b) Schematic diagram of the CRESt system, which is based on a large visual language model (LVLM)-driven agent framework. Reproduced with permission from ref. 65 licensed under a Creative Commons License CC BY 4.0. (c) Schematic diagram of the process of predicting and analyzing the structure–performance relationship of electrochemical nitrogen reduction reaction (eNRR) literature based on LLMs enhanced by ChemPrompt. Reproduced with permission from ref. 66 licensed under a Creative Commons License CC BY 4.0. | ||
The design of multimetallic catalysts faces an explosive increase in the combinations of composition, structure, and process parameters, making traditional Bayesian optimization strategies, which rely on single data streams, ineffective in handling complex experimental spaces. The Copilot for Real-world Experimental Scientists (CRESt) system (Fig. 7b), developed by Zhang et al.,67 integrates a multimodal large visual language model (LVLM), knowledge-assisted Bayesian optimization, and a robotic automated experimental platform to enable closed-loop high-throughput exploration from natural language interaction to catalyst synthesis, characterization, and electrochemical testing. In a direct formate fuel cell system, CRESt screened over 900 formulations and completed approximately 3500 tests within several months, successfully discovering an octagonal multimetallic catalyst with excellent performance and significantly reduced noble metal loading. This system achieves a dynamic balance between exploration and utilization by fusing literature, images, and experimental data embedding, thereby significantly improving material discovery efficiency and result reproducibility. Overall, multimodal and agent-driven self-powered laboratories provide an efficient and scalable technical path for mining new catalytic knowledge from existing data, offering a new research paradigm for the accelerated design of complex catalytic systems.
In catalysis research, the large volume and structural complexity of literature data make it difficult for traditional trial-and-error approaches and rule-based text mining methods to effectively uncover the intrinsic relationships among catalyst composition, structure, reaction conditions, and performance. With the development of LLMs and AI agents, researchers can now achieve high-throughput data parsing and automated extraction of catalytic knowledge. For example, as shown in Fig. 7c, the eNRRCrew framework,66 by integrating ChemPrompt-enhanced LLMs, knowledge graphs, and machine learning algorithms, extracts catalyst characteristics and performance information from 2321 literature abstracts and utilizes an agent to conduct structured analysis and performance prediction. Furthermore, such agents can not only parse existing data but also proactively recommend potentially efficient catalyst combinations based on learned structure–performance relationships, thereby improving catalyst screening efficiency. Overall, AI-driven data mining and agent analysis provide an efficient path for transforming scattered historical knowledge into actionable design rules, significantly accelerating the discovery and optimization of novel electrocatalytic materials.
In summary, AI agents can integrate literature, databases, and computational data to achieve automated catalytic-knowledge extraction and design optimization, significantly improving the efficiency of exploring complex materials spaces. This strategy promotes the transformation of catalysis research from an experience-driven to data-driven and self-driven modes, accelerating the discovery and design of novel electrocatalytic materials. Despite these advantages, the reliability of AI-agent-based discovery remains dependent on data quality and may be affected by model hallucination and bias.
However, the increasing use of literature-derived data also requires a more critical perspective. First, published catalytic data are not equivalent to the complete materials space because they are inevitably affected by publication bias. Catalysts with attractive activity, selectivity, or stability are more likely to be reported, whereas failed experiments, poorly performing materials, and negative results are rarely included in the literature. Therefore, statistical trends extracted from published studies should be interpreted as patterns within the reported knowledge space rather than universal laws of the entire chemical space. Second, catalytic performances collected from different publications are often measured under non-uniform conditions, including differences in electrolyte composition, pH, reactant concentration, catalyst loading, electrode configuration, potential calibration, product quantification, and stability testing protocols. Such heterogeneity may introduce hidden variables into data mining and machine learning models, making it necessary to combine statistical correlations with mechanistic analysis, uncertainty evaluation, and experimental validation.
Looking ahead, the deep integration of AI and theoretical computation will be an important direction for catalysis research. Several specific future directions deserve attention. First, universal machine learning interatomic potentials (MLIPs) should be systematically developed and benchmarked across realistic catalytic environments, such as solid–liquid interfaces, dynamic adsorbate coverages, surface defects, and catalyst reconstruction under operating conditions. Their combination with multi-scale simulations can help connect atomic-level mechanisms with macroscopic catalytic performance. Second, multimodal LLMs73,74 may integrate heterogeneous catalytic information, including literature, spectra, microscopy images, computational data, and experimental protocols, thereby assisting automated data extraction, mechanism hypothesis generation, and cross-modal consistency checking. Third, closed-loop platforms combining digital databases, autonomous synthesis, high-throughput characterization, and catalytic testing should be established to realize a self-iterative “prediction–verification–feedback” workflow. Active learning can further guide experiment selection and continuously refine predictive models. Fourth, a cross-scale and cross-system digital materials ecosystem70,75,76 should emphasize standardized metadata, shared benchmark datasets, and unified evaluation protocols, enabling knowledge transfer across different reactions and material families. Overall, these efforts will promote catalysis research toward a more reliable, interpretable, and closed-loop paradigm for efficient and predictable catalyst design.
| This journal is © The Royal Society of Chemistry 2026 |