Open Access Article
Janghoon
Ock
ab,
Radheesh Sharma
Meda
c,
Tirtha
Vinchurkar
a,
Yayati
Jadhav
c and
Amir
Barati Farimani
*c
aDepartment of Chemical Engineering, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA. E-mail: barati@cmu.edu
bDepartment of Chemical and Biomolecular Engineering, University of Nebraska–Lincoln, Lincoln, NE 68588, USA
cDepartment of Mechanical Engineering, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA
First published on 19th December 2025
Adsorption energy is a key reactivity descriptor in catalysis. Determining adsorption energy requires evaluating numerous adsorbate–catalyst configurations, making it computationally intensive. Current methods rely on exhaustive sampling, which must navigate a large search space without guaranteeing the identification of the global minimum energy. To address this, we introduce Adsorb-Agent, a Large Language Model (LLM) agent designed to efficiently identify stable adsorption configurations corresponding to the global minimum energy. Adsorb-Agent leverages its built-in knowledge and reasoning to strategically explore configurations, significantly reducing the number of initial configurations required while improving the energy prediction accuracy. In this study, we also evaluated the performance of different LLMs—GPT-4o, GPT-4o-mini, Claude-3.7-Sonnet, and DeepSeek-Chat—as the reasoning engine for Adsorb-Agent, with GPT-4o showing the strongest overall performance. Tested on twenty diverse systems, Adsorb-Agent identifies comparable adsorption energies for 84% of cases and achieves lower energies for 35%, particularly excelling in complex systems. It identifies lower energies in 47% of intermetallic systems and 67% of systems with large adsorbates. These findings demonstrate Adsorb-Agent's potential to accelerate catalyst discovery by reducing computational costs and enhancing prediction reliability compared to exhaustive search methods.
Adsorption energy, defined as the change in energy upon the binding of a molecule to a catalytic surface, is one of the most widely used descriptors in computational catalysis due to its direct correlation with catalytic reactivity.4,5 The adsorption energy, corresponding to the most stable adsorption configuration, serves as a key descriptor of catalyst performance and plays a crucial role in estimating the reactivity of various catalysts.4,6–8 Furthermore, adsorption energy is a foundational parameter in constructing free energy diagrams, which are used to identify the energetically preferred reaction pathways on catalyst surfaces.
The adsorption energy, ΔEads, is mathematically defined as the global minimum energy among all possible adsorption configurations.9,10 It is calculated as the difference between the total energy of the adsorbate–catalyst system (Esys,i), the energy of the clean surface (slab) (Eslab), and the energy of the gas-phase adsorbate or reference species (Egas):
| ΔEi = Esys,i − Eslab − Egas | (1a) |
![]() | (1b) |
Accurately determining adsorption energy presents significant challenges. The complex electron-level interactions that govern chemical bonding make it impractical to predict the most stable configuration based solely on atomic-level information. As a result, determining the global minimum adsorption energy typically requires enumerating and evaluating a vast number of possible configurations.10–12 This process becomes computationally prohibitive when using quantum chemistry methods such as density functional theory (DFT).10,13 The difficulty is further compounded by the combinatorial explosion of potential binding sites, variations in surface geometries, and diverse orientations the adsorbate can adopt. Despite exhaustive configuration searches, there is no guarantee of reliably identifying the true global minimum energy configuration. These challenges underscore the need for more efficient and accurate approaches to streamline adsorption energy determination, paving the way for faster and more reliable catalyst design.
Recent advances in machine learning (ML) have introduced promising alternatives to conventional quantum chemistry methods, significantly improving the efficiency of adsorption energy prediction tasks. In particular, Graph Neural Networks (GNNs) have demonstrated exceptional performance in predicting energy and forces for atomic systems. For adsorbate–catalyst systems,14 GNNs achieve a high level of precision, predicting adsorption energies with an error of approximately 0.2 eV and forces with an error of 0.013 eV Å−1.15,16 These capabilities make GNNs effective surrogates for DFT calculations in tasks such as geometry optimization and energy prediction. Building on this foundation, Lan et al. introduced the AdsorbML method, which achieved a 2000× speedup in adsorption energy determination while retaining 87.36% of the accuracy of full DFT calculations by integrating GNNs with DFT.10 In their approach, GNNs are used to relax structures from initial adsorption configurations, after which DFT is employed for further relaxation or single-point energy calculations to obtain DFT-validated adsorption energies. Despite this progress, the placement of adsorbates on the surface and the sampling of adsorption sites remain reliant on exhaustive enumeration, posing a significant challenge in reducing the initial search space and improving the computational efficiency.
Moving forward, large language models (LLMs) are emerging as transformative tools in scientific problem-solving by leveraging their pre-trained knowledge and planning capabilities, as well as their seamless human–machine interaction.17 LLM agents, powered by LLMs, have demonstrated remarkable potential in reshaping scientific workflows. For example, Boiko et al. proposed Coscientist, an LLM agent that automates experimental design and execution, significantly enhancing productivity while reducing manual effort.18 Similarly, Szymanski et al. demonstrated the application of LLMs in an autonomous laboratory framework for proposing synthesis recipes.19 In the field of additive manufacturing, Jadhav et al. introduced the LLM-3D Print framework, which streamlines the design-to-manufacturing process by autonomously generating, validating, and optimizing 3D printing instructions with an LLM agent at its core.20 These breakthroughs demonstrate how LLMs are reshaping the landscape of scientific discovery and innovation.
In this study, we introduce Adsorb-Agent, an LLM-based agent designed to determine adsorption energy efficiently. Adsorb-Agent predicts initial adsorption configurations that are likely to be close to the most stable configuration and relaxes them to identify the minimum energy state. While human researchers can propose plausible stable adsorption configurations for specific systems based on domain knowledge—such as chemical bonding and surface science—it remains exceedingly difficult to derive a universal theorem for predicting the most stable configuration across diverse adsorbate–catalyst systems. Moreover, in high-throughput screening, where millions of candidate systems must be evaluated, it is infeasible to manually propose stable configurations for each individual system.21 Adsorb-Agent addresses these challenges by autonomously deriving stable configurations, relying solely on the LLM's built-in knowledge and emergent reasoning capabilities. Because it operates purely through inference from pre-trained models, it is readily applicable to large-scale tasks.
This study has two primary objectives: first, to reduce the computational cost of adsorption energy identification by minimizing the number of initial configurations required; second, to enhance the accuracy of adsorption energy predictions by generating refined initial configurations that are closer to the global minimum while maintaining human interpretability. By bridging state-of-the-art LLM capabilities with catalytic configuration challenges, Adsorb-Agent represents a significant step toward broader adoption of AI-driven methods in materials science and catalysis, accelerating the discovery and design of optimal catalysts. Furthermore, when integrated with other LLM models and tools for catalyst design,13,22,23 Adsorb-Agent can be extended to a wider range of applications in optimal catalyst development.
To reduce hallucination and improve reliability, we adopt two safeguards. First, instead of using the full free-form LLM output, we extract only four essential descriptors, such as site type, surface binding atoms, adsorbate orientation, and adsorbate binding atoms. They are the minimal elements needed to define a configuration. This limits sensitivity to wording or irrelevant reasoning. Second, a Critic module checks the internal consistency of the proposed configuration and filters out contradictory outputs. These measures ensure that only coherent and well-defined configurations proceed to the relaxation stage.
To ensure the logical coherence of the solution output, the Critic module evaluates the initial solution generated by the Solution Planner. It takes the solution as input and uses a knowledge prompt that clarifies the terms within the solution to guide its review (see Fig. 1b). Details of the knowledge prompt are provided in SI Fig. S1. The module focuses on two aspects: (1) the coherence between the adsorption site type and the binding atoms on the surface and (2) the alignment between the adsorbate's binding atoms and its orientation. For instance, if the solution specifies a bridge site, it must involve two binding surface atoms. Similarly, for adsorbate orientation, if the adsorbate is described as end-on, it should have only one binding adsorbate atom. If any incoherence is identified, the Critic rebuts the solution, and the Solution Planner is re-initialized to produce a revised solution. This iterative interaction ensures the final adsorption configuration is logically coherent and self-consistent.
Once a coherent solution is generated, the Binding Indexer module assigns indices to the adsorbate's binding atoms based on the solution. This step translates the human-readable configuration into a numerical format suitable for computational processing. As illustrated in Fig. 1d, the Binding Indexer takes the identified binding atoms and orientation from the solution, along with the adsorbate's atomic number array, to generate a binding index array. This array specifies which atoms in the adsorbate are involved in binding to the surface. Using this array, the adsorbate can be positioned on the catalytic surface, reflecting both the surface orientation and binding atom information. This automation removes the need for manually pre-defining binding atoms—commonly required in datasets like the Open Catalyst Project, which explicitly mark binding atoms with asterisks (e.g., NNH). In addition, we introduce a new placement strategy capable of handling side-on adsorbates (see the Methods section).
These initial structures are subsequently relaxed to determine the minimum energy configuration, as relaxed energies are necessary for meaningful comparison. In this study, we employ a GNN model, specifically EquiformerV2 trained on the Open Catalyst 2020 dataset,14,16 although other machine learning models or quantum chemistry simulations could also be used. Details of the GNN-based relaxation process are provided in the Methods section. During relaxation, even initially similar configurations can evolve into distinct final structures with different energies. Structures exhibiting anomalies such as extensive surface reconstruction, adsorbate dissociation, or desorption may occur, and any structures exhibiting these anomalies are filtered out.10,24,25 Among the remaining valid configurations, the one with the lowest energy is identified as the most stable configuration. This energy is recognized as the adsorption energy, which serves as a critical reactivity descriptor for the adsorbate–catalyst combination.
and
algorithms, which sample numerous configurations to approximate the global minimum adsorption energy. Details of these enumeration procedures are provided in the Methods section.
Adsorb-Agent's ability to identify the most stable configuration is evaluated against exhaustive enumeration algorithms. Performance is assessed based on three key criteria: (i) efficiency in reducing the configuration search space, (ii) accuracy in identifying adsorption configurations with energies comparable to those found by enumeration algorithms, and (iii) consistency of results across independent trials.
The evaluation is conducted on 20 adsorbate–catalyst systems selected for their practical importance, particularly in nitrogen production and fuel cell applications.26–28 For example, the electrocatalytic nitrogen reduction reaction (NRR) offers a potential route for sustainable nitrogen fixation. However, its performance is often limited by the high activation energy required to cleave the inert N
N bond and by strong competition from the hydrogen evolution reaction (HER).29 Likewise, the oxygen reduction reaction (ORR) is a vital reaction process in fuel cell operation and holds a central position in the broader field of electrocatalysis.28
Eight of the selected systems are associated with the NRR and HER. These include four catalysts proposed by Zhou et al.,29 each interacting with two key adsorbates: NNH for the NRR and H for the HER, resulting in a total of eight systems. An additional six systems are associated with the ORR, with OH as the key adsorbate, selected from the experimentally verified sets reported by Kulkarni et al.28,30,31 To broaden the evaluation beyond simple adsorbates, six more systems featuring larger molecules—such as CH2CH2OH, OCHCH3, and ONN(CH3)2—and intermetallic catalysts were included. These complex systems were randomly selected from the Open Catalyst 2020-Dense (OC20-Dense) dataset.10 A complete list of adsorbate–catalyst systems is provided in Table 1, with detailed slab information available in SI Table S1.
| No. | Adsorbate | Catalyst | Adsorption energy [eV] | Number of initial sets | ||
|---|---|---|---|---|---|---|
| Adsorb-Agent (↓) | Algorithm | Adsorb-Agent (↓) | Algorithm | |||
| 1 | H | Mo3Pd (111) | −0.764 ± 0.113 | −0.941 ± 0.002 | 6.7 ± 2.1 | 59 |
| 2 | NNH | Mo3Pd (111) | −1.265 ± 0.158 | −0.903 ± 0.117 | 9.3 ± 3.7 | 51 |
| 3 | H | CuPd3 (111) | −0.380 ± 0.003 | −0.398 ± 0.017 | 16.7 ± 1.2 | 98 |
| 4 | NNH | CuPd3 (111) | 0.745 ± 0.006 | 0.867 ± 0.072 | 17.3 ± 3.1 | 78 |
| 5 | H | Cu3Ag (111) | −0.019 ± 0.041 | −0.072 ± 0.002 | 21.3 ± 4.1 | 98 |
| 6 | NNH | Cu3Ag (111) | 1.504 ± 0.057 | 1.500 ± 0.002 | 16.7 ± 2.6 | 56 |
| 7 | H | Ru3Mo (111) | −0.587 ± 0.002 | −0.586 ± 0.050 | 17.0 ± 2.2 | 94 |
| 8 | NNH | Ru3Mo (111) | −0.498 ± 0.013 | −0.276 ± 0.003 | 18.7 ± 0.5 | 81 |
| 9 | OH | Pt (111) | 0.990 ± 0.001 | 0.990 ± 0.071 | 7.0 ± 1.6 | 54 |
| 10 | OH | Pt (100) | 0.991 ± 0.001 | 0.991 ± 0.001 | 10.3 ± 4.2 | 54 |
| 11 | OH | Pd (111) | 0.814 ± 0.000 | 0.814 ± 0.001 | 20.0 ± 5.7 | 54 |
| 12 | OH | Au (111) | 1.408 ± 0.002 | 1.409 ± 0.002 | 23.3 ± 5.2 | 54 |
| 13 | OH | Ag (100) | 0.440 ± 0.001 | 0.463 ± 0.009 | 23.7 ± 4.5 | 53 |
| 14 | OH | CoPt (111) | −0.208 ± 0.015 | −0.166 ± 0.046 | 41.3 ± 1.2 | 120 |
| 15 | CH2CH2OH | Cu6Ga2 (100) | −2.338 ± 0.833 | −3.077 ± 0.062 | 28.7 ± 15.5 | 66 |
| 16 | CH2CH2OH | Au2Hf (102) | −2.761 ± 0.592 | −3.761 ± 0.129 | 28.0 ± 4.5 | 78 |
| 17 | OCHCH3 | Rh2Ti2 (111) | −4.561 ± 0.007 | −4.275 ± 0.086 | 29.0 ± 4.3 | 62 |
| 18 | OCHCH3 | Al3Zr (101) | −4.616 ± 0.014 | −4.325 ± 0.052 | 22.0 ± 2.4 | 68 |
| 19 | OCHCH3 | Hf2Zn6 (110) | −5.922 ± 0.209 | −5.443 ± 0.037 | 18.0 ± 2.2 | 67 |
| 20 | ONN(CH3)2 | Bi2Ti6 (211) | −3.454 ± 0.337 | −2.441 ± 0.103 | 33.0 ± 3.6 | 139 |
Adsorb-Agent significantly reduces the number of initial configurations by focusing on likely adsorption sites, compared to the exhaustive enumeration algorithmic approaches. For example, Adsorb-Agent predicts the bridge site at the intersection of Cu and Pd atoms as the primary adsorption site for NNH–CuPd3 (111). Similarly, it identifies the hollow site of the Pt (111) surface as the optimal adsorption site for OH and the bridge site between Zn and Hf atoms for the OCHCH3–HfZn3 (110) system. Occasional deviations from the predicted solutions arise due to the distance margin used to define targeted adsorption sites. By focusing on specific sites, Adsorb-Agent effectively reduces the search space.
For the NNH–CuPd3 (111) and OCHCH3–HfZn3 (110) systems, the relaxed adsorption sites of the most stable configurations precisely match the solutions proposed by Adsorb-Agent. In both cases, Adsorb-Agent achieves adsorption energies lower than those obtained via algorithmic approaches. However, for the OH–Pt (111) system, the adsorption site of the most stable relaxed configuration identified by both the algorithmic methods and Adsorb-Agent is an ontop site, differing from the initial Adsorb-Agent prediction. Despite this discrepancy, the initial site suggested by Adsorb-Agent successfully guided the system to the most stable configuration during the relaxation process.
Additionally, the solution generated by Solution Planner includes reasoning prompts beyond the four essential pieces of information required for determining the initial configuration (see Fig. 1c). Although the adsorption site and surface-binding details suggested are not explicitly reported in prior literature, the general concept of side-on orientation preference is reasonable. The idea that a “side-on orientation allows both nitrogen atoms to interact effectively with the surface, providing optimal orbital overlap and electronic interactions” is plausible in general surface science contexts but lacks direct support in CuPd nitrate reduction. Orientation effects (tilt/side-on vs. end-on) have been studied for NO and NHx species on metal (111) surfaces, where orbital overlap and back-donation govern adsorption.33 The proposal that the mixed-metal Cu–Pd environment “offers balanced interactions, optimizing charge transfer and adsorption energy”, is supported by multiple studies of Cu–Pd alloys and intermetallic catalysts. For example, ordered B2 CuPd nanocubes can break classical adsorption-energy scaling by stabilizing early oxy-nitrogen intermediates (e.g., *NO3) while weakening late nitrogen-containing fragments (e.g., *N) via subsurface Pd electronic effects.34,35 Similarly, Cu–M alloys (M = Pd, Zn, etc.) have been reported to tune intermediate binding energies and charge transfer in nitrate reduction reactions.36
and
), as summarized in Table 1. To quantify the effectiveness of Adsorb-Agent in identifying adsorption energies, three key metrics are defined, and their mathematical formulations are provided in the Methods section. Success Ratio (SR) assesses the ability of Adsorb-Agent to identify adsorption energies comparable to those found by the algorithmic approaches. Lower Energy Discovery Ratio (LEDR) measures the capability of Adsorb-Agent to discover adsorption energies lower than those identified by the algorithmic approaches. Reduced Search Space Ratio (RSR) quantifies the reduction in the number of initial configurations required by Adsorb-Agent compared to the algorithmic approaches.
These metrics provide a comprehensive framework for evaluating Adsorb-Agent's effectiveness in identifying the most stable configurations relative to conventional methods. Specifically, the RSR reflects the efficiency of the search process, while the SR and LEDR reflect the accuracy of energy identification. An increase in the number of initial configurations, as reflected by a higher RSR, often corresponds to improvements in SR and LEDR values. Therefore, all three metrics should be considered collectively to thoroughly assess Adsorb-Agent's performance in comparison to algorithmic approaches.
As shown in Fig. 3a, Adsorb-Agent successfully identifies adsorption energies comparable to those found by the algorithmic approach in 83.7% of cases and discovers lower energies in 35.0% of cases. Remarkably, it achieved these results while using only 6.8–63.6% of the initial configurations required by the algorithmic methods (see Fig. 3b). As discussed earlier, increasing the number of initial configurations is likely to improve both the SR and LEDR. To ensure a fair comparison in this study, the number of initial configurations used by Adsorb-Agent is scaled relative to the algorithmic approach, resulting in a reduction of 6.8–63.6% of the original, with an average of 26.9% across three independent runs. For practical applications, the number of initial configurations can be further increased, particularly for systems with lower RSRs, to improve the performance of adsorption energy determination.
An analysis of specific system categories reveals that the results vary with system complexity, as shown in Fig. 3a. The systems are categorized based on the composition of the catalytic surface (monometallic and intermetallic) and the size of the adsorbate molecule (small and large molecules). Large adsorbates are defined as those with more than three atoms, such as CH2CH2OH, OCHCH3, and ONN(CH3)2.
For systems with monometallic catalysts, Adsorb-Agent consistently identifies adsorption energies comparable to those found by the algorithmic approach across all three trials. However, it does not achieve lower adsorption energies, indicating that the algorithmic approach successfully identifies the adsorption configuration with the global minimum energy. This outcome is likely due to the relatively homogeneous nature of monometallic surfaces, which consist of a single atom type (see Fig. 2b). These findings suggest that extensive site enumeration is unnecessary for monotonous surfaces.
In contrast, for systems with intermetallic catalysts, Adsorb-Agent demonstrates a distinct advantage. While the SR slightly decreases to 82.2% compared to monometallic systems, the LEDR significantly improves to 46.7%. This highlights the ability of Adsorb-Agent to uncover new global minima through targeted searches, which the algorithmic approach cannot achieve through simple enumeration because of the increased complexity and heterogeneity of intermetallic surfaces.
A similar trend is observed when analyzing systems based on adsorbate complexity. Systems with large adsorbates exhibit a lower SR but a significantly higher LEDR compared to those with small adsorbates. Notably, systems with large adsorbates achieve the highest LEDR of 66.7%, underscoring the effectiveness of the targeted search approach employed by Adsorb-Agent. These results suggest that simple enumeration is relatively less effective at identifying the global minimum in systems containing complex adsorbates. Furthermore, this finding reinforces the effectiveness of Adsorb-Agent in addressing these challenges.
Furthermore, as shown in Table 1, the standard deviations of adsorption energies across multiple implementations of Adsorb-Agent remain within an acceptable range for most systems. The only exceptions are two systems involving CH2CH2OH, where higher deviations are observed. This variability highlights the need for further refinement in handling complex adsorbates while affirming the overall robustness of Adsorb-Agent.
To quantitatively evaluate consistency across independent trials, we introduce the consistency ratio, which measures the proportion of systems that yield consistent solutions across three independent trials. The consistency ratio is defined as:
![]() | (2) |
Consistency is evaluated separately for surface-related and adsorbate-related information. A solution is deemed consistent if the binding atom arrays across the three trials either match exactly or differ by no more than one atom, with the shorter array being a subset of the longer array. This criterion is applied to both surface binding atoms and adsorbate binding atoms using the algorithm detailed in Algorithm 1. A solution is considered fully consistent only if both surface-related and adsorbate-related information meet these criteria.
Adsorb-Agent demonstrates reasonable reliability, producing consistent solutions for 17 out of 20 systems when the Critic module is applied. Specifically, only one system fails to achieve consistency in adsorbate-related solutions, while two systems fail in surface-related solutions. This highlights Adsorb-Agent's strong performance in generating reliable adsorbate-related solutions. Without the Critic module, one additional system fails to achieve consistency in surface-related solutions, indicating the Critic module's potential role in enhancing solution reliability. Although the limited size of the test set makes it challenging to generalize the effectiveness of the Critic module, these results suggest that the Critic module may help improve consistency by filtering out incoherent solutions within the tested systems. Further discussion of the Critic module performance is provided in SI Section S7.
A key aspect of our approach is that the agent is built on a standard LLM trained on general language corpora, rather than domain-specific data. By leveraging its built-in understanding of chemical bonding, surface chemistry, and emergent reasoning capabilities, Adsorb-Agent autonomously proposes plausible adsorption configurations tailored to specific systems. Adsorb-Agent demonstrates a strong ability to identify configurations with energies closer to the global minimum, particularly in complex systems such as intermetallic surfaces and large adsorbate molecules. This capability highlights a critical advantage of our approach in addressing computationally intensive and chemically complex systems.
site enumeration algorithm, which leverages surface symmetry.11,12 Using Pymatgen's
, this algorithm identifies the most energetically favorable sites, such as ontop, bridge, or hollow sites. The adsorbate is then placed at the selected sites with a random rotation about the z-axis and minor adjustments along the x- and y-axes, ensuring the binding atom is positioned at the site.14
To expand the configuration space beyond the
algorithm, Lan et al. proposed a
algorithm.10 This method uniformly samples surface sites at random. After performing a Delaunay triangulation of the surface atoms, random sites are selected within each triangle. At each randomly selected site, the adsorbate is placed with random rotations about the x-, y-, and z-axes, ensuring alignment of the center of mass with the target site.14
In this paper, we introduce a new placement strategy specifically designed to accommodate side-on oriented adsorbates, offering a key differentiation from the above methods. Our approach determines the placement center as the weighted center of the binding atoms and orients the adsorbate to maximize the exposure of its binding atoms to the surface. This method differs from the above placement strategies, which position a single binding atom at the site and apply stochastic rotations. Although our strategy operates similarly to existing methods for adsorbates with a single binding atom, it specifically enables side-on placement for bidentate adsorbates.
In GNN-based relaxation, the atomic system is represented as a graph, where atoms are treated as nodes and interatomic bonds or interactions as edges. The GNN iteratively performs message passing, where neighboring atoms exchange information, allowing the network to capture local chemical environments. Through these learned representations, the GNN predicts both the system's energy and the atomic forces. The atomic positions are then updated using optimization algorithms based on the predicted forces, and this process is repeated until the system converges to a stable, low-energy configuration.
In this study, we use EquiformerV2 as the core GNN model due to its demonstrated high accuracy (∼0.2 eV MAE). We specifically use the checkpoint “EquiformerV2-31M-S2EF-OC20-All+MD”, available from the Equiformer repository.16 This checkpoint corresponds to a model trained on the OC20 S2EF (Structure-to-Energy-and-Forces) dataset, including both the full OC20 training set and additional MD trajectories.
After the GNN relaxation, the energy of the adsorbate–catalyst system can be predicted. Since a single adsorbate–surface system may have multiple possible adsorption configurations, it is necessary to evaluate the energies of all candidate configurations to identify the most stable one. The adsorption energy is determined by selecting the configuration with the lowest energy, offering insights into the energetically favorable configuration. The calculation is defined as eqn (1).
427 catalyst materials and 86 adsorbates, with the catalyst crystal structures sourced from the Materials Project44 and the Open Quantum Materials Database.45 The OC-demo helps identify the adsorption energy of the selected adsorbate and catalytic surface by generating multiple initial configurations and evaluating them using state-of-the-art graph neural networks, like GemNet-OC and EquiformerV2. In our workflow, we use the OC-demo to place adsorbates on catalyst surfaces using enumeration algorithms, perform structure relaxation with EquiformerV2, and obtain the corresponding adsorption energies. These results serve as the baseline for the conventional algorithmic approach, against which we compare our LLM-agent-based method.
![]() | (3) |
is defined to be 1 if the condition inside is satisfied and 0 otherwise.
The Lower Energy Discovery Ratio (LEDR) assesses Adsorb-Agent's ability to identify adsorption energies that are lower than those found by the algorithmic approaches. It is defined as:
![]() | (4) |
The Reduced Search Space Ratio (RSR) quantifies the reduction in the number of initial configurations required by Adsorb-Agent (Ninit,agent) compared to the algorithmic approaches (Ninit,algorithm). A lower RSR indicates a greater reduction in the search space. It is defined as:
![]() | (5) |
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d5dd00298b.
| This journal is © The Royal Society of Chemistry 2026 |