Open Access Article
Hongyu
Li
a,
Jia
Zhang
*b,
Zihao
Yao
c and
P.
Hu
*ade
aCentre for Computational Chemistry and Research Institute of Industrial Catalysis, East China University of Science and Technology, Shanghai 200237, China
bInstitute of High Performance Computing (IHPC), Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, #16–16 Connexis, Singapore 138632, Republic of Singapore. E-mail: zhangj@ihpc.a-star.edu.sg
cInstitute of Industrial Catalysis, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, China
dSchool of Physical Science and Technology, ShanghaiTech University, Shanghai, 201210, China. E-mail: hupj@shanghaitech.edu.cn
eSchool of Chemistry and Chemical Engineering, Queen's University Belfast, Belfast, BT9 5AZ, UK
First published on 17th March 2025
Identifying the rate-controlling steps in an unknown reaction network can be time-consuming due to its inherent complexity. Here we present a strategy to simplify this process by focusing expensive barrier calculations on significant elementary steps. The strategy is constructed by determining significant elementary steps using the degree of rate control data, which is derived from microkinetic modeling calculations performed on surrogate networks, which are a series of networks generated by assigning fictitious values to unknown barriers while all the reaction energies are computed using density functional theory. The barriers for significant elementary steps are then calculated iteratively to refine the network. We demonstrate this strategy for the reaction of Fischer–Tropsch synthesis, which has already been extensively studied in our previous work. Applying the strategy, we identified the most rate-controlling step, achieving a 77% reduction in the number of transition state calculations compared to traditional methods. Additionally, a detailed analysis of the strategy reveals the correlation between the parameters in the strategy and its performance. We validate the practicability of the strategy by applying it onto testing networks and the potential limitations of the strategy are also discussed.
This process can be significantly simplified by recognizing that only a small portion of reactions controls most of the properties of the entire network. The majority of the network properties, except for the kinetic parameters of the rate-controlling steps, have small impacts on the overall reaction network. The rate-controlling step can be simply interpreted as the slowest step of the favored pathway and it can strongly influence the overall reaction rate of the network. In order to identify rate-controlling steps, one popular method is the degree of rate control (DRC). This approach for analyzing multistep reaction mechanisms was introduced by Campbell,1–4 in which the “degree of rate control for elementary step i”, DRCi, is defined as
![]() | (1) |
To address this, researchers have proposed various solutions,5–14 primarily relying on approximations like the BEP relation15–20 or machine learning models to minimize computational resources and time. As a representative example, by using the prediction of machine-learning and approximations, such as scaling relations, Ulissi et al. developed an approach to determine the significant reactions on the favored pathway: by focusing only on these reactions with full density functional theory (DFT) calculations at each iteration, they achieved remarkable reduction of time and resource cost on studying the network of syngas.9 Despite all the progress in this field, two main improvements could still be made. One is to achieve a more optimal trade-off between accuracy and efficiency. We need to determine when to use an approximate method that saves time but is less accurate, and when to choose a high-fidelity DFT calculation, which is more precise but also more time-consuming. The second improvement is to develop a better strategy to identify significant elementary steps.
In this work, we aim to address these issues. We developed a workflow to identify the rate-controlling steps of a reaction network, significantly reducing computational effort by performing a limited number of TS calculations. The strategy used DFT calculations to obtain reaction energies, and the MKM was conducted using CATKINAS, a software developed by our group and widely used.21–30 We saved time and computational resources by reducing barrier calculations, as shown in Fig. 1. Our workflow uses DRC data as an indicator to find significant elementary steps and calculate the corresponding TS energy at each iteration. The characteristics of the reaction network are refined at each iteration, and the rate-controlling steps of the network are determined at the end of the refinement. We demonstrated this workflow for the Fischer–Tropsch (FT) reaction,31–46 a process that utilizes synthesis gas to convert into hydrocarbons and oxygenated hydrocarbons. Using the strategy, we identified the most rate-controlling step with reducing the number of TS calculations by 77%.
In constructing the surrogate networks, the first step is to define all possible chemical species and elementary steps to be considered in the network. Then the adsorption energies of all intermediates are calculated to obtain the reaction energy for each elementary step. In this way, a raw reaction network with only the thermodynamic information is built. Next, a surrogate network is generated by assigning fictitious values to energy barriers of all the elementary steps in the network. To construct a series of surrogate networks, as shown in Fig. 2b, we began with assigning values to Xmax (the biggest fictitious barrier value), Xmin (the smallest fictitious barrier value) and A (the interval of fictitious barrier between surrogate networks) in our strategy. As the fictitious barrier x changes from Xmax to Xmin, a series of surrogate networks were constructed. For clarity, when we built the surrogate networks, the fictitious energy barrier x is only added to the exothermic forward/backward reactions in each step. For example, assuming the energies of the initial state and the final state for a specific elementary step are E1 and E2, respectively, the energy of TS will be E1 + x when E1 < E2; otherwise, it will be E2 + x if E2 < E1.
In the MKM phase of surrogate networks, we conducted MKM on all the surrogate networks to obtain their kinetic information, which contains DRC data. Since our goal in this work was to identify the key elementary steps within a highly complex reaction network, coverage dependence was not explicitly considered to streamline the calculation process. In the evaluation phase, we introduced an indicator, DRC(sum), to assess the significance of each elementary step. For a specific elementary step i, DRCi(sum) is defined as the sum of the absolute value of the corresponding DRC in each surrogate network as follows:
![]() | (2) |
In the refinement phase of surrogate networks, we used DFT calculations to obtain the barrier of the chosen elementary step and updated all the surrogate networks by replacing the fictitious barrier of the selected step with its actual barrier. This process is repeated until the convergence criterion is met.
The refinement procedure is illustrated in Fig. 3a, which shows the entire reaction network with nodes as reaction species and lines as elementary steps. For visual simplicity, hydrogen is omitted. Refinements were performed under various settings of parameter N. As N increased, a greater number of TSs were required to be calculated for the iteration to terminate. The additional elementary steps that needed to be calculated by DFT, as N was increased to N + 1, are indicated by colored lines in Fig. 3a. Since parameter N defines the convergence criterion in the evaluation phase, increasing N requires more rate-controlling steps to converge before the iteration stops. This would result in more TS calculations and potentially more accurate outcomes. The accuracy of the final result and the number of iterations before convergence are both plotted against the value of N in Fig. 3b. In this work, ‘accuracy’ is defined as the ratio of the sum of DRC values of the calculated M elementary steps to the total DRC sum across all elementary steps, with all DRC values derived from a reference set based on the full reaction network from prior work,47 which conducted calculations of all the TS energies through DFT calculations. This metric is designed specifically for internal assessment of the strategy's predictive accuracy within this study and is not intended for direct application in real-world determinations of rate-controlling steps.
![]() | (3) |
![]() | ||
| Fig. 3 Rate-controlling step determination under different convergence criteria. (a) The FT reaction network on Co(0001) calculated in our previous work.47 In this work, the reaction network with the barriers we previously calculated is used as the reference set. The nodes represent reaction intermediates and each line represents an elementary step between the two nodes. (b) Illustration of accuracy of rate-controlling step determination (solid line) and iterations before convergence (dashed line) as functions of the criterion of convergency N. As N increases, the accuracy of the final result and number of iterations increase. There are three significant increases on the accuracy curve when N is set to 1, 2 and 5. (c) The top 3 most rate-controlling steps and the corresponding pathways determined by traditional calculations and our strategy. To identify the favored pathways using our strategy, the surrogate network in which a specific rate-controlling step reaches its highest DRC was determined firstly, and then the favored pathway for the corresponding rate-controlling step was identified based on MKM data of this surrogate network. | ||
The first accuracy significant increase occurs when N was set to 1; with 11 iterations in total, the calculation converged. The most rate-controlling step was determined as CH3O
CH3 + O, which is the third most rate-controlling step according to the reference set. The second significant accuracy increase occurs when N was set to 2, with the convergence achieved after 16 iterations. In this case, the elementary step of CHO
CH + O was identified as the most rate-controlling step, which aligns with the result from the reference set. This means that using the method developed in this work, we only conducted 15 TS calculations to correctly predict the most rate-controlling step for the FT reaction network on Co(0001), reducing 77% of TS calculations. The third accuracy significant increase occurs when N was set to 5, with the convergence achieved after 29 iterations. This time we correctly identified the top 3 most rate-controlling steps and reached a value of accuracy of 0.97.
To investigate how the choice of surrogate networks influences the efficiency and accuracy of our strategy, we applied our workflow on the FT reaction network on Co(0001) with different range of fictitious barriers. We constructed 5 calculation groups, which sets Xmax to 0.0 eV, 0.5 eV, 1.0 eV, 1.5 eV and 2.0 eV, respectively, while Xmin, A and N were set to 0.0 eV, 0.1 eV and 5 uniformly. As shown in Fig. 2b, as Xmax increases, the range of fictitious barriers become wider and more surrogate networks with higher fictitious barriers will be generated for the corresponding group, leading to different results of refining processes. The accuracy of the final result and the number of iterations needed before convergence for each calculation are shown in Fig. 4b. It can be seen that as Xmax increases, the number of iterations increases. The calculation group with Xmax set to 1.5 eV reached the highest accuracy. The reason for this is that most of the barriers in the reaction network we studied lies in the range of 0.0–1.5 eV. Hence, the calculation group with Xmin and Xmax set to 0.0 eV and 1.5 eV is suitable for probing the rate-controlling steps of the network because its range of fictitious barriers matches the range of real barriers, ensuring that there are enough surrogate networks to investigate potentially significant elementary steps. In contrast, the groups with the range of fictitious barriers too narrow will not have enough surrogate networks to detect some rate-controlling steps and might miss some significant elementary steps. In addition, for the groups where the range of fictitious barriers is too wide, such as the calculation group with Xmax set to 2.0 eV, the surrogate networks with unusually high fictitious barriers will disproportionately influence the identification of the rate-controlling steps, thereby reducing the accuracy of the results.
Four testing networks were generated in total as the testing reference sets, and our strategy successfully predicted the most rate-controlling step for 3 of them as shown in Table 1. After analyzing the failed case, we find that there are mainly two types of scenarios where our strategy may encounter difficulties on the task of identifying the rate-controlling steps. As shown in Fig. 5b, the first scenario is that the rate-controlling step exists on a thermodynamically unfavorable but kinetic favorable pathway, which means that the energies of intermediates are relatively high but energies of TSs are relatively low on this pathway. There is a possibility that our strategy would identify this pathway as an unfavored pathway and start searching for a possible rate-controlling step elsewhere. In the second scenario, if there is a pathway that is thermodynamically favorable but kinetically unfavorable and does not include the rate-controlling step, our strategy might mistakenly identify this pathway as favored and converge on an incorrect rate-controlling step. Thus, our strategy may encounter difficulties when certain types of elementary steps have unexpected high or low barriers.
| Testing networks | Amount of iterations | Top 3 reactions with biggest DRC(sum) | |
|---|---|---|---|
| Our strategy (reaction #) | Reference set (reaction #) | ||
| T1 | 21 | 59, 3, 58 | 59, 3, 58 |
| T2 | 15 | 43, 4, 6 | 3, 25, 58 |
| T3 | 20 | 59, 58, 6 | 59, 6, 30 |
| T4 | 20 | 3, 58, 59 | 3, 58, 59 |
In the failed case of testing network T2, when the iteration stopped, our strategy identified the most rate-controlling step as C + C
CC as shown in Fig. 5c. However, the actual most rate-controlling step, according to the testing reference set, is C + O
CO. After thorough inspection, we found that the reason network T2 converged to an incorrect elementary step is that the actual favored pathway, according to the reference set, is a thermodynamically unfavorable pathway. This means that the energies of the species involved in this pathway are relatively high. However, the barrier of an elementary step (CH2CH3
CH2 + CH3) on this pathway, highlighted in blue in Fig. 5c, is unrealistically low (0.037 eV), which unexpectedly makes this pathway favorable. To identify the most rate-controlling step for the testing network T2, we increased the value of N to calculate more potential significant steps. After we increased N to 22, our strategy converged at the correct rate-controlling steps after 47 iterations.
For the reaction network tested in this work, which includes 26 species and 67 reactions, the good parameters Xmax, Xmin, A, and N were found to be 1.5 eV, 0.0 eV, 0.1 eV, and 5, respectively. This set choice has accurately predicted the top 3 rate-controlling steps while reducing the number of TS calculations by 55%, as shown in Fig. 4b. Compared to traditional calculations, our strategy focuses on identifying the rate-controlling steps as quickly as possible and performing barrier calculations only on those steps identified as significant. This approach saves substantial time and computational resources. Additionally, unlike methods that rely on machine learning or other approximations, the strategy developed in this work has greater generality. However, it is important to note that our strategy does not guarantee 100% accuracy in predicting rate-controlling steps if the parameters are not properly selected.
The advancements of our method compared to previous approaches are summarized as follows:
1. Efficient rate-control predictions: by conducting MKM calculations on surrogate networks, our method enables the prediction of rate-controlling steps using DRC without the extensive barrier calculations.
2. Balanced accuracy and efficiency: by reducing the number of TS calculations while using high accuracy methods to conduct the calculations of reaction energies and MKM, our strategy reached excellent balance between accuracy and efficiency.
3. General applicability: unlike methods that rely on approximations such as BEP relations, our strategy does not depend on specific approximations, making it applicable to the study of any reaction network.
Finally, our method could be further advanced by integrating experimental data or machine learning models to predict transition state energies or adsorption energies. Combining these approaches has the potential to reduce computational costs and improve accuracy when exploring new reaction networks. Future work may focus on refining strategies to efficiently identify critical elementary steps and incorporating additional catalytic effects, such as coverage effects, to enhance the practicality and effectiveness of the current method.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cy01336k |
| This journal is © The Royal Society of Chemistry 2025 |