Open Access Article
Veronique Van Speybroeck
*,
Massimo Bocus
,
Wei Chen
and
Pieter Cnudde
Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052 Zwijnaarde, Belgium. E-mail: veronique.vanspeybroeck@ugent.be
First published on 17th March 2026
This paper demonstrates the need for a consistent dynamics view of all processes from the nano- to mesoscale with quantum accuracy to design industrial catalysts. This stems from the observation that when a feed of molecules passes through a crystalline nanoporous framework, the following events occur: molecules enter the crystal, diffuse to active sites where they adsorb, desorb, or react, and finally products exit. These events span vastly different time and length scales—from picoseconds to hours and nanometers to micrometers. Length and time phenomena are entangled—the dynamics is altered by modifications at the length scale—leading to spatiotemporal phenomena. Current modeling practices treat reactions and diffusion inconsistently, often combining quantum methods for reactions with classical force fields for diffusion, and frequently neglect true dynamics under operating conditions. Industrial catalysts are highly sensitive to process conditions; some active sites form only at elevated temperatures, and reactive intermediates can change nature under these conditions. First-principles molecular dynamics captures these effects but becomes prohibitively expensive when relying on quantum methods like Density Functional Theory for force evaluations. In this paper, we demonstrate—through the study of key intermediates in CO2-to-hydrocarbon conversion within small-pore zeolites—that diffusion must also be treated at the quantum level, as molecules can interact with active sites and even react while diffusing. Diffusion may become strongly hindered and compete with reactive events. In later case studies, we present proof-of-concept results for deriving diffusion coefficients in small-pore zeolites using machine learning potentials (MLPs) trained on quantum data. For benzene methylation in large-pore zeolites, we develop reactive MLPs to construct multidimensional free-energy surfaces, revealing competing pathways. Overall, the various cases underscore the need for a consistent dynamic description with quantum accuracy across scales. MLPs offer a promising route to obtain kinetic and transport properties from the nano- to mesoscale at reduced cost, however their integration in overall multiscale models and reaction–diffusion frameworks will be necessary to reach industrially relevant scales.
To illustrate the complexity and the events taking place during a catalytic process, consider the various steps taking place when a feed of molecules is sent over a crystalline catalytic particle. In an experiment, a feed of molecules flows over a nanoporous material, initiating a cascade of steps as schematically shown in Fig. 1a.9 The molecules first enter the crystal, sometimes after interacting with external active sites,10 and then diffuse through the lattice in search of active sites where adsorption and reaction can occur. Intracrystalline diffusion may be hindered by structural constraints such as narrow windows or by the presence of bulky co-adsorbed species.11 Once a reaction takes place, the products desorb from the active site and migrate toward the exit before leaving the material. These events span an enormous range of time scales—from picoseconds for molecular vibrations to seconds or even hours for activated processes. In principle every event is also characterized by its own characteristic time scales, indicated with τ in the figure.
![]() | ||
| Fig. 1 (a) Schematic representation of various steps taking place during a typical catalytic trajectory, where molecules are sent over a heterogeneous catalyst. Molecules enter the crystal and diffuse through the pores to arrive at the active site to adsorb/react. Diffusion in/out the crystal can be hindered due to external active sites, other molecules or narrow windows in the crystal. Every event is also characterized by its own characteristic time scales, indicated with τ. (b) Brief illustration of all individual events and the various kinetic and transport properties that may be derived from first principles simulations. (a) is adapted from ref. 9, copyright 2026, the Annual Review of Physical Chemistry. | ||
In today’s modelling practice, each of the events is often treated in an isolated way and with tailored methodologies, i.e. reactions obviously need to be treated with quantum mechanical (QM) methods, however this can be done using static or dynamic methodologies.1,12 Diffusion is most commonly treated with classical force fields as these processes take place on longer length and time scales.13 There is a growing evidence that reaction, diffusion, and adsorption are dynamically entangled under realistic, time-dependent operating conditions.1,2 Modelling the full catalytic progression via first-principles QM based approaches that fully account for the dynamics is to date not feasible. Also from the experimental point of view, it is still not possible to make a “molecular movie” with the most advanced spectroscopic or imaging techniques and to follow on the fly the breaking and formation of chemical bonds with atomic scale resolution.14–18 To evolve towards predictive and rational design, it is essential to devise methods that capture the dynamics of catalysts in action across length and timescales. New methods are needed that enable the connection between nanoscale chemical rearrangements dictated by electronic restructuring of bonds to mesoscale transport phenomena. With the rise of machine learning methods,19–22 new avenues are opened to integrate effects from the nano- to the mesoscale. In this Faraday Discussions contribution, we argue the need for obtaining a consistent dynamics view from the nano- to the mesoscale with quantum accuracy to design industrial catalysts.23 Machine learning methods are central to this vision but are on their own not sufficient to reach this ambition. More in particular, one can now derive Machine Learning Potentials (MLPs) to obtain the energies and forces with so-called quantum accuracy, i.e. the same accuracy as the underlying QM data on which they are trained, however at a much lower computational cost compared to the brute force QM calculations.23,24 This allows simulation of timescales and length scales of at least three orders of magnitude larger than when pristine density functional theory methods are used. Yet today, MLPs are not fully part of a current catalysis workflow with a high-dimensional free energy surface where many competitive reactions taking place simultaneously. Major methodological developments are necessary to couple MLPs with kinetic models to describe complex dynamical phenomena in the broad range of length and time scales relevant for heterogeneous catalysts.
Within this Faraday Discussions article, we describe first results on how MLPs can be instrumental to obtain kinetics for all events from the local to the crystal particle level with quantum accuracy. Fig. 1b shows the individual events taking place during a successful catalytic trajectory and the kinetic and transport properties needed to set up integrated reaction diffusion models based on underlying kinetics and characteristic time scales obtained with QM accuracy. Within this figure one observes reactions, which are local events taking place on the (sub)nanometer level and are typically described by QM methods. For reactions one needs to be able to calculate the reaction rates. Still today, reaction kinetics of activated processes are most often obtained using a static approach, based on a few points on the Potential Energy Surface (PES) and application of Transition State Theory (TST).25,26 Such an approach accounting only for local information around the (meta)stable states gives a too approximate view of complex transformations at operating conditions and does not capture broad transition state regions, competitive pathways, and configurational mobility at higher temperatures.27–31 Enhanced sampling MD (ESMD) simulations are now extensively used to account for operating conditions, however in combination with QM evaluation of energies these are very expensive.1 Additionally they rely mostly on collective variables (CVs) able to distinguish between various minima and their definition is far from trivial.32–36 Most importantly, current ESMD simulations determine the kinetics primarily within the limits of TST neglecting barrier recrossings.1,25,37 With the rise of MLP based approaches, it is expected that dynamically correct reaction rates will be possible to be determined in a more routine way, also for complex catalytic transformations. Within this Faraday Discussions article we show first proof of concepts results where we derived a MLP for zeolite catalyzed reactions, which enables sampling of competitive pathways in higher dimensionalities at a feasible computational cost.
Intracrystalline diffusion is an essential element of the catalytic trajectory but takes place on longer length scales, which is one of the reasons why it is usually described by classical force fields. One major drawback of such description is that specific host–guest interactions are not accounted for and also that potential reactive events taking place during a transport process are not captured.38–40 Diffusion within nanoporous catalysts can be hindered or non-hindered, i.e. diffusion itself can become an activated event and becomes competitive with reactive events. To characterize diffusion one typically needs to calculate diffusion coefficients, either collective or self-diffusion coefficients, that quantify how fast molecules spread out in a medium due to random thermal motion or a concentration gradient.41 Diffusion coefficients can be calculated by various approaches. Self-diffusion coefficients are typically calculated from the mean square displacement (MSD) of adsorbate molecules from very long MD runs.13,42 This approach is feasible when the molecules are not severely hindered during transport. For hindered diffusion, the diffusion itself becomes activated and it is necessary to rely on methods for activated processes.38,39 The interested reader is referred to dedicated reviews for more information on the topic.1,13,41 Comparison between experimental and theoretical diffusion coefficients needs to be done with extreme care as was pointed out nicely by Kärger et al.43,44 Apart from direct comparison of diffusion coefficients with experiment, one can also obtain information on residence times of probe molecules in zeolite catalysts using pulse-response temporal analysis of products (TAP) methodology.45,46 However various effects contribute to the measured TAP results, such as surface barriers, intracrystalline diffusion, adsorption at the active site and finally kinetics.47–49 From an experimental point of view, it is very difficult to disentangle all these contributions.38,50 With the development of consistent dynamical models for all events from the nano- to the mesoscale – as discussed in this paper – it might become possible in the future to build bottom-up models starting from the atomic scale and determine the various components contributing to the TAP data. Herein, we show first proof of concept results on how MLPs can be instrumental to simulate on longer length and time scales diffusion and how self-diffusion coefficients can be estimated from long molecular dynamics approaches with QM accuracy.
Fig. 1b also highlights events related to adsorption and kinetics at the external surface of zeolite catalysts. In principle it is necessary to also obtain kinetics of surface permeation at QM level of theory on realistic external surfaces. Experimental studies clearly indicate the presence of surface barriers, often attributed to pore blocking or amorphous outer layers, yet their precise molecular origin remains unknown.51 Obtaining spatially resolved information on the structure and nature of the external surface under operating conditions is currently impossible, leaving the location of active sites and their interaction with key intermediates unresolved.51 Also within this area machine learning potentials could be instrumental to simulate surface barrier phenomena, however further progress will also depend on construction of realistic surface models, however this discussion is beyond the scope of this paper.
The three case studies presented in this paper, give evidence for the claim that we need a consistent dynamics view from the nano- to the mesoscale with quantum accuracy to design industrial catalysts and show how MLPs can open new possibilities to model complex catalytic events with QM accuracy.
The first case study examines the diffusion of key intermediates—CO, formaldehyde, and ketene—through the small-pore zeolite H-SSZ-13, which is a crucial step in the CO2-to-hydrocarbons process. In this material, diffusion is strongly hindered, and in some cases, species like ketene do not simply diffuse but can also react while passing through the eight-ring windows.40 This demonstrates that diffusion must be described using quantum mechanical methods capable of capturing explicit interactions with active sites and possible transformations. However, such simulations require long timescales and extended length scales, making standard QM approaches impractical for complex catalytic systems.
The second case study provides a proof of concept for overcoming these limitations by training a machine learning potential (MLP) to model the diffusion of C2 olefins and paraffins in H-SSZ-13. When properly trained, MLPs enable the calculation of converged free-energy profiles between cages and allow the derivation of diffusion coefficients.
The third case study demonstrates the potential of machine learning potentials (MLPs) for exploring rare events in zeolite catalysis under realistic operating conditions. We focus on the methanol-to-olefins process and revisit the methylation of benzene by methanol in a large-pore H-SSZ-24 zeolite.30,31 This reaction can proceed via two competing mechanisms—a direct, concerted methyl transfer and a stepwise route through a surface-bound methoxide—both of which may coexist depending on operating conditions.52 In earlier work based on first-principles molecular dynamics, we showed that these pathways are accessible and that their relative preference is strongly influenced by the local environment, including methanol clustering and framework flexibility, resulting in a complex free-energy landscape.31 This seminal study highlighted the importance of entropic contributions and the complexity of the free-energy surface at elevated temperatures, revealing mechanistic competition that static models cannot capture. While these insights were groundbreaking, the simulations were computationally demanding and not broadly applicable to zeolite catalysis. MLPs hold the promise to make these insights much more accessible, provided the MLPs are trained in a correct way to capture the essential chemistry. Within this paper, we revisit the benzene methylation in H-SSZ-24 by deriving an MLP using active learning. We show how it becomes possible to simulate extensively longer within reasonable computational resources and can explore the free energy surface in a higher dimensional space.
These case studies illustrate how machine learning potentials (MLPs) enable dynamic information with quantum-level accuracy for individual steps of a catalytic trajectory. However, achieving a fully consistent dynamic view of an entire catalytic process—including all relevant time scales—requires more than MLPs alone. With MLP based methods, it is not possible to reach the micrometer length scales or the seconds-long time scales needed to mimic real catalytic experiments. To bridge this gap, consistent kinetic and transport data must be integrated into reaction–diffusion or overall kinetic models at the level of the entire catalyst particle. In the final conclusion and perspective section, we reflect on strategies to connect detailed insights from individual events within a catalytic cycle to a holistic description of the catalyst particle and its behavior under operating conditions. Ultimately, the long-term vision is to model the time evolution of real catalytic systems with complex feedstocks under realistic conditions, and we outline future avenues to move towards this goal.
![]() | (1) |
![]() | ||
| Fig. 2 Collective variables of three typical unsaturated oxygenates’ diffusion and reaction in SSZ-13. (a) CV1 (ξ) to describe the ketene diffusion, (b) CV2 to describe the protonation of CO with BAS, (c) CV3 to describe the protonation of formaldehyde with BAS, (d) CV4 to describe the protonation of ketene with BAS. (a) and (d) are reproduced under the terms of the CC BY 4.0 license from ref. 40. Copyright 2026 Elsevier. | ||
In the presence of BAS, CO, ketene, and formaldehyde may spontaneously react to new species along the diffusion pathway, i.e., by protonation into either a cationic intermediate or surface-bound species (Fig. 2b–d). Clearly, this interconversion between adsorbed intermediates presents an orthogonal degree of freedom and cannot be described solely by the diffusion coordinate (CV1). To accurately distinguish between neutral, cationic intermediates, and surface-bound species along the diffusion coordinate, a second collective variable is required to track these species. By considering both a diffusion coordinate as the first CV and a reaction coordinate as the second CV, the entangled effects of diffusion and reaction can be clearly revealed. Notably, enhanced sampling was not performed along the second CV, and the resultant 2D free energy surface can simply be regarded as a two-dimensional extension of the classical 1D free energy profile, unfolded along a second CV. Therefore, the obtained 1D free energy profile along the diffusion coordinate (CV1) is de-projected into a 2D free energy surface along a second coordinate describing the reaction using the deprojection module implemented in the ThermoLIB software.59
In detail, CO may be protonated to a formal cation (HCO+) and further form the surface formate (Zeo–HCO) described by CV2 in Fig. 2b; HCHO is able to be protonated to a hydroxymethyl cation (CH2OH+) and further form a surface hydroxymethyl (Zeo–CH2OH)60 described by CV3 in Fig. 2c; ketene can be easily protonated to an acylium ion and further form surface acetate described by CV4 in Fig. 2d. To ensure each of these states are properly visited, thorough sampling is required, resulting in the addition of the following umbrella windows: (i) 25 windows (between −6 Å ≤ ξ ≤ 6 Å) to sample the diffusion of neutral CO, formaldehyde, and ketene; (ii) 13 windows in proximity to the acid site (between −3 Å ≤ ξ ≤ 3 Å) in which the cationic forms are visited, and (iii) 7 windows in close proximity to the acid site (between −1.5 Å ≤ ξ ≤ 1.5 Å) in which surface-bound species are sampled.
(1) A concerted methylation, in which a methyl group is transferred from one of the methanol molecules directly to benzene (I → III in Fig. 3a).
(2) A step-wise methylation, where a methyl group of one of the methanol molecules is first transferred to the framework (I → II in Fig. 3a), forming a surface-methoxide-species (SMS). The SMS acts subsequently as a methylating agent for benzene to form a toluenium ion (II → III in Fig. 3a).
(3) The two methanol molecules can condense forming dimethyl ether (I or II to IV in Fig. 3a). The latter can then act as a methylating agent either directly with benzene or with the SMS.
To be able to describe (at least qualitatively) all these processes, we selected as collective variables (CVs) 2 linear combinations of 3 coordination numbers (CNs). The general formula for the CNs is given below:
| State | max[CN(Cme,i, Car)] | min[CN(Cme,i, Ome,i)] | max[CN(Cme,i, Ozeo)] | |
|---|---|---|---|---|
| I | Reactant: 2 methanol molecules + benzene | 0 | 1 | 0 |
| II | Surface methoxide species, water, methanol, benzene | 0 | 0 | 1 |
| III | Toluenium ion, methanol | 1 | 0 | 0 |
| IV | Benzene, dimethyl ether, water | 0 | 0 | 0 |
With this definition of collective variables, the three intermediates I, II, and III can be seen as three vertices of an equilateral triangle in the space defined by these three CNs, located at 1 on the x, y, or z axis, as schematically shown in Fig. 3b. Intermediate IV, involving dimethyl ether, will be at the origin as all CNs will be about 0. Performing enhanced sampling simulations in three dimensions is normally associated with major convergence issues and, therefore, we opted in the first instance to reduce the dimensionality of the problem by defining two new collective variables CV1 and CV2 defined by the equations shown in Fig. 3b. The choice can be understood as a projection of the 3-dimensional space on the plane passing through (1, 0, 0), (0, 1, 0), and (0, 0, 1), in which the origin is projected on the origin of the new system (blue axes in Fig. 3b). The 2-dimensional CV space defined in this way was used both for the active learning procedure and for subsequent production runs. However once we derived a converged MLP, we also performed enhanced simulations in terms of the three collective variables, to obtain a full picture of all possible intermediates and transition state regions. We also note that a set of half-quadratic walls was placed starting from 1.15 Å (with a force constant of 1000 kJ mol−1 Å−2) on all the C–H bonds of benzene and the CH3 groups of methanol, to prevent unwanted proton transfer reactions.
We initially attempted to perform well-tempered metadynamics based on the MACE-mp0 foundational model to explore the whole reaction phase space.68 However, we observed that proton-transfer reactions were excessively favored (e.g., leading to methane formation), resulting in chemically irrelevant pathways. For instance, methyl cations extracted a proton from two other methanol molecules to form methane and formaldehyde, or from benzene, producing compounds that are not chemically feasible. Therefore, we decided to train a MACE model from scratch via active learning within Psiflow. The generation of new structures required four separated loops (steps 1–4 in Fig. 4) to achieve a satisfactory phase space coverage.
First, we began by performing 25 ps of well-tempered metadynamics with MACE-mp0 (ref. 68) at 800 K with 240 independent walkers (hills height: 5 kJ mol−1; hills sigma: 0.05 along both CVs; bias factor: 15; deposition pace: 250 steps) to generate an initial dataset of 1194 structures. Unless differently specified, the timestep for this and all subsequent simulations was set to 0.5 fs and temperature control was achieved by Langevin dynamics, with a time constant of 100 fs. The bias was applied by PLUMED67 and i-PI63,64 was used to drive the dynamics. A structure was saved every 5 ps. The initial dataset generated in this way was evaluated with our reference level of theory. All first-principles evaluations were performed with CP2K 2024.1 (ref. 66) at a PBE-D3(BJ)/MOLOPT-TZV2P level of theory (note that the double polarization functions were not used for Si and Al being electron-deficient).55,56,65 The cutoff for the secondary plane-wave basis set used by CP2K was set to 900 Ry to achieve well-converged forces. GTH pseudopotentials58 were used to smooth the electron density in proximity of the atomic nuclei.
The dataset and model obtained from this first set of simulations with MACE-mp0 were used as input for the actual active learning, which was organized in four steps (Fig. 4):
• Step 1: the well-tempered metadynamics simulations for the 240 independent walkers were continued for 5 active learning loops, propagating each time the walkers for 10 ps and collecting the final structures from the trajectories for the subsequent evaluation, dataset expansion and new model training. A uniform temperature ramp between 300 and 1000 K was applied to the simulations to expand the explored phase space boundaries.
• Step 2: since many walkers from Step 1 entered uninteresting regions of phase space (e.g., methane formation), we restarted all walkers from the reasonable, manually built initial structure of the reactants and added 3 extra active learning loops at 800 K, propagating each walker for 25 ps for each loop. The well-tempered metadynamics bias was also reinitialized.
• Step 3: the transition state region is subjected to limited sampling in metadynamics. Therefore, we constructed some additional structures representing the transition states and minima of interest. We defined a grid of 2-dimensional quadratic umbrella biases covering the CV-regions of interest (that is, the triangle defined by the three minima in Fig. 3). Each umbrella was initialized with the structure closest to its center in CV space. In the first online learning loop, we propagated the walkers for 2500 steps at 10 K, with a timestep of 0.05 fs. This mimics a geometry optimization, and allows the walkers to relax if they start far from an umbrella center. Then, the temperature was raised to 300 K, the timestep set back to its default value of 0.5 fs, and each walker propagated for 5 ps per loop. Four additional loops were performed in this way, simultaneously raising the temperature to 1000 K.
• Step 4: most of the simulations tend to sample TSs involving dimethyl ether. To augment the sampling of TSs involving methanol, we hand-picked a set of umbrella centers sampling the “direct” paths between the minima (i.e., not passing by the center of the triangle in Fig. 3), performing 3 additional on-line learning loops at 800 K with a propagation time of 10 ps per loop.
The final model from this procedure, trained on the 6169 collected structures (randomly split 90
:
10 into training and validation sets), achieved a root mean squared error of 1.2 meV per atom and 1.1 meV per atom on the energies of the training and validation sets, respectively, and a root mean squared error of 39.8 and 42.0 meV Å−1 on the forces of the training and validation sets, respectively. This final model was then used for all subsequent production runs. Because of the qualitative nature of the presented results, no additional checks on the model accuracy were performed, i.e. generating independent test sets to further validate the MLP. Evidently, we visually inspected the generated trajectories to ensure (i) that they were stable and (ii) that no obviously new regions of phase space were explored.
To understand the different degrees of diffusion–reaction entanglements, we consider the diffusion of these intermediates between two adjacent cages of H-SSZ-13 with one BAS on the connecting 8R window, and the resulting free energy profiles are displayed in Fig. 5. First, CO, a stable molecule with a weak affinity for protons (PA: 594.0 kJ mol−1), is difficult to protonate to a cation, and therefore crosses the 8R window in the form of a neutral species (Fig. 5a) by overcoming a barrier of 36.4 kJ mol−1 (Fig. 5d). Secondly, for formaldehyde (HCHO), with a higher PA value of 712.9 kJ mol−1, a weak entanglement between diffusion and reaction is observed, as indicated by the extension of the sampling along the reaction coordinate on the 2D free energy surface (Fig. 5b). However, the protonation of HCHO to CH2OH+ did not produce any local minima. Therefore, the diffusion of HCHO in H-SSZ-13 will occur predominantly its neutral form with a small barrier of 16.1 kJ mol−1 (Fig. 5e), although its cationic form (CH2OH+) may occasionally occur. Finally, ketene (CH2CO), with the highest PA value of 825.3 kJ mol−1, has a well-known high activity to react with the BAS, and the low protonation barrier of ketene (27.3 kJ mol−1 in Fig. 5f) into an acylium ion or surface-bound acetate leads to a strong diffusion–reaction entanglement as displayed in Fig. 5c. Based on the analysis of different diffusion–reaction pathways in Fig. 5f,40 ketene preferably diffuses from cage A to cage B in its neutral form (barrier of 20.4 kJ mol−1). However, diffusion as an acylium ion (CH3CO+) was also found to be kinetically feasible (with a barrier of 25.9 kJ mol−1) and strongly competitive with the pathway of the neutral ketene. Note that for those species that can form a hydrogen bond with the Brønsted acidic proton, like HCHO and ketene, the free energies of the adsorbed species in cages A and B are not the same. The Brønsted acid site (BAS) at the 8-ring window preferentially faces cage B, which leads to stronger hydrogen-bonding interactions with HCHO or ketene in cage B than in cage A. As a result, the free energy in cage B is slightly lower than that in cage A. Some snapshots along the minimum free energy coordinate are shown in Fig. 5d–f. Finally the variation in pathways from CO to HCHO and ketene suggests that this reaction–diffusion entanglement is primarily determined by the difference between the diffusion and reaction barriers. Overall, the increase in competition between diffusion and reaction from CO to ketene reflects the complexity of the diffusion behavior of highly active intermediates and reaffirms the necessity of applying a highly accurate QM accuracy methodology to comprehensively investigate both reaction and diffusion under operating conditions.
![]() | ||
| Fig. 5 The increasing degree of diffusion–reaction entanglement for unsaturated oxygenate mobility and reaction in H-SSZ-13. The 2D free energy profiles along the diffusion and reaction coordinates for (a) CO, (b) HCHO, and (c) CH2CO in H-SSZ-13 zeolite with one BAS on the 8R window at 673 K. The diffusion coordinate is a collective variable that describes the diffusion of guest molecules from cage A to cage B in H-SSZ-13, while the reaction coordinate is a collective variable that describes the reaction of guest molecules with the BAS of H-SSZ-13. The lines on 2D free energy surfaces are the minimum free energy paths of CO, HCHO, and ketene diffusion, and the free energy of these paths is shown in (d) CO, (e) HCHO, and (f) ketene. The reactions are described on the left side of each panel in the figure and Fig. 2b–d. Fig. 5c and f are reproduced under the terms of the CC BY 4.0 license from ref. 40. Copyright 2026 Elsevier. | ||
The influence of the presence of Brønsted acid sites (BAS) on the diffusivities of ethene and ethane in H-SSZ-13 is investigated by performing MLP-US simulations at 600 K to quantify diffusion energy profiles for hopping between two neighboring cages. These simulations can be performed on a small simulation cell and with the MLP simulation lengths up to 1 ns per window can be achieved. Three different zeolite models are considered, containing either 0, 1 or 2 acid sites on the 8-ring window through which the guest molecule jumps. The resulting free energy profiles are depicted in Fig. 6a. Note that by performing MLP simulations, smooth and properly converged free energy profiles can be obtained with significantly reduced error bars compared to analogous simulations at DFT level of theory. The barrier for hopping through the narrow 8-ring window is mainly determined by entropy for ethane and ethene, which are both much smaller than the ring surface area, as was also shown in our earlier work.39 In the absence of acid sites, ethene and ethane exhibit a similar diffusion barrier. However, the presence of BAS in the framework has a considerable impact on the diffusivities. For a single acid site on the 8-ring, the barrier for ethene diffusion is reduced by more than 50% which can be attributed to the stabilizing π–H interactions between the olefinic double bond and the acid proton. These favorable interactions remain intact while the olefin travels through the 8-ring, thus reducing the overall diffusion barrier. This effect is further magnified when two acid sites are located oppositely on the 8-ring window as this orientation allows the formation of a double π–H interaction (see Fig. 6b). In contrast, for ethane, no clear beneficial effect of acid sites is observed. Weak physisorption interactions of the paraffin with the acid proton result only in a minor reduction of the diffusion barrier. In conclusion, the presence of acid sites in the zeolite framework may have a considerable influence on the relative diffusion rates of light olefins and paraffins.
Diffusion constants Ds can be derived from the free energy profiles by applying transition state theory to obtain rate constants for diffusion, kdiff. These rate constants can be used to solve the so-called kinetic master equation or applied in a random walk model, with hopping distance λ, as developed by Smit and coworkers.72,73
| MSD(t) = 2dDst + b |
We used both strategies to derive the diffusion constant at 600 K for ethene in SSZ-13 without acid sites. According to transition state theory and the random walk model, a diffusion constant of 1.92 × 10−11 m2 s−1 is obtained while the MSD method predicts a value of 6.88 × 10−11 m2 s−1, within the same order of magnitude. To track the MSD of the ethene guest molecule, extended simulation lengths on a large simulation cell are required. Such simulations become feasible thanks to the MLP which was derived on DFT calculations on a small unit cell model (Fig. 7a). Regular MD simulations of 10 ns are carried out on a large 2 × 2 × 2 simulation cell. To improve the statistics, the final MSD curves are obtained as averages of 20 independent MD simulations in a lag time range of 5 ns. The results are plotted in Fig. 7b, wherein 2 cases are shown, one with no BAS in the simulation cell and one with 50 BAS randomly distributed in the simulation cell. The computed diffusion constants confirm the earlier observed trend that in the presence of acid sites in the framework, ethene diffusion is subject to a significant speed-up compared to a framework without acid sites. The applied procedure with the MLPs has great potential to extend to higher loadings, a variety of acid site densities. If the MLP is trained to cover complex chemistries, it becomes feasible to derive self-diffusion coefficients but also collective diffusion coefficients for industrial catalysts.
Two collective variable choices are possible (Fig. 3b): the first is a set of three coordination numbers (CNs), similar to what was done in the original study. For the second, one can imagine to project this 3-dimensional space on a 2-dimensional plane by means of linear combinations of the original CNs. While this significantly reduces the complexity of the problem at hand, it will also “merge” different states to similar regions of the 2-dimensional CV space. For an exhaustive explanation of the CVs definition, we refer back to the methods section.
Using the trained model, we began by performing eight independent well-tempered metadynamics simulations67,74 in the 3-dimensional CV space. Each simulation was propagated for 1 ns. The resulting free-energy surfaces are reported in Fig. 8b in terms of the three CVs defined in Section 2.3.1. It is interesting to note that all four minima can be explored by the walkers, however convergence in 3-dimensions is extremely challenging. This shows very clearly when looking at the free energy surfaces of the single simulations, where none managed to visit all four minima and all only visited 2 or 3 of them. The major issue in this system lies in its conformational freedom, where long timescales are needed for the reacting partners to “find each other”. To illustrate qualitatively the expected 3D free energy surface we took the mean of the 8 independent well-tempered metadynamics simulations. The resulting 3D-surface, shown in Fig. 8b, reveals—as anticipated from the collective variable definition—three minima positioned at the vertices of an equilateral triangle in the space defined by the three CNs. Each minimum lies around 1 along the x-, y-, or z-axis, as was already schematically depicted in Fig. 3b.
![]() | ||
Fig. 8 (a) Reaction scheme of the benzene methylation reaction by two methanol molecules in H-SSZ-24. The reactants (I) can undergo concerted methylation towards the products (III). Alternatively, a surface methoxide species can be formed first (II) or the two methanol molecules can condense to dimethyl ether (IV). To illustrate the wide variety of transition paths connecting the various minima, various snapshots of the direct methylation, methoxide and dimethyl ether formation are shown. (b) Mean 3-dimensional free energy surface obtained from a set of 8 independent well-tempered metadynamics simulations. The separate profiles are reported on the right, showcasing how none of the simulations has visited the four minima even after 1 ns of simulation time. (c) 2-Dimensional free energy surface obtained from umbrella sampling simulations in terms of CV1 and CV2, defined as CV1 = and CV2 = , where x, y, z were defined in Section 2.3.1 and schematically in Fig. 3b. | ||
To illustrate the wide variety of transition paths connecting the various minima we show some snapshots of the direct methylation, methoxide and dimethyl ether formation in Fig. 8a. It is clear that various possible transition paths exist due to the large configurational freedom and the participation of one or either two methanol molecules in the transition paths. Note that it is not possible anymore to define one transition state as is often done in static simulation methods, but that a region of transition paths exists. As mentioned in the original publication of De Wispelaere et al., this is a very challenging reaction given the large conformational freedom in the large pore zeolite.31 The high degree of conformational flexibility and the various competing paths, makes a faster pace in the metadynamics bias deposition pace also non-ideal, as one risks forcing bond breaking before the reactive partners have the correct spatial disposition creating unphysical transition states.
To improve the convergence of the final free-energy surface, we constructed a fine grid of 2-dimensional umbrellas based on the following collective variables: CV1 =
and CV2 =
which were defined in Section 2.3.1 and performed umbrella-sampling simulations.75,76 We used an initial force constant of 1500 kJ mol−1, propagating each umbrella for 250 ps, and subsequently added a second set of umbrellas with a tighter force constant (2500 kJ mol−1) propagated for 125 ps to fill residual gaps in the two-dimensional CV space. The thus obtained 2D free-energy surface is shown in Fig. 8c. The four expected minima are clearly distinguishable, although we emphasize that these results are purely qualitative: the strong “compression” of phase space into only two dimensions prevents a clean separation of the different reactive channels (for example, the I → II pathway cannot be distinguished from the I → IV → II route). Obtaining reliable, quantitative results would require a more suitable CV selection or restricting the analysis to specific reaction steps. Nevertheless, this example demonstrates how accessible it has become to investigate complex mechanistic landscapes with MLPs. The production runs amount to almost 175 ns of total simulation time, but required only a few days of actual user time. These numbers were unthinkable at the time of the original study and highlight how MLPs enable more extensive exploration and improved convergence in simulations of zeolite catalysis.
Finally it is interesting to comment on how far it would be possible to also deduce kinetic constants from the simulations which can afterwards be fed into an overall kinetics model. Rate constants are mostly derived from enhanced sampling simulations using transition state theory where barrier recrossings are neglected. The underlying reason for this assumption can be traced back to the high computational expense, especially when the energies and forces are calculated with expensive Density Functional Theory calculations or other first principles methods. With the use of MLPs, it becomes possible to obtain dynamically corrected rates explicitly accounting for barrier recrossings using the reactive flux formalism.25,77–79 Thousands of MD trajectories atop of the trajectories need to be determined to evaluate whether they end in the reactant or product basin.37 With the MLP based methodologies it will also become possible in the future to evaluate reaction rates using path based methods that also necessitate the generation of a lot of trajectories. One example is the usage of Replica Exchange Transition Interface Sampling (RETIS) where an ensemble of reactive paths is generated with a Monte Carlo procedure.80,81 These methods sample the transition path without prior knowledge of the transition state and without using CVs.82–85 So far the method has not been applied to complex reactions under investigation here, due to the computational expense, but new studies are underway also from some of the present authors to showcase the first successful application of these methods to calculate dynamically accurate reaction rates for complex zeolite catalysed reactions.
In the second and third case studies we illustrated how machine learning potentials (MLPs) enable a path forward to obtain dynamic information with quantum-level accuracy for individual steps of a catalytic trajectory, namely for reactive and diffusion events.
For reactions in industrial catalysts, which often occur under extreme temperatures and pressures, it is essential to account for operating conditions.1,8,29,87 The most natural way to do so is to use first principles molecular dynamics simulations, where the catalytic reaction and their intermediates are followed on the fly at the actual reaction conditions.1 With the rise of MLPs, such simulations become much more accessible as shown in this paper. However as reactions are rare events, one needs to rely on enhanced sampling molecular dynamics simulations.74,88–91 As has become clear from case study 3 in this paper, proper choice of CVs can become rather cumbersome.27 An alternative would be to rely on path sampling methods, but their straightforward application to industrially relevant catalysts was to date out of reach due to the large computational cost to generate many paths at the DFT level of theory.81–85,92,93 However with the rise of MLPs, we foresee a more frequent usage of such methods with the rise of reliable MLPs that are able to capture the chemistry of the complex catalyst.
Clearly describing all events of a catalytic trajectory consistently from the nanoscale to the catalyst particle level, poses enormous challenges. However we argue that it is necessary to obtain such a consistent dynamics view across length and time scales. This is especially true as catalysts evolve continuously across length and time scales.23 During a catalytic cycle, new active sites can be formed dynamically, a process that is highly dependent on the operating conditions. Furthermore the dynamics at the crystal-particle level are strongly influenced by spatial gradients, crystal size, and morphology.17 A typical example of such a highly dynamical complex system is an MTO catalyst. A recent review paper by Liu et al. also clearly emphasized how diffusion, reaction network evolution, coke buildup/regeneration and catalyst evolution interact continuously, with each process influencing the others dynamically.2 Within this paper, we focused on the spatial heterogeneities observed within the catalyst particle level, ranging from Å to µm and argue that future modelling studies should focus on obtaining a consistent dynamics view of all events within the crystal particle level with QM accuracy. Catalytic solids used in industrial applications are further embedded within binders or other supports and within the industrial reactor, however these heterogeneities fall beyond the scope of this paper.94 Even at the level of the catalyst particle, where various events are intertwined, namely molecular diffusion, adsorption–desorption, chemical reactions and also surface barriers, substantial gradients in the concentration of reactants and products. Every spatial modification in the catalyst leads to temporal gradients and change of dynamics. Given this complexity, future modeling studies should fully embrace the spatial heterogeneities present in the crystal from the nanoscale local level to the crystal particle level.
However, achieving a fully consistent dynamic view of an entire catalytic process—including all relevant time scales—requires more than MLPs alone. With MLPs one can extend accessible length and time scales with roughly three orders of magnitude with current computational power, compared to accessible length and time scales of hundreds of picoseconds and a few nm accessible with DFT. Therefore, information on individual reactions and diffusion events must be integrated into multiscale models that bridge length and time scales in order to reach the scales of the catalyst particle level. Various models exist to tackle the multi-length/time scale problem, such as microkinetic modeling, kinetic Monte Carlo (kMC) models and heat and mass transport models to account for the inhomogeneities in temperatures and reactant concentrations in the reactor, but none of them account for the kinetics of all events taking place at the crystal particle level with quantum accuracy.95–99 Furthermore within the set of existing multiscale models, major advances are still necessary to make them applicable for complex industrial catalysts and to fully account for the dynamic evolution of the catalyst, spatiotemporal gradients and the interplay between diffusion and reaction. There exist also other phenomenological models to describe the combined effects of reaction and diffusion, however in this case one does not (or to a limited extent) use atomistic scale input.100–102
Overall, the results presented in this paper highlight the need for a consistent dynamics description with quantum accuracy across length and time scales to enable rational catalyst design. Machine learning potentials (MLPs) offer a promising pathway toward this goal by providing kinetic and transport properties for events from the nano- to mesoscale with quantum-level accuracy at significantly reduced computational cost. In the future these data can possibly be integrated into multiscale models such as kMC or reaction–diffusion frameworks, which are essential to reach the length and time scales relevant for industrial catalysts.
| This journal is © The Royal Society of Chemistry 2026 |