Open Access Article
Xin Jin
,
Yutao Li
,
Kelian Gaedecke
,
Xiaoqi Zhang
and
Berend Smit
*
Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland. E-mail: berend.smit@epfl.ch
First published on 2nd April 2026
Metal–organic frameworks (MOFs) are porous materials with the potential for gas adsorption and separation technologies due to their tunable structural and chemical characteristics. However, simulating gas adsorption isotherms in MOFs with DFT-level accuracy remains a key challenge. In this work, we use MIL-120 as a case study and propose a comprehensive computational workflow to fine-tune a pre-trained MACE potential, enabling the creation of accurate machine-learning interatomic potentials tailored for frameworks with dynamic structural behavior. Using this ML potential to accelerate sampling, we uncover a strong coupling between CO2 adsorption and the complex dynamic hydrogen-bond network on the MIL-120 pore surface. The presence of CO2 induces local configurational rearrangements that reshape the pore environment. This insight provides a generalizable strategy for simulating adsorption in other flexible MOF systems.
Although classical force field MC and MD simulations have already been widely employed to explore the adsorption and diffusion of guest molecules in MOFs,6 inaccuracies resulting from these generic force field can lead to both qualitative and quantitative inconsistencies in the prediction of properties, especially in the case of complex physical and chemical adsorptions involving open-metal sites, parallel aromatic rings, and dynamic hydrogen bonds networks.8,9 Ab initio methods, such as Density Functional Theory (DFT), offer a successful approach for precisely determining complex interaction forces. However, considering the time and computational cost, it is impractical to perform ab initio Monte Carlo (AIMC) or Molecular Dynamics (AIMD) calculations for large-scale metal–organic frameworks (MOFs) due to the high computational demands.
Atomistic simulation with machine learning potentials has emerged as a promising approach, offering, at the expense of a modest decrease in computational efficiency compared with force field method, an accuracy comparable to density functional theory. Among these, DeepMD10 and NequIP11 are mature and promising options for training the machine learning potential of MOFs from scratch. Many works have been reported using machine learning potentials trained from scratch for specific MOFs, including simulations of their gas adsorption properties. For example, Achar et al.12 developed an ML potential for CO2 adsorption in UiO-66, achieving DFT-level binding energies. Similarly, Zheng et al.13 employed the DeepMD approach to train an ML potential to describe the CO2 adsorption in MOF-74-Mg. Goeminne et al.14 employed the NequIP method for the same system, further improving the accuracy to a level suitable for grand canonical Monte Carlo simulations. In addition to these system-specific models, there are also generalized MOF ML potentials, such as that reported by Yue et al.,15 who trained a NequIP potential for 8000 Zn-based empty MOFs, which can be used for geometric optimization. These studies generate the dataset for training the potential through time-consuming ab initio molecular dynamics simulations. And, to ensure a stable and reliable potential, introduce subsequent active learning cycles. These cycles iteratively refine the potential energy surface. One of the challenges with this approach is that it has limited extrapolation capacity.
Among those machine learning potential methods, MACE, a message-passing model based on Atomic Cluster Expansion (ACE),16 uses a different approach. Unlike traditional local potential models, the MACE potential can capture information beyond the strictly local atomic environment, owing to message passing across multiple predefined cutoff ranges within the model, ensuring that MACE is not limited to structural information within a fixed local neighborhood. Instead, it enhances the representation of high-order many-body interactions, thereby improving the accuracy of descriptions in complex atomic environments. More importantly, MACE provides a set of general pre-trained models derived from a wide range of solid-state material systems.17 These models allow users to efficiently generate representative machine learning potentials by fine-tuning with a small number of key configurations. Lim et al.18 performed continuous sampling of CO2 at different positions inside MOF pores, and fine-tuned a pretrained MACE potential to obtain a generalized potential for CO2 adsorption in rigid MOFs. This approach is expected to yield accurate results for MOFs with well-defined, known binding sites. However, It is still far from providing accurate adsorption energies in all MOFs, let alone isotherms.
Moreover, there is an even more challenging task for simulating the properties of flexible frameworks. In most cases, MOF structures are treated as rigid frameworks, with their structures assumed to remain unchanged during Monte Carlo or Molecular Dynamics simulations. However, this assumption does not hold for all MOFs. In fact, an increasing number of flexible MOF structures have been reported, characterized by pore sizes or surface morphologies that change in response to variations in temperature or pressure.19,20 Since gas adsorption occurs at the pore surfaces, the dynamic structural changes of flexible MOFs can have a significant impact on adsorption behavior and therefore cannot be neglected in simulation studies.21 It is important to build a potential energy surface (PES) to describe how atoms interact for MC or MD simulations at the DFT level with flexible frameworks.
MIL-120, a robust microporous aluminum 1,2,4,5-benzene tetracarboxylate framework, has recently been reported by Chen et al.22 as a CO2 capture material candidate for its relatively moderate isosteric enthalpy of adsorption (≈−40 kJ mol−1), good CO2/N2 selectivity, and long-term stability under real conditions (see Fig. 1). Classical force field simulations of gas adsorption in Al-based MOFs often show significant discrepancies from experimental measurements,23–25 necessitating the use of empirical scaling factors. MIL-120 is no exception in this regard. For instance, Chen et al.22 set the energy parameters of Al to zero in their classical GCMC simulations to fit the experimental data. Naturally, simulations of MIL-120 are particularly challenging due to the structural complexity of the framework, considering the high density of µ2-OH groups, as well as the accessible aromatic rings decorating the channel surface, which play an essential role in the CO2 interaction with the atoms of the MOF. This work was inspired by the recent work of Li et al.,26 who developed an empirical force field to correct the UFF force field for Al-containing MOF. This force field showed a remarkable improvement for all Al-containing MOFs for which experimental isotherms were available, except for MIL-120. For this MOF, Li et al.26 could not capture the adsorption of CO2 correctly. Therefore, this represents an interesting MOF to study using a machine learning force field.
![]() | ||
| Fig. 1 The atomistic structure of MOF MIL-120, where the red balls represent O atoms, blue ones represent Al atoms, white ones represent H atoms, and brown ones represent C atoms. | ||
In this work, we present a general computational workflow, using CO2 adsorption in MIL-120 as a case study, to establish a machine-learning potential for predicting adsorption isotherms using grand-canonical Monte Carlo simulations. Our workflow assumes a flexible MOF and aims to predict CO2 adsorption isotherms with DFT-level accuracy. The final training set contains only around 3000 DFT single-point calculations, with training errors below 0.1 meV per atom for energies and below 20 meV Å−1 for intermolecular forces. The results illustrate the importance of dynamic changes in the chemical environment at the MOF pore surface on the gas adsorption behavior in MIL-120.
Specifically, hydrogen-bond networks are established between adjacent MOF rod chains through the orientations of these µ2-OH groups. Each µ2-OH group on a rod chain forms a hydrogen bond with a µ2-OH group on a neighboring chain, where one acts as a donor (D) and the other as an acceptor (A). The µ2-OH groups functioning as acceptors typically possess a free OH bond pointing into the MIL-120 pore, creating potential CO2 adsorption sites.
Once the orientation type of the µ2-OH groups on a single rod chain is determined, the orientations on adjacent rod chains can be inferred accordingly, enabling the construction of the complete hydrogen-bonding network across the MIL-120 unit cell.
However, due to the limitations of X-ray diffraction (XRD) experiments in accurately resolving hydrogen atom positions, the orientation of OH bonds in the µ-(OH) groups of MIL-120 cannot be directly determined. This uncertainty gives rise to multiple possible structural configurations. For example, in the work by Chen et al.,22 the OH orientation sequence in the rod chain repeat unit for the str1 structure is AAAADDDD, whereas for str2 it is ADDAADDA. In fact, beyond these two configurations, many other configurations are possible. For instance, as illustrated below, str3 exhibits DAADDAAD OH sequence, while str4 adopts a uniform AAAAAAAA pattern.
Performing CO2 adsorption energy calculations for each of these hydrogen-bonding variants, or treating them as rigid frameworks for GCMC simulations to obtain adsorption isotherms, would be computationally inefficient and practically infeasible. Therefore, it is necessary to develop more efficient strategies to address the configurational diversity arising from structural uncertainty in a reasonable manner.
(1) Ab initio molecular dynamics simulations of the MIL-120 empty framework to sample stable dynamic configurations of the framework structure (optional);
(2) Long-time molecular dynamics simulations of the MIL-120 empty framework based on the pre-trained MACE potential at different temperatures, to collect diverse dynamic framework configurations (MACE–MD);
(3) Grand Canonical Monte Carlo simulations of CO2 adsorption with a fixed framework using the UFF potential, to sample configurations of CO2 adsorption sites and configurations;
(4) Long-time molecular dynamics simulations of the MIL-120 with CO2 based on the pre-trained MACE potential at different temperatures, to collect diverse dynamic adsorption configurations.
Changes in the unit cell shape are one manifestation of MOF flexibility. In the present work, however, for MIL-120, no breathing behavior has been observed experimentally, and the unit cell does not undergo significant deformation under ambient conditions. Therefore, we only used NVT simulations in both AIMD and MACE–MD. For MOFs that do show changes in the unit cell shape, sampling using the Rahman–Parrinello MD scheme can be straightforwardly incorporated into our workflow by including stress in the MACE training parameters, without altering the overall procedure presented here.
Fig. 3 demonstrates the two-dimensional projection of the SOAP structural descriptor space for empty MIL-120 configurations. We used the t-distributed stochastic neighbor embedding (t-SNE) method, which preserves local similarities within the data, ensuring that similar structures are mapped close to each other in the 2D space. The plot clearly reveals that the configurational space is divided into two distinct regions, indicating a significant difference between the configurations sampled by AIMD simulations and those from simulations based on the pre-trained MACE potential. This highlights a key limitation of AIMD-based sampling. Due to its high computational cost, simulations are typically limited to short time scales, making it difficult to capture the full range of structural evolution and configurational diversity of the MOF framework starting from a single initial structure at finite temperature. From this, we selected the 800 most distant (and thus maximally diverse) points as the initial training set for the empty framework.
As a first screening criterion, we employed the variance in atomic forces predicted by an ensemble of machine learning potentials to identify regions of high uncertainty. In each training iteration, we generate four sets of machine learning potentials from the same training set but with different random seeds. The interatomic force deviation across multiple random seed models can be expressed as follows:
![]() | (1) |
i represents the average forces over all the MACE potential models, and the deviation serves as a criterion for structural selection. Those structures with force deviations between 0.06 and 0.24 eV Å−1 were selected as candidate configurations, depending on the users' desired level of accuracy for the final target MLP. Candidate configurations were then evaluated using SOAP descriptors to quantify structural diversity, and representative structures were selectively added to the training dataset.
Our results give a significantly better accuracy with an order of magnitude less training data compared to previous results.12,21 These studies also employ active learning, but they differ in the way configurations are selected for which additional DFT labeling is performed. Typically, these candidate structures were randomly sampled from the set of configurations that have too high errors in the forces or selected as the final frames of short MD simulations starting from a large number of randomly perturbed initial structures. We employ a different approach to avoid random sampling. We use a SOAP structural descriptor to quantify the similarity of the set of inaccurate configurations. We then perform our DFT labeling for a set of 200 most diverse configurations for each system that optimally span this set.
Fig. 4 shows the sampling of our MOF loaded with one CO2 molecule during the first round of active learning. The red dots in Fig. 4A are the first set of configurations for which we fine-tuned the Mace force field, and the grey dots are the configurations from a long MD trajectory we generated from the first round of fine-tuning. Fig. 4B is the same figure but now color-coded with the estimated accuracy of this force field. In our first round of active learning, we selected the set of configurations with poor accuracy. We determined the set of configurations that optimally cover this set, which are the yellow dots in Fig. 4A. In Fig. 4C, we compare our sampling with a random sampling from the same set of configurations, i.e., how many random samples do we need to select to cover the same diverse configuration space? These results show that even with 2000 configurations, we do not cover the entire configuration space.
Fig. 4D illustrates the reason for this. The yellow line is a sketch of the energy landscape of a part of our MD trajectory. If we carry out a random sampling, we will mainly select the low-energy configurations. For an accurate MD trajectory, not only do the low-energy configurations matter, but the barriers are equally important. Our diversity algorithm ensures that representative configurations of these barriers are included in the training set. However, with random sampling, the probability of selecting such a high-energy configuration is (exponentially) small, and at the same time, it selects many configurations that add very little new information to our active learning. Hence, one needs an order of magnitude more configurations to have such a conformation selected during any of the learning cycles. In our example, even with 10 times more samples, we did not sample all relevant barriers. This is the main reason why we can get significantly better accuracy with far fewer DFT single-point calculations.
These results also highlight that AIMD simulations with only 10
000 DFT steps explore only a small fraction of the conformational space of such complex, dynamic structures. As shown in Fig. 4A, we can even omit all AIMD conformations from the training set, because sampling based on the pretrained MACE model already covers the conformational space explored by AIMD. For the hydrogen-bond network in MIL-120, fewer than 3000 single-point DFT calculations are sufficient to achieve an accurate description of the system. This approach offers substantial savings in computational resources compared to most MLP training strategies that rely heavily on conformations extracted directly from AIMD trajectories.
Fig. 6 illustrates the distribution of the training set within the entire sampling space. This space includes not only all structural frames obtained during the initial data sampling phase for training but also all frames extracted from long-time molecular dynamics and GCMC simulation trajectories generated using the trained models. By comparing these distributions, one can evaluate the extent to which the training set covers the overall configurational space and assess the model's generalization capability in realistic simulation tasks. These simulation tasks includes not only simple single-energy calculations and NVT MD simulations but also hybrid MD–GCMC simulations.
Fig. 7 presents a comparison between the fine-tuned MACE potential and DFT calculations in terms of energy and interatomic force predictions, demonstrating excellent agreement. For the empty MOF framework system, the prediction errors MAE of the fine-tuned MACE model for energy and atomic forces are 0.07 meV per atom and 14.58 meV Å−1, respectively. In the system with one adsorbed CO2 molecule, the corresponding errors for energy and forces are 0.07 meV per atom and 13.32 meV Å−1, respectively. Combined with the previously shown broad and well-distributed coverage of the configurational space, this further confirms that the constructed MACE potential effectively captures the dynamic structural variations of the MIL-120 framework. These error bars are an order of magnitude smaller than typically reported for MLP describing MOFs.12,13,15,18 Therefore, this model serves as an efficient and reliable alternative, enabling feasible simulations of experimentally observable macroscopic properties, such as adsorption isotherms and heats of adsorption, while maintaining accuracy close to the DFT level.
To quantitatively characterize the directionality of each OH bond, we computed the angle α between the projection vector of the OH bond onto the xz-plane and the x-axis. As shown in the Fig. 8, we obtained the statistical distribution of these angles over a 0.1 ns timescale.
The angle range was divided into three intervals: 0° to 45°, 45° to 135°, and 135° to 180°. OH bonds falling within the first and third intervals were defined as “free OH” pointing toward the pore center, functioning as hydrogen bond acceptors. OH bonds within the middle interval were oriented toward neighboring rod chains, serving as hydrogen bond donors. The distribution of OH orientations among the three categories remained approximately in a 1
:
2
:
1 ratio. This observation is consistent with our earlier structural analysis, wherein two adjacent µ2-(OH) groups form a stable hydrogen bond, while the remaining OH bond points into the pore, acting as a potential CO2 adsorption site.
In MIL-120, the number of OH groups pointing toward the pore center is constant. Changes in their spatial distribution within the pore have a large impact on the interactions with CO2 molecules. Certain adsorption sites allow a single CO2 molecule to interact simultaneously with multiple OH groups, thereby significantly affecting its adsorption capacity. Fig. 9 shows the CO2 adsorption sites and Table 1 the corresponding DFT-calculated binding energies for the three main structural units (str1, str2, and str3 in Fig. 8) in MIL-120. It is evident that the introduction of CO2 induces a configurational response on the pore surface, altering the local morphology of the pore environment.
| Structures | Binding energy (kJ mol−1) |
|---|---|
| Str1_parallel | −41.07 |
| Str1_vertical | −36.17 |
| Str2_parallel | −46.95 |
| Str2_vertical | −44.56 |
| Str3_parallel | −34.77 |
| Str3_vertical | −33.56 |
Based on this analysis, the MIL-120 framework was divided into these three structural units. Molecular dynamics simulations of CO2-adsorbed MOF structures at various temperatures were conducted to statistically track changes in the proportions of each structural unit. As shown in the Fig. 10, str1 dominates in the empty MIL-120 framework; however, upon CO2 adsorption, the proportion of str3 gradually decreases while that of str2 notably increases. This shift is primarily attributed to the interactions between CO2 molecules and the OH groups on the pore surface, which perturb the equilibrium of the dynamic hydrogen-bond network. Due to the variation in CO2 binding energies across different structures, the adsorption process tends to induce a structural transition favoring the formation of str1 and str2, which offer stronger adsorption sites. To investigate the effect of temperature on the intrinsic framework behavior, we performed angle-based structural analysis on the empty MIL-120 framework at various temperatures. The results show that as the temperature increases, the proportion of str1 gradually decreases, str2 declines rapidly, while str3 increases significantly. This indicates a temperature-driven reorganization of the hydrogen-bond network toward more disordered structures. This phenomenon is a key reason why traditional rigid-framework models fail to accurately simulate CO2 adsorption in MIL-120, highlighting the critical importance of accurately modeling structural flexibility and the dynamic hydrogen-bond network for reliable adsorption performance descriptions.
By performing molecular dynamics simulations on both the empty MOF framework and the MOF structure with adsorbed CO2 molecules, we calculated the heat of adsorption of CO2 in MIL-120 using the following formula:
| ΔHads = 〈EMOF+CO2〉 − 〈EMOF〉 − 〈ECO2〉 − RT | (2) |
The calculated heat of adsorption is −41.9 kJ mol−1, which is in excellent agreement with experimental value (−40.2 kJ mol−1). If we compare this with the value obtained by UFF (−52.3 kJ mol−1) or the modified UFF (−50.2 kJ mol−1), we see that this represents a significant improvement.
To more accurately capture the dynamic structural response of the MOF framework during gas adsorption, we incorporated a molecular dynamics-coupled strategy into the GCMC simulations. Specifically, prior to each CO2 insertion or deletion attempt, a short molecular dynamics simulation was performed on the entire system. This approach updates the framework configuration, allowing it to more realistically reflect the flexibility changes that occur during the adsorption process.
Fig. 11 compares the predicted CO2 adsorption isotherms in MIL-120 at 303 K using different simulation methods. The results show that the MD–GCMC approach, which accounts for framework dynamics, provides adsorption predictions that align more closely with experimental data. This highlights the significant role of structural flexibility in governing the adsorption performance of this material.
![]() | ||
| Fig. 11 Comparison of the simulated and experimental CO2 adsorption isotherms at 303 K. The green curve represents the experimental CO2 adsorption isotherm. The red curve shows the result from this work using GCMC simulations with a fine-tuned MACE potential on a flexible framework. The yellow curve corresponds to GCMC simulations with a fixed framework using the UFF force field, while the blue curve shows results from a fixed-framework simulation using a refined force field for aluminum centers.26 | ||
It is instructive to compare our results with the simulation results of Chen et al.22 It is well known that the UFF force field overestimates the results for Al-containing MOFs. For example, Boyd et al.6 used a scaling parameter to reduce the van der Waals interactions. Chen et al.22 could obtain excellent agreement with the experimental isotherms of CO2 in MIL-120 by setting the van der Waals interaction of CO2 with the Al atoms to zero. Li et al.,26 however, argued that it is not the Al atoms that have the incorrect parameters in UFF, but the O and C atoms next to Al. However, all these calculations assume that the MOF is rigid. This work shows that for MIL-120, the explanation is much more subtle. It is not only a matter of changing the UFF parameters, but it is intrinsically related to the dynamics of the hydrogen bond network that cannot be captured if we assume a MOF is rigid.
Also, it is important to realize that the effect of the hydrogen is amplified because four hydrogens collectively interact with CO2, in most MOFs, it will only be one. In such a case, movements of the hydrogen will have little, if any, impact on the adsorption behavior. Therefore, we do not expect similar failures to describe the CO2 adsorption in most other MOFs in classical force fields with a rigid framework.
In this work, we developed a comprehensive and efficient computational workflow to fine-tune an MACE potential for modeling gas adsorption in flexible MOFs, which, in the case of MIL-120, can accurately capture the dynamic hydrogen-bond network critical to CO2 adsorption. By integrating multiple sampling strategies, we constructed a diverse and representative training dataset, further refined through an active learning loop. This active learning process aimed to ensure that we carry out an optimal number of DFT calculations, where we define optimal as a compromise between available CPU time, accuracy, and the diversity of configurations that may need to be sampled. In this case, we carried out about 3000 single-point DFT calculations.
For carbon capture applications, CO2 is an important component. An equally important component in carbon capture applications in H2O. Our results for MIL-120 show that the MACE potential reliably captures that CO2 adsorption induces a reorganization of the hydrogen-bonding network, altering the pore environment in ways that rigid-framework models cannot capture. These observations are fascinating, but unique to MIL-120. However, for the adsorption of H2O, we expect this to be the norm. An optimal orientation of a single hydrogen of the MOF can facilitate the formation of a hydrogen-bond network for H2O.
The configuration number of the training set is two orders of magnitude larger than that used by Lim et al.18 to describe the adsorption of CO2 in MOFs with open metal sites. There are a few reasons why these numbers are so different. First, we computed the complete isotherm, while Lim et al.18 focused on a single CO2 molecule. A more important difference is that Lim et al.18 assumed that the MOF was rigid and focused on fine-tuning the MACE potential at the binding site. This is a very efficient strategy for those MOFs for which the binding site is known, but it would not work in the case of MIL-120. This study highlights the necessity of accounting for framework flexibility and dynamic chemical environments in realistic MOF adsorption simulations. The proposed machine learning-based workflow offers a transferable strategy for studying other flexible MOF systems and provides new insights for the design of MOF materials for direct air capture and related applications.
All our DFT results can be found on Zenodo. If more studies make their DFT calculations available, these force field models can be systematically improved for these metal–organic frameworks. This will eventually reduce the number of DFT calculations needed to fine-tune these models.
The initial training set of MOF with CO2 configurations was also generated from UFF GCMC simulations at subsequent pressure points. They were performed by the Isotherm work chain in the aiida-lsmo plugin32 with RASPA33 molecular simulation software. The optimized framework geometries were kept rigid in classical simulations. We considered van der Waals and electrostatic interactions to describe the energy surface, represented respectively by the Lennard-Jones (LJ) potential and Coulomb interactions. The density-derived electrostatic and chemical (DDEC) method34 is used to compute the partial charges on the atoms of the MOF frameworks. The Ewald summation technique was used to model the Coulomb interaction. The TraPPE force field35 was selected to model gas–gas interactions. The Lennard-Jones parameters were taken from the Universal Force Field (UFF)36 to model the interactions of CO2 with the framework atoms.
The hybrid MD-GCMC simulations were performed with the LAMMPS package. At a temperature of 300 K, employing a Nose–Hoover thermostat with a time step of 0.5 fs, the ideal gas reservoir is defined by specifying different pressures. Throughout the simulation, NVT MD simulations are performed, with one GCMC insertion/deletion attempt every 5 steps. The positions of CO2 molecules are updated in real time during the MD simulation; there are no translation or rotation Monte Carlo move attempts. The chemical potential, as defined in Lammps, is calculated by pressure according to the following equation:
![]() | (3) |
![]() | (4) |
Ab initio molecular dynamics (AIMD) sampling was performed in the canonical ensemble employing a Nose–Hoover thermostat at 300 K with a time step of 0.5 fs over a total duration of 5 ps.
:
100 for energy and force, respectively. In the second stage, the weights were adjusted to 10
:
1 to focus on accurate energy prediction. Optimization was performed with a batch size of 8 using an initial learning rate of 1 × 10−3 with a weight decay of 5 × 10−7. In the second stage, the learning rate was reduced to 1 × 10−4. Early stopping was employed with a patience parameter of 10 epochs to prevent overfitting.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d5sc09058j.
| This journal is © The Royal Society of Chemistry 2026 |