Open Access Article
Joana Avelar
*a,
Marcos Rivera-Almazo
b,
Rubicelia Vargas
a and
Jorge Garza
*a
aDepartamento de Química, División de Ciencias Básicas e Ingeniería, Universidad Autónoma Metropolitana-Iztapalapa, San Rafael Atlixco 186, Col. Vicentina, Iztapalapa, C.P. 09340, Ciudad de México, Mexico. E-mail: joar@xanum.uam.mx; jgo@xanum.uam.mx
bInstituto de Física, Universidad Nacional Autónoma de México, Coyoacán, C.P. 01000, Ciudad de México, Mexico
First published on 2nd June 2026
We present a computational framework for the systematic exploration of high-dimensional potential energy surfaces associated with host–guest interactions in porous materials. As case studies, the adsorption of benzyl alcohol, benzene, benzaldehyde, CO, CO2, H2S, and methanol in the metal–organic frameworks MFM-300(Sc) and MFM-300(In) was investigated. A stochastic sampling strategy generated thousands of candidate configurations, which were first screened at the semiempirical PM7 level and subsequently refined using dispersion-corrected density functional theory. To identify representative low-energy structures while preserving configurational diversity, a data-driven selection procedure based on outlier detection and unsupervised clustering of energetic and geometric descriptors was implemented. The exploration of approximately 39
000 configurations reveals that the adsorption landscapes are characterized by multiple low-lying minima within narrow energy windows, particularly for weakly interacting guest molecules. This behavior reflects the complex and highly corrugated potential energy surfaces associated with confinement in MOF pores and highlights the limitations of approaches based on chemically intuitive initial guesses or single optimized structures. The multilevel protocol provides a statistically robust and computationally efficient strategy for identifying adsorption sites and representative configurations in porous materials, and offers a transferable framework for exploring configurational landscapes in host–guest systems, heterogeneous interfaces, and other complex condensed-phase environments.
Metal–organic frameworks are a class of hybrid crystalline porous materials composed of inorganic metal ions or clusters interconnected by rigid organic linkers bearing multiple coordination sites. The resulting frameworks are stabilized through coordination bonds between the metal nodes and organic ligands, often reinforced by Coulombic interactions.8,11–14 Owing to their high crystallinity, exceptional porosity, and structural tunability, MOFs enable precise control over both physical and chemical properties. This remarkable versatility has driven extensive research into their use across a broad range of applications, including catalysis, gas adsorption and separation, energy harvesting and storage, luminescence, sensing, drug delivery, biomedicine, magnetism, and light-emitting diodes (LEDs).15,16
Due to their astonishing surface area, consequence of their porous nature, the role of MOFs in gas and molecular capture and storage has been extensively studied. Furthermore, the strategic design and functionalization of MOFs allow for the selective and efficient adsorption of hazardous and industrially relevant gases, thereby contributing to environmental sustainability and process efficiency. For example, CO2 capture is critical for mitigating greenhouse gas emissions, and frameworks such as MUF-16 have demonstrated exceptional CO2 selectivity over hydrocarbons, making them suitable for natural-gas purification and feedstock refinement.17 Likewise, the removal of toxic gases such as H2S is essential to prevent corrosion and catalyst poisoning in industrial settings. MOFs including Mg-MOF-74 and MIL-101(Cr) have shown notable selectivity for H2S over CO2, exhibiting reversible adsorption behavior and excellent structural stability.18–21 The π–π stacking interactions and hydrophobic pore environments within MOFs promote the selective uptake of aromatic pollutants such as benzene and benzaldehyde, compounds associated with carcinogenicity and respiratory health risks.22 In parallel, the presence of polar functional groups in certain MOFs facilitates the adsorption of polar volatile organic compounds (VOCs) such as methanol and benzyl alcohol, thereby mitigating occupational exposure in industrial environments.15 In this context, research on the capture of these molecules is essential for advancing environmental protection, ensuring industrial safety, and optimizing chemical processing.
MFM-300(Sc) and MFM-300(In) were selected as model systems because they are isostructural frameworks that provide closely related pore environments while differing in the nature of the metal center. Both materials contain μ2-OH groups lining the pore walls, which act as well-defined interaction sites capable of establishing hydrogen-bonding and electrostatic contacts with guest molecules. In addition, the MFM-300 family has been reported to exhibit dynamic metal–ligand behavior, in which hemilabile coordination between the metal centers and carboxylate ligands may promote the reversible formation of semi-open metal sites upon guest adsorption.23,24 These structural features make MFM-300(Sc) and MFM-300(In) suitable platforms for studying adsorption modes governed by different types of host–guest interactions, ranging from weak dispersive contacts in small gases such as CO and CO2, to hydrogen-bonding and electrostatic interactions in polar molecules such as methanol and benzyl alcohol, and π-related interactions in aromatic guests such as benzene and benzaldehyde. Moreover, the experimental characterization available for these frameworks and related MFM-300 materials provides a useful reference for benchmarking computational methodologies. For these reasons, MFM-300(Sc) and MFM-300(In) constitute appropriate model systems for assessing a systematic protocol aimed at exploring complex MOF–guest potential energy surfaces.
The efficiency of gas capture in MOFs is primarily determined by the strength and nature of host–guest interactions within their porous frameworks.25,26 Computational modeling plays a central role in elucidating the structural features of these complexes and quantifying interaction energies with sufficient accuracy to rationalize experimental observations.27,28
A central challenge in the computational investigation of MOF–guest systems is the reliable identification of low-energy adsorption configurations on highly multidimensional potential energy surfaces. Chemically intuitive initial guesses are frequently employed29–31; however, such approaches do not guarantee identification of the global minimum or a representative ensemble of metastable states. Consequently, systematic exploration of the configurational landscape is required to obtain a statistically meaningful description of host–guest interactions.
Given the structural diversity and size of MOFs, simulations must address multiple length and time scales. For large systems, force-field-based atomistic simulations provide computational efficiency,32 albeit with limited accuracy in describing detailed chemical interactions. Conversely, electronic-structure methods offer greater fidelity in locating adsorption sites and characterizing host–guest interactions, though at significantly higher computational cost and typically under low guest loadings.33 These approaches can be combined within multiscale frameworks. In parallel, the integration of machine-learning techniques with high-throughput data generation has enabled rapid screening of experimental and hypothetical MOFs, using descriptors such as adsorption energies, geometric parameters, electrostatic potentials, polarizability, and electronic features to predict adsorption performance and reactivity.27,34–37
In this work, we introduce a computational framework for the unbiased exploration of complex potential energy surfaces and the systematic identification of low-energy configurations in high-dimensional configurational spaces. Rather than relying on chemically intuitive initial guesses, the protocol combines stochastic configurational sampling, descriptor-driven machine-learning selection, and hierarchical density functional theory (DFT) refinement to efficiently navigate large structural ensembles while preserving both energetic and geometric diversity.
Although the framework is demonstrated here through its application to MFM-300(Sc) and MFM-300(In) interacting with representative guest molecules—including alcohols, benzene, benzaldehyde, CO, CO2, and H2S—the conceptual foundation of the methodology extends well beyond MOF–guest systems. Its design is inherently general and can be readily adapted to heterogeneous interfaces, catalytic active sites, supramolecular assemblies, molecular clusters, and condensed-phase environments where a statistically robust and unbiased exploration of the configurational landscape is essential.
More broadly, the present work establishes a reproducible, descriptor-driven, and statistically informed strategy for systematic potential energy surface exploration. By combining extensive sampling with hierarchical electronic-structure refinement, the protocol provides a transferable framework capable of supporting data-driven materials discovery, adsorption studies, and mechanistic investigations across a wide spectrum of problems in computational chemistry and materials science.
To accomplish this, we explored the potential energy surface (PES) of each MOF–guest system using a stochastic sampling strategy. In particular, the Restricted Isomers Searching by Simulated Annealing (RISSA) code38,39 was employed to efficiently sample the configurational space and generate thousands of candidate MOF–guest configurations. Within RISSA, the relative energy of each generated configuration is evaluated using the self-consistent field (SCF) energy obtained at the semiempirical PM7 level of theory, providing a computationally efficient estimate of the energetics during the sampling process.40
Although RISSA has been previously used for molecular and cluster-like systems, the code also includes different spatial constraints for configurational sampling, namely confinement within a sphere, three-dimensional periodic boundary conditions (PBC), and two-dimensional periodic boundary conditions. In the present work, the MOF–guest configurations were generated using the three-dimensional PBC option, with the guest molecule sampled inside the periodic unit cell of the corresponding MFM-300 framework. Therefore, the stochastic exploration was carried out directly on the periodic MOF–guest systems, and no finite cluster models were constructed or used in the sampling stage.
For each system, RISSA generated a variable number of distinct structures, ranging from 2000 to 3470, depending on the configurations identified during the stochastic sampling process. Previous studies have demonstrated the effectiveness of RISSA in identifying reliable candidates for global minima on complex potential energy surfaces.41–45
In the next stage of the protocol, single-point energy calculations were carried out at the DFT level to improve accuracy and obtain reliable electronic properties. The Perdew–Burke–Ernzerhof (PBE) exchange–correlation functional46 was employed together with Grimme's D2 dispersion correction scheme.47 Calculations were performed using plane waves and pseudopotentials with an energy cutoff of 400 eV to achieve a balanced compromise between accuracy and computational efficiency. This stage DFT computations were conducted with the Vienna Ab initio Simulation Package (VASP).48
In a subsequent step, a machine learning–based selection strategy was employed to systematically identify a reduced yet representative set of configurations for further high-level calculations. This step aims to mitigate the large structural redundancy inherent to stochastic sampling methods while preserving both energetic and geometric diversity across the explored configurational space. Prior to model construction, total energies were normalized to the [0, 1] interval, and anomalous configurations were removed using the Isolation Forest algorithm. Three complementary selection models were then developed, each designed to capture different aspects of the MOF–guest interaction landscape: (a) energy-based clustering, in which configurations were grouped solely according to their total energy, enabling the identification of energetically representative structures without bias toward a single low-energy region; (b) multivariable clustering, combining total energy with four distances between the most electronegative atom of the guest and selected metal centers in the MOF, thus accounting simultaneously for energetic and geometric similarity and allowing distinct adsorption motifs with comparable energies to be retained; and (c) minimum-distance selection, based exclusively on the shortest distance between the most electronegative atom of the guest and the nearest MOF metal center, targeting configurations with potentially strong host–guest interactions. For models (a) and (b), unsupervised clustering was performed using the K-means algorithm.49,50 The optimal number of clusters was determined to be three using the elbow method, and the configuration closest to each cluster centroid was selected as the representative structure. For model (c), the four configurations exhibiting the smallest guest–metal distances were retained.
It is important to emphasize that, at this stage, all configurations correspond to unrelaxed geometries generated directly from the stochastic sampling procedure and evaluated only through single-point semiempirical or low-cost DFT calculations. Therefore, the corresponding energies should be regarded as screening descriptors rather than final adsorption energies. Consequently, the configurations exhibiting the lowest initial energies are not necessarily those that converge to the lowest-energy structures after full geometry optimization at a higher level of theory. This is particularly relevant for MOF–guest systems, where geometrically distinct adsorption motifs may have similar preliminary energies but relax toward substantially different local minima due to changes in guest orientation, host–guest contacts, and framework relaxation. Since full periodic DFT optimization of all sampled configurations would be computationally prohibitive, selecting structures exclusively according to their initial energy ranking could introduce a strong bias toward a narrow region of the configurational space and potentially overlook relevant adsorption modes. To address this issue, the present protocol incorporates a descriptor-driven unsupervised selection strategy that considers both energetic and geometric variability prior to the final DFT optimizations. This approach increases the likelihood of retaining representative configurations associated with distinct host–guest interaction mechanisms and enables a broader and more statistically robust exploration of the potential energy surface.
In the final stage of the protocol, both the cell parameters and atomic positions of all selected structures were relaxed using DFT with high-accuracy settings. These calculations were performed with VASP employing the PBE-D2 method in a similar fashion to the second stage. For this optimization section, in contrast, the interaction between core and valence electrons via the Projector Augmented-Wave (PAW) method,51,52 an energy cutoff of 520 eV, as well as spin polarization were considered. Geometry optimizations were conducted with a plane-wave energy cutoff of 520 eV and Brillouin-zone sampling using a 1 × 1 × 1 k-point mesh. The electronic energy and force convergence thresholds were set to 1 × 10−6 eV and 0.01 eV Å−1, respectively.
Additionally, complementary DFT calculations employing Gaussian-type basis sets were carried out with the CRYSTAL23 software package.53 These calculations used the hybrid B3LYP exchange–correlation functional54 with Grimme's D2 dispersion correction and the POB-TZVP basis set55 for all atoms, except for the In(III) ion, where a pseudopotential compatible with the POB-TZVP basis was employed. A k-points sampling of 2 × 2 was set together with the electronic energy and force convergence thresholds of 1 × 10−7 a.u. and 0.0003 a.u. Å−1, respectively. Although more recent dispersion schemes such as D3 are available in both VASP and CRYSTAL23, the D2 correction was deliberately retained in this work to ensure direct methodological consistency with the previous study by Rivera-Almazo et al. on MFM-300(Sc) and MFM-300(In).30 This choice allows the differences observed in the adsorption configurations and relative energies to be attributed primarily to the configurational-search protocol rather than to changes in the dispersion model. Therefore, D2 was adopted as a common reference level for both the large-scale screening and the subsequent comparison with previously reported structures, rather than as an assertion of its superiority over more recent dispersion corrections. The overall computational workflow is summarized in Fig. 1. All data preprocessing and machine-learning analyses were implemented in Python using the scikit-learn library.49,50
| Parameter | B3LYP-D2 | PBE-D2 | Exp56,57 | MAPE (B3LYP-D2) | MAPE (PBE-D2) |
|---|---|---|---|---|---|
| MFM-300(Sc) | |||||
| a | 12.56 | 12.61 | 12.49 | 0.54 | 0.93 |
| b | 12.56 | 12.61 | 12.49 | 0.54 | 0.93 |
| c | 12.56 | 12.61 | 12.49 | 0.54 | 0.93 |
| α | 104.28 | 104.26 | 104.22 | 0.06 | 0.03 |
| β | 104.28 | 104.26 | 104.22 | 0.06 | 0.03 |
| γ | 120.45 | 120.50 | 120.58 | 0.11 | 0.07 |
| Volume | 1483.23 | 1500.30 | 1458.46 | 1.70 | 2.87 |
| dM-Oeq | 2.11 | 2.11 | 2.12 | 0.48 | 0.34 |
| dM-Oax | 2.12 | 2.12 | 2.11 | 0.41 | 0.52 |
| dM-OH | 2.09 | 2.09 | 2.05 | 2.21 | 1.96 |
| MFM-300(In) | |||||
| a | 12.64 | 12.77 | 12.55 | 0.7 | 1.74 |
| b | 12.64 | 12.77 | 12.55 | 0.7 | 1.74 |
| c | 12.64 | 12.77 | 12.55 | 0.7 | 1.74 |
| α | 103.99 | 103.91 | 103.97 | 0.02 | 0.06 |
| β | 103.99 | 103.91 | 103.97 | 0.02 | 0.06 |
| γ | 121.10 | 121.27 | 121.14 | 0.04 | 0.11 |
| Volume | 1506.35 | 1552.03 | 1474.82 | 2.14 | 5.24 |
| dM-Oeq | 2.14 | 2.18 | 2.13 | 0.59 | 2.48 |
| dM-Oax | 2.19 | 2.24 | 2.16 | 1.55 | 3.53 |
| dM-OH | 2.12 | 2.15 | 2.11 | 0.65 | 1.82 |
Each MOF unit cell comprises 72 atoms, including four metal centers that exhibit an octahedral coordination environment formed by the carboxylate groups of the biphenyl-3,3′,5,5′-tetracarboxylate ligand and the bridging μ2-OH group. In addition, Table 1 reports the optimized unit-cell parameters together with selected metal–oxygen bond lengths of the framework. The metal centers exhibit a coordination environment defined by equatorial and axial oxygen atoms, including contributions from hydroxyl (OH) groups (Fig. 2). Given their relevance to the structural robustness and chemical stability of the materials, representative metal–oxygen distances were analyzed and are summarized in Table 1. In this context, d denotes the distance between the metal center and the equatorial oxygen atom associated with the hydroxyl group, d corresponds to the metal–oxygen separation along the axial direction, perpendicular to the hydroxyl plane, as shown in Fig. 2, and d refers to the bond length between the metal center and the oxygen atom directly bonded to the OH ligand.
![]() | ||
| Fig. 2 Unit cells of the MOFs (a) MFM-300(Sc) and (b) MFM-300(In) obtained using the PBE-D2 exchange–correlation functional. | ||
The porosity of these materials is consistent with that reported for the isostructural MFM-300 family. Although the framework forms a fully connected three-dimensional crystalline network, the accessible pore space is best described as a system of one-dimensional (1D) channels running along the crystallographic c-axis, as illustrated in Fig. 3. These channels display an approximately cylindrical shape, with a slightly corrugated internal surface arising from the periodic arrangement of μ2-OH groups lining the pore walls. The effective channel diameters are similar in both materials, with values of approximately 13.6 Å for MFM-300(Sc) and 13.8 Å for MFM-300(In). The slightly larger pore diameter in MFM-300(In) is consistent with the expansion of the framework associated with the larger ionic radius of In compared with Sc. Overall, both materials provide well-defined 1D adsorption environments embedded within a robust three-dimensional framework, making them suitable platforms for assessing how guest position and orientation affect the host–guest potential energy surface.
![]() | ||
| Fig. 3 Pore network of MFM-300(In) constructed through unit cell replication, highlighting the formation of extended channels. | ||
For all these metal–ligand distances, both methods slightly overestimate the experimental values for MFM-300(Sc) and MFM-300(In), although the deviations remain within acceptable limits. Both computational methods accurately reproduced the experimental lattice parameters (a, b, and c) and cell angles (α, β, and γ) for MFM-300(Sc), with mean absolute percentage errors (MAPEs) below 1% for PBE-D2/plane-wave calculations. Slightly larger deviations were observed for MFM-300(In), approaching 2%. The B3LYP-D2/POB-TZVP method yielded approximately half the error of PBE-D2/plane-wave for these parameters. Regarding the unit-cell volume, both methods slightly overestimated the experimental values, with PBE-D2/plane-wave showing a larger deviation. The highest MAPEs were 3% for MFM-300(Sc) and 5.2% for MFM-300(In) using PBE-D2/plane-wave. In contrast, B3LYP-D2/POB-TZVP provided smaller deviations of 1.7% and 2.1%, respectively, demonstrating its superior accuracy in reproducing the experimental structural features. Although slight discrepancies are observed for certain properties, both computational approaches show good overall agreement with the experimental data, with all deviations remaining below 6%. These results confirm that both the B3LYP-D2/POB-TZVP and PBE-D2/plane-wave methods are suitable for accurately reproducing the structural parameters of the studied MOFs, providing reliable results within acceptable error margins.
000 configurations were generated across all MOF–guest systems, providing extensive sampling of the host–guest interaction landscape. For MFM-300(Sc), the number of configurations per adsorbate ranged from 2082 to 3250, while for MFM-300(In) it ranged from 2 183 to 3 471. Each configuration produced by RISSA was evaluated through single-point PBE-D2 plane-wave calculations. Although this stage required significant computational resources, the use of a 400 eV plane-wave cutoff enabled efficient large-scale screening within practical computational times.
Fig. 4 summarizes the distributions of relative energies (ΔE = Econf − Emin) for the configurational ensembles of the different adsorbates confined in MFM-300(Sc) and MFM-300(In). In all cases, the lowest-energy configuration within each ensemble was taken as the zero-energy reference, allowing a direct comparison of the shape and width of the energy distributions independently of the absolute interaction energies. The number of configurations retained after outlier removal is indicated in the figure, and the corresponding statistical descriptors are summarized in Table 2. The statistical descriptors support the trends observed in the violin plots. For both frameworks, larger organic adsorbates such as benzyl alcohol and benzaldehyde exhibit broader ΔE distributions, larger standard deviations, and higher median energies than the smaller gaseous molecules. This behavior indicates a more corrugated configurational landscape, consistent with the larger number of possible orientations, adsorption motifs, and host–guest contacts available to these molecules inside the pore. Benzyl alcohol displays the broadest distribution in both materials, which can be associated with its conformational flexibility and with the coexistence of dispersive and hydrogen-bond-type interactions. Benzaldehyde shows a similar, although slightly less pronounced, energetic dispersion. In contrast, CO and CO2 exhibit lower median ΔE values and lower energetic dispersion, as reflected by their statistical descriptors. These quantitative descriptors make the differences among CO, CO2, and H2S clearer than visual inspection of the violin plots alone. The skewness coefficients reported in Table 2 provide a quantitative measure of the asymmetry of the distributions. Positive skewness values indicate that most configurations are concentrated in a relatively low-energy region, while a smaller number of configurations extend toward high relative energies. This behavior is consistent with the extended high-energy tails observed in Fig. 4. In particular, the smaller gaseous adsorbates show pronounced positive skewness, suggesting that although many sampled configurations remain close to the low-energy region, a limited number of orientations lead to strongly destabilized host–guest arrangements. Overall, the analysis indicates that molecular size, adsorption-site diversity, and interaction complexity determine the ruggedness of the configurational energy landscape in both frameworks. The similar statistical trends observed for MFM-300(Sc) and MFM-300(In) suggest that pore topology primarily governs the qualitative features of the potential energy surface, whereas the identity of the metal center modulates the energetic dispersion more subtly. These results support the need for a selection protocol that preserves both energetic and geometric diversity before final DFT optimization.
![]() | ||
| Fig. 4 Distribution of relative energies (ΔE = Econf − Emin) for the configurational ensembles of different guest molecules adsorbed in MFM-300(Sc) and MFM-300(In). The violin plots represent the probability density of the sampled configurations, while the embedded boxplots indicate the median and interquartile range. Individual sampled points were omitted to improve readability. The number of configurations retained after data cleaning with the Isolation Forest algorithm is indicated for each adsorbate. The x-axis corresponds to the adsorbate molecule and the y-axis reports relative energies in kJ mol−1. Quantitative statistical descriptors of the distributions are reported in Table 2. | ||
| Molecule | Mean | Median | Std deviation | Skewness |
|---|---|---|---|---|
| MFM-300(Sc) | ||||
| Benzyl alcohol | 360.96 | 206.44 | 396.36 | 1.69 |
| Benzene | 283.13 | 116.07 | 373.95 | 1.94 |
| Benzaldehyde | 337.03 | 177.15 | 399.98 | 1.86 |
| CO | 183.50 | 32.69 | 328.43 | 2.80 |
| CO2 | 202.46 | 48.02 | 344.35 | 2.52 |
| H2S | 156.61 | 29.86 | 278.45 | 2.97 |
| Methanol | 199.59 | 47.38 | 335.34 | 2.59 |
| MFM-300(In) | ||||
| Benzyl alcohol | 372.03 | 235.15 | 390.99 | 1.61 |
| Benzene | 271.70 | 119.46 | 373.54 | 2.09 |
| Benzaldehyde | 340.91 | 177.33 | 393.01 | 1.67 |
| CO | 174.37 | 36.62 | 301.85 | 2.73 |
| CO2 | 191.66 | 31.39 | 342.57 | 2.56 |
| H2S | 171.75 | 34.44 | 300.25 | 2.85 |
| Methanol | 169.90 | 38.33 | 304.83 | 2.95 |
000 configurations, full periodic DFT optimization of the complete ensemble would be computationally prohibitive, particularly because the final relaxation includes both guest and framework degrees of freedom. Table 3 should be interpreted by identifying which selection criterion leads to the lowest-energy optimized configurations. If total energy alone were sufficient at the screening stage, the lowest-energy structures would be expected to arise systematically from the EC column. Instead, several minima are obtained from the GP or MLD criteria, indicating that geometrical and chemically motivated descriptors retain adsorption motifs that may be missed by energy ranking alone. Therefore, combining energy-, geometry-, and distance-based criteria provides a more reliable strategy for identifying representative low-energy structures on the MOF–guest PES. The coexistence of several low-lying structures within narrow energy windows reflects the corrugated nature of the PES in confined MOF environments. The purpose of the present protocol is therefore not only to locate a single minimum, but also to identify representative adsorption motifs and their relative energies. Direct comparison with experiment is limited by the availability of complete guest positions; nevertheless, for benzene@MFM-300, comparison with previously proposed structures derived from partial experimental information shows that the present protocol identifies lower-energy adsorption geometries, as discussed below. Fully optimized structures are provided in the supplementary information (SI). The SI contains detailed structural data that may be of interest to researchers seeking to perform further analyses or comparative studies on these MOF–guest systems.
| MFM-300(Sc) | MFM-300(In) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PBE-D2 | B3LYP-D2 | PBE-D2 | B3LYP-D2 | ||||||||
| EC | GP | MLD | EC | GP | MLD | EC | GP | MLD | EC | GP | MLD |
| Benzyl alcohol | |||||||||||
| 1.97 | 3.52 | 0.00 | — | 0.00 | 6.68 | 7.10 | 0.00 | — | 0.00 | — | 3.16 |
| 6.24 | 4.74 | 2.33 | — | 2.53 | — | — | 1.15 | — | — | — | — |
| — | 4.89 | 4.75 | — | 7.39 | — | — | 1.47 | — | — | — | — |
| Benzene | |||||||||||
| 0.21 | 0.00 | — | 0.01 | 0.02 | 0.00 | 0.55 | 0.00 | 5.81 | 0.00 | 3.47 | 3.50 |
| 0.22 | 0.05 | — | 3.80 | 3.37 | — | 3.69 | 0.47 | — | — | 4.13 | 4.14 |
| — | 0.41 | — | 5.12 | 4.57 | — | — | 0.50 | — | — | 4.76 | 5.08 |
| Benzaldehyde | |||||||||||
| 4.11 | 1.21 | 0.00 | 0.32 | 0.90 | 0.00 | 1.89 | 6.20 | 0.00 | 0.25 | 8.08 | 0.00 |
| 4.90 | 5.08 | 0.17 | — | 6.98 | 0.36 | 5.11 | 6.76 | — | 0.82 | 9.15 | — |
| — | 5.55 | 1.21 | — | 7.27 | 0.51 | 5.89 | 8.17 | — | 1.29 | — | — |
| — | — | 5.41 | — | — | 9.73 | — | — | — | — | — | — |
| CO | |||||||||||
| 0.11 | 0.00 | — | 1.18 | 0.00 | — | 0.00 | 0.09 | — | 0.00 | 0.04 | 0.03 |
| 0.43 | 0.16 | — | 1.21 | 0.01 | — | 0.41 | 0.41 | — | 2.32 | 2.37 | — |
| 0.17 | 9.97 | — | 1.33 | 1.14 | — | 0.65 | 2.99 | — | 2.61 | 4.33 | — |
| CO2 | |||||||||||
| 0.26 | 0.00 | 0.51 | 0.53 | 0.03 | 0.00 | 0.66 | 0.00 | — | 0.00 | 0.05 | — |
| 0.51 | 0.31 | — | 0.71 | 0.03 | — | 3.02 | 2.25 | — | 2.37 | 2.07 | — |
| 0.73 | 0.57 | — | 3.45 | 0.43 | — | — | 2.78 | — | 4.44 | 2.23 | — |
| H2S | |||||||||||
| 0.72 | 0.00 | — | 0.05 | 0.00 | — | 0.50 | 0.80 | 0.00 | 0.00 | 0.82 | 0.78 |
| 1.16 | 0.44 | — | 2.51 | 0.07 | — | 0.67 | 0.95 | — | 0.74 | 1.15 | — |
| — | 1.09 | — | — | 0.61 | — | 0.88 | 2.98 | — | 4.17 | 1.83 | — |
| Methanol | |||||||||||
| 0.00 | 5.64 | 4.83 | 0.00 | 0.00 | 8.42 | 4.48 | 3.02 | 0.00 | 5.22 | 0.00 | 0.01 |
| 6.33 | 6.01 | 5.01 | — | 0.08 | 9.21 | 7.03 | 7.00 | — | — | 0.18 | — |
| — | 6.03 | — | — | 0.11 | — | 9.89 | 7.55 | — | — | — | — |
The energies and selected geometrical parameters of the MOF–benzene systems obtained using our protocol were compared with those reported previously by Rivera-Almazo et al.30 To ensure a direct and consistent comparison, and to avoid discrepancies arising from software version differences, single-point energy calculations were performed with CRYSTAL23 on the structures reported by Rivera-Almazo et al.30
A comparison of the empty MOF cells, i.e., in the absence of benzene, revealed only negligible differences between both studies, with the largest energy deviation being 0.003 Hartree. This indicates that the isolated frameworks are essentially equivalent at the level of theory used for the comparison. In contrast, when benzene adsorption is considered, the structures reported by Rivera-Almazo et al. are substantially higher in energy than those identified in the present work, by 106.68 kcal mol−1 for MFM-300(Sc) and 130.44 kcal mol−1 for MFM-300(In). Therefore, these differences arise mainly from the MOF–benzene arrangement and the subsequent host–guest relaxation, rather than from intrinsic differences in the empty MOF frameworks.
The optimized cell parameters and selected metal–oxygen distances remain very similar in both sets of structures. In particular, the parameters a, b, c, γ, dM-Oeq, dM-Oax, and dM-OH show only minor variations. Nevertheless, the superposition of the structures, shown in Fig. 5, reveals subtle but relevant atomic rearrangements of the framework upon benzene adsorption. These displacements are more pronounced for MFM-300(Sc), with deviations in the range of 0.50–0.80 Å, than for MFM-300(In), where deviations of 0.10–0.40 Å are observed. The most important differences, however, are associated with the adsorption site and orientation of the benzene molecule inside the pore.
These rearrangements are also reflected in the optimized unit-cell volumes. Relative to the structures reported by Rivera-Almazo et al., the configurations obtained in this work exhibit volume reductions of −2.30 Å3 for MFM-300(Sc) and −9.62 Å3 for MFM-300(In). These changes suggest a more compact host–guest arrangement, which may enhance cooperative framework--guest contacts and contribute to the additional stabilization. However, the energetic stabilization cannot be rationalized solely in terms of cell-volume changes or the shortest metal–guest distance. For MFM-300(Sc), the shortest C–M distance is 4.41 Å in the structure identified in this work, compared with 4.61 Å in the previously reported configuration. For MFM-300(In), the corresponding distances are 4.54 and 4.50 Å, respectively. Thus, the lower energies obtained here result from the combined effect of benzene reorientation, changes in the adsorption site, and cooperative relaxation of the framework. This analysis demonstrates that the systematic exploration of the PES is able to identify adsorption motifs that are not evident from chemically intuitive initial guesses or partially defined experimental structural information.
It is important to note that the structures reported by Rivera-Almazo et al. were derived from partial experimental information, whereas the configurations presented here result from a systematic and statistically guided exploration of the potential energy surface. Although the distance between the metal center and the nearest carbon atom of benzene is similar in both approaches, the corresponding geometries and energies differ significantly. The stochastic sampling performed with RISSA generates a broad set of initial MOF–guest arrangements, which are subsequently relaxed through full DFT optimization. This finding highlights the robustness of the protocol proposed in this study, which provides a systematic and unbiased framework for exploring MOF–guest configurations when the atomic arrangement of the guest molecule is not experimentally well defined. Overall, our protocol constitutes a useful tool for identifying MOF–guest binding sites and, more generally, for investigating pore--guest interactions.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d6cp00919k.
| This journal is © the Owner Societies 2026 |