Ruihan Wang,
Shiqi Wang and
Qianji Han*
Chemical Engineering College, Hebei Normal University of Science & Technology, Qinhuangdao, Hebei 066600, PR China. E-mail: hanqianji4193@hevttc.edu.cn
First published on 3rd October 2025
Achieving high SF6 uptake and SF6/N2 selectivity is a key challenge in gas separation. High-throughput computational screening is an efficient strategy to identify high-performing adsorbents. However, these candidates may be overlooked because most studies rely on empirical partial charge assignments. In this study, we present a data-driven workflow that integrates accurate density-derived electrostatic and chemical (DDEC) partial atomic charges into grand canonical Monte Carlo (GCMC) simulations to accelerate the discovery of high-performance MOFs for SF6/N2 separation. By screening the quantum-chemical metal–organic framework (MOF) database, several top-performing candidates with high SF6 uptake and selectivity were identified. The key features for efficient separation were open metal sites, parallel aromatic surfaces, uncoordinated nitrogen atoms, and metal–oxygen–metal bridges. A machine learning model trained on the DDEC-based GCMC results achieved excellent predictive performance (coefficient of determination = 0.968, mean absolute error = 0.281 mmol g−1) and enabled rapid screening of 154144 MOFs within 50 min. Zn-TCPP was selected for validation via density functional theory calculations, confirming the reliability of the proposed workflow. This study illustrates how quantum-chemical datasets facilitate high-throughput material discovery for challenging separations.
Metal–organic frameworks (MOFs) are excellent candidates for high-performance gas separation applications because of their wide range of pore sizes, large surface areas, exceptionally high porosities, and tunable structures.6–8 Using MOFs for SF6/N2 separation has been extensively studied.9–11 Recent advances have demonstrated remarkable performance under industrially relevant conditions (SF6:
N2 = 10
:
90, 1.0 bar). For example, BUT-53 exhibits a record selectivity of 2485 with an SF6 uptake of 3.55 mmol g−1,12 whereas Cu–MOF–NH2 delivers a higher uptake of 7.88 mmol g−1 with a selectivity of 266.2.5 Although the SF6/N2 selectivity of MOFs has reached satisfactory levels, SF6 uptake should be further improved. However, there is an intrinsic trade-off relationship between capacity and selectivity, which is a key challenge in the design of porous materials. Achieving simultaneous optimization of both properties is the key for MOF application in industrial-scale SF6/N2 separation.
Currently, there are over 100000 synthesized MOFs,13 among which high-performance MOFs suitable for industrial-scale SF6/N2 separation should be identified. Nevertheless, experimental testing of this enormous number of MOFs is technically challenging and economically infeasible. To overcome this challenge, high-throughput computational screening can be used to efficiently identify high-performance MOFs from large material databases.14–16 However, there are only few computational screening studies on SF6/N2 separation.5,17,18 For example, Ren et al. employed grand canonical Monte Carlo (GCMC) simulations to evaluate 2513 experimentally synthesized MOFs and identified Cu–MOF–NH2 as a promising candidate.5 In another case, an integrated strategy combining high-throughput simulation with machine learning (ML) was used to screen more than 25
000 MOFs for SF6/N2 adsorption and separation.18 These studies highlight the efficiency of computational and data-driven approaches in identifying high-performing MOFs. Moreover, the predictive accuracy of these computational screening methods help determine the success of subsequent experimental validations, particularly when experimental resources are limited.
To improve the reliability of the simulation-based screening, we previously optimized the force field parameters for CH4 adsorption,19 resulting in considerably improved computational accuracy. In another study, we applied data mining techniques to construct a high-quality experimental dataset for C2H6/C2H4 separation,20 which facilitated the development of ML models with enhanced predictive performance. However, for small hydrocarbons, the commonly used transferable potentials for phase equilibria (TraPPE) force field21 omits partial atomic charges because electrostatic interactions have a limited effect on their adsorption behavior. In contrast, in the cases of SO2, CO2, and SF6, accurate treatment of electrostatic interactions is essential for reliable predictions.22 This distinction underscores the critical need for accurate charge assignment schemes in the modeling of MOF–guest interactions involving these molecules.
Recent developments for overcoming this challenge introduced first-principles-based methods for the partial charge assignment in MOFs. Among these methods, the density-derived electrostatic and chemical (DDEC)23–26 approach is the most accurate method because it can generate chemically consistent and transferable charges for both metal nodes and organic linkers.27 Despite its recognized accuracy, DDEC remains underutilized in large-scale computational screenings. Most high-throughput computational screenings for SF6/N2 separation use empirical charge assignment methods such as charge equilibration28 and its extended variant (EQeq).29 These methods are computationally efficient and capable of producing reliable screening results; however, their empirical nature often limits their ability to accurately reproduce reference DDEC charges, especially in cases involving transition metals in MOFs.30 Nazarian et al.31 demonstrated that EQeq overestimates the charges of alkali and rare earth metals and underestimates those of alkaline earth metals, indicating the inherent bias introduced by parameterization.
To overcome these limitations, large-scale quantum chemical screenings have emerged for generating reliable and reproducible reference data. In particular, recent breakthroughs in this area have focused on constructing quantum chemical MOF databases that incorporate electronic structure information obtained using density functional theory (DFT) calculations. Major examples of these datasets include the Open DAC 2023 (ODAC23)32 and Quantum MOF (QMOF)33 datasets, which provide comprehensive annotations for properties such as DDEC partial atomic charges and band gaps. These quantum chemically derived resources offer a strong foundation for improving the reliability of electrostatic modeling in MOFs. Using these databases, accurate electrostatic interactions can be incorporated into high-throughput screenings which enhances the predictive accuracy of theorical methods for adsorption and separation performance of SF6 on MOFs.
In this work, we propose a data-driven workflow for identifying high-performance MOFs for SF6/N2 separation, making use of the enhanced electrostatic accuracy provided by DDEC partial atomic charges (Fig. 1). GCMC simulations were performed on the QMOF dataset, leading to the identification of a subset of top-performing MOFs that exhibits high SF6 uptake and selectivity. Subsequent analysis revealed that the top-performing MOFs exhibit several main structural features, including open metal sites (OMSs), parallel aromatic surfaces, uncoordinated nitrogen atoms, and metal–oxygen–metal bridges (MOMBs). Furthermore, we developed an ML model that can directly predict SF6 uptake based on the MOF structure. The ML model showed strong predictive accuracy on the test set, with a coefficient of determination (R2) of 0.968 and a mean absolute error (MAE) of 0.281 mmol g−1. It was then applied to a database of 154144 MOFs, predicting additional candidates with favorable properties. DFT calculations further demonstrated its practical potential for SF6/N2 separation.
![]() | ||
Fig. 1 Data-driven workflow of high-throughput GCMC simulations, structure–performance analysis, ML and virtual screening employed in this work. |
RASPA-2.0 (ref. 40) was used to perform all GCMC simulations. In the simulations, 20000 initialization cycles and 50
000 production cycles were conducted. During the simulations, four types of the Monte Carlo moves were defined: translation, rotation, insertion, and swap. The LJ interactions were determined using spherical cutoff of 14 Å with a long-range correction, whereas long-range electrostatic interactions were treated based on the Ewald summation41 method. Lorentz–Berthelot mixing rules were employed to evaluate the interactions between different atom types. The simulation cell of each MOF was constructed by replicating the unit cell in the three dimensions so that its linear sizes were at least twice the spherical cutoff. The SF6 uptake was obtained using single-component GCMC simulations conducted at 298 K and 1.0 bar. The SF6/N2 selectivity was calculated based on binary mixture adsorption simulations with a SF6
:
N2 molar composition of 10
:
90 under the same conditions, according eqn (1).
![]() | (1) |
ΔEMOF-gas = EMOF-gas − EMOF − Egas | (2) |
To further assess the applicability of the method to different MOF topologies and chemistries, we performed additional GCMC simulations under the same conditions for two well-characterized benchmark materials: UiO-66 (ref. 47) and HKUST-1.48 The computed SF6 uptakes were 1.79 and 5.22 mmol g−1, respectively, which are close to the corresponding experimental values (1.67 and 4.98 mmol g−1, respectively). The previously reported selectivities of UiO-66 and HKUST-1, which were obtained via IAST predictions, are 127.8 and 70.4, respectively, whereas our GCMC-derived values were 248.8 and 80.9, respectively. Although the GCMC-derived selectivity values are not fully consistent with the quantitative predictions from IAST, the computed SF6/N2 selectivity trend, in which Cu–MOF–NH2 exhibits higher selectivity than UiO-66 and HKUST-1, agrees well with experimental observations. The observed discrepancy can be ascribed to the intrinsic limitations of IAST, which relies on the assumptions of homogeneous pore filling and uniform adsorbate accessibility. Collectively, these results confirm the robustness of the employed force field (Dellis–Samios model)39 and the accuracy of the DDEC partial atomic charges in describing the MOF–SF6 interactions. This benchmarking analysis confirms that the computational workflow provides a reliable foundation for large-scale computational screening of MOF candidates for SF6/N2 separation applications.
To ensure consistency in the performance benchmarking, Cu–MOF–NH2 was selected as the reference material. Its computed SF6 uptake and SF6/N2 selectivity were 8.03 mmol g−1 and 271.9, respectively. These benchmark values are indicated by red dashed lines in Fig. 2 and were used to facilitate the identification of high-performance candidates. Cu–MOF–NH2 achieves the objective of this study, which is to identify MOFs with high SF6 uptake and competitive separation performance. To identify MOFs with ultra-high selectivities, we also explored an alternative threshold region defined by a SF6/N2 selectivity higher than 1000 and uptake greater than 3.60 mmol g−1. For example, the DOYBEA MOF52 exhibits SF6 uptake and SF6/N2 selectivity of 3.79 mmol g−1 and 1943.9, respectively. Fig. S1 shows the positions of all MOFs meeting this ultra-selective criterion, and their details are listed in Table S4.
![]() | ||
Fig. 2 (a) High-throughput GCMC screening of the QMOF dataset based on SF6/N2 selectivity and SF6 uptake at 298 K and 1.0 bar. |
The qmof-2634ae7 MOF, which was obtained from the GMOF subset in the QMOF dataset, exhibited the highest SF6 uptake (13.65 mmol g−1) in this database with a selectivity of 96.7. In comparison, the LEZPAJ MOF exhibited an exceptionally high selectivity of 79019.1, with a limited uptake (0.88 mmol g−1). This inverse relationship between the uptake and selectivity reflects a common trade-off relationship observed across the dataset and highlights the challenge in simultaneously optimizing both performance metrics. From the 19
560 MOFs evaluated in the QMOF dataset, 12 candidates fell within the upper-right quadrant of the uptake-selectivity plot (Fig. 2). For clarity, the structures are labeled with their CoRE MOF codes. HADLOQ01 (ref. 53) corresponds to Co(bdp), which is a flexible MOF that exhibits an expanded structure only under high-pressure conditions (∼20 bar). However, under the pressure adopted for the SF6 adsorption in this study (1 bar), the structure remains in a collapsed conformation with negligible accessible pore volume (AV), making it effectively inaccessible to SF6 molecules. Therefore, HADLOQ01 represents a false positive in the screening process due to the failure to capture the pressure-dependent flexibility in the static structural model. Other flexible MOFs, such as QUPZIM02 (ref. 54) and QUQBEL,54 were similarly flagged as top candidates but exhibited collapsed structures under the studied SF6 adsorption conditions, leading to overestimated performances. BUVYIB and HECQUB were obtained from the experimental CoRE MOF database, while the others were drawn from the hypothetical subset. It should be emphasized that the use of rigid-framework GCMC can lead to false positives when dealing with pressure-responsive MOFs. For example, HADLOQ01, QUPZIM02, and QUQBEL were flagged as top candidates in our initial screening but were later identified as collapsed structures under the studied SF6 adsorption conditions, resulting in overestimated performances. This underscores the importance of carefully validating the structural stability of promising candidates after the high-throughput stage. For any individual candidate flagged as potentially flexible, we recommend performing one or more of the following validation steps: (i) literature or experimental checks for known open/closed phases; (ii) geometric screening of accessible pore volume and limiting pore diameter at the target conditions.
To further validate the screening results, we examined the top candidate MOFs in detail. None of these candidates have been experimentally reported to date. Interestingly, one candidate, GAFROX,46 exhibits the same topological framework as the experimentally studied Cu–MOF–NH2, with zinc replacing copper as the metal center. Given this close structural similarity, GAFROX is very likely to represent a high-performance material for SF6/N2 separation. A complete list of the top candidates identified via QMOF screenings is provided in Table 1.
MOF name | SF6 uptake (mmol g−1) | SF6/N2 selectivity | Structural features |
---|---|---|---|
qmof-1cc6d2b | 9.28 | 512.4 | PAR 10.3 Å |
qmof-7c9951b | 8.77 | 578.7 | PAR 10.3 Å |
qmof-77a0b3f | 8.83 | 287.6 | PAR 10.1 Å |
QUPZIM02 | 9.28 | 414.8 | PAR 10.4 Å |
HADLOQ01 | 8.95 | 359.3 | PAR 10.1 Å |
QUQBEL | 8.89 | 312.1 | PAR 9.8 Å |
GAFROX | 8.38 | 308.6 | OMSs |
BUVYIB | 8.42 | 498.2 | UN |
HECQUB | 8.83 | 272.3 | UN |
qmof-b890eba | 8.26 | 286.5 | MOMBs |
qmof-94124d1 | 8.05 | 280.0 | MOMBs |
qmof-6f5fabe | 9.64 | 284.5 | MOMBs |
To study the structural features affecting the performance of top-performing MOFs, we systematically analyzed the 12 candidates listed in Table 1. As summarized in Fig. 3, these MOFs consistently exhibit a set of structural features that likely affect their separation performance: (i) OMSs, (ii) parallel aromatic rings, (iii) uncoordinated nitrogen atoms, and (iv) MOMBs.
![]() | ||
Fig. 3 Typical structural features commonly found in high-performing MOFs: (a) open metal sites; (b) parallel aromatic rings; (c) uncoordinated nitrogen atoms; and (d) metal–oxygen–metal bridges. |
Among these structural features, OMSs is critical binding structural motifs that can enhance the gas–MOF interactions (Fig. 3a). OMSs facilitate the adsorption of gas molecules, such as CO2 and C2H4, owing to their ability to engage in Lewis acid–base interactions and π-back bonding mechanisms.55 The framework of GAFROX exhibits readily accessible OMSs, which can serve as strong SF6 adsorption sites. HADLOQ01, QUPZIM02, and QUQBEL exhibit expanded structureonly under high pressure and highly collapsed pore structures with limited accessible porosity under ambient pressure. However, important structural characteristics can be retrieved from their high-pressure structures. These MOFs exhibit an arrangement of nearly parallel aromatic rings, with LCDs close to 10 Å (Fig. 3b). Comparable structural motifs with similar aromatic spacing are observed in qmof-1cc6d2b, qmof-7c9951b, and qmof-77a0b3f. The recurrence of this geometry suggests that as eparation of approximately 10 Å between aromatic surfaces may serve as a reliable structural descriptor for efficient SF6 accommodation and molecular packing in porous materials.
The presence of uncoordinated nitrogen atoms (Fig. 3c), such as in BUVYHE and HECQUB, is another recurring structural feature. Introduction of electron-deficient heteroaromatic rings reduces π-electron density of the ring plane and produces regions of positive local electrostatic potential distribution above the ring. These positively biased surface regions interact favorably with the negatively polarized fluorine shell of SF6, thereby increasing host–guest binding.56
In addition, MOMBs (Fig. 3d) are frequently observed in structures including qmof-6f5fabe, qmof-b890eba, and qmof-94124d1. These MOMBs have previously been identified as performance-enhancing features in large-scale computational screenings for CO2 capture49 and may similarly contribute to SF6 adsorption. Therefore, the simultaneous occurrence of OMSs, parallel aromatic surfaces with optimal spacing, uncoordinated nitrogen sites, and MOMBs can define the structural frameworks of high-performance SF6/N2 adsorbents.
In contrast, SF6/N2 selectivity reaches its maximum within a considerably lower GSA range (1190–1690 m2 g−1) (Fig. 4a). Within this lower range, the compact pore structure enhances the molecular discrimination and reduces nonspecific N2 adsorption. A similar contradiction was observed in the relationship between the performance and the AV. As shown in Fig. 4b, SF6 uptake increased with AV, reaching an optimal range at 1.21–1.51 cm3 g−1. Within this range, the pores provide sufficient space to accommodate numerous SF6 molecules while maintaining favorable interactions. However, SF6/N2 selectivity was maximized at considerably lower AV values (approximately 0.19 cm3 g−1, Fig. 4b). These tighter AV promote size exclusion and reduce N2 adsorption, thereby enhancing selectivity. The effect of the LCD further confirms this inherent trade-off relationship. High SF6 uptake is observed at LCD of 9.6–11.6 Å (Fig. 4c). In contrast, the highest selectivity is achieved at LCD of approximately 6.1 Å, where the pore size is only sufficient to admit SF6 while restricting N2 (Fig. 4c). A similar trade-off relationship was observed with the void fraction. As shown in Fig. 4d, MOFs with void fractions of approximately 0.75 exhibit high SF6 uptake due to the availability of large internal free volumes. In contrast, selectivity is maximized in materials with void fractions between 0.37 and 0.47 because the tighter pore space enhances molecular sieving and restricts nitrogen transport. These results highlight an important challenge in the rational design of MOFs for SF6/N2 separation. Structural features that facilitate high adsorption capacity often decrease selectivity, whereas configurations that enhance selectivity tend to reduce uptake.
To assess whether high-performing MOFs exhibit distinct linker types or topological frameworks, we employed the MOFid package37 to decompose each structure into its fundamental building units. Owing to the limited number of MOFs (only 12) that outperform Cu–MOF–NH2 in the QMOF databases, statistical analysis at this level was not feasible. Therefore, we relaxed the selection criteria to include MOFs with an SF6 uptake and SF6/N2 selectivity greater than 5.22 mmol g−1 and 80.9, respectively. These values are comparable to those of HKUST-1, a well-established benchmark material. Based on these revised criteria, 677 high-performing MOFs from the QMOF database were identified. Fig. 4e and f illustrate the most frequently occurring linkers and topologies among these high-performing materials in the QMOF dataset. Several linkers were functionalized with groups such as –NH2, –F, and –Cl, which are commonly introduced to tune the chemical environment of the framework. These functional groups can affect factors such as polarity, electrostatic interactions, and adsorption site specificity. For example, the introduction of an amino group in YTU-29-NH2 leads to a marked improvement in SF6/N2 separation efficiency compared to the YTU-29 parent framework.57 The most common topologies among the high-performing MOFs were rna, pcu, acs, and sql. These results suggest that certain linkers and topologies are strongly associated with superior SF6/N2 separation performance, and hence, they can serve as fundamental structural motifs in inverse design frameworks.
Building upon this strategy, we employed ML to develop predictive models that directly estimate SF6/N2 separation performance based on structural information. Our prior results indicate that the predictive accuracy and transferability of ML models strongly depend on the quality and reliability of the training set.19 In this study, ML models were developed based on the DDEC-based GCMC results. The first model predicts pure-component SF6 uptake, and the second estimates SF6/N2 selectivity for a 1:
9-gas mixture. These models are hereafter referred to as the SF6 uptake model and the SF6/N2 selectivity model, respectively. To ensure balanced coverage of adsorption values, the QMOF dataset (excluding structures with LCD < 4.7 Å) was split into training, validation, and test sets (80
:
10
:
10) using a label-based binning algorithm based on SF6 adsorption uptake, rather than a purely random split. This strategy guarantees that all adsorption ranges are represented in each subset. The distribution after binning is shown in Fig. S2.
We employed MOF-CGCNN,19 which has been shown to effectively capture local atomic environments. The predictive performance of each model was evaluated using standard regression metrics on the test set, including R2, MAE, and root MSE (RMSE). As illustrated in Fig. 5, the SF6 uptake model demonstrated excellent predictive performance, achieving an R2 of 0.968, with MAE and RMSE of 0.281 and 0.443 mmol g−1, respectively.
![]() | ||
Fig. 5 Accuracy representations for the MOF-CGCNN models and a schematic of the hierarchical screening workflow. |
In contrast, the SF6/N2 selectivity model exhibited considerably lower accuracy, showing an R2 of 0.474 and considerably larger errors (MAE = 63.37 and RMSE = 86.26). Extensive hyperparameter optimization failed to improve the selectivity model to a level suitable for practical deployment. This may be attributed to the fact that selectivity is a ratio of two uptake values, which amplifies noise and compound errors from both predictions, rendering its accurate modeling inherently more difficult.
To demonstrate the practical applicability of the ML model, we performed ML-based virtual screening on a large-scale dataset comprising 154144 MOFs from the ODAC23,32 hMOF,62 and ToBaCCo51 databases. Owing to the limited predictive accuracy of the SF6/N2 selectivity model, we adopted a hierarchical screening workflow to efficiently identify high-performing candidates (Fig. 5). In the first stage, the SF6 uptake model was applied to the full set of 154
144 MOFs. This large-scale prediction task was completed in approximately 50 min using a 40-core server and resulted in the identification of 2525 structures with predicted SF6 uptakes exceeding 8.03 mmol g−1. These 2525 candidates were then subjected to GCMC simulations to obtain more reliable estimates of their SF6 uptake and SF6/N2 selectivity as shown in Fig. 5.
By filtering out over 98% of the structures before the first stage, this approach drastically reduced the computational cost. The candidates were then ranked according to their calculated SF6 uptake and selectivity, identifying three hypothetical MOFs and one experimentally reported structure (CoRE MOF code: NAWXER, hereafter referred to as Zn-TCPP)63 that exhibited SF6 uptake greater than 8.03 mmol g−1 and selectivity higher than 271.9. Zn-TCPP is built from paddlewheel-type Zn2(COO)4 clusters, each coordinated equatorially by four 2,3,5,6-tetrakis(4-carboxyphenyl) pyrazine (TCPP4−) linkers. Axial DMF ligands can be removed to generate OMSs. GCMC calculations for Zn-TCPP yielded SF6 uptake of 9.67 mmol g−1 and SF6/N2 selectivity of 335.77, placing it among the top-performing materials in the 154144 MOF dataset. Detailed SF6 uptake and selectivity values for the three hypothetical MOFs are provided in Table S5. The corresponding GCMC calculated adsorption isotherms are displayed in Fig. 6. The integration of predictive modeling with large-scale screening offers a powerful route for exploring the chemical space. The full datasets and corresponding source code are available on GitHub at https://github.com/ruihwang/SF6_MOFCGCNN/.
![]() | ||
Fig. 6 GCMC calculated adsorption isotherms of the five top-performing MOFs identified from the ML-based virtual screening. |
Among the top-performing candidates, hMOF-5035619 and hMOF-5060764 feature electron-donating functional groups, such as –NH2 and –OH, grafted onto their aromatic linkers. Both structures also exhibit parallel aromatic rings with linker center distances of 10.3 and 9.7 Å, respectively. These structural characteristics confirm the established structure–performance relationship, in which an optimal spacing of ∼10 Å between aromatic surfaces correlates with enhanced SF6/N2 separation efficiency. The framework of Zn-TCPP prominently shows OMSs and uncoordinated nitrogen atoms, which are two structural elements previously identified as key contributors to SF6/N2 selectivity. Given that the synthesis of hypothetical MOFs remains to be further explored, we primarily focused on Zn-TCPP that has already been experimentally reported.63
To gain further molecular-level insights into the high selectivity of Zn-TCPP for SF6 over N2, periodic dispersion-corrected DFT calculations were conducted on representative configurations based on GCMC simulations (Fig. 7). These calculations revealed that the most favorable SF6 adsorption site was located at the OMS, with a binding energy of −39.59 kJ mol−1. At this site, the fluorine atom of SF6 and the Zn center are 2.704 Å apart, indicating strong electrostatic attraction. A secondary binding site was identified within a confined pocket formed by four aromatic rings. In this environment, the distances from the fluorine atoms in SF6 to the aromatic ring centroids are 3.057–3.688 Å. Moreover, an F⋯N contact of 3.004 Å further confirms the presence of noncovalent interactions. The total binding energy at this site was calculated to be −21.91 kJ mol−1. In contrast, the binding energy of N2 is substantially lower (−10.47 kJ mol−1), underscoring the preferential affinity of the MOF for SF6. Therefore, these DFT results validate the adsorption performance predicted by the GCMC simulation and provide a detailed energetic framework for understanding the origin of the selective SF6 uptake of Zn-TCPP, revealing its potential for efficient SF6/N2 separation.
![]() | ||
Fig. 7 Probability density distributions of SF6 and N2 molecules based on GCMC simulations. Binding sites of SF6 and N2 molecules in the Zn-TCPP pores based on the DFT optimization calculations. |
A quantitative structure–performance relationship analysis confirmed the fundamental trade-off relationship between the uptake capacity and selectivity. The maximum SF6 uptakes were observed in MOFs with GSA of 3700–4300 m2 g−1, AV of 1.21–1.51 cm3 g−1, void fractions close to 0.75, and LCD of 9.6–11.6 Å. In contrast, the highest SF6/N2 selectivities were associated with MOFs with considerably lower GSA values (1190–1690 m2 g−1), low AV (∼0.19 cm3 g−1), void fractions of 0.37–0.47, and narrow LCDs centered around 6.1 Å. In addtion, the most common topologies among the high-performing MOFs were rna, pcu, acs, and sql.
Furthermore, the ML model for SF6 uptake, which was trained on DDEC-based GCMC simulation data, achieved an R2 of 0.968 and an MAE of 0.281 mmol g−1. Leveraging this model, we efficiently screened 154144 MOFs, among which Zn-TCPP was chosen as a representative candidate for proof-of-concept validation using DFT calculations. Overall, this integrated workflow provides a scalable and transferable model that can accelerate the development of advanced porous materials for challenging molecular separations.
The Supplementary information: includes the Lennard–Jones parameters employed in this work, the list of top-performing MOFs and hypothetical MOFs under specified conditions, and the distribution of SF6 adsorption uptakes in the training, validation, and test sets after label-based binning. See DOI: https://doi.org/10.1039/d5ra06266g.
This journal is © The Royal Society of Chemistry 2025 |