Open Access Article
Junhui Kou
abc,
Tianle Liu
*ab,
Guosheng Jiangab,
Guokun Yangab,
Zerang Liab,
Xiaoyang Nib and
Hao Liub
aUnconventional Cementing and Special Reinforcement Laboratory, China University of Geosciences, Wuhan 430074, China. E-mail: liutianle@cug.edu.cn
bFaculty of Engineering, China University of Geosciences, Wuhan, 430074, China
cDepartment of Applied Science and Technology, Politecnico di Torino, Duca degli Abruzzi 24, Torino 10129, Italy
First published on 2nd March 2026
The screening and rational design of metal–organic frameworks (MOFs) with optimal methane capture performance remain a critical challenge for environmental and energy applications. Existing research often emphasizes single-point adsorption capacity, overlooking the dynamic mechanisms relevant to pressure swing adsorption (PSA) required for practical use. To address these challenges, this study developed a multi-task learning (MTL) framework for high-throughput screening of methane adsorption in MOFs. The proposed model integrates attention and screening mechanisms to predict the methane working capacity using a dataset of 252
352 MOFs characterized by geometric and chemical descriptors. Shared parameters were leveraged across adsorption prediction tasks under six pressure conditions, achieving high predictive accuracy, with R2 values of 0.992 and 0.962 for gravimetric and volumetric working capacities, respectively. SHAP analysis within the MTL context identified shared underlying mechanisms governing adsorption at both low and high pressures. Key descriptors influencing adsorption capacity included accessible pore volume and specific surface area, alongside chemically relevant atomic identity, covalent radius, and nuclear charge. Variations in feature importance from low to high pressures reflected the shifts in methane adsorption mechanisms. This MTL model provides a novel approach to accelerate the discovery of MOFs with high working capacities, offering new insights into methane adsorption mechanisms in MOFs under varied pressure regimes.
Enhancing the structural design, functional groups, and adsorption sites of MOFs to develop materials with increased methane uptake has become a central focus of current research.18,19 Numerous studies20,21 have aimed to push the limits of methane adsorption in MOFs. Gándara et al. employed a strategy of dangling BTB ligands to tailor the pore structure of MOFs, reducing the pore size to 7.6 Å, and synthesized MOF-519, which exhibited a record methane uptake of 279 cm3 cm−3 at 80 bar.22 This work demonstrated the critical role of optimizing pore structure for enhanced methane adsorption. These successful studies highlight that understanding the key material properties and their synergistic effects during adsorption is essential to advancing MOFs' methane uptake performance.23 However, experimentally investigating the individual and combined effects of various factors on methane adsorption remains a complex process that requires substantial time and resource.24 Compared to traditional experimental methods based on experience and trial-and-error, machine learning (ML) provides a more efficient way to accelerate research progress.19,25,26 Fernandez et al. applied classical ML models to identify high-performance MOFs for carbon capture and separation.27,28 The application of ML relies on material descriptors converted from chemical or physical information as input features, which are the key to interpret the resulting predictions,29 to train predictive models. As research progressed, the variety of MOF descriptors has expanded to hundreds or even thousands, and this large number of feature types may complicate model training and optimization.30 Selecting appropriate descriptors is key to reducing bias, ensuring generalization, and improving research efficiency in ML models.31 The development of interpretable ML techniques holds promise to address this challenge.32 Numerous studies have employed Shapley additive explanations (SHAP)33 to conduct interpretable analyses of MOFs' structure–property relationships.30,34,35 Wei et al. utilized permutation feature importance and transfer learning to investigate methane adsorption in MOFs, highlighting the significance of methane-related descriptors under high-pressure conditions.36 Interpretable ML bridges hypothetical MOF data with experimental, reducing the experimental time and costs.25,37 In summary, advances in ML currently provide efficient assistance for MOF design and performance prediction, but challenges remain, including training data quality and model optimization, which require further research.
Although extensive research has focused on maximizing the methane adsorption capacity, pressure swing adsorption (PSA) represents the key technique used in industrial applications.38–40 Materials with effective working capacity are characterized by a larger difference in methane loading between high-pressure adsorption and low-pressure desorption.41 However, current high-throughput screening and predictive models predominantly focus on single-point adsorption capacities,42,43 with few studies addressing the dynamic characteristic of the PSA process. Since PSA involves adsorption at high pressure and desorption at low pressure, understanding the mechanisms and characteristics of MOFs under these conditions is essential for designing materials with high working capacities. In fact, conventional molecular simulations and ML or neural network models cannot effectively predict this dynamic process. This limitation impedes the identification of MOFs suitable for practical methane storage applications. To address this issue, several researchers have attempted to predict methane adsorption isotherms of MOFs. For example, existing studies used multilayer perceptron models and deep feed-forward neural networks to train methane uptake data at six different pressures and thereby fit isotherms.44,45 However, these approaches treat adsorption predictions at each pressure as separate tasks, overlooking the inherent correlations among adsorption behaviors across different pressures. This task separation not only results in inefficient use of computational resources but also restricts the model's ability to capture the underlying shared mechanisms between varying pressure adsorption, thereby diminishing the prediction accuracy and generalization performance.
In this study, the multi-task learning (MTL) approach is intended to address the challenges described above. As an effective deep learning paradigm, MTL enables simultaneous learning of multiple related tasks within a single model, facilitating information sharing and collaborative optimization.46 For example, some researchers developed an MTL model to selectively adsorb oils and phenolic pollutants from coal chemical wastewater, simultaneously predicting adsorption rates and capacities, and revealing co-adsorption mechanisms in complex systems.47 Other researchers designed an MTL model based on soft parameter sharing to address the dual challenge of cadmium contamination and methane emissions in soil remediation using biochar, achieving superior prediction of their synergistic effects compared to traditional single-task models.48 In the context of methane adsorption prediction, MTL treats adsorption capacities at different pressures as interconnected output tasks. By leveraging intrinsic relationships among these tasks, it captures both shared features and distinct mechanisms between high- and low-pressure adsorption processes. This approach enables a more comprehensive understanding of MOFs' adsorption behavior across varying pressures and improves the prediction accuracy.
To address the challenges in designing MOFs and to explore efficient methods for screening materials with practical methane working capacity, this study introduces MTL into the development of adsorption prediction models for the first time. The approach integrates the attention mechanism and working capacity screening mechanism to establish a novel predictive framework. Adsorption and working capacity prediction models were successfully developed, demonstrating superior performance under multiple pressure conditions compared to existing studies. The intrinsic relationships learned between low- and high-pressure adsorption tasks by the MTL model were thoroughly investigated and interpreted through SHAP analysis. The interactive contributions of individual and combined descriptors to MOF adsorption performance were systematically examined. By exploiting inherent correlations among adsorption properties, a more comprehensive and practical evaluation of MOF performance was achieved using this approach. This study enhanced the accuracy of methane working capacity predictions and introduced a novel method to accelerate the discovery of MOFs with practical application potential.
352 structures.
To construct the input feature space, both geometric and chemical descriptors were incorporated into the predictive models. Fourteen geometry-related parameters were extracted for each MOF using the Zeo++ software package,53 employing a methane-sized probe radius of 1.86 Å.51 These descriptors, which have been extensively validated in prior studies,42,54 encompass metrics describing the pore size, accessible volume, surface area, and framework density. A complete list and definitions of the geometric descriptors are provided in Table S1.
Chemical descriptors were characterized using revised autocorrelation function (RAC) descriptors. The RAC methodology was originally introduced by Janet et al.55 and subsequently extended by Moosavi et al.56 to enable its application to periodic MOF systems. By encoding correlations among heuristic atomic properties within graph-based representations of MOFs, RAC descriptors provide an effective means to probe structure–property relationships.57 In this work, RAC features were derived from MOF graphs using atomic identity (I), coordination environment and topology (T), Pauling electronegativity (χ), covalent radius (S), and nuclear charge (Z) as primary attributes, while polarizability (α) was additionally incorporated to describe linker-mediated interactions. The RAC formulation is given in eqn (1), and a total of 176 chemical descriptors were generated.
![]() | (1) |
denotes the aggregated difference in a given atomic property between atoms belonging to the start list and those in the corresponding scope list at a bond distance d. The atomic terms Pi and Pj correspond to the property values of atoms i (selected from the start atom list) and j (selected from the scope atom list), respectively, and di,j defines their bond-based distance within the MOF graph. Molecular graph generation and the classification of start and scope atom sets were carried out according to the methodology reported by Moosavi et al.56
After feature extraction, 14 geometric descriptors and 176 chemical descriptors were obtained.
| Task | Pressure | Temperature | Application |
|---|---|---|---|
| T1 | 0.01 bar | 338 K | Landfill gas treatment |
| T2 | 0.9 bar | 298 K | Methane purification |
| T3 | 4.4 bar | 338 K | Landfill gas treatment |
| T4 | 5.8 bar | 298 K | Methane separation and purification |
| T5 | 9 bar | 298 K | Methane purification |
| T6 | 65 bar | 298 K | Methane storage |
![]() | ||
| Fig. 1 Schematic diagram of the MTL model incorporating the attention mechanism and the PSA screening mechanism developed in this study. | ||
| Cw = Qads−Qdes | (2) |
:
2. Model robustness was evaluated using five-fold cross-validation on the free training set, with each fold optimized over 100 training epochs.
A series of optimization strategies were implemented to enhance the performance of the MTL model. The adaptive learning rate optimizer Adam was employed to adjust the learning rate, with the mean squared error (MSE) selected as the loss function. To address varying task importance in MTL, a dynamic task weighting mechanism was applied, in which task weights were updated inversely proportional to their losses. Tasks exhibiting higher losses received increased weights, whereas those with lower losses were assigned reduced weights. A total of 200 training epochs were set to ensure sufficient iteration and improve model generalization. Adjustable batch sizes of 32, 64, and 128 were tested to control the number of samples used per parameter update. Dropout layers were incorporated into the shared layers to prevent overfitting by randomly deactivating a proportion of neurons, thereby enhancing model robustness. Dropout rates ranging from 0.2 to 0.7 were determined via hyperparameter optimization.
Model performance was evaluated using the coefficient of determination (R2) and root mean squared error (RMSE). R2 quantifies the proportion of variance in the data explained by the model, with higher values indicating a better fit to the observed data. The RMSE reflects the precision of model predictions and places greater emphasis on larger deviations, providing insight into the model's performance on extreme values. The formulae of R2 and RMSE are provided in eqn (3) and (4), respectively.
![]() | (3) |
![]() | (4) |
The resulting model enables the prediction of methane uptake and working capacity, guiding the identification of MOFs with superior performance.
![]() | (5) |
Descriptor selection in the initial model development often included redundant features. After calculating descriptor importance, the contribution rankings for the six training tasks were established. A comprehensive importance ranking was obtained by averaging the absolute importance values of each descriptor across the six tasks. Based on the integrated importance ranking, the top twelve descriptors were retained and used to rebuild the prediction model. Model retraining was performed on the original dataset with the same hyperparameter optimization scheme, aiming to streamline the feature space, decrease computational demand, and improve generalization performance.
976, respectively, while their means concentrate at lower values of 47.83 and 483.42. Although most samples exhibit coordination environments in the lower range, the long tails indicate a minority with highly complex coordination networks corresponding to functionally unique MOFs. The nuclear charge descriptor f-lig-Z-1 spans from 78.86 to 33
248, with a mean of 2190.62, reflecting significant diversity in ligand elemental composition across a wide range of low to high nuclear charges. Pauling electronegativity descriptors f-lig-chi-3 and f-lig-chi-0 show mean values of 945.68 and 232.41, with maxima reaching 15
631.39 and 3332.93, respectively. Covalent radius descriptors F-lig-S-2 and F-lig-S-3 present mean values of 53.58 and 64.42, with maximum values of 920.44 and 1121.64, respectively. Overall, the numerical distributions of these descriptors reflect the high diversity of MOF ligands in atomic composition, topology, and nuclear properties. This diversity may offer valuable reference points for further understanding structure–performance relationships and for guiding the screening of high-performance MOFs.
Statistical analysis of UCv and related volumetric descriptors for MOF samples revealed wide distribution ranges for UCv, AVA, and POAVA, spanning from 0.4407 Å3to 198
068 Å3, as illustrated in Fig. 2D. The means of these descriptors concentrate in the lower range, measured at 6310.63 Å3, 3007.67 Å3, and 4507.69 Å3, respectively, with a small number of MOF samples significantly exceeding these averages. The UCv and related volumetric descriptors display clear right-skewed distributions, indicating that most MOF samples cluster within smaller UCv, while a minority with exceptionally large volumes elevate the maximum values substantially. Similar distribution patterns appear in pore size descriptors LCD, PLD, and DFSP, as shown in Fig. 2E, with values ranging from approximately 0.50 to 68.87 Å. This wide span reflects the diversity of pore structures within the sample set, covering very small micropores to larger channels. The means of these descriptors are 11.05 Å, 8.33 Å, and 10.98 Å, respectively, with distributions also right-skewed. The majority of pore sizes cluster at smaller scales, while a few samples exhibit notably larger pores. Distributions of volumetric descriptors Vg and POVg, shown in Fig. 2F, range from approximately 0.00025 to 15.70 cm3 g−1, demonstrating significant variability in pore volumes among the samples. Mean values register at 0.68 cm3 g−1 and 1.19 cm3 g−1, with overall right-skewed distributions. Fig. 2G and H present distributions for other six descriptors. SAv ranges from 42.93 to 3474.6 m2 cm−3, and SAg ranges from 23.33 to 8748.9 m2 g−1. Both exhibit wide distribution ranges with means of 1745.01 m2 cm−3 and 3153.68 m2 g−1, respectively, reflecting considerable diversity in pore surface areas across the dataset.
Fig. 2I illustrates the distribution of methane adsorption capacities for MOFs under six pressure conditions. Overall, the methane adsorption capacity shows a clear increasing trend with rising pressure, as both the mean and median values increase progressively. The median adsorption capacity increases from approximately 0.013 mmol g−1 under low pressure to about 15.14 mmol g−1 at high pressure, indicating a significant enhancing effect of pressure on adsorption performance. The distribution under low pressures (0.01 bar to 9 bar) is relatively concentrated, as evidenced by narrow and sharply peaked violin plots. In contrast, at 65 bar, the adsorption capacity distribution becomes more dispersed, reflecting greater diversity among MOF samples and a broader range of adsorption capability distribution under high-pressure conditions.64 At the highest pressure of 65 bar, a pronounced long-tail distribution appears, where several samples exhibit adsorption capacities far exceeding the majority. These outliers correspond to MOFs with unique structures or exceptional pore characteristics, which play a critical role in elevating overall performance. The distributions of methane adsorption capacity across varying pressures not only demonstrate the promoting effect of pressure but also reveal performance differences and diversity among samples. These insights may provide a strong foundation for a deeper understanding of MOF adsorption behavior and for optimizing screening strategies.
470 samples excluded from model training was employed to evaluate the predictive performance of the MTL model. The training loss curve of the model is shown in Fig. S1A. Table 2 summarizes the test results of the full-feature model and the optimized model with simplified features across six adsorption prediction tasks. The model achieved R2 values exceeding 0.900 on all six tasks, indicating strong data fitting across all prediction targets. The highest R2 of 0.989 was obtained for T6, with R2 values generally increasing as the pressure associated with the prediction tasks increased. RMSE values should be interpreted in the context of each task's label range. Overall, RMSE results across all six tasks remain within acceptable limits, reflecting low prediction errors. Fig. 3 compares the true values and predicted methane adsorption capacities across the six tasks of the MTL model. Data points for T6 predominantly align along the diagonal, demonstrating strong predictive consistency for this high-pressure task. Other tasks exhibit varying degrees of outliers, notably under low-pressure adsorption conditions (e.g., T1, T2, T3, and T4), indicating reduced predictive consistency compared to high-pressure tasks. Fig. S2 presents the R2 training curves, which rise rapidly during early epochs, reflecting rapid feature learning by the model. The R2 curves stabilize with increasing epochs, exhibiting minimal fluctuations. Consistent trends between training and test sets suggest high fitting stability without signs of overfitting.
| Task | Full-feature model | Feature-optimized model | Feature values | |||
|---|---|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | Min value | Max value | |
| T1 | 0.902 | 0.002 | 0.816 | 0.003 | 1.0 × 10−6 | 0.085 |
| T2 | 0.924 | 0.157 | 0.856 | 0.216 | 3.0 × 10−6 | 4.197 |
| T3 | 0.920 | 0.127 | 0.866 | 0.164 | 2.0 × 10−5 | 3.204 |
| T4 | 0.924 | 0.398 | 0.852 | 0.554 | 9.4 × 10−5 | 11.302 |
| T5 | 0.941 | 0.349 | 0.895 | 0.467 | 7.9 × 10−5 | 10.384 |
| T6 | 0.989 | 0.716 | 0.984 | 0.848 | 4.0 × 10−4 | 63.146 |
![]() | ||
| Fig. 3 Distribution of predicted and true values for the full-feature model: (A) T1; (B) T2; (C) T3; (D) T4; (E) T5; (F) T6. | ||
![]() | ||
| Fig. 4 Distribution of predicted and true values for the feature-optimized model: (A) T1; (B) T2; (C) T3; (D) T4; (E) T5; (F) T6. | ||
![]() | ||
| Fig. 5 Comprehensive global interpretation (average feature importance) and comprehensive local interpretation (SHAP value distribution) for the six prediction tasks. | ||
470 MOF samples, which were not included in the model training. Working capacities were calculated at two PSA pressure ranges: PSA1 (65 bar to 5.8 bar) and PSA 2 (9 bar to 0.9 bar). The performance of models are summarized in Table 3. Both the full-feature and feature-optimized models achieved R2 values exceeding 0.9. The highest R2 of 0.992 was obtained by the full-feature model for PSA1 screening.
| Working capacity | Models | PSA1 | PSA2 | ||
|---|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | ||
| Gravimetric working capacity | Full-feature model | 0.992 | 0.622 | 0.953 | 0.320 |
| Feature-optimized model | 0.991 | 0.693 | 0.924 | 0.407 | |
| Volumetric working capacity | Full-feature model | 0.962 | 8.724 | 0.931 | 5.448 |
| Feature-optimized model | 0.951 | 9.891 | 0.894 | 6.725 | |
The descriptor f-lig-I-1 has the highest rank in overall feature importance. It characterizes the chemical environment of the first coordination shell surrounding the core atoms of MOF functional groups.55 The atomic composition of this shell directly reflects the local changes at adsorption sites, thereby influencing their activity. Higher f-lig-I-1 values indicate a greater number of bonds within the ligand and a larger ligand size, which impacts the pore size and shape near adsorption sites. Local importance plots reveal that f-lig-I-1 exerts both positive and negative effects, with a generally positive influence on predicted adsorption capacity in T4. f-lig-S-3 and f-lig-S-2 ranked second and fourth in overall importance, respectively. These represent statistical measures of covalent radii for atoms located in the third coordination shell and beyond, and the second coordination shell, around the functional group's core atoms. They capture the volume and spatial environment of atoms at middle and distal distances, affecting MOFs' pore shape and size, which subsequently influence methane accessibility and diffusion. These findings suggest that f-lig-I-1 influences the electronic properties of adsorption sites in MOF samples for methane adsorption, while f-lig-S-3 and f-lig-S-2 represent covalent radii reflecting the spatial distribution of ligand atom sizes within the molecular topology. These descriptors affect the spatial framework of MOFs, determining the molecular accessibility and binding efficiency. The high importance of these three chemical descriptors indicates that capturing key factors governing adsorption performance requires consideration of both the local electronic environment near functional groups and the mid-to-long-range spatial structure.
Among geometric descriptors, POVf and SAv ranked third and fifth, respectively. POVf denotes the fraction of pore-accessible volume, representing the effective pore space available for methane molecules.66 SAv corresponds to the surface area per unit volume, which directly relates to the number of available adsorption sites.67 Other geometric descriptors among the top twelve include SAg, density, and Vf. Although the model was trained using a substantially larger number of chemical descriptors than geometric ones (176 and 14), nearly half of the top twelve features were geometric descriptors (5 in total). This highlights the significant role of geometric features in prediction performance. The prominence of these geometric descriptors underscores the importance of effective pore volume, surface area, and density related metrics for methane adsorption in MOFs, consistent with conclusions drawn from extensive prior studies.45,68 From the perspective of integrated multi-task descriptor importance, factors influencing methane adsorption performance range from the local chemical environment of functional groups at the microscopic scale, through mid- and long-distance spatial constraints, to macroscopic effective pore volume and surface area. These insights should guide the design of MOFs optimized for high-performance methane adsorption.
![]() | ||
| Fig. 6 Global interpretation and local interpretation for the six predictive tasks: (A) T1; (B) T2; (C) T3; (D) T4; (E) T5; (F) T6. | ||
As pressure increased to intermediate levels (4.4 bar, Fig. 6C; 5.8 bar, Fig. 6D; 9 bar, Fig. 6E), the MTL model revealed consistent patterns in feature importance rankings for predicting the methane adsorption capacity. Topological and chemical differential descriptors associated with various atom types, metal-centered, functional group, and linker-centered, such as D_mc-T-3-all, D_func-l-3-all, and D_lc-Z-3-all, along with descriptors capturing medium- and long-range topological and chemical environments (f-T-3-all, f-lig-I-1, and func-chi-3-all), dominated the rankings. These differential features indicate that methane adsorption performance at intermediate pressures strongly depends on local chemical heterogeneity and the spatial distribution of electronic structures within the material.
Notably, the relative importance of geometric descriptors declined with increasing pressure, replaced by multilayered chemical descriptors grounded in topological differentials and electronic properties. This finding underscores the importance of chemical modification of functional groups and multiscale optimization of linker electronic structures for enhancing methane adsorption at elevated pressures. The top-ranked descriptors were f-chi-1-all, DFSP, and f-chi-0-all in T6 (Fig. 6F), with normalized mean absolute SHAP values of 0.33, 0.23, and 0.21, respectively. This result highlights the dominant role of functional group electronegativity under high pressure adsorption, reflecting the modulation of local electronic environments that influence the polarization ability of adsorption sites and thus affect the interaction strength with methane molecules.69 Under high pressure adsorption conditions, the geometric descriptor DFSP also maintained high importance, indicating that pore connectivity and methane accessibility remain critical factors even at near-saturation adsorption stages. Effective enhancement of methane adsorption performance under high pressure requires concurrent optimization of functional group electronic environments and reasonable pore structure design to ensure smooth methane ingress and efficient filling.70
Further investigation focused on SHAP dependence relationships for descriptors suspected to exhibit feature interactions based on prior importance analyses. Fig. 7 illustrates the SHAP dependence relationships capturing the interaction between the descriptor f-lig-I-1 and geometric descriptors related to pore size and shape in T3. Similar interaction patterns were observed between f-lig-I-1 and the three pore-related geometric descriptors. When geometric descriptor values were small, the positive contribution of f-lig-I-1 to methane adsorption gradually diminished as its value increased. After f-lig-I-1 exceeds 32, SHAP values shifted to negative, and the negative contribution to the adsorption performance gradually increases. When geometric descriptor values were high, f-lig-I-1 consistently exhibited negative contributions, with the magnitude of negative impact increasing more moderately as the descriptor value increased.
Fig. 8C and D depict the synergistic effects of Vf, SAg, and density on methane adsorption in MOFs for the T3 (4.4 bar) and T6 (65 bar) tasks, respectively. The results show that, in the T3 task, high adsorption samples were mainly concentrated at low density (<1 g cm−3) and within specific ranges of SAg and Vf. At high density (>1 g cm−3), the adsorption capacity increased with increasing POVf. At low density, adsorption initially decreased and then increased as POVf increased. For the T6 task, samples with high adsorption were primarily found under conditions of low density and high SAg and Vf, with adsorption capacity increasing as the density decreased and SAg and Vf increased. Fig. S10 and S11 illustrate the interactive effects of four descriptors on methane adsorption for other tasks. The results indicate that, under high-pressure conditions, methane adsorption in MOFs increased significantly with decreasing density and increasing POVf and Vf, exhibiting a clear linear correlation. In contrast, during low-pressure adsorption, the relationship between adsorption capacity and density, POVf, and Vf did not follow a simple monotonic pattern, but rather clustered within specific ranges of geometric descriptors, exhibiting nonlinear response characteristics. These findings indicate significant differences in the mechanisms by which geometric descriptors influence methane adsorption performance under different pressure conditions.
Consistent trends were observed between performance and the atomic identity descriptor (f-lig-I-1), covalent radius descriptor (f-lig-S-3), and nuclear charge descriptor (f-lig-Z-1) in both working capacity and methane adsorption. Although high adsorption MOFs exhibited dispersed distributions across these descriptor attributes, clustering of high-performance MOFs was still detected within specific attribute ranges, primarily located at lower attribute values.
Unlike automotive fuel tanks that focus on the gravimetric performance of adsorbents, industrial applications and methane storage prioritize the volumetric efficiency. Therefore, the volumetric working capacity serves as a crucial criterion for selecting high-performance materials. Tables S4 and S5 present the top-performing materials by volumetric working capacity for PSA1 and PSA2, along with their representative descriptors and topology features. Crystal structures of these high volumetric capacity MOFs are shown in Fig. S13 and S14. Comparison of volumetric working capacities indicates that the top value under PSA1 (213.45 cm3 cm−3) significantly exceeds that under PSA2 (113.07 cm3 cm−3). Descriptor analysis reveals significant differences between the optimal ranges of descriptors for top volumetric working capacity MOFs under PSA1 and those for top gravimetric working capacity MOFs. The most notable geometric descriptor ranges for volumetric capacity include a density of 0.48 ± 0.01 g cm−3, SAg of 5193.60 ± 178.87 m2 g−1, POVf of 0.75 ± 0.01, and SAv of 2487.99 ± 113.93 m2. This distinction highlights how gravimetric and volumetric criteria reflect different geometric structural features of high-capacity MOFs, where the former corresponds to properties of reduced mass per unit and the latter to properties of reduced volume per unit. Top-performing MOFs for volumetric working capacity exhibit similar characteristics, especially in geometric descriptors. The coefficients of variation for density, SAg, POVf, and SAv under PSA1 are 0.02 g cm−3, 0.03 m2 g−1, 0.01, and 0.05 m2, respectively, whereas those under PSA2 are 0.10 g cm−3, 0.15 m2 g−1, 0.02, and 0.06 m2, respectively.
Methane working capacities (desorption at 5 bar) of top-performing MOFs identified in this study were compared to those of experimentally synthesized nanoporous materials reported in the literature71,72 under conditions of 65 bar and 298 K.73,74 The results are presented in Fig. 11. Fig. 11A shows that among the surveyed experimental MOFs, VNU-22 synthesized by Tu et al.75 surpassed 0.5 g g−1 in gravimetric working capacity, exceeding the methane storage target established by the U.S. Department of Energy (DOE). The top-performing MOFs screened in this study demonstrated gravimetric working capacities exceeding that of VNU-22. Notably, m2_o33_o24_xaq exhibited the highest gravimetric working capacity at 0.929 g g−1, with its crystal structure shown in Fig. 10A. Fig. 11B illustrates that UTSA-76a, ZJU-5a, and HKUST-1 all exceed 188 cm3 cm−3 in volumetric working capacity, while hypotheticalMOF_5082031_1_0_2_29_13_0, identified in this study, achieved the highest volumetric working capacity of 213 cm3 cm−3. Its crystal structure is shown in Fig. S13A. These results demonstrate that the MOFs identified in this study possess the potential to surpass current synthesized materials in methane working capacity performance. Moreover, it should be noted that practical engineering applications also require consideration beyond adsorption capacity, including thermal and structural stability of materials.
Investigation of descriptors for materials with superior working capacity revealed that top-performing MOFs exhibit similar distributions in geometric descriptors such as density, SAg, POVf, and SAv. Based on the experimental findings, the following design guidelines are proposed to provide a valuable direction for future efforts aimed at developing MOFs with enhanced methane adsorption capacity and working capacity. (1) Target materials should possess SAv values in the range of 400 to 1500 m2 cm−3 for optimal high-pressure adsorption performance. (2) Optimal performance is associated with SAg values between 5500 and 8700 cm2 g−1 under high-pressure adsorption and between 1950 and 5700 cm2 g−1 under low-pressure adsorption. Pursuit of maximum adsorption capacity necessitates minimizing density while maximizing Vf and POVf. (3) For gravimetric working capacity, MOFs with density around 0.074 g cm−3, SAg near 6700 m2 g−1, POVf approximately 0.007, and SAv close to 500 m2 demonstrate superior performance. MOFs optimized for volumetric working capacity exhibit values near density of 0.5 g cm−3, SAg of 4700 m2 g−1, POVf of 0.75, and SAv of 0.40 m2.
352 chemically reasonable MOFs screened from eight publicly available databases. These MOFs cover a broad range of pore geometries, topologies, and chemical environments. However, predictions for materials that deviate substantially from this domain should be interpreted with caution, such as those possessing structural or chemical features not captured by the selected descriptors.
In addition to the geometric and chemical descriptor domain, the applicability of the model is constrained by the thermodynamic conditions under which the adsorption data were generated. All methane adsorption and working capacity data used for model training and validation were obtained at the temperatures considered in this study under six pressure conditions corresponding to three representative PSA application scenarios. Specifically, these conditions include: landfill gas treatment (4.4–0.01 bar, 338 K), methane purification (9–0.9 bar, 298 K), and methane storage, separation, and purification (65–5.8 bar, 298 K). Model predictions are therefore strictly applicable only to these defined pressure-temperature conditions.
It is necessary to acknowledge the limitations of the present study. Although several strategies were employed to mitigate potential overfitting, including five-fold cross-validation and performance evaluation using an independent test set obtained through random sampling that was not involved in model training (but originated from the same source as the training data), it should be noted that independent external validation using datasets outside the training database has not yet been conducted. Consequently, the current model is primarily validated within the chemical and structural space represented by the available MOF dataset.
Validation using independently sourced external datasets is essential for fully substantiating the capability of the proposed model to identify genuinely novel high-performance MOFs beyond its training domain. In future work, external validation will focus on newly reported or newly released experimental MOFs with available methane adsorption or working capacity data. First, compiling methane working capacity data for MOFs that are newly discovered or experimentally synthesized in subsequent studies to further validate and refine the proposed multi-task learning model. Second, the applicability of the model will be extended by incorporating working capacity prediction tasks under a broader range of pressure–temperature conditions, thereby enabling a more comprehensive assessment of model applicability and providing further insights into its generalization behaviour under extended operating conditions.
| This journal is © The Royal Society of Chemistry 2026 |