Open Access Article
Yifei
Yue
ab,
Athulya S.
Palakkal
a,
Saad Aldin
Mohamed
a and
Jianwen
Jiang
*ab
aDepartment of Chemical and Biomolecular Engineering, National University of Singapore, Singapore, 117576, Singapore. E-mail: chejj@nus.edu.sg
bIntegrative Sciences and Engineering Programme, National University of Singapore, Singapore, 119077, Singapore
First published on 16th September 2025
Metal–organic frameworks (MOFs) are intriguing nanoporous materials with a wide variety of potential applications. Recent efforts in extending the functionalities of MOFs toward biological applications have inspired the development of Bio-MOFs comprising biological building blocks. Yet, while numerous experimental studies have attempted to synthesize different Bio-MOFs, computational screening of Bio-MOFs is impeded by the limited number of Bio-MOFs currently available. Here, we design a Bio-hMOF database containing 17
681 hypothetical structures, assembled from the fragments of 309 experimental Bio-MOFs, with rigorous geometry optimization and structural checks. Subsequently, a possible biological application of the Bio-hMOFs is demonstrated for the selective adsorption of signaling gases NO and CO. The effects of different inorganic and organic fragments on the mechanical properties of Bio-hMOFs are also examined. Finally, we identify mechanically stable Bio-hMOFs promising for selective NO/CO adsorption and holistically analyze the trade-off between adsorption capacity and mechanical strength. The digital Bio-hMOF database is available publicly, in which future studies can be leveraged to discover top candidates and unveil new structure–property insights into the further design of Bio-MOFs for targeted biological applications.
While there are growing experimental studies on the design and applications of Bio-MOFs, computational effort to identify promising Bio-MOFs is scarce due to the limited number of Bio-MOFs available. Except for a few studies that computationally screened Bio-MOFs for indoxyl sulfate adsorption14 and O2/N2 separation,15 a common database of Bio-MOFs with biomolecular building blocks is lacking and the existing studies did not consider the mechanical properties of Bio-MOFs. Thus, addressing these limitations would accelerate the design of new Bio-MOFs for emerging biological applications.
In this work, a database of hypothetical Bio-MOFs (Bio-hMOFs) is developed. We first decomposed the unique inorganic and organic fragments from 309 experimentally available Bio-MOFs, and then generated Bio-hMOF structures based on two configurations: (1) one inorganic node with one organic node; (2) one inorganic node with one organic edge. The generated structures were optimized, followed by structural checks to eliminate disordered structures. Eventually, 17
681 structures were curated to constitute the Bio-hMOF database. From an application point of view, subsequently, we identified promising Bio-hMOFs for the selective adsorption of signaling gases,16 namely NO17,18 and CO,19 as the selective adsorption and release of signaling gases is crucial in biological applications involving the in vivo delivery of signaling molecules.20,21 Thus, understanding attributes of Bio-MOFs with signaling gas adsorption would aid the development of new functional Bio-MOFs for selective capture of the desired biological signaling gases. Finally, we conducted molecular dynamics (MD) simulations to evaluate the mechanical properties of Bio-hMOFs, which are important metrics for assessing the mechanical stability of hypothetical structures. Furthermore, the relationships between chemical compositions and mechanical properties, as well as the trade-off between porosity and stability, were discussed.
The remainder of this work is organized as follows. First, we provide a brief overview of our research workflow for generating Bio-hMOFs. Next, we analyze the geometric and chemical features of Bio-hMOFs, in comparison to existing hypothetical MOF databases.22–26 Thereafter, we examine the selective adsorption of a NO/CO mixture in Bio-hMOFs to elucidate structure–property relationships and identify top Bio-hMOFs with large adsorption capacity (N) and high selectivity (S). Finally, we integrate bulk (K), shear (G), and Young's (E) moduli of Bio-hMOFs in relation to their chemical compositions. More broadly, we provide a publicly available database and analysis tools to facilitate future computational studies in identifying and understanding the attributes of Bio-MOFs with promising functionality for biological applications.
4 and CSD.27 Then, these Bio-MOFs were decomposed (i.e., broken down) into unique inorganic and organic fragments. In total, 141 inorganic fragments (Fig. 1b) existed including common metal nodes like Zr8 secondary building units (SBUs) in UiO-66 and rare fragments like metalloporphyrin. Meanwhile, 66 organic fragments contained a wide range of functional groups (Fig. 1c) such as bioactive acetanilide (fragment 8), pyrene derivatives (fragment 59), multiple acetophenone (fragment 9) and cyclodextrin (fragment 41). Then, the fragments were assembled to generate Bio-hMOFs by using the tinker-toy topological assembly algorithm28 provided in PORMAKE29 (version 0.2.1) with two configurations: (1) one inorganic node and one organic edge (or linker) and (2) one inorganic node and one organic node. More details about the generation of Bio-hMOFs are provided in S1 of the SI.
The generated structures underwent geometry optimization using the LAMMPS simulation package (version August 2023)30 with the universal force field (UFF).31 Briefly, the optimization procedure comprised the steepest descent (SD) with frozen cell boundaries, SD with cell relaxation, and then three cycles of FIRE32 minimization with frozen cell, followed by SD minimization with cell relaxation. All the minimization steps were performed with a convergence criterion of 10−8 stopping tolerance and 50
000 maximum iterations for force and energy evaluations. Thereafter, we conducted structural checks using MOFID33,34 (version 1.1.0) in order to eliminate structures with under-coordinated or hyper-coordinated atoms. Furthermore, the lammps-interface35 (version 0.2.2) was used to detect and filter out disordered structures containing free molecules. It is important to highlight the necessity for performing robust structural checks, as highlighted by recent studies,36,37 and the development of structural check algorithms in the future can be applied to eliminate problematic structures (e.g., metal centers with erroneous oxidation states). Consequently, a database with 17
681 Bio-hMOFs was curated (see Table S1 for statistics). Next, the structural and feature diversity of Bio-hMOFs was analyzed and compared with popular hypothetical databases (BW-DB, ARC, hMOF, Tabacco and ultrastable)22–26 in terms of geometric and chemical features. Consisting of pore size, surface area, pore volume (see Table S2), the geometric features were computed using Zeo++ (version 0.3).38 The revised auto-correlated functions (RACs) were adopted as the chemical features (see Table S3) as computed from molSimplify (version 1.7.3),39 employing the same methodology employed by Moosavi et al.39,40 Five categories of RACs were evaluated based on the atom centers over neighboring atoms (within a maximum of three bond distance): (1) metal centers (metal-centered RACs), (2) linker-connecting atoms (linker-connecting RACs), (3) full linkers (full-linker RACs), (4) functional group atoms (functional group RACs) and (5) over entire structure (full-scope RACs). The feature space of RACs was analyzed using unsupervised learning methods (i.e., t-SNE). Additional details regarding geometric features and RACs are provided in S2.
To demonstrate a potential application of Bio-hMOFs in signaling gases NO and CO,16 we conducted grand-canonical Monte Carlo (GCMC) simulations to evaluate the selective adsorption of an equimolar NO/CO mixture in Bio-hMOFs at 298 K, under pressures of 1 bar and 10 bar, respectively. The force field parameters were adopted from previous studies as detailed in S3. Due to the competing nature of NO/CO endogenous gases in binding toward the active sites of biological receptors (e.g., soluble guanylate cyclase), identifying materials with selective NO or CO adsorption will be useful for biological applications (e.g., in vivo delivery of signaling molecules).20,21 The selective adsorption performance of Bio-hMOFs was quantified by using two metrics: adsorption capacity Nx and selectivity
, where p is the partial pressure. Compared to CO, NO is a more essential signaling molecule, and our analysis was primarily based on the selectivity of NO with respect to CO (i.e., SNO/CO).
Furthermore, as mentioned above, it is crucial to consider the mechanical stability of hypothetical structures like Bio-hMOFs for a holistic assessment. The mechanical properties of Bio-hMOFs, including bulk (K), shear (G) and Young (E) moduli, were evaluated from MD simulations as described in more detail in S4. Regarding the choice of force field to compute mechanical properties, it is important to note that UFF4MOF41 overestimates the mechanical and thermal expansion properties of certain MOFs,35,42 which may potentially lead to outliers when employed in computational screening. While recent studies have demonstrated the accuracy of machine-learned potentials (MLPs) in predicting mechanical properties of MOFs,43,44 scaling MLP-MD simulations to a large number of Bio-hMOFs is computationally inaccessible. Thus, we employed the UFF31 instead of UFF4MOF41 and MLPs43,44 as it offers a good balance between accuracy and scalability in the evaluation of mechanical properties. Finally, the structural–property relationships and trade-off between adsorption capacity and mechanical stability were evaluated and discussed.
Next, we compare the geometric properties of Bio-hMOFs with those of other hypothetical databases.22–26 As shown in Fig. 2 and S3, most geometric properties of Bio-hMOFs, such as pore limiting diameter (PLD), largest cavity diameter (LCD), gravimetric or volumetric surface area (GSA/VSA), and gravimetric pore accessible volume (GPOV), exhibit distributions similar to other databases. For the void fraction (VF), Bio-hMOFs possess a more uniform kernel density estimation (KDE) distribution in comparison with other databases (Fig. S3). This is possibly attributed to the composition of relatively shorter linkers (e.g., derivatives of oxalate and BDC linkers) in most Bio-hMOFs, which lead to a slightly lower porosity. Moreover, we note that hypothetical structures constructed using topological nets with multiple nodes and edges (e.g., one metal node and two types of organic edges) were not included in Bio-hMOF due to their exceedingly high porosity. Nevertheless, the balanced distribution of VF in the Bio-hMOF database is advantageous for improving transferability from data-driven analysis of Bio-hMOFs to experimental Bio-MOFs, because most hypothetical databases tend to possess geometric properties that are skewed toward a higher VF as compared to experimental databases.22
![]() | ||
Fig. 2 (a) Geometric features in Bio-hMOF and other hypothetical databases, with the kernel density estimation (KDE) distributions plotted on the marginal axes. From left to right: GSA against LCD, GSA against GPOV, and VSA against PLD. (b) t-SNE maps of chemical features, with the KDE distributions on the marginal axes. The KDE was normalized to the total number of MOFs in each database: 269 391 ARC-MOF,22 323 789 BW-DB,23 54 139 ultrastable,24 11 577 ToBaCCo,25 114 658 hMOF,26 and 17 681 Bio-hMOF (this work). | ||
The Bio-hMOF database is further compared with other hypothetical databases in terms of chemical features. Fig. 2b and S4 show the t-distributed stochastic neighbor embedding (t-SNE) maps of the RACs in different databases. It is observed that ARC-MOFs are evenly distributed in the t-SNE space, as ARC-MOFs22 were selectively composed to attain a good trade-off between balance and variety of chemical features.47 The RACs of Bio-hMOFs are well-distributed across the entire t-SNE space despite their low quantity (c.a., 2.2% of the entire space), which can be corroborated with the distribution of Bio-hMOFs in the t-SNE maps of RACs for full-linker, metal-centered, linker-connecting and functional groups, respectively (Fig. S5). This indicates that Bio-hMOFs comprise diverse metal nodes and organic linkers in the MOF ecosystem. While the agglomeration of clusters is observed in the t-SNE maps of RACs describing various chemical types (Fig. S5), this is a result of the (inevitable) imbalance of element compositions of building blocks involved in the construction of hypothetical databases.22 Despite a small skew due to the restriction on building blocks, Bio-hMOFs are relatively well-dispersed in the t-SNE space in comparison to other larger databases. Overall, our analysis indicates that the Bio-hMOF database exhibits diverse and balanced structural and chemical features, which renders it well suited for data-driven screening and machine learning tasks.
51). While the force field parameters for NO and CO adsorption were validated in certain MOFs,17,19 we caution that extensive validation in bio-MOFs is not yet possible. Thus, we provide the simulated adsorption isotherms of pure NO, CO and NO/CO mixture in two pertinent experimental bio-MOFs (bio-MOF-1 and bio-MOF-12) in Fig. S6 for future reference.
Fig. 3 shows the relationships between NO (NNO) and CO (NCO) adsorption capacities in an equimolar NO/CO mixture predicted from GCMC simulations. There is a greater capacity at a higher pressure, whereas the adsorption selectivity is not significantly affected by pressure (Fig. S7). NNO and NCO are not directly correlated with the inorganic fragment type (Fig. S8 and S9), although they are influenced by the organic fragment type (Fig. 3c and S10). This is due to the greater impact of pore geometry that governs the predominant physisorption process for NO/CO as compared to chemisorption. Although NO may undergo chemisorption at the open metal sites (OMS), the presence of large pores is equally important for high NO uptake.17 Conversely, previous simulations show that interaction between OMS and CO is less significant, and CO preferentially undergoes physisorption.19 As a result, Bio-hMOFs with longer and sparser organic fragments such as multi-phenyl chains (e.g., E60 and E62) exhibit stronger adsorption for both NO and CO, as compared to their counterparts with shorter fragments (e.g., E0, E54, and E55).
As discussed above, the metal node type has a significant impact on NO adsorption. As shown in Fig. S11, Bio-hMOFs containing Zn- and Cu-nodes, particularly with accessible OMS (e.g., N102 and N21), favor higher NNO. Regarding selectivity SNO/CO, most Bio-hMOFs exhibit a selectivity close to unity (due to the similar kinetic diameter of NO and CO), despite a slight preference for NO (median SNO/CO of 1.17 at 1 bar). Bio-hMOFs exhibiting high SNO/CO are attributed to the presence of OMS, such as the pillared Zn-node (N102), which favors interaction with NO. Hence, these Bio-hMOFs are potentially promising for selective NO sensing and loading. Conversely, Bio-hMOFs containing coordinatively saturated Zn- and Cd-nodes (N6 and N118) possess higher CO selectivity, indicating that metal node type has an insignificant impact on CO adsorption. Bio-hMOFs with high NO and CO selectivity are listed in Tables S8 and S9, respectively. As elucidated in ZIFs, certain functional groups (e.g., nitro-imidazolate in ZIF-68) instead of OMS serve as binding sites for CO. Thus, tuning of pore geometry and functional groups is more effective to increase CO adsorption.19 Taken together, the above analysis of structure-adsorption relationships indicates that selective signaling gas adsorption can be tailored by tuning metal nodes and functional groups in Bio-hMOFs.
681) possess high bulk moduli (KH > 20 GPa), which is possibly due to the relatively low VF possessed by Bio-hMOFs. Meanwhile, approximately 919 (∼5%) Bio-hMOFs exhibit exceptional KH and EH (shaded region in Fig. 4a). The majority of 919 contain bulkier organic fragments (E54: 24% and E55: 15.7%) that correspond to lower signaling gas adsorption (Tables S10 and S11). As porosity increases with the length of organic linkers, it is expected that organic linkers containing multiple phenyl chains (e.g., E6 and, E62) lead to lower KH. In contrast, shorter linkers (e.g., E54 and E0) lead to higher KH. The same conclusions can be drawn from the chemical compositions of 909 Bio-hMOFs with exceptional KH and GH (in Fig. 4b). While specific inorganic fragments on average promote slightly higher KH (Fig. 4c), the effect of organic fragments on KH is much more significant (Fig. 4d). This can be attributed to the impact of organic linker type on porosity.52 However, the mechanical properties are also affected by the rigidity of inorganic fragments.24,52 For instance, Bio-hMOFs containing Cd-nodes (e.g., N118) possess a slightly higher KH (median of 4.53 GPa) compared to more common Zn-nodes (e.g., N6 with median KH = 3.12 GPa). This is a result of the greater maximum coordination number (MCN) of Cd metal (MCN = 8) as compared to Zn (MCN = 4). Aside from the well-established influence of porosity on KH,52,53 metals with greater MCN, such as 2nd-row transition metals (Cd and Zr) and post-transition metals (Eu and Tb), reinforce the rigidity of metals and contribute to higher mechanical strength.24,52
Fig. 5 shows KHversus NNO at 1 bar. Notably, there exists a trade-off between adsorption and mechanical strength. On the Pareto frontier, five Pareto-optimal Bio-hMOFs with inorganic fragments, OMS (e.g., Mn-based N128 and Fe-based N41) and short linkers (e.g., E54) exhibit high NNO and good mechanical strength (Tables S12 and S13). This underscores the importance of using appropriate building blocks, combining NO-selective metal nodes (or functionalized linkers) and topological nets (e.g., zec) to facilitate the generation of porous regions (in Fig. 5b) that best optimize the trade-off. In terms of selectivity, a Bio-hMOF containing a Mn-porphyrin node (N128) achieves the highest SNO/CO > 4, due to the availability of Mn OMS to facilitate strong binding with NO (in Fig. 5b).17 Furthermore, the Mn-porphyrin complex is also highly rigid, thus facilitating high mechanical strength. Similarly, a Bio-hMOF with the Fe-porphyrin node (N41) and topology net zec demonstrates high NNO and KH (in Fig. 5c).
Additionally, we find that Bio-hMOFs lying on the Pareto frontier also exhibit good NO selectivity (SNO/CO > 2; Table S13). This can be attributed to the presence of accessible OMS in these Bio-hMOFs, which facilitate high NO uptake and selectivity. On the other hand, Bio-hMOFs with high mechanical stability and SNO/CO may not exhibit the desired high NNO due to their low porosity (see Table S8 and examples in Fig. S11a). Thus, Bio-hMOFs possessing high mechanical strength and selective NO adsorption can be designed with the presence of accessible OMS, synergized with appropriate linkers and topological nets. Conversely, Bio-hMOFs containing suitable functional groups (e.g., imidazolate19) exhibit highly selective CO adsorption (see examples in Fig. S11b). These Bio-hMOFs are potential carriers for targeted delivery of CO, which is required for anti-inflammatory treatment.45
Finally, it is worthwhile to note that metal toxicity analysis14 indicated that common metals in MOFs (e.g., Ni, Cd, and Cu)54 are toxic when evaluated on the basis of median lethal dose (LD50).14 Therefore, Bio-hMOFs containing safe metals (e.g., Zn, Co, Fe, Mn, Mg, and In) should be mainly considered when screening for biological applications, although all Bio-hMOFs were examined in this work for selective NO/CO adsorption. In practical biological applications,7,24 tuning the organic fragments of Bio-MOFs (i.e., biological ligands and functional groups) remains an important avenue toward their functionalization, especially when only a few non-toxic metal types14 are feasible for safe in vivo biological applications.21
681 structures. Subsequently, we demonstrate the application of Bio-hMOFs for the delivery of biological signaling gases. In addition to identifying top Bio-hMOFs exhibiting high NCO and NNO, our analysis elucidates that Bio-hMOFs with open metal sites are preferred for selective NO adsorption. By evaluating the mechanical properties of Bio-hMOFs, we show that a few Bio-hMOFs possess high mechanical strength. Finally, we analyze the effects of building block compositions (i.e., inorganic/organic fragments) on the trade-off between adsorption capacity and mechanical strength. Overall, we elucidate that stable Bio-hMOFs containing metal-porphyrin based inorganic fragments facilitate selective NO adsorption, whereas functional groups and pore geometry play a more significant role in CO adsorption. Computational screening can further discover new Bio-MOFs for emerging biological applications. Future extension of the Bio-hMOF database may leverage data-driven approaches to include important properties relevant to biological compatibility, such as structural stability and biodegradability.55
The data supporting this article, including the construction and featurization of Bio-hMOFs, GCMC simulations, computation of mechanical properties, feature analysis, and the statistics of adsorption and mechanical properties, have been included as part of the SI. See DOI: https://doi.org/10.1039/d5dd00213c.
| This journal is © The Royal Society of Chemistry 2025 |