Open Access Article
Bo
Han
a,
Marcus
de Carvalho
b,
Jie
Zhang
*b,
Hui
Mao
*c and
Qingyu
Yan
*ad
aEnergy Research Institute@NTU (ERI@N), Nanyang Technological University, Singapore 637141, Singapore. E-mail: alexyan@ntu.edu.sg
bCollege of Computing and Data Science, Nanyang Technological University, 50 Nanyang Ave, Singapore 639798, Singapore. E-mail: zhangj@ntu.edu.sg
cCollege of Chemical Engineering, Sichuan University, Chengdu 610065, P. R. China. E-mail: rejoice222@163.com
dSchool of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore
First published on 23rd January 2026
Metal–organic frameworks (MOFs) are highly versatile materials known for their exceptional properties, including high specific surface area, tunable pore structures, and multifunctionality, making them invaluable in fields of electrochemical ammonia synthesis. Over the past decade, MOF-based systems have evolved from initial N2-reduction prototypes into advanced catalysts that can convert nitrates and nitrites into ammonia. This offers a broader perspective on the electrochemical nitrogen cycle. However, their structural complexity poses significant challenges to traditional design and optimization approaches. This review first critically summarizes the recent advances in MOF development for electrochemical ammonia synthesis. Next, this review systematically explores the transformative role of machine learning (ML) in advancing MOF research. It also addresses the 8 major challenges and limitations currently facing this intersection, including data scarcity, model interpretability, and inverse design. To address these challenges and guide future progress, the review proposes 4 clear and practical future directions: leveraging explicable AI to improve model transparency, integrating active learning with automated platforms to optimize experimental workflows, exploring the synergy between ML and quantum computing to simulate complex structures, and fostering multidisciplinary collaboration for holistic innovation. By bridging computational intelligence and materials discovery, this work underscores the central role of ML in shaping the next generation of MOF-based technologies for sustainable energy and environmental applications.
The electrochemical nitrogen reduction reaction offers the potential to produce ammonia under mild conditions. However, the immense bond energy of N2 (941 kJ mol−1) and its limited solubility result in sluggish kinetics and low Faradaic efficiencies.4 Meanwhile, electrocatalytic nitrate/nitrite reduction offers a more thermodynamically favorable route while addressing nitrate pollution from fertilizers and industrial waste.5 Together, these approaches form the basis of a new electrochemical nitrogen economy.6
Metal–organic frameworks (MOFs) are highly versatile materials with exceptional properties such as chemical multifunctionality, tunable pore structure, and high specific surface area.7 The modular architecture of MOFs, combining metal nodes with organic linkers, allows for customized functionality tailored to specific applications.8–10 These properties have positioned MOFs as critical candidates for electrochemical ammonia synthesis.11
Despite their enormous potential, MOFs exhibit structural complexity due to several factors that make their design and optimization challenging for traditional approaches: (i) diversity of building blocks.12 MOFs are composed of many metal nodes and organic linkers, allowing for an almost infinite number of structural combinations. This structural diversity results in a high-dimensional design space. (ii) Tunable properties.13 MOFs can be tailored at multiple levels, such as pore size, surface area, topology, and functionalization, greatly increasing the number of potential variants. (iii) High crystallographic symmetry.14 Many MOFs have intricate geometries with high symmetry and complex connectivity patterns, making their characterization and computational modeling difficult. (iv) Dynamic behavior.15 MOFs can exhibit flexibility, breathing, and other dynamic properties that add a layer of complexity to understanding and predicting their behavior under different conditions. (v) Defects and disorder.16 The presence of defects, partial occupancies, and framework disorder in many MOFs can affect their performance but are difficult to quantify and model accurately. (vi) Interplay of chemical interactions.17 The performance of MOFs is often influenced by a combination of chemical interactions, such as van der Waals forces, hydrogen bonding, and π–π interactions, which require advanced computational tools to model effectively. (vii) Multifunctionality.18 Many MOFs are designed for multifunctional applications that require simultaneous optimization of multiple, often competing, properties such as stability, adsorption capacity, and selectivity. (viii) Scalability and reproducibility.19 Scaling up MOF synthesis while maintaining precise structural and property control adds practical complexity to their design and application. The structural complexity of MOFs is shown schematically in Fig. 1, which raises the motivation to create advanced techniques for precise design and development of MOFs.
The rapid growth of data generation in materials science, coupled with advances in machine learning (ML) algorithms, has made ML a transformative approach to solving complex scientific challenges.20 ML excels at unraveling non-linear relationships in large data sets, enabling prediction of properties and optimization of materials with unprecedented accuracy and speed. This capability is particularly relevant for MOFs, whose diverse structural and chemical compositions require sophisticated tools to uncover and exploit structure–property relationships.21 By integrating ML, researchers have significantly accelerated the processes of screening, developing, and refining MOFs for cutting-edge applications, bridging the gap between computational efficiency and experimental feasibility.22
Machine learning (ML) provides a data-driven framework that can identify complex, nonlinear relationships between MOFs and their catalytic performance. These relationships are often inaccessible through conventional, intuition-based design. In the context of electrochemical ammonia synthesis, ML can be integrated into MOF research by learning from existing datasets that correlate metal nodes, ligand chemistry, defect structures and electronic descriptors with activity towards N2 activation and nitrate or nitrite reduction. High-dimensional optimization enables ML to rapidly predict key properties such as adsorption energies, charge transfer characteristics, conductivity pathways and intermediate binding energetics. These insights can then be used to guide the rational design of MOF architectures with active sites that are tailored to specific applications, coordination environments that are optimized, and electronic structures that can be tuned as required. Furthermore, ML can support the optimization of synthesis by identifying the optimal solvothermal parameters, linker–metal combinations and defect densities that maximize catalytic efficiency. By integrating ML into the research process, the discovery and refinement of MOF electrocatalysts is accelerated, reducing reliance on trial-and-error experimentation and enhancing the precision and scalability of catalyst development.
This review explores the intersection of ML and MOF research, highlighting the transformative role of ML in revolutionizing the design and development of MOFs. The review identifies and critically examines 8 key challenges that currently impede the broader adoption of ML in MOF research, such as the scarcity of high-quality data, challenges in interpreting complex ML models, and the inverse design and experimental validation. To address these barriers, this work proposes 4 actionable strategies: increasing model transparency through explicable AI, integrating active learning with automated experimentation, leveraging quantum computing to simulate complex material behavior, and fostering interdisciplinary collaboration to drive innovation. By addressing these issues, this review aims to highlight pathways for harnessing the full potential of ML to advance MOF-based technologies for energy and environmental sustainability.
Yan et al.30 recently reported one representative example of this phenomenon. Their work describes a bimetallic Cu–Co MOF (CuxCoy-DHTA) that acts as a catalyst for the highly efficient electrochemical reduction of nitrates to ammonia. Fig. 2(a) illustrates the hydrothermal synthesis of CuxCoy-DHTA and its electrochemical reconstruction during nitrate reduction. When a cathodic potential is applied, some of the Cu species in the bimetallic MOF are reduced to form finely dispersed Cu(0) nanoparticles. Meanwhile, the Co-MOF framework remains largely intact, resulting in a composite catalyst with embedded Cu sites. Fig. 2(b) presents the proposed dual-site mechanism. The Cu(0) nanoparticles catalyze the reduction of NO3− to NO2− preferentially, while the adjacent Co sites in the MOF promote the water dissociation and hydrogen donation steps required for the subsequent hydrogenation of NO2− to NH3. The MOF pores enable the efficient transfer of intermediates between these sites, thereby supporting a tandem Cu–Co relay pathway for the selective conversion of nitrates to ammonia. Consequently, the optimized Cu1/2Co1/2-DHTA achieves Faradaic efficiencies for NH3 of over 95% within a broad potential range of −0.8 to −1.0 V vs. RHE, with a maximum NH3 production rate of 20.02 mg h−1 cm−2, surpassing the performance of most previously reported MOF-based eNO3RR catalysts. For the mechanism study, Fig. 2(c) further illustrates the Cu–Co dual-site tandem mechanism, in which Cu(0) drives the initial NO3−-to-*NO2 step, while the Co sites govern the subsequent hydrogenation steps towards NH3. Furthermore, Fig. 2(d–f) summarize DFT results comparing monometallic Cu-DHTA with bimetallic Cu1/2Co1/2-DHTA. Although the initial reduction of NO3− to *NO2 is thermodynamically favorable on both catalysts, hydrogenation of *NO2 is energetically hindered on Cu-DHTA, resulting in desorption as NO2−. In contrast, the Co sites in the bimetallic system exhibit significantly lower free energy barriers for *NO2 hydrogenation, facilitating the retention and continued reduction of intermediates to NH3. These calculations provide mechanistic support for the experimentally observed tandem catalysis. Overall, this study provides convincing evidence that bimetallic MOF precursors that form dual-site architectures offer a powerful structural approach to efficient nitrate-to-ammonia electrocatalysis. However, it also highlights the need for a deeper understanding of the reconstruction dynamics, durability and scalability required for real-world applications.
![]() | ||
| Fig. 2 (a) Schematic of CuxCoy-DHTA preparation and electro-reduction. (b) Mechanism diagram of CuCo dual-site electrocatalytic nitrate reduction. (c) Schematic of the CuCo bimetallic Dual-Site tandem catalysis mechanism. (d) Gibbs free energy diagrams of NO3RR over Cu-DHTA and Cu1/2Co1/2-DHTA. (e) Gibbs free energy diagrams of *NO2 to NO2− over Cu-DHTA and Cu1/2Co1/2-DHTA. (f) Gibbs free energy diagrams of *NO2 to *NO over Cu-DHTA and Cu1/2Co1/2-DHTA. Reproduced with permission.30 Copyright 2025, Wiley. | ||
In another study, Sun et al.31 reported a bimetallic structural approach involving the immobilization of cobalt clusters within the cavities of Cu-based MOF-818 to form a Cu–Co dual-site catalyst that promotes the electrocatalytic reduction of nitrate to ammonia. Mechanistically, the Cu sites preferentially adsorb and reduce NO3− to NO2−, while the encapsulated Co clusters catalyze the kinetically demanding NO2−-to-NH3 steps. This enables a synergistic tandem pathway that markedly lowers the free energy barrier of the rate-determining *NO2 → *NO transformation. The MOF-818 (Cu)–Co catalyst benefits from this dual-site cooperation, achieving NH3 Faradaic efficiencies above 90% across −1.3 to −1.8 V (vs. Ag/AgCl), with a peak FE of ∼99.5% at −1.5 V and an NH3 yield rate of 35.0 mol h−1 gCu,Co−1. This significantly outperforms both pristine MOF-818 (Cu) and Co nanoparticles. This work critically demonstrates that embedding secondary metal clusters within MOF cages effectively couples adsorption and reduction functions within confined, nanoscale reactors. However, this success relies heavily on controlled metal dispersion and stable MOF architectures, which could limit the strategy's scalability and applicability to other MOF systems.
Recently, Wang et al.32 presented on a phosphate-assisted iron-modified MOF-808 (Fe@MOF-808-P) catalyst. This catalyst uses metal-center engineering, specifically the incorporation of Fe(III) into the Zr-based MOF-808 framework, combined with phosphate ligand modulation, to boost nitrate electroreduction to ammonia. The iron nodes act as the primary active sites for nitrate adsorption and stepwise hydrogenation, while the coordinated hydrogen phosphate species accelerate proton transfer and stabilize reaction intermediates, collectively lowering the energy barriers for nitrate to ammonia conversion. Mechanistic analysis shows that the Fe(III) centers promote NO3− chemisorption and facilitate electron-proton coupling, while the phosphate groups create a more favorable microenvironment for the hydrogenation steps, thereby enhancing selectivity. Electrochemical measurements demonstrate a Faradaic efficiency of 90.8% ± 0.5% at −0.9 V vs. RHE, with an NH3 yield rate of 420.4 µmol h−1 cm−2 and a specific activity of 1035.9 mg NH3 h−1 mg Fe. The system exhibits excellent durability over repeated cycles. The structural strategy of dual incorporation of Fe(III) and phosphate proves effective because it couples precise active-site design with microenvironment engineering, enabling high intrinsic activity of the Fe catalytic center and accelerated proton supply. However, the reduced surface area after P/Fe loading may limit scalability and the system still relies on relatively high NO3− concentrations for peak performance. This indicates room for optimization in terms of mass transport and active-site exposure.
Hu et al.33 reported a bimetallic oxide composite derived from a MOF, in which Cu-doped Co3O4 nanosheets are grown on three-dimensional nickel foam (Cu–Co3O4/NF). This composite was constructed using Cu-ZIF-67 as a precursor, followed by calcination, to create an electrocatalyst that can reduce nitrates to ammonia at ultra-low concentrations. This structural approach combines MOF-derived spinel Co3O4 with the incorporation of copper and a conductive nickel foam (NF) scaffold. This enhances the density of active sites, improves charge transport and suppresses the hydrogen evolution reaction (HER). Here, Fig. 3 systematically compares the electrocatalytic performance of Cu–Co3O4/NF and Co3O4/NF towards nitrate reduction. Fig. 3(a) shows that upon the addition of NO3−, both electrodes exhibit enhanced cathodic currents from approximately −0.2 V vs. RHE, indicating the onset of nitrate reduction. However, Cu–Co3O4/NF consistently displays higher current densities than Co3O4/NF, reflecting superior conductivity and catalytic activity. Fig. 3(b) and (c) show the potential-dependent NH3 yield rate and Faradaic efficiency, respectively, in 2 mM NO3−. The Cu-doped catalyst achieves a maximum NH3 yield of 0.137 mmol h−1 cm−2 at −0.83 V vs. RHE and a peak FE of 92.4% at −0.53 V vs. RHE. These values are significantly higher than those of the Co3O4/NF catalyst, confirming beneficial Cu–Co synergy. Fig. 3(d) shows the effect of nitrate concentration on the NH3 yield rate. It demonstrates that the yield rate increases with increasing NO3− concentration up to ∼20 mM. Notably, the FE remains high and still reaches 92.4% at only 2 mM NO3−, highlighting the catalyst's suitability for low-concentration conditions. Fig. 3(e) shows good durability, with both the NH3 yield rate and the FE fluctuating only slightly over six consecutive electrolysis cycles at −0.53 V vs. RHE. Finally, the Nyquist plots in Fig. 3(f) show that Cu–Co3O4/NF has much smaller charge-transfer resistance than Co3O4/NF. This indicates faster interfacial electron transfer, which corroborates the enhanced kinetic performance of the MOF-derived Cu-doped composite. Mechanistic studies based on in situ FTIR and DFT calculations suggest that the reduction of NO3− proceeds via an H-assisted surface pathway. This pathway involves the following sequence of steps: NO3− → *NO3 → *NO2 → *NO2H → *NO → *NOH → *N → *NH → *NH2 → *NH3. Meanwhile, Cu doping modulates the electronic structure of the Co sites, strengthens NO3− adsorption and lowers the free-energy barrier of the key deoxygenation steps. This significantly disfavors the HER. In addition, the authors also emphasized that preparing MOF-derived Cu-doped Cu-Co3O4/NF is simple and inexpensive.
![]() | ||
| Fig. 3 Electrocatalytic performances. (a) LSV curves of Cu–Co3O4/NF and Co3O4/NF in Ar-saturated 0.2 M K2SO4 with and without 2 mM NO3−. The comparisons of the (b) NH3 yield rate and (c) FE between Cu–Co3O4/NF and Co3O4/NF in contained 2 mM NO3− electrolyte under a range of potentials for 30 min. (d) The NH3 yield rate and FE of Cu–Co3O4/NF at −0.53 V vs. RHE in electrolytes with various concentrations of NO3− for 30 min. (e) The NH3 yield rate and FE of cycling tests for Cu–Co3O4/NF at −0.53 V vs. RHE for 30 min. (f) The Cu–Co3O4/NF and Co3O4/NF Nyquist plots. Reproduced with permission.33 Copyright 2024, Elsevier. | ||
In another study, Chen et al.34 developed an MOF-derived composite electrocatalyst in which an adenine-based cobalt-bio-MOF undergoes pyrolysis to form cobalt/nitrogen-doped carbon nanospheres (Co@NC). This process enables the uniform dispersion of metallic cobalt nanoparticles within an N-rich carbon matrix, facilitating the efficient conversion of nitrates to ammonia. This structural approach uses the nitrogen-containing organic ligand to create an abundance of pyridinic, pyrrolic and graphitic nitrogen sites during carbonization. These sites coordinate with the cobalt and modify the electronic structure, facilitating selective NOx hydrogenation. Mechanistically, the Co nanoparticles act as primary active centers for NO3− adsorption and multi-electron reduction via *NO3, *NO2, *NO and *NHx intermediates. Meanwhile, the N-doped carbon shell enhances electron transport and stabilizes reactive species, collectively suppressing competing HERs. Consequently, Co@NC exhibits a Faradaic efficiency of 96.5% and an exceptionally high NH3 yield rate of 758.0 µmol h−1 mgcat−1 in an alkaline electrolyte, alongside excellent cycling durability. The proposed MOF-derived approach is effective because it yields atomically coupled Co–N–C motifs and high conductivity within a robust carbon matrix. However, its structural precision is limited by uncontrollable reconstruction during pyrolysis and the catalyst's performance still relies on relatively negative potentials. This suggests that further optimization of active-site exposure and electronic structure is required for practical advancement.
For example, Han et al.38 investigated the use of a main-group-element-based aluminum fumarate (Al-Fum) MOF as an electrocatalyst for the reduction of nitrogen to ammonia. They demonstrated that defect engineering via solvent-assisted linker exchange (SALE) is an effective structural strategy for activating otherwise inert aluminum sites. In this approach, pristine Al-Fum, constructed from Al–O chains and fumarate linkers, is partially subjected to linker extension. This generates coordinatively unsaturated and electronically modified Al defect sites, while preserving the parent framework topology largely intact. It is found that only the defective Al-Fum exhibits significant N2RR activity. It achieves a maximum NH3 production rate of 53.9 µg(NH3) h−1 mgcat−1 in 0.4 M K2SO4, with a Faradaic efficiency of 73.8% in 0.1 M K2SO4 at −0.15 V vs. RHE under ambient conditions. From a theoretical study, Fig. 4 shows how Al-site defects affect the electronic structure and N2RR pathway. The projected density of states (DOS) for Al in pristine Al-Fum (Fig. 4(a)) reveals no notable states at the Fermi level. This suggests that the Al centers are electronically 'silent' and unable to engage effectively in N2 adsorption or activation. By contrast, the DOS of Al in defective Al-Fum (Fig. 4(b)) exhibits a pronounced peak at the Fermi level, indicating increased electronic activity and the formation of defective Al sites that can interact with N2 and N2RR intermediates. Fig. 4(c) shows the free-energy diagram mapping the N2RR pathway on a defective Al site at 0 V vs. SHE. This reveals that the initial N2 → *NNH conversion is the potential-determining step, with a free-energy increase of 1.815 eV. Energetically accessible protonation steps then occur along an alternating associative mechanism. In addition, Fig. 4(d) shows side and top views of the adsorbed intermediates. N-containing species, such as *NNH, *HNNH, and *HNNH2, bridge two adjacent defective Al centers, while *NH and *NH2 bind to a bridging position between the same pair of Al atoms. These configurations demonstrate how pairs of defective aluminum sites, coordinated by surrounding oxygen atoms, can stabilize key intermediates cooperatively and support the continuous formation of N–H bonds along the N2RR pathway. Overall, this study provides compelling evidence that linker-extension-induced defect engineering is an effective method of activating main-group MOFs for the N2RR. However, it also emphasizes the necessity of enhancing conductivity and achieving greater control over defect chemistry to maximize the potential of this approach.
![]() | ||
| Fig. 4 DFT calculations. DOS of the Al atom for Al-Fum MOF (a) without and (b) with defects. (c) Free-energy diagrams for N2 reduction on the defect site of Al-Fum at 0 V versus SHE. (d) Side and top views of atomic configurations for N2 reduction reaction intermediates adsorbed on the defect site of Al-Fum. Reproduced with permission.38 Copyright 2023, Wiley. | ||
In terms of the conductivity enhancement, Zhu et al.39 presented a structural strategy focused on conductivity, in which ultrafine metallic Cu clusters (approximately 1.5 nm in size) are generated in situ and tightly confined within the mesoporous channels of the conductive CuHHTP MOF. This forms a Cu@CuHHTP hybrid that maximizes electronic transport while preventing cluster aggregation. The catalytic mechanism is governed by metallic Cu sites, where a higher d-band center and an ‘accept–donate’ charge-transfer pathway on Cu(111) surfaces enhance the adsorption and activation of NO3− intermediates. This lowers the overall reaction barriers relative to pristine MOF sites. Consequently, the catalyst exhibits impressive ammonia-synthesis metrics from nitrate reduction, including 85.81% nitrate conversion, 96.84% NH3 selectivity, an NH3 yield of 1.84 mg h−1 cm−2 and a Faradaic efficiency of 67.55%. This work proved the pore-filling approach successfully couples MOF conductivity with highly active Cu sites.
For example, Zhu et al.40 investigated the use of a 2D Co-MOF nanosheet as a scaffold for embedding p-mercaptobenzoic acid-protected iridium nanoclusters (Ir NCs@Co-MOF) in order to facilitate the electrocatalytic reduction of nitrate to ammonia. Fig. 5(a) illustrates the solvothermal synthesis of Ir NCs@Co-MOF on Ni foam. Co2+, terephthalic acid (TPA) and p-MBA-protected iridium nanoclusters assemble, whereby the –SH group binds to the iridium and the terminal –COOH groups coordinate with the cobalt, enabling the iridium nanoclusters to be chemically integrated into the growing Co-MOF nanosheets. Fig. 1(b) shows an SEM image of the pure Co-MOF, which displays dense, layered 2D nanosheets that cover the Ni foam uniformly. After the incorporation of the Ir NCs (Fig. 1(c)), the electrode retains the same nanosheet morphology. This indicates that the embedding of the Ir NCs does not disrupt the 2D Co-MOF structure. TEM imaging of the Ir NCs@Co-MOF composite (Fig. 1(d)) confirms that the Ir nanoclusters, which have an average size of approximately 1.8 nm, are finely and uniformly dispersed on the Co-MOF. The lattice spacing is 0.225 nm, which corresponds to the Ir(111) plane. Elemental mapping (Fig. 1(e)) further demonstrates the homogeneous distribution of C, O, Co, and Ir throughout the nanosheets, verifying the successful embedding of the low loading of Ir (0.39 at%) into the 2D MOF framework. Electrochemically, Ir NCs@Co-MOF exhibits strong NO3−-to-NH3 performance in mildly saline electrolyte (0.1 M Na2SO4, 50 mg L−1 NO3−–N). At −0.8 V vs. RHE, the catalyst achieves a nitrate conversion rate of 92.5%, an NH4+ selectivity of 81.4%, and an NH4+ yield of 230.1 µg h−1 cm−2, with Faradaic efficiency peaking at around 31.4%. It is reported that this design maximizes exposure of active sites, enhances electron transfer via Ir, and stabilizes the nanoclusters against aggregation, thereby improving NO3−-to-NH3 conversion under ambient conditions.
![]() | ||
| Fig. 5 Synthesis process and morphology characterization. (a) Schematic illustration of the synthesis of Ir NCs@Co-MOF in situ grown on Ni foam by a solvothermal method. (b) SEM image of Co-MOF. (c) SEM image of Ir NCs@Co-MOF. (d) TEM of Ir NCs@Co-MOF. (e) Element mapping images of Ir NCs@Co-MOF. Reproduced with permission.40 Copyright 2025, American Chemical Society. | ||
In another study, Pan et al.41 employed a 2D, ultrathin Ni-MOF nanosheet strategy to enhance the electrochemical production of ammonia from nitrate. This strategy exploits the inherently high surface exposure, the abundance of coordinatively unsaturated nickel sites, and the improved electron transport properties of 2D MOFs. Combining experimental analysis and DFT calculations, mechanistic studies reveal that hydroxyl-ligated Ni sites possess stronger nitrate adsorption and lower activation barriers than carboxyl-ligated ones. This enables a more favorable NO3 → NO2 → NO → NHx hydrogenation pathway. Under −1.4 V, the Ni-MOF nanosheets achieve 96.4% nitrate conversion, ∼80% NH3 selectivity and an NH3 yield of 110.13 µg h−1cm−2, outperforming pristine Ni foam. This nanosheet-based structural design is effective because thinning the MOF to a few nanometers maximizes the exposure of catalytically active Ni centers, shortens the diffusion lengths of electrons and ions, and modulates the local microenvironment to stabilize key intermediates.
Topology engineering, on the other hand, involves altering the global connectivity and pore network of MOFs, including framework dimensionality, channel orientation, and connectivity density.47,48 For example, Jiang et al.49 reported the topology functionalization of a robust chiral Zr-MOF for enantioselective hydrogenation. In this study, reticular and modulated synthesis strategies were employed to generate frameworks with either FLU or ITH network topologies from identical tetratopic biphenol-derived linkers. Despite being constructed from the same building blocks, the two topologies exhibited strikingly different catalytic performance, which highlights the decisive role of framework architecture in determining reactivity and selectivity. In detail, the flu topology was found to feature large octahedral cages interconnected by open one-dimensional channels, with the dihydroxy groups of the biphenol ligands oriented towards spacious cavities. This configuration enables the successful post-synthetic grafting of monophosphoramidite ligands and the subsequent immobilization of iridium (Ir) complexes. This yields highly active, recyclable heterogeneous catalysts for asymmetric hydrogenation with up to 98% enantiomeric excess. In contrast, the ith topology features triple-helicate cages with significantly smaller diameters and more confined triangular channels. These findings demonstrate that topology engineering is a crucial control parameter for catalytic performance, not just a structural design parameter. Together, these structural strategies show that MOFs can be custom-designed reactors where atomic structure and electrochemical function are seamlessly integrated. Although these structural strategies offer an essential foundation for understanding how MOF architectures influence catalytic performance, the rapid integration of ML provides a powerful means of accelerating their design. This is achieved by predicting structure–property relationships and guiding the discovery of next-generation MOFs for efficient electrocatalysis.
The integration of ML with HTCS further enhances this process by enabling predictive modeling and pattern recognition, thereby reducing the reliance on exhaustive simulations.54 Recent advances, particularly in Graph Neural Networks (GNNs), have revolutionized the screening of MOF databases by capturing complex structural and chemical relationships.55 This approach allows rapid identification of MOFs with tailored properties for specific applications.
In addition, the synergy between ML and automated experimental platforms has transformed high-throughput MOF research. These platforms seamlessly integrate ML-guided synthesis with experimental validation, closing the loop between computational predictions and laboratory results.56 This iterative cycle accelerates the development of advanced materials and bridges the gap between theory and practice.
Supervised learning is one of the most widely used ML paradigms in MOF research. It involves training models on labeled data sets to predict specific outcomes, such as gas adsorption capacities, catalytic efficiencies, or electronic properties. This paradigm is particularly useful for performance prediction and optimization tasks.57 Common models used in supervised learning include Support Vector Machines (SVM),58 Random Forests (RF),59 Decision Trees (DT),60 Gradient Boosting Machines (GBM),61 Artificial Neural Networks (ANN),62 k-Nearest Neighbors (k-NN)63 and Bayesian algorithms such as Bayesian Networks (BN).64 These algorithms excel at identifying structure–property relationships and optimizing synthesis conditions to improve material performance.
Unsupervised learning, on the other hand, focuses on discovering hidden patterns in data without requiring labeled outputs.65 This approach is ideal for exploratory tasks, such as clustering MOFs based on structural similarities or identifying trends in performance data sets. Popular unsupervised learning models include K-means,66 hierarchical clustering,67 Gaussian mixture models (GMM),68 and density-based clustering (DBSCAN).69 In addition, dimensionality reduction algorithms such as principal component analysis (PCA),70 t-distributed stochastic neighbor embedding (t-SNE),71 and autoencoders72 can be used to summarize the data by describing it using less information, which is helpful in both supervised and unsupervised learning. These techniques are instrumental in understanding the underlying relationships within complex MOF datasets and generating new hypotheses.
In the field of MOF research for sustainable ammonia production, both supervised and unsupervised learning play complementary yet fundamentally different roles. Supervised learning is primarily used for performance prediction. Regression models such as Random Forest or gradient boosting, for example, are trained using descriptors including metal-node electronic properties, defect density and pore geometry, to predict Faradaic efficiency, NH3 yield rate and selectivity over the HER for EN2RR or ENO3RR systems. These models allow for the rapid virtual screening of thousands of hypothetical MOFs, enabling the identification of promising candidates prior to experimental validation.
In contrast, unsupervised learning is mainly used for structural discovery and pattern recognition. Clustering algorithms and dimensionality reduction techniques, such as k-means or t-SNE, can group MOFs based on latent structural similarities. This reveals hidden correlations between framework topology, functional group distribution and catalytic behaviour, without the need for labelled performance data. Such analyses help to uncover previously overlooked structure–activity trends and guide the exploration of new MOF families with desirable catalytic microenvironments.
Semi-supervised learning bridges the gap between supervised and unsupervised methods by using both labeled and unlabeled data.73 This technique is particularly advantageous in MOF research, where labeled data (e.g., experimental adsorption capacities) can be scarce and expensive to obtain. Models such as self-training74 and co-training75 are commonly used. In addition, transfer learning enables the use of knowledge gained from one prediction task (e.g., MOFs containing metals A, B, and C) to improve model performance on a related but different task (e.g., MOFs containing metals D, E, and F), thereby enhancing generalization and reducing the need for extensive new training data.76 Multi-task learning, on the other hand, facilitates the simultaneous optimization of multiple interrelated goals by leveraging shared representations.77 For example, in MOF-based electrocatalysis, a model could be trained to predict both high Faradaic efficiency (FE) and high NH3 yield (task 1), while simultaneously enforcing constraints related to structural topology (task 2), ultimately leading to a more holistic and efficient design strategy. The integration of these techniques could significantly improve the predictive capabilities of ML models in MOF research, especially for applications that require trade-offs between multiple performance criteria. These methods improve the predictive power of machine learning models while reducing the dependence on large, labeled datasets.
Reinforcement learning (RL) represents a dynamic approach where models learn optimal actions through iterative interactions with an environment.78 In MOF research, RL is applied to guide the discovery of synthesis pathways, optimize material designs, and improve decision-making in experimental setups. Key algorithms in reinforcement learning include Q-Learning,79 Deep Q-Networks (DQN),80 Policy Gradient Methods,81 Actor-Critic Methods,82 and Monte Carlo Tree Search (MCTS).83 These algorithms enable the systematic exploration of vast design spaces and facilitate the identification of optimal solutions for complex problems.
Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Coefficient of determination (R2), and Pearson entropy. MAE measures the average size of errors in predictions and provides a simple interpretation of prediction accuracy.84 It is calculated as the average of the absolute differences between predicted and observed values (
, where yi and ŷi are the true and predicted values, respectively, and n is the number of samples). MSE quantifies the mean squared difference between predicted and actual values
. It serves as the primary metric for assessing prediction accuracy, with smaller MSE values indicating higher accuracy. By penalizing larger errors more severely, MSE provides a robust measure for error analysis, making it particularly effective in identifying models with significant outliers.85 On the other hand, RMSE provides insight into the standard deviation of prediction errors, penalizing larger errors more than smaller ones
.86 This metric is particularly useful for evaluating models where large errors are critical. In addition, R2, also known as the coefficient of determination
(where ȳ is the mean of the observed values), measures the proportion of variance in the dependent variable that is explained by the model.87 Ranging from 0 to 1, R2 values closer to 1 indicate that the model captures a substantial amount of variance, reflecting its predictive power. This metric is essential for understanding the explanatory power of a model and determining its fitness to the data. Furthermore, the Pearson correlation coefficient evaluates the linear relationship between two variables and ranges from −1 to 1. A coefficient closer to 1 or −1 indicates a strong positive or negative correlation, respectively, while a value closer to 0 indicates a weak or no linear correlation.88 This metric is critical in determining the strength and direction of relationships within data, providing insight into underlying patterns. In summary, MAE, MSE and RMSE follow different principles in error evaluation, and their selection depends on the characteristics of the data set, especially the presence of outliers. MAE is less sensitive to extreme values, making it more suitable for datasets with many outliers, while also being easier to interpret. In contrast, MSE and RMSE are preferable when the dataset contains few or no outliers, as they penalize larger errors more severely, thereby emphasizing substantial deviations. Among the two, RMSE is often preferred for interpretability due to its consistency with the original data units. In addition, Mean Absolute Percentage Error (MAPE), although less commonly used in the field, is usually a good metric for summarizing regression predictions as a percentage, which can be useful for explaining the performance of the model to non-specialists (thanks to its high interpretability value). These error metrics provide different insights, with lower error values indicating higher predictive accuracy. When used together, these metrics provide a comprehensive framework for evaluating and improving predictive models, ensuring their robustness, accuracy, and applicability in MOF research contexts.
953 hypothetical MOFs generated from a library of building blocks, with data on properties such as surface area, pore size distribution, and methane storage capacity. (ii) CoRE MOF.91 Contains 14
142 computationally ready, experimentally derived MOFs designed for simulations. This database is widely used for various computational studies. (iii) Cambridge Structural Database (CSD) MOF Subset.92 Provides 69
666 MOFs from the CCDC with calculated properties such as pore volume, surface area, and density. (iv) Boyd Materials Cloud.93 Includes 324
426 hypothetical MOFs screened for CO2/N2 selectivity and work capacity, ensuring accurate electrostatic potential representation. (v) Quantum MOF (QMOF).94 Provides calculated quantum chemical properties for 15
713 experimentally synthesized MOFs, including band gaps, energies, and density of states. (vi) Open DAC Dataset (ODAC).95 Includes extensive DFT calculations for over 8412 MOF materials targeted for direct air capture applications. (vii) ARC-MOF.96 A diverse database of 279, 610 MOFs with detailed geometric, physical, and chemical properties derived from multiple sources. (viii) DigiMOF.97 Generated using a chemistry-aware natural language processing tool that captures over 43
000 MOF-related journal articles and 52
680 associated properties such as synthesis methods and topologies.
In summary, MOF data could be obtained from four main sources: materials databases, scientific literature, theoretical calculations, and experimental results. These databases aggregate data from various sources, including experimental measurements, computational simulations, and scientific literature, providing a comprehensive foundation for ML applications in MOF research. The quality of the data from these repositories is paramount. Experimental data, such as adsorption capacities and catalytic performances, provide empirical insights but can vary due to differences in measurement conditions and techniques.98 Computational simulation data, including DFT calculations, provide theoretical predictions that are consistent within defined parameters, but may not always capture real-world complexities. Therefore, it is essential to ensure data quality through careful curation. This process includes data cleaning to remove inaccuracies, standardization to ensure consistency across datasets, and validation against known benchmarks. Such rigorous curation increases the reliability of ML models trained on these datasets, resulting in more accurate and generalizable predictions.
In practice, the construction of descriptors for MOFs should follow a hierarchical strategy that starts with physically meaningful building blocks and progressively incorporates data-driven screening. At the atomic level, descriptors can be derived from elemental properties, such as electronegativity, oxidation state, d-band filling and coordination number, to capture the intrinsic activity of metal nodes and ligand environments. At the pore level, geometric descriptors such as pore-limiting diameter, accessible surface area, void fraction and pore connectivity encode the effects of mass transport and confinement that govern adsorption and catalytic kinetics. At the topology level, graph-based descriptors reflecting node degree, ring statistics and framework connectivity provide a compact representation of long-range structural organization.
These descriptors, driven by domain knowledge, should be then complemented by data-driven feature engineering. Model-agnostic feature importance analysis (e.g. RF importance or SHAP values) and dimensionality reduction methods (e.g. PCA or autoencoders) can be employed to identify redundant or weakly informative variables. Additionally, graph neural networks facilitate the direct learning of latent structural descriptors from atomic graphs, effectively capturing intricate interactions that are challenging to encode manually. By combining physical intuition with statistical screening iteratively, a reduced set of interpretable and predictive descriptors can be constructed. This improves model generalizability and enables rational, performance-oriented MOF design.
Feature importance analysis, performed using models such as RF or GB, identifies the most influential descriptors for predicting MOF properties.99 These two models are particularly well-suited to tabular data. However, when working with high-dimensional data, such as structured data, more advanced models are required, such as GNNs. Another useful procedure in feature engineering is dimensionality reduction (PCA, t-SNE and autoencoders), as this removes duplicated or non-informative features. These analyses often reveal that geometric descriptors such as pore size and void fraction dominate adsorption predictions, while chemical and energy descriptors play complementary roles, especially under variable conditions.102 Understanding these relationships guides rational MOF design by prioritizing structural features that directly influence desired properties.
Feature engineering plays a vital role in bridging the gap between raw MOF data and high-performance ML models. It can be divided into three sequential stages: feature construction,103 feature transformation,104 and feature selection.103 The first stage, feature construction,105 involves generating new descriptors from raw inputs. For example, elemental properties of metal nodes can be combined with coordination numbers to derive effective catalytic site descriptors. Alternatively, pore size distribution, void fraction, and surface chemistry can be aggregated to describe mass-transport behavior relevant to electrocatalysis. The aim of feature transformation is to normalize or rescale heterogeneous descriptors (e.g. z-score normalization of electronic properties106) to reduce bias during model training. Dimensionality reduction techniques, such as principal component analysis or autoencoders, further compress correlated features into a compact latent space while preserving key structure–property relationships.
Feature selection then identifies the most informative subset of descriptors for model construction. This can be achieved using filter methods (e.g. correlation screening107), wrapper methods (e.g. recursive feature elimination108) or embedded methods based on model-specific importance scores, such as Random Forest feature importance or SHAP values. In MOF-based electrocatalysis, this process enables the isolation of the most relevant descriptors for ammonia production metrics (e.g. metal-node electronic configuration, defect density, and local coordination environment) while discarding redundant geometric variables. This structured feature engineering pipeline enables ML models to achieve improved accuracy, interpretability and generalization, thereby facilitating more reliable prediction and rational design of MOF electrocatalysts.
In addition, graph model-based feature representation further enhances the understanding of structure–property relationships in MOFs. By representing MOF structures as graphs, where atoms serve as nodes and chemical bonds as edges, advanced techniques such as GNNs capture intricate structural and chemical relationships.101 This representation facilitates the extraction of meaningful features, such as connectivity patterns and node-ligand interactions, that are difficult to identify using traditional methods.109 Through this graph-based approach, ML models can uncover correlations between structure and performance, enabling the design of MOFs optimized for specific applications. This integration of descriptor selection and graph modeling represents a powerful strategy for advancing MOF research.
ML serves as a transformative tool to bridge fundamental research with practical applications in MOF design. Using techniques such as classification, regression, clustering, and optimization, ML enables a deeper understanding of structure–property relationships and drives the rational design of MOFs for diverse applications.118 For example, ML-guided property predictions enable researchers to optimize MOFs for gas storage and separation, including membrane technique,119–122 CO2 capture,123 adsorption heat transformation,124–126 CH4 adsorption,127 hydrogen storage,128 water harvesting129,130 and gas separation.131 Similarly, ML facilitates catalyst development by predicting reactivity and stability, while also supporting the design of MOFs for water treatment,132 sensing,133,134 adsorption heat transformation,135,136 and drug delivery.110 The theoretical foundation provided by ML methods integrates seamlessly with application-driven research, accelerating material discovery cycles and providing targeted solutions to real-world challenges. Techniques such as GNNs and deep learning have enabled accurate predictions and insights into key properties such as thermal conductivity, mechanical stability, and adsorption capacities, providing actionable data for performance optimization. Here, ML-assistance in MOF design and synthesis for these applications is illustrated in Fig. 7. Some representative studies are elaborated in Section 3.6 (Linking ML fundamentals to applications) for comprehensive understanding. The intersection between fundamental ML techniques and application-oriented research sets the stage for Section 4 (Challenges and suggested directions), where we explore specific practical implementations of ML in MOF design.
Recently, these ML-guided MOFs showed remarkable potential in a wide range of emerging fields. For example, Wang et al.137 applied ML to design 2D MOFs with high magnetic anisotropy energy (MAE) for magnetic storage. Using the Transition-Metal Interlink Neural Network (TMINN) model, it screened 1440 MOFs and predicted MAE for 2583 additional candidates, identifying 28 high-MAE structures, including 11 novel ferromagnetic MOFs validated by DFT. ML enabled efficient high-throughput screening and achieved accurate predictions with an RMSE of 11.36 meV per atom, demonstrating its potential to accelerate MOF discovery despite reduced accuracy for low-MAE structures. In another study, Wang et al.138 focused on the use of ML to predict the drug loading capacity and cytotoxicity of MOFs, specifically UiO-66-NH2 and MIL-125-NH2, for biomedical drug delivery applications. In this study, ML plays a critical role in streamlining candidate selection, improving performance prediction, and reducing reliance on trial-and-error experimental methods. Furthermore, Anadebe et al.139 investigated the application of Co-MOF (ZIF-67) as a corrosion inhibitor for X65 steel in CO2-saturated environments, addressing critical industrial challenges in preventing steel degradation. The study achieves an impressive corrosion inhibition efficiency of up to 97%, which is attributed to the formation of a dense protective film on the steel surface. ML, specifically ANFIS, played a critical role in predicting inhibition performance, streamlining the analysis of experimental electrochemical data, and optimizing the protective properties of the material.
Specifically, ML-guided MOFs have been widely used in electrochemical catalysis (e.g., CO2 reduction140 and nitrogen reduction reactions), photocatalysis (e.g., hydrogen evolution and pollutant degradation), and biocatalysis (e.g., enzyme immobilization and drug synthesis).38 These applications address critical global challenges such as clean energy production, greenhouse gas mitigation, and sustainable chemical manufacturing.141 ML is accelerating advances in MOF-based catalysis by enabling the prediction of active sites, reaction pathways, and catalytic performance. ML models can identify structure–activity relationships, optimize reaction conditions, and design novel MOFs with improved catalytic efficiencies. In addition, ML facilitates the integration of MOFs into hybrid systems, such as mixed-matrix membranes and composite catalysts, enabling scalable and efficient solutions for industrial catalysis. By leveraging ML, researchers can unlock the full potential of MOFs for catalytic applications, driving advances in sustainable and innovative chemical technologies.
For example, McCullough et al.142 used NU-1000 MOFs as catalytic supports and employed high-throughput experimentation (HTE) and data-driven algorithms to optimize catalytic performance for propyne dimerization. Fig. 9(a) shows an integrated experimental and ML framework for hexadiene yield optimization of over 23 catalytically active metals, totaling 652 experiments. The pie chart categorizes the distribution of metals tested, providing a clear overview of the experimental diversity. The methodology follows a dual model scheme: Principal Component Analysis (PCA) and k-nearest neighbors (KNN) are first used to optimize conditions for the Cu catalyst. XGBoost Random Forest Regression is then applied to predict catalytic performance across different metals, leveraging the insights gained from the Cu optimization. In addition, Fig. 8(a) also highlights the highest hexadiene yields achieved for each metal (represented by black circles) and their corresponding product selectivity. Cu shows the most significant improvement in hexadiene yield under optimized conditions, validating the ML-guided approach. In summary, Bayesian optimization guided experiments for different metals, with Cu-deposited NU-1000 showing significant yield improvements from 0.4% to 24.4% hexadiene. The comprehensive evaluation provides a robust pipeline for rational catalyst design using ML.
![]() | ||
Fig. 8 (a) Summary of the second campaign to optimize hexadiene yield over 23 different catalytically active metals. Reproduced with permission.142 Copyright 2023, American Chemical Society. (b) Evaluation of 16 ML models (75% quantile) with training score, test score, precision score, recall score and f1 score. (c) Structures of representative highly active MOFs predicted by HTS of 12 515 MOFs. Reproduced with permission.143 Copyright 2025, KeAi. | ||
Another representative example was recently reported by Bai et al.143 They investigated the HTS of MOFs as catalysts for CO2 cycloaddition reactions using an explicable ML model. The MOFs include 12
415 hypothetical structures and 100 reported MOFs, with experimental validation of MOF-76(Y). Here, Fig. 8(b) evaluates the performance of 16 ML models for predicting CO2 cycloaddition activity, focusing on metrics such as training score, test score, precision, recall, and F1 score calculated at the 75% quantile. Among these models, RF emerges as the most effective, achieving the highest scores across all metrics. The consistency between training and test scores highlights the model's ability to balance bias and variance, reducing overfitting while ensuring accurate predictions. Fig. 8(c) shows the structures of highly active MOFs identified by high-throughput screening, where Fig. 8(c.1)–(c.8) represent hypothetical MOFs generated by ToBaCCo calculations and Fig. 8(c.9) and (c.10) represent experimentally reported MOFs from the CoRE MOF 2019 database. The selected MOFs exhibit diverse structural and electronic features, including different pore sizes, surface areas, and metal–ligand combinations, which contribute to their catalytic efficiency. This work highlights the potential of combining data-driven methodologies with computational tools for the rapid and accurate discovery of efficient catalysts for CO2 cycloaddition reactions. Further experimental validation of hypothetical MOFs could expand their applicability and address potential limitations of computational assumptions.
To overcome these limitations and improve the reliability of ML models, several strategies can be implemented: (i) data augmentation. Techniques such as mix-up,144 which generate new training samples by interpolating between existing data points, can improve model generalization and robustness. This method introduces controlled variability while preserving meaningful relationships in the data. (ii) Data synthesis. Generative models such as Generative Adversarial Networks (GANs),145 Autoencoders (AEs),145 Variational Autoencoders (VAEs),146 and Diffusion Models147 can be used to generate synthetic MOF data by introducing controlled noise. These models enable the creation of diverse yet realistic MOF structures and properties, filling gaps in underrepresented data distributions. Although generative models are commonly introduced as tools for data augmentation, they have a more transformative capability in inverse materials design. In the context of MOF-based ammonia electrocatalysts, variational autoencoders, generative adversarial networks and graph-based generative models can be trained using existing MOF databases that are annotated with NH3 production metrics, to learn the latent structure–property manifold. Once trained, these models can generate new MOF candidates directly, based on desired performance targets such as high Faradaic efficiency, strong suppression of the HER or enhanced structural stability. In practical workflows, the generated hypothetical MOF structures are first filtered using rapid ML predictors for key descriptors, followed by high-throughput DFT screening to validate the adsorption energetics of N2, NO3− and reaction intermediates. This closed-loop generative–predictive framework transforms ML from a passive prediction tool into an active discovery engine, enabling the rational exploration of vast chemical spaces that would be inaccessible through trial-and-error experimentation alone. (iii) Collaborative databases. Establishing open-access, standardized repositories that integrate experimental and computational data from multiple research groups will improve data accessibility and interoperability. Transfer learning techniques, in particular domain adaptation, can be used to enhance the utility of these databases. Domain adaptation allows models trained in data-rich domains to be adapted for use in data-poor domains, provided there is a meaningful relationship between the two. This approach allows insights from well-characterized MOF properties (e.g., adsorption data) to inform predictions for under-represented properties (e.g., electronic conductivity or mechanical stability). Incorporation of the FAIR (Findable, Accessible, Interoperable, and Reusable) data principles will further facilitate seamless data integration, improve ML model performance and accelerate materials discovery. (iv) Quality control measures. Rigorous data validation protocols, including systematic comparisons between experimental and computational data, can ensure consistency and reliability. The use of anomaly detection algorithms can help identify outliers and reduce noise in training data sets, resulting in more robust ML models. (v) Standardization initiatives. The development of community-driven standards for data collection, reporting, and sharing is essential for seamless dataset integration. Establishing consistent descriptor definitions, measurement protocols, and data formats will facilitate greater interoperability and improve model generalizability.
Several solutions have been proposed to address these challenges. One promising approach is to incorporate graph attention mechanisms into GNNs. These mechanisms assign varying degrees of importance to different nodes (atoms) and edges (chemical bonds) in the graph representation of MOFs, providing a clearer picture of the key structural features that influence model predictions. By highlighting critical interactions within the MOF framework, graph attention mechanisms improve interpretability and guide researchers in identifying design rules for targeted applications. Another promising approach to improve interpretability is knowledge distillation,149 a transfer learning technique in which a complex deep learning model (e.g., DNN) serves as a teacher to train a more interpretable model, such as a DT. While DT are limited in their ability to fully capture the complex knowledge embedded in a DNN, they can extract simplified representations that provide valuable insights into key decision patterns.150 Although this method does not provide a complete explanation of the model's reasoning, it increases transparency by identifying influential structural features and trends, thereby aiding in the rational design of MOFs. In addition, techniques such as SHAP and Local Interpretable Model-agnostic Explanations (LIME) can be applied to post hoc analysis of ML models. These methods decompose predictions into the contributions of individual descriptors, providing insight into the underlying decision process of the model. Visualization tools, such as feature importance plots and attention heatmaps, further aid in understanding the relationship between MOF structures and their properties.
To improve the generalization capability of ML models, four strategies are proposed for future improvement: (i) transfer learning. Transfer learning encompasses several techniques designed to improve model performance by leveraging knowledge from related tasks. One approach is domain adaptation, where a model is initially trained on a dataset with many samples (source domain) and simultaneously adapted to a dataset with fewer samples (target domain) to improve generalization in data-scarce scenarios. Another widely used method is fine-tuning, in which a pre-trained model is not developed from scratch but refined by adjusting specific parameters to adapt it to a new but related task. These transfer learning strategies enable more efficient knowledge reuse, reducing the need for extensive new training data while improving model robustness and applicability to diverse MOF systems. (ii) Available data. Expanding datasets to include a broader range of MOF structures and properties to better represent the chemical and structural diversity in MOF systems. Synthetic data generation through high throughput simulations can also complement experimental data sets. (iii) Model regularization. Incorporation of techniques such as dropout, L1 (Lasso regularization) and L2 (Ridge regularization), and ensemble learning to prevent overfitting and improve robustness across different data sets. (iv) Benchmarking and validation. Regularly testing models on independent datasets or using cross-validation techniques to ensure their applicability to new MOF systems. This includes benchmarking against experimental results or datasets from different sources.
To address these integration challenges, several strategies can be implemented to bridge the gap between ML predictions and experimental workflows. Developing robust ML models that account for experimental variability, such as incorporating uncertainty quantification and error correction mechanisms, can improve prediction reliability (also known as “robust against noise”). Establishing standardized protocols for data collection, reporting, and validation can reduce variability and improve consistency, allowing for better agreement between experimental results and ML predictions. High-throughput experimental setups integrated with ML-guided predictions can accelerate the validation process by testing multiple MOFs under controlled conditions. Iterative learning loops, where experimental results are fed back into the ML model for refinement, can ensure continuous improvement in prediction accuracy. Improving the interpretability of ML models using tools such as SHAP scores can provide actionable insights into predictions, reducing trial-and-error efforts in experimental design. By implementing these strategies, the synergy between ML and experimental studies can be strengthened, leading to faster and more reliable progress in MOF research.
To address these challenges, possible solutions include: (i) knowledge distillation. Simplifying complex models into smaller, more efficient versions without significant loss of accuracy. This approach reduces computational complexity while maintaining predictive performance, making ML models more accessible to resource-constrained researchers. (ii) Model compression. Techniques such as pruning, quantization, and low-rank factorization can reduce the size and complexity of ML models. These methods improve the efficiency of both training and inference processes, reducing hardware requirements and power consumption. It is also noted that model compression and knowledge distillation are usually in hand-to-hand mode. (iii) Transfer learning. Reusing pre-trained models for related tasks minimizes the need for extensive retraining, saving computational resources and time. (iv) Few-shot and zero-shot learning. Few-shot learning allows ML models to learn from a minimal number of data points, reducing the dependence on large data sets and extensive computing resources. Zero-shot learning, on the other hand, allows models to make predictions on unseen data by leveraging cross-domain knowledge, such as textual descriptions or structural similarities. These techniques improve model adaptability while reducing training time and energy consumption. (v) Efficient algorithms. Developing lightweight algorithms optimized for MOF datasets, such as sparse matrix operations or low-complexity neural architecture, can further reduce computational requirements.
In addition to inverse design, experimental validation remains a critical step to ensure the reliability and practicality of inverse design results. This involves synthesizing the proposed MOFs, characterizing their properties, and comparing them to predicted values. However, the iterative nature of this process can be resource intensive, especially for complex MOF designs.152 High-throughput experimental techniques, automated synthesis platforms, and advanced characterization methods are being developed to mitigate these challenges and enable faster and more efficient validation cycles.
To address these limitations, future work could focus on (i) reinforcement learning for synthesis routes. Incorporating reinforcement learning into ML models can help identify feasible synthesis routes, making experimental realization of designed MOFs more practical. (ii) Active learning approaches. Iterative integration of experimental results into ML models can refine predictions and reduce the number of synthesis trials required. (iii) Hybrid models. Combining data-driven ML approaches with physics-based simulations can improve the accuracy of predictions by incorporating fundamental material properties and interactions. (iv) Collaborative validation platforms. Establishing shared resources for synthesis and characterization, supported by a network of research laboratories, can streamline the experimental validation process.
One of the most critical applications of ML in MOF research is the prediction of material stability under various conditions, such as thermal, chemical, and mechanical stress.153 Stability predictions often require models to analyze structural features such as metal–ligand coordination strength, pore size, and framework flexibility. However, these predictions are challenging due to the lack of comprehensive experimental stability data and the need to account for dynamic interactions within the MOF framework. Another focus is on predicting the synthesizability of MOFs – whether a theoretically designed structure can be practically synthesized. This involves evaluating factors such as reaction conditions, precursor availability, and the likelihood of achieving the desired topology. ML models trained in synthesis data can provide valuable guidance but can be inaccurate due to missing or inconsistent experimental data.
Here are some practical strategies for improving long-term accuracy: (i) continuous model updating. Ensuring that models remain accurate over time requires continuous updating with new and diverse data. This can be achieved through lifelong learning, a subfield of ML that focuses on incrementally adding knowledge to an existing model without forgetting previously learned patterns. In addition, machine learning operations (MLOps) provide a structured approach to deploying and maintaining ML models, enabling automated retraining and evaluation as new experimental and computational data become available. Integrating high-throughput experimental and computational pipelines into MLOp frameworks can further enhance model robustness and adaptability. (ii) Transfer learning. This broad category of techniques leverages existing data and models to improve performance for specific MOF-related tasks. One commonly used method is fine-tuning, where a pre-trained model is adapted to a new application by adjusting a subset of its parameters. More generally, transfer learning enables knowledge transfer from well-studied domains to understudy MOF applications, reducing the need for extensive training data while improving model accuracy and generalization. (iii) Synthetic data. Generating synthetic data, such as through simulations or generative models, can expand training data sets and fill gaps in experimental data, especially for rare or complex MOF properties. (iv) Active learning. Active learning improves model efficiency by interactively selecting the most informative data points for human or expert annotation. Instead of passively training on static data sets, the model “asks questions” by identifying uncertain predictions and requesting validation from an oracle (e.g., a researcher or an automated high-fidelity simulation). This targeted approach prioritizes data collection for uncertain regions, significantly improving model accuracy with minimal additional data. (v) Validation with experimental feedback. Integrating iterative feedback loops between ML predictions and experimental results helps identify and correct inaccuracies, ensuring that models remain aligned with real-world performance.
Specifically, there are a few key elements missing from current workflows. First, there are the data pipeline issues. Many studies lack a clear and standardized process for collecting, cleaning, and curating data sets. Issues such as inconsistent formats, missing metadata, and unverified data quality can undermine the reliability of ML models. Second, researchers often use different algorithms and hyperparameter tuning methods for model selection and training without a consistent standard, leading to inconsistent results for similar problems. Third, for validation and reporting, few workflows emphasize rigorous validation using independent data sets or cross-validation techniques. In addition, there is a lack of transparency in reporting key metrics and model limitations. Next, integration with experimental studies is still not fully explored in the reported literature. The lack of a feedback loop between ML predictions and experimental results limits the ability to refine models and validate their applicability in the real world. Finally, there is no standard approach to presenting ML workflows and results in publications, leading to gaps in reproducibility and knowledge dissemination.
To address the above issues, a comprehensive and universal ML workflow for MOF research should begin with standardized protocols for data collection and curation, ensuring that datasets are well documented, consistently formatted, and annotated with metadata to enhance reproducibility. Data preprocessing should include automated tools for cleaning, normalization, and augmentation to address missing values, outliers, and inconsistencies. Feature engineering is critical and should include guidelines for selecting and generating descriptors, such as structural, chemical, and functional properties of MOFs, to improve model performance. The model development process should rely on standardized frameworks for algorithm selection, model training, and hyperparameter optimization, with transparent criteria to guide decisions. Rigorous validation techniques, including cross-validation, independent dataset testing, and uncertainty quantification, are essential, along with publication of validation metrics and benchmark results for reproducibility. Integration with experimental studies should include iterative feedback loops in which experimental results refine model predictions to ensure practical relevance. In addition, interpretability and explainability must be prioritized using tools such as SHAP scores or attention maps to increase confidence in ML models. Finally, the workflow should emphasize a standardized format for reporting methods, datasets, and results in publications, with open access repositories facilitating collaboration, reproducibility, and further innovation.
| φ | Porosity |
| ρ | Density |
| ADCC | Attention-densely connected conventional network |
| ANFIS | Adaptive neuro-fuzzy inference systems |
| APS | Adsorbent performance score |
| ASA | Accessible surface area |
| AUC | Area under the curve |
| AV | Accessible volume |
| BPNN | Back propagation neural network |
| CFFNN | Cascade feedforward neural networks |
| CFIDs | Force-filed-inspired descriptors |
| CGCNN | Crystal graph convolutional neural network |
| CMIS | Committee machine intelligence system |
| CoRE | Computation-ready, experimental |
| ELM | Extreme learning machine |
| ERT | Extremely randomized trees |
| ET | Extra trees |
| ETR | Extra trees regressor |
| DCNN | Deep convolutional neural network |
| DFT | Density functional theory |
| DNNs | Deep, feed-forward neural networks |
| DOE | Department of energy |
| DT | Decision tree |
| FAIR | Findable, accessible, interoperable, and reusable |
| FNN | Feedforward neural networks |
| GANs | Generative adversarial networks |
| GBDT | Gradient boosting decision trees |
| GBRT | Gradient-boost regression tress |
| GCMC | Grand canonical Monte Carlo |
| GM | Volumetric uptake using geometric data |
| GNN | Graph neural networks |
| GRNN | General regression neural networks |
| GPR | Gaussian process regression |
| GSA | Gravimetric surface area |
| GUM | Predicts gravimetric uptake |
| HTCS | High-throughput computational screening |
| HTE | High-throughput experimentation |
| KNN | k-Nearest neighbors |
| LAMOFS | Long-arm metal–organic frameworks |
| LCD | Largest cavity diameter |
| LDA | Linear discriminant analysis |
| LDH | Layered double hydroxide |
| LGBM | Light gradient boosting machines |
| LIME | Local interpretable model-agnostic explanations |
| LSSVM | Least squares support vector machines |
| MAE | Mean absolute error |
| MCGCNN | Modified crystal graph convolutional neural network |
| MCTS | Monte Carlo tree search |
| MD | Molecular dynamics |
| ML | Machine learning |
| MLOps | Machine learning operations |
| MLP | Multilayer perceptron |
| MLPNN | Multilayer perceptron neural networks |
| MOF | Metal–organic framework |
| MSE | Mean squared error |
| NCA | Neighbor component analysis |
| NGM | Volumetric uptake without geometric data |
| PCA | Principal component analysis |
| PCC | Pearson correlation coefficient |
| PLD | Pore-limiting diameter |
| PSO | Particle swarm optimization |
| Q st | Heat of adsorption |
| RAE | Relative absolute error |
| RDF | Radial distribution function |
| RF | Random forest |
| RFC | Random forest classifiers |
| RL | Reinforcement learning |
| RMSE | Root mean squared error |
| RNN | Recurrent neural network |
| RR | Ridge regression |
| RRSE | Root relative squared error |
| SBUs | Secondary building units |
| SHAP | Shapley additive explanation |
| SVM | Support vector machine |
| TMINN | Transition-metal interlink neural network |
| TPOT | Tree-based pipeline optimization tool |
| UMAP | Unified manifold approximation and projection |
| VCD | Volumetric deliverable capacity |
| VF | Void fraction |
| VOC | Volatile organic compounds |
| VSA | Volumetric surface area |
| XAI | Explainable artificial intelligence |
| XGB | Extreme gradient boost |
| This journal is © The Royal Society of Chemistry 2026 |