Open Access Article
Marco
Anselmi
a,
Gregory
Slabaugh
*b,
Rachel
Crespo-Otero
*c and
Devis
Di Tommaso
*a
aDepartment of Chemistry, Queen Mary University of London, Mile End Rd, Bethnal Green, London, E1 4NS, UK. E-mail: d.ditommaso@qmul.ac.uk
bDigital Environment Research Institute, Queen Mary University of London, Empire House, 67–75 New Road, London, E1 1HH, UK. E-mail: g.slabaugh@qmul.ac.uk
cDepartment of Chemistry, University of College London, 20 Gordon Street, London, WC1H 0AJ, UK. E-mail: r.crespo-otero@qmul.ac.uk
First published on 2nd January 2026
The discovery of Metal–Organic Framework (MOF) photocatalysts for CO2 reduction is hindered by the computational cost of quantum chemical screenings. To overcome this barrier, we introduce a Machine Learning (ML)-accelerated workflow that integrates the speed of ML with the accuracy of Density Functional Theory (DFT). While a DFT-based screening of over 20
000 MOFs identified 105 promising candidates in nearly a month, a ML-driven approach using the Molecular Graph Transformer (MGT) required only 4.5 hours. Here, we present a quantitative assessment of ML performance compared with hybrid DFT for MOF electronic screening, showing that prediction errors are related to the chemistry of the MOFs. We therefore derive an error-aware ML candidate selection strategy that raises DFT candidate recovery from 20% to 70% while keeping a sensible selection set. Building on this, we propose a practical ML to DFT screening workflow in which ML serves as a fast pre-filter to define a small subset for hybrid DFT evaluation, enabling efficient discovery of promising MOFs.
A photocatalytic reaction such as PCO2R generally occurs in three steps.14–16 (i) Photon absorption: in the first step, photons with energy equal to or greater than the optical gap energy are absorbed by the photocatalyst. This absorption excites electrons from the occupied orbitals to the unoccupied orbitals, creating holes in the highest occupied molecular orbital. (ii) Electron–hole dynamics: in the second step, the excited electrons and the created holes can either recombine within the catalyst or migrate to the surface. (iii) Surface reactions: in the third step, the electrons and holes that reach the surface participate in oxidation and reduction reactions.
The electronic structure of a photocatalyst is a key factor in determining its suitability for PCO2R, particularly the positions of the Highest Occupied Molecular Orbital (HOMO) and Lowest Unoccupied Molecular Orbital (LUMO), as well as the bandgap between them. To enable direct comparison with the redox potentials of the Oxygen Evolution Reaction (OER) and the CO2 Reduction Reaction (CO2RR) for a specific product, the HOMO and LUMO energy levels are converted to Ionization Potential (IP) and Electron Affinity (EA), respectively, by alignment with the vacuum energy level. This comparison assesses whether the photocatalyst has the necessary energetic properties for the CO2RR to take place.17,18
Metal–organic frameworks (MOFs) are a class of crystalline porous materials constructed from organic linkers and metal ions/clusters, known as secondary building units (SBUs). The various combinations of these components can lead to an almost infinite number of structures with diverse functionalities. Through fine-tuning, it is possible to create and screen materials with easily accessible catalytic sites and desirable electronic structures.19–21 Due to their unique structural properties, such as high surface area, high porosity, tuneable morphology, and adjustable chemical composition, MOFs are highly suitable for various CCU applications,22 including photocatalytic CO2 reduction.5–7,23,24
Nevertheless, the practical deployment of MOF photocatalysts is currently hindered by significant experimental challenges. Foremost among these are the poor stability of many frameworks in the aqueous environments required for CO2 reduction and the complexity of synthesis, which limits scalability.25 From an electronic perspective, efficiency is often stifled by rapid electron–hole recombination and light absorption spectra that are typically confined to the ultraviolet range, a limitation shared with traditional photocatalysts such as TiO2.26 To address these limitations, this study utilizes a computational screening approach centred on band-edge engineering. Unlike previous screenings that utilize broad energetic windows, we use a visible-light based bandgap window of 1.9–2.5 eV. By prioritizing this range, we aim to identify candidates that maximize solar energy utilization efficiency while maintaining the energy requirements to drive the thermodynamically challenging CO2 reduction.
Quantum chemical methods, typically within the framework of Density Functional Theory (DFT), can be used to determine the electronic structure of MOFs, including the energy positions of the HOMO, LUMO, and band gap. This information is crucial for assessing their potential as PCO2R catalysts. However, DFT calculations of band gaps generally require the use of hybrid DFT functionals such as HSE06 (ref. 27) and B3LYP3,28 rather than standard GGA functionals such as PBE,29 which tend to underestimate band gaps.30 The large unit cells required to model MOFs mean that a significant number of atoms must be considered, making the application of DFT, especially hybrid DFT methods, computationally very demanding. These calculations can take weeks to months to complete when applied to large datasets with tens or hundreds of thousands of structures, and because of its complexity, the explicit calculation of the excited states in MOFs hasn't been extensively explored in the literature.31
Recently, Machine Learning (ML) models have been gaining popularity as a methodology to accelerate the discovery of new materials. ML models can be used to predict properties of structures, such as formation energy, potential energy surfaces and electronic properties, like the bandgap, of various materials.32–38 Nevertheless, MOFs have seen less attention in the literature due to the lack of substantial electronic structure–property datasets, with only three contributions found for this study.39–41
In a major contribution to the field of ML prediction of MOF bandgaps, Rosen et al.40 introduced a new dataset called the Quantum MOF (QMOF) database and an analysis of the performance of five different ML representations on this dataset, namely the Sine Coulomb matrix,42 the stoichiometric, the orbital field matrix, the Smooth Overlap of Atomic Positions43 (SOAP) and the Crystal Graph Convolutional Neural Network33 (CGCNN) representations. The QMOF database introduced by Rosen et al.40 contains the computed properties for 15
713 MOFs after structure relaxation via DFT, which they collected from the Cambridge Structural Database MOF subset44 and the 2019 Computationally Ready MOF database,45 with the aim of creating a database for the development of novel ML algorithms for MOFs. The properties reported included, but were not limited to, relaxed geometries, bandgaps, HOMOs, LUMOs, charge densities, density of states, partial charges, spin densities and bond orders. Since its inception, the database has been expanded to contain the properties for 20
375 MOFs.
Unlike earlier MOF ML models such as CGCNN,33 which primarily capture local atomic connectivity, our approach employs the Molecular Graph Transformer46 (MGT) architecture to address the unique challenges posed by MOFs. MGT uses a multi-graph representation that encodes local bonded interactions, many-body effects, and long-range electrostatic interactions through a global graph derived from Coulomb matrix features. This design is particularly suited to MOFs with large unit cells and extended pore networks, where non-bonded interactions strongly influence band edges and band gaps. By combining local attention with message passing across these graphs, MGT achieves improved accuracy (average error 0.34 eV for bandgap, EA, and IP) and enables an error-aware ML to DFT workflow that accelerates screening while maintaining chemical interpretability.
This study involves a high-throughput screening of the QMOF database to identify potential photocatalysts for CO2 reduction using vacuum-aligned HOMOs and LUMOs (band edges), together with the bandgap, and employs both hybrid DFT and ML techniques throughout. Initially, 105 promising photocatalyst candidates were identified via hybrid DFT calculations. The electronic-structure data resulting from these calculations were then used to construct a training dataset for the ML model. A subsequent ML based screening was then performed to predict vacuum-aligned band edges, yielding a set of 45 candidates. The outcomes of DFT and ML high-throughput screening were then compared to assess the feasibility of ML as a stand-alone approach. In doing so, the work provides a systematic evaluation of ML screening versus DFT for MOF electronic structure prediction, including an analysis of prediction error with respect to the chemistry of MOFs. Using these insights, we propose an error-informed ML-to-DFT workflow, in which ML serves as a fast pre-filter for the selection of a small subset of candidates for hybrid-DFT evaluation, thereby substantially accelerating the screening speeds of MOF photocatalyst candidates.
The VASP calculations were run on a mixture of GPU and CPU nodes on the High Performance Computing (HPC) Hub in Materials and Molecular Modelling Young (MMM Hub Young), on Queen Mary's Apocrita HPC facilities supported by QMUL Research-IT, and on the Sulis Tier 2 HPC platform. The alignment calculations, on the other hand, were run using 8 CPU cores and 16 GB memory on the Apocrita HPC nodes.
In this step, the collection of DFT computed bandgaps is essential. To accelerate the process, utilizing the generalized gradient approximation (GGA) PBE functional is recommended. Although the PBE functional is less accurate than hybrid functionals like HSE, its computational efficiency makes it preferable for large datasets. Notably, since the only property needed for this step is the bandgap, and a linear relationship exists between the bandgaps obtained using the GGA PBE functional and those obtained using the HSE06 Hybrid functional,49 this relationship can be expressed as:
| Eg,HSE = 1.09 Eg,PBE + 1.04 eV | (1) |
Similar linear equations can also be derived for other hybrid functionals, reinforcing the preference for using the PBE functional in this step.
In this regard, a techno-economic evaluation of PCO2R revealed that the production of C1 products is more economically competitive compared to the production of C2 products, which entails higher energy requirements, longer reaction times, and more complex processes that contribute to increased production costs.50 Consequently, this study focuses on three key C1 products: methane (CH4), methanol (CH3OH) and formic acid (HCOOH). Among these, the smallest energy gap between the redox potentials for the OER and the reduction to a specific product is 1.88 eV, while the largest gap is 2.25 eV. However, practical considerations necessitate adjusting the required bandgap values. Overpotentials and losses in current efficiencies must be accounted for, leading to higher bandgap requirements for catalyst selection.17
Furthermore, the ability of a material to absorb light depends on the energy of incident photons, which should match or be higher than the bandgap of the material. For instance, TiO2, one of the most studied photocatalysts, has a bandgap of 3.2 eV, limiting its light absorption to the ultraviolet range, which constitutes less than 5% of the solar spectrum.51 Thus, to maximize light absorption, it is essential to tune the bandgap.
Considering all these criteria, an ideal bandgap for efficient photocatalysis would fall within the range of 1.9 eV and 2.5 eV. To account for bandgap underestimation errors caused by the PBE functional used in the calculation of bandgaps in the QMOF database, we adjust the bandgap selection criteria to 0.8 eV and 1.34 eV using eqn (1).
To enhance accuracy, a hybrid functional can be created by incorporating a percentage of exact exchange computed using the Hartree–Fock (HF) exchange functional.52 In the study, single-point calculations were performed for bandgaps, band edges, and local potential using the HSE06 functional. The input structures were obtained from the QMOF database, and the HSE06 setup involved a 25% addition of HF exchange and an inter-electronic range of 0.2 Å−1 for applying the HF exchange.
The methodology outlined in this study addresses this issue by adjusting the bandgap and band edges to the energy level of the vacuum. Traditionally, band alignment involves referencing the energy of the vacuum above the surface of the material. However, for MOFs, direct calculations of the surface electronic structure can be computationally demanding due to their large unit cells containing hundreds of atoms. To overcome this challenge, the approach proposed by Butler et al.53 uses the electrostatic potential at the centre of MOF pores as an approximation for the vacuum level. By using this reference point, electronic energy levels of MOFs can be placed on a common energy scale.
To determine the pore centre required for the vacuum energy approximation, the Pore Size Distribution (PSD) method introduced by Trepte and Schwalbe54 is employed. This approach utilizes a Monte Carlo procedure, generating random points within the MOF unit cell. At each step, these points are adjusted to maximize their distance from the atoms in the MOF, ultimately identifying the pore center and diameter.
The combination of these two techniques allows consistent alignment of electronic energy levels across MOF structures, facilitating informed material selection for efficient CO2RR.
| H2O → 2H+ + 1/2O2 + 2e−E = −5.67 eV |
| 2H+ + 2e− → H2E = −4.44 eV |
In the case of water splitting photocatalytic CO2RR, as the redox potentials for the conversion of CO2 into added value chemicals are higher than those of both the OER and HER, the required bandgap of the MOF will need to be larger than that required for water splitting, but not too wide, in order to allow for the absorption of photons. The bandgap edges will then need to straddle the OER and the redox potential for the CO2RR to the desired product, with the valence band below the OER and the conduction band above the CO2RR redox potential.
| CO2 + 8H+ + 8e− → CH4 + 2H2O E = −3.79 eV |
| CO2 + 6H+ + 6e− → CH3OH + H2O E = −3.65 eV |
| CO2 + 2H+ + 2e− → HCOOH E = −3.42 eV |
375 data points in the QMOF database. Hybrid DFT calculations were then performed on these structures to obtain more accurate bandgaps and band-edges (Table S3). Subsequently, the band alignment step was carried out. The DFT calculations for all 2169 structures required a total of 24 days of CPU time, as determined from the VASP output files.
From these 2169 MOFs, 1132 structures exhibited an EA higher than the CO2R redox potential (for the CO2RR with the highest gap requirement) and an IP lower than the water OER. However, by applying the ideal bandgap range of 1.9 eV to 2.5 eV (refer to Section 2.1.2), the screening was further refined to identify 105 structures that meet the criteria to be suitable as catalysts for at least one of the CO2R reactions, producing either CH4, CH3OH or HCOOH.
Out of these 105 structures, 61 have been synthesised, with their structures available on the CSD dataset. Among these, qmof-e100550 (ref. 55) presented a experimentally calculated bandgap of 2.23 eV, qmof-d6d8373 (ref. 56) has a reported bandgap of 3.1 eV, lastly for qmof-306fb3e,57 its bandgap was measured to be 2.49 eV. Meanwhile, qmof-3eec122 (ref. 58) doesn't have a reported bandgap; it has, however, been tested for the electrocatalytic oxygen reduction reaction and for CO2 trapping. The remaining 44 structures are computationally theorized compounds sourced from various databases.59,60
Fig. 3 shows the bandgaps and band edges positions for the structures that are suitable for all three CO2R reactions. Among these MOFs, qmof-e581c0e exhibits the lowest bandgap of approximately 1.97 eV, allowing for greater light absorption. Among these, 13 are suitable for the reduction to formic acids, 42 structures are suitable for the reduction to methanol, while the remaining 43 are suitable for the reduction to methane.
By further analysing the chemical nature of the structures that exhibit high-error predictions, it is revealed that the cases in which the model fails are not random but related to chemical complexity. This analysis identifies three specific structural features in MOFs that consistently lead to large prediction errors for key electronic properties including IP, EA, and bandgap: (i) the presence of multi-metal centres (e.g. qmof-6e89a67 with CuNa, qmof-cfee5a8 with Cu2Yb2, and qmof-8281b85 with Cd2Ni2); (ii) the inclusion of heavy d-block and f-block elements; and (iii) the incorporation of halides in the ligands.
A detailed breakdown of error distribution by metal type (Fig. 5 and Table S2) further demonstrates the correlation between the inaccuracies of the model and the chemical environment. Alkali metals show a trend where the average error on unseen data (0.33 eV) is lower than on seen data (0.41 eV). This is likely due to the fact that these metals are very similar to each other and all show stable oxidation states of either 0 or +1. Meanwhile, transition metals, which comprise the majority of the dataset, consistently show a mediocre performance (0.42 eV) for both seen and unseen data, suggesting that the model struggles to capture the variable oxidation states and diverse coordination geometries of d-block metals.
Post-transition metals are similar in performance to transition metals, but they also show a clear trend of being more reliable for lighter metals (e.g. Al with 0.28 eV error) than for heavy ones (e.g. Ga with 0.74 eV error). Lastly, lanthanides display the largest disparity: they have the lowest error of any group during training (0.18 eV) but the highest error when unseen (1.09 eV). This is likely due to the fact that lanthanide complexes have a wide range of coordination geometries, which indicates that the model is likely overfitting to the specific coordination environments of seen lanthanide clusters. Thus, the model is effectively memorizing the geometry rather than learning the underlying physics of f-orbitals, leading to significant failures when encountering new elements such as ytterbium. Therefore, the significant outliers in the parity plots (Fig. 4) are primarily associated with these complex electronic phenomena, ranging from the challenge of d-orbital directionality to the poor generalisation of f-electron correlation.
From the results obtained using the MGT model on the test set for this study, 43 MOFs were identified as suitable catalysts for the CO2R reactions producing either CH4, CH3OH, or HCOOH. Out of the 43 selected structures, 20 have been synthesized and studied in experimental papers. The remaining 23 are computationally theorized compounds. For all 43 structures, their bandgaps and band edges have been plotted in relation to the redox potentials for the OER and CO2R reactions (Fig. 6). Among these MOFs, qmof-8b03127 exhibits the lowest bandgap of approximately 2.06 eV, allowing for greater light absorption. Among all the structures, 5 are suitable for the reduction to formic acids, 22 are suitable for the reduction to methanol, while the remaining 16 are suitable for the reduction to methane.
min ≤ Eg ≤ Eg,
max, and the ML screening would accept candidates satisfying:| Eg,pred > Eg,min − δ and Eg,pred < Eg,max + δ | (2) |
Similarly, for the band edges, where the IP must be lower than the water OER (VOER) and the EA higher than the CO2R redox potential (VCO2R), the ML screening criteria can be expanded to accept candidates satisfying:
| IPpred < VOER + δ and EApred > VCO2R − δ | (3) |
By accounting for the model's uncertainty in this manner, the screening becomes more in line with the DFT-based screening. As shown in Table 1, adjusting the selection criteria by the error of the model (δ = 0.34 eV), successfully recovers 69% of the DFT-selected structures. This highlights a clear trade-off between computational cost and scientific accuracy. Regarding the types of DFT hits missed by ML, we find that DFT selected candidates are most often missed by ML in two regimes: (i) near threshold band edge cases, where vacuum aligned IP/EA lie within the model’s MAE of the redox or band gap limits, leading to classification flips unless thresholds are widened by δ ≈ 0.34 eV (see eqn (2) and (3) and Table 1); and (ii) chemically complex secondary building units (SBUs), notably transition metals with partially filled d shells, lanthanides/actinides, multi metal centres, and halide bearing ligands, where pDOS indicates metal dominated edges and error diagnostics show poor generalisation. Accordingly, ML only screening is generally reliable for alkali/light post transition SBU, single metal frameworks and for candidates whose predicted edges sit well inside the selection window, whereas hybrid DFT validation is advisable for f block/actinide systems, multi metal nodes, TM dominated edges, and borderline alignment cases.
| Screening strategy | Selection criteria | Candidates selected (% of test set) | DFT candidate recovery (“hit percentage”) |
|---|---|---|---|
| Raw ML predictions | Use unchanged selection criteria | 43 (7%) | 21 (20%) |
| ML error margin | Widen window by the error of the model | 204 (31%) | 73 (69%) |
| Conservative margin | Widen window by 0.5 eV | 293 (45%) | 94 (89%) |
| High-recall margin | Widen window by 1.0 eV | 470 (72%) | 103 (98%) |
Beyond the selection of structures, a more fundamental methodological difference lies in the accuracy and internal consistency of the electronic property values obtained from DFT versus ML. In DFT calculations, for instance, the bandgap is derived directly from the computed HOMO and LUMO levels, and the alignment to the vacuum level also utilizes these HOMO and LUMO values. This interconnectedness ensures that all calculated electronic properties are consistent; a shift in the HOMO or LUMO energy propagates through to the bandgap, IP, and EA. In contrast, the machine learning approach predicts each property (HOMO, LUMO, bandgap, IP, and EA) independently. As a result, each prediction has its own distinct error relative to the reference DFT values. This independence can lead to discrepancies: the bandgap predicted directly by the ML model may differ from a bandgap calculated as the difference between the predicted HOMO and LUMO levels. Similarly, these values might also diverge from a bandgap inferred from the predicted EA and IP. Due to the nature of machine learning algorithms, these inconsistencies cannot be entirely eliminated, although they can be reduced by developing more accurate models.
These inconsistencies, however, lead to another problem. When evaluating candidate photocatalysts, it is often necessary to consider more than just the bandgap, HOMO, and LUMO levels. For instance, our current study reveals that the Density of States (DOS) at the HOMO and LUMO levels is predominantly localized on the organic linkers when the metals are alkali (Fig. 7a), post-transition metals (Fig. 7b), lanthanides (Fig. 7c), or Group 12 metals (Fig. 7d). Conversely, in MOFs with transition metals featuring partially filled d-shells (Fig. 7e), the DOS at the HOMO can shift significantly onto the metal center (Fig. 7). MOFs with Actinide metal centers (Fig. 7f) appear to localize the LUMO DOS onto the metal. Notably, when multiple transition metals with partially filled d-shells are present (Fig. 7g), the DOS at both the HOMO and LUMO levels shifts onto the metal centers.
This observation highlights the critical role of the organic linker in defining the foundational electronic structure of a MOF, a concept explored by Grau-Crespo et al. for zeolitic imidazolate frameworks (ZIFs).61 Our findings align with their conclusion that the positions of the band edges are largely determined by the linkers. However, while Grau-Crespo et al. also reported a direct correlation between the bandgaps of the isolated linkers and the energy difference between the Highest Occupied Crystal Orbital (HOCO) and the Lowest Unoccupied Crystal Orbital (LUCO) of the corresponding ZIFs, this specific relationship is not apparent in our broader dataset (Fig. 8). Nevertheless, the principle that metals modify these linker-defined band edges to achieve a desired electronic structure holds true. The comparison plots (Fig. 8) clearly show that transition metals and multi-metal systems cause the largest deviations from the linker-only properties, affecting the bandgap, HOMO, and LUMO. Actinides and lanthanides also show a strong influence, particularly on the LUMO energy, while having less effect on the HOMO. In contrast, MOFs with alkali and post-transition metals stick closest to the electronic properties of their linkers, generally showing the smallest influence. As also suggested by Grau-Crespo et al., transition metals can play a crucial role by providing alternative sites for excited electrons or holes, thereby promoting electron–hole separation and increasing the lifetime of the excited state by reducing recombination rates. Grau-Crespo et al. also discussed that charge separation, and thus potentially slower recombination, can also be achieved in ZIFs by using multiple linker types, where one type primarily contributes to the HOMO and another to the LUMO.
Performing a comparable detailed analysis of linker contributions and metal–ligand interactions using machine learning presents significant challenges. It would require the prediction of additional properties, each introducing its own error margin. Furthermore, obtaining detailed electronic data for isolated ligands would likely require a separate ML model trained specifically for that purpose. The accumulation of errors from these multiple steps could render the overall comparison too uncertain to yield any meaningful conclusions.
Design trends across the 105 DFT-selected and 43 ML-selected MOFs reveal qualitatively a linker first, metal tune paradigm: organic linkers broadly set the baseline band edge positions, while the choice of metal centres (especially d and f block elements and multi metal nodes) introduces controllable deviations that tune the HOMO/LUMO alignment and the band gap within the visible light window. Alkali and light post transition metals preserve linker dominated edges and simplify alignment to OER/CO2RR potentials, whereas transition metals and actinides shift edge densities onto the metal, enabling band edge tuning and charge separation at the cost of greater variability that requires hybrid DFT validation. Halide-bearing ligands and multi-metal centres correlate with higher ML uncertainty, so error aware selection windows are essential when ML is used as a pre filter.
Therefore, directly replacing DFT with ML is not yet feasible. Instead, an integrated workflow is essential. ML should serve as a high-speed engine for initial screening, the effectiveness of which hinges on the screening strategy. We demonstrate that applying a margin based on the error of the model to the selection criteria is crucial for creating a candidate pool that is both computationally feasible and can include significant percentage of candidates that would be selected with a DFT screening. To build more confidence into the candidate screening process, ML models could also be enhanced with uncertainty quantification. Methods such as Monte Carlo dropout and model ensembling can show how certain a prediction is, allowing the selection of potential candidates with greater certainty. DFT calculation can then be performed on these ML-identified candidates for validation, accurate electronic structure determination, and detailed analysis. This workflow can leverage speed of the MGT and accuracy of hybrid DFT calculations.
Looking forward, the full potential of ML in materials science will be unlocked by developing models capable of preserving physical interdependencies between properties. For instance, using a consistency loss that enforces a relationship between the predictions could potentially reduce inconsistencies and provide better bandgap predictions. Until then, an integrated workflow, which leverages the speed of ML and the accuracy of DFT, offers the most powerful and pragmatic strategy to accelerate the design and discovery of next-generation photocatalysts for a sustainable future.
| This journal is © The Royal Society of Chemistry 2026 |