Open Access Article
Maitreyo
Biswas
,
Rushik
Desai
and
Arun
Mannodi-Kanakkithodi
*
School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA. E-mail: biswasm@purdue.edu; amannodi@purdue.edu
First published on 13th August 2024
Photocatalytic water splitting is an efficient and sustainable technology to produce high-purity hydrogen gas for clean energy using solar energy. Despite the tremendous success of halide perovskites as absorbers in solar cells, their utility for water splitting applications has not been systematically explored. A band gap greater than 1.23 eV, high solar absorption coefficients, efficient separation of charge carriers, and adequate overpotentials for water redox reaction are crucial for a high solar to hydrogen (STH) efficiency. In this work, we present a data-driven approach to identify novel lead-free halide perovskites with high STH efficiency (ηSTH > 20%), building upon our recently published computational data and machine learning (ML) models. Our multi-fidelity density functional theory (DFT) dataset comprises decomposition energies and band gaps of nearly 1000 pure and alloyed perovskite halides using both the GGA-PBE and HSE06 functionals. Using rigorously optimized composition-based ML regression models, we performed screening across a chemical space of 150
000+ halide perovskites to yield hundreds of stable compounds with suitable band gaps and edges for photocatalytic water splitting. A handful of the best candidates were investigated with in-depth DFT computations to validate their properties. This work presents a framework for accelerating the navigation of a massive chemical space of halide perovskite alloys and understanding their potential utility for water splitting and motivates future efforts towards the synthesis and characterization of the most promising materials.
Photoelectrochemical (PEC) water splitting is one of the most efficient approaches to extract high-purity hydrogen. PEC water splitting aims to split the H2O molecule into H2 and O2via two half-reactions:
(i) Hydrogen evolution reaction (HER):
| 2H+ + 2e− → 2H2 |
(ii) Oxygen evolution reaction (OER):
| 2H2O → O2 + 4H+ + 4e− |
Materials suitable for water splitting should have a band gap greater than 1.23 eV to overcome the thermodynamic barrier of the endothermic water splitting reaction.1 They should also have a straddling band alignment, i.e., the conduction band minimum (CBM) should be above the reduction potential of H+/H2 and the valence band maximum (VBM) should be below the oxidation potential of O2/H2O to allow the HER and OER respectively to take place.2 Incident photons excite electrons to the CB leaving behind holes in the VB, forming electron–hole pairs. The electrons in the CB facilitate the HER whereas holes take part in the OER.
TiO2 is the most extensively studied photocatalyst because of its photo-chemical stability, corrosion resistance, abundance, and non-toxic nature.3–8 But anatase and rutile TiO2 have band gaps of 3.2 eV and 3.0 eV9 respectively, limiting its photoactivity to the UV-range which is around 5% of the total irradiated solar energy. To narrow the band gap and to enhance the efficiency, numerous methods have been explored, such as the introduction of metals and non-metals as co-catalysts or dopants,10–13 creating heterostructures,14–16 and Z-scheme system construction.17,18 In spite of these experimental and theoretical investigations, including large-scale synthesis of Z-scheme systems and heterostructures,19 several challenges remain in TiO2-based photocatalysis, such as the degradation of PEC efficiency due to the presence of point defects, dopants, and surface additives,20 and heavy electron effective mass due to localized d-orbitals.21
Halide perovskites (HaPs) have been extensively studied for their high photovoltaic (PV) efficiency22,23 and exciting optoelectronic applications.24–26 Perovskites prove to be promising materials for photocatalytic water splitting because of their high solar absorption coefficient,22,23 long electron and hole diffusion lengths,27,28 long charge carrier lifetimes28 and easily tunable band gaps for efficient absorption in the visible range of the solar spectrum.28,29 Liu et al. reported a hydrogen evolution rate of 242.5 μmol g−1 h−1 by splitting H2O using CsPbI3 combined with graphitic carbon nitride (g-C3N4).30 Fehr et al. reported a peak STH efficiency of 20.8% for integrated halide perovskite PEC cells using Cs0.05FA0.85MA0.10Pb(I0.95Br0.05)3 and FA0.97MA0.03PbI3 as the photocathode and photoanode respectively.31 Karuturi et al.32 reported an STH efficiency of over 17% for perovskite/Si dual-absorber tandem cells where a Si photocathode was paired with Cs0.10Rb0.05FA0.75MA0.15PbI1.8Br1.2 in tandem. Wang et al. implemented a data-driven approach to estimate the photocatalytic performance of lead-free A3B2X9 perovskites and reported an STH efficiency of ∼17% for
compounds.33 Thus, it can be well understood that composition engineering at cation or anion sites is an effective way to tune the band gap and enhance the photocatalytic efficiency of HaPs.
Despite these efforts, there are limitations in the detailed understanding of the effects of alloying on the photocatalytic performance of ABX3 halide perovskites. The chemical space of ABX3 perovskites comprises millions of possible alloying combinations at A, B, and X sites that would take decades to be screened experimentally. High-throughput DFT (HT-DFT) is one of the most effective ways to explore such combinatorial chemical spaces. HT-DFT combined with state-of-the-art ML models can be used for accelerated screening and discovery of novel stable perovskites with suitable band gaps and photocatalytic efficiencies. This kind of data-driven approach has been used previously by Pilania et al.,34 Jin et al.,35 and Wang et al.33 to screen and identify suitable AA′BB′O6 double perovskite oxides and A3B2X9 halide perovskites for PEC water splitting.
In this work, we utilized our recently published multi-phase, multi-fidelity HaP alloy dataset,36–38 containing 985 individual computations using the GGA-PBE and HSE06 functionals, on pure and alloyed inorganic and hybrid compounds, to screen promising candidates for photocatalytic water splitting. Each perovskite is represented by a 56-dimensional vector, used as the input to train ML predictive models for bulk stability and electronic band gaps and edges. Based on rigorously optimized regularized greedy forest (RGF)39 models for the decomposition energy (ΔH) and band gap (Eg), prediction and screening were performed across a dataset of 150
000+ enumerated perovskite alloy compositions. For photocatalysis, the band gap and the position of band edges are crucial properties for determining the feasibility of HER and OER. Though the semi-local GGA-PBE40 functional used for geometry optimization reproduces the lattice parameters and thermodynamic stability quite well, it severely underestimates the band gap.37,41 Thus our RGF model was trained on a multi-fidelity dataset containing ΔH and Eg from both the GGA-PBE and the hybrid HSE06 functional (HSE),42 such that the model learns the complex relationship between PBE and HSE band gaps for different perovskite systems. As reported in our recent work, learning from the PBE and HSE data together helps improve chemical space generalizability and prediction accuracy at the HSE-level.36
Screening is first performed based on predicted bulk stability and band edges empirically estimated using predicted band gaps and Mulliken electronegativities, following which the ηSTH is calculated to determine the suitability for water splitting. We further examined the relationship between ηSTH and material properties such as the band gap and electronegativity. It is found that alloying at the B-site plays a major role in enhancing the photocatalytic performance. Through this work, we present a list of promising HaP compositions for high-efficiency water splitting, including a few stable Pb-free perovskites to mitigate Pb toxicity issues. It is hoped that the insights and results from this computational screening effort will pave the way for future experimental synthesis of efficient halide perovskite-based photocatalysts. Fig. 1 shows an outline of this work, including perovskite descriptors, ML training, and screening across a massive space of possible compositions.
![]() | ||
| Fig. 1 DFT+ML workflow for multi-fidelity predictions of perovskite properties and screening for suitable photocatalysts. | ||
Further detailed analysis of this dataset can be found in our past publications (Table 1).36–38
| Property | Functional | Test RMSE (eV) | Test MAE (eV) |
|---|---|---|---|
| Decomposition energy | PBE | 0.03 | 0.02 |
| HSE | 0.03 | 0.02 | |
| Band gap | PBE | 0.10 | 0.07 |
| HSE | 0.12 | 0.08 | |
785 unique A-site, B-site, and X-site mixed compositions based on the set of 5 unique A-site cations, 6 B-site cations, and 3 X-site anions shown in Fig. 1. Since each compound could exist in one of four prototype phases, this adds up to 151
140 total compounds. We extracted the 56-dimensional feature vectors for each of these compounds and ultimately fed them into the RGF models for predicting the ΔH and Eg, using averages over the 4000 individual runs as described above, also yielding the prediction uncertainty in terms of the standard deviation.
![]() | (1) |
140 compounds in the enumerated dataset. We employed a hierarchical screening procedure on the enumerated dataset as shown in Fig. 2(b) to identify stable perovskites with suitable band gaps and band edges for water-splitting. To validate the formability of the ABX3 perovskites, we first performed screening based on the well-known tolerance and octahedral factors which consider the ionic radii of the A, B, and X-site species. In addition to the Goldschmidt tolerance and octahedral factors, we also used a new tolerance factor proposed by Bartel et al.57 The three stability factors are defined as follows:
Octahedral factor:
![]() | (2) |
Tolerance factor:
![]() | (3) |
Bartel tolerance factor:57
![]() | (4) |
The accepted upper and lower bounds for the perovskite formability factors are as follows:36–38o ∈ (0.442 − 0.895), t ∈ (0.813 − 1.107), and tB < 4.18; these conditions are satisfied by 67
916 of the 151
140 compounds. To assess the thermodynamic stability, we used a criterion where perovskites with decomposition energy ΔHHSE < 0.1 eV were accepted as likely being stable. This threshold accounts for potential errors in the machine learning (ML) predicted decomposition energies and includes more candidates. This step left us with 59
273 compounds. Next, to ensure that any compound is able to effectively absorb photons within the visible solar spectrum and to meet the threshold for minimum water electrolysis potential, we applied the condition of 1.23 ≤ EHSEg ≤ 3 eV, reducing the number of compounds to 23
201. In Fig. 3, we visualize the ML-predicted EHSEg plotted against ΔHHSE for the formable compounds; the shaded region shows where the 23
201 compounds lie.
![]() | ||
Fig. 3 Visualization of the ML-predicted HSE decomposition energies vs. band gaps for 23 201 compounds with desirable octahedral and tolerance factors. | ||
Next, we must align the electronic band edges of the HaPs with respect to vacuum to determine whether they straddle the redox potentials of water. To do this, we adopted an empirical approach based on the Mulliken electronegativity and (ML-predicted HSE) band gap of the perovskites. The VBM and CBM are calculated as:
![]() | (5) |
![]() | (6) |
It should be noted that the electronegativities of all the A/B/X species used in this work are already tabulated and even used as part of the ML descriptors. This empirical approach has been successfully implemented previously,33,34,58–61 and the estimated band edges have shown good agreement with experimentally measured VBMs and CBMs.62 The band edges should have a straddling alignment to allow the HER and OER at the VBM and CBM respectively. Under the normal hydrogen electrode (NHE) standard, ECBM < 0 and EVBM ≥ 1.23 should be satisfied for the necessary alignment. After this final round of band edge screening, 3043 perovskites were identified as suitable water splitting photocatalysts, which is only about 2% of the total number of enumerated compounds. In the next section, we provide further analysis of the screened compounds and DFT validation of a few selected perovskites.
| ηSTH = ηabsηcu | (7) |
![]() | (8) |
![]() | (9) |
![]() | (10) |
χ(H2) denotes the HER overpotential, i.e., the potential difference between the CBM and the H+/H2 potential, and χ(O2) denotes the OER overpotential which is the potential difference between the VBM and the O2/H2O potential.
Fig. 4 shows a visualization of the ηSTH values (calculated in %) of the 3043 compounds post-screening, in terms of a plot between the Mulliken electronegativity and the HSE band gap. The truncated region represents perovskites with ηSTH > 12% to the left. It can be seen that HaPs with band gaps in the range 1.6 eV ≤ EHSEg ≤ 2.5 eV show high ηSTH, clearly attributed to higher solar absorption in the visible spectrum which elevates ηabs and thus ηSTH.
![]() | ||
| Fig. 4 Dependence of ηSTH on the band gap and electronegativity of the perovskites. The dotted line shows ηSTH > 12%. | ||
In general, ηSTH seems to decrease as the electronegativity increases. Fig. 5(a) further shows a plot between ηSTH and EHSEg, revealing something interesting: among these stable and formable HaPs with suitable band edges, the highest STH efficiencies are shown by purely inorganic compounds, and hybrid organic–inorganic perovskites (HOIPs) where the A-site contains some mix of MA and FA cations show lower efficiencies. This arises from the fact that in this band gap range, Cs-based inorganic perovskites are the most stable and lie on the lower EHSEg range thus showing ηSTH ≈ 24%, whereas MA/FA-based compounds, which are largely far more stable than Cs/Rb/K-based compounds across the dataset,36,37 lie in the larger EHSEg range and thus show ηSTH < 20% for the majority of HOIPs. Decreasing the band gap in HOIPs below 2 eV also shifts the CBM downwards in energy below the H+/H2 redox potential, making it unfavorable for the HER.
![]() | ||
| Fig. 5 (a) ηSTH plotted as a function of band gap for the screened inorganic and hybrid organic–inorganic perovskites. (b) Different kinds of mixing present in the 3043 screened perovskites. | ||
Next, we discuss the general trends observed in tuning perovskite properties via composition engineering. As we go from Cs to Rb to K at the A-site, the cation size decreases, thereby strengthening p–p hybridization and consequently reducing the band gap.64 The band gap decreases monotonically from Cl to Br to I at the X-site due to the decreasing electronegativity (Cl > Br > I).65 It is known that B-site and/or X-site substitution are the most common ways to tune the band gap and band edge positions of HaPs, owing to the fact that the CBM and VBM majorly comprise the B-site s, p or d-orbitals and X-site p-orbitals, respectively.65–67
Fig. 5(b) shows the distribution of different types of mixing present in the 3043 screened perovskite list. These compounds predominantly involve B-site mixing (85%) in both inorganic HaPs and HOIPs, followed by scarce traces of X-site mixing (9%) and A-site mixing (6%), which corroborates the general trends as discussed. Fig. 6(a) shows that Cs is the A-site cation in a majority of the compounds followed by MA and FA, with only ∼2% of the compounds containing Rb or K. The scarcity of A-site mixing in the screened list signifies that the stable perovskites tend to preserve pure compositions at the A-site. The lack of pure K-based or Rb-based perovskites can be attributed to their inherent instability and tendency to decompose.68,69 Thus, K and Rb are only found as constituents in A-site mixed perovskites.
Fig. 6(b) further shows the prevalence of different mixing fractions of the B-site cations, revealing that mixing of several cations at once (thus forming high-entropy perovskite alloys) is indeed quite favorable, and each of the 6 cations is more likely to appear in smaller mixing fractions than larger quantities. At the X-site (Fig. 6(c)), about three-quarters of the compounds are iodides with the remaining compounds being nearly equally divided between bromides and chlorides. Interestingly, all the X-site mixed perovskites had pure Cs and Ge at the A and B sites respectively in different phases. No chlorides were identified in combination with FA or MA. The incorporation of Cl in HOIPs either resulted in band gaps exceeding 3 eV or led to higher instability.
Since Pb-free perovskites are much sought after for mitigating Pb-toxicity issues, we performed a visualization of Pb-free vs. Pb-containing compounds in Fig. 6(d). We find that 1173 compounds out of 3043 do not contain any Pb at the B-site, constituting about 39% of the space, highlighting a significant exploration into alternative, environment friendly materials for water splitting. Fig. 6(e) shows that the HOIP space in the screened list comprises a majority of MA-I (879) and FA-I (667) compounds, and only four FA–Br compounds, whereas all the purely inorganic compounds are mostly Cs-based bromides (468) and chlorides (512) followed by Rb–Cl (48) compounds.
We find that the most suitable HOIPs with high ηSTH are substitutional alloys of FAPbI3, FASnI3, and MAPbI3 with alkaline earth metals Ca, Sr, or Ba at the B-site. The lower work function of the alkaline earth metals shifts the CBM which leads to band gap widening.70 The most promising inorganic compounds are primarily alloys of CsGeBr3 and CsGeCl3 followed by alloys of CsPbBr3 and CsSnBr3. Similar to their hybrid counterparts, substitution with alkaline earth metals in inorganic perovskites widens the band gap and tunes the band alignment to be suitable for photocatalysis. The best STH efficiencies reported in the literature for perovskites lie in the ∼20% range;31,32 the best candidates identified here from the DFT-ML screening approach show efficiencies exceeding 24%, which represents a significant potential improvement in photocatalytic water splitting efficiency.
Table 2 summarizes the DFT computed decomposition energies and band gaps and compares them against the ML predictions for the five selected compounds, in cubic and non-cubic phases (selected based on the ML-predicted lowest energy phase). ML predictions for ΔHHSE and EHSEg are both in good agreement with the DFT values, validating the generalizability and reliability of our surrogate models for novel compositions. The negative (or close to zero) values for ΔHHSE prove the stability of these novel compositions against decomposition into their respective binary AX and BX2 phases. The band edge positions of these five perovskites relative to the redox potential of water, calculated using the HSE band gaps and eqn (8), are plotted in Fig. 8. Our DFT calculations verify the straddling band alignment of the chosen perovskites which is essential to facilitate the HER and OER processes. Direct band gap photocatalysts typically show higher solar absorption efficiency as compared to indirect band gap compounds because the interband transition of electrons from the VBM to the CBM does not require phonon transport.35,71 All five perovskites reported in this work showed a direct band gap, which coupled with their high absorption coefficients bode well for efficient light-harvesting within the visible spectrum and subsequent OER and HER productivity. We also note that three of these compounds are Pb-free perovskites and are thus of particular promise.
| Compound | Phase | DFT calculations | ML predictions | |||||
|---|---|---|---|---|---|---|---|---|
| E HSEg | Gap-type | ΔHHSE | ΔHPBE | E HSEg | ΔHHSE | ΔHPBE | ||
| CsCa0.25Ge0.75Br3 | Cubic | 1.90 | Direct | −0.20 | −0.23 | 2.22 | −0.19 | −0.23 |
| FACa0.375Sn0.625I3 | Cubic (pseudo) | 2.16 | Direct | 0.05 | −0.05 | 2.40 | 0.05 | 0.07 |
| CsCa0.25Ge0.50Pb0.25Br3 | Cubic | 1.86 | Direct | −0.24 | −0.19 | 2.14 | −0.19 | −0.22 |
| CsCa0.25Ge0.25Pb0.50Br3 | Tetragonal | 2.12 | Direct | −0.35 | −0.22 | 2.12 | −0.25 | −0.21 |
| CsGe0.875Sr0.125Br3 | Orthorhombic | 2.33 | Direct | −0.30 | −0.29 | 2.17 | −0.25 | −0.26 |
![]() | ||
| Fig. 8 Relative positions of band edges for 5 selected compounds, estimated empirically from HSE-computed band gaps. | ||
Another important aspect of efficient photocatalysis is a low electron effective mass
so as to achieve high charge carrier mobility,28,35,71 long carrier lifetime,28,35,71 and efficient electron transfer to facilitate the HER. We calculated
as well as the hole effective mass
by fitting a parabolic function to the dispersion relation at the CBM and VBM:
![]() | (11) |
,
and ηSTH of the five compounds are listed in Table 3, alongside the optimized lattice parameters. The effective masses are primarily determined by the extent of orbital overlap between the B-site and X-site ions.72 The abnormally high
and
of CsCa0.25Ge0.25Pb0.50Br3 in the tetragonal phase can be attributed to the increased disordering and octahedral tilting due to the mixing of three types of cations at the B-site. In general, in the tetragonal and orthorhombic phases, the orbital overlap between B and X ions is reduced as compared to the cubic phase, which in turn increases
and
. The increased disorder in CsCa0.25Ge0.25Pb0.50Br3 due to triple mixing at the B-site distorts the linearity of the B–X–B bonds, reducing the orbital overlap and increasing
and
. For the remaining compounds, our computed effective masses are in good general agreement with previously reported values for cubic HaPs.64,72,73 Among the DFT-validated perovskites, CsCa0.25Ge0.75Br3 and CsCa0.25Ge0.25Pb0.50Br3 show the highest ηSTH > 24%, which is substantially higher than the previously experimentally observed ηSTH = 20.8%31 for the Cs–FA–MA–Pb–I HOIP.
| Compound | Phase | a (Å) | b (Å) | c (Å) | α (°) | β (°) | γ (°) | η STH (%) | ||
|---|---|---|---|---|---|---|---|---|---|---|
| CsCa0.25Ge0.75Br3 | Cubic | 11.32 | 11.32 | 11.32 | 90.00 | 90.00 | 90.00 | 0.209 | 0.528 | 24.18 |
| FACa0.375Sn0.625I3 | Cubic (pseudo) | 12.86 | 12.76 | 12.87 | 87.41 | 95.15 | 90.61 | 0.335 | 0.439 | 17.82 |
| CsCa0.25Ge0.50Pb0.25Br3 | Cubic | 11.51 | 11.53 | 11.51 | 90.00 | 90.00 | 90.00 | 0.223 | 0.492 | 24.18 |
| CsCa0.25Ge0.25Pb0.50Br3 | Tetragonal | 16.37 | 16.38 | 11.81 | 90.00 | 90.00 | 90.04 | 1.815 | 0.971 | 20.31 |
| CsGe0.875Sr0.125Br3 | Orthorhombic | 16.46 | 16.19 | 11.59 | 90.07 | 91.43 | 89.72 | 0.247 | 0.316 | 16.14 |
000+ materials and discovered novel compounds for photocatalytic water splitting. This work is built upon a previously published high-throughput multi-fidelity halide perovskite DFT dataset and regularized greedy forest regression models trained on the data. We investigated the generalizability of our DFT-ML surrogate models and successfully validated the best predictions with DFT calculations. This work provides an analysis of the effects of alloying at the A/B/X sites on the thermodynamic landscape and optoelectronic properties of ABX3 halide perovskites. For identifying suitable perovskites for water-splitting, we employed a hierarchical down-screening approach that filters out compositions based on their tolerance factors, decomposition energy, HSE band gaps, and empirically estimated electronic band edges. Through this approach, we identified 3043 promising materials, most of which are FA-based iodides or Cs-based bromides and contain multiple group II or group IV divalent cations mixed at the B-site.
We find that B-site alloying is the most ideal way to tune perovskite band gaps. Combined with low electron and hole effective masses and a high optical absorption coefficient (>105 cm−1), these compounds show great promise as efficient photocatalysts. Among the screened perovskites, our DFT computations revealed CsCa0.25Ge0.75Br3 and CsCa0.25Ge0.25Pb0.50Br3 to have a solar-to-hydrogen efficiency >24%, which is notably higher than the previously reported ηSTH for perovskites both experimentally31,32 and computationally.33 The ML-predicted decomposition energies, band gaps and edges, and efficiencies are all made available. Our results also help identify several Pb-free perovskites that may be suitable for water splitting. We hope that this ML-accelerated hierarchical down-screening approach will inspire experimental efforts for validation in the near future. Our predictions and surrogate models are poised to enhance the exploration of this massive perovskite alloy space, enabling more informed and strategic research on perovskite based photocatalysts. As part of future work, the DFT dataset will be extended to more perovskite compositions and alternative ML algorithms will be explored for further improvement.
140 perovskites and the band edges and ηSTH derived from the band gaps of all the 3043 screened perovskites can be found on Github: https://github.com/maitreyo18/Multi-fidelity-screening-of-perovskite-photocatalysts
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cp02330g |
| This journal is © the Owner Societies 2024 |