Carmen
Martínez-Alonso
abc,
Valentin
Vassilev-Galindo
a,
Benjamin M.
Comer
c,
Frank
Abild-Pedersen
c,
Kirsten T.
Winther
c and
Javier
LLorca
*ad
aIMDEA Materials Institute, C/Eric Kandel 2, 28906 – Getafe, Madrid, Spain. E-mail: javier.llorca@upm.es; javier.llorca@imdea.org
bDepartment of Inorganic Chemistry, Complutense University of Madrid, 28040 Madrid, Spain
cSUNCAT Center for Interface Science and Catalysis, Department of Chemical Engineering, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, Stanford, California 94305, USA
dDepartment of Materials Science, Polytechnic University of Madrid, E. T. S. de Ingenieros de Caminos, 28040 Madrid, Spain
First published on 4th June 2024
The adsorption energies for hydrogen, oxygen, and hydroxyl were calculated by means of density functional theory on the lowest energy surface of 24 pure metals and 332 binary intermetallic compounds with stoichiometries AB, A2B, and A3B taking into account the effect of biaxial elastic strains. This information was used to train two random forest regression models, one for the hydrogen adsorption and another for the oxygen and hydroxyl adsorption, based on 9 descriptors that characterized the geometrical and chemical features of the adsorption site as well as the applied strain. All the descriptors for each compound in the models could be obtained from physico-chemical databases. The random forest models were used to predict the adsorption energy for hydrogen, oxygen, and hydroxyl of ≈2700 binary intermetallic compounds with stoichiometries AB, A2B, and A3B made of metallic elements, excluding those that were environmentally hazardous, radioactive, or toxic. This information was used to search for potential good catalysts for the HER and ORR from the criteria that their adsorption energy for H and O/OH, respectively, should be close to that of Pt. This investigation shows that the suitably trained machine learning models can predict adsorption energies with an accuracy not far away from density functional theory calculations with minimum computational cost from descriptors that are readily available in physico-chemical databases for any compound. Moreover, the strategy presented in this paper can be easily extended to other compounds and catalytic reactions, and is expected to foster the use of ML methods in catalysis.
Obviously, the search for efficient and affordable catalysts for the HER and the ORR is a critical priority to ensure the success of the hydrogen economy, and the research effort has been very large (see, for instance, ref. 11). The final goal is to modify the catalyst's electronic structure to achieve an optimum performance and this can be achieved by adding other elements to form an alloy or compound,12–19 introducing defects in the crystal lattice or changing the surface facets20,21 and also through the application of elastic strains.22–25 The combination of all these variables gives rise, however, to endless possibilities that cannot be explored experimentally without computational tools to guide research towards more promising materials and structures.
The extraordinary progress of ab initio calculation methods based on density functional theory (DFT) to calculate the electronic structure has opened the door for the virtual design of catalysts from computer simulations.26 There are examples in the scientific literature in which the entire kinetics of a catalytic reaction (including the determination of the activation states as well as the activation and reaction energies and the corresponding entropic contributions) have been determined from DFT calculations and Monte Carlo simulations,27,28 but it is necessary to recognize that the evaluation by first principles of the kinetics of all the reactions in a catalytic process is –in most cases– an impossible task. Therefore, the discovery of new catalysts through simulations is usually carried out based on the rational definition of different descriptors related to various stages of the catalytic process that allow determining the stages that control the kinetics of said reaction. The identification of these descriptors can be simplified because the activation energies of the elemental reactions on the catalyst surface are proportional to the adsorption energies according to the Brønsted–Evans–Polanyi principle.29 In this regard, the accuracy associated with DFT in the calculation of adsorption energies is in the range of 0.1 eV30 and this strategy has been used in the HER and the ORR reactions with good agreement between theoretical and experimental results.26,31 Moreover, DFT calculations have been used recently by us to analyze the effect of elastic strains on the catalytic activity of transition metals for the HER and ORR, providing good agreement with the experimental evidence available in the literature.24,32,33
However, the computational cost of DFT-scales as O(n3), with n being the number of electrons in the system34 – prevents a detailed exploration of the chemical compound space spanned by all possible intermetallic materials.35 This limitation has encouraged the use of strategies based on artificial intelligence, namely machine learning (ML), in the prediction of new materials in different fields, including catalysis.36–40 Nevertheless, the application of ML tools to assess catalytic processes involving hydrogen is scarcely explored41,42 due to three main limitations: the first one is the lack of large and robust databases that can be used to train ML algorithms that can deal with different geometries, stoichiometries, and surfaces.43,44 The second limitation is the lack of accuracy of the ML model predictions, which provide mean absolute errors (MAEs) equal to or higher than 0.16 eV for H,45 and 0.23 eV for O and OH adsorption energies,46 too large to predict the catalytic activity. Finally, information about the influence of elastic strains on the catalytic activity is missing.
The objective of this investigation is to overcome these limitations and develop an accurate ML model to describe the adsorption processes for the HER and the ORR in intermetallic compounds. To this end, a large database of the adsorption energies of H, O, and OH was computed by DFT in 24 pure metals and 332 binary intermetallic compounds with different lattice structures, stoichiometries, and surfaces, including the effect of biaxial elastic strains. This information was used to train a random forest model to predict the adsorption energy of ≈2700 binary intermetallic compounds including the effect of elastic strains using descriptors that can be obtained for each compound and surface from physico-chemical databases. A short list of candidates presented similar adsorption energies to Pt, indicating their potential to replace Pt-group metals as catalysts in these reactions. Overall, this investigation demonstrates a strategy based on the synergistic application of high-throughput DFT calculations and artificial intelligence tools to discover new catalysts for the HER and ORR that can be easily employed for other catalytic processes.
![]() | ||
Fig. 1 Flowchart for the database generation: strategy to select the binary intermetallic compounds for the database of adsorption energies. |
For each stoichiometry, only the most stable lattice and the lowest energy surface were considered for the calculations, namely 117 AB bcc(101), 9 AB fcc(101), 24 A2B hcp(0001), 118 A3B fcc(111), and 20 A3B hcp(0001) compounds were found, leading to a total of 332 compounds. From those, we randomly selected 106 compounds to compute by DFT the adsorption energy of H, O, and OH with and without the application of biaxial strains. Adsorption is a very localized process since the adsorption energy depends on the local atomic environment of the adsorption site. Therefore, we explored all possible adsorption sites for both the pure metals and binary intermetallic compounds (see Fig. 2). The main adsorption sites for pure metals (Fig. 2a–e) are FCC, HCP, ONTOP, and BRIDGE. At FCC and HCP positions, the adsorbate is placed on a hole between three atoms. At FCC, there is no atom in the second layer below, while at HCP, a metal atom is present right under the adsorbate. ONTOP refers to the adsorption above an atom of the first layer, and BRIDGE describes the adsorption at the bond between two atoms. For bcc(100) surfaces we must discriminate between SHORTBRIDGE and LONGBRIDGE, as there are two different bond distances on the surface. HOLLOW refers to the adsorption site in the hole between four atoms. For most pure metals, the most favorable adsorption sites (lowest adsorption energies) were FCC and HCP. In contrast, BRIDGE and ONTOP positions were higher in energy and, in some cases, they were unstable (i.e., the adsorbate moved to another position during the geometry optimization). Those cases where the adsorbate moved were not included in the database since they corresponded to repeated entries. For the binary intermetallic compounds (Fig. 2f–j), the number of possible adsorption sites was bigger, as we had to discriminate between A and B atoms on the surface. Here, we studied different FCC and HCP possibilities depending on the three atoms around the hole –AAA, AAB, or ABB–. Similarly, the ONTOP positions could occur above an atom A or an atom B, and BRIDGE sites between two atoms AA, BB, or AB. In AB bcc(101) samples, an extra position appeared: THREEFOLD. It stands for a hole between three atoms where the atom in the second layer below is at an intermediate position between FCC and HCP. The most favorable binding site for binary intermetallics was highly dependent on the affinity of the adsorbate towards A or B.
The adsorption energy varies significantly with the adsorption site. For instance, we found that the adsorption energy can change in more than 1 eV between a hole between three atoms (FCC, HCP, THREEFOLD) to a bond between two atoms (BRIDGE, LONGBRIDGE, SHORTBRDIGE) or ONTOP a single atom. Additionally, the specific atoms around the adsorption site are also very relevant. For instance, in the adsorption process on a THREEFOLD position in a binary compound AnB, the three atoms around the adsorbate can be AAA, AAB, or ABB and the adsorption energy can change in more than 0.5 eV by changing one single atom.
The results of the DFT calculations are included in two different databases: one with EHads and the other with EOads and EOHads. Given that EOads and EOHads are linearly correlated49 and that OH binds to the surface by the O atom, it was expected that one single model could be able to predict the adsorption energy of both adsorbates. A total of 130 different pure metals and binary intermetallic compounds were included: 24 pure metals, 58 AB bcc(101), 8 AB fcc(101), 6 A2B hcp(0001), 26 A3B fcc(111), and 8 A3B hcp(0001). The total number of entries in the two databases was 942 and 1699, respectively, considering the different adsorption sites and strain states. The distribution and the number of entries of the database are shown in Fig. 3. A tabulated version can be found in Table S2 of the ESI.† The inputs and the outputs of the DFT calculations for the adsorption energies can be found in Catalisys-Hub.org43via the link https://www.catalysishub.org/publications/AlonsoStrain2023.
Following these previous references, 9 descriptors from 3 different types were chosen as features in our ML model. Two of them are characteristic of each adsorption site on the surface of the intermetallic compound and describe the geometry configuration of the adsorption site and the chemical environment around the site. The third descriptor is extrinsic and indicates the biaxial elastic strain applied to the surface. The nine geometrical, electronic, and strain descriptors are depicted in Table 1. One important condition for the intrinsic descriptors is that they can be calculated for any compound, surface, and adsorption site from data available in structural and electronic databases. In fact, the values for the atomic radii, the electronegativity, and the first ionization energy were taken from the Mendeleev Python package,55 whereas the unit cell volume was taken from the Materials Project database.47 Although additional descriptors (such as the d-band center and the d-band width) can provide a better description of the local chemical environment,56–58 they require expensive DFT calculations that would hinder an efficient exploration of thousands of intermetallic compounds. Finally, an extra descriptor was needed to describe the prediction of either O or OH in the database for the adsorption energy of O and OH. It was a categorical and binary descriptor with a value of 0 for O and 1 for OH adsorption. The reason for employing a Boolean descriptor over more elaborate physical or categorical features to encode the information of the adsorbate is simple. The adsorption of OH always happens with the O atom close to the surface and the H atom above it, while O is a single atom interacting with the surface. Thus, there are no other relevant degrees of freedom that characterize adsorbate-slab interactions apart from the position of the adsorbates above the surface (already included in the information of other descriptors that will be described later, such as the generalized coordination number and Ψ). A Boolean descriptor (0/1), which in this scenario is formally equivalent to a categorical feature with values O and OH, serves to encode the only missing information: the identity of the adsorbate. Before feeding the descriptors to train the RF models, all non-categorical descriptors were scaled to a [0,1] range with a MinMax scaler.
Geometric descriptors | Unit cell volume |
Weighted atomic radius (WAR) | |
Generalized coordination number (GCN) | |
Electronic descriptors | Weighted electronegativity (WEN) |
Weighted first ionization energy (WIE) | |
Outer electrons of A (SA) | |
Outer electrons of B (SB) | |
Ψ | |
Strain descriptor | Biaxial strain |
![]() | (1) |
The generalized coordination number (GCN) is the feature employed to describe the adsorption plane as well as the binding site where the adsorption takes place. Each adsorption site at a specific surface plane has a unique GCN, which allows our model to discriminate between geometries and facets. It can be computed as:59
![]() | (2) |
Other geometrical features, such as the lattice parameters, the cell size, and the area of the adsorption hole, were explored but they provided information equivalent to that of the unit cell volume. The molecular mass was also included but it did not correlate with the adsorption phenomena.
![]() | (3) |
The weighted first ionization energy (WIE) can similarly be expressed as:
![]() | (4) |
The outer electrons of A and B range from 1 to 12 and reflect the electronic properties of the valence layer for element A and element B, respectively. These are the electrons that interact with the adsorbate and, therefore, influence the adsorption energy.
The last descriptor of this set is Ψ that was proposed by Gao et al.60 according to
![]() | (5) |
The magnetic moment of the intermetallic compounds was also explored as an additional electronic feature but it had very low importance in the description of the adsorption process and the errors of the model experimented no variation when this feature was included. Thus, it was not included among the descriptors.
The Pearson correlation indexes between all the features (in the complete H and O/OH database) in Table 1 were calculated to make sure that they provide unique information to the model and avoid features that were strongly correlated. They are presented in Fig. 4. The results are similar for both the hydrogen and the oxygen and hydroxyl descriptors: the most correlated features are Ψ and the outer electrons of A (SA), which is coherent with the description of Ψ. As shown in eqn (6), SA is included in the numerator, which explains a positive value of the correlation. Additionally, the WIE is relatively correlated with SA and Ψ due to the definition of ionization energy (energy that an isolated, gaseous atom in the ground electronic state must absorb to discharge an electron). Conversely, WAR and WEN are correlated with a negative index of −0.6, which is in accordance with the basis of chemistry: the larger the atomic radius of an element, the lower the tendency to attract electrons to itself. Finally, the categorical feature adsorbate in the model for oxygen and hydroxyl (b) is not related to any other descriptor because it does not represent any chemical property of the binary intermetallic compounds. All in all, we can conclude the final set of 9 descriptors were not correlated with each other.
![]() | ||
Fig. 4 Pearson correlation matrixes of the features: a) hydrogen adsorption and b) oxygen and hydroxyl adsorption. |
The adsorption energies in the dataset range from −2 to 2 eV in the case of hydrogen and from −8 to 7 eV in the case of oxygen and hydroxyl (Fig. 5). Overall, the accuracy of the predictions of the RF is excellent according to various statistical parameters, such as R2, MAE, RMSE, and MAD of the testing set, that are indicated in Fig. 5a) and b). The MAE is 0.07 eV for hydrogen, and MAE = 0.18 eV for oxygen and hydroxyl. Considering that the error of the DFT calculations in the adsorption energy is assumed to be 0.1 eV,30 these models are able to predict the adsorption energy with a similar degree of accuracy. It should be noted that the MAE in the model for O/OH was higher than for the H model due to the wider range of its predictions (−8 to 7 eV vs. −2 to 2 eV). For approximately the same amount of data in the training, the O/OH model is required to predict a larger scope of energies, thus, the amount of entries at each energy is lower, leading to higher MAE. Hence, more entries in the oxygen and hydroxyl database should be included to achieve a similar performance as that obtained from hydrogen adsorption.
Moreover, the standard deviations from the mean of the testing data are plotted in Fig. 5c) and d) for the sake of clarity. These error bars have been calculated using the Jackknife or “leave one out” procedure. This is an iterative cross-validation technique that gives an estimate of the sampling variance of the random forest. Firstly, each point is estimated from all the data available in the dataset. Then it is recalculated, dropping out one by one other entries in the training. The difference between the whole sample estimate and the partial estimate is calculated and its standard deviation is used to estimate the error and the confidence intervals. In other words, the error bars are a measurement of how much the RF predictions would change if we trained it on a different training set. The results in Fig. 5c) and d) give a reference of two uncertainty quantifications that are crucial in machine learning: calibration, and dispersion. Calibration (or honesty) compares the difference among error bars. A “well-calibrated” model has entries whose standard deviations are close to the model's predicted standard deviations. This means that for those points that are further away from the diagonal in the parity plots, the confidence intervals should be broader. In contrast, the error bars are very small for those entries in which the DFT calculated and the ML predicted adsorption energies are very similar. Thus, it can be concluded from this data that the calibration of our model is correct. Secondly, the models present reasonable error bars along all the range of energies (limited dispersion), especially in the middle, because the datasets are well distributed. In the case of the hydrogen model, it can be observed that the error bars are significantly larger at larger energies (between 0.5 and 1 eV) because of the lower number of data in this range. All in all, our model has consistent uncertainty metrics.
As expected, the surfaces and geometries with more entries in the dataset –fcc(111) and bcc(101)– are the ones that are more accurately described by our models. Among the total data for intermetallics, A3B fcc(111) represents the 32.5% and AB bcc(101) the 56.2%. Yet AB fcc(101), A2B hcp(0001), and A3B hcp(0001), only represent the 1.4%, 1.9%, and 8.0%, respectively. The entries in the test set with the highest errors were A2B hcp(0001), and A3B hcp(0001).
![]() | ||
Fig. 6 Features importances for the RF models: prediction of a) hydrogen and b) oxygen and hydroxyl. |
The features with the highest weight for the hydrogen adsorption are GCN and PSI (Fig. 6a). These descriptors contain information about the local geometrical and chemical environment of the adsorption site and this result demonstrates that the adsorption energy of the small hydrogen atoms onto a surface is highly dependent on the specific adsorption site. Regarding the oxygen and hydroxyl adsorption (Fig. 6b), the most relevant descriptor is the type of adsorbate (either O or OH). The second most important feature is the WEN. This was expected because the adsorption of oxygen will be disfavored on a surface with high WEN because oxygen is very electronegative, whereas electropositive surfaces will favor the adsorption. The influence of GCN and PSI is smaller –compared to the hydrogen adsorption– because, larger adsorbates such as O and OH are not as dependent on the local geometric and chemical features of the binding site as H. Regarding the outer electrons, it can be observed that in both cases SA has a higher weight than SB. This reflects that in most cases the proportion of A is larger than B (A3B, A2B), and consequently, the feature describing A is somehow more important than the feature for B. Finally, WAR, WIE, and the volume of the cell have a smaller influence on the adsorption energy, with the exception of WIE in the case of hydrogen, which has a significant weight. As the H atom only has one proton and one electron, the quantity of energy that is absorbed to discharge an electron, which corresponds to the definition of IE, is more relevant.
The importance of the strain descriptor is the smallest in both models. It indicates that biaxial strains in the range applied have a limited effect on the adsorption energy as compared with other features, such as GCN, PSI, or WEN. This is expected since the variation in adsorption energy is larger between different chemical compositions compared to the variation due to strain within one given composition. However, these changes in the adsorption energies (either positive or negative) of 0.1–0.2 eV in the case of hydrogen and of 0.4–0.8 eV in the case of oxygen and hydroxyl61 can be introduced to any intermetallic surface. Thus, the capability of the RF models to account for elastic strains can be very useful to study their effect on the catalytic behavior of the particular compounds.25 It indicates that biaxial strains in the range applied have a limited effect on the adsorption energy as compared with other features, such as GCN, PSI, or WEN. This is expected since the variation in adsorption energy is larger between different chemical compositions compared to the variation due to strain within one given composition. However, these changes in the adsorption energies (either positive or negative) of 0.1–0.2 eV in the case of hydrogen and of 0.4–0.8 eV in the case of oxygen and hydroxyl61 can be introduced to any intermetallic surface. Thus, the capability of the RF models to account for elastic strains can be very useful to study their effect on the catalytic behavior of the particular compounds.25
System | Adsorbate | Strain | Binding site | E ads (ML) | E ads (DFT) | Error |
---|---|---|---|---|---|---|
Ce3Sn | H | 0 | fccAAA | −0.47 | −0.71 | 0.24 |
Ce3Sn | H | 0 | hcpAAA | −0.50 | −0.71 | 0.21 |
Ce3Sn | OH | 0 | fccAAA | −2.09 | −2.05 | −0.04 |
Ir3Sc | OH | −1 | fccAAA | 1.19 | 1.42 | 0.23 |
Nd3In | OH | 0 | fccAAB | −2.00 | −2.13 | 0.14 |
Ni3Pt | H | −3 | fccAAB | −0.41 | −0.44 | −0.04 |
Ni3Pt | O | −5 | fccAAB | −1.94 | −2.05 | −0.11 |
Pd3Ce | H | 1 | fccAAA | −0.44 | −0.54 | −0.10 |
Pd3Sc | H | −5 | fccAAA | −0.47 | −0.34 | 0.13 |
Pd3Sc | H | −5 | hcpAAA | −0.51 | −0.70 | 0.20 |
Pd3Sc | O | −5 | hcpAAA | −1.68 | −1.58 | −0.10 |
Pd3Sc | OH | −5 | fccAAA | 1.00 | 1.08 | −0.08 |
Pt3Dy | H | 0 | hcpAAA | −0.68 | −0.49 | 0.19 |
Pt3Dy | OH | 0 | hcpAAA | 1.01 | 1.23 | −0.22 |
Pt3Mn | OH | 1 | ontopA | 0.99 | 0.74 | 0.25 |
Pt3Mn | OH | 1 | hcpAAA | 1.17 | 1.15 | 0.02 |
Pt3Sn | O | 5 | fccAAA | −1.64 | −1.39 | −0.25 |
Pt3Y | H | 1 | hcpAAA | −0.50 | −0.70 | −0.20 |
Ir3Sc | O | −3 | ontopB | −1.94 | −0.79 | 1.15 |
Nd3In | H | 0 | hcpAAA | −0.46 | −0.83 | 0.38 |
Ni3Ga | O | −3 | ontopB | −1.60 | 0.08 | −1.68 |
Pt3Mn | O | 1 | hcpAAA | −1.69 | −1.43 | −0.26 |
Pt3Mn | OH | 1 | ontopB | 1.09 | 0.59 | 0.50 |
Pr3In | H | 3 | hcpAAA | −0.46 | −0.79 | 0.34 |
Pr3In | H | 3 | fccAAA | −0.46 | −0.78 | 0.32 |
Pr3In | OH | 3 | fccAAB | −2.07 | −2.07 | 0.00 |
Pr3In | OH | 3 | fccAAA | −1.97 | −2.30 | 0.32 |
SmRh | H | 0 | longbridgeA | −0.46 | 0.02 | 0.48 |
Sn3Pr | H | 5 | fccAAB | −0.45 | 0.32 | −0.77 |
Sn3Pr | O | 5 | ontopB | −1.60 | −2.41 | 0.81 |
In contrast, the second part of the table shows candidates who were expected to give inaccurate predictions and, thus, highlights the limitations of the RF models. These candidates contain Al, Dy, Sc, Nd, In, Ga, Mn, Pr, Sm, and Sn, which were not included in the database of 24 pure elements for which the adsorption energy of H, O, and OH was computed. The RF models cannot predict accurately the adsorption energies in fccAAA, hcpAAA, longbridgeA, or ontopA locations if the adsorption energy of pure element A is not included in the dataset because the chemical environment in these sites (which is determined by element A) has not been included enough in the training set. The differences between the adsorption energies calculated by RF and DFT were much higher in these sites and reached >1 eV in some cases, although the differences were below 0.34 eV for Pr3In, even though pure Pr and In were not included in the database. The parity plot representing the 30 verification candidates is shown in the ESI† Fig. S4.
Material | Adsorption site | E Hads (eV) | Material | Adsorption site | E Hads (eV) |
---|---|---|---|---|---|
Pt | FCC | −0.49 | Pt | FCC | −0.49 |
Ce3Ga | hcpAAA | −0.45 | Pd3Nd | hcpAAA | −0.53 |
fccAAA | −0.45 | Pd3Sm | hcpAAA | −0.52 | |
Ce3In | hcpAAA | −0.46 | fccAAA | −0.52 | |
fccAAA | −0.45 | fccAAB | −0.51 | ||
Ce3Sn | hcpAAA | −0.50 | Pd3Y | fccAAA | −0.52 |
fccAAB | −0.48 | Pr3In | hcpAAA | −0.45 | |
Nd3In | hcpAAA | −0.46 | Pt3Co | hcpAAA | −0.49 |
Ni3Fe | fccAAB | −0.53 | fccAAB | −0.49 | |
hcpAAB | −0.49 | fccAAA | −0.49 | ||
Ni3Pt | fccAAA | −0.48 | Pt3Dy | hcpAAA | −0.49 |
hcpAAA | −0.48 | fccAAA | −0.47 | ||
fccAAB | −0.47 | Pt3Sc | hcpAAA | −0.53 | |
Pd3Ce | hcpAAA | −0.53 | fccAAA | −0.51 | |
fccAAA | −0.53 | fccAAB | −0.51 | ||
fccAAB | −0.50 | Pt3Sn | fccAAA | −0.53 | |
Pd3Dy | hcpAAA | −0.47 | Pt3Y | fccAAA | −0.51 |
fccAAA | −0.46 | hcpAAA | −0.48 | ||
Pd3Fe | fccAAA | −0.53 | fccAAB | −0.45 | |
Pd3Pr | fccAAA | −0.53 | Rh3Mo | fccAAA | −0.46 |
hcpAAA | −0.53 | Sm3In | hcpAAA | −0.46 | |
fccAAB | −0.52 | Sn3Nd | fccAAB | −0.45 | |
Pd3Sc | fccAAA | −0.52 | Sn3Sm | fccAAB | −0.49 |
hcpAAA | −0.51 | Zn3Ti | fccAAB | −0.48 | |
fccAAB | −0.48 | hcpAAB | −0.44 |
From the experimental viewpoint, Baibars et al.65 proved that a NiFeOxHy/Ni3Fe interface designed via electropassivation could provide superior catalysis of the HER. Additionally, Cu3Pt is one of the candidates that has been studied experimentally as a potential candidate to replace Pt, although its application has been hindered by dealloying and passivation phenomena.66–69
The application of elastic strains introduces two more candidates with adsorption energies ±0.05 eV that of Pt(111): Pt3Sm and Ir3W. The difference in the adsorption energy of H for both candidates is reduced to 0.01 eV and 0.03 eV in the presence of −3% and −5% compression strains, respectively. Moreover, the candidates whose adsorption energies were close to Pt could get even closer (≈0.03 eV) under the application of biaxial strains. This is the case of Ir3W, Pd3Ce, Pd3Fe, Pd3Nd, Pd3Pr, Pd3Sc, Pd3Sm, Pd3Y, Pt3Sc, Pt3Sm, Pt3Sn, and Zn3Ti that become closer to Pt in the presence of compressive strains, while Ce3Ga, Ce3In, Ce3Sn, Nd3In, Ni3Pt, Pd3Dy, Pr3In, Sm3In, Sn3Nd, and Sn3Sm get closer to Pt under tensile strains. Ni3Fe, Pt3Co, Pt3Dy, Pt3Y, and Rh3Mo are the only candidates whose adsorption energies for hydrogen are closer to Pt at zero strain. The optimum applied strain state and adsorption site for each candidate, and a representation of the variation of EHads with strain are shown in Table S5, and Fig. S5 of the ESI,† respectively. This fact supports the potential of elastic strains to tune the catalytic properties of materials, because changes of a few tenths of eV may lead to important changes in the catalytic activity.24–26
Material | Adsorption site | E Oads (eV) | Adsorption site | E OHads (eV) |
---|---|---|---|---|
Pt | FCC | −1.79 | FCC | 1.19 |
Cu3Pt | fccAAB | −1.82 | ontopA | 1.3 |
hcpAAA | −1.92 | ontopB | 1.38 | |
Ir3Sc | ontopB | −1.95 | fccAAA | 1.21 |
Ni3Pt | hcpAAA | −1.99 | ontopB | 1.37 |
Pd3Dy | fccAAA | −1.88 | fccAAA | 1.19 |
fccAAB | −1.83 | hcpAAA | 1.10 | |
ontopA | 1.37 | |||
Pd3Sc | hcpAAA | −1.73 | fccAAA | 1.00 |
Pd3Sn | fccAAB | −1.94 | fccAAA | 1.30 |
hcpAAA | 1.06 | |||
Pt3Co | fccAAA | −1.61 | fccAAB | 1.37 |
fccAAB | −1.64 | hcpAAA | 1.24 | |
Pt3Dy | fccAAA | −1.76 | fccAAA | 1.14 |
fccAAB | −1.78 | hcpAAA | 1.01 | |
hcpAAA | −1.82 | ontopA | 1.28 | |
ontopB | 1.36 | |||
Pt3Mn | fccAAA | −1.67 | fccAAA | 1.36 |
fccAAB | −1.71 | fccAAB | 1.06 | |
hcpAAA | −1.69 | hcpAAA | 1.22 | |
ontopA | 1.05 | |||
ontopB | 1.15 | |||
Pt3Sc | fccAAB | −1.96 | fccAAA | 1.21 |
Pt3Sn | fccAAB | −1.99 | fccAAA | 1.23 |
Pt3Y | hcpAAA | −1.63 | fccAAA | 1.06 |
ZnAu | threefoldAAB | −1.82 | longbridgeB | 1.36 |
Another suitable candidate, Ag3In fcc(111), appears for the ORR if the effect of elastic strains is considered. In this case, the EOads is within the ranges of energies close to Pt when 5% of tension strain is applied, while EOHads needs a −5% compression to be similar to Pt. Among the rest of the candidates, strain drives the adsorption energy up to 0.15 eV closer to Pt. The optimum applied strain level and the specific adsorption sites for each compound are indicated in Table S6 of the ESI.† Compressive strains bring the EOads closer to Pt in Cu3Pt, Ir3Sc, Ni3Pt, Pd3Dy, Pd3Sn, Pt3Dy, Pt3Sc, and Pt3Sn, while tensile strains have the same result for Ag3In, Pd3Sc, Pt3Co, Pt3Mn, Pt3Y, and ZnAu. Compressive strains are favorable for EOHads in Ag3In, Ir3Sc, Pd3Sc, Pt3Co, and Pt3Sn, whereas tensile strains bring Cu3Pt, Ni3Pt, Pd3Sn, Pt3Dy, Pt3Mn, and Pt3Sn. Only Pd3Dy, Pt3Y, and ZnAu are closer to the EOHads of Pt at zero strains. A representation of the variation of EOads and EOHads for the candidates with strain, can be found in the ESI† Fig. S6 and S7, respectively. These results indicate again the potential of the elastic strains to improve the catalytic performance of good catalysts.
Experimentally, core–shell nanoparticles of Ni3Pt have demonstrated good catalytic performance for the ORR by Godínez-Salomón et al.,70 but they observed a strong segregation of Ni on the topmost layer. Similarly, Pd3Fe, Pd3Y, Pt3Sc, and Pt3Y have been proven to be an efficient catalyst for ORR.71–76 Stamenkovic et al.33 also measured that Pt3Co exhibits a similar catalytic activity to Pt, in the ORR. Although there is still little experimental evidence of the effect of elastic strains on the catalytic activity of the ORR, Escudero-Escribano et al.24 proved experimentally that the catalytic activity of Pt5Dy, Pt5Ce, and Pt5Sm compounds was enhanced by a factor of 3 to 6 over Pt. These studies match our DFT calculations and ML predictions and therefore represent a starting point for the use of binary intermetallic compounds and elastic strain engineering in the catalysis of the HER and the ORR.
The ML models were used to predict the adsorption energy for hydrogen, oxygen, and hydroxyl of ≈2700 binary intermetallic compounds with stoichiometries AB, A2B, and A3B made of metallic elements, excluding those that were environmentally hazardous, radioactive, or toxic. This information was used to search for potential good catalysts for the HER and ORR from the criteria that their adsorption energy for H and O/OH, respectively, should be close to that of Pt, which is the best-known catalyst for both reactions. Including the effect of elastic strains, 27 potential candidates were reported for the HER and 14 for the ORR. The few of them that have been explored experimentally have shown good catalytic performance, validating our methodology.
The results of this investigation provide a large database of adsorption energies of H, O, and OH in pure metals and binary intermetallic compounds (including the effect of elastic strains) that can be very useful for training other data-driven strategies to search for new catalysts. Moreover, it demonstrates that the ML models –suitably trained– can predict adsorption energies with an accuracy not far away from density functional theory calculations with minimum computational cost from descriptors that are readily available in physico-chemical databases for any compound. Finally, the expansion of the strategy presented in this paper to ternary intermetallic compounds, oxides, etc. for other catalytic reactions is straightforward and is expected to foster the use of these methods in catalysis.
The adsorption of H, O, and OH was modeled on four-layer slabs (2 × 2) generated with the atomic simulation environment (ASE)81 from the equilibrium lattice parameters obtained in the Materials Project database.47 The equilibrium lattice parameters were calculated by DFT for a number of random candidates with all the stoichiometries. The differences between the calculated and the database values of the lattice parameters were negligible in all cases and can be found in the ESI† Table S4. Thus, the lattice parameters from the Materials Project database were used to avoid an unnecessary waste of computational time. The periodic slabs were separated by 10 Å of vacuum in the direction perpendicular to the surface and adsorption energies were calculated for each chemical species assuming a coverage of 1/4 monolayers. All metal atoms in the half-top of the slab and all adsorbed H, O, and OH were fully relaxed, while the positions of metal atoms in the half-bottom layers were fixed. The adsorption energy of H, O, and OH was calculated following the methodology in ref. 61. The expression used for the H and the O adsorption was:
![]() | (6) |
Similarly, the adsorption energy of OH was calculated as:
![]() | (7) |
The effect of elastic strains was assessed by imposing biaxial strains on the surface plane. The deformation gradient F applied to the supercell was
![]() | (8) |
Among all, the state-of-the-art MAE of ML model predictions for the adsorption energy is in the range of 0.16–0.3 eV.45,46 In our case, random forest (RF) regressor models were built separately for the datasets of EHads and of EOads/EOHads. RF is a part of a family of methods known as ensemble methods. Ensemble methods combine the predictions of different base estimators. In RF models, the ensemble consists of a collection of decision trees that are trained independently using random subsets of both features and training data. Each decision tree predicts the target variable by learning simple decision rules inferred from the features. In regression tasks, the final prediction of a RF model is the average of the predictions made by each tree in the forest. RF models are generally robust to overfitting, can handle outliers, and provide information about the importance of each feature in the descriptor, which can be useful for feature selection and for elucidating the model's behavior. RFs are known to be versatile to different datasets without requiring an expensive hyperparameter tuning. We chose a RF regressor as the algorithm for this work due to the size of our database (useful with small- –hundred– or medium-sized –thousand– databases), the quick training, and the good accuracy obtained. Under the same conditions, we trained our databases with a Gaussian process regressor algorithm, however, the MAE was 0.2 eV higher on average. We did not consider graph neural networks due to the diversity and size of our databases. In our RF model, the training set was 85% of the data and the remaining 15% conformed the test set. The training set was used to optimize the hyperparameters with a k-fold grid-search cross-validation (K = 5) and they can be found in Table S3 in the ESI.† For that, we constructed a parameters grid and optimize it with the GridSearchCV routine as implemented in the Scikit-learn library.82
The accuracy of the model was analyzed with the coefficient of determination R2, the MAE, the root mean squared error (RMSE), and the median absolute error (MAD). The algorithm selected was ExtraTreesRegressor and was constructed and analyzed using the Scikit-learn library.82
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cy00491d |
This journal is © The Royal Society of Chemistry 2024 |