Moumita
Mukherjee
a,
Sayan
Dutta
a,
Madhusudan
Ghosh
b,
Partha
Basuchowdhuri
*b and
Ayan
Datta
*a
aSchool of Chemical Sciences, Indian Association for the Cultivation of Science, Kolkata, West Bengal, India. E-mail: spad@iacs.res.in
bSchool of Mathematical and Computational Science, Indian Association for the Cultivation of Science, Kolkata, West Bengal, India. E-mail: partha.basuchowdhuri@iacs.res.in
First published on 27th June 2022
Developing a cost-effective and environmentally benign substitute for the energy-intensive Haber–Bosch process for the production of ammonia is a global challenge. The electrocatalytic nitrogen reduction reaction (NRR) under ambient conditions through the six proton–electron process has attracted significant interest. Herein, a series of transition-metal (TM) based single atom catalysts (SAC) embedded on carbon nitride (C6N6) have been chosen to explore the NRR activity. The promising metals have been primarily screened through density functional theory (DFT) by calculating their adsorption energies on C6N6 – energies for dinitrogen binding and the barriers at the rate determining step. Based on these criteria, amongst the 18 metal centers, Ta based C6N6 emerges as a good candidate for the reduction of nitrogen to NH3. On the other hand, for the Machine Learning (ML) regression models, the covalent radius and the d-band center of the TM have been identified as the most correlated descriptors for predicting the adsorption energy of nitrogen on the active metal center. Besides, probabilistic modeling using the soft voting technique in the classification model allows us to predict the most efficient single atom catalyst. Despite the realistic bottleneck of having only a limited number of TMs to choose from, this technique effectively predicts the best catalyst from a modest dataset. With the highest probabilistic score, Ta based C6N6 dominates over the other catalysts in a good agreement with DFT findings. This letter manifests the effectiveness of the soft voting technique in an ensemble-based classification model.
The electrocatalytic nitrogen reduction reaction (NRR) proceeds via activation of dinitrogen followed by breaking of NN towards the production of NH3 under normal atmospheric pressure and temperature. From a thermodynamic viewpoint two competing reactions, viz., the NRR and the HER (hydrogen evolution reaction) might occur at the same potential. Hence, a suitable catalyst must be such that it helps in lowering the binding free energy for N2-adsorption in comparison to H-adsorption and thereby, enhancing the NH3 selectivity. Since the discovery of transition metal (TM) – N2 complex in 1965, many studies have been reported on metal doped two-dimensional (2D) materials as a suitable catalyst for the NRR where the metal center acts as an active catalyst.8–11 Qiao et al. have explored the advantages of single atom catalysts (SACs).12 SACs serve the purpose by maximizing the metal usage efficiency, especially for noble metals. Moreover, the low-coordinated metal active center with high surface free energy dispersed on a large surface support inhibits the possibility of diffusion and agglomeration and, thus, makes it a suitable candidate for the NRR. On the other hand, backdonation of the d-electrons from the active metal center to the antibonding π* orbital of the NN bond facilitates the dissociation of the N2 bond. In order to search for a suitable catalyst towards the NRR, density functional theory (DFT) has been used for studying systematically a series of TMs supported on different suitable substrates for strong interactions. Wang et al. have reported that amongst a series of single metal embedded on graphitic carbon nitride (g-C3N4), tungsten (W) showed the highest catalytic activity towards the NRR.13 In another study by Zhao et al., molybdenum (Mo) and chromium (Cr) were found to be the most suitable metals supported on nitrogen doped graphene towards the production of ammonia.14 Liu and his group studied conversion of N2 to ammonia via several TMs anchored on boron sheets and reported that ruthenium (Ru) showed superior activity.15
In addition to DFT calculations, Machine Learning (ML) has emerged as a potent tool in the search for new catalysts for important processes namely, CO2 reduction,16 N2 reduction,17,18 and O2 reduction reactions.19 Typically, ML algorithms can be modeled in two ways – either that demonstrating a ML model where the input set is experimental data or that available from electronic calculations or utilizing previously established ML models for specific surfaces or metals for training the required catalytic application. The output is a collection of a few descriptors based on which ML can predict the target class at a significantly reduced computational cost. The descriptors may be the intrinsic parameters of the constituent atoms and/or that obtained from electronic or structural calculations. Instead of computing the rigorous, time consuming, expensive computational calculations, one can predict the target properties by screening through a complete dataset using the ML algorithm. Bagherzadeh et al. have compared the decision tree algorithm viz. Random Forest (RF) and Gradient Boosting Machine (GBM), using the neural network algorithm (ANN) to predict the amount of nitrogen present in waste water and showed the accuracy of the algorithm for this prediction.20 In a combined study of ML predictions and DFT verifications, Kim and his group found the most stable catalyst for NRR from a large database (∼3040 surface) based on the elemental properties of the adsorbates.21 For the N2 reduction, Zafari et al. have suggested a deep neural network (DNN) to predict efficient catalysts in B-doped graphene SAC. By employing the most important features as the input in the light gradient boosting machine (LGBM) model they have successfully predicted the adsorption free energies of the adsorbates.22 Ensemble learning models are being used nowadays for its efficient predictions based on the results generated from different prediction models that have been already trained by the available datasets.23
Herein, a series of transition metals (3d, 4d, and 5d) embedded carbon nitride (C6N6) have been used as catalysts for the NRR mechanism. Based on the DFT computations, Mo, Nb, W, Re, Hf and Ta supported on C6N6 are found to be suitable candidates. In the machine learning algorithm, the regression model has been adopted for calculating the adsorption energy of nitrogen, with the aid of different possible descriptors including the structural, electronic and properties associated with the host metal centers. Using statistically significant and simple features, an ensemble-based classification model has been established. It has shown a high correlation with the DFT computations for a realistically modest dataset of 18 TMs. Ta@C6N6 has been found to be an excellent electrocatalyst for the NRR reaction.
Eads = Etotal − Esubstrate − Eadsorbate | (1) |
The Gibbs free energy change for each reaction step considering the computational hydrogen electrode model (CHE), has been calculated using the following expression:
ΔG = ΔE + ΔZPE − TΔS + ΔGpH + eU | (2) |
In this work, the scikit-learn package has been employed for all the ML based calculations.28,29 Here, the machine learning algorithms have been categorized into two models – regression and classification. We have mainly applied ensemble-based techniques, such as gradient boost regression (GBR), and random forest regression (RFR) for the regression problem.29,30 These two methods are discussed in detail in the ESI.† The performance of the regression model has been quantified by the R2 value and root-mean squared error (RMSE) value using the following equations:
(3) |
(4) |
The classification model, on the other hand, calculated a probabilistic score for each data-point present in the full dataset. We have applied the majority voting principle to ensemble the algorithms such as logistic regression, support vector machine and random forest classifier used for the classification problem.30–32 The hyperparameters used here for the regression and classification model are listed in Table S1 (ESI†).
Fig. 1 (a) Possible binding sites of metals on C6N6. (b) Adsorption energy of different metals on C6N6 at position 2. (c) Charge transfer from the active metal-center to C6N6. |
The adsorption of N2 on the TM doped catalyst surface is the initial step of electrocatalytic reduction of nitrogen, which plays an important role in complete pathway. N2 can be adsorbed in two different modes viz, end-on and side-on. The end-on mode is further classified into two pathways, namely, distal and alternating while side-on mode follows the enzymatic pathway.9 The end-on mode starts with the binding of one nitrogen with the TM and forms NN before going through the further steps towards reduction. In the distal mechanism, the hydrogenation occurs consecutively on the distal-N leading to the desorption of first NH3 from the surface followed by re-hydrogenation of the other N attached to the TM. On the other hand, in the alternating pathway, hydrogenation occurs alternatingly over the two N atoms. The side-on mode or enzymatic pathway proceeds through the binding of both the nitrogen with the TM and eventually hydrogenating them alternatively which liberates two NH3 consecutively. We have calculated adsorption energies of N2 in both the modes for 18 different SACs which indicates that the end-on mode is preferable over the side-on mode for binding of N2, as shown in Table S3 (ESI†). The potential determining step for most metal surfaces, as shown by Nørskov et al., would be either the first reduction step (formation of *NNH), or the final protonation step (*NH2 to *NH3).33 Also, Ling et al. proposed three key steps that include Gibbs free energy of N2 adsorption , first and final hydrogenation for finding a suitable electrocatalyst for the NRR.34 A screening through these essential steps can help to find a suitable electrocatalyst for the NRR. The transition metal atoms including Sc, Ti, Cr, Mn, Co, Ni, Cu, Nb, Mo, Tc, Ru, Hf, Ta, W, Re, Ir, Pt and Au doped on C6N6 have been considered in this study and the screening has been carried out through these three above mentioned steps. Fig. 2(a) shows Gibbs free energy changes (ΔG) for N2 adsorption which are found to be negative for Sc, Ti, Co, Ni, Cu, Nb, Mo, Tc, Ru, Hf, Ta, W, Re, Ir, and Pt doped substrates indicating that these TM doped catalyst surfaces are suitable for N2 activation, i.e., N2 to *N2 where * denotes the substrate. In the subsequent step for first hydrogenation, except Nb, Mo, Hf, Ta, W, Re doped C6N6, all other elements are ruled out because of the large endoergocity in the reduction of *N2 to *NNH, as shown in Fig. 2(b). In order to get more insight, the d-band center of each metal on TM@C6N6 has been computed and a relationship constructed with the barrier potential of the *N2 to *NNH step, as shown in Fig. 2(c). The catalytic activity shows a volcano type dependence which suggests that the six metals namely, Mo, Nb, W, Hf, Re, and Ta based SACs locate at the top of the diagram possessing lowest barrier potential and thereby demonstrating good NRR activity. Previous studies also showed a similar trend towards N2 reduction for the metals present in the middle of each period.35,36 Here, the catalysts having a positive d-band center (∼0.00–0.80 eV) around the Fermi level exhibits superiority. This can also be manifested in Fig. S1 (ESI†), where a linear relationship is observed between the d-band center and the binding energy of N2. Though the R2 value is not good, it roughly estimates that the d-band center plays an important role in evaluating the catalytic activity. According to the d-band model as predicted by Norskov et al., with the shifting of the d-band center above the Fermi level, the antibonding states begin to appear. These antibonding states are empty and as the number increases, the bond between the adsorbate and the metal becomes stronger. It can be quantitatively demonstrated from the d-band center values, as shown in Table S4 (ESI†) that Ta has a moderately positive d-band center value indicating moderate binding with N2.37 Therefore the d-band center correlates with the adsorption energy of the intermediate, that connects with the catalytic activity of SACs. Furthermore, NRR selectivity may be hindered by the competing hydrogen evolution reaction (HER) under acidic conditions. To find out the appropriate catalyst for nitrogen reduction, the adsorption energies of *H and *N2 are compared, as shown in Fig. 2(d). The catalyst is considered to be the best catalyst for HER in which tends to its ideal value leading to the desorption of hydrogen easily from the catalyst surface either through the Tafel or the Heyrovsky step, which in turn, leaves the catalyst surface prepared for the successive adsorption of hydrogen for further reaction to proceed.38 Here, the values of , as shown in the Fig. 2(d), vary from −0.2 eV to −0.79 eV which indicates that the H binds strongly with the catalyst surface that hinders the HER to proceed further. On the other hand, nitrogen in end-on configuration binds stronger than hydrogen that makes surface concentration of the *N2 higher, facilitating NRR to carry out further. Eventually, the positive value of for each of the six catalysts suggest their selectivity of the NRR. Moreover, among these suitable six catalysts, the Ta based C6N6 is found to be the best having a small potential barrier ∼0.39 eV for the *N2 to the *NNH step. The same for Mo, W, Nb, Hf and Re based SACs are remarkably higher for this first coupled proton and electron transfer step. The corresponding values are 0.62, 0.60, 0.54, 0.76, and 0.61 eV respectively. Therefore, only Ta based C6N6 is studied further for the final hydrogenation step, i.e., *NH2 to *NH3, which has again a reasonable potential barrier of ∼0.30 eV. The alternating pathway has also been evaluated and the potential barrier for the rate determining step becomes 0.91 eV, and therefore neglected in this study. Having the lowest potential barrier, among the 18 different metal centers, Ta@C6N6 is found to be suitable for NRR. In order to determine the thermal stability of the Ta based catalyst, ab initio molecular dynamics (AIMD) simulations compiled using the VASP simulation package (VASP) are carried out at 300 K for a time period of 15 ps with a time step of 0.5 fs. The root-mean-square (RMSD) plot is presented in Fig. 3(a) exhibits that the fluctuations equilibrate within a few picoseconds which confirms the stability of Ta@C6N6. The free energy diagram for NRR on Ta@C6N6 in the end-on configuration is shown in Fig. 3(b). After the first hydrogenation step, the next two consecutive (H+ + e−) transfers occur on the same distal N atom with an energy release of 0.28 eV and 0.42 eV respectively, shown as downhill in the energy profile. Consequently, *NNH becomes *NNH2 and *N with the release of the first NH3. This remaining *N gets hydrogenated consecutively thrice before the release of the second NH3 molecule. The corresponding ΔG values for these steps are −0.46 eV, +0.15 eV, and +0.30 eV respectively. The optimized structures of different intermediates adsorbed on the metal center of Ta@C6N6 is presented in Fig. S2 (ESI†). The free energy changes of each step in the N2 reduction reaction have been assessed considering the solvent effect which is plotted in Fig. 3(b). It is evident that ΔG of each step changes under this condition and more negative values of ΔG imply more stabilization under aqueous conditions. The potential barrier of *N2 to *NNH is found to be ∼0.38 eV. This small change in the limiting potential has no effect on the NRR pathway and, therefore, the solvation effect is not considered further for this reduction, which is in accordance with the previous report.39
Fig. 2 Free energy change of (a) N2 adsorption, and (b) first hydrogenation. (c) Relationship between the d-band center and the barrier potential. (d) Free energy change of the HER and NRR. |
In order to elucidate a correlation between the intrinsic properties of active metal center with its catalytic activity, ML has been adopted in assistance with DFT calculations. A set of features/descriptors based on the intrinsic properties of metal atoms, viz., the ionization energy (IE), electron affinity (EA), electronegativity (EN), covalent radius of the metal atom (rcov), Zunger radius (dz),40 and d-electron count (θd) have been considered as features for building the ML model to predict . A few additional descriptors have chosen based on the DFT calculations. As the chemical environment is important for the catalytic activity of the active site, we have considered the distance between the active metal center and nearest nitrogen (dM–N) as a salient parameter for demonstrating the ML model.41 Additionally, since the positive charge of the metal center plays a crucial role towards the selectivity of the NRR over the other competing reaction, HER, it is reasonable to consider the amount of charge transfer (q) as obtained through Bader charge analyses (in Fig. 1(c)) as an important descriptor. Moreover, Hammer et al. have suggested that the metal d-band centre (εd) acts as the key parameter to gauge the strength of the bonding of the adsorbate and therefore, we consider the d-band centre of the metal atoms to be a prime descriptor.42 These nine features for the 18 different metal atoms based C6N6 have been taken into account for training and testing in ML to predict . The dataset of the features taken for this study is illustrated in Table S4 (ESI†). To analyze the strength of the correlation between each of the features and eliminate further the redundant ones, the Pearson correlation coefficient (p) has been calculated using the following relation:
(5) |
The heat map, as presented in Fig. 4, demonstrates a significant correlation between rcov and εd with θd. A previous study by Zhao et al. has shown that elimination of the redundant features can enhance the performance in ML.43 Hence, θd has been omitted here from the regression models. Initially, different regression models have been counted on for cross validation over 50 different distributions of training and test split, and their RMSE values are compared in Fig. S3 (ESI†). Having low bias and low variance, the ensemble learning algorithm namely, GBR and RFR are found to be good among all of them. Henceforth, these two models are put forward in estimating the target. These 18 metal doped C6N6 are further divided into 14:4 as training and testing dataset. The values predicted via the ML algorithm and obtained from DFT have been compared. The GBR model performs comparatively well with high R-square (R2) value (0.87) and low RMSE (0.005) value respectively, as shown in Fig. 5(a). The contribution of the features towards the prediction of is shown in Fig. 5(b). It shows that the d-band center (εd) has the maximum feature importance ∼56.8%. Also, the covalent radius of the atom has a significant importance (17.7%). The contribution of other features is relatively low towards the target properties. By using the mutual information method as shown in Fig. 5(c), we have found that the d-band center become one of the most determining factor.
In addition to , the first hydrogenation step, plays a significant role towards prediction of efficient electrocatalysts. The ensemble-based classification model using the soft voting technique has been considered to predict the best catalyst with such a small dataset. The main goal of our classification model is to produce a probabilistic score in search of the efficient catalysts based on important key steps. Additionally, we have compared the results predicted via ML with that via DFT. Based on the approach proposed by Ling. et al., we adopted the two key step method, i.e, Gibbs free energy of nitrogen binding and the first protonation step to find out the active electrocatalyst.32 The cut-off barrier for these steps have been set as
The catalysts fulfilling the above criteria have been considered as an efficient catalyst and labeled as class 1; otherwise, it is considered as an inefficient catalyst and designated as class 0. From the DFT computation, out of 18 SACs, 6 catalysts (Mo, Nb, W, Re, Hf, and Ta) are labeled as class 1 and rest of them belonged to the class 0 category. Thus, we have framed our problem statement as a simple binary classification problem thereby enabling us to apply ML based classification methods. Here, we proposed ensemble models using logistic regression, support vector machine and random forest classifier and also apply the cross validation strategy with a splitting ratio of 14:4 on the total dataset. The best outcome has been obtained by bagging three classification models using the soft voting principle. The schematic flowchart of the ML process in the classification model is shown in Fig. S4 (ESI†). The atomic descriptors such as ionization energy, electronegativity, electron affinity, covalent radius of the metal atom, and Zunger radius are used to get a high probability score for the best catalyst. In the soft voting scenario, every individual classifier provides the probability score for the catalysts which are then taken as input and voting is calculated based on this probability to ensure the class in which the specific catalyst belongs. We have applied grid search cross validation strategy on each individual classification model to tune the required hyper-parameters. In order to train our classification model using cross-validation, 10 different pairs of training and test sets using random sampling without replacement have been taken from the complete dataset (shown in Table S4, ESI†). Each time, we compare the probability score (calculated using eqn (6)) of the predicted class based on our model with that of the result obtained from DFT.
(6) |
Footnote |
† Electronic supplementary information (ESI) available: List of hyperparameters of the regression model, adsorption energy of nitrogen in the end-on mode and side-on mode, feature statistics of the dataset considered for machine learning method, schematic diagram of work-flow for the classification model, cross validation result for different classification models. See DOI: https://doi.org/10.1039/d2cp01901a |
This journal is © the Owner Societies 2022 |