Omar Allam‡
ab,
Byung Woo Cho‡ab,
Ki Chul Kim*ac and
Seung Soon Jang*adef
aComputational NanoBio Technology Laboratory, School of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0245, USA. E-mail: SeungSoon.Jang@mse.gatech.edu
bThe George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0405, USA
cDepartment of Chemical Engineering, Konkuk University, Seoul 05029, Republic of Korea. E-mail: kich2018@konkuk.ac.kr
dInstitute for Electronics and Nanotechnology, Georgia Institute of Technology, Atlanta, GA 30332, USA
eParker H. Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
fStrategic Energy Institute, Georgia Institute of Technology, Atlanta, GA 30332, USA
First published on 26th November 2018
In this study, we utilize a density functional theory-machine learning framework to develop a high-throughput screening method for designing new molecular electrode materials. For this purpose, a density functional theory modeling approach is employed to predict basic quantum mechanical quantities such as redox potentials, and electronic properties such as electron affinity, highest occupied molecular orbital (HOMO) and lowest unoccupied molecular orbital (LUMO), for a selected set of organic materials. Both the electronic properties and structural information, such as the numbers of oxygen atoms, lithium atoms, boron atoms, carbon atoms, hydrogen atoms, and aromatic rings, are considered as input variables for the machine learning-based prediction of redox potentials. The large-set of input variables are further downsized using a linear correlation analysis to have six core input variables, namely electron affinity, HOMO, LUMO, HOMO–LUMO gap, the number of oxygen atoms and the number of lithium atoms. The artificial neural network trained using the quasi-Newton method demonstrates a capability for accurately estimating the redox potentials. From the contribution analysis, in which the influence of each input on the target are accessed, we highlight that the electron affinity has the highest contribution to redox potential, followed by the number of oxygen atoms, HOMO–LUMO gap, the number of lithium atoms, LUMO, and HOMO, in order.
However, it should be noted that the current lithium ion batteries suffer from several issues such as difficulty in the diffusion of lithium ions through the bulk phase of conventional transition metal oxides of positive electrodes; which results in poor power capacities.3,6 In order to resolve this problem, it is crucial to design novel cathode materials with enhanced ionic diffusivities without compromising the charge and energy densities.3–6 Therefore, it is very desirable to accomplish the following requirements in new cathode materials: (1) high cell voltages, (2) enhanced power capacities, (3) excellent charge and energy capacities.
In this context, the efforts to identify promising positive electrode materials with high performance have not been limited to only conventional transition metal oxides, but have also been extended to metal-free organic materials such as redox-active quinone molecules.15 Organic electrode materials are attractive because the cost of storing electrochemical energy can be potentially lowered by utilizing abundant available carbon-based materials. In addition, the light weight and robust mechanical properties possessed by carbon-based materials open a door towards new flexible battery for flexible electronic devices. Thus, graphene derivatives with high conductivity and vast accessible inner spaces8,9,16,17 and quinone derivatives with electrochemically active oxygens9–11,18–26 have been studied intensively as promising organic electrode materials.
It should be noted that quantitative molecular structure–electrochemical property relationships have not been thoroughly established yet; which means that there is no practical molecular structure design guideline to obtain desired specific levels of certain properties and commensurate performances. To avoid the time consuming “trial and error” approach, a more systematic procedure is desired to tune and design materials for target properties and performances.
In this study, we attempted to establish a machine learning based high-throughput screening method that can take a role as a predictive design guideline of molecular structures. It has two components: one is quantum mechanical density functional theory (DFT) modeling and another is the machine learning method. We have used DFT modeling method for carbon-based molecular electrode materials for battery8–13 and confirmed its accurate predictability for redox potentials with small uncertainty (<0.3 V). Specifically, in this contribution we aim to train a learning model for various carbon based electrode materials.8–13
Although DFT can produce high-fidelity predictions of redox potential, it is impractical to establish a predictive high-throughput screening method only based on DFT modeling. The main reasons are that (1) the effects of various molecular architectures are not linearly proportional to the target properties and (2) multistep quantum mechanical computations for predicting accurate redox potentials are computationally very expensive. Thus, we planned to use relatively light DFT computations to obtain basic quantum mechanical characteristics such as electron affinity (EA), highest occupied molecular orbital (HOMO), lowest unoccupied molecular orbital (LUMO), and so on, which can be used as input parameters for our learning model.
Machine learning (ML), a precursor to artificial intelligence, offers an algorithm that can achieve its predictive capability by learning from an input dataset.27 Through recent reports, it has been demonstrated that ML can be used in materials science in order to predict complex behaviours of materials and thereby to help explore vast chemical space for new materials discovery.27–29 For instance, once a machine learning algorithm is trained, it can provide an immediate estimate of complex electronic and electrochemical properties of materials, which otherwise would have taken a significantly longer time to obtain experimentally or computationally through DFT. Although the application of ML in materials science is still in an early stage, it should be of great interest to validate its capability as a high-throughput screening method.
Therefore, in this study we combine DFT and ML to build a high-throughput screening method for predicting redox potential (in V). We expect ML can be expanded to utilize experimental input parameters in principle; thus this study will be a small step forward in developing more complicated ML models as high-throughput screening methods for alternative battery electrode materials.
Fig. 1 Molecules used to train an artificial neural network: (a) quinone derivatives; (b) functionalized graphene fragments; (c) boron-doped graphene fragments. |
The reduction free energy in solution phase (ΔGred(R, sol)) can be computed through the thermodynamic cycle, as suggested by Truhlar et al. (Fig. 2).33,34 The redox potentials in solution phase were calculated from the reduction free energy in solution phase (ΔGred(R, sol)) which can be written as
ΔGred(R, sol) = ΔGred(R, gas) + ΔGsolv(R−) − ΔGsolv(R) | (1) |
Fig. 2 Thermodynamic cycle used to calculate the equilibrium redox potential in the condensed phase.33,34 |
Then, the redox potential in solution phase with respect to Li/Li+ was calculated based on the free energy change for reduction in solution phase using the following equation:
(2) |
Even though one of the main disadvantages of ANN is that it can be computationally burdensome, this was not a major issue in this study due to the relatively small size of the data set. Please note that ANN was primarily used in this study for its regressive or interpolative capability to investigate the characteristics that influence the redox potential.
A preliminary analysis was performed to downsize the original 10 input variables (for those with the highest linear correlation with the target): electron affinity (EA), highest occupied molecular orbital (HOMO), lowest unoccupied molecular orbital (LUMO), the HOMO–LUMO gap, the number of aromatic rings, and the number of oxygen, carbon, boron, hydrogen, and lithium atoms. A linear correlation analysis was performed by calculating the Pearson correlation coefficients (PCC) whose absolute values are presented in Fig. 4. In this case, a PCC of 1 corresponds to complete linear correlation, whereas 0 corresponds to no linear correlation. The electron affinity, with an absolute PCC of 0.92, has the most strong linear correlation with the redox potential while the number of aromatic rings has nearly no correlation. Such high correlation between EA and RP is anticipated since EA is a measure of material's capability to attain an electron, which is a core process for electrochemical reduction. However. Please note that the EA cannot be used solely in predicting RP, which is why we need other input parameters.
Fig. 4 Pearson linear correlation factor of each input variable to the redox potential (target property). |
After training the ANN, a linear regression analysis was performed in order to evaluate the accuracy of our learning model in predicting redox potentials of organic molecular electrode materials. It is demonstrated from the linear regression shown in Fig. 5 that the correlation, slope, and, intercept are very close to 1, 1, and 0, respectively, indicating that the machine learning model with six input variables has a good predictability. Nevertheless, it is observed that ANN predictions show relatively significant errors for 11 cases at low normalized redox potential (<0.6 V). To resolve these discrepancies, we are investigating how to improve the accuracy of our current ANN further, specifically by searching for other responsible input variables.
Fig. 5 Comparison of the redox potentials predicted from machine learning with those predicted from DFT. |
Next, the individual contributions of the input variables are also analyzed to evaluate their importance in affecting the redox potential. Please note that the contribution analysis in Fig. 6 is performed by investigating the error of the neural network when the corresponding input variable is removed in predicting the redox potential. If the contribution factor of an input variable is less than 1, the error of the neural network is decreased by removing the corresponding input variable, and vice versa. In other words, if an input has a contribution greater than 1, the ANN will be less accurate without the corresponding input variable. Fig. 6 shows that all the inputs are larger than 1, indicating that all the input variables are significantly important in determining the redox potential. The most noteworthy point in Fig. 6 is that the electron affinity has the highest contribution in the estimating the redox potential; this is followed by the number of oxygen atoms, the HOMO–LUMO gap, the number of lithium atoms, LUMO, and HOMO, respectively. It should be pointed out that although LUMO, HOMO, and HOMO–LUMO gap are not independent, a removal of any of these three variables led to an increase in the error of the neural network. Keeping these three variables reduces the error without incurring significant computational cost. Furthermore, despite not being completely independent, the contribution of HOMO–LUMO gap is greater than HOMO and LUMO alone. This is interesting since it would have been presumed that the contribution of the HOMO–LUMO gap would lie between the contributions of HOMO and LUMO given its dependency with both variables. Nevertheless, it is definitely worth further investigating to determine how to better optimize the model with only two out of the three aforementioned variables for more efficient and accurate prediction.
Fig. 6 Contributions of each input variable to the redox potential (target property) in the machine learning. |
Fig. 7 The correlation of input variables to redox potential while other input variables are kept constant: (a) the electron affinity; (b) the number of oxygen atoms. |
Variable | Average values |
---|---|
HOMO (eV) | −5.458 |
LUMO (eV) | −2.944 |
EA (eV) | −2.250 |
#O | 0.8796 |
HOMO–LUMO gap (eV) | 2.514 |
#Li | 0.2129 |
Fig. 7a shows that the redox potential is monotonously decreased as a function of the EA, which indicates that the electron affinity is a governing variable in control redox potential. Therefore, it is shown that EA is a property that can be utilized to tune the redox potential of molecular electrode materials. On the other hand, in Fig. 7b, it is demonstrated that the redox potential is fluctuated as the number of oxygen atoms is increased. This is an unexpected result since it has been thought that the redox potential would be proportional to the number of oxygen atoms because the oxygen atom is known to have electron-withdrawing effects due to its high electronegativity, which would increase the electron affinity, as published previously.39 However, our model indicates that solely increasing the number of oxygen atoms (or any electronegative atoms) does not necessarily result in the increase of redox potential. Nevertheless, various aspects such as the distribution of oxygen atoms and the type of bond connectivity of oxygen atoms should be considered to obtain the complete understanding about the behavior of redox potential in Fig. 7b.
Lastly, the trained ANN was validated by comparing the redox potentials estimated by the learning model with those from DFT computations for the cases which were not included in the training data sets; namely the 6 cases obtained from other publications.11,39 Table 2 shows the error in predicting the redox potential. The averaged error is approximately 3.54%. In this study, given the relatively limited size of the data set for training the ANN, the predictive capability of the trained ANN model is demonstrated from this relatively small error.
Redox potential (V) from literatures11,39 | Redox potential (V) predicted by ANN | Error (%) |
---|---|---|
2.30 | 2.28 | 0.670 |
2.10 | 2.11 | 0.491 |
−1.80 | −1.81 | 0.722 |
3.48 | 3.50 | 0.419 |
3.12 | 3.54 | 13.3 |
3.51 | 3.71 | 5.68 |
Average error | 3.54 |
Out of the various machine learning methods available, we employed artificial neural network (ANN) since it has been known to capture complicated nonlinear relationships among variables. First, a correlation analysis was performed as a preliminary screening method to downselect irrelevant input variables with low correlation; such as the numbers of hydrogen atoms, boron atoms, carbon atoms, and aromatic rings. Then, ANN was trained using quasi-Newton method, and its accuracy was validated by comparing the redox potentials predicted from ANN and those calculated from DFT modeling.
Next, the contribution analysis for each input variable to the redox potential was performed through investigating the error caused by the corresponding input variable when it was removed. From this analysis, it was found that the electron affinity has the highest contribution to redox potential, followed by the number of oxygen atoms, the HOMO–LUMO gap, the number of lithium atoms, LUMO, and HOMO in order respectively.
The ANN was further validated by estimating the redox potentials for 6 cases which were not included in the training dataset. The redox potentials predicted by ANN were in good agreement with those calculated from DFT modeling, whose averaged error was 3.54%.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8ra07112h |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2018 |