Geun Ho
Gu
,
Juhwan
Noh
,
Inkyung
Kim
and
Yousung
Jung
*
Graduate School of EEWS, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro 34141, Daejeon, 305-335, South Korea. E-mail: ysjn@kaist.ac.kr
First published on 30th April 2019
Achieving the 2016 Paris agreement goal of limiting global warming below 2 °C and securing a sustainable energy future require materials innovations in renewable energy technologies. While the window of opportunity is closing, meeting these goals necessitates deploying new research concepts and strategies to accelerate materials discovery by an order of magnitude. Recent advancements in machine learning have provided the science and engineering community with a flexible and rapid prediction framework, showing a tremendous potential impact. Here we summarize the recent progress in machine learning approaches for developing renewable energy materials. We demonstrate applications of machine learning methods for theoretical approaches in key renewable energy technologies including catalysis, batteries, solar cells, and crystal discovery. We also analyze notable applications resulting in significant real discoveries and discuss critical gaps to further accelerate materials discovery.
![]() | ||
| Fig. 1 Global greenhouse gas (GHG) emissions as implied by the Intended Nationally Determined Contributions (INDCs) compared to the no-policy baseline, current-policy and 2 °C scenarios reproduced with permission from ref. 2 Copyright 2016 Springer, Nature. White lines show the median, and 20th–80th-percentile ranges are shown for no-policy and least-energy-economy-cost 2 °C scenarios, and the 10th–90th percentile across all studies is shown. | ||
Machine learning-based data-driven approaches have demonstrated tremendous impact in a number of aspects. The advancement in computational power and the emergence of big data have led to the success of machine learning methods in retail,7 medical diagnoses,8 image recognition,9 and speech understanding.10 These successes accompanying the advancement in machine learning have attracted interest in its application to science and engineering led initially by the U.S. Materials Genome Initiative.11
Conventional trial-and-error theoretical and experimental research studies involve an in-depth understanding of interesting, useful phenomena followed by the exploitation of new knowledge to test better materials. Such intuition-based approaches provide insightful knowledge but are not the most efficient approach for discovering new materials. Experimental approaches involve synthesis, characterization and property analysis by manual labor, resulting in a slow turnover. On the other hand, computational simulations can allow easier integration of high-throughput screening through automation. However, finding suitable descriptors or a model for the material's activity of interest is difficult and the ability to bridge the time-and-length scale to obtain macroscale properties is limited due to the large computational cost. In addition, computational simulations alone are not enough to discover materials as the simulations may not be realistic. For this, machine learning and robotics offer a systematic solution to speed up discovery.
Machine learning offers a flexible and accessible framework that correlates the materials' a priori knowledge with the properties of interest with respectable accuracy and speed given a large data set. For the experimental approach, interpreting and designing of the experiment can be delegated to the machine learning algorithms while robotics is used to automate the aforementioned manual labor.12 Notably, the Autonomous Research System, ARES, has used this approach successfully to optimize carbon nanotube synthesis.13 For the computational approach, machine-learning models offer a practical solution for high-throughput screening as theoretically developing a highly flexible model that correlates a priori knowledge with the properties of interest is often difficult. We do note that developing theoretical models and understanding is important for developing machine learning models as it provides insights into effective descriptors and designs.
Machine learning is also useful in coupling the small and large time-and-length scales. For example, molecular dynamics simulations are often performed with a surrogate model such as a force-field, but the accuracy of the energy and force is typically lower than that of the ab initio methods. Machine learning models can provide high accuracy while maintaining the large speed up from ab initio calculations.14 Another exciting and promising area of machine learning is inverse design where the desired properties of the material are given to the model, and materials with those properties are outputted.15 Its application to functional materials discovery has enormous potential in academia and industry, but the inverse design model development for materials is still in its infancy.
The U.S. Department of Energy has identified several major technical advancement goals for realizing a secure and sustainable energy future: (1) production of fuels from sunlight, (2) carbon-neutral electricity generation, and (3) revolutionizing energy efficiency and use.16 Among renewable sources of energy including wind, geothermal heat, and hydropower, solar energy is the most available and practical resource to harness.17 However, with current solar cell technology and its growth rate, solar cells are predicted to supply only 10 percent of the carbon-free energy demand in 2050.18 While the solar cell has grown to be the least expensive technology to produce energy in many countries,12 continued improvements in the cost, solar conversion efficiency, and energy storage are needed to meet the global energy demand sustainably.18
Currently, the transportation sector is almost completely dependent on fossil fuels. Replacing them with renewable and clean-burning fuel is thus essential, and discovering catalysts that convert water, biomass, and CO2 to hydrogen fuels and small hydrocarbons such as methanol and ethanol is critical.16 A large emphasis has been placed on discovering water-splitting catalysts as hydrogen is the cleanest fuel.17 Hydrogen fuel production, delivery and dispensing have become nearly cost-competitive with gasoline, and continued innovation and deployment is key.19 In addition to fuels, developing catalysts for renewable production of chemicals and plastics from biomass and CO2 is widely investigated as well. While renewable fuels can power transportation either by direct burning or generating electricity through fuel-cells, battery-operated cars are another solution to renewable transportation. In this regard, developing better batteries is necessary to increase range, reduce weight and improve efficiency.16 Also, renewable energy sources such as solar and wind energy suffer from inconsistent power output due to the day and night cycles and weather. Thus, developing battery technologies that provide large amounts of electricity over a long period of time is important.16 To address this, improvement of redox flow fuel-cells, which can be thought of as a hybrid of a battery and a fuel cell, to exhibit the potential to store electricity in large quantities, was carried out. In addition, improving electric grid reliability can reduce economic overhead expenses. The Lawrence Berkeley National Laboratory estimated that power outages cost the US about $80 billion annually.20 The materials discovery of high power energy storages to address micro-outages is a critical issue to improve electric grid reliance.16 As briefly discussed above, the trend of replacing the energy supply infrastructure with renewable energy sources is currently not encouraging, and CO2 emission from combustion is likely to continue.17 In this regard, developing CO2 capture materials can negate the CO2 emissions from using fossil fuels.17Fig. 2 summarizes and categorizes these key renewable energy technologies.
These challenges in realizing a renewable society can be addressed to some extent at the manufacturing level and by (non-)state actions, but, in essence, discovering game-changing materials with superior efficiency and low cost offers a fundamental solution. In this review, we aim to provide a broad overview of machine learning research in innovating renewable energy technology with a focus on the computational approaches. We refer to other in-depth reviews of machine learning methods for the physical system,6,21–26 and here we focus on applications of various machine learning techniques towards renewable energy materials development. Section 2 introduces key studies on critical renewable energy technologies discussed above such as solar cells, catalysts, CO2 capture, and batteries. We also summarize the emerging inverse design machine learning and others with large implications in accelerating materials design.27 To move forward, we report the success stories that have led to the real discovery of practical materials in Section 3. Finally, we close this review with the remaining gaps and prospects.
![]() | ||
| Fig. 4 (a) Demonstration of the Sabatier principle for predicting optimal catalyst activity, and (b) workflow chart of catalyst screening via machine learning. | ||
While electronegativity and coordination number descriptors have shown good performance, the reactivity difference among alloys is often derived from d-band characteristics.54 However, the quantifying d-band characteristics for an alloy surface require DFT calculations, and are thus not ideal for high-throughput screening. In order to exploit d-band information without DFT, Noh et al. have introduced the d-band width of the mean-field muffin-tin orbital theory and used it together with electronegativity to compute the CO binding energy. Using the active-learning algorithm to sample the most informative data, the kernel ridge regression model demonstrated the current state-of-the-art performance of 0.05 eV RMSE for CO binding energy on the (100) surface.55 This study also has suggested a number of promising catalyst candidates for the CO2 reduction reaction.
More recently, machine learning is used to guide DFT calculations to find optimal electrocatalysts for CO2 reduction and H2 evolution through binding energy prediction. Here, the TPOT machine learning package is employed to automatically select and run DFT calculations where 80% and 20% of the DFT calculations were dedicated to optimizing the model and finding optimal materials, respectively.56 The automated calculation framework performed for little over a year and resulted in the identification of 131 promising candidate surfaces from various alloys made from a pool of 31 elements. As opposed to the studies above, the binding energy is computed with DFT accuracy as opposed to ML accuracy, thus adding credibility to the suggested candidates.
In a similar vein, Takigawa and co-workers have investigated models predicting the d-band center.57 Ordinary least squares regression, partial least squares regression, Gaussian process regression, and gradient boosting regression are used with nine readily accessible physical descriptors of metal atoms that do not require ab initio calculations. Bimetallic alloys with impure and overlay surfaces are considered. Gradient boosting regression showed the highest accuracy at 0.17 eV and 0.19 eV RMSE for impurity and overlay surfaces, respectively. In the interest of designing catalysts for hydrocarbon selectivity from the conversion of renewably obtained methane, Takigawa and co-workers have employed accessible descriptors to predict the binding energy of CH3, CH2, CH, C and H on Cu-based alloys.58 Out of nine regression methods, extra tree regression demonstrated a RMSE near 0.3 eV. Besides predicting the active components, controlling the structure of the catalyst can improve activity and binding energy, differing based on the facets of the surface. In this regard, Yıldırım and co-workers implemented a neural network to predict the CO binding energy on Au clusters for CO oxidation.59 CO oxidation is an important process for reducing CO emission from transportation and industries. Au catalysts have demonstrated good activity for CO oxidation, but the active site is not well known. Yildirim and co-workers have used the cluster size, overall charge, unpaired-electrons and coordination number to predict binding energies via a neural network. Asahi and co-workers have predicted N, NO and O binding energies and the formation energy of RhAu octahedral nanoparticles with various surface compositions using SOAP kernel regression.60,61 The binding energies and formation energies are predicted at ∼0.1 and 0.02 eV RMSE, respectively. Then, the binding energy and formation energy are used to estimate catalyst activity and predict nanoparticle stability, respectively. This work demonstrates the one-shot approach to directly predict catalyst activity given the composition of the nanoparticle.
500 MOF structures is considered, where grand canonical Monte Carlo combined with the Lennard-Jones potential and DFT-derived electrostatic potential is used to model electronic interactions. The model was able to capture most of the known active MOFs. Froudakis and co-workers used the presence or absence of substructure patterns in MOFs as descriptors and “Just Add Data”, an automated machine learning analysis tool, to predict the CO2 and H2 uptake capacity.66 Compared to the earlier study by Woo and co-workers, this model is able to predict the continuous uptake capacity value and is trained using experimental data of 100 MOFs. Similar to Woo and co-workers, Gómez-Gualdrón and co-workers investigated CO2 capture prediction of MOFs using DFT, grand canonical Monte Carlo, and machine learning.67 Here, various storage properties of 400 MOFs are determined which are then used to train six different machine learning models via 13 different electronic and geometric descriptors. Instead of CO2, Woo and co-workers investigated storage capacity prediction for methane using the geometric features, such as pore size and void fraction, of 130
000 MOFs with multilinear regression, decision trees, and nonlinear support vector machines.65 Analysis of the model revealed the desired geometric properties of the MOF that could lead to high methane storage.
Liquid electrolytes are often based on organic solvents and suffer from flammability. In this regard, solid electrolytes are promising as they are generally less flammable and have a larger electrochemical window than liquid electrolytes. However, solid electrolytes suffer from low Li ion conductivity and thus research on them focuses on finding highly conductive materials. In computational approaches, the Li-ion migration barrier is widely computed as a surrogate measurement of ionic conductivity. Jalem et al. applied the partial least squares algorithm to predict the migration barrier of olivine-type LiMXO4.73 Analyzing the model coefficient plot and variable importance in projection plot revealed that descriptors such as the ionic size of M and local lattice distortion are important for migration barriers and new promising olivine-type solid electrolytes were suggested. Furthermore, Jalem et al. employed the neural network framework for olivine-type LiMXO4 as solid electrolyte candidates to predict the migration barrier and cohesive energy.74 The neural network migration barrier prediction model reported by Jalem et al. was applied to tavorite type LiMTO4F materials as a solid electrolyte.75 The Li migration trajectory has also been widely studied to understand the Li migration mechanism and elucidate the rate determining step which can be improved for ionic conductivity. Chen et al. developed a density-based clustering method to elucidate Li migration trajectories of garnet-type solid electrolytes.76 In addition, Li ion conductivity was directly evaluated using experimental measurements where the support vector regression algorithm with descriptors of diffusivity at 1600 K, average volume of the disordered structure, ordered/disordered phase transition temperature and temperature at which ionic conductivity was measured, were employed to classify the Li ion conductivity of LiSiCON type solid electrolytes77 and garnet-type solid electrolytes.78 Jalem et al. applied the Gaussian process with Bayesian sampling to predict the migration barrier of tavorite type solid electrolytes.79
For electrodes, Li ions intercalate and deintercalate into and from the electrodes to store and release energy. Intercalation characteristics of the electrode are critical for energy and power densities. To achieve high energy density, researchers have focused on understanding structure–performance relationships by controlling the elements and structure of electrode materials. In this regard, Shandiz et al. classified three crystal structures (monoclinic, orthorhombic, and triclinic) from Li–Si–(Mn, Fe, Co)–O compositions using five algorithms.80 Among them, random forests and extremely randomized trees showed the highest accuracy. Wang et al. used partial least squares regression to distinguish the descriptors which described the volume change during delithiation of cathode candidates well.81 Eremin et al. applied ridge regression to predict the energy of LiNiO2 and LiNi0.8Co0.15Al0.05O.82 From the results, common features that (de)stabilize the structure and stabilization effect of doping were discovered. The investigation revealed important descriptors for evaluating the migration barrier of LiMTO4F structures and the model was extended to predict the values of a database containing two structure types, both having the 1D Li path in common. Okamoto employed Kernel ridge regression with Bayesian optimization sampling to evaluate the change in Gibbs free energy before and after Li intercalation into a graphite anode.84
One of the high power energy storage techonology is called, Superconducting Magenetic Energy Storage (SMES). Majority of the SMES cost is in the superconductor materials. Discovering economic superconductors may aid in the SMES distribution. However, machine learning-based research on superconductive materials is preliminary, as the mechanisms behind high-temperature superconductivity are not clear. Stanev et al. applied the random forest method to classify critical temperatures to identify superconductive materials. 35 promising materials and their common characteristics were determined.84 For the redox flow battery, Kim et al. have developed a multiple descriptor multiple kernel method that predicts the solubility of active molecule derivatives of the molecular candidates in an aqueous electrolyte depending on the pH.85
000 perovskites with further analysis of the stability using the DFT calculations. Dey et al. investigated a machine learning approach for predicting the bandgap of chalcopyrite type materials using OLS, SPLS, elastic net, and LASSO coupled with rough set and principal component analysis methods.92 A total of 15 accessible elemental properties are used as descriptors to predict the bandgap, where 28 data points are used for training. The bandgaps of 227 chalcopyrite materials are predicted. Though this study greatly expanded the knowledge of the solar cell performance of chalcopyrite materials, the lack of training data limited the testing of model accuracy. Beyond specific types of materials, Ward et al. developed a flexible framework that can be used for crystalline or amorphous materials.93 Here, the OQMD database is used where properties such as the volume, formation energy, and bandgap were the target properties and 145 attributes were used as descriptors. The algorithms such as partial least squares regression, LASSO, decision tree, kernel ridge regression, Gaussian process regression, and neural networks are used together with an effective data partitioning strategy for high accuracy. On the other hand, the organic solar cell has been gaining popularity due to its accessible electronic property engineering as well as its cheap cost. Aspuru-Guzik and co-workers have used the Harvard Organic Photovoltaic Dataset where molecular fingerprint methods were used together with the Gaussian process to predict properties such as frontier orbital energies, optical gaps, current density, open circuit voltage, and power conversion efficiency. The same strategy is applied to discover non-fullerene acceptors where the molecular fingerprint based Gaussian process is used to leverage the difference between the DFT calculated and experimental frontier molecular orbital energies.94 Schmidt and co-workers have developed the variational autoencoder method where context-free grammar representations of molecules are implemented to predict the LUMO and optical transition energy. The variational autoencoder enables inverse modeling, which is used to identify a number of molecules with desired properties.
In addition to the bandgap, the stability of the materials is critical for their longevity. The stability of the materials can be analyzed by calculating the energy above the convex hull (in eV per atom) within the phase diagram constructed from the given elements in a theoretical or experimental manner. We note that the energy above the convex hull is defined as the difference between the enthalpy of the formation of a target material and the most stable enthalpy of formation evaluated from the phase diagram. In this regard, Li et al.95 proposed classification and regression models for predicting the energy of the convex hull of a perovskite oxide material using the elemental property data (the best F1 score for classification is 0.881 and the best RMSE for regression is 28.5 meV per atom). Furthermore, the model was used to discover 15 new promising perovskite materials. In addition, Schmidt et al.96 tested various types of machine learning models including ridge regression, random forests, extremely randomized trees (including adaptive boosting), and neural networks to predict the energy above the hull for 250
000 cubic perovskite materials (ABX3), where extremely randomized trees outperformed all the other models with a mean absolute error of 121 meV per atom.
In order to search for sunlight absorbing molecules, Xin and co-workers tested LASSO, kernel ridge regression, support vector machines, and neural networks to predict energy gaps of porphyrin molecules using molecular fingerprints, Coulomb matrices, chemoinformatics, and electro topological-state indices as descriptors.99 The model captures the energy gaps within 0.06 eV RMSE, and its analysis suggests structural motifs that influence energy gaps via sensitivity analysis. Besides utilizing CO2 and splitting water, biomass utilization is another attractive pathway to produce chemicals and fuels. Lignocellulosic monomers are already functionalized as opposed to petrochemicals which need functionalization, and aromatic structures, widely synthesized in the petrochemical industry, are already present. However, theoretical studies of biomass conversion are difficult due to their large reaction network.100 In this regard, Vlachos and co-workers have applied LASSO to predict the thermochemistry of biomass monomer adsorbates on metal surfaces using subgraph descriptors.101 The model demonstrated a RMSE of 0.09 eV for the heat of formation of lignin monomers on the Pt(111) surface. Such a model reinforces the drawbacks of a popular semi-empirical method for predicting energy called group additivity, where manually identifying graph descriptors is difficult.102–104 Metal heterogeneous catalysts are often synthesized by supporting metal particles on oxides, but theoretical metal catalyst studies often use ideal surfaces as a model catalyst. In this regard, Hammer and co-workers introduced a genetic algorithm to globally optimize the metal nanoparticle structure on supports, where the pairing of candidates involved cutting of two candidate particles in half and splicing.105
While a large number of machine learning studies on catalysts focused on metal alloys, several studies addressed homogeneous catalysts. Rothenberg and co-workers investigated cross-coupling reactions via a neural network where steric and electronic descriptors of ligands, substrates, catalyst precursors and 412 Heck reactions are correlated with experimentally computed catalyst activity.106 Also, Kulik and co-workers have been leading the efforts in developing machine learning frameworks for transition metal complexes, which are used often in homogeneous catalysts.107 Kulik and co-workers introduced molSimplify, an automated toolkit for screening and discovery of inorganic and intermolecular complexes.108 The neural network has been implemented to predict important properties such as energetics, metal–ligand bond lengths, spin-states, and oxidation states and the model accounts for DFT functional sensitivities. An example is the redox potential design of octahedral Fe(II/III) redox couples with nitrogen ligands. Furthermore, they have introduced revised autocorrelation functions that encode atomic properties to molecular graphs demonstrating great accuracy (0.26 eV mean absolute error for atomization energies) for various properties of metal complexes.109 Finally, the genetic algorithm and neural network are employed to discover spin-crossover complexes, which have applications as spin-based switches.110 This work demonstrates the accelerated discovery of machine learning augmented screening.
Beyond the specific applications discussed above, machine learning methods have been extensively applied for diverse materials applications due to their flexibility.111 Faber et al.112 proposed the formation energy prediction machine learning model for Elpasolite materials by utilizing kernel ridge regression. The developed model was used to screen all possible candidates suggesting that 90 out of 212 new structures are predicted to be on the convex hull. Legrain et al.113 utilized a random forest machine learning model to predict the force constant of the 121 metastable phase of KZnF3 with a mean absolute error of 0.17 eV Å−2, and the predicted force constant was used to estimate phonon spectral features, heat capacities, vibrational entropies, and vibrational free energies, which were in good agreement with the ab initio calculations. Furthermore, Pilania et al.114 proposed a machine learning model to estimate diverse physical properties (formation energy, bandgap, elastic constants and so on) for more than 1200 binary wurtzite superlattices. Dixon and co-workers have developed a linear model predicting thermochemistry using fragments of ZnO nanoparticles, predicting various phase transitions and providing insights on particle growth for ZnO.115 All of the studies discussed here lead to a concrete conclusion that combining the ab initio calculations with the novel machine learning model can help to accelerate the understanding of novel materials.
![]() | ||
| Fig. 6 Machine learning potential enables microscale simulations to understand other key properties of interest. | ||
037 bulk structures, and 5347 slab geometries is used to achieve a 2 meV per atom RMSE. Such modeling has been applied to copper bulk and surfaces as well.123 The MLP framework of Behler and co-workers has been developed into the Aenet software package that can systematically generate data sets to develop a model.124 Software evaluation demonstrates independence of the CPU time from the number of atoms, attractive for the multi-scale approach. Similarly, Kitchin and co-workers have employed a MLP framework for zirconia to test its ability to predict diverse bulk properties.125 A total of 2178 DFT calculations are used to train the model which demonstrated high accuracy for formation energy, the equation of states, oxygen vacancy formation energies, and diffusion barrier prediction. MLP has also shown great predictive ability for predicting surface energy, palladium vacancy formation, diffusion barriers, and adatom diffusion barriers for palladium particles.126
Artrith and co-workers have investigated solvents and Au–Cu alloys using MLP. Au–Cu alloys have shown promising overpotential and stability for CO2 reduction electrochemistry.129 However, identifying the active site for electrochemical reactions is difficult due to the solvent and adsorbate. The MLP model is trained using 24
995 DFT calculations consisting of the Au–Cu alloy bulk, slab, and clusters in a vacuum and in water. The developed model was combined with molecular dynamics and Monte Carlo simulations and predicted a Cu–Au core–shell structure in agreement with experimental results. The temperature dependence of the core–shell structure is observed and a potential strategy for nanoparticle structure control during experimental synthesis is suggested. A computation-based synthesis suggestion is atypical due to the complexity of the synthesis simulation, demonstrating the MLP's ability to couple the time–length-scale. In addition, Artrith and co-workers have shown that the Cu–Au alloy structure changes from the core–shell structure in a vacuum to a mixed surface in an aqueous solvent.130 This demonstrates that MLP can be used to understand the surface structure under reaction conditions in order to perform DFT investigation more in-line with experimental conditions. The MLP predicted nanoparticle structure agreed well with the Wulff-construction predicted structure, validating the neural network potential-based particle prediction. While Artrith and co-workers used a simple frozen water shell model to account for the solvent effect, Behler and co-workers performed water–copper interface dynamics simulations for various surfaces.131 Here, the water–copper interaction strength has been shown to depend on the facets, and structures of the interface hydration layer have been analyzed.
MLP has also been applied for the gas phase adsorbate surface system as well. Kroes and co-workers investigated the N2/Ru(0001) system where the phonons, wave-like vibrations of surface atoms, are used to describe dissociative chemisorption of N2 more in-line with the experimental conditions.132 Combining MLP with molecular dynamics allows the computation of a sticking coefficient lower than previously possible using ab initio molecular dynamics, which also shows good agreement with the experiment. Kitchin and co-workers have implemented MLP to predict dynamic interactions between oxygen atoms on the Pd(111) surface which enables molecular dynamics simulation of adsorbates on catalytic surfaces.133
On the other hand, Nørskov and coworkers implemented MLP for classic binding energy calculation problems.134 In the interest of predicting CO2 reduction activity, MLP is used to learn the CO binding energy for various catalytic site environments of the Ni–Ga alloy. The geometries of CO at 583 binding sites were relaxed using neural network potential, and if the neural network potential error rose above 0.2 eV, DFT calculation is performed which is subsequently used as a training set. This modeling revealed the active sites for the Ni–Ga alloy, providing a rationalization for its high activity for CO2 reduction. Compared to full explicit DFT calculations, only 10% of DFT calculations are used, demonstrating computational time efficiency of the machine learning approach.
000 theoretically calculated diffraction patterns from the 3D atomic arrangement are used to train the deep learning model. Although this model is limited to only 8 crystal systems, the proposed model outperforms other packages significantly regardless of defects in the crystal structures. In the case of an experimental investigation, the extraction of symmetry group information from the spectroscopy data can enhance experimental characterization capability. In this regard, Park et al.141 trained a deep learning model to classify X-ray diffraction patterns into 230 space groups, 101 extinction groups, and 7 crystal systems simultaneously. The model demonstrates a reliable accuracy of 81.14, 83.83 and 94.99% for the space group, extinction group, and crystal system, respectively. On the other hand, a couple of models have been developed to predict the crystal structure type given the fixed chemical formula type such as equiatomic binary (AB)142 and ternary (ABC)143 compounds. Here, the support vector machine was used for the high-throughput classification model. Notably, the model identified a new experimentally validated material, an RhCd compound with the CsCl-type structure.142 Furthermore, Oliynyk et al.143 experimentally confirmed 19-polymorphs between TiNiSi- and ZrNiAl-type structures in agreement with experimental results.
Fischer et al.144 approached materials discovery using statistics and proposed data mining structure prediction (DMSP), a probability-theory-based model, for binary alloys, and the model was used to predict novel nitrogen-rich nitride materials.145 Furthermore, this concept was extended by Hautier et al.146 to ternary materials where 209 new compounds were discovered with a minimal computational budget. The model predicted two new compounds in the Mg–Mn–O system, MgMnO3 and Mg2Mn3O8, and for MgMnO3, the diffraction pattern matched the experimental diffraction pattern. On the other hand, Ryan et al.147 proposed a neural network based model using the normalized atomic fingerprints to predict crystal structures given alloy compositions.
Besides the supervised-learning models introduced above, DFT-based evolutionary algorithms have been widely employed to predict crystal structures and generate materials with target properties. The software packages Crystal structure analysis by particle swarm optimization148 and XTALOPT149,150 are well known. Zhou et al.151 used XTALOPT to predict host–guest Na–Fe intermetallics at high pressures and Na3Fe and Na4Fe were predicted to be stable at pressures above 120 and 155 GPa, respectively. All the predicted materials have formed a host–guest-like Na sublattice structure. These structures are similar to the host framework of the self-hosting incommensurate phases observed in group I and II elements. In addition, the model is further used to find 2D B2S materials and discover new anisotropic 2D-Dirac cone materials.152 Furthermore, Wang et al.153 used the evolutionary algorithm to predict new metastable allotropes of Li2MnO3 as cathode materials under a high pressure of 20 GPa. Similarly, Shamp et al.154 predicted the most stable hydrides of phosphorus (PHn, n = 1–6) at 100, 150, and 200 GPa, pressure of which the phosphorus hydrides decomposes to elemental phases such as PH2 and H2. Interestingly, three metallic PH2 phases have been found that are dynamically stable and superconducting between 100 and 200 GPa providing new insights on high-pressure-driven materials with properties that cannot be observed at 1 atm.
178 candidates. Similarly, Balachandran et al.,157 investigated into perovskite structure classification using the two-step machine learning models: one for classifying perovskite and the other for classifying the cubic perovskite structure. The proposed models were trained with the experimentally known ABO3 compounds. High-throughput screening was performed and revealed 625 ABO3 compounds which were further analyzed using DFT calculations, suggesting 87 highly promising cubic perovskite materials. All the listed results suggest that the data-driven approach can be effectively used to determine the class of the crystal structure.
Machine learning can also be applied to perform optimization for various materials and device designs. For example, Ma et al.158 proposed a deep learning framework for design parameter prediction for on-demand design of the chiral metamaterials where the developed model enables not only the prediction of light–matter interaction properties of devices but also the proposal of design parameters for nano-photonic devices suggesting that the deep-learning-based model effectively used real world device design. A similar approach was also proposed by Peurifoy et al.159 where the author proposed a neural network model for the inverse design of nanophotonic particle simulation with the analytical gradient method. Furthermore, Liu et al., by combining a forward neural network (property prediction network) and an inverse network (input feature prediction network), overcame non-uniqueness in all inverse scattering problems. Interestingly, the structure of the proposed model is quite similar to that of the novel autoencoder widely used for generative models, but the authors used each part of the autoencoder (i.e. encoder and decoder) as independent regression models to handle the fundamental non-uniqueness of the inverse scattering problem effectively. One interesting study on machine learning for real materials synthesis was conducted by Yuan et al.160 to predict electrostrain of Pb-free BaTiO3 (BTO)-based materials. Here, the author used both exploration (using uncertainty) and exploitation (using only model prediction) to find out the optimal criterion for new novel BTO-based materials, and (Ba0.84Ca0.16) (Ti0.90Zr0.07Sn0.03)O3 was confirmed to be a novel piezoelectric material with large electrostrain both experimentally and theoretically.
Aspuru-Guzik and co-workers have been leading a one-shot approach where high-throughput computational screening is performed followed by an experimental demonstration of the discovered materials (Fig. 8). While organic light-emitting diodes (OLEDs) have many industrial applications due to their high efficiency and color properties, blue OLED development has been particularly difficult due to the higher energy needed for excitation. In this regard, Aspuru-Guzik and co-workers have presented a highly integrated design process involving theoretical insight, quantum mechanics, machine learning, industrial expertise, and experiments to discover new highly efficient blue OLEDs.161 Here, the chemical space is defined using a combinatorial enumeration of defined fragments. These fragments are selected using theoretical intuition. Molecules with unstable substructures known from experiments are filtered as well. Then, a neural network is employed to find the best OLED candidate, which is analyzed by time-dependent DFT calculations, resulting in a total of 400
000 calculations. From the 400
000 candidates, four candidates were experimentally validated after 2500 human experts voted for property novelty of candidates and synthetic accessibility. The study demonstrated one validated candidate showing 22% external quantum efficiency as well as about one thousand potential candidates with equal or better performance. This study demonstrates that an end-to-end highly integrated approach directly leads to the discovery of new materials.
![]() | ||
| Fig. 8 Collaborative discovery approach adopted by Aspuru-Guzik and co-workers to discover blue OLED materials (adapted with permission from ref. 161 Copyright 2016 Springer, Nat. Mater.). The screening stages integrating theoretical and computational approaches and experimental input and testing were the key to successful discovery. | ||
One of the methods to produce white light using LEDs involves phosphor coating on a light emitting diode (LED), where part of the LED emission is absorbed and re-emitted as photons at different wavelengths. The combination of all the photons results in white light. Such engineering simplifies the design and improves the efficiency of the white light LED, but only a handful of phosphor materials have been reported. In this regard, Brgoch and co-workers employed DFT, the support vector machine regression model, and experimental validation to discover NaBaB9O15 which is highly efficient and stable.162 Here, support vector machine regression is trained with 2610 DFT-based Debye temperature from the Materials Project database that correlates with the quantum efficiency of the materials. Then, the model is used to screen 2071 materials (1) that are available in Pearson's crystal database, (2) for which the bandgap is available in the Materials Project database, (3) that are ternary, and (4) that are non-metals. Out of these, NaBaB9O15 shows the most ideal Debye temperature and bandgaps, which are further validated using experiments (see Fig. 9). The key in this study was the screening of materials from the database which contains experimentally observed materials.
![]() | ||
| Fig. 9 Machine learning predicted Debye temperature against the calculated bandgap. Machine learning predicted Debye temperatures (ΘD,SVR) against the density functional theory calculated bandgaps (Eg,DFT) for 2071 compounds predicted (adapted with permission from ref. 162 Copyright 2018 Springer, Nat. Commun.). | ||
Degradation of battery performance caused by electrolyte decomposition can be improved by adding electrolyte additives as discussed above. Anode additives are reduced prior to electrolyte solvents and cathode additives are oxidized prior to electrolyte solvents to form a stable solid electrolyte interphase layer to reduce the irreversible capacity.163 In this regard, calculating the reduction and oxidation potential can help find promising electrolyte additives. Park et al. used a neural network model to predict the oxidation and reduction potentials for organic additives and solvents using 86 descriptors, such as bonding types, functional groups and so on.164 The relationship between the redox potential and the functional group was proposed as in Fig. 10(a). From the results, it can be seen that organic compounds containing double bonds are prone to reduction and unsusceptible to oxidation, i.e., the compounds can be used as anode additives. Among various candidates that meet these conditions, quinoxaline was tested for full cell applications and validated to improve cycle life as shown in Fig. 10(b).
![]() | ||
| Fig. 10 (a) Schematic of the distribution of functional groups on the potential plane, (b) cyclic performance of a Li(Ni0.88Co0.11Al0.01)O2/graphite full cell with and without a quinoxaline additive (adapted with permission from ref. 164 Copyright 2016 Royal Society of Chemistry, Phys. Chem. Chem. Phys.). | ||
Saeki and co-workers applied a similar strategy, where molecular fingerprinting techniques are combined with neural networks and random forests to predict the bandgap, molecular weight and power conversion efficiency for fullerene polymer using approximately a thousand experimentally calculated properties of polymer-fullerene. To demonstrate materials design (see Fig. 11), 2.3 million molecules from the Harvard Clean Energy Project database were screened. A total of 1000 molecules were selected from the database based on the first-principles calculated properties, 149 molecules of which were selected after the screening using a random forest model. One molecule was manually chosen based on its possibility of synthesis. The study identified a new polymer with a power conversion efficiency of ∼5.4%.165 The study shows stage by stage screening starting from the existing large first-principles database, followed by a machine learning model trained using experimental values to narrow the gap between the theory and experiments. Finally, manual consideration was used to decide the synthesis accessibility of the screened material.
![]() | ||
| Fig. 11 Polymer design scheme combining first-principles, machine learning, and manual consideration to discover a new polymer for organic photovoltaics (adapted with permission from ref. 165 Copyright 2018 ACS publications, J. Phys. Chem. Lett.). | ||
Sun et al.166 used the DMSP scheme, already discussed in the previous section, to expand the chemical space of the various nitride systems since it can be used for various applications such as solid-state lighting, ammonia-synthesis catalysts, superconductors, superinsulators, electrodes and so on. In spite of the aforementioned high potential, the nitride systems (<400) are relatively under-explored in the ICSD compared to the ternary metal oxides (>4000) suggesting that it is important to find new (meta-) stable metal nitrides. Because the DMSP scheme can easily be applied to predict the crystal structure from the given composition of the target metal nitride system, they first constructed a map of the metal nitrides after doing DFT calculations to identify the stability of the predicted materials as shown in Fig. 12. One interesting point is that although there are many previously known metal nitrides in the nitride map, there are still plausible new ternary metal nitrides indicating that machine learning can be effectively used to discover a large materials space compared to the conventional combinatorial explorations.
![]() | ||
| Fig. 12 Map of the constructed metal nitrides using the DMSP scheme and DFT calculations to identify stability (adopted with permission from the corresponding author of ref. 166). | ||
The other interesting point is that from the theoretical predictions the authors experimentally identified 7 new phases of Zn- and Mg-based ternary metal nitrides (Zn–Mo–N, Zn–W–N, Zn–Sb–N, Mg–Ti–N, Mg–Zr–N, Mg–Hf–N, and Mg–Nb–N) of which the latter new materials can be classified into the two unique crystal structures (the wurtzite and rocksalt structure; see Fig. 13). Although there is still a need for human intuition in experimental synthesis from the newly discovered materials, one can reduce unnecessary trial and error for exploring un-plausible chemical space by utilizing machine learning models.
![]() | ||
| Fig. 13 (a) 7 new phases of the ternary metal nitrides with the corresponding space group and formation energies, (b) detailed structures for the newly discovered nitrides, (c) synchrotron measured XRD patterns of new Zn- and Mg-based ternary nitrides and (d) discovery histogram for new ternary nitride spaces, based on entries as cataloged in the ICSD (adopted with permission from the ref. 166). | ||
Reed and co-workers have leveraged machine-learning with experiments to predict Li ion conductivity in order to discover a solid electrolyte for Li-ion batteries.167 Here, 12
831 Li-containing crystal structures from the Materials Project were extracted, and performance-related properties such as electronic conductivity and electrochemical stability were computed using DFT and theory (see Fig. 14). Another critical performance measure is the Li-ion conductivity, but DFT is difficult to use to compute this metric as it is a larger scale phenomenon. However, ionic conductivity for 40 crystal structures was available; thus the authors have implemented logistic regression to classify high and low conductivity via 20 features extracted from the crystal's elemental and structural properties. The screening narrowed the search space down to 21 structures, the performances of which were confirmed by experiments. This work highlights the difficulty of theoretical approaches to simulate larger scale phenomena as well as highlighting the importance of integrating experiments to screen materials.
![]() | ||
| Fig. 14 Flowchart of the discovery of a new Li solid electrolyte by integrating DFT, machine learning, and experiments. Machine learning is used to predict Li ion conductivity which is difficult to compute using DFT due to its multi-scale nature (adopted with permission from ref. 167 Copyright 2017 Royal Society of Chemistry, Energy Environ. Sci.). | ||
Machine learning potentials (MLPs) have demonstrated their potential to couple the DFT time–length scale to a larger scale. In particular, much attention has been devoted to understanding the surface and nanocluster dynamics in the interest of catalysis. Notably, MLP has been effectively applied to the multi-scaling phenomenon of the nanocluster structure change under reaction conditions, demonstrating its ability to reveal new catalytic phenomena. Furthermore, MLP shows promise for identifying active sites of an alloy by learning the binding energy activity descriptor.
Discovering new stable crystals is critical to expanding our knowledge of viable materials. In this respect, several studies focused on predicting crystal structures given the materials composition constraint. In addition, a couple of machine learning augmented DFT based materials discovery methods are introduced and suggested as a standard strategy for discovering materials. Another approach has involved stability screening within a defined chemical space. Many of these introduced approaches have successfully identified previously unknown materials, signifying that the community has a good idea in leveraging machine learning to discover new materials.
Several possible strategies can be suggested. For example, a number of studies leverage the experimental experts to measure the synthesizability. The blue OLED discovery study developed a web interface for experimentalists to vote on the synthesizability of molecules screened using a machine.161 Similarly, manual screening of the synthetic aspect is considered for the discovery of polymers for organic photovoltaics.165 Another popular strategy is to avoid hypothetical materials entirely by defining the screening scope as the experimentally known materials. The discovery of white LED materials introduced above is an example.162 This approach has been one of the most successful strategies for screening studies not involving machine learning. In addition, theoretical screening criteria are often limited to properties that are easily computable due to practical consideration (tractable time–length scale), instead of properties that are more directly relevant to experiments, and for these cases surrogate models are helpful to predict experimentally determined properties. All this shows that close collaboration between the computational and experimental investigators is key. Also, it would be helpful if the theoretical and experimental researchers closely communicate coherently at the beginning of collaboration to improve the success rate of machine prediction followed by experimental validation, instead of performing separate roles of “design” and “validation” by theory and experiments, respectively.
Although, here, we have mainly focused on the application of machine learning in terms of computational prediction of novel functional materials mainly using computational data, utilizing actual experimental data to predict materials properties of unknown compounds or even suggest new materials can be highly impactful. The critical aspect here is to collect a large set of data that have been obtained consistently using the same experimental setup under controlled conditions. Most of the existing experimental data in the literature are sparse and inhomogeneous for use in machine learning. For this reason, the number of quality experimental machine learning studies is limited. Recently, Gregoire and co-workers have demonstrated the potential of high-throughput consistent experiments where 178
994 data samples are used to map the visual image of samples and their adsorption spectra.171 Furthermore, compositions and Raman signal data of 1379 BiVO4 alloys have been correlated to their photoelectrochemical activity.172 These promising results demonstrate that the consistent experiments enable end-to-end data science for materials science.
Augmenting machine learning with robotics, or the so-called self-driving laboratory, has been emerging as a significant new direction.4,12,124 Developing the self-driving laboratory requires non-human-interrupted closed-loop flow work, where a machine learning model designs the experiments, followed by using robotics to perform the experiments and characterize the sample. Then, the new knowledge is learned by machine learning which can design the next experiment to repeat the cycle. Maruyama and co-workers are the pioneers in this regard via the Autonomous Research System (ARES) where the carbon nanotube growth rate is learned by machine learning model to grow the carbon nanotube at target rates, showing its potential,13 but such an effort is still in its infancy.173,174 The self-driving lab enables robust end-to-end materials search and is expected to revolutionize materials discovery in the future, which can be adopted in industry.
| This journal is © The Royal Society of Chemistry 2019 |