Joy
Datta
a,
Nikhil
Koratkar
b and
Dibakar
Datta
*a
aDepartment of Mechanical and Industrial Engineering, New Jersey Institute of Technology (NJIT), Newark, NJ 07052, USA. E-mail: dibakar.datta@njit.edu; Tel: +1 973 596 3647
bDepartment of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
First published on 25th March 2024
Lithium–ion batteries (LIBs) are ubiquitous in everyday applications. However, lithium (Li) is a limited resource on the planet and, therefore, not sustainable. As an alternative to lithium, earth-abundant and cheaper multivalent metals such as aluminum (Al) and calcium (Ca) have been actively researched in battery systems. However, finding suitable intercalation hosts for multivalent-ion batteries is urgently needed. Open-tunneled oxides represent a specific category of microparticles distinguished by the presence of integrated one-dimensional channels or nanopores. This work focuses on two promising open-tunnel oxides: niobium tungsten oxide (NTO) and molybdenum vanadium oxide (MoVO). The MoVO structure can accommodate a larger number of multivalent ions than NTO due to its larger surface area and different shapes. Specifically, the MoVO structure can adsorb Ca, Li, and Al ions with adsorption potentials ranging from around 4 to 5 eV. However, the adsorption potential for hexagonal channels of Al ions drops to 1.73 eV due to the limited channel area. The NTO structure exhibits an insertion/adsorption potential of 4.4 eV, 3.4 eV, and 0.9 eV for one Li, Ca, and Al, respectively. Generally, Ca ions are more readily adsorbed than Al ions in both MoVO and NTO structures. Bader charge analysis and charge density plots reveal the role of charge transfer and ion size in the insertion of multivalent ions such as Ca and Al into MoVO and NTO systems. Exploring open-tunnel oxide materials for battery applications is hindered by vast compositional possibilities. The execution of experimental trials and quantum-based simulations is not viable for addressing the challenge of locating a specific item within a large and complex set of possibilities. Therefore, it is imperative to conduct structural stability testing to identify viable combinations with sufficient pore topologies. Data mining and machine learning techniques are employed to discover innovative transition metal oxide materials. This study compares two machine learning algorithms, one utilizing descriptors and the other employing graphs to predict the synthesizability of new materials inside a laboratory setting. The outcomes of this study offer valuable insights into the exploration of alternative naturally occurring multiscale particles that exhibit promising potential for the utilization of multivalent ions in battery-related contexts.
The selection of the active material's particle size used in the anode and cathode is crucial in affecting battery performance.10,11 Traditionally, microparticles have been used in the industry due to their higher volumetric energy density, high mass loading, better scalability, and lower cost.12 Nevertheless, microparticles encounter mechanics, thermodynamics, and kinetics challenges, all of which nanoparticles can solve.13–15 Nanoparticles exhibit enhanced cycle stability over microparticles. Their smaller size allows for more uniform intercalation than microparticles, improving fracture toughness and fatigue life for electrode materials.16–20 Notably, nanoparticles also exhibit superior fast-charging capabilities. Their small size drastically reduces diffusion length, enabling faster charging and discharging ability.21,22 Furthermore, nanoparticles' limited compositional range makes them less conducive to the co-existence of distinct phases within nanostructures. Consequently, phase transitions occur more swiftly in nanostructures, releasing the excess free energy generated from lattice mismatch and high surface area.23–25
Nanostructures, however, come with their distinct set of drawbacks. For instance, the large surface area of nanostructures leads to extensive electrolyte decomposition. Consequently, the solid electrolyte interphase (SEI) layer forms early in the battery cycle, resulting in low first-cycle coulombic efficiency.26–30 Additionally, achieving an industrial standard (20–30 mg cm−2) high mass loading (20–30 mg cm−2) proves problematic with nanostructures.31,32 Nanostructure-based batteries also suffer from low volumetric capacity, significantly limiting their utility in stationary and grid storage applications.19,33,34 Moreover, the synthesis cost of nanostructures is higher, and the manufacturing process generates substantial chemical waste.35–38
Multiscale particles (MP) embody nanoscale attributes within microscale particles, effectively combining characteristics of both micro and nanostructures,13 potentially yielding the best of both worlds. These particles can be synthesized as engineered multiscale particles (E-MP) or multiscale particles with natural nano-porosity (N-MP). However, manufacturing cost and scalability pose significant challenges for E-MPs.3,32 On the other hand, micro-particles based on open-tunnel oxides, such as niobium tungsten oxide (NTO) and molybdenum vanadium oxide (MoVO), embody naturally formed nanoscale channels, endowing them with exceptionally fast ion diffusion abilities.39,40 N-MP family members display unique characteristics, exemplified by niobium tungsten oxide structures like Nb12WO33, Nb16W5O55, and Nb18W16O93.41,42 For example, Nb12WO33 is a wadsley-roth type crystallographic shear structure composed of (3 × 4) size of MO6 (M = Nb, W) octahedral block.42 These octahedral blocks share edges around the corner, forming open channel-like structures, as shown in Fig. 1a. Kocer et al.41 investigated density functional theory (DFT) to identify Li insertion sites and scrutinized lattice contraction in the Nb12WO33 structure during high Li concentration, enhancing cycle stability. MoVO structures (Fig. 1b–d) can have different polymorphs – orthorhombic (MoV2O8), trigonal (MoV3O6), and tetragonal (MoVO5). Furthermore, these structures feature additional tunnels like hexagonal, heptagonal, pentagonal, and rectangular varieties.39
A thorough analysis of diverse materials is required to improve our understanding of open-tunnel oxide materials. The goal of this research is to uncover previously unknown chemicals that have the potential to surpass current ones. Pursuing an optimal solution through experimental trial and error is improbable, given the extensive range of possible material compositions and stoichiometries. This difficulty is compounded by complex and not fully understood reaction mechanisms.43–45 Hence, selecting the most appropriate candidate from the myriad options becomes a challenge akin to finding a ‘needle in a haystack’.9 Given the limited availability of known open-tunnel oxides with suitable pore sizes for accommodating multiply charged ions, a comprehensive exploration of new compounds becomes imperative.
Transition metal oxides (TMOs), especially at the nanoscale, possess distinctive structural characteristics essential for a wide range of applications.46 Nanotechnology enables precise control of TMO nanostructures, resulting in customized sizes and shapes with excellent surface area-to-volume ratios.47 Utilizing machine learning models trained on transition metal oxide (TMO) characteristics can aid in investigating various open tunnel oxides found in nature, drawing from knowledge and design concepts obtained from TMO nanostructures. Tunnel-structured transition metal oxides (TMOs) constitute a crucial subgroup of TMOs with open channels enabling fast ion transport, including open tunnel oxides.48 Thus, a thorough understanding, and utilization of TMO-based oxide material training data provides critical insights for uncovering and investigating novel open tunnel oxide materials. This undertaking may encompass the study of binary, ternary, quaternary, quinary, and senary-based families of transition metal oxides (TMOs), a task laden with challenges due to the vast array of potential structures stemming from diverse combinations of elements and stoichiometries.49 The number of attainable materials reaches millions by varying the elements and stoichiometric ratios, rendering experimental exploration infeasible. Before selecting stable configurations with acceptable pore architectures for intercalating multiply charged ions, it is critical to analyze the structural stability of potential TMOs to assure their synthesizability.49
To forecast the structural stability of TMOs, a DFT-based energy hull diagram can be constructed by evaluating the formation energy across all conceivable stoichiometric ratios.9 However, the computationally intensive nature of this procedure is exacerbated by the intricate crystal structures and substantial unit cells of TMOs. Furthermore, addressing the “Needle in a Haystack” conundrum using DFT poses challenges due to the multitude of potential multivalent ion insertion sites and the potential for cation disorder.50–52 Molecular modeling employing the MD method presents a possible approach for exploring TMO crystal structures. Nonetheless, this approach hinges on the availability of a suitable interatomic potential, a requirement frequently unmet for oxide materials, constraining the MD method's capacity to explore all feasible combinations of TMO crystal structures. Furthermore, the reliability of the interatomic potential rests on data derived from DFT, which is not without limitations.53
In recent years, deep learning algorithms such as graph neural networks (GNNs) have surfaced as a cost-effective avenue for predicting formation energy and band gap while establishing correlations between structure and properties.54–57 Compared to traditional methods, machine learning (ML) boasts two distinct advantages: compactness and the ability to predict the properties of numerous compounds within seconds without requiring substantial processing power. ML relies exclusively on information garnered from DFT and experimental techniques. Several databases, such as the materials project database (MPD),58 open quantum materials database (OQMD),59 automatic flow for materials discovery (AFLOW),60 and inorganic crystal structure database (ICSD),61 provide the necessary crystal structure data for training and testing, as well as the corresponding property data that is utilized for the application of ML models.
Descriptor-based and GNN-based approaches have demonstrated strong predictive capabilities for material property forecasts.56,62,63 A “descriptor” comprises a numerical vector, employed as input for ML models in materials science, and encompasses diverse choices like coulomb matrix,64 sine coulomb matrix,65 atom-centered symmetry functions,66 smooth overlap of atomic orbitals,67 and orbital field matrix.68 In contrast, GNNs represent a category of deep learning techniques tailored to operate on graph structures. The GNNs are extensively utilized in graph analysis because of their exceptional performance and interpretability. Multiple GNNs such as materials graph network (MEGNet),56 atomistic line graph neural network (ALIGNN),69 continuous graph neural networks (CGNN),70 global attention based graph convolutional neural networks (GATGNN),55 have exhibited strong prediction capabilities about material structures.
In contrast to the research mentioned above conducted with lithium (Li), there currently needs to be more available literature about the incorporation of multivalent ions, such as aluminum (Al) and calcium (Ca), into NTO or MoVO systems. In addition to that, we are extending our study from DFT-based investigations into a ML-focused approach, explicitly aiming to predict the formation energy of naturally occurring TMO. Our whole contribution to this work is organized in two interconnected sections –
1. We investigated the insertion of Ca, Al, and Li into trigonal structures (MoV3O6) that feature a variety of hexagonal, heptagonal, and triangular channels, thereby assessing their adsorption potential. Expanding our analysis, we also explored NTO (Nb12WO33) systems, enabling a comparison of the adsorption capabilities between NTO and MoVO structures. Moreover, we examined the influence of multivalent ion size and charge density on insertion potentials.
2. The preceding section utilized DFT to investigate certain TMOs recognized as stable and experimentally synthesizable. However, the potential number of stable TMOs is infinite, rendering it impossible to conduct DFT calculations for all possible combinations. In this section, we adopt a machine learning approach to identify potential stable TMOs by predicting their formation energy. To accomplish this, we used the MPD to compile an inorganic crystal structure dataset for TMOs. Using this dataset, we trained CNNs and GNNs to predict these materials' formation energy. The test dataset compared both models' performance. Once trained, these ML models allow experimentalists to estimate the synthesizability of unknown crystal structures. Furthermore, akin to the previous section, the predicted stable TMOs can be considered for further exploration via DFT studies in future investigations.
The adsorption potential (V) is calculated as:
(1) |
ΔG = ΔEf + PΔVf − TΔSf | (2) |
ΔEf = ΔEXnG − (nEx + Eg) | (3) |
Fig. 2 Data distribution of TMO based oxide material: (a) number of data present concerning the number of TMOs material compositions, (b) overview of binary TMOs, (c) overview of ternary-based TMOs, (d) training and testing data distribution for the ML model. In (b) and (c), the green color represents the considered material, and red color depicts the ruled-out material from our list. Areas without color in (c) represent the third element in ternary-based TMOs. (b) Shows the binary-based TMOs, where all transitional metals should be present, except for oxygen (O). (c) Represents ternary-based TMOs. In (c), multicomponent materials on TMO consist of elements from any element group except group 1 alkali metals, aligning with our focus on multivalent ions. For example, quaternary-based TMOs follow the format WsArBmOn. Here, W represents any metal from the periodic table, excluding monovalent metals. A represents any metal from the periodic table. B represents the transition metal. The s, r, m, and n, denote the stoichiometry ratio.9 In our ML model, we allocated 80% of our randomly selected data to the training dataset, while the remaining 20% constitutes the testing dataset. |
The dataset was retrieved from MP database utilizing the application programming interface (API) with the Matminer77 and python materials genomics (pymatgen)78 libraries in Python. Each data entry is associated with an ICSD ID, ensuring that these crystal structures are also present in the MP and ICSD databases. It is worth noting that the MP database includes a subset of naturally synthesizable ICSD materials.78,79
In this work, we have selected the magpie descriptor and X-ray diffraction pattern for our CNN input data. The descriptor simplifies feature extraction for supplementary applications by transforming these properties into a one-dimensional vector representation. The robust magpie descriptor technique82,83 allows for the calculation of various material properties, including physical, chemical, electrical, ionic, and fundamental characteristics.
The descriptor leverages a statistical analysis of the material's electrical structure, elemental composition, stoichiometric and elemental properties. Also, as part of its electronic structure properties, the descriptor contains information about electronic configuration, such as the typical proportion of electrons from each element's s, p, d, and f valences. The magpie descriptor has widespread applicability across numerous material science fields and can considerably improve the effectiveness and efficiency of materials research. The magpie features on pymatgen78 library python with 132 dimensions represent a crystal structure's material properties.
Another descriptor is X-ray diffraction pattern, which plays a pivotal role in the experimental technique for determining the crystal structure of synthesized materials. It provides valuable insights into the atomic arrangement throughout the lattice. In the present dataset, we readily access XRD data from the MP Database58 and incorporate it as one of the structure-based descriptors in this study. Previous work by Aguiar et al.84 demonstrated using XRD descriptors for classifying the space group of crystal structures.
To incorporate XRD descriptors into our ML model for this study, we utilize the featurization tools provided by the Matminer library.77 The XRD featurizer transforms atomic positions into a one-dimensional (1D) array, effectively capturing essential crystal structure and phase information. The entire CNN architecture is depicted in Fig. 3. We applied ReLU activation function after the convolutional layer and dropout after average pooling operation and dense operation to avoid overfitting.
To select the optimal hyperparameters, we employed Bayesian optimization using the keras tuner library available in keras.85–87 Bayesian optimization has demonstrated remarkable effectiveness in determining the optimal hyperparameters for predicting material properties.88,89 Our technique involved dividing our datapoints into several subsets for the purposes of training and validation. More precisely, we employed an 80–20 division, allocating 80% of the data for training the model and reserving 20% of this training data for validation purposes during the hyperparameter tuning stage. The utilization of this validation set was crucial in fine-tuning the model's hyperparameters using Bayesian optimization. The best hyperparameters were determined by evaluating the root mean square error (RMSE) on the validation dataset.
The MEGNet framework has been highly accurate in predicting molecular and crystalline properties using GNNs that can learn from the underlying structure of materials, including atoms, bonds, and crystal lattices. One of the main advantages of using GNNs is their ability to capture complex relationships among various atoms and their spatial arrangements.
ALIGNN derives the atomistic line graph from the atomistic graph to incorporate angular information. Every node in the line graph corresponds to an edge in the original graph, representing interatomic bonds. The initial edge characteristics in the line graph are obtained through a radial basis function (RBF) expansion of the bond angle cosines. ALIGNN collects more details of atomic structure by incorporating angle information, improving its capacity to predict material properties.
The primary goal of ALIGNN is to develop a model that captures the complex interactions between atoms and bonds in crystal structures. By leveraging edge-gated graph convolution and incorporating angular information through the atomistic line graph, ALIGNN improves the accuracy of material property predictions. It provides a comprehensive representation of the atomic system, facilitating efficient information propagation between the atomistic bond graph and its line graph. This results in the effective capture of crucial features that influence material properties, particularly in crystals.69
Given the limited space within the triangular tunnel, it can only accommodate a single Ca ion. To facilitate a clearer evaluation of the difference in adsorption potential, we inserted 2 Ca ions into both the heptagonal and hexagonal tunnels. For a single Ca ion, the heptagonal and hexagonal tunnels exhibited nearly equivalent adsorption favorability of 4 eV (Fig. S1b, c and g, ESI†). Subsequently, we explored the insertion of 2 Ca ions into the heptagonal and hexagonal tunnels. In this case, the adsorption potential for the heptagonal and hexagonal channels is 3.06 eV and 2.07 eV, respectively. Consequently, the heptagonal channel emerged as the more favorable choice for Ca insertion compared to the other channels.
Given that Li has smaller charge density (Table 1) and size compared to Ca, we conducted a similar study using Li to investigate the influence of charge density and particle size. In this investigation, we explored the insertion of Li into both heptagonal and hexagonal channels, allowing Li to reposition itself to achieve the most optimized structure.
Ion name | Heptagonal 1 ion | Hexagonal 1 ion | Trigonal 1 ion | Heptagonal 2 ion | Hexagonal 2 ion | |
---|---|---|---|---|---|---|
Charge transfer (e per atom) | Ca | 1.08 | 1.57 | 1.42 | 1.22 | 1.25 |
Li | 0.55 | 0.69 | 0.63 | 0.38 | 0.46 | |
Al | 1.28 | 1.04 | 1.65 | 1.06 | 1.26 |
For the insertion of a single Li atom, the highest adsorption potential was observed within the heptagonal tunnel (4.73 eV, Fig. S2a and f, ESI†). Conversely, the lowest adsorption potential was found in the triangular channel (3.53 eV, Fig. S2c and f, ESI†). Fig. S2f (ESI†) illustrates that even after introducing 2 Li atoms into both the heptagonal and hexagonal channels, the adsorption potential remains relatively high. The adsorption potential for the heptagonal and hexagonal channels is 4.22 eV and 3.46 eV, respectively. Consequently, the atomic size plays a significant role in enhancing the adsorption potential for 2 Li atoms (Fig. S2f, ESI†).
Al was introduced using the same intercalation process to assess the channels’ comparative capabilities. However, when considering triangular channels (Fig. S3c, ESI†), Al could not be successfully inserted due to an unfavorable adsorption potential of −1.95 eV (Fig. S3f, ESI†). The highest adsorption favorability for a single Al atom is observed within the heptagonal channel (4.39 eV, Fig. S3f, ESI†). In contrast, the adsorption potential within the hexagonal tunnel is less than 50% compared to the heptagonal channel (Fig. S3f, ESI†). A larger channel size favors Al to achieve this high adsorption potential for the heptagonal channel (Fig. S3f, ESI†), which has 1.5 times more area in comparison with hexagonal channel. In the case of hexagonal channels, the ability to accommodate 2 Al atoms proved challenging because of low adsorption favorability (0.66 eV, Fig. S3f, ESI†).
Fig. 4 presents a performance comparison of the adsorption capabilities of Ca, Li, and Al. Notably, Li exhibits the highest adsorption ability across various channel shapes. Ca demonstrates a nearly equivalent intercalation ability to Li, despite having a larger particle size compared to both Li and Al. However, as the channel dimensions decrease from heptagonal to triangular, Al exhibits a significantly poorer adsorption potential compared to Li and Ca (Fig. 4(d)). This discrepancy can be primarily attributed to Al's notably high charge density (Table 1).
Another multiscale particle with naturally occurring nanochannels is the NTO-based structure, with one of its family members being Nb12WO33.41,42 Similar to MoVO structures, we computed the adsorption potential for the insertion of Li, Ca, and Al insertion into NTO using DFT. In Nb12WO33, there is a single rectangular-shaped channel available for insertion. We inserted Li, Ca, and Al into those channels at various concentrations (Fig. 5a–f). For the insertion of 1 and 2 Li atoms, the adsorption potential (Fig. 5g) is 4.4 eV and 3.44 eV, respectively. In contrast, 2 Ca and 2 Al atoms insertion into the channel resulted in the adsorption potential of −0.82 eV and −0.26 eV, respectively (Fig. 5g). However, insertion favorability of 2 Ca and 2 Al is positive for the MoVO structure because heptagonal and hexagonal channels have areas of 41.67 Å2 and 27.19 Å2, respectively. However, for NTO, the area is only 15.36 Å2. These results indicate that channel area and charge density of cation (Table 2) play an important factor for adsorption favorability. Our results indicate that inserting multivalent ions into the MoVO structure is more advantageous as compared to NTO.
Ion name | 1 Ion | 2 Ions | |
---|---|---|---|
Charge transfer (e per atom) | Ca | 1.36 | 1.13 |
Li | 0.22 | 0.26 | |
Al | 1.53 | 3.01 |
To gain a comprehensive understanding of the charge distribution within the MoVO and NTO structures, we conducted Bader charge analysis and computed the charge density difference (Δρ).92
Δρ = ρA+B − (ρA + ρB) | (4) |
ρ A indicates the charge density of the MoVO/NTO crystal structures, while ρB represents the density of the cation. The Bader charge analysis has determined the number of charge transfers between cations and MoVO structures. Table 1 displays the results, showing that when 1 Al ion is inserted into a triangular channel, there is a charge transfer of 1.65 e per atom, which is higher than other cations. The charge density plot in Fig. 6 illustrates the relationship between charge accumulation and depletion. In Fig. 6m, Al ions exhibit the weakest interactions with all cations at these positions. Consequently, the negative adsorption potential observed in Fig. S3 and S4 (ESI†) can be attributed to the lowest ionization of Al ion and MoVO structure at the triangular channel.
When contemplating the transfer of two ions, the charge transfer exhibits distinct characteristics owing to dipole–dipole interactions. These interactions can disturb the electronic structure of the atoms engaged in the exchange, causing modifications in electron density distribution. The redistribution of charges significantly impacts the overall transfer of charge, as alterations in electron density directly influence the kinetics of charge transfer. Consequently, the charge transfer efficiency is lower for transferring two ions in the MoVO structure compared to sharing a single ion.
Charge transfer for NTO structures is highest for 1 Al ion (Table 2). Consequently, the adsorption potential is lowest for a single ion insertion (Fig. 5g). Concerning two ions insertion, Ca exhibits the highest charge transfer relative to other cations (Table 2). Introducing two ions can elicit cooperative phenomena and facilitate a more extensive redistribution of charges inside the NTO structure. This can lead to more robust interactions with the host material and higher charge transfer compared to a scenario involving only a single ion. Consequently, the insertion of two Ca ions has the lowest potential for adsorption. In a broader context, our charge investigations reveal a negative correlation between the potential for adsorption and the extent of charge transfer within the NTO structure. An increase in charge transfer is inversely proportional to the degree of adsorption (Fig. 7) in the NTO structure.
We evaluated the validation dataset results for all two input data types to choose the best hyperparameters. In our research, we employ the commonly used RMSE metric for assessing prediction accuracy in predictive modeling. RMSE provides a numerical representation of the typical “distance” between model-predicted values and actual values, with lower RMSE values indicating a closer match to the data.
The RMSE values for the 2026 test dataset for 1-dimensional magpie descriptors and 1-dimensional XRD pattern are 0.25 eV per atom (see Fig. 8b) and 0.56 eV per atom (see Fig. 8a), respectively. Fig. 8 displays the trendline between the actual and predicted values for the two descriptors. The x-axis in this scattering plot represents the actual formation energies, while the y-axis represents the predicted formation energies. The scatter plot in Fig. 8 demonstrates the low variability or dispersion of predicted development forces compared to the calculated values. The dark data points in the graph represent the TMO materials with actual formation energies falling within the silicon range of −4.5 to 1.5 eV. Consequently, based on the 1-dimensional CNN model using these descriptors, magpie descriptors prove to be the more effective input data featurization approach for predicting the TMO-based formation energies (see Fig. 8).
Fig. 8 Visualization of CNN-based model predictions, showcasing the relationship between actual and predicted values using two distinct featurization methods: (a) XRD patterns, and (b) magpie descriptors. As evident from Fig. 8(b), the magpie descriptors resulted in predictions that exhibited sparser and less pronounced deviations from the actual values, in contrast to the other featurization method. |
In our pursuit of more accurate material property predictions, we explored the use of crystal-based graph conversion as our input data. For this purpose, we leveraged the MEGNet model, equipped with carefully selected parameters: a batch size of 32, a learning rate of 0.00075, and an L2 regularization coefficient of 0.001. We conducted training for 300 epochs, keeping these parameters56 consistent. Initially, we employed a cutoff radius of 5 Å to capture crystal structure interactions. However, recognizing the potential significance of long-range interactions, we expanded the cutoff radius to 6 Å to investigate their effects.
Interestingly, this modification did not yield a significant improvement in the model's performance. The system's predicted properties may be primarily influenced by short-range interactions, with long-range interactions playing a negligible role in this context. Furthermore, we employed an elemental embedding of 16 and selected three blocks to optimize the model's generalization capabilities. Although we experienced with different block numbers, ranging from 3 to 5, there is no substantial improvement in performance.
Fig. 9 illustrates the MEGNet model's performance, using the same test dataset as the descriptor data, achieving an RMSE value of 0.13 eV per atom. This represents a significant improvement compared to the Magpie descriptors, where the RMSE value was 0.25 eV per atom. The performance of the MEGNeT model is visualized in Fig. 9(a).
As we considered the number of graph convolutional layer, it's important to note that an excessive increase in layers can lead to over-smoothing issues. DeeperGATGNN applies DGN and skip connections for training the data. Our implementation utilized a learning rate of 0.005 and batch size of 100. The model architecture consists of multiple layers, including a single pre-fully connected layer, several graph convolution layers, and no post fully connected layer. The model employs a “global_add_pool” pooling mechanism for effective global information aggregation and follows an “early” pooling order. The DeeperGATGNN model employs a graph representation with a maximum atomic interaction radius of 12 Å and a maximum of 12 neighboring atoms per atom, effectively capturing local atomic connectivity.
All training data underwent 300 epochs of training. To determine the optimal number of GNN layers, we studied with 10, 15 and 20 layers. Other parameters adhered to the default settings of DeeperGATGNN.90 Surprisingly, the best test RMSE value of 0.11 eV per atom was obtained with 10 GNN layers, beyond which the performance began to deteriorate, resulting in an RMSE value of 0.13 eV per atom for 15 and 20 GNN layers. This decline in performance can be attributed to the excessive increase in learning parameters when utilizing a larger number of GNN layers, which can hinder model performance, particularly with a limited training dataset. Fig. 9b presents the performance of the DeeperGATGNN model.
In this study, we introduced the ALIGNN model, which leverages cutting-edge technology and complex algorithms to extract significant characteristics and properties of individual atoms within the molecular structure. This model comprises four alignment layers and four graph convolutional layers, with input features spanning 92-dimensional atomic attributes, 80 interatomic bond features, and 40 triplet input features.69 As a result of the inconclusive outcomes observed when experimenting with different cutoff radius values, we decided to utilize the default value of 8 Å in the ALIGNN model, which permits the inclusion of up to 12 adjacent atoms per atom.69
By encoding the molecular structure into a lower-dimensional space using an embedding feature size of 64, the ALIGNN model efficiently extracts essential structural information and aligns molecular graphs. The inclusion of hidden features, set at 256, empowers the model to learn intricate patterns and representations effectively. For the ALIGNN model, the test RMSE is 0.11 eV per atom, underlining its exceptional predictive capabilities. Fig. 9c represents the exemplary performance of the ALIGNN model.
Fig. 10a compares RMSE values between human-crafted descriptors and the GNN model. Notably, human-crafted descriptors appear to encompass critical structure–property relationships. Magpie descriptors stand out as the best performing, with an RMSE value of 0.25 eV per atom. However, the best RMSE value is obtained from ALIGNN and DeeperGATGNN models, both exhibiting an RMSE value of 0.11 eV per atom. This represents a remarkable 56% improvement in human-crafted descriptors. ALIGNN, in particular, emerges as the top-performing model, achieving this exceptional RMSE value with fewer convolutional layers.
In contrast, the DeeperGATGNN model necessitates a more intricate hyperparameter optimization process, consuming computational resources, and time. ALIGNN distinguishes itself as the sole three-GNN model investigated in this study, preserving atom representations (one-body), bond representations (two-body), and bond angle representations (three-body). This distinguishing feature holds immense promise for capturing and portraying the periodic correlations inherent in crystal formations, thereby making a significant contribution to the field.
Fig. 10b illustrates the distribution of mean absolute error (MAE) using ALIGNN. The analysis reveals that nearly 95% of the test samples (1933 instances) exhibit an error below 0.2 eV per atom, and half of the data showcases an error below this threshold. The overall average error, represented as the MAE, stands at approximately 0.05 eV per atom. Fig. 10b provides a quantitative assessment of our model's performance, indicating that in 98.52% of cases, the model accurately predicts the formation energy/atom with an error below 0.40 eV per atom.
However, the ALIGNN model exhibits limitations in predicting the formation energy per atom for three specific structures (mp-690844, mp-1191949, mp-27873), as indicated by their higher MAE values exceeding 1. Upon detailed analysis, it becomes apparent that the graph representation employed by the ALIGNN model may not fully encapsulate the intricacies of interactions within these structures, leading to more pronounced prediction errors. One noteworthy observation from these cases is that ALIGNN primarily focuses on explicitly encoding three-body interactions. However, including higher-order n-body interactions (where n is greater than or equal to 4) could enhance the predictive capability of the ALIGNN model. Various studies in GNN have explored the incorporation of n-body interactions for diverse applications.93–96
The results from Bader charge analysis and charge density plots have unveiled that Al ions exhibit the lowest insertion favorability for both MoVO and NTO structures, primarily due to Al's substantial charge transfer to these materials. Meanwhile, Ca ions demonstrate adsorption behavior akin to monovalent Li ions but encounter challenges when inserted into the smaller channels of NTO structures, particularly when accommodating 2 Ca ions. Hence, it becomes evident that channel dimension, ion size, and ion charge density collectively play pivotal roles in the insertion of ions into MoVO and NTO structures. This deep understanding of ion insertion processes is critical for exploring materials with improved ion storage.
To lay the groundwork for future material screening methodologies, we gathered an extensive dataset of TMOs from the ICSD. The dataset has diverse structural compositions with empty spaces and nanochannels resembling those seen in NTO and MoVO structures, such open-tunnel oxides. In our research employing GNNs, we explored various methods for representing graphs. The ALIGNN and DeeperGATGNN models outperformed conventional descriptors, highlighting the effectiveness of graph-based representations in predicting formation energy. Based on these results, we aim to employ ML algorithms to forecast the stability of additional promising open-tunneled oxide materials for more efficient multivalent-ion batteries.
In the future, our focus will be on inverse machine learning (IML), a method that enables the creation of new crystal structures from TMOs.9,97 IML combined with our ML model holds great promise for predicting materials with specific characteristics including porosity, tunnel structure, formation energy, and conductivity (Fig. S3, ESI†).9 The generative adversarial network (GAN) represents a potential approach IML, having successfully generated novel crystal structures with specific properties, expanding the scope of materials that can be developed beyond current databases.97
Structures generated by IML can undergo further screening using our ML models to forecast stability, electronic and ionic conductivity, and pore sizes. By integrating ML with IML, we can effectively explore a wide variety of material parameters, leading to the identification of robust, conductive, and structurally strong intercalation hosts for multivalent-ion batteries. This approach simplifies the practical process of material screening, thereby advancing state-of-the-art materials for energy storage solutions.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4ya00014e |
This journal is © The Royal Society of Chemistry 2024 |