Open Access Article
Kento Murakamia,
Yudai Yamaguchia,
Yo Katoa,
Kazuki Ishikawab,
Naoto Tanibata
a,
Hayami Takeda
*a,
Masanobu Nakayama
*a and
Masayuki Karasuyama
b
aDepartment of Advanced Ceramics, Nagoya Institute of Technology, Gokiso, Showa, Nagoya, Aichi 466-8555, Japan. E-mail: takeda.hayami@nitech.ac.jp; masanobu@nitech.ac.jp
bDepartment of Computer Science, Nagoya Institute of Technology, Gokiso, Showa, Nagoya, Aichi 466-8555, Japan
First published on 15th December 2025
Lithium-ion-conductive oxide materials have attracted considerable attention as solid electrolytes for all-solid-state batteries. In particular, LiZr2(PO4)3-related compounds are promising for high-energy-density devices using metallic lithium anodes, but further enhancement of their ionic conductivity is requested. In general, Li-ion conductivity is influenced by mechanisms operating on two distinct length scales. At the atomic scale, point defects and the associated migration barriers within the crystal lattice are critical, whereas at the micrometre scale, porosity and grain-boundary characteristics that develop during sintering become the dominant factors. These coupled effects make systematic optimization of conductivity difficult. In paticular, microstructural analysis has often relied on researchers' intuitive interpretation of scanning electron microscopy (SEM) images. Here, we apply a convolutional neural network (CNN), a deep-learning approach that has seen rapid advances in image analysis, to SEM images of LiZr2(PO4)3-based electrolytes. By combining image-derived features with conventional vector descriptors (composition, sintering parameters, etc.), our regression model achieved an R2 of 0.871. Furthermore, visual-interpretability analysis of the trained CNN revealed that grain-boundary regions were highlighted as low-conductivity areas. These findings demonstrate that deep-learning-based SEM analysis enables automated, quantitative evaluation of ionic conductivity and offers a powerful tool for accelerating the development of solid electrolyte materials.
Oxide-based lithium-ion conductors are considered advantageous due to their nonflammability, chemical stability, and mechanical strength.
Representative oxide-based lithium-ion conductors with high ionic conductivity include perovskite-type Li0.5La0.5TiO3,7 garnet-type Li7La3Zr2O12 (LLZ),8 and NASICON-type compounds such as Li1.5Al0.5Ge1.5P2O12,9 Li1.3Al0.3Ti1.7(PO4)3 (LATP),10 and LiZr2(PO4)3.11 Among these, LLZ and LiZr2(PO4)3 are particularly notable because they do not react with metallic lithium, making them promising candidates for high-energy-density all-solid-state batteries with lithium metal anodes. However, LLZ is known to be sensitive to experimental conditions, such as its high reactivity with moisture and CO2 in air, and variations in conductivity depending on the incorporation of Al from crucibles during synthesis.12
Among them, NASICON-type LiZr2(PO4)3 materials have attracted attention as they combine high lithium-ion conductivity with durability against metallic lithium. In fact, Li et al. successfully fabricated a high-capacity all-solid-state battery using lithium metal, composed of Li/LiZr2(PO4)3/LiFePO4, and reported stable cycle performance.13
To date, many attempts have been made to improve the conductivity of LiZr2(PO4)3 by controlling its composition through doping with different elements.14–25 In addition, when using oxide materials as solid electrolytes, it is essential to obtain dense sintered bodies in terms of mechanical durability and the reduction of grain boundary resistance. The control of oxide sintering depends on heating temperature, atmosphere, time, and raw powder characteristics. Generally, increasing the heating temperature enhances the sintering density. However, in the case of LiZr2(PO4)3, components such as Li and P often volatilize at high temperatures, making it necessary to optimize the heating conditions.26,27
We have attempted to maximize lithium-ion conductivity in Li1+2x+yCaxZr2−xSiyP3−yO12 (hereinafter referred to as LCZSP) materials, in which Ca and Si are substituted, by controlling the composition of Ca and Si28 or the heating conditions.26,29 For example, in composition control, the amount of Ca doping was found to influence microstructure of sintered body, while Si doping was associated with the formation of the α-phase (high ionic conductivity) and β-phase (low ionic conductivity), significantly affecting lithium-ion conductivity (from 2.7 × 10−5 to 2.3 × 10−8 S cm−1 at 30 °C).27,28
Furthermore, even with the same composition, we found that systematically varying the first and second heating temperatures resulted in significant changes in lithium-ion conductivity over two orders of magnitude (from 3.3 × 10−5 to 6.4 × 10−7 S cm−1 at 30 °C).26 It was also revealed that the optimal heating temperature lies in the middle of the specified temperature range, suggesting a trade-off relationship likely due to sintering density and component volatilization as mentioned above. These findings clearly indicate that optimizing the composition and processing conditions is crucial in the development of solid electrolyte materials.
Optimization of composition and processing has traditionally relied on a trial-and-error approach based on the knowledge and experience of researchers and engineers. However, in recent years, materials development utilizing Materials Informatics (MI) has been explored. MI aims to predict physical properties such as activation energy using machine learning, and by applying information science to materials development, it is expected to accelerate the discovery of new functional materials.30–32 We have demonstrated efficient determination of the optimized temperature and composition by applying Bayesian optimization to LCZSP materials.26–28 However, one challenge with above mentioned Bayesian optimization is that it employs black-box functions, such as Gaussian process regression, which often fails to provide systematic knowledge on factors affecting Li ion conductivity. Traditionally, various analytical techniques have been used to understand the mechanisms underlying material functionality. For inorganic crystalline compounds, commonly used methods include X-ray Diffraction (XRD) for analyzing crystal phases, Scanning Electron Microscope (SEM) for observing microstructure, and X-ray Photoelectron Spectroscopy (XPS) for elemental analysis and chemical bonding states of surfaces. Recently, we evaluated the relationship between XRD profiles obtained from LCZSP materials sintered at various temperatures and the solid electrolyte properties (activation energy for lithium-ion conduction) using deep learning with an attention mechanism.33 As a result, it was suggested that the activation energy can be predicted from XRD profiles and is mainly influenced by the resulting crystal phases (α-phase, β-phase, and impurity phases). On the other hand, while XRD is sensitive to crystal structures derived from composition and atomic arrangements, it is not suitable for analyzing microscale information such as particle morphology and sintering microstructures. SEM images are effective tools for analyzing micrometer-scale information, and indeed, studies have been reported in which such images are directly analyzed using deep learning techniques. For example, Kondo et al.34 developed a model that predicts oxide ionic conductivity in yttria-stabilized zirconia by training a modified VGG16 convolutional neural network (CNN)35 on microscopic images of ceramic microstructures. Furthermore, by visualizing the intermediate features in the CNN architecture, they specified microstructural features in SEM images affecting measured ionic conductivity positively or negatively. More recently, deep learning has also been used to elucidate the relationships between microstructures and properties such as mechanical strength and thermal conductivity in sintered silicon nitride ceramics (a heat-resistant structural material),36 and between sintered structures and ionic conductivity in lithium-ion conductive materials.
In this study, we acquired 130 SEM image data for a total of 52 sintered LCZSP specimens, each with different compositions and sintering conditions, for which ionic conductivity had been measured previously, and then used a CNN to predict their conductivities. We placed particular emphasis on how microstructure affects ion transport, and sought to deepen our understanding of LCZSP ionic conduction by visualizing which regions of the SEM images exert positive or negative influence on conductivity.37
For training the CNN model, we employed leave-one-out cross-validation, which is suitable for small datasets. Since each material had three images, all three were used as the test set for that material, while the remaining 153 images were used for training and validation (Fig. 1). Four-fold cross-validation (k = 4) was performed on the training data, and the hyperparameters that yielded the minimum loss were adopted. Details of these hyperparameters are listed in Table S2. The four models obtained through k-fold cross-validation were used to predict the test data, and their average was taken as the final prediction. Mean Squared Error (MSE) was used as the loss function for training and validation, while Root Mean Squared Error (RMSE) and the R2 score were used as evaluation metrics for the test data. For training the CNN model, we employed leave-one-out cross-validation at the sample level, which is suitable for small datasets. Each material was associated with 1–3 SEM images, and all of the images belonging to the held-out sample were used together as the test set. The remaining images (from the other 51 samples, totaling 127–155 images depending on the test case) were used for training and validation (Fig. 1). This procedure ensures that no data leakage occurs between training and test sets, as different images from the same sample were never split across them.
Four-fold cross-validation (k = 4) was then performed on the training portion, and the hyperparameters that yielded the minimum loss were adopted. Details of these hyperparameters are listed in Table S2. The four models obtained through k-fold cross-validation were used to predict the test data, and their average was taken as the final prediction. Mean Squared Error (MSE) was used as the loss function for training and validation, while Root Mean Squared Error (RMSE) and the R2 score were used as evaluation metrics for the test data.
| Model name | Mode | Figure |
|---|---|---|
| Efficient Net-B3 | Transfer | — |
| Baseline | Original | Fig. S1 |
| Baseline with descriptors | Original | Fig. S2 |
| Baseline with Attention | Original | Fig. S3 |
Among the remaining architectures, the “Baseline” model represents the simplest structure, and the other two are modifications of this “Baseline” model. The detailed structure of the “Baseline” model is shown in Table 2. In the “Baseline with descriptors” model, we focused on the Global Average Pooling (GAP) layer and observed changes in regression accuracy by concatenating additional material-derived descriptors to the one-dimensional feature vector. The “Baseline with Attention” model introduces an attention mask into the intermediate convolutional layers.33,42–44 Specifically, a single image is generated from the feature maps using a 1 × 1 convolution,45–48 and a Sigmoid function is applied to this image to transform it into an attention mask (attention score). This mask is then element-wise multiplied (Hadamard product) with each original feature map to produce a set of weighted feature maps, which are passed on to the next layer. The attention mask highlights the regions that should be focused on, thus enhancing the effect of feature extraction through convolution.
| Type | Input | Kernel | Stride | Pad | Output |
|---|---|---|---|---|---|
| Input | 512 × 512 × 1 | N/A | N/A | N/A | 512 × 512 × 1 |
| Convolution | 512 × 512 × 1 | 3 × 3 | 1 | 1 | 512 × 512 × 16 |
| Convolution | 512 × 512 × 16 | 3 × 3 | 1 | 1 | 512 × 512 × 16 |
| Max pooling | 512 × 512 × 16 | 2 × 2 | 2 | 0 | 256 × 256 × 16 |
| Convolution | 256 × 256 × 16 | 3 × 3 | 1 | 1 | 256 × 256 × 32 |
| Convolution | 256 × 256 × 32 | 3 × 3 | 1 | 1 | 256 × 256 × 32 |
| Max pooling | 256 × 256 × 32 | 2 × 2 | 2 | 0 | 128 × 128 × 32 |
| Convolution | 128 × 128 × 32 | 3 × 3 | 1 | 1 | 128 × 128 × 64 |
| Convolution | 128 × 128 × 64 | 3 × 3 | 1 | 1 | 128 × 128 × 64 |
| Global average pooling | 128 × 128 × 64 | 128 × 128 | 1 | 0 | 1 × 1 × 64 |
| Fully connected | 1 × 1 × 64 | 1 × 1 | 1 | 0 | 1 × 1 × 1 |
The schematic figure of the “Baseline,” “Baseline with descriptors,” and “Baseline with Attention” architectures are shown in Fig. S1–S3, respectively. Also codes are available in the associated repository.49 In addition, to address the concern that the network design might be overly simplistic, we conducted supplementary experiments by increasing the depth of the “Baseline” model ((1) doubling convolutional layers, (2) doubling fully connected layers, and (3) doubling both). The results, summarized in SI Table S3, show that these deeper variants did not yield any significant improvement in RMSE compared to the original Baseline model. This confirms that the adopted network depth is already appropriate for the dataset size, and deeper architectures only increased the risk of overfitting without enhancing predictive accuracy.
To improve the prediction accuracy for images of low ionic conductivity materials—which previously showed lower accuracy—we conducted regression analysis using the “Baseline with descriptors” model, which incorporates additional numerical vector descriptors such as sintering temperatures, composition, and so forth (the specific information of added descriptors is summarized in Table 3). As previously described, material-derived features were newly appended at the GAP layer. The ionic conductivity characteristics of oxide solid electrolytes are determined by the material composition and the sintering temperature necessary to form ion-conducting pathways. Therefore, in this study, we created 33 descriptors: seven basic descriptors consisting of the molar ratios of [Li, Ca, Zr, Si, P] representing the composition of LCZSP and the first and second sintering temperatures, along with their interaction terms (products). To ensure stable learning within the neural network, these 33 descriptors were standardized to align their scales. Fig. 3(d) shows the diagnostic plots of the “Baseline with descriptors” model. It achieved relatively high prediction accuracy even in the previously challenging low ionic conductivity region, attaining a significant improvement of an R2 score = 0.871. This indicates that while features such as sintered structure and particle morphology captured in SEM images do contain some information relevant to ionic conductivity, it is essential to also include information not directly visible in SEM images, such as material composition, for accurate prediction of material properties. Although the dataset size (52 samples, 130 images) may appear relatively small compared to typical machine learning benchmarks, it should be noted that in the context of experimental solid electrolyte research this represents a substantial collection effort. Each sample requires careful synthesis, sintering, and characterization, and thus assembling a dataset of this scale is non-trivial. Our results demonstrate that even with such a dataset size, the CNN models are able to achieve reliable predictive performance (R2 = 0.871), highlighting the practical utility of deep learning approaches for small-data regimes that are common in materials science. Consequently, the “Baseline + descriptors” model was found to be the most accurate among all evaluated models. In addition, we trained a model using only composition- and sintering-related descriptors, without SEM images (SI Fig. S4(a and b)). The resulting R2 score of 0.38 indicates that such descriptors alone can explain the overall trend but are insufficient for precise predictions. Comparing the three models—(i) descriptors only, (ii) SEM images only, and (iii) descriptors + SEM images—clearly demonstrates that SEM image features provide complementary information to composition and sintering descriptors, leading to the best performance in the combined model. Similar trends were reported by Zhang et al.,37 who used traditional machine learning to correlate microstructural features (grain size, porosity, and grain-boundary fraction) with ionic conductivity in oxide solid electrolytes. Our CNN-based approach builds upon this concept by automatically learning such microstructural features from SEM images while integrating compositional and processing descriptors. To further evaluate the contribution of each descriptor, a feature importance analysis using SHapley Additive exPlanations (SHAP)50 was conducted. Fig. S4(c) shows the SHAP summary plot for the “descriptors only” model. The results indicate that the first heating temperature (T1) and Si content are the most influential features, followed by their interaction terms (e.g., Si × T1, Zr × T1). These factors are physically meaningful, as excessive Si is known to promote the formation of low-conductivity secondary phases, while a lower first heating temperature reduces sintering density and connectivity of the ion-conducting network. Therefore, the addition of composition- and process-related descriptors to the CNN framework effectively complements the SEM image features and enhances the model's ability to capture the influence of processing and composition on ionic conductivity.
| [Ca] | [Si] | [Li] | [Zr] | [P] | T1 | T2 |
|---|---|---|---|---|---|---|
| [Ca]2 | [Si]2 | [Li]2 | [Zr]2 | [P]2 | T12 | T22 |
| [Ca] × [Si] | [Ca] × [Li] | [Ca] × [Zr] | [Ca] × [P] | [Li] × [Zr] | [Ca] × T1 | [Ca] × T2 |
| [Si] × [Li] | [Si] × [Zr] | [Si] × [P] | [Li] × [P] | [Zr] × [P] | [Si] × T1 | [Si] × T2 |
| [Li] × T1 | [Li] × T2 | [Zr] × T1 | [Zr] × T2 | T1 × T2 |
The GAP layer represents a comprehensive value obtained by learning various features through multiple convolution and pooling layers. Therefore, it can be assumed that the one-dimensional values obtained after GAP have either positive or negative correlations with the ionic conductivity of the input images. By averaging the information in each correlation group, we can visualize their contributions. The visualization process from CNN training is outlined below:
(i) Using the trained CNN model, divide the feature channels into groups that show positive and negative correlations with ionic conductivity.
(ii) For each group, average the feature maps obtained after the final convolution layer (GAP). This means aggregating the information into a single feature map by averaging the pixel values across all channels in the group.
(iii) Since the feature maps generated in (ii) are downsampled compared to the original input image, resize them back to the original image size.
(iv) In the resulting resized feature map, the pixel locations with values higher than the overall median are identified as important regions contributing to ionic conductivity. A masking operation is then applied to the original SEM image, displaying only these regions.
This series of operations, from (i) to (iv), is referred to as segmentation. In this study, segmentation was performed using the trained “Baseline with descriptors” model, which had achieved the highest regression accuracy, to visualize the relationship between SEM images and ionic conductivity.
Fig. 4(b) shows a histogram of the correlation coefficients between feature channels and ionic conductivity. The model used had 64 channels in its GAP layer. Fig. 4(b) presents a histogram of the correlation coefficients between each channel and ionic conductivity. In our model, the global average pooling (GAP) layer processes 64 channels. As shown in Fig. 4(b), the channel-wise correlation coefficients are relatively low ranging from −0.5 to +0.3. Although the feature maps extracted by the CNN at the GAP layer do not exhibit strong correlations individually, weak inter-channel correlations are present. By aggregating these weak signals, a high overall correlation as shown in Fig. 3 can be derived. Fig. 5 presents segmentation results for four representative materials with high and low ionic conductivity. In the positive examples, it was observed that regions with larger particles were recognized as contributing to high conductivity. Conversely, in the negative examples, grain boundaries were identified as low-conductivity regions. Since grain boundaries often disrupt crystal structures, leading to reduced conductivity,25,53,54 this negative judgment aligns with previous findings. Large, isotropic grains likely represent the bulk of the high-conductivity α-phase and were thus evaluated positively. Notably, channels exhibiting relatively strong negative correlations (i.e., <−0.25) are more prevalent than those with positive correlations (i.e., >+0.25), as shown in Fig. 4(b). This trend suggests increased sensitivity to microstructural features such as grain boundaries and voids observed in the SEM images.
These segmentation results are consistent with established knowledge. However, prior observations have indicated that columnar-like small particles in LZP materials correspond to the low-conductivity β-phase.25,39 Interestingly, in Fig. 5(c and d), even the bulk of columnar-like crystals—presumed to be β-phase—were evaluated positively, yielding results that differ from known expectations. These results suggest that accurate prediction of ionic conductivity requires the inclusion of composition and structure information. In our “Baseline + descriptors” model, explicitly providing composition and sintering-temperature data enabled us to incorporate details about the resulting α- and β-phases.
Nonetheless, this method demonstrates the ability to extract valuable insights regarding material microstructures related to ionic conductivity directly from SEM images. Typically, interpreting SEM images requires expert knowledge and experience in materials research, but segmentation can help reduce the effort required for such analyses and improve efficiency.
Furthermore, transfer learning represents a promising future direction. Our preliminary tests with EfficientNet-B3 pretrained on ImageNet already improved baseline accuracy, suggesting that the availability of larger SEM datasets will further enhance generalizability and emphasize the importance of open data initiatives. This methodology is not only applicable to the LCZSP materials studied here but can also be extended to other systems, enabling the effective utilization of routinely acquired SEM images.
Supplementary information: conditions of neural network settings. See DOI: https://doi.org/10.1039/d5dd00232j.
| This journal is © The Royal Society of Chemistry 2026 |