Interpretable and uncertainty-informed machine learning to accelerate the design and discovery of lead-free piezoceramics with large piezoelectric constant

Heng Hu; Bin Wang; Didi Zhang; Kang Yan; Tao Tan; Dawei Wu

doi:10.1039/D5TC00865D

View PDF VersionPrevious ArticleNext Article

DOI: 10.1039/D5TC00865D (Paper) J. Mater. Chem. C, 2025, 13, 8958-8968

Interpretable and uncertainty-informed machine learning to accelerate the design and discovery of lead-free piezoceramics with large piezoelectric constant†

Heng Hu ^a, Bin Wang ^a, Didi Zhang ^a, Kang Yan *^a, Tao Tan *^b and Dawei Wu *^a
^aState Key Laboratory of Mechanics and Control of Mechanical Structures, College of Aerospace Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China. E-mail: yankang@nuaa.edu.cn; dwu@nuaa.edu.cn
^bFaculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China. E-mail: taotan@mpu.edu.mo

Received 27th February 2025 , Accepted 15th April 2025

First published on 16th April 2025

Abstract

Potassium sodium niobate (KNN)-based ceramics are promising alternatives to lead-containing piezoelectric materials. However, the vast design space, characterized by multiple dopant choices and variable content, presents a considerable challenge in the chemical modification of KNN compositions to improve their piezoelectric performance. In recent years, the rapid advance of machine learning (ML) techniques has facilitated expedited materials design and discovery with deeply sought insights into the materials. In this study, we constructed an interpretable and uncertainty-informed ML framework to optimize the piezoelectric coefficient d₃₃ of a KNN-based lead-free system. We identified and analyzed the influential features for the d₃₃ prediction and conducted three experimental iterations based on the uncertainty-informed predictions obtained from the Monte Carlo dropout (MCDropout). Promising KNN compositions exhibiting large d₃₃ values over 300 pC N⁻¹ were located and synthesized. Furthermore, the MCDropout markedly reduced the computational cost by 33% compared to the commonly used bootstrap method for uncertainty assessment. This study exhibits an ML framework with enhanced interpretability and search efficiency for optimizing the crucial piezoelectric properties of piezoceramics. The application scope of the utilized methods can be extended to various materials with tailored properties.

1. Introduction

Chemical modification is one of the most frequently used approaches to enhance the piezoelectricity of lead-free materials, which makes them promising alternatives to the Pb(Zr,Ti)O₃-based materials.¹ However, the vast chemical design space significantly limits the exploration efficiency for high-performance compositions. In recent years, the machine learning (ML) framework that combines feature engineering, ML modeling, and iterative experimental observations has emerged as a promising tool to accelerate the design and discovery of novel materials.^2–6 Previously unobserved piezoelectric compositions with enhanced properties have been effectively located. Typically, Yuan et al.^6–8 used ML methods to design the BaTiO₃ (BT)-based piezoceramics with large electrostrain and piezoelectric coefficient. Gu et al.⁹ and Sun et al.¹⁰ constructed ML models to accelerate the design of high-performance (K,Na)NbO₃ (KNN)-based ceramics. The ML model exhibits excellent ability to establish the hidden patterns between input features and output properties, making it an efficient tool for materials design and discovery.

Recently, a critical concern of ML about its interpretability or explainability has received rising attention.^11,12 The lack of transparency poses challenges in understanding the rationale behind the ML predictions, thus necessitating the development of an interpretable ML framework that shows exact physical meanings and closely engages with observations from experimentation.^13–15 Besides, training robust ML models implicitly assumes the availability of a sufficiently large and diverse database.¹⁶ Unfortunately, this rarely occurs in the investigation of piezoelectric materials due to the laborious and lengthy experimental procedures. ML models are prone to generating less reliable predictions when exploring the materials far from the observed domain than when exploiting those near the observed domain in the design space.¹⁷ Therefore, it is crucial to seek out rationality and evaluate the reliability of ML predictions when using ML methods to accelerate the search for materials with tailored properties.

This study aims to establish an interpretable and uncertainty-informed ML framework for designing and discovering lead-free piezoceramics. This framework is centered around shapley additive explanations (SHAP),¹⁸ Monte Carlo dropout (MCDropout),¹⁹ and Bayesian optimization (BO),²⁰ which are integrated to search for promising KNN-based lead-free piezoelectric ceramics with enhanced piezoelectric coefficient d₃₃. Extensive research has been conducted on enhancing the piezoelectric properties of KNN-based ceramics using multicomponent co-dopants, such as Li, Sb, BaZrO₃, etc.^21–25 The selection of dopants and their stoichiometry is mainly based on expertise-driven empirical observations due to the complex and obscure mechanisms underlying piezoelectricity enhancement in KNN-based ceramics. Therefore, the application of the SHAP method here aims to interpret the influence of material features on d₃₃ prediction in the data-driven materials design, which might give insight into these materials from a statistical learning perspective. Uncertainty quantification (UQ) is essential for studies with limited and unbalanced experimental databases as it provides a rigorous evaluation of the ML predictions.²⁶ We emphasized the role of UQ, which is realized by the MCDropout. This technique extends the traditional dropout employed to prevent overfitting during the training phase of ML and generate uncertainty-informed predictions during the inference phase.²⁷ It has been explored for various applications, such as digital histopathology,²⁸ soil spectroscopy,²⁹ and object detection.³⁰ For the materials research, the potential of MCDropout is valuable to be further validated and developed. In addition, BO has emerged at the forefront of expensive “black box” optimization due to its data efficiency.³¹ It can guide global optimal search with a small number of experimental observations, which has attracted intense interest in materials research.^32,33 Our experimental design strategies based on BO functions, including the exploitation (ET) and expected improvement (EI), leverage predictions and predictive uncertainty captured by the MCDropout to determine the compositions for the experimental observations.

The influential features interpreted by the SHAP show consistency with the conventional understanding of KNN-based ceramics. Simultaneously, additional insights into the previously unfocused features affecting the direction of d₃₃ predictions were revealed. The model implemented with MCDropout effectively generated multiple predictions for compositions in the vast design space. The shape of the predictive distribution can be approximated as the bell curve of normal distribution, which is ideal for the UQ. Furthermore, the computational results reveal that the applied MCDropout made more efficient inferences than the commonly used bootstrap method.^7–9,34,35 This benefit can be strengthened when the iteration and model complexity increase. Ultimately, three iterations of the experiment guided by the BO led to the discovery of KNN compositions with enhanced d₃₃ exceeding 300 pC N⁻¹, validating the effectiveness of the utilized method.

2. Methods

The iterative ML workflow of this study is shown in Fig. 1, which consists of five parts for each iteration: data preparation, feature engineering, machine learning, experimental design, and synthesis and characterization.


	Fig. 1 Schematic workflow for the machine learning-driven design of KNN-based ceramics.

2.1 Computational details

The design space was defined according to the chemical formulae (1 − x)(K_y₁Na_y₂Li_{1−y₁−y₂})(Nb_zSb_1−z)O₃-xMZrO₃ (xM = aBa, bCa, cBi_0.5K_0.5, dBi_0.5Na_0.5, or eBi_0.5Li_0.5). The mole fraction of each variable was constrained by 0 ≤ x ≤ 0.06, 0.4 ≤ y₁ ≤ 0.5, 0.5 ≤ y₂ ≤ 0.6, 0.9 ≤ z ≤ 1, 0 ≤ a, b, c, d, e ≤ 0.05 to avoid the possible appearance of relaxor phase, which can be harmful to d₃₃ enhancement. It leads to approximately 331 [thin space (1/6-em)]

000 possible compositions if the mole fraction is controlled to 0.01. The database (shown in Table S1, ESI†) is included in the design space. All ceramic specimens in the database were prepared using the conventional solid-state reaction method and sintered at a sintering temperature around 1120 °C, which ensures data comparability. We took the nominal compositions in literature as actual compositions, excluding the impact of processing variance.³⁶

Feature engineering involves a three-step process: feature construction, feature screening, and feature interpretation. Firstly, the KNN compositions were transformed into features that had been defined according to the knowledge of ferroelectrics.^6,37 The feature pool covers the size factors (e.g., atomic volume), electrochemical factors (e.g., electronegativity), atomic number factors (e.g., atomic number), etc. of A and B-site dopants (shown in Tables S2 and S3, ESI†). The prepared features are representative, related to target properties, and easily accessible.³⁸ Subsequently, Pearson correlation analysis was used to screen out highly correlated features. The Pearson correlation coefficient r between two features is defined by


	(1)

where n is the number of features, X_i and Y_i are two different features, [X with combining macron]

and Ȳ are the mean of X_i and Y_i. One of the highly correlated features with r > 0.95 was retained, while others were removed.³⁹ The retained features served as the inputs of the ML model. Lastly, gradient boosting and SHAP methods were used to improve the interpretability of ML models. The gradient boosting method provided relative importance of features to the model prediction, which is calculated based on the contribution of features to the performance of the gradient boosting regressor. The SHAP method explained the direction of the top-ranked features that contributed to the prediction of d₃₃. The important features and their impact on predictions are illustrated by combining these two methods.

We used the artificial neural network (ANN) to learn the patterns between input features and output d₃₃. Compared to traditional ML models, it can be more advantageous when dealing with high-dimensional features and non-linear patterns.⁴⁰ The ANN architecture consists of an input layer, one or more hidden layers, and an output layer that can be flexibly adjusted depending on specific problems. The shape of the input layer corresponds to the dimension of features. The rectified linear unit served as the activation function for the hidden layers, whereas the linear function was employed for the output layer. The grid search method was used to determine hidden layer configurations, including the number of hidden layers and the number of neurons within each hidden layer. The search range for the hidden layer number is defined from 1 to 2. The hidden nodes in each hidden layer range from 2 to 30 with an increment of 2. Leave-one-out cross-validation (LOOCV) method was used to calculate the performance of ANN models with different hidden layer configurations. This method works by training all but one sample and testing the left-out sample. The above procedure was repeated until every sample had been the test sample. The metric to evaluate model performance is the root mean square error (RMSE) across all test samples (RMSE_LOOCV), which is defined by


	(2)

where n is the size of the original dataset, y_i is the measured value and ŷ_i is the predicted value based on the model trained by the rest n − 1 samples. We chose and fixed the hidden layer configurations that achieved the lowest RMSE_LOOCV for the subsequent task. The training and optimization processes of the ANN model took 500 epochs. The optimizer is adaptive moment estimation, and the loss function is RMSE. The learning rate of the optimizer is 0.001. The weights and biases in the neural network were initialized before each model training process, eliminating the influence of cumulative learning.

The MCDropout was used to assess model uncertainty. This method works by randomly activating and switching off different subsets of neurons in the hidden layers of ANN with a dropout rate of p. The model architecture has 2ⁿ variations during each inference if there are n neurons in the hidden layers as a result of p. Different architectures lead to different predictions for the same input. Consequently, we can uncover uncertainty by inspecting the predictive distributions provided by the MCDropout. In this study, the uncertainty is evaluated through two key parameters: the mean μ and standard deviation σ of the predictions. We implemented MCDropout on all the neurons in the hidden layers with a dropout rate of 0.2. That means 20% of the neurons, together with connected synapses, are randomly set inactive at each epoch during the inference phase. The forward pass frequency was set to 500, resulting in 500 predictions for each composition in the search space.

We used the normal quantile–quantile (Q–Q) plot to test the normality of the predictive distributions. The Q–Q plot illustrates discrepancies between a sample distribution (sample quantiles) and what is expected from a normal distribution of a similar number of values with the same μ and σ (theoretical quantiles). A strong agreement between the theoretical and sample quantiles indicates a satisfactory approximation to normality in the predictive distribution. We used R² to numerically measure the agreement, which is defined by


	(3)

where n is the number of samples, q_i is the sample quantile, [q with combining circumflex]

_i is the theoretical quantile and [q with combining macron]

is the μ of n sample quantiles. At last, we compared the inference time using the MCDropout method with the widely used bootstrap method. The principle of bootstrapping is to randomly resample with replacement from the initial database to construct multiple datasets. Based on 500 bootstrapped datasets, we trained 500 ANN models with the same layer configurations to infer the design space.

The search for high-performance compositions is inspired by the BO and surrogate-based optimization.^41,42 We used ANN as the surrogate model to approximate the features-d₃₃ relationship. Two typical acquisition functions, ET and EI, based on the predictions obtained from ANN models, were employed to guide the experimental design. The ET function selects the composition with the highest predicted μ at point x, which is defined by


ET(x) = argmax[μ(x)].	(4)

The EI function takes into account both the predicted μ and σ, which is defined by


EI(x) = argmax[(μ − f(x))Φ((μ − f(x))/σ) + σφ((μ − f(x*))/σ)],	(5)

where the Φ(z) and φ(z) represent the cumulative distribution function (CDF) and probability density function (PDF) of the standard normal distribution, respectively. The f(x*) equals the largest d₃₃ value in the database. The compositions that maximize these two acquisition functions were selected for experimental observation.

2.2 Experimental procedures

The KNN-based piezoceramics were prepared using the conventional solid-state method. K₂CO₃(99.99%), Na₂CO₃(99.8%), LiCO₃(99.9%), Nb₂O₅(99.9%), Sb₂O₃(99.9%), ZrO₂(99.9%), Bi₂O₃(99.9%), CaCO₃(99.9%), and BaCO₃(99.9%) were used as raw materials. These materials were mixed in stoichiometric ratios by ball milling using alcohol as a solvent for 12 hours. The powders were calculated at 950 °C for 6 hours after drying. The calcined powders were milled and dried again before being pressed into pellets using polyvinyl alcohol (PVA) as a binder. The pellets were fired at 600 °C to remove the binder and subsequently sintered at 1070–1170 °C for 6 hours in an ambient atmosphere. Different sintering temperatures were exhaustively tested for the pellets with the same composition to find out the optimal sintering temperature for d₃₃. The sintered ceramic specimens were polished and coated with silver electrodes on the top and bottom surfaces for electrical testing.

The ceramic surfaces were polished and thermally etched to expose the microstructure and grain morphologies, which were observed by scanning electron microscopy (SEM) (Vega 3, TESCAN, Czech). The ferroelectric and electromechanical properties were characterized at 10 Hz and room temperature using a precision ferroelectric workstation (Radiant Technologies, Inc., USA) equipped with a photonic sensor (MTI-2100, MEMS Technology, USA). The d₃₃ of the ceramic specimens poled in silicone oil at 4 kV mm⁻¹ was measured by a quasi-static piezo-d₃₃ meter (ZJ-3A, Institute of Acoustics Academia Sinica, China).

3. Results and discussion

3.1 Interpretable feature engineering

The r is widely used to measure the degree and direction of the linear dependence between two features. We grouped highly correlated features (r > 0.95) and retained one of them as they essentially provided identical information to the model, which led to a reduction of the feature numbers to 20. The groups of highly correlated features and their respective correlations are shown in Table S4 and Fig. S1, ESI.†Fig. 2(a) illustrates the r matrix between the remaining features for model training. The value in the grids indicates the r value between each feature. The red grid indicates a positive correlation, whereas the blue grid indicates a negative correlation. The color intensity is proportional to the absolute value of r.


	Fig. 2 (a) Heatmap of the Pearson correlation coefficient matrix for material features with low correlation (r ≤ 0.95). (b) Top-ranking features based on the feature importance calculated from the gradient boosting model. (c) SHAP summary plot illustrating the contribution of the top-ranking features to the d₃₃ prediction.

To improve interpretability, we identified the most contributive features to the d₃₃ prediction using the gradient boosting feature importance, as shown in Fig. 2(b). The atomic mass (W), Pauling electronegativity (EN-P), and the effect on the cubic to tetragonal (C–T) ferroelectric transition temperature of the doping cations (T_c) were recognized as the three primary contributory features. Essentially, the W is a crucial atomic number factor to be considered when selecting doping elements. An increase in the EN-P of the metal element results in a higher covalent interaction and hybridization. The addition of high electronegativity elements, e.g., Bi³⁺, into KNN-based materials would increase the local covalency, leading to a high polarization region or local octahedral titling, which can be important for d₃₃ enhancement.¹ Besides, the effect of dopants on Curie temperature acts as a key indicator when constructing a polymorphic phase boundary (PPB) for achieving high piezoelectric performance of KNN-based materials.⁴³

Fig. 2(c) displays an informative SHAP overview of the effects of features on the direction of d₃₃ prediction. For each feature, every sample in the dataset is distributed horizontally along the x-axis according to their SHAP values. The color bar corresponds to the original values of the features for each sample. Examining the color distribution horizontally of all the samples in each feature row provides insights into the directional influence of the features on the model's predictions. For instance, higher W values have positive SHAP values as the points extending towards the right are increasingly red, indicating that increasing the A/B-site ratio can lead to higher predicted d₃₃. This may explain why adding Bi³⁺ can enhance d₃₃ from the statistical perspective.⁴⁴ The T_O–T and T_R–O represent the influence of dopants on the tetragonal to orthorhombic (O–T) and rhombohedral to orthorhombic (R–O) phase transition temperatures, respectively. The T_c and T_O–T exhibit negative values, but T_R–O has positive values as the dopants generally decrease C–T, O–T and increase the R–O phase transition temperatures.⁴⁵ Lower T_c values lead to a more positive impact on d₃₃ prediction, reflecting the inverse relationship between T_c and d₃₃. Increasing T_R–O and decreasing T_O–T to construct PPB is an effective way to improve d₃₃.^23,46 The SHAP values of T_R–O and T_O–T indicate that limited additives capable of shifting R–O and O–T phase transition temperature points lead to the d₃₃ improvement, while an excess of these additives can negatively impact d₃₃. The SHAP analyses show agreement with the reported phase and d₃₃ evolution tendency.^47,48 Overall, the demonstrated interpretable ML not only corroborates established knowledge but hopefully provides unexploited insights into the materials of interest.

3.2 Machine learning model configuration

This study used the exhaustive grid search method to calculate the average of cross-validated RMSE on the left-out test points. It is suitable for this small-sized dataset problem, providing highly accurate and consistent results of performance assessment as every sample has been the test point. Fig. 3(a) shows the evolution of RMSE_LOOCV as the number of neurons in the one-hidden-layer architecture models increases. The global performance tended to improve with increased neurons. However, some fluctuations were captured. The error can increase in the overfitted models due to increased neurons in the hidden layer, eventually reflected in the RMSE_LOOCV. Adding one more hidden layer can stabilize the feedback in the nonlinear system and lead to better generalization performance.^49,50


	Fig. 3 (a) Performance of one-hidden-layer neural networks with varying numbers of hidden layer nodes. (b) Performance of two-hidden-layer neural networks with varying numbers of hidden layer nodes. (c) Global performance on test sets for the optimal neural network for the first experimental iteration. (d) Global performance metrics as a function of the number of experimental iterations.

Fig. 3(b) shows the evolution of RMSE_LOOCV with increasing neurons in the two-hidden-layer architecture models. The optimal performance was achieved by the model with 22 neurons in the first hidden layer and 30 neurons in the second hidden layer. The RMSE and mean absolute error (MAE) were further reduced to 60.34 pC N⁻¹ and 42.42 pC N⁻¹, respectively. The global performance of this model architecture for the first experimental iteration is shown in Fig. 3(c). Approximately 85% of the test points were constrained in the error lines ± 60 pC N⁻¹, indicating a moderate predictive capability of the selected model architecture. The global performance of the model with the same architecture for the second and third experimental iterations is shown in Fig. S2, ESI.†

Fig. 3(d) shows the evolution of global performance with experimental iterations. The predictive errors remain stable after three iterations. Despite the efforts that have been made to improve and maintain model performance, the common challenges in constructing ML models for data-driven materials studies, including the noise in samples, the small sample size, the imperfect features, etc.,^6,36 can induce uncertainty and prevent the model from achieving reduced errors. Thus, it is necessary to carefully evaluate the predictive uncertainty for the subsequent deployment of the ML model in the search space.

3.3 Uncertainty-guided experimental design

To obtain uncertainty-informed predictions, we applied the ANN with MCDropout to the entire search space. Fig. 4(a)–(c) show the predictive distributions of the candidates that maximized ET and EI throughout three experimental iterations. Eqn (4) indicates that the ET function is a pure exploitation strategy without considering assessed uncertainty. As seen from the red columns and approximated bell curve, the ET function selects the composition with the highest μ of predictions, illustrated as a narrow distribution centered at a high predicted d₃₃ value. On the contrary, the EI function balances the trade-off between exploitation and exploration. As indicated by eqn (5), the first component of EI is dominated by predicted μ, which stands for exploitation. The second component of EI is dominated by predicted σ, which stands for exploration. The distribution of the composition selected by ET exhibits a broader form, indicating more significant uncertainty. The distribution shapes reflect the exploitative tendency of ET and the explorative tendency of EI.


	Fig. 4 Monte Carlo dropout predictions and their normality tests as a function of the number of experimental iterations. (a)–(c) Predictive distributions of the compositions selected by exploitation (ET) and expected improvement (EI) strategies for three iterations, respectively. (d)–(f) Quantile–quantile plots showing the normality of the predictive distributions across three iterations, respectively.

A normal distribution possesses consistent properties, such as its symmetry around μ and asymptote determined by σ. These properties provide a consistent, rigorous and accurate uncertainty estimation based on the predictive distribution, which is ideal for the use of acquisition functions. We used the Q–Q plot to graphically check whether the predictions conformed to a normal distribution. As shown in Fig. 4(d)–(f), the majority of points cluster around the 45-degree reference line, indicating that the predictions calculated by MCDropout show good normality. In summary, the MCDropout provides comprehensive and well-formed uncertainty estimates for compositions in the materials search space.

3.4 MCDropout versus bootstrapping

This study employed MCDropout as a substitute for the commonly used bootstrap method for UQ. Fig. 5 compares the computational time of individual predictions across the entire design space using MCDropout and bootstrap methods. The predictions were executed 500 times. Compared to the average time of 9.98 seconds using the bootstrap method, the average time to make predictions on the design space is 6.71 seconds using MCDropout under the same computation environment. This result reveals that the used MCDropout outperformed the bootstrap method in computational efficiency.


	Fig. 5 Computational time using Monte Carlo dropout and bootstrap methods.

To explain the computational advantage of MCDropout over bootstrapping, Fig. 6(a) and (b) demonstrate the schematic workflow of these two methods. The computational time to obtain N predictions using bootstrapping can be represented as N × (t_{t_b} + t_{i_b}), where t_t and t_i represent training and inference time, respectively. By contrast, it costs t_{t_m} + N × t_{i_m} using the MCDropout method because it does not require resampling data or retraining models. The t_{t_m} becomes insignificant when N is sufficiently large. Besides, as indicated by the red points in Fig. 5, the computational time using the bootstrap method tends to increase with an increase in inference times. The efficiency benefit of MCDropout can be more obvious when the iteration and model complexity increase. The computational results reveal that the MCDropout is a straightforward and time-saving alternative to the commonly used bootstrap method for UQ.


	Fig. 6 (a) Schematic of Monte Carlo dropout workflow. (b) Schematic of bootstrapping workflow.

3.5 Experimental validation

The preparation of all ceramic specimens was undertaken at the optimized sintering temperature in order to achieve the respective optimal intrinsic d₃₃.⁵¹ Table 1 shows their compositions and measured d₃₃. The compositions identified by the pure exploitation ET function exhibit a higher d₃₃ than those identified by the EI function that balances the trade-off between exploration and exploitation. A similar result was reported in the search for high d₃₃ values of BT-based piezoceramics.⁸ One possible explanation is that the observed points in this study are extremely sparse compared to the unobserved points of the design space. The EI function is inclined to make more risky decisions than the ET function as it extra considers the predictions with larger deviations. As a result, the risk did not result in higher d₃₃ for the initial three iterations. However, from the global optimization perspective, the utilization of EI can mitigate the risk of converging on local optima.⁵² Incorporating EI samples into the database for the subsequent iterations updates the information on the areas with high uncertainty in the design space.

Table 1 Newly synthesized compounds with measured d₃₃

#	Iteration	Strategy	Composition	Measured d₃₃ (pC N⁻¹)
1	1st	ET	(K_0.49Na_0.51Nb_0.96Sb_0.04O₃)_0.96(Bi_0.5Na_0.5ZrO₃)_0.02(Bi_0.5K_0.5ZrO₃)_0.02	330
2	1st	EI	(K_0.5Na_0.5Nb_0.96Sb_0.04O₃)_0.94(BaZrO₃)_0.02(Bi_0.5K_0.5ZrO₃)_0.04	150
3	2nd	ET	(K_0.45Na_0.55Nb_0.97Sb_0.03O₃)_0.96(Bi_0.5K_0.5ZrO₃)_0.04	348
4	2nd	EI	(K_0.4Na_0.6Nb_0.98Sb_0.02O₃)_0.96(Bi_0.5Na_0.5ZrO₃)_0.03(BaZrO₃)_0.01	260
5	3rd	ET	(K_0.47Na_0.53Nb_0.96Sb_0.04O₃)_0.96(Bi_0.5Na_0.5ZrO₃)_0.01(Bi_0.5K_0.5ZrO₃)_0.03	335
6	3rd	EI	(K_0.45Na_0.55Nb_0.98Sb_0.02O₃)_0.94(Bi_0.5Na_0.5ZrO₃)_0.05(BaZrO₃)_0.01	222

Fig. 7 shows the surface morphologies of selected samples. All samples presented relatively dense microstructures and heterogeneous grains. Although alterations in grain size that have occurred due to compositional variation in the ceramics, the SEM results suggest that the ceramics selected by ET and EI functions exhibit no significant differences in microstructure, hence excluding the influence of processing on the different ceramics.


	Fig. 7 Surface morphologies of the newly synthesized ceramic samples. (a)–(f) Represent the samples synthesized in the first, second, and third iterations, respectively. (a), (c), and (e) Represent the samples selected by the exploitation strategy. (b), (d), and (f) Represent the samples selected by the expected improvement strategy.

Fig. 8(a)–(c) show the polarization hysteresis loops (P–E), bipolar, and unipolar strain curves (S_b–E and S_u–E) of the samples under the external electric field, respectively. All samples have well-defined P–E loops except for composition #5, which appears to have a relatively high leakage current. It is shown that the ET-selected compositions have considerably high P_r exceeding 26 μC cm⁻², indicating strong ferroelectricity. The observed enhanced piezoelectric properties in these ceramics are related to their high P_r. In addition, as shown in Fig. 8(b), the multiple ions doping gives rise to defect dipoles in KNN-based ceramics, thereby inducing an asymmetry in bipolar S–E loops.^53,54 All samples show slim S_u–E loops and the maximum S_u was obtained for the ET selected composition #3. It reaches a large electric field – induced strain over 0.18% at 30 kV cm⁻¹, corresponding to large high-field piezoelectric strain coefficients of d₃₃* over 600 pm V⁻¹. This composition shows the maximum d₃₃ of 348 pC N⁻¹, surpassing most compositions in the database. The experimental results confirmed the accelerated search efficiency of the methods utilized for materials design and discovery.


	Fig. 8 Ferroelectric test of the newly synthesized ceramic samples. (a) Polarization hysteresis loops. (b) Bipolar strain loops. (c) Unipolar strain loops.

4. Conclusion

This study presents an active learning framework incorporating SHAP feature engineering, MCDropout UQ, and Bayesian-based global optimization to accelerate the design and discovery of high-performance KNN compositions. The crucial features for the d₃₃ prediction were statistically identified and found to show good consistency with the knowledge of d₃₃ enhancement in the KNN-based ceramics based on experimental observations. Our results suggest that interpretable feature engineering can be utilized as a promising tool for deepening and enlightening the complex physics behind the properties of materials. We investigated and demonstrated how to leverage quantified uncertainty to guide experimental observation. The potential and effectiveness of MCDropout for UQ in materials search were illustrated from computational and experimental aspects. The comparative study between the two UQ methods indicates that MCDropout is a compelling alternative to the bootstrap method, enabling a 33% reduction in computational cost. Finally, the experimentally synthesized candidates selected by the ET strategy reached d₃₃ values exceeding 300 pC N⁻¹, demonstrating the efficient and stable performance of the utilized methods. This study is expected to provide insight into the ML-driven investigation of piezoelectric materials and other various materials.

Author contributions

Heng Hu: writing – original draft, investigation, formal analysis, data curation. Bin Wang: methodology, investigation. Didi Zhang: investigation, data curation. Tao Tan: writing – review & editing, funding acquisition. Kang Yan: writing – review & editing, supervision, project administration, funding acquisition, formal analysis, conceptualization. Dawei Wu: writing – review & editing, supervision, project administration, funding acquisition.

Data availability

The data that support the findings of this study are available within the article and its additional ESI.†

Conflicts of interest

The authors declare that they have no known competing financial interests or personal relationships that could have influenced the work reported in this manuscript.

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grant No. 12227808 and 12227802), the National Key Research and Development Program of China (Grant No. 2024YFB4205600), the Natural Science Foundation of Jiangsu Province, China (Grant No. BK20221483) and the Macao Polytechnic University Grant (RP/FCA-15/2022).

References

Y. Zhang and J.-F. Li, J. Mater. Chem. C, 2019, 7, 4284–4303 RSC .
M. O. Buzzy, D. Montes de Oca Zapiain, A. P. Generale, S. R. Kalidindi and H. Lim, Acta Mater., 2025, 284, 120537 CrossRef CAS .
A. K. Srinithi, A. Bolyachkin, X. Tang, H. Sepehri-Amin, S. Dieb, A. T. Saito, T. Ohkubo and K. Hono, Scr. Mater., 2025, 258, 116486 CrossRef CAS .
M. Zhang, C. Luo, G. Zhang, H. Xu and G. Li, J. Alloys Compd., 2024, 1006, 176354 CrossRef CAS .
M. Wang and J. Jiang, Adv. Funct. Mater., 2024, 34, 2314683 CrossRef CAS .
R. Yuan, D. Xue, Y. Xu, D. Xue and J. Li, J. Alloys Compd., 2022, 908, 164468 CrossRef CAS .
R. Yuan, Z. Liu, P. V. Balachandran, D. Xue, Y. Zhou, X. Ding, J. Sun, D. Xue and T. Lookman, Adv. Mater., 2018, 30, 1702884 CrossRef PubMed .
R. Yuan, D. Xue, D. Xue, Y. Zhou, X. Ding, J. Sun and T. Lookman, IEEE Trans. Ultrason., Ferroelectr., Freq. Control, 2019, 66, 394–401 Search PubMed .
W. Gu, B. Yang, D. Li, X. Shang, Z. Zhou and J. Guo, J. Adv. Ceram., 2023, 12(7), 1389–1405 CrossRef CAS .
Y. Sun, B. Hu, Y. Zhang, X. Song, J. Feng, Y. Xu, H. Tao and D. Ergu, J. Alloys Compd., 2024, 1003, 175598 CrossRef CAS .
L. Scorzato, Minds Mach., 2024, 34, 27 CrossRef .
E. Barbierato and A. Gatti, Electronics, 2024, 13, 416 CrossRef .
X. Zhong, B. Gallagher, S. Liu, B. Kailkhura, A. Hiszpanski and T. Y.-J. Han, npj Comput. Mater., 2022, 8, 1–19 CrossRef .
J. Dean, M. Scheffler, T. A. R. Purcell, S. V. Barabash, R. Bhowmik and T. Bazhirov, J. Mater. Res., 2023, 38, 4477–4496 CrossRef CAS .
F. Oviedo, J. L. Ferres, T. Buonassisi and K. T. Butler, Acc. Mater. Res., 2022, 3(6), 597–607 CrossRef CAS .
K. Choudhary, B. DeCost, C. Chen, A. Jain, F. Tavazza, R. Cohn, C. W. Park, A. Choudhary, A. Agrawal, S. J. L. Billinge, E. Holm, S. P. Ong and C. Wolverton, npj Comput. Mater., 2022, 8, 59 CrossRef .
L. Alzubaidi, J. Bai, A. Al-Sabaawi, J. Santamaría, A. S. Albahri, B. S. N. Al-dabbagh, M. A. Fadhel, M. Manoufali, J. Zhang, A. H. Al-Timemy, Y. Duan, A. Abdullah, L. Farhan, Y. Lu, A. Gupta, F. Albu, A. Abbosh and Y. Gu, J. Big Data, 2023, 10, 46 CrossRef .
H. Wang, Q. Liang, J. T. Hancock and T. M. Khoshgoftaar, J. Big Data, 2024, 11, 44 CrossRef .
Y. Gal and Z. Ghahramani, Proceedings of The 33rd International Conference on Machine Learning, PMLR, 2016, vol. 48, pp. 1050–1059.
E. Brochu, V. M. Cora and N. de Freitas, arXiv, 2010, preprint, arXiv:1012.2599 DOI:10.48550/arXiv.1012.2599.
C. Zhou, J. Zhang, W. Yao, X. Wang, D. Liu and X. Sun, J. Appl. Phys., 2018, 124, 164101 CrossRef .
B. Zhang, J. Wu, X. Cheng, X. Wang, D. Xiao, J. Zhu, X. Wang and X. Lou, ACS Appl. Mater. Interfaces, 2013, 5, 7718–7725 CrossRef CAS PubMed .
L. Tan, Q. Sun and Y. Wang, J. Alloys Compd., 2020, 836, 155419 CrossRef CAS .
D. Pan, Y. Guo, X. Fu, R. Guo, H. Duan, Y. Chen, H. Li and H. Liu, Solid State Commun., 2017, 259, 29–33 CrossRef CAS .
T. Mei, T. Chen, Y. Liu, J. Zhang, T. Zhang, G. Wang and J. Zhou, J. Mater. Sci.: Mater. Electron., 2017, 28, 4879–4884 CrossRef CAS .
T. Lookman, P. V. Balachandran, D. Xue and R. Yuan, npj Comput. Mater., 2019, 5, 21 CrossRef .
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever and R. Salakhutdinov, J. Mach. Learn. Res., 2014, 15, 1929–1958 Search PubMed .
J. M. Dolezal, A. Srisuwananukorn, D. Karpeyev, S. Ramesh, S. Kochanny, B. Cody, A. S. Mansfield, S. Rakshit, R. Bansal, M. C. Bois, A. O. Bungum, J. J. Schulte, E. E. Vokes, M. C. Garassino, A. N. Husain and A. T. Pearson, Nat. Commun., 2022, 13, 6572 CrossRef CAS PubMed .
J. Padarian, B. Minasny and A. B. McBratney, Geoderma, 2022, 425, 116063 CrossRef .
S. H. Yelleni, D. Kumari, P. K. Srijith and C. K. Mohan, Pattern Recognit., 2024, 146, 110003 CrossRef .
X. Wang, Y. Jin, S. Schmitt and M. Olhofer, ACM Comput. Surv., 2023, 55, 287:1–287:36 Search PubMed .
A. Ishii, S. Kikuchi, A. Yamanaka and A. Yamamoto, J. Alloys Compd., 2023, 966, 171613 CrossRef CAS .
R. Yuan, Y. Tian, D. Xue, D. Xue, Y. Zhou, X. Ding, J. Sun and T. Lookman, Adv. Sci., 2019, 6, 1901395 CrossRef CAS PubMed .
J. Li, Y. Zhang, X. Cao, Q. Zeng, Y. Zhuang, X. Qian and H. Chen, Commun. Mater., 2020, 1, 1–10 CrossRef .
C. Kim, A. Chandrasekaran, A. Jha and R. Ramprasad, MRS Commun., 2019, 9, 860–866 CrossRef CAS .
B. Ma, X. Wu, C. Zhao, C. Lin, M. Gao, B. Sa and Z. Sun, npj Comput. Mater., 2023, 9, 1–11 CrossRef .
J. He, J. Li, C. Liu, C. Wang, Y. Zhang, C. Wen, D. Xue, J. Cao, Y. Su, L. Qiao and Y. Bai, Acta Mater., 2021, 209, 116815 CrossRef CAS .
Q. Tao, P. Xu, M. Li and W. Lu, npj Comput. Mater., 2021, 7, 23 CrossRef .
H. Akoglu, Turk. J. Emerg. Med., 2018, 18, 91–93 CrossRef PubMed .
S. F. Ahmed, Md. S. B. Alam, M. Hassan, M. R. Rozbu, T. Ishtiak, N. Rafa, M. Mofijur, A. B. M. Shawkat Ali and A. H. Gandomi, Artif. Intell. Rev., 2023, 56, 13521–13617 CrossRef .
B. Lei, T. Q. Kirk, A. Bhattacharya, D. Pati, X. Qian, R. Arroyave and B. K. Mallick, npj Comput. Mater., 2021, 7, 194 CrossRef .
A. I. J. Forrester and A. J. Keane, Prog. Aerosp. Sci., 2009, 45, 50–79 CrossRef .
H. Tao, H. Wu, Y. Liu, Y. Zhang, J. Wu, F. Li, X. Lyu, C. Zhao, D. Xiao, J. Zhu and S. J. Pennycook, J. Am. Chem. Soc., 2019, 141, 13987–13994 CrossRef CAS PubMed .
W. Yang, Y. Wang, P. Li, S. Wu, F. Wang, B. Shen and J. Zhai, J. Mater. Chem. C, 2020, 8, 6149–6158 RSC .
J.-F. Li, Lead-free piezoelectric materials, Wiley-VCH, Weinheim, 2021 Search PubMed .
X. Lv, J. Zhu, D. Xiao, X. Zhang and J. Wu, Chem. Soc. Rev., 2020, 49, 671–707 RSC .
T. Zheng, J. Wu, D. Xiao, J. Zhu, X. Wang and X. Lou, ACS Appl. Mater. Interfaces, 2015, 7, 20332–20341 CrossRef CAS PubMed .
L. Tan, Q. Sun and Y. Wang, J. Alloys Compd., 2020, 836, 155363 CrossRef CAS .
E. D. Sontag, IEEE Trans. Neural Networks, 1992, 3, 981–990 CrossRef CAS PubMed .
A. J. Thomas, M. Petridis, S. D. Walters, S. M. Gheytassi and R. E. Morgan, in Engineering Applications of Neural Networks, ed. G. Boracchi, L. Iliadis, C. Jayne and A. Likas, Springer International Publishing, Cham, 2017, pp. 279–290 Search PubMed .
H. Hu, M. Huang, B. Wang, D. Zhang, T. Tan, K. Yan and D. Wu, Ceram. Int., 2024, 50, 54536–54546 CrossRef CAS .
D. Zhan and H. Xing, J. Global Optim., 2020, 78, 507–544 CrossRef .
S. Tian, J. Xin, Y. Cheng, L. Lai, B. Li and Y. Dai, Acta Mater., 2024, 280, 120344 CrossRef CAS .
G. Huangfu, K. Zeng, B. Wang, J. Wang, Z. Fu, F. Xu, S. Zhang, H. Luo, D. Viehland and Y. Guo, Science, 2022, 378, 1125–1130 CrossRef PubMed .

Footnote

† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5tc00865d

Click here to see how this site uses Cookies. View our privacy policy here.