Open Access Article
Xi
Zhao
*ab,
Shu-guang
Cheng
a,
Sen
Yu
b,
Jiming
Zheng
a,
Rui-Zhi
Zhang
c and
Meng
Guo
c
aSchool of Physics, Northwest University, Xi'an 710069, China. E-mail: zhao_xii@stumail.nwu.edu.cn
bNorthwest Institute for Nonferrous Metal Research, Xi'an 710016, China
cJinan Key Laboratory of High-Performance Industrial Software, Jinan Institute of Supercomputing Technology, Jinan, Shandong 250103, China
First published on 12th December 2024
High-entropy carbides (HECs) have garnered significant attention due to their unique mechanical properties. However, the design of novel HECs has been limited by extensive trial-and-error strategies, along with insufficient knowledge and computational capabilities. In this work, the intrinsic correlations between elements in the high-dimensional compositional space of HECs are investigated using high-throughput density functional theory calculations and two machine learning models, which enable us to predict the Young's modulus, hardness and wear resistance with only a chemical formula provided. Our models demonstrate a low root mean square error (11.5 GPa) and mean absolute error (9.0 GPa) in predicting the elastic modulus of HECs with arbitrary non-equimolar compositions. We further established a database of 566
370 HECs and identified 15 novel HECs with the best mechanical properties. Our models can rapidly explore the mechanical properties of HECs with descriptor–property correlation analysis, and hence provide an efficient method for accelerating the design of non-equimolar high-entropy materials with desired performance.
Recently, Machine Learning (ML) has achieved significant success in predicting complex high-entropy materials.14–18 By training models on existing data with specified properties and compositions, novel materials can be efficiently predicted prior to their physical synthesis. Zhang et al.14 used artificial neural network(ANN) and support vector machine (SVM) models to identify single-phase HECs and evaluated the single-phase probabilities of 90 HECs that have not yet been experimentally reported, with a prediction accuracy as high as 98.2%. Meng et al.19 used high-throughput synthesis and calculations combined with ML methods to identify 22 phase-forming ability descriptors for novel HECs, achieving a verification accuracy of at least 25.3% higher than previously reported, which provides theoretical guidance for discovering HECs. Tang et al.20 proposed a ML strategy based on bond parameters (bond order, bond ionicity, and bond length) to explore new HECs with excellent mechanical properties, and the mean absolute error (MAE) and R2 of their model were 32.2 GPa and 0.84. Zhou et al.21 developed three ML models (RF, SVR and ANN) to predict the Young's modulus and hardness of various HECs, with MAE of only 15.3 GPa and 1.1 GPa, showing high prediction accuracy. Although ML algorithms show promising predictive potential in exploring the compositional space of HECs, most research still concentrates on the prediction of single-phase formation capabilities, with few studies on mechanical property prediction. Moreover, training ML models for prediction of mechanical properties usually requires many complicated structure-based descriptors, which are inaccessible for unknown new HECs, and it is also important to enhance the generalization ability of ML models to adapt for the prediction of non-equimolar HEC systems.
The goal of this study is to leverage the power of ML methods to explore the compositional space of HECs, understand the relationships and patterns within the elements, and predict the mechanical properties of unexplored HECs with extraordinary mechanical properties to enable composition optimization screening. Designing systems capable of comprehending and mapping the vast chemical space of HECs is an ongoing challenge.22 The key to performance lies in the interaction between elements. In traditional materials, correlations among elements can be illustrated through phase diagrams, where each point in the phase space represents a unique combination of elemental composition and specific properties. For the domain of high-dimensional space corresponding to multi-component HECs, traditional methods struggle to capture all points within the entire space, while in ML, the complex relationships inherent in high-dimensional data present challenges to the generalization ability of models, as the distribution characteristics of training data may differ from unknown data. A key innovation of our approach is the ability to predict the mechanical properties of arbitrary non-equimolar HECs from binary carbides, ternary carbides and quaternary equimolar HECs in the absence of complex structural information and density functional theory (DFT) calculation results, with only a chemical formula provided, demonstrating the great potential of machine learning in complex materials design.
In this work, we employed two algorithms, deep learning and random forest (RF), to predict the mechanical properties such as Young's modulus (E), bulk modulus (B), and hardness (H) of HECs containing nine types of transition metal elements (Ta, Zr, Hf, V, Nb, Ti, Mo, W, and Cr). With the trained ML models, we have established a database containing the mechanical properties of 566
370 HECs, including E, H, etc., and identified 15 compositions with superior mechanical properties. Our results demonstrate the feasibility of advanced ML techniques in learning potential correlations and patterns among elements in high-dimensional space, providing a convenient approach for discovering novel equimolar and non-equimolar HECs with desirable mechanical properties.
All these structures used for HT-DFT calculations maintained a single-phase rock salt structure with transition metal atoms randomly occupying cationic sites and carbon atoms occupying anionic sites, which were generated using the Python Materials Genomics (Pymatgen) package24(a schematic diagram of the crystal structure is shown in ESI S1†). The use of small unit cells can considerably improve the computational performance. Additionally, we also compared our results with published experimental data25,26 to verify the accuracy of DFT calculations. Koval et al.27 and Liu et al.28 also confirm the reliability of elastic property predictions using small unit cells.
| E = 9BG/(3B + G) |
| B = (Bv + BR)/2 |
| G = (Gv + GR)/2 |
| Bv = BR = (C11 + 2C12)/3 |
| Gv = (C11 − C12 + 3C44)/5 |
| GR = 5C44(C11 − C12)/4C44 + (C11 − C12) |
In the context of focusing solely on elastic responses, without considering plastic deformation and defects, the Vickers hardness was approximated from the elastic modulus as follows:30HV = 2(k2G)0.585 − 3; k = G/B.
To avoid sampling bias caused by randomness and ensure consistency in the sample distribution between the training and test sets, a random sampling method was employed to split the data for both the RF and CrabNet models, with 80% of the data allocated to the training set and the remaining 20% to the test set. The RF model was optimized with the following parameters: n_estimators = 300, random_state = 1, min_samples_split = 5, min_samples_leaf = 1, and the number of features considered for each split was set to the square root of the total number of features. A 10-fold cross validation was employed. The mean absolute error (MAE), mean squared error (MSE), and R2 scores were used to evaluate the performance of the RF model and CrabNet model.
![]() | ||
| Fig. 1 (a) Comparison of ROM results and HT-DFT calculations of Young's modulus for 495 carbides (contains 9 monocarbides, 108 ternary carbides, 252 quaternary carbides and 126 equimolar quaternary high entropy carbides). The results of ROM are based on the atomic ratio summation of the Young's modulus of 9 monocarbides (TaC, ZrC, HfC, VC, NbC, TiC, MoC, WC, and CrC); (b) Influence of 9 transition metal elements in 495 carbides on Young's modulus calculated by HT-DFT (only the elemental species are counted, detailed statistics are shown in Fig. 2). | ||
The influence of nine different transition metal elements (Ta, Zr, Hf, V, Nb, Ti, Mo, W, and Cr) on the Young's modulus of 495 carbides is shown in Fig. 1b. Each violin plot encompasses data for binary, ternary, and quaternary carbides; the median values of calculated Young's modulus reveal that Young's modulus is significantly impacted by elemental compositions. It is worth noting that carbides containing Ta and W exhibit the highest Young's modulus, indicating superior stiffness and resistance to deformation under applied stress. The following are carbides that incorporate Nb, Ti and V, which exhibit comparatively higher modulus but slightly lower than those of carbides with Ta and W. Carbides containing Hf and Mo demonstrate moderate values of Young's modulus, suggesting a decrease in stiffness compared to those mentioned previously, consistent with the results of Xia et al.39 The relatively lowest elastic moduli are observed in carbides that include Zr and Cr, implying that their addition may reduce the hardness of HECs and increase the plastic deformation ability. This trend emphasizes the clear dependence of mechanical properties on specific elemental types within the carbide composition.
To gain a deeper understanding of the relationship between elements and mechanical properties, we further investigated the effect of different amounts of elements on the elastic modulus. Fig. 2 illustrates the effects on Young's modulus, bulk modulus and shear modulus when the content of 9 transition metal elements increases from 0 to 75 at%, respectively. The red part in the violin diagram represents the Young's modulus, the blue part represents the bulk modulus, and the green part represents the shear modulus. As the concentration of certain elements increases, elements such as Ta, Nb, Ti, and V are observed to enhance the Young's modulus in carbides. Conversely, elements such as Hf, Mo, Zr, and Cr seem to decrease their Young's modulus. Consistent with the previous analysis, higher concentrations of Ta lead to the largest increase in Young's modulus due to its ability to significantly increase the bulk and shear modulus of carbides, followed by Nb. Although increasing Nb effectively improves the shear modulus of carbides, the bulk modulus shows almost no significant increase. Ti and V provide a modest increase in the Young's modulus of carbides. Unlike V, as the concentration of Ti increases, the bulk modulus of carbides gradually decreases, with the reduction outweighing the increase in shear modulus, the Young's modulus still shows an increasing trend, which is consistent with the findings of Lu et al.40 suggesting that the enhancement of Young's modulus of carbides may be primarily influenced by shear modulus, followed by bulk modulus. With increased concentrations of elements such as Hf, Mo, Zr, and Cr, the Young's modulus of carbides tends to decrease. The increase in Zr concentration significantly reduces the Young's modulus, bulk modulus, and shear modulus of carbides. An increase in Mo and Cr concentrations gradually increases the bulk modulus while decreasing the shear modulus and Young's modulus. Hf exhibits no significant impact on the shear modulus of carbides; however as its concentration increases, a significant reduction in bulk modulus is observed, leading to a slight decrease in Young's modulus. The influence of W on the mechanical properties deviates from the previous trends, as the bulk modulus significantly increases with W content. However, the shear modulus and Young's modulus initially increase and then decrease with increasing W content, showing optimal mechanical properties at around 50%.
Fig. 4a illustrates the correlation between the RF model predictions (using the Jarvis descriptor) for the bulk modulus of 495 carbides in both the training and test datasets and the bulk modulus values calculated using DFT. The RF model achieved a coefficient of determination (R2) value exceeding 0.99 on training data and 0.96 on test data, with the root mean square error (RMSE) and mean absolute error (MAE) of 1.9 GPa and 1.3 GPa on training data, indicating that the predicted bulk modulus from the RF model closely matches the results from DFT calculations. After confirming the prediction accuracy, the trained RF model using Jarvis descriptors was employed to predict the bulk modulus for 123 non-equimolar HECs, and the results are shown in Fig. 4b. The prediction accuracy for the bulk modulus of 123 non-equimolar HECs significantly decreased compared to its performance on 495 carbides, with an R2 of 0.78. Additionally, the RMSE and MAE of the model are 16.4 GPa and 14.0 GPa, respectively. As can be seen from Fig. 4b, the predicted values of the RF model are in good agreement with the DFT calculated results in the low modulus range (below 280 GPa). As the bulk modulus of carbides increases (280–360Gpa), the predicted values of the RF model are significantly higher than those of DFT calculations. Fig. 4c shows the performance of the RF model using Jarvis descriptors to predict the Young's modulus compared with DFT calculation results for 495 carbides. The model achieved an R2 of 0.92 on the training set, with an RMSE of 8.6 GPa and an MAE of 6.2 GPa. On the test set, the R2 dropped to 0.75, with RMSE and MAE increasing to 18.7 GPa and 13.8 GPa, indicating better accuracy on the training data but a decline on the test set. Then the trained RF model was used to predict the Young's modulus of 123 non-equimolar HECs and compared the results with DFT calculations, as shown in Fig. 4d. The predicted prediction accuracy of the RF model on the 123 non-equimolar HECs was significantly reduced, with an R2 below 0.6, and an RMSE and MAE of 27.8 GPa and 21.6 GPa, respectively. The prediction error was considerably higher than that on the training data, particularly in the low modulus range (<430 GPa) and high modulus range (>530 GPa), where data points became more scattered, and the error was significantly amplified when predicting the 123 non-equimolar HECs. This may be attributed to overfitting during the RF model training, as the training dataset primarily consists of low-dimensional carbides, while the trained RF model was applied to predict the Young's modulus of 123 non-equimolar HECs, the variations in composition ratios among these non-equimolar HECs introduced more complex non-linear relationships, limiting the RF model's accuracy and generalization ability. Consequently, the RF model was unable to capture the initial correlations between elements as effectively as anticipated, leading to lower prediction accuracy for non-equimolar HECs. The prediction results for the shear modulus are provided in ESI S6.†
The CrabNet model was used to predict the elastic modulus of 495 carbides and 123 non-equimolar HECs to compare its prediction accuracy with that of the RF model as depicted in Fig. 5. As shown in Fig. 5a, the prediction results of the CrabNet model are quite consistent with the results of DFT calculations. The predicted R2 of the bulk modulus exceeds 0.98 on both the training data and test data, with an RMSE and MAE of 2.5 GPa and 1.8 GPa, respectively, on the training set, which is comparable to the prediction accuracy of the RF model. Fig. 5b compares the prediction accuracy of the CrabNet model in predicting 123 non-equimolar HECs and the results from DFT calculations. The predicted R2, RMSE and MAE for bulk modulus are 0.83, 11.5 GPa and 9.0 GPa, respectively. It is evident that the prediction accuracy of the CrabNet model is significantly improved compared to that of the RF model, particularly in overcoming the problem of overestimating values in the high bulk modulus range encountered by the RF model, and shows good agreement with the results of DFT calculations, which avoids the overfitting of the ML model and reduces the MAE and RMSE of the non-equimolar HECs in bulk modulus prediction. Considering the limited experimental data on non-equimolar high-entropy carbides (HECs), we compared the bulk modulus of 123 non-equimolar HECs using DFT calculations. The bulk modulus predicted using the CrabNet model showed excellent agreement with the DFT calculations (detailed data are provided in ESI S8†). Fig. 5c shows a comparison between the CrabNet model's predictions and the DFT calculated Young's modulus. The model achieves an R2 value of 0.77 on the training data, with an RMSE of 17.6 GPa, and an MAE of 9.9 GPa. On the test data, the predicted RMSE and MAE values are 17.37 GPa and 11.72 GPa, respectively. Despite the CrabNet model showing a lower R2 on the training data compared to the RF model, the higher R2 on the test data indicates that the CrabNet model effectively overcomes the overfitting observed in the RF model. As depicted in Fig. 5d, the prediction performance of the CrabNet model on 123 non-equimolar HECs shows an R2 of 0.77, with RMSE and MAE values of 21.0 GPa and 17.4 GPa. The CrabNet model shows consistent prediction accuracy on the training data between the 495 carbides and the 123 non-equimolar HECs, with no significant decrease in R2. Additionally, there is no significant severe deviation in the high modulus range, and the RMSE and MAE for non-equimolar HECs are also lower than the RF model's results, indicating that the CrabNet model has better generalization ability than the RF model when extrapolated to quaternary non-equimolar HECs. This may be because the CrabNet model, with a neural network architecture incorporating transfer learning and self-attention mechanisms, can more effectively handle the complex non-linear relationships and data distribution variations in non-equimolar HECs. These capabilities enable it to learn correlations among elements in high-dimensional spaces, providing a powerful tool for exploring the compositional space of HECs. The prediction results for the shear modulus are provided in ESI S9.†
370 HECs based on ergodic combinations of 9 transition metal elements (Ta, Zr, Hf, V, Nb, Ti, Mo, W, and Cr). According to CrabNet's prediction results, the distributions of the Young's modulus and hardness values of the non-equimolar HECs are mapped in Fig. 6, and the points with different colors are used to distinguish the elastic strain to failure related to H/E. Fifteen types of HECs with top Young's modulus, hardness and H/E are highlighted. Ta24Hf3Nb2Ti3C32, Ta24Hf3NbTi4C32, Ta24Hf2NbTi5C32, Ta24Hf2Nb3Ti3C32 and Ta24Hf4NbTi3C32 exhibit the unique mechanical properties of ultra-high Young's modulus (>536 GPa), Ta24Hf5VNb2C32, Ta23Hf5VNb3C32, Ta23Hf4VNb4C32, Ta19Hf5V4Nb4C32, and Ta24Hf4V2Nb2C32 are found to be the hardest, with a predicted hardness greater than 29 GPa and Zr24Hf4VTi3C32, Zr24HfV4Ti3C32, Zr18Hf2V11TiC32, Zr16Hf4V11TiC32, and Zr24Hf3VTi4C32 show good wear resistance due to the high H/E (>0.06). Notably, the HECs containing more Ta elements exhibit higher Young's modulus and hardness, which is consistent with the analysis of previous DFT calculation results, suggesting that the introduction of Ta can effectively improve the mechanical properties of HECs. Contrary to expectations, high Young's modulus and high hardness did not result in high wear resistance, the H/E of HECs with the metal Zr tended to be higher than those without it, which implies that Zr can enhance the wear resistance of HECs, although previous calculations show that it has an insignificant effect on Young's modulus. The research results of Medveď et al.41 also confirmed that Zr-based composites have higher wear resistance. The addition of a small amount of the Hf element has a positive effect on increasing the Young's modulus, hardness, and wear resistance of HECs simultaneously. Similarly, a small amount of the V element enhances their hardness and wear resistance, while a small amount of the Ti element improves the Young's modulus and wear resistance. Our machine learning predictions exhibit a remarkable consistency with DFT results regarding the influence of elements on mechanical properties, which suggests that machine learning models adeptly capture complex element interactions within high-dimensional compositional spaces, enabling precise mechanical property predictions solely based on composition. This work can effectively reduce the research and development costs of HECs in the early stage of design and is expected to be applied to other high-entropy ceramic materials Fig. 6.
370 new HECs. The findings from HT-DFT calculations suggest that the introduction of additional elements such as Ta, Nb, Ti, and V may enhance the Young's modulus of HECs, and Zr-rich HECs show good performance in wear resistance, which is reflected in the prediction results of the ML model. The RF model and the CrabNet model are both trained to predict mechanical properties for non-equimolar HECs using the compositional features. The bulk modulus prediction accuracies of the CrabNet model and the RF model with Jarvis descriptors are remarkably similar on equimolar HECs. However, for non-equimolar HECs, the CrabNet model exhibits superior performance in predicting bulk modulus, with an R2 of 0.85, and RMSE and MAE values of 10.7 GPa and 8.8 GPa, respectively. For Young's modulus prediction, the CrabNet model's performance on non-equimolar HECs is significantly better than that of the RF model, with RMSE and MAE values of 21 GPa and 17.4 GPa, respectively, demonstrating better generalization ability and capacity to handle complex nonlinear relationships. The trained CrabNet model was employed to predict the mechanical properties of 566
370 HECs, including Young's modulus, hardness, and wear resistance. Fifteen novel HECs with the best mechanical properties were identified, including Ta24Hf3Nb2Ti3C32 with the highest Young's modulus of 537.4 GPa, Ta24Hf5VNb2C32 with the highest hardness of 29.4 GPa, and Zr24Hf4VTi3C32 with the best performance in wear resistance.
Our work aims to predict the mechanical properties of materials with arbitrary compositions, focusing on the intrinsic correlations among elements and avoiding complex structure-based descriptors, using the chemical formula as input. However, the valence electron concentration (VEC) of high-entropy materials profoundly influences their mechanical properties. In future research, we hope to incorporate more features based on chemical formulae to enhance the predictive accuracy of machine learning models. Additionally, the impact of compositional variations of elements on predictive accuracy is significant, as even a small amount of addition can substantially affect mechanical properties in doping. Our research provides a new path and theoretical basis for the development of high-entropy ceramics (HECs), showing potential applications to other high-entropy materials.
Footnote |
| † Electronic supplementary information (ESI) available: The data that support the findings of this study are openly available on GitHub, at https://github.com/ZhaoXi1209/HECs-Mechanical-Properties-prediction. See DOI: https://doi.org/10.1039/d4dd00243a |
| This journal is © The Royal Society of Chemistry 2025 |