Ankit 
            Anuragi‡
          
        
        
      , 
      
        
          
            Ankit 
            Das‡
          
        
      , 
      
        
          
            Akash 
            Baski
          
        
      , 
      
        
          
            Vinay 
            Maithani
          
        
       and 
      
        
          
            Sankha 
            Mukherjee
 and 
      
        
          
            Sankha 
            Mukherjee
          
        
       *
*
      
Department of Metallurgical and Materials Engineering, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India. E-mail: sankha@metal.iitkgp.ac.in
    
First published on 29th April 2024
The manipulation of crystallographic defects in 2H-transition metal dichalcogenides (2H-TMDCs), whether pre- or post-synthesis, has garnered significant interest recently, as it holds the promise of tuning the thermal, chemical, and electronic properties of these materials. However, such desirable improvements often come at the cost of deteriorated elastic and inelastic properties, which may lead to serious concerns considering mechanical reliability issues. Therefore, persistent efforts are needed to explore the effects of energetically favorable vacancies on the mechanical properties of 2D TMDCs for an effective tuning of material properties for versatile applications. In this context, machine learning models trained on data based on molecular models can not only provide fast, efficient material models but also unearth crucial structure–property relations. However, such efforts are at an early stage of development. In this study, machine learning and deep learning techniques are used to analyze the mechanical properties of 2D transition metal dichalcogenides (TMDCs) in both pristine and defect forms. The goal is to predict failure stress, strain to failure, and strength based on chirality and strain. Various crystallographic defects were considered, and extensive molecular dynamics simulations were performed. XGBoost and densely connected neural network (DenseNet) algorithms were used to make accurate state-of-the-art predictions, and comparative evaluation and Shapley value analysis of both models are presented to improve interpretability.
In 2D TMDCs, point imperfections can serve as sites for electronic scattering, leading to alterations in the band arrangement and bending.3,5 Point defects can also cause recombination and restricted emission of photons. All of these phenomena are dependent on the unique electronic configuration of a particular imperfection. Interestingly, manipulating electrical properties in these materials through engineered chalcogen atomic defects can even lead to transitions from semiconducting to metallic behavior. Moreover, by intentionally creating chalcogen atomic defects on the outermost layer of 2D TMDCs, the generation of 2D magnets becomes feasible. For instance, introducing Se vacancies triggers the formation of Se-deficient line defects in VSe2, unexpectedly resulting in room-temperature ferromagnetism.6 Additionally, tensile deformation has been demonstrated as a viable method to adjust the electronic properties (from semiconductor to metallic behavior) and magnetic characteristics (from anti-ferromagnetism to ferromagnetism) of 2D TMDCs. Similarly, compressive and tensile strains were shown to control spin polarization and exchange coupling, useful for applications in nano-switches.7 Furthermore, the engineered nanopores within materials like MoS2 nanosheets play a crucial role in selectively allowing water molecules to pass while blocking salt ions, showcasing precise control over filtration performance and highlighting the biomimetic potential of nanotechnology to address global water scarcity challenges.8
2D TMDCs are inherently brittle, and defects can significantly deteriorate their performance, reducing their elastic moduli, strength, and toughness. For example, it was demonstrated that defects can not only impact the quasi-static strength of 2D TMDCs but also detrimentally influence their long-term performance, thus affecting reliability in device-level applications.9 Similarly, crystallographic defects were shown to significantly influence the fatigue strength of not only 2D TMDCs but 2D materials in general (such as graphene, graphene oxide, and 2D hybrid organic–inorganic perovskites).10,11 Therefore, as 2D TMDCs are pushed toward more diverse and extreme applications with a host of functionalities, understanding the role of defects in their mechanical performance needs more careful attention and predictive modeling is desired.6 While density functional theory calculations of mechanical properties are very accurate, these computations incur significant computational expenses and are limited to pristine materials at absolute zero temperatures. Alternatively, although ab initio molecular dynamics simulations enable probing materials at finite temperatures, such endeavors are prohibitively costly, and there are only a handful of such attempts for forecasting the mechanical properties of pristine 2D materials. In this context, classical molecular dynamics calculations12 emerge as a cost-effective alternative, enabling comprehensive parametric analysis encompassing factors like temperature and the strain rate on failure stress, failure strain, and toughness. Furthermore, these calculations also furnish unparalleled insights into the intricate mechanisms underpinning failure within atomically thin materials. However, thus far, these techniques have been employed to study the inelastic properties of 2D TMDCs on a case-by-case basis, which hinders efficient predictive modelling and optimization, especially at the fundamental level.
With remarkable progress in computational sciences, machine learning (ML) algorithms have enabled the development of ingenious strategies in materials design. ML models have also demonstrated their efficacy in predicting mechanical properties. By utilizing data from molecular dynamics (MD) simulations, deep learning algorithms can identify and understand the intrinsic connections between mechanical properties and various influencing factors. This, in turn, enables the prediction of crucial properties for desired materials. Notable efforts in this area include the prediction of the bulk moduli of metal–organic frameworks,13 nonlinear mechanical characteristics of cellular mechanical metamaterials,14 multi-scale constitutive elastic relations for nanoporous materials,15 microscale elastic strain field of three-dimensional (3D) high contrast elastic composites,16 prediction of dynamic crack propagation patterns in 3D solids, and mechanical property prediction of 3D spider webs.17 In the context of 2D material mechanics, however, only a limited number of attempts have been made to predict and employ ML so far. In their seminal work, Yang et al. employed graph neural networks to predict the effective tensile modulus of porous graphene.18 More recently, Jin et al. harnessed support vector machines to predict the mechanical properties of nickel–graphene nanocomposites.19 Xu et al. employed the extreme gradient boosting (XGBoost) algorithm to estimate the tensile strength of polycrystalline monolayer graphene oxide.20 Also, Malakar et al. employed ML techniques to predict the fracture properties of cracked 2D Mo- and W-based chalcogenides.21 All these endeavors offer an efficient multi-scale approach, employing machine learning models on mechanical property data derived from atomistic modeling for the purpose of rational design of structures and devices.
In this study, we have employed machine learning and deep learning techniques to study the inelastic properties of pristine and defective 2D TMDCs. The objective is to predict chirality-dependent failure stress, strain to failure, and toughness for 2D TMDCs in their pristine and defective forms as a function of strain rate. The selection of these materials was motivated by the objective to encompass semiconducting, metallic, magnetic, and nonmagnetic systems with a diverse array of chalcogen elements exhibiting varying electronegativities, as well as transition metal elements characterized by a spectrum of unpaired d electrons. Several crystallographic point defects were taken into account, including an isolated vacancy of a transition metal atom (VM), a lone chalcogen vacancy (VC), double vacancies involving chalcogen elements (V2C), and a defect comprising one transition metal atom along with two chalcogen vacancies (V1M-2C). A comprehensive set of classical molecular dynamics (MD) simulations, totaling ∼666, were conducted at strain rates ranging from 107 s−1 to 1010 s−1. Firstly, we employed the XGBoost22 algorithm due to its ability to handle complex relationships in data, effectively capture feature importance, and deliver accurate predictions for challenging structural integrity scenarios. Additionally, the densely connected neural network23 algorithm was employed to improve the declined accuracy caused by the vanishing gradient in high-level neural networks. We provide a succinct comparative evaluation of DenseNet and XGBoost models, accompanied by a comprehensive summary of the entire model architecture and its key findings. Furthermore, we conducted a Shapley value analysis to elucidate the inner workings and interpretability of both DenseNet and XGBoost models.
| Features | Values | 
|---|---|
| Metal atomic mass | Sc (44.96), Ni (58.69), V (50.94), Cr (51.99), Nb (92.90) | 
| Chalcogen atomic mass | O (16.00), S (32.06), Se (78.96), Te (127.6) | 
| Chiral directions | Armchair, zigzag | 
| Strain rates (s−1) | 1010, 109, 108, 107 | 
| Defects types | Pristine, VM, VC, V2C, V1M-2C | 
| Number of unpaired electrons | Sc (1), Ni (2), V (3), Cr (6), Nb (5) | 
| Electronegativity values of metals and chalcogens | Sc (1.36), Ni (1.91), V (1.63), Cr (1.66), Nb (1.6), O (3.44), S (2.58), Se (2.55), Te (2.1) | 
|  | ||
| Fig. 1 (a) Details regarding the DenseNet model architecture. (b) Schematic representation of the DenseNet model. | ||
As shown in Fig. 1(b), the DenseNet model was developed with three building blocks connected in succession. Each block consists of three consecutive batch normalized and regularized dense layers,32 where the output of each block is concatenated with the input layer and concatenated layer of the previous block's output to form the input of the next block. In this way, the last block contains all the blocks and layers previously used as input for the model. A single block unit is composed of one linear (Dense) unit with ‘ReLU’ activation,33 a kernel initializer, and a kernel L2 regularizer34 of 0.01. L2 regularization was applied to the model during the training process to mitigate overfitting. This regularization technique enhances the generalization ability of the model and prevents it from excessively memorizing the training data. Furthermore, as an optimizer in this study, Adam35 is employed. Adam is a stochastic gradient descent method that performs adaptive estimation of the 1st and 2nd order moments of the gradients. By leveraging Adam, the convergence of the model during training is enhanced, and it aids in finding better optima in the parameter space for improved performance in stress–strain curve prediction for 2D TMDCs.
One of the main objectives of this research was to determine how a single feature or multiple combinations of features actually change the output of a model. A game-theoretic approach was employed to understand and interpret each feature by systematically combining it with the remaining 8 features. For each feature, it was excluded from the feature set first, and various combinations or elements from the superset of features were considered, averaging over the results. These averages were then compared with the expected output of the model with all 9 features combined. The mathematical model of Shapley can be represented with the impact on model output with Shapley values given by  where S is all the subsets of the feature excluding feature i from the feature space. The difference of expectations of outputs of these feature spaces can be calculated as Shapley values to quantify the ‘impact on model output’ parameter. M is all the 9 features together as a set. Here, fx implies the expected values. The Shapley values were calculated on 5000 test data points for both DenseNet and XGBoost interpretability.
 where S is all the subsets of the feature excluding feature i from the feature space. The difference of expectations of outputs of these feature spaces can be calculated as Shapley values to quantify the ‘impact on model output’ parameter. M is all the 9 features together as a set. Here, fx implies the expected values. The Shapley values were calculated on 5000 test data points for both DenseNet and XGBoost interpretability.
 where N represents the total number of sample points, ypi denotes the predicted output for each set of samples, yi stands for the actual target data, and α represents the L2 regularisation parameter. The first term in the loss function calculates the MSE between the predicted output and the actual target data. In contrast, the second term applies L2 regularisation to penalize the weights (W) to prevent overfitting and enhance model generalization.
 where N represents the total number of sample points, ypi denotes the predicted output for each set of samples, yi stands for the actual target data, and α represents the L2 regularisation parameter. The first term in the loss function calculates the MSE between the predicted output and the actual target data. In contrast, the second term applies L2 regularisation to penalize the weights (W) to prevent overfitting and enhance model generalization.
        The coefficient of determination (R2) was used to evaluate the performance of different models. R2 is a statistical metric that indicates how well the predictions of the model fit the actual data. A higher R2 value signifies a more accurate forecast. The R2 is given by  where Y represents the average of the target data across all samples. By comparing the R2 values, we can assess the predictive performance of the model in stress–strain curve prediction for 2D TMDCs. The regularization parameter α plays a crucial role in controlling overfitting by penalizing large weights and promoting a more generalizable model.
 where Y represents the average of the target data across all samples. By comparing the R2 values, we can assess the predictive performance of the model in stress–strain curve prediction for 2D TMDCs. The regularization parameter α plays a crucial role in controlling overfitting by penalizing large weights and promoting a more generalizable model.
A notable positive correlation between the number of unpaired electrons in transition metals and their valence electrons is observed. However, this trend does not extend to the relationship between valence electrons and mechanical properties such as failure stress and toughness. In these cases, valence electrons demonstrate a negligible and inverse correlation with both toughness and failure stress. Given these findings, analysis of this study is focused on the ‘number of unpaired electrons in transition metals’ as the primary feature of interest. The approach was strategically shifted, replacing the ‘number of valence electrons in transition metals’ with the ‘number of unpaired electrons in transition metals’ as the key variables. This alteration led to a significant improvement in the predictive accuracy of both the DenseNet and XGBoost models. The enhanced performance, along with the conclusive results and insights gained from the model analyses, affirm the effectiveness of this modification.
A substantial positive correlation exists between the chalcogen atomic mass and chalcogen size. Consequently, to avoid redundancy in the model, it was a more logical approach not to include both features in our trained models. The decision to exclude the chalcogen size as a feature is rooted in the recognition that highly correlated features can introduce redundancy and potentially lead to overfitting, undermining the interpretability and generalization capabilities of the model.
The predictions generated by the DenseNet model are presented in Fig. 4 and 5. These figures display stress–strain curves, contrasting the predicted (red curve) and actual (black curve) behaviors. Each set of plots, characterized by distinct subfigures, corresponds to specific pattern types and trends as described in their respective figure captions. Fig. 4 presents the stress–strain response of pristine and defective (pristine, V2C, VC, VM, and V1M-2C, from top to bottom) V-based 2D TMDCs with different chalcogen elements (O, S, Se, and Te, from left to right) for a strain rate of 108 s−1 along the armchair direction. Similarly, Fig. 5 presents the stress–strain response of pristine and defective (V2C, VC, VM, and V1M-2C, from top to bottom) Se-based 2D TMDCs containing Cr, Nb, Ni, Sc, and V (from left to right) as the transition metal, for a strain rate of 107 s−1 along the armchair direction. Upon meticulous examination of Fig. 4 and 5, an interesting observation emerges: the overall predicted stress–strain curves across all subfigures within Fig. 4 and 5 closely mirror the corresponding actual stress–strain curves. This consistent alignment underscores the robust predictive capability of the DenseNet model. However, subtle deviations emerge near the point of failure. Notably, these discrepancies are particularly noticeable at the failure point, and minor deviations are discernible in the mid-section of the stress–strain curve. Moreover, Fig. S2 (ESI†) presents the stress–strain predictions for various pristine samples, considering both chiral orientations (armchair and zigzag) and incremental strain rates.
In the course of the hyperparameter tuning process, attention was directed towards several deep learning hyperparameters, including the activation function, batch size, and learning rate. The selection of optimal hyperparameters was guided by assessing the performance of the model on the validation set, which served as a pivotal evaluation metric. The approach of this study involved a systematic exploration of various combinations of these hyperparameters, with the primary objective being the identification of a configuration that not only maximized the performance of the model but also yielded the most favorable results. Fig. 6 displays scatter plots depicting the relationship between predicted and actual values for failure stress, failure strain, and toughness. Notably, these scatter plots are exclusively generated for the test set, and the predictions are quite good, with a highest R2 of 0.98 in the case of failure stress. However, in Fig. 6, certain data points, particularly evident in the scatter plot correlating failure stress and toughness, exhibit notable deviations from the overarching linear trend.
|  | ||
| Fig. 6 Scatter plots of predicted (using DenseNet) and actual values for (a) failure strain, (b) failure stress, and (c) toughness. | ||
Additionally, the DenseNet model yielded a favorable R2 score of 0.98, 0.98, and 0.95 on the training, validation, and testing sets, respectively, indicating its strong predictive capabilities. The R2 scores for DenseNet, incorporating the valence electrons of the transition metal as a feature in lieu of the number of unpaired electrons of the transition metal, are 0.97, 0.97, and 0.95 for the training, validation, and testing sets, respectively. Despite the comparability of R2 scores, it is noteworthy that, in terms of performance, the predictions were not as satisfactory as those achieved with the inclusion of the number of unpaired electrons of the transition metal.
The predictions generated by the XGBoost model are presented in Fig. 7 and 8. Fig. 7 presents the stress–strain response generated by XGBoost for pristine and defective (pristine, V2C, VC, VM, and V1M-2C, from top to bottom) V-based 2D TMDCs with different chalcogen elements (O, S, Se, and Te, from left to right) for a strain rate of 108 s−1 along the armchair direction. Similarly, Fig. 8 presents the stress–strain response of pristine and defective (V2C, VC, VM, and V1M-2C, from top to bottom) Se-based 2D TMDCs containing Cr, Nb, Ni, Sc, and V (from left to right) as the transition metal, for a strain rate of 107 s−1 along the armchair direction. Furthermore, the stress–strain predictions for a range of pristine samples are shown in Fig. S3 (ESI†), which takes into account incremental strain rates as well as chiral orientations (zigzag and armchair).
The stress–strain curves generated by the XGBoost model exhibit noticeable deviations from the actual curves, particularly in the vicinity of the near-failure point, similar to DenseNet predictions. The discrepancies are evident when comparing the stress–strain curves depicted in Fig. 8, where the model falls short of accurately capturing the behavior, as indicated by the superior performance of the DenseNet predictions illustrated in Fig. 5. Further examination reveals that the stress–strain curves in Fig. 7 exhibit a relatively improved correspondence with the actual data, surpassing the predictive capabilities demonstrated in Fig. 8. However, challenges persist, particularly in Fig. 8, where the XGBoost model encounters difficulties in fully capturing the nuances of the near-failure point. This discrepancy is evident as the model fails to accurately replicate the actual stress–strain path shown in the figures.
These observations suggest a need for refinement or adjustment in the XGBoost model, particularly in its capacity to precisely model the near-failure region of stress–strain curves. Addressing this issue may contribute to a more accurate representation of the material behavior, aligning the predictions of the model more closely with the actual data. As shown in Fig. 9, the feature importance plot highlights the significance of the number of unpaired electrons of the transition metal as a crucial feature for prediction. This observation emphasizes the relevance of electron configuration in influencing the material properties under consideration, substantiating its importance in the predictive capabilities of the XGBoost model.
Additionally, Fig. 10 presents scatter plots that illustrate the correlation between predicted and actual values for failure stress, failure strain, and toughness. Notably, the R2 values associated with these scatter plots do not surpass the R2 values exhibited in the scatter plots generated for DenseNet (refer to Fig. 6). This indicates a comparative performance lag in the predictive capabilities of the XGBoost model. The observed discrepancy in R2 values suggests that, in the context of failure stress, failure strain, and toughness predictions, the XGBoost model may not be performing as effectively as the DenseNet model. Further analysis and potential adjustments to the XGBoost model may be necessary to enhance its predictive accuracy in capturing the intricate relationships between input features and the material properties of interest.
|  | ||
| Fig. 10 Scatter plots of predicted (using XGBoost) and actual values for (a) failure strain, (b) failure stress, and (c) toughness. | ||
However, the XGBoost model produced exceptional R2 scores of 0.99 on the training set and 0.99 on the development set. Furthermore, an R2 score of approximately 0.93 was achieved on the test set. The stress–strain and scatter plots have been exclusively generated for the test set, and the outcomes demonstrate exceptional performance in terms of R2 score. The presence of serrations near the point of failure in the stress–strain plots generated by XGBoost can be attributed to the limitation of the model in accurately capturing the abrupt failure, which leads to the observed irregularities in the plots. However, the overall results attained from the evaluation of the unseen test set reveal the impressive accuracy and efficacy of the XGBoost model in predicting the aforementioned parameters.
Fig. 11 presents the box plots of the mean Shapley values for the XGBoost and DenseNet models. These plots illustrate the absolute impact of 9 feature metrics on the model output, providing a clear depiction of the intensity of the impact of each feature. 5000 data points for both XGBoost and DenseNet from the test data were utilized to gain inherent model interpretability, demonstrating that, with the same dataset from the test set, DenseNet can extract valuable insights and prove more robust for predicting test results compared to the XGBoost model. Fig. 11 displays the impact values of DenseNet and XGBoost, where both models exhibit distinct orders of absolute impact values for all features.
|  | ||
| Fig. 11 The impact of features on both XGBoost and DenseNet outputs for (a) and (b) failure stress, (c) and (d) toughness, and (e) and (f) failure strain, respectively. | ||
In the realm of transition metal dichalcogenides (TMDCs), the intrinsic bond strength is significantly contingent upon the charge transfer characteristics between the constituent chalcogen and transition metal atoms.40,41 Empirical evidence suggests a diminution in charge density from sulfides to tellurides, implying a gradual weakening of covalent bonds within the basal plane.42 This phenomenon underscores the pivotal role of charge transfer in modulating bond robustness, corroborated by the correlation between bond strength and the electronic configuration, specifically the number of unpaired electrons in the atomic structure.
Additionally, a larger difference in electronegativity between two bonding atoms typically makes the bond stronger, as it tends to be more ionic. This implies that 2D sulfides should be stronger than selenides and tellurides. For example, as shown in Fig. 4 and Fig. S4 (ESI†), in terms of strength, VS2 > VSe2 > VTe2 and CrS2 are stronger than CrSe2 and CrTe2. This conclusion is in agreement with previous reports on pristine 2D TMDCs wherein an identical trend was observed for elastic constants and microhardness.43,44 This same trend remains intact for samples with isolated defects as well. Future explorations should look into the effect of interacting defects.
The beeswarm plots shown in Fig. 12 provide a visual representation of all the Shapley values, with the Y-axis organizing values based on features. Each group on the Y-axis is color-coded according to feature values, with higher feature values appearing in a redder hue. These plots facilitate the comprehension of inter-feature relationships. In the XGBoost model, for instance, in the case of the number of unpaired electrons in transition metals, an increase in feature values corresponds to a rise in the Shapley value (see Fig. 12(a), (c) and (e)). This implies that higher values of unpaired electrons are associated with increased material failure stress and toughness. Conversely, materials with fewer unpaired electrons are predicted to have lower toughness in TMDCs, aligning with the previously reported impact of charge transfer. Similarly, the strain rate exhibits a positive association with failure stress, strain, and toughness, suggesting enhanced mechanical properties at higher deformation rates.
|  | ||
| Fig. 12 The impact of features on both XGBoost and DenseNet outputs for (a) and (b) failure stress, (c) and (d) toughness, and (e) and (f) failure strain, respectively. | ||
In terms of void size concerning toughness, failure strain and failure stress values, a larger value corresponds to diminished material properties. This suggests that voids act as stress concentrators, promoting crack initiation and propagation, an observation in line with previous observations.9,45,46 However, as shown in Fig. 11, the effect of void size is more pronounced on toughness and failure strain than on failure stress. Notably, the atomic mass of both metals and chalcogens also negatively impacts toughness, potentially due to its influence on material density and interatomic forces. Fig. 12(f) further reveals intriguing nuances in the ’behavior of the model concerning failure strain. Unlike toughness, the number of unpaired electrons exhibits a moderately negative influence on failure strain. This suggests a trade-off between intrinsic strength and ductility, where higher electron delocalization enhances strength but may reduce deformability. Interestingly, the strain rate again demonstrably enhances failure strain, highlighting its proportional influence on material deformation, an observation in agreement with previous reports on 2D materials.47–49 According to thermal activation theory by Eyring,50 the time to failure of a solid under load, which is directly related to the strain rate, can be represented as  where
 where  is the pre-exponential factor, τ0 is the vibrational period of atoms in the solid, ns is the number of sites available for the state transition, U0 represents the interatomic bond dissociation energy, σ is the stress applied, k is Boltzmann's constant, T denotes the absolute temperature, and finally V is the activation volume. Therefore, according to this model, for a thermally activated failure process, which is the case for 2D TMDCs, an increase in failure stress can reduce the effective energy barrier for bond breaking, thereby allowing for failure in a commensurate time at higher strain rates.
 is the pre-exponential factor, τ0 is the vibrational period of atoms in the solid, ns is the number of sites available for the state transition, U0 represents the interatomic bond dissociation energy, σ is the stress applied, k is Boltzmann's constant, T denotes the absolute temperature, and finally V is the activation volume. Therefore, according to this model, for a thermally activated failure process, which is the case for 2D TMDCs, an increase in failure stress can reduce the effective energy barrier for bond breaking, thereby allowing for failure in a commensurate time at higher strain rates.
The DenseNet model, while also recognizing the importance of the total number of unpaired electrons, assigns greater significance to the electronegativity difference. This suggests that DenseNet may be more sensitive to the nuances of atomic interactions, such as the bond strength variations caused by differences in electronegativity, which is in line with the scientific expectation that a greater difference should lead to stronger bonds. DenseNet identifies chirality as a moderately influential factor, suggesting that the arrangement of atoms in the crystal lattice can affect the mechanical behavior of TMDCs. However, previous density functional theory-based calculations on the mechanical behavior of graphene,51 MoS2, and WSe29 indicate that at the ground state (i.e., 0 K) these systems possess pronounced strength in the armchair direction compared to the zigzag direction, unlike the MD results at 300 K reported here. This could be a limitation of the interatomic potential employed or a finite temperature effect arising from different levels of anharmonicity in these materials.
Both XGBoost and DenseNet highlight the critical role of the electronic structure in determining the mechanical properties of TMDCs. However, the disparity in their assessment of features like electronegativity difference and chirality underscores the unique perspectives from which these models approach the problem. The deep learning architecture of DenseNet appears to provide a more detailed understanding of structural impacts, potentially offering superior interpretability in complex scenarios. Conversely, the faster processing and simpler structure of XGBoost may be advantageous for rapid predictions and scenarios where interpretability is less critical.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 leaves. In contrast, DenseNet utilizes an iterative backpropagation algorithm, propagating gradients throughout the network to update a considerably larger number of trainable parameters, around 3.8 million.
000 leaves. In contrast, DenseNet utilizes an iterative backpropagation algorithm, propagating gradients throughout the network to update a considerably larger number of trainable parameters, around 3.8 million.
        These findings suggest that, with the appropriate selection of features and hyperparameters, even a machine learning model such as XGBoost can achieve highly accurate predictions. While DenseNet training can be expedited by reducing computational complexity using GPUs, the experiments were conducted on a 16 GB CPU, which proved sufficient for our purposes. Overall, this work showcases the importance of optimizing model architecture, feature selection, and hyperparameters to harness the predictive capabilities of both DenseNet and machine learning models like XGBoost. These insights contribute to the ongoing advancements in the field and offer promising avenues for future research and applications.
Noteworthy is the inclusion of Shapley value analysis in our work, providing not only trends but also invaluable physical insights not readily attainable through alternative methods. Shapley analysis serves as a pivotal tool in demystifying the so-called ‘black box’ nature of machine learning models. By providing interpretable insights, Shapley analysis enhances the practical applicability of our findings, offering a nuanced understanding of the underlying physical processes. The investigation employed in this study reveals a notable correlation between an increase in the number of unpaired d electrons in transition metals and elevated values of failure stress and toughness. This trend is in harmony with previous findings,42 wherein density functional theory calculations revealed that an increased level of charge transfer from the transition metal atom to the chalcogen atoms leads to stiffer and stronger M–X bonds. A similar effect was picked up by both the machine learning models wherein the increased number of unpaired d electrons was found to promote heightened failure stress, failure strain, and toughness.
Overall, the Shapley value analysis of both DenseNet and XGBoost provided very similar feature importance trends and hinted that gradient boosting algorithms like XGBoost or AdaBoost might be good alternatives to deep neural network-based models, such as DenseNet or FFNN, in constrained resources. The specific application of Shapley aimed to interpret the models of XGBoost and DenseNet by elucidating the impact of features on the model output. SHAP variants such as TreeShap, KernelShap, GradientShap, or DeepShap may yield different Shapley values; it is anticipated that the respective features, ranked according to their Shapley values, should exhibit a consistent order. This acknowledgment underscores the recognition that diverse SHAP variants can offer distinct perspectives on feature importance, yet the overall hierarchy of influential features remains relatively stable across these different methodologies.
While this study is focused on a specific family of 2D materials, such as, in this case, dichalcogenides, the proposed methodology can be explored for other types of chalcogenide-based systems (such as monochalcogenides), MXenes, and MBenes. This stands as a future scope for us, thereby expanding the reach of these models to include a broader spectrum of heteroatomic systems. This expansion would enable a more comprehensive understanding by encompassing a diverse array of materials. Additionally, our examination has primarily considered the effect of isolated vacancies placed at the center of the samples, studied using MD. However, future work could explore more realistic scenarios where defects are distributed across the entire material, including hetero-defects with varied defect distributions. Such investigations would address the complexities arising from distributed defects and their interactions with strain fields.
Finally, it is crucial to highlight one important aspect that the interpretability of machine learning models is intricately tied to the quality of input data, often generated by classical MD. While a clearer picture could be painted using ideal strength data from ab initio calculations, it is essential to acknowledge that this approach is somewhat prohibitive in terms of cost. Mechanical property predictions through ab initio MD, although potentially insightful, have unfortunately been largely neglected due to these cost constraints. So, while our results provide valuable insights, let us approach them with a touch of nuance, appreciating the intricate interplay between machine learning and classical MD that shapes the landscape.
| Footnotes | 
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cp00974f | 
| ‡ These authors contributed equally. | 
| This journal is © the Owner Societies 2024 |