Yiwen Zhenga,
Agni K. Biswala,
Yaqi Guob,
Prakash Thakolkaranb,
Yash Kokanec,
Vikas Varshneyd,
Siddhant Kumar
b and
Aniruddh Vashisth
*a
aDepartment of Mechanical Engineering, University of Washington, Seattle, WA, USA. E-mail: vashisth@uw.edu
bDepartment of Materials Science and Engineering, Delft University of Technology, 2628 CD Delft, Netherlands
cDepartment of Materials Engineering, Indian Institute of Technology Gandhinagar, Palaj, Gandhinagar, India
dMaterials and Manufacturing Directorate, Air Force Research Laboratory, Wright-Patterson Air Force Base, OH, USA
First published on 28th July 2025
Vitrimers represent an emerging class of sustainable polymers with self-healing capabilities enabled by dynamic covalent adaptive networks. However, their limited molecular diversity constrains their property space and potential applications. Recent developments in machine learning (ML) techniques accelerate polymer design by predicting properties and virtually screening candidates, yet the scarcity of available experimental vitrimer data poses challenges in training ML models. To address this, we leverage molecular dynamics (MD) data generated by our previous work to train and benchmark seven ML models covering six feature representations for glass transition temperature (Tg) prediction. By averaging predicted Tg from different models, the model ensemble approach outperforms individual models, allowing for accurate and efficient property prediction on unlabeled datasets. Two novel vitrimers are identified and synthesized, exhibiting experimentally validated higher Tg than existing bifunctional transesterification vitrimers, along with demonstrated healability. This work explores the possibility of using MD data to train ML models in the absence of sufficient experimental data, enabling the discovery of novel, synthesizable polymer chemistries with a wide range of desirable properties. The integrated MD–ML approach offers polymer chemists an efficient tool for accurate property prediction and designing polymers tailored to diverse applications.
Since the introduction by Leibler et al.,2–5 a class of healable polymers called vitrimers have gained significant attention as a potential solution to the plastic pollution issue. Combining the reprocessability of thermoplastics and the superior mechanical properties of thermosets, vitrimers can potentially reduce plastic production and costs in polymer recycling. Vitrimers feature dynamic covalent adaptive networks (CANs), which enable building blocks to attach to and detach from each other via bond exchange reactions, imparting healability and recyclability (Fig. 1a). Vitrimers can be classified based on the types of bond exchange reactions including transesterification, disulfide exchange, and imine exchange reactions.4 However, the thermo-mechanical properties of currently available vitrimers are constrained by the limited variety of commercially available molecular building blocks (i.e., monomers) used in their synthesis. Therefore, to broaden the applications of vitrimers, it is essential to efficiently identify the available or synthesizable monomers that constitute vitrimers with desirable properties.
Traditionally, novel polymer discovery has been accomplished in a trial-and-error fashion.6 For a given set of polymers, chemists characterize the properties of each one of them by experiments and select those with desired properties. Since experimental synthesis of novel polymer chemistries is costly, molecular dynamics (MD) simulations have been favored as an alternative method to characterize polymers. MD is a simulation technique that bridges quantum mechanics and classical mechanics, and it has been extensively utilized to accelerate the discovery process.7 MD simulations have provided valuable insights into how polymer molecular structures influence mechanical properties,8 glass transition temperature,9 and self-healing performance.10 Despite these advancements, large-scale computational screening using MD or other simulation techniques remains computationally expensive even with the progress in high-performance computing.11 Consequently, the exploration of polymer design space is typically constrained and fails to cover the vast design space of molecular compositions.
Recent developments in machine learning (ML) methods have opened a new path to accelerate polymer discovery by orders of magnitude. ML models are trained using available data to predict polymer properties (labels) from molecular structures (features). The trained models are employed to predict the properties of unlabeled polymers (whose properties are unknown) and the polymers with promising predicted properties are selected for further validation. This process is also known as virtual screening and has been applied to design polymers with various properties including glass transition temperature (Tg),12 thermal conductivity,13 bandgap,14 and gas uptake properties.15 Generally, there are three considerations when applying machine learning to polymer property prediction:
• Data: high-quality datasets are necessary for training accurate ML models. The largest polymer database PolyInfo16 contains ∼18000 polymers with experimentally measured properties but lacks sufficient data on specific properties. For example, there are only 173 data points for thermal conductivity. As an experimental database, PolyInfo also lacks recently synthesized polymers. To address these challenges, high throughput MD simulations allow for creating a large and consistent dataset to train ML models. Researchers have established and curated polymer datasets of various properties including glass transition temperature,17 free volume,18 thermal conductivity,13 and ionic conductivity.19 ML models have been trained using these datasets for virtual screening of polymers and novel insights into structure–property relationships have been identified.
• Representations: monomers or polymer repeating units consist of atoms and bonds, which, in the original form, are not interpretable to ML models. To achieve accurate predictions, it is essential to select an appropriate way to convert molecular structures into machine-readable numerical feature representations. Molecular descriptors, which are theoretically derived values incorporating physicochemical information of molecules, serve as a commonly used representation.20,21 Another prevalent representation is molecular fingerprints, which are vectors with substructure occurrences as entries.22 Molecular structures can also be described using string notations such as the simplified molecular-input line-entry system (SMILES).23 Graph-based representations have drawn increasing attention due to the inherent graph nature of molecules, where atoms or substructures serve as nodes and bonds as edges. The selection of the best representation type is typically case-specific and requires empirical trials.
• ML models: the choice of ML models is partially dependent on the selected representation of polymers. For example, graph neural networks (GNNs) are specialized for graphs while transformers are more suitable for sequence-based input such as SMILES.24,25 Molecular descriptors and fingerprints are more universal and accommodate both shallow and deep ML models such as linear regression, random forest, and feed-forward neural networks. Similar to representations, the choice of best ML models is also empirical.
Despite the growing applications of ML for polymers, efforts to design vitrimers using ML remain limited. Yan et al.26 employed a variational autoencoder (VAE)-based virtual screening approach to identify vitrimers with desirable glass transition temperature (Tg) and recyclability. In their method, vitrimers were encoded as latent vectors using a pretrained VAE, while a feedforward neural network (FFNN) predicted Tg from these latent vectors. However, although the VAE was pretrained on ∼420000 drug-like molecules, the property predictor was trained on a small experimental dataset (389 data points for Tg) manually curated from the literature, resulting in limited predictive accuracy. In our previous work,17 we addressed this limitation by conducting large-scale molecular dynamics (MD) simulations to calculate the Tg of 8424 hypothetical vitrimers and calibrating calculated Tg against available experimental data. This dataset, along with an unlabeled set of ∼1 million hypothetical vitrimers, was used to train a VAE for the inverse design of vitrimers with targeted Tg. While the generative model successfully learned the distribution of valid vitrimers and generated novel chemistries, it did not explicitly account for synthesizability, making the proposed vitrimers difficult to synthesize. In contrast, the virtual screening approach ensures the synthesizability of designed vitrimers by restricting the search space to a predefined unlabeled dataset consisting of vitrimers derived from commercially available monomers.
This study focuses on establishing a ML model informed by MD data for accurate and efficient prediction of vitrimer Tg. The trained model is subsequently employed to identify novel bifunctional transesterification vitrimers composed of carboxylic acids and epoxides in an equimolar ratio, which is the widely adopted ratio used to synthesize transesterification vitrimers in the literature.27–31 Leveraging open source MD-generated data from our previous study,17 we systematically evaluate the performance of ML models in predicting vitrimer glass transition temperature (Tg). We benchmark seven ML models (linear regression, random forest, support vector regression, gradient boosting, feedforward neural network, graph neural network, and transformer) across six molecular representations, including molecular fingerprints, RDKit20 descriptors, Mordred21 descriptors, Mol2vec embeddings,32 SMILES, and graphs. Our findings indicate that an ensemble learning approach, which averages predictions from multiple models, outperforms individual models (Fig. 1a). Our analysis of feature importance further provides interpretability by identifying key factors that influence the Tg of vitrimers. Using the trained ensemble model, we screen an unlabeled vitrimer dataset containing approximately 1 million hypothetical vitrimers, identifying candidates with extreme (high and low) Tg values beyond the training domain, which are further validated through MD simulations (Fig. 1b). To ensure synthesizability, we conduct an additional screening of vitrimers composed of commercially available carboxylic acids and epoxides, selecting promising candidates for experimental evaluation. Two novel vitrimers are synthesized and characterized, demonstrating experimentally higher Tg than any bifunctional transesterification vitrimer reported in the literature (Fig. 1c). The simulated and experimentally characterized Tg values validate the ability of the model to expand the property space of available vitrimers. This study demonstrates the feasibility of using MD-generated data to train ML models in the absence of available experimental data and successfully applies this approach to the design and validation of novel vitrimers with desirable properties through experimental synthesis and characterization. The proposed framework serves as an effective tool for polymer chemists to design synthesizable polymers with tailored properties for diverse applications.
In this work, we also curate a synthesizable dataset to facilitate the experimental synthesis of screened vitrimers. This dataset is constructed by selecting 37 commercially available bifunctional carboxylic acids and 7 bifunctional epoxides from the Sigma-Aldrich website,34 yielding a total of 259 vitrimers that can be synthesized using off-the-shelf chemicals. The sizes and sources of all three datasets are provided in Table 1.
• For molecular fingerprints, the repeating units are transformed into 2048 bit Morgan fingerprints22,35 with a radius of 3 using the RDKit package.20 Each bit in the fingerprint vector represents the frequency of a specific chemical substructure within the vitrimer, a method known as frequency-based molecular fingerprints. This approach has been shown to outperform traditional binary fingerprints, where each bit only indicates the presence (1) or absence (0) of a substructure.36 We select 200 substructures that appear most frequently across the labeled dataset. Substructures with zero variance (i.e., those with identical occurrence counts across all labeled vitrimers) are removed. The final fingerprint vector consists of 195 entries, each corresponding to a specific substructure present in the vitrimers.
• Molecular descriptors for each vitrimer are computed using the RDKit20 and Mordred21 packages. Since certain descriptors are not well defined for some vitrimer chemistries, only those that are valid across both the labeled and hypothetical unlabeled datasets are retained. Additionally, descriptors with zero variance are removed. After filtering, 166 RDKit descriptors and 620 Mordred descriptors are selected.
• Mol2vec is an unsupervised learning-based approach that represents molecules as fixed-length numerical vectors by learning embeddings of molecular substructures.32 In this work, we utilize a pretrained Mol2vec model to encode vitrimers into 300-dimensional vector representations, capturing their structural and chemical characteristics.
• SMILES: each vitrimer repeating unit is converted into canonical SMILES using the RDKit package,20 where the asterisk symbol (*) denotes connection sites for the repeating unit.
• Following the approach described by Hickey et al.,37 vitrimers are represented as molecular graphs, where heavy atoms serve as nodes and bonds as edges. Each node is characterized by atomic features including atom type, number of hydrogen atoms, valency, neighbor degree and aromaticity. Edge features include the bond type, as well as indicators for whether the bond is conjugated or part of a ring. An adjacency matrix is also extracted with binary indicators of connectivity between each pair of nodes.
The trained ML models not only enable efficient Tg prediction but also provide molecular insights into the key factors influencing vitrimer Tg. To quantify feature importance, we perform Shapley additive explanations (SHAP) analysis45 on the trained LASSO, RF, XGBoost, and FFNN models using molecular fingerprints, RDKit descriptors, and Mordred descriptors. SVR is excluded from this evaluation due to high computational cost. For molecular fingerprints, RDKit descriptors and Mordred descriptors, we compute the average SHAP values (i.e., feature importance scores) across all considered ML models and training data. The most impactful substructures and descriptors are identified based on the highest absolute average SHAP values, providing interpretability into the factors governing vitrimer Tg.
After training all models and selecting the best-performing one, we proceed with screening the unlabeled datasets. Each vitrimer in these datasets is preprocessed using the previously discussed feature representations, and its Tg is predicted using the trained model. Promising candidates are identified for further validation. For candidates from the hypothetical dataset, MD simulations are performed to compute their Tg, following the same simulation and calibration protocols used in generating the labeled dataset (details provided in the SI). For candidates from the synthesizable dataset, two vitrimers are selected for experimental synthesis and characterization based on chemical intuition, synthesizability and cost.
The performance of LASSO_Mordred is compared with that of GNN and transformer models in Fig. 3a, where it also demonstrates superior prediction accuracy over the deep learning models. To further improve Tg prediction, we implement a model ensemble approach by averaging the predicted Tg values from multiple models. We find that an ensemble consisting of XGBoost with fingerprints (XGBoost_fp), XGBoost with Mordred descriptors (XGBoost_Mordred), GNN, and transformer models achieves the highest correlation with ground truth values in the test set, yielding an R2 value of 0.78 and a RMSE of 15.19 K (Fig. 3b). In contrast, an ensemble incorporating all 22 models performs worse due to redundancy in overlapping features. Notably, the four models in the best-performing ensemble encompass four major types of feature representations: molecular fingerprints, molecular descriptors, graphs, and SMILES strings. These results suggest that averaging predictions from models with diverse feature representations leads to higher accuracy than relying on any single model alone, demonstrating the effectiveness of the ensemble learning approach.
The most impactful RDKit descriptors are shown in Fig. 4b, with their definitions provided in Table S1 (SI). The descriptor with the most negative impact, the number of rotatable bonds, decreases vitrimer chain rigidity, leading to a negative correlation with Tg. The most positively correlated descriptor, VSA_Estate2, is a hybrid descriptor that captures the combined effect of the van der Waals surface area and the electrotopological states of atoms. Another notable descriptor, FractionCSP3, represents the fraction of carbon atoms that are sp3-hybridized and negatively impacts Tg due to a more aliphatic and flexible molecular structure. Several identified descriptors have also been reported in a previous study based on an experimental dataset,48 further validating the effectiveness of our analysis. The most important Mordred descriptors, along with their descriptions, are presented in Fig. S3 and Table S2 (SI). Similar to the RDKit descriptors, the most significant Mordred descriptor is the number of rotatable bonds, demonstrating consistency across different models and providing cross-validation within this study.
The insights derived from molecular fingerprints and descriptors highlight two primary factors influencing vitrimer Tg across different scales: chain mobility and molecular interactions. At the polymer chain scale, increased chain flexibility, facilitated by rotatable bonds and aliphatic structures, enhances conformational freedom and segmental motion, resulting in lower Tg. In contrast, the presence of aromatic rings and bulky side groups contributes to greater chain rigidity, restricting mobility and leading to higher Tg. At the atomistic scale, molecular interactions, such as hydrogen bonding and van der Waals forces, strengthen local interactions within the vitrimer network, increasing energy barriers to segmental motion and thereby elevating Tg. These findings offer valuable molecular design guidelines that can inform the discovery of vitrimers with tailored thermal properties.
![]() | ||
Fig. 5 Virtual screening of the hypothetical unlabeled dataset (see Table 1) and examples of discovered vitrimers whose Tg is validated by MD simulations. (a) Tg distribution of the training data and discovered vitrimers. (b) Chemical space of the training data and discovered vitrimers visualized by PCA of molecular fingerprints. (c) Molecular structures of discovered vitrimers with high and low Tg. All presented Tg values are obtained from MD simulations. |
Note that some vitrimers presented in Fig. 5c contain secondary amine groups, which are also potential reaction sites with epoxides. However, the secondary amine–epoxide reactions are less favorable than carboxyl–epoxide reactions in the presence of catalysts such as TBD during actual synthesis. Therefore, to maintain bifunctionality in the resulting vitrimers and reduce the complexity of MD simulations for Tg evaluation, we make the assumption that only carboxylic acids can react with epoxides while the reactions involving amines are ignored. While this simplification does not fully capture the actual experiments, undesired amine–epoxide reactions could be prevented by replacing nitrogen atoms with alternative heavy atoms. This strategy preserves the predicted Tg and minimizes side reactions, as demonstrated in our previous work.17
The most impactful substructure frequencies and descriptors (see Fig. 4) of the discovered vitrimers are presented in Fig. S4 (SI). The occurrence of positively correlated substructures and descriptors is higher in high-Tg vitrimers, while negatively correlated features are more prevalent in low-Tg vitrimers. This consistency cross-validates our feature importance analysis, demonstrating that the key factors identified in the training data also apply to novel vitrimers outside the training dataset. These findings further support the universality and robustness of the interpretable insights offered by our models.
The discovered vitrimers presented in Fig. 5c are currently hypothetical vitrimer candidates exhibiting desirable Tg and their self-healing capabilities remain to be evaluated. Future research could focus on quantifying the healability of these vitrimers through more complicated MD simulations, such as reactive MD. This would also allow for an investigation into the interplay between healability, Tg, and other relevant properties.
![]() | ||
Fig. 6 Experimental synthesis and characterization of the discovered novel vitrimers. (a) The trained ensemble model is employed to virtually screen 259 vitrimers sourced from the Sigma-Aldrich database.34 By integrating chemical intuition, we evaluate their synthesizability and cost, ultimately selecting two vitrimers for experimental synthesis and characterization. (b) Synthesis schemes of the vitrimers. (c) DSC results to measure the Tg of the synthesized vitrimers. (d) Tg of the vitrimers synthesized in this work and bifunctional transesterification vitrimers present in the literature.10,17,28,29,49,50 (e) Images of pristine, cut and healed vitrimer specimens, demonstrating the healability of the synthesized vitrimers in this work. |
Among the screened candidates, vitrimers based on D,L-malic acid (MA) and 2,6-naphthalenedicarboxylic acid (NA) paired with diglycidyl ether (DGEBA) are selected for synthesis, as they exhibit desired Tg values from the ML ensemble model. These acids are associated with structural features—such as aromaticity and limited rotatable bonds in NA and the hydroxyl-rich backbone in MA—that are identified in our SHAP analysis as positively contributing to Tg, reinforcing the rationale for their selection. The epoxide and acids are cured in the presence of the catalyst TBD for vitrimer synthesis (Fig. 6b). While malic acid contains hydroxyl groups in addition to carboxyl groups, these hydroxyl groups are less reactive towards esterification with epoxides due to the presence of acidic hydrogen.51,52 Therefore, both MA-based and NA-based vitrimers are processed with an equimolar ratio of carboxyl groups to epoxides during curing.
Thermal analysis through DSC measurements reveals Tg values of 348 K (75 °C) and 395 K (122 °C) for the MA-based and NA-based vitrimers, respectively (Fig. 6c). Notably, the NA-based vitrimer exhibits a higher Tg than any bifunctional transesterification vitrimer reported in the literature (see Table S3, SI) to the best of our knowledge,10,17,28,29,49,50 as shown in Fig. 6d. The MA-based vitrimer also demonstrates a relatively higher Tg compared to most existing bifunctional transesterification vitrimers. The experimental Tg of both vitrimers agrees well with the ML-predicted Tg (364 and 390 K), indicating the accuracy of our ML model in predicting the actual Tg of vitrimers. To evaluate the healability of the synthesized vitrimers, pristine specimens are cut and subsequently healed at elevated temperatures. Microscopic examination of the pristine, cut, and healed samples (Fig. 6e) demonstrates complete damage recovery, confirming the healability of these novel vitrimer chemistries. The combination of high Tg and self-healing capability validates the effectiveness of our framework in identifying vitrimers with desirable properties, thereby broadening the scope of potential vitrimer applications.
Besides Tg, self-healing efficiency is crucial for the practical applications of vitrimers. One key metric of efficient self-healing can be associated with chain mobility, suggesting that there should be sufficient difference between Tg and the topology freezing temperature (Tv) that indicates the onset of bond exchange reactions for efficient self-healing behavior. Future research could focus on designing vitrimers that exhibit both desirable Tg and reasonable healability characterized by a sufficient Tv–Tg difference. For instance, with sufficient data on Tv for vitrimers, the ML framework developed in this study could be trained to predict self-healing performance. In addition, especially for high Tg vitrimers where there may not exist sufficient separation between Tg, Tv, and decomposition temperature (Td), incorporating further chemical intuition and Td prediction ML models could facilitate the design of vitrimers balancing Tg and Td requirements with self-healing performance.
Although this work focuses on Tg prediction of vitrimers in a limited design space (bifunctional transesterification vitrimers with an equal molar ratio), the presented approach can be readily adapted to other types of vitrimers with higher functionality and various molar ratios, incorporating the effect of macromolecular structures with greater specificity and accuracy, provided sufficient data for training are available. The proposed framework provides an effective and efficient tool for polymer chemists to design synthesizable polymers with tailored properties, expanding the potential applications of vitrimers.
Supplementary information includes details of vitrimer datasets and machine learning models as well as additional results. See DOI: https://doi.org/10.1039/d5dd00239g.
This journal is © The Royal Society of Chemistry 2025 |