Open Access Article
Nguyen T. T. Duyena,
Ngo T. Que†
b and
Anh D. Phan†
*ab
aFaculty of Materials Science and Engineering, Phenikaa University, Hanoi 12116, Vietnam. E-mail: anh.phanduc@phenikaa-uni.edu.vn
bPhenikaa Institute for Advanced Study, Phenikaa University, Hanoi 12116, Vietnam
First published on 5th February 2026
We present a multiscale modeling approach that integrates molecular dynamics simulations, machine learning, and the elastically collective nonlinear Langevin equation (ECNLE) theory to investigate the glass transition dynamics of polymer systems. The glass transition temperatures (Tg) of four representative polymers are estimated using simulations and a machine learning model trained on experimental datasets. These predicted Tg values are used as inputs to the ECNLE theory to compute the temperature dependence of structural relaxation times and diffusion coefficients, and the dynamic fragility. The Tg values predicted from simulations show good quantitative agreement with experimental data. While machine learning tends to slightly overestimate Tg, the resulting dynamic fragility values remain close to experimental fragilities. Overall, ECNLE calculations using these inputs agree well with broadband dielectric spectroscopy results. Our integrated approach provides a practical and scalable tool for predicting the dynamic behavior of polymers, particularly in systems where experimental data are limited.
Several experimental techniques are commonly used to determine the glass transition temperature (Tg), including differential scanning calorimetry (DSC),10 dynamic mechanical analysis (DMA),11,12 and broadband dielectric spectroscopy (BDS).13 Each method has distinct advantages and disadvantages. DSC has been widely adopted because of its simplicity and ability to detect heat flow associated with thermal transitions. However, when the glass transition overlaps with melting or crystallization events, it is hard to determine Tg. DMA has been widely used to measure the temperature dependence of viscoelastic behaviors of polymers with significant mechanical changes typically observed near the Tg. However, it requires precise preparation of sample dimensions and careful mechanical calibration to ensure accurate results. BDS data provide detailed insight into molecular relaxation dynamics over a broad frequency range. It allows us to calculate the temperature and pressure dependence of structural relaxation times, Tg, and the dynamic fragility. Despite its capabilities, BDS requires specialized instrumentation, can be costly, and is not universally applicable to all types of materials. These limitations have led to the development of computational approaches that include molecular dynamics (MD) simulations and theoretical models to complement or substitute the experimental determination of Tg and structural relaxation time.
MD simulations provide atomistic insight into the structural and thermal behavior of polymers by tracking the time evolution of particles under realistic interatomic interactions.14 The glass transition temperature can be estimated from such simulations by analyzing the temperature dependence of volume or density. However, MD simulations are limited to short timescales, typically up to 105 picoseconds. Meanwhile, experimental observations often span milliseconds to hundreds of seconds. This means that MD simulations cannot fully capture the slow relaxation dynamics characteristic of systems near and below the Tg.
To address the timescale limitations of MD simulations, the elastically collective nonlinear Langevin equation theory was developed to describe structural relaxation in glass-forming systems. The ECNLE theory describes how local cage-scale hopping cooperates with long-range elastic displacements to govern structural relaxation. It predicts the temperature dependence of relaxation times and diffusion coefficients across a wide dynamic range from picoseconds to hundreds of seconds, which cover both simulation and experimental timescales. This theory has been successfully applied to various amorphous materials including polymers,15,16 molecular liquids, organic compounds,17,18 metallic glasses,19 and confined systems. ECNLE predictions have shown strong agreement with experimental data and/or simulation.15–19 A key input to ECNLE calculations is Tg, which provides the thermal mapping from density to temperature. However, for newly synthesized or computationally designed materials, experimental Tg values are often unavailable and this restricts the applicability of the theory.
To overcome this challenge, in this study, we present an integrated approach that combines MD simulations, machine learning, and ECNLE theory to investigate the glassy dynamics of representative polymer systems. First, MD simulations and trained ML models are used to estimate the Tg values of each polymer system based on structural or compositional information. These predicted Tg values and experimental counterparts are then used as inputs for the ECNLE theory through the thermal mapping. We subsequently compute the temperature dependence of the structural relaxation time and diffusion coefficient, and the dynamic fragility. Finally, we validate our predictions by comparing them against available experimental data. This integrated MD–ML–ECNLE approach provides a predictive and scalable platform for understanding and designing amorphous materials, particularly when experimental data are limited or unavailable.
![]() | (1) |
The dynamic free energy provides key insights into the local particle dynamics in dense fluids. At low volume fractions (Φ < 0.432), the dynamic free energy Fdyn(r) decreases monotonically with increasing r and has no local barrier. In this regime, particles are not dynamically localized and diffuse freely due to weak interparticle interactions. This corresponds to fluid-like behavior.15 As the system becomes denser, a local barrier emerges in the dynamic free energy due to stronger interactions between the tagged particle and its nearest neighbors. These interactions give rise to transient cages that temporarily confine the particle's motion. The radius of the particle cage, rcage, is determined as the position of the first minimum in the radial distribution function, g(r). This function is related to the static structure factor S(q) via an inverse Fourier transform. As density increases, the local barrier in the dynamic free energy becomes higher. The height of this barrier is given by FB = Fdyn(rB) − Fdyn(rL), where rL is the localization length and rB is the position of the barrier maximum in Fdyn(r) as illustrated in Fig. 1. The jump displacement Δr = rB − rL represents the distance that the particle must move to escape its cage.
![]() | ||
| Fig. 1 Schematic illustration of the dynamic free energy in the ECNLE theory. Key length and energy quantities are defined. | ||
As a particle attempts to escape its cage, the surrounding particles must undergo cooperative rearrangements. This process leads to a slight expansion of the cage surface and generates a radially symmetric displacement field, u(r), that propagates outward from the particle cage. Based on Lifshitz's continuous mechanics analysis,33 the displacement field is
![]() | (2) |
![]() | (3) |
The propagation of the displacement field induces vibrations in the surrounding particles. Since u(r) is small, these vibrations can be approximated as harmonic and the elastic energy associated with a single particle is given by
with K0 = |∂2Fdyn(r)/∂r2|r=rL being the spring constant. To quantify the effect of collective motion on relaxation, the total elastic energy is obtained by summing the contributions from all particles outside the cage15–30
![]() | (4) |
Based on Kramers’ theory, we can calculate the structural relaxation time via15–19
![]() | (5) |
is the magnitude of the curvature of the dynamic free energy at the peak of the barrier. The analytical expression for τs has been discussed in detail in prior studies.24–27
Note that the calculations above provide τα as a function of volume fraction Φ. To quantitatively compare with experimental data, it is necessary to map Φ to temperature. To achieve this, Mirigian and Schweizer developed a thermodynamic mapping scheme for rigid molecular liquids.20–22 This method relies on matching the dimensionless amplitude of long-wavelength density fluctuations, defined as Sexpt0(T) = ρkBTκT, where κT is the isothermal compressibility. This quantity reflects nanoscale thermal density fluctuations and can be obtained from the experimental equation of state. In the hard-sphere fluid model with the Percus–Yevick approximation, the corresponding zero-wavenumber structure factor is given by
. By equating SHS0 with Sexpt0(T),15–30 we can determine the volume fraction as a function of temperature (Φ(T)). This thermal mapping allows us to calculate the temperature dependence of τα within the ECNLE theory.
Although ECNLE calculations associated with Schweizer's thermal mapping have been successful in predicting the dynamics of many glass-forming materials21,24 without any adjustable parameter, its assumption of a universal link between local and long-range dynamics has a limitation. In reality, variations in microstructure and chemical interactions lead to significant differences in relaxation behavior, particularly in the dynamic fragility of polymers with different chemistries. To address this, Xie and Schweizer extended the ECNLE theory by introducing a non-universal coupling between local hopping and collective elastic barriers.31 They argued that the microscopic hopping distance needed for a particle to escape its cage is specific to the chemical structure and nanoscale rearrangements of the material. This chemical specificity is encoded into the theory through a single material-dependent parameter, ac, which scales the collective elastic barrier. The new elastic barrier becomes Fe,new = acFe, and the structural relaxation time in eqn (5) is modified as15–30
![]() | (6) |
The extended theory significantly improves the predictions for the glass transition temperature, dynamic fragility, and temperature dependence of τα simultaneously. However, calculations for the extended theory are highly dependent on equation-of-state data and the adjustable parameter ac, which limits its practical application.
To address the limitations of previous thermal mapping schemes, we introduced an alternative approach based on the principle of thermal expansion
![]() | (7) |
The relationship between the diffusion coefficient (D) and the structural relaxation time is described by19,35,36
![]() | (8) |
In several previous studies,19,35,36 diffusion coefficients calculated using the ECNLE theory have shown good agreement with experimental measurements for various glass-forming systems.19,35,36
The ECNLE theory has several distinct advantages over other theoretical models such as mode-coupling theory (MCT),37,38 random first order transition (RFOT) theory,39 and the Adam–Gibbs (AG) model40 to describe the thermal dependence of structural relaxation in supercooled liquids and polymers. The MCT captures short-time caging and two-step relaxation behavior and breaks down below the crossover temperature because it neglects activated hopping processes. The RFOT assumes the existence of cooperatively rearranging regions and a free-energy landscape without a precise microscopic formulation for the energy barrier. Similarly, although the AG model links relaxation time to configurational entropy, it remains largely phenomenological and lacks a clear microscopic basis. In contrast, ECNLE explicitly accounts for both local hopping out of transient cages and long-range collective elastic distortions. This dual contribution allows it to accurately model relaxation dynamics over a wide temperature range, including near and below the glass transition. In general, the ECNLE theory provides a more physical and predictive framework to understand relaxation time, fragility, and glass formation in a broad range of amorphous materials.15–30
For each polymer, simulation cells containing either 10 or 20 chains with a polymerization degree of N = 10 were generated. These structures were packed into a periodic cubic box of dimensions 60 × 60 × 60 Å3 to provide an initial mass density close to 1.0 g cm−3. The Dreiding force field was employed due to its applicability to both organic molecules and polymeric systems.42 This force field accounts for bonded interactions (bond, angle, dihedral) and nonbonded interactions (van der Waals and hydrogen bonding), enabling realistic modeling of polymer conformations and intermolecular forces.
The chosen polymerization degree of N = 10 corresponds to molecular weights of approximately 631, 683, 656, and 570 g mol−1 for PPS, PI, PPG, and PB, respectively. While these values lie in the low-to-intermediate molecular weight regime, they are comparable to the experimental molecular weights of PI (1040 g mol−1), PPG (192 g mol−1), and PB (777 g mol−1).43 This allows for meaningful comparisons of their Tg values and relaxation dynamics.43 Prior studies43 have shown that the glass transition temperature of PPG remains relatively insensitive across a broad molecular weight range (134–18
000 g mol−1), and the Tg values of PB with molecular weights near 570 and 777 g mol−1 are nearly identical. Thus, for these polymers, although the molecular weight has significant effects on the glass transition temperature and fragility,43–45 our simulation conditions are representative of experimentally accessible behavior. However, for PPS, the simulated molecular weight is significantly lower than the experimental value (∼44
000 g mol−1),43 and quantitative comparisons of Tg and fragility in this case are cautiously studied.
Following energy minimization, each polymer system was equilibrated at a high temperature selected to ensure molecular mobility and thermal stability. The equilibration temperatures were set to 340 K for PPS, 300 K for PI and PPG, and 330 K for PB. Simulations were performed in the NPT ensemble using the Nosé–Hoover thermostat and barostat for 250 ps to maintain constant thermodynamic conditions. Subsequently, the systems were cooled in a stepwise manner to target temperatures of 150 K for PPS, 100 K for PI and PPG, and 50 K for PB. The cooling process was executed over multiple temperature intervals of 20–25 K with each interval involving 200 ps of equilibration. At each step, the specific volume was averaged over the final 100 ps of simulation. To determine Tg, the temperature–volume data were analyzed by fitting two linear regimes corresponding to high- and low-temperature regimes. The intersection of these two linear fits was taken as the glass transition temperature for each polymer.
The chemical structures of the polymers were first represented using the simplified molecular input line entry system (SMILES) format,47 which encodes how atoms are connected within each molecule. These representations were then processed using RDKit48 to generate quantitative molecular fingerprints that capture key features of the molecular topology, such as the arrangement of atoms and bonds. Each fingerprint consists of 2048 binary values (0 and 1), where each bit indicates the presence or absence of a specific structural pattern. These molecular descriptors were used as the input features for training and testing the machine learning model. All dataset and code used in this study are openly available in the GitHub repository49 to ensure full reproducibility and transparency. The dataset was randomly divided into training (80%) and testing (20%) sets. To evaluate the predictive accuracy, we used two common performance metrics: the coefficient of determination (R2), which indicates how well the model captures variance in the data, and the root mean square error (RMSE), which measures the average deviation between predicted and actual Tg values.
The histograms in Fig. 2 show the frequency distributions of actual and predicted Tg from our machine learning model. For the training set (Fig. 2a), the distribution of the predicted Tg values shows excellent overlap and is nearly indistinguishable from the actual Tg distribution. Crucially, for the test set (Fig. 2b), our predictions have a strong coherence between the actual and predicted distributions. Specifically, the predicted Tg distribution in the test set successfully captures the overall shape, range, and modal features of the experimental data. This strong agreement in both training and test sets suggests that our model is reliable for predicting the Tg values.
![]() | ||
| Fig. 2 Distribution of actual and predicted glass transition temperatures of polymers for (a) training data and (b) test data. | ||
Our machine learning model, which is based on GPR, predicts Tg with R2 of 0.78 and RMSE of 19.5 K. Fig. S2 in the SI shows the predicted versus experimental Tg values for the training and test sets of the low-Tg subset to confirm comparable errors for both. This performance indicates reasonable predictive accuracy within the low-Tg range relevant to our selected polymers. Compared to a previous GPR study,28 which reported R2 of over 0.89 and an RMSE below 37 K for a full dataset spanning Tg values from 134 to 768 K, our model has a lower overall R2 but a smaller RMSE. This difference arises because we focus our training on a subset of 1586 polymers with Tg values below 320 K to better represent the temperature range of interest. As detailed in Tables SI–SIII of the SI, cross-validation shows that this choice yields slightly lower global R2 due to the narrower Tg range, but improves accuracy (lower RMSE) for low-Tg polymers and avoids the systematic upward shift in predicted Tg that occurs when higher-Tg polymers are included in the training set.
Several recent studies have applied more complex machine-learning architectures to predict polymer Tg. A graph neural network (GNN) model was specifically developed for polyimides, focusing on high-Tg materials within a relatively narrow chemical space.50 However, its maximum R2 of 0.86 implies a non-negligible amount of unexplained variance and does not provide higher accuracy than our GPR model. Park et al. combined a graph convolutional network (GCN) with a neural-network (NN) regressor to determine Tg and other properties over a dataset with a similar Tg range and distribution to our full database.51 Although their deep-learning architecture is more complex, the resulting prediction accuracy is only comparable to that of our simpler GPR approach. Cheminformatics models coupled to coarse-grained MD simulations52 can identify physical descriptors, but these simulations are time-consuming and do not always reproduce experimental behavior accurately. As a result, this approach yields only 0.47 ≤ R2 ≤ 0.74 for Tg prediction across diverse polymers and is difficult to deploy in a high-throughput setting. More recently, quantum-chemical-augmented NN/GNN frameworks53 that rely on quantum-chemistry-derived local-cluster descriptors report R2 ≈ 0.74 for Tg and require additional quantum-chemical calculations for each polymer. In contrast, our GPR model operates on simple binary fingerprints generated from SMILES strings, avoids any MD- or QC-based feature generation, and still achieves higher accuracy in the low-Tg regime of interest. This balance of accuracy, simplicity, and computational efficiency makes the GPR framework the most appropriate choice for the present work.
To further evaluate the accuracy of both ML and MD approaches, we compare their predicted Tg values with experimental data in ref. 43. As summarized in Table 1, the MD results show good quantitative agreement with experimental values and deviations are typically within 10–15 K. The GPR model predicts the Tg values for PPS and PI with moderate accuracy. The ML predictions overestimate the experimental counterparts by 20 K and 36 K, respectively. For PPG and PB, the deviations are larger, on the order of 80 K. This likely indicates both the limited number of structurally similar low-Tg polymers in the training data and the intrinsic scatter of the experimental Tg database. Hence, in this work we use the GPR model mainly to capture overall trends in the low-Tg regime and to provide an additional input for testing ECNLE predictions, rather than as a highly accurate predictor for each individual flexible polymer. While MD simulation provides more accurate absolute Tg predictions, the GPR model remains a useful and efficient tool for trend prediction and rapid screening, especially when experimental data are unavailable. A systematic exploration of more expressive architectures and additional descriptors as shown in ref. 50–53 to further reduce these errors is an important direction for future work, but is beyond the scope of this study, which focuses on testing ECNLE predictions against different Tg inputs.
| Material | Tg,expt (K) | Tg,MD (K) | Tg,ML (K) |
|---|---|---|---|
| PPS | 229 | 227 | 249 |
| PI | 185 | 221 | 240 |
| PPG | 191 | 192 | 277 |
| PB | 166 | 181 | 249 |
Fig. 4 shows the structural relaxation time as a function of 1000/T for PPS, PI, PPG, and PB. The relaxation times were first calculated as a function of volume fraction using eqn (1)–(5), and then converted to temperature using the thermal mapping described in eqn (7), where Tg is a key input. The Tg values used in this mapping were obtained from MD simulations, ML predictions, and experimental data as listed in Table 1. ECNLE predictions using experimental Tg values (Tg,expt) quantitatively agree with BDS data in ref. 43 for all polymers. When using MD-predicted Tg values (Tg,MD), the predicted relaxation times remain consistent with experimental data for most systems, although a noticeable deviation is observed for PI. This finding is predictable since the difference between Tg,MD and Tg,expt is largest for PI. In contrast, using ML-predicted Tg values (Tg,ML) leads to greater discrepancies between ECNLE results and experimental data, particularly at lower temperatures.
![]() | ||
Fig. 4 Temperature dependence of the structural relaxation time for (a) PPS, (b) PI, (c) PPG, and (d) PB calculated using eqn (1)–(5) and (7). The Tg used in the thermal mapping (eqn (7)) is obtained from MD simulations, ML predictions, and experimental data in ref. 43. Experimentally, the molecular weights obtained were 44 000 g mol−1 for PPS, 1040 g mol−1 for PI, 192 g mol−1 for PPG, and 777 g mol−1 for PB. In contrast, the corresponding values derived from MD simulation were 631, 683, 656, and 570 g mol−1, respectively. | ||
To facilitate direct comparison with experimental results, the relaxation time data from Fig. 4 are replotted as a function of Tg/T in Fig. 5. This normalized representation shows that the results from the MD simulations and the GPR model agrees more closely with the experimental data than when plotted in Fig. 4. Moreover, plotting log
τα versus Tg/T allows us to calculate the dynamic fragility, which is defined as
.54 This quantity quantifies the steepness of the increase in the structural relaxation time near Tg. Based on the value of the dynamic fragility index m, glass-forming materials are typically categorized into three groups. Materials with m < 30 are considered “strong” glass formers and their τα(T) exhibits relatively Arrhenius-like behavior. Those with m > 100 are classified as “fragile” and the temperature dependence of the structural relaxation time shows highly non-Arrhenius dynamics near Tg. Systems with m values between 30 and 100 are regarded intermediate glass formers.
![]() | ||
| Fig. 5 The logarithm of structural relaxation times as a function of Tg/T for (a) PPS, (b) PI, (c) PPG, and (d) PB from the same data as in Fig. 4. | ||
Table 2 summarizes the dynamic fragility calculated using ECNLE theory using different Tg inputs and experimental fragility values. In most cases, the calculated m values are lower than the experimental values, but the deviations remain within an acceptable range. When using experimental Tg as the input, the calculated m values for PPS and PI are within 10–15% of the experimental fragilities. Although the Tg,MD values are close to the experimental Tg, the fragilities calculated using Tg,ML actually show better agreement with experimental values across all polymers. This suggests that machine learning, even with slightly overestimated Tg values, can effectively predict the dynamic fragility. The observed differences may result from experimental uncertainties, the sensitivity of glassy dynamics to small variations in Tg, and approximations inherent in the theoretical model. We also train our ML model based on the full dataset of polymer and find that the calculated fragilities deviate more significantly from experimental values (see the SI). This further validates the importance of appropriate data selection. Overall, these findings reveal that combining ECNLE theory with Tg inputs from either MD or ML provides a reliable and practical approach for predicting the thermal behavior of polymer glasses.
| Material | mexpt | m (Tg,expt) | m (Tg,MD) | m (Tg,ML) |
|---|---|---|---|---|
| PPS | 116 | 84 | 84 | 89 |
| PI | 80 | 75.4 | 82 | 89 |
| PPG | 79 | 67 | 67 | 103 |
| PB | 84 | 64 | 67 | 92 |
To examine how specific molecular characteristics such as intramolecular flexibility, side-chain dynamics, and intermolecular interactions affect the τα(T) calculations, we adopted the method proposed by Xie and Schweizer.31 This approach uses a material-specific parameter ac to adjust the contribution of the collective elastic barrier to the total energy barrier (FB + acFe). All above calculations in this work use the default value ac = 1. Fig. 4 presents the structural relaxation times of PPS, PI, PPG, and PB as a function of 1000/T computed using the ECNLE theory with varying values of the parameter ac based on eqn (1)–(4), (6), and (7). Adjusting ac improves quantitative agreement between numerical results and experimental data. This finding indicates that accurate prediction of Tg alone is insufficient to quantitatively capture the temperature dependence of structural relaxation time, as the dynamic behavior is also sensitive to how the collective elastic barrier is treated. However, even with optimal tuning, the model does not fully reproduce the relaxation behavior over the entire temperature range. It is also important to note that the predictions of this extended approach strongly depend on the selection of ac, which currently lacks a first-principles basis and must be empirically adjusted. The dependence of this method on fitting limits its use to systems for which experimental data already exist (Fig. 6).
![]() | ||
| Fig. 6 Temperature dependence of the structural relaxation time for (a) PPG, (b) PI, (c) PPS, and (d) PB. Data points correspond to experimental data in ref. 43. Solid curves are ECNLE theoretical predictions computed using eqn (1)–(4), (6) and (7) with experimental Tg values and different ac parameters. | ||
The above results indicate that using ac = 1 in the ECNLE theory provides a reasonable prediction of τα(T). Based on this, we apply eqn (8) to compute the diffusion coefficient as a function of Tg/T for PPS, PI, PPG, and PB. The particle diameter d, which is used to calculate the diffusion coefficient, was estimated for each polymer using the atomic structure of its monomer derived from SMILES codes and processed with the RDKit software. The computed diffusion coefficients are presented in Fig. 7. Using the Tg values obtained from MD, ML, and experimental data, we find that the diffusion coefficient decreases rapidly as temperature decreases. Notably, the ECNLE predictions using Tg,MD and Tg,expt are relatively close, despite the quantitative differences between these Tg values. Although the ECNLE theory is a microscopic statistical mechanical theory and does not explicitly account for spatially heterogeneous dynamics or the decoupling behavior between diffusion and relaxation time observed near the glass transition temperature,55–57 our calculated diffusion coefficients show good agreement with experimental data for PPG and PB provided in ref. 58. This agreement reinforces the physical relevance of our predictions and suggests that, for the systems studied here, the ECNLE theory can provide a quantitatively accurate description of average translational dynamics.
![]() | ||
| Fig. 7 The predicted temperature dependence of the diffusion coefficient for (a) PPS, (b) PI, (c) PPG, and (d) PB calculated using eqn (1)–(5), (7) and (8). The Tg used in the thermal mapping (eqn (7)) is obtained from experimental, MD simulations, ML predictions, and experimental data.58 The molecular weights for PPG were 770 g mol−1 experimentally and 656 g mol−1 via MD simulation, while the values for PB were 466 g mol−1 and 570 g mol−1, respectively. | ||
τα versus 1000/T. In the Angell representation (log
τα versus Tg/T), the ECNLE results based on Tg,MD, Tg,ML and Tg,expt are all quantitatively in agreement with the experimental relaxation data. This representation can reliably estimate the dynamic fragility. In particular, the fragilities calculated using Tg,ML show better agreement with the experimental values compared to those based on Tg,MD. This suggests that machine learning provides not only trend preservation but also quantitative accuracy in fragility prediction. Although the extended ECNLE model allows tuning through a material-specific parameter ac to scale effects of collective dynamics on the glass transition, we found that the standard formulation (ac = 1) already provides reasonable accuracy without any adjustable parameters. Overall, our study clearly validates the predictivity of our integrated approach, especially when experimental data are limited.
Footnote |
| † Present address: Center for Materials Innovation and Technology, VinUniversity, Hanoi 100000, Vietnam. |
| This journal is © The Royal Society of Chemistry 2026 |