Open Access Article
Yu Li
,
Jia-ao Hou
,
Yuqi Feng,
Danyang Zhao,
Cheuk Lun Chow
* and
Denvid Lau
*
Department of Architecture and Civil Engineering, City University of Hong Kong, Hong Kong, China. E-mail: denvid.lau@cityu.edu.hk; cheuchow@cityu.edu.hk
First published on 19th June 2026
Aluminosilicate hydrate geopolymers, such as sodium aluminosilicate hydrate and potassium aluminosilicate hydrate, are low-carbon cementitious materials commonly synthesized from industrial solid wastes such as fly ash and slag, offering a sustainable cementitious material with superior mechanical and environmental performance. However, their atomic-scale mechanisms remain elusive due to limitations in experimental and conventional simulation methods. In this study, a machine learning potential model constructed within a deep potential generator framework is developed for sodium aluminosilicate hydrate, and trained on density functional theory datasets spanning from 300 K to 1000 K. The model accurately reproduces density functional theory-calculated energies and forces with errors of 0.005 eV per atom and 0.078 eV Å−1. Furthermore, transfer learning is employed to adapt the sodium aluminosilicate hydrate model to potassium aluminosilicate hydrate using a small amount of additional density functional theory data, yielding comparable accuracy and faster convergence, with errors of 0.003 eV per atom and 0.092 eV Å−1 for energies and forces. To the best of our knowledge, this work presents the first machine-learning interatomic potentials specifically developed for both N-A-S-H and K-A-S-H geopolymer gels. Structural characterization, elastic properties, and dynamic behaviors predicted by the models are benchmarked against density functional theory, classical forcefields, and experimental measurements, demonstrating robustness and transferability of the approach. The findings demonstrate that the method is highly capable of reliably capturing complex aluminosilicate systems, which provide a new atomic-level understanding of their structural and mechanical behavior, thereby establishing a robust basis for guiding the targeted design of durable, high-performance, and sustainable geopolymer materials.
![]() | ||
| Fig. 1 Workflow for developing a DPMD model tailored to variable geopolymer systems, integrating dataset preparation, model training, and application. | ||
Although geopolymers show tremendous promise in terms of material performance and environmental applications, their formation mechanisms and microstructural evolution remain poorly understood. Conventional characterization approaches, including X-ray diffraction and nuclear magnetic resonance spectroscopy, are largely limited to inferring atomic-scale structural features in an indirect manner, while dynamic processes such as ion migration and hydrogen bond formation are still beyond direct observation. High-resolution characterization methods are not only expensive but also difficult to implement widely. Against this backdrop, classical molecular dynamics (MD) simulations offer an effective computational framework for investigating thermodynamic properties, structural units, and dynamic behaviors at the atomic level.12–18 These simulations contribute to unveiling the formation mechanisms of geopolymers and guiding performance optimization. Prior studies employing non-reactive Buckingham or Morse forcefields and reactive forcefield (ReaxFF) have constructed three-dimensional sodium aluminosilicate (N-A-A) glass or network models under constant-energy, constant-pressure and constant-temperature ensembles.19–21 These investigations reveal, for instance, that increasing water content from 2.6% to 4.7% increases the sodium diffusion coefficient, from 7.82 × 10−13 m2 s−1 to 10.12 × 10−13 m2 s−1, that models with silicon-to-aluminum ratios of two and three exhibit the highest tensile strength, and that noncompact models containing nanopores and water display strengths 45% lower than that of their compact analogues.22 Additional ReaxFF simulations have demonstrated that high temperatures accelerate network polymerization, that silicon-oxygen and aluminum–oxygen bond lengths first increase then decrease, and that magnesium binds more strongly to the aluminosilicate framework than sodium while cesium shows the highest mobility.22,23 Nevertheless, empirical forcefields confront an inherent trade-off between computational efficiency and chemical fidelity. Furthermore, ab initio molecular dynamics (AIMD) simulations, which generate interatomic forces via density functional theory (DFT), can deliver unparalleled precision in describing bonding and reaction events.24,25 However, their practical applicability is typically restricted to systems containing only a few hundred atoms and simulation times on the order of tens of picoseconds, which severely limits their use for large-scale geopolymer dynamics.
Machine learning potential (MLP) has recently been recognized as an effective approach for achieving a balance between accuracy and computational efficiency in MD simulations. Trained on first-principles datasets, these models can achieve quantum-level accuracy without incurring prohibitive computational cost.26,27 Deep potential (DP), a multilayer perceptron framework, excels at capturing complex nonlinear relationships between atomic coordinates and system energy.28 Open-source packages such as deep potential molecular dynamics (DPMD) based on passive learning and Deep Potential Generator (DP-GEN) employing concurrent learning have been developed to facilitate model construction.29 The DP-GEN concurrent-learning strategy iteratively refines the training set by screening MD trajectories for force deviations outside predefined bounds and supplementing data via single-point first-principles calculations.28 Although this methodology has propelled advances in calcium silicate hydrate research, its application to geopolymer systems remains unexplored, underscoring the need for dedicated machine learning potential studies. More importantly, the unresolved challenge is not only whether existing MLP frameworks can be transferred to geopolymer systems, but whether they can be used to construct system-specific interatomic potentials for the particular amorphous aluminosilicate gels that researchers actually aim to understand and design. In conventional forcefield approaches, once a parameter set is fixed, extension to related but compositionally different systems often requires reparameterization, empirical adjustment, or the use of separate and potentially inconsistent descriptions. A MLP framework offers the opportunity to overcome this limitation by providing a more flexible yet still first-principles-grounded route for building coherent potentials tailored to the systems of interest.
This study aims to formulate a DPMD model with applicability to sodium-based and potassium-based aluminosilicate geopolymers. Starting from established sodium aluminosilicate glass structures, aluminum atoms are randomly substituted by silicon and charge neutrality is maintained through the incorporation of calcium cations. Subsequent hydration yields realistic N-A-S-H frameworks, which are geometry-optimized via density functional theory. To ensure comprehensive coverage of the potential energy surface of the geopolymer, training data are generated over a temperature range spanning 300 K to 1000 K. The resulting first-principles forces and energies form the DP-GEN training dataset. Rigorous validation of the trained model is performed, followed by benchmarking against experimental measurements, classical MD results and DFT calculations. Given the structural similarities between K-A-S-H and the N-A-S-H system, this work employs an innovative approach by leveraging a small amount of first-principles data from the K-A-S-H system to enable cross-system model transfer. The accuracy of the transferred model is then rigorously validated. The impact of this research lies in filling the critical gap of DP-GEN-based modeling in geopolymer systems, a field previously dominated by less accurate classical forcefields or computationally prohibitive AIMD. To the best of our knowledge, this is the first study to develop a DPMD model specifically tailored for both N-A-S-H and K-A-S-H geopolymer systems. The originality of this work lies in establishing a practical modeling strategy for complex amorphous geopolymer gels, where reliable potentials are constructed through system-specific dataset generation, AIMD sampling, and iterative refinement within a unified framework. By delivering a reliable and scalable simulation framework, the present study enables detailed atomistic understanding of the intrinsic links between the atomic structure and macroscopic properties in N-A-S-H and K-A-S-H, while also establishing a cost-effective transfer learning paradigm for future geopolymer research. The insights gained here are expected to inform bold, forward-looking strategies to guide the rational design and identification of advanced durable, high-performance geopolymer materials for infrastructure applications. For example, this forcefield can capture atomic scale deformation mechanisms to guide the development of materials with ultra-high mechanical strength, reliably simulate the behavior of geopolymers at 300–1000 K to optimize refractory formulations, and support systematic control of key element ratios such as the silicon aluminum ratio and sodium aluminum ratio, breaking free from the limitations of trial-and-error experiments.
Once the amorphous N-A-S-H structure is generated, a deep optimization step via DFT refines the model. Specifically, after structural relaxation and energy minimization, the initial training configurations are generated by AIMD simulations, in which atomic forces and stresses are directly computed from density functional theory and the temperature is explicitly controlled by the Nosé–Hoover thermostat. The initial AIMD sampling is performed at 500 K, which is selected as an intermediate temperature within the targeted 300–1000 K range to facilitate efficient subsequent DP-GEN exploration and convergence. Numerous iterative calculations adjust atomic positions and electronic densities to drive the system toward its lowest energy. This crucial stage not only ensures structural validity and stability but also reduces the computational demands of later first-principles runs. With the optimized structure in hand, the dataset generation phase employs fully ab initio quantum-mechanical calculations on the N-A-S-H system to extract atomic coordinates, energies and forces. Throughout the computational workflow, simulations run in an NVT ensemble governed by the Nose–Hoover thermostat that maintains temperatures between 300 K and 1000 K. The use of the NVT ensemble is based on the fact that the system volume has already been established during the preceding structural relaxation and energy minimization stage. In this step, the model is equilibrated by minimizing both atomic forces and stresses, with convergence thresholds of 0.01 eV Å−1 for the force and 2.0 kBar for the stress, thereby ensuring a mechanically stable structure with a physically reasonable volume and density before finite-temperature sampling. Sampling across this wide range delivers comprehensive mapping of the potential energy surface and captures structures and energies under diverse thermodynamic states. In the subsequent DP-GEN concurrent-learning cycles, AIMD-based exploration is carried out at four temperatures, namely 300 K, 500 K, 700 K, and 1000 K. During each iteration, simulations at all temperatures are performed using identical sampling time and interval settings, while candidate configurations are selected according to the model deviation. This sampling procedure is adaptive, where temperature regions with larger uncertainty contribute more candidate structures, whereas fewer structures are retained from well-converged regions, ensuring both balanced thermodynamic coverage and efficient data generation. After integration and filtering, the initial dataset of 1230 atomic frames initiated 17 concurrent learning iterations of the DP-GEN framework, which integrated MD-based configuration space exploration. The initial dataset contains 1230 atomic configurations and mainly serves to initialize the DP-GEN concurrent-learning process. Through 17 iterative exploration–labeling–training cycles, the dataset is progressively expanded to a final total of 16
700 configurations. The adequacy of the training set is determined not by a predefined target number of frames, but by the convergence of model accuracy and the reduction of force deviation during active learning. In the final exploration stage, 137
834 out of 140
472 explored configurations fall within the predefined trust region, corresponding to a prediction accuracy of approximately 98%, which indicates a high level of model reliability. These richly detailed datasets support both training and validation of the DPMD model and pave the way for deep understanding of how the microstructure of N-A-S-H geopolymers influences their macroscopic properties.
![]() | (1) |
During the exploration and labeling stages, deviation metrics are assessed based on an ensemble of previously trained models, with new configurations selected based on the maximum force deviation, defined as eqn (2),28
![]() | (2) |
The initial K-A-S-H structure is generated by substituting all Na+ species within the optimized N-A-S-H network with K+ counterparts. As both cations carry the same charge, the substitution preserved overall charge neutrality. The hydration environment established in N-A-S-H is preserved to approximate the gel chemistry of K-A-S-H. A similar equilibration protocol is applied, where equilibration is subsequently performed at 500 K in an NVT ensemble. This procedure yields a stable amorphous K-A-S-H configuration suitable for further refinement. DFT calculations are subsequently carried out under the same computational settings used for N-A-S-H, including the Perdew–Burke–Ernzerhof exchange–correlation functional, plane-wave cutoff energy, and total energy convergence threshold. NVT ensembles are conducted spanning temperatures from 300 K to 1000 K, with atomic coordinates, energies, and forces recorded at 5-fs intervals. From the resulting trajectories, only 330 frames are selected for training the DPMD model, significantly fewer than the 1230 frames used for N-A-S-H, thereby substantially reducing computational cost while maintaining model fidelity.
Afterwards, to capture the distinct structural features of K-A-S-H, including the coordination environment of K+ ions and changes in network polymerization, the pre-trained N-A-S-H DPMD model is adapted rather than directly reused. Parameters associated with general aluminosilicate network features, including descriptors for [SiO4] and [AlO4] tetrahedral environments and energy modules for Si–O–Al bridging oxygens, are retained to serve as the foundation for transfer learning. Using the DP-GEN concurrent learning strategy, the initial 330 frames in the K-A-S-H DFT dataset forms the starting training set. Molecular trajectories are screened for configurations with force deviations exceeding 0.012 eV Å−1, resulting in 13
157 additional single-point DFT calculations. Fig. 2(b) presents the maximum force deviation distribution obtained for the K-A-S-H system, with its potential energy surface derived via transfer learning starting from the N-A-S-H model. Compared with N-A-S-H, the distributions of K-A-S-H at all temperatures are likewise concentrated in the low-deviation region, and their curve shapes closely resemble the corresponding profiles observed for N-A-S-H, which demonstrates that the transfer-learning approach largely preserves the predictive strength of the parent model. At 300 K and 500 K, the distributions nearly coincide with those of N-A-S-H, suggesting that the transfer-learned model transitions smoothly to the K-A-S-H system under low-temperature conditions. At 700 K and 1000 K, although the distributions become slightly broader, the overall deviation levels remain low, and most snapshots still fall within a reasonable range. These results suggest that transfer learning achieves a substantial reduction in computational expense for new-system modeling while simultaneously preserving high predictive accuracy and stability in the complex configurational space under higher-temperature conditions. It is worth noting that, under high-temperature conditions, the K-A-S-H distributions exhibit greater smoothness relative to the corresponding N-A-S-H results, implying that transfer learning enhances, to some extent, the mode adaptability to different chemical environments. During training, the initial four layers of the fitting network are kept fixed, and only the parameters of the output layer are trained, and the training proceeds totally for 5 × 104 steps. A direct comparison between transfer learning and training from scratch for K-A-S-H is provided in the SI, where transfer learning is shown to achieve comparable accuracy with an approximately tenfold reduction in training steps. This transfer learning approach successfully adapts pre-trained aluminosilicate potential to a new cationic environment, providing a cost-efficient strategy for developing accurate interatomic potential across chemically similar systems.
![]() | (3) |
![]() | (4) |
indicate the corresponding energy and force values predicted by the DPMD model.
To further assess the performance of the DPMD model, lattice constants, radial distribution functions (RDFs), mean square displacement (MSD), and structural descriptors characterizing geopolymer systems are computed. RDF profiles are obtained from canonical ensemble simulations conducted via AIMD and classical MD using various potential models. MSD analysis is performed to assess atomic mobility and dynamic stability, providing complementary insight into the model's ability to reproduce realistic transport behavior.38–40 For elastic property evaluation, the stress–strain method is applied with ±5% strain, which is a value that falls within the typical range for elastic property characterization and has been widely employed in simulations targeting elastic properties of geopolymers and similar materials.1,22,31,41 Elastic tensors under compressive deformation are evaluated using large-scale atomic/molecular massively parallel simulator (LAMMPS) for both DPMD and clay forcefield (ClayFF) models, while reference results based on DFT are taken from first-principles calculations. The bulk modulus, Young's modulus, shear modulus, and Poisson ratio are subsequently determined through Voigt–Reuss–Hill (VRH) averaging. These multi-faceted validations demonstrate the robustness and cross-system applicability of the DPMD model in capturing both static and dynamic properties of geopolymer systems.
For the energy prediction, the points of both the training sets are tightly clustered around the diagonal line. This result demonstrates that the DPMD model is capable of faithfully recovering the DFT-derived energy values. The RMSE for energy in the training set is 0.003 eV per atom, and 0.005 eV per atom in the test set, both of which are fairly low compared to the literature29 and have reached the level of meV deviation, demonstrating the high precision of the proposed model in energy prediction (see Fig. S2). Regarding the force prediction, the points also show a strong adherence to the diagonal line. The RMSE for force in the training set is 0.063 eV Å−1 and 0.078 eV Å−1 in the test set, as shown in Fig. S2. These small error values imply that the model can effectively capture the interatomic force information as accurately as DFT. In terms of computational efficiency, the DPMD model offers a substantial advantage over DFT. While a single DFT calculation for a N-A-S-H system of comparable size requires significant computational resources on a high-performance cluster, the trained DPMD model generates predictions almost instantaneously. The training process, though computationally intensive, is performed only once, after which the model can be reused for large-scale simulations or repeated evaluations with negligible additional cost.
Afterwards, the predictive performance of the transfer-learned DPMD model for the K-A-S-H system is evaluated by comparing its outputs with DFT reference values, as illustrated in Fig. 4. The model parameters are obtained via transfer learning from the previously trained N-A-S-H model. The four panels correspond to total energy, as shown in Fig. 4(a), and force components along the x, y, and z directions (see Fig. 4(b)–(d)), with green points representing the training set and purple points, the test set. The dashed diagonal line in each plot denotes perfect agreement with DFT. Across all subplots, data points for both training and test sets are densely distributed along the diagonal, indicating that the transfer-learned model maintains high predictive fidelity for both energy and atomic forces in the new chemical system. For energy prediction, the clustering of points is particularly tight, reflecting minimal deviation from DFT values. For force components, strong linear correlations are also observed in all three Cartesian directions, with only slight dispersion relative to the diagonal, consistent with the inherent sensitivity of force calculations to local atomic environments. The close agreement between predictions and DFT results demonstrates that the structural and energetic features learned from N-A-S-H can be effectively generalized to K-A-S-H via transfer learning. This approach significantly reduces the amount of new DFT data and training time required, while still achieving DFT-level accuracy.
When compared with the outcomes obtained for the N-A-S-H system, the K-A-S-H model exhibits comparable RMSE values for both energy and force predictions. The RMSE for energy is 0.002 eV per atom and 0.003 eV per atom in the training and test set, respectively, and the RMSE for force is 0.090 eV Å−1 and 0.092 eV Å−1 in the training and test set, respectively, confirming that transfer learning preserves the accuracy of the original model while enabling rapid adaptation to a chemically related system, as shown in Fig. S3. This highlights the efficiency and scalability of the DP-GEN framework for extending interatomic potentials across similar material systems.
It is important to emphasize that RDF curves reflect the probability distribution of specific atomic pairs at different distances, and the peak positions correspond to average bond lengths or coordination shell locations, while the peak heights represent the degree of local ordering and the strength of interatomic interactions.42 Therefore, the strong agreement between DPMD and AIMD across various atomic pairs not only confirms that the model can reliably forecast both energy and force behaviors but also demonstrates its capability to faithfully reproduce the local structural characteristics and chemical environments associated with the N-A-S-H material system. In contrast, the deviations observed in ClayFF highlight the limitations of traditional empirical forcefields in complex inorganic polymer systems, particularly in cases involving multiple components and diverse coordination environments. The RDFs of N-A-S-H at other temperatures (500 K, 700 K and 1000 K) are shown in Fig. S4–S6, where Si–O and Al–O retain short-range coordination stability but show broadened peaks and reduced intensity with increasing temperature, while Na–O and H–O exhibit more pronounced peak attenuation and broadening, accompanied by progressive deterioration of medium-range order and enhanced ionic mobility. In summary, the developed DPMD model achieves structural prediction accuracy comparable to AIMD while offering significant computational efficiency, thus providing a reliable tool for subsequent investigations into the structure and properties of geopolymer systems. From a mechanistic perspective, the RDF results also indicate that charge-balancing cations play an important role in regulating framework compactness and rigidity. In the N-A-S-H system, the relatively localized Na–O coordination environment suggests stronger electrostatic interactions between Na+ and framework oxygen atoms, which help stabilize the aluminosilicate network and promote a more compact local structure. Such stronger cation-framework association is expected to enhance network constraint and reduce structural flexibility, thereby contributing to the comparatively higher rigidity of N-A-S-H.
Afterwards, to examine the applicability of the transfer learning strategy in the K-A-S-H system, the RDFs between different atoms at 300 K are further analyzed, as shown in Fig. 6. The results from DPMD are generally consistent with AIMD in terms of peak positions, successfully reproducing the structural features of the primary coordination shells, whereas ClayFF exhibits noticeable deviations. For the K–O, Si–O, and H–O pairs, the peak intensities obtained from ClayFF are significantly higher than those from DPMD and AIMD, indicating an overestimation of local ordering. In the case of the Al–O pair, the first peak position predicted by ClayFF is shifted, reflecting its limitations in accurately describing the tetrahedral framework. By contrast, the curve profiles and peak intensities from DPMD show excellent agreement with AIMD, demonstrating that the K-A-S-H DPMD potential derived through transfer learning from the N-A-S-H model maintains high predictive accuracy in a new cationic environment. Quantitatively, the first-peak position differences between DPMD and AIMD for K-A-S-H remain within approximately 0.02–0.03 Å for Si–O and Al–O, and within about 0.05–0.06 Å for K–O and H–O. The corresponding peak-height differences are generally within 5% for the framework pairs at 300 K. This confirms that the transfer-learned model preserves the local structural characteristics of the parent system with only limited deviations in peak sharpness and local ordering. This approach not only retains the broad generalization ability of the original model, but also enhances the overall computational efficiency compared with AIMD. In addition to validating the transfer learning strategy, the RDF comparison also reveals the structural role of K+ in the geopolymer network. Compared with Na+, K+ exhibits a broader and less localized first coordination shell with oxygen, which is consistent with its larger ionic radius, lower charge density, and weaker electrostatic binding to the aluminosilicate framework. This more delocalized coordination environment implies a looser local packing state and a broader distribution of cation-oxygen distances, thereby increasing medium-range structural variability and reducing the overall rigidity of the K-A-S-H network. These results further confirm that the transfer learning method offers strong reliability and broad applicability for cross-system modeling of complex geopolymer systems. The RDFs of K-A-S-H at 500 K, 700 K and 1000 K are shown in Fig. S7–S9, where the faster deterioration of the medium-range structure of ionic pairs at elevated temperatures is observed, which further validates the model's capability to capture temperature-dependent structural dynamic behaviors.
Similarly, the MSD of the K-A-S-H model is analyzed using different methods at 300 K, as shown in Fig. 8. At 300 K, the MSD profiles of the K-A-S-H system display marked dynamical distinctions relative to those of N-A-S-H. The MSD values of K-A-S-H are consistently higher than those of N-A-S-H, for both framework atoms (Si, Al and O) and mobile species (K and H), indicating stronger thermal vibrations and enhanced ionic mobility. This phenomenon primarily arises from the larger ionic radius and reduced charge density of K+ compared with Na+, which weakens its electrostatic interaction with framework oxygen atoms. Therefore, K+ is less tightly confined within the aluminosilicate network and experiences a lower effective migration barrier. At the same time, the weaker cation–framework interaction reduces the topological constraint exerted on the surrounding framework, allowing larger-amplitude thermal fluctuations and more pronounced local structural rearrangements. Therefore, the stronger dynamics in K-A-S-H originate not only from the intrinsic mobility of K+ itself, but also from the increased flexibility of the entire cation-framework coupled network. As a result, local structural relaxation is intensified, the thermal amplitude of the framework atoms is enlarged, and more accessible diffusion pathways are created for ionic transport. Consequently, K-A-S-H exhibits higher overall mobility than N-A-S-H under ambient conditions. In terms of methodological comparison, the results of DPMD and AIMD for K-A-S-H remain in excellent agreement, and the MSD curves of both framework and ionic species nearly overlap in magnitude and growth rate, confirming the robustness and cross-system transferability of the K-A-S-H DPMD model obtained through transfer learning based on the N-A-S-H potential. Quantitatively, for the mobile species in K-A-S-H at 300 K, the MSD difference between DPMD and AIMD remains within about 10% over the main diffusion regime. As in N-A-S-H, the framework atoms exhibit larger relative deviations because of their smaller absolute displacements, while the end-of-trajectory absolute differences remain limited, typically within about 0.5–2.0 Å2 for Si and Al, and up to 1–3 Å2 for O. This result indicates that DPMD models not only achieve near-DFT accuracy within a single system but can also be effectively extended to chemically related systems through transfer learning, thereby capturing subtle dynamical differences. By contrast, the results from ClayFF are significantly lower than those of DPMD and AIMD, with particularly pronounced underestimation of the MSD of K and H, highlighting the systematic limitations of classical forcefields in describing hydrogen bonding, proton dynamics, and the interactions between large alkali cations and the aluminosilicate framework. The MSDs of the K-A-S-H model at other temperatures are shown in Fig. S11, where atomic mobility increases markedly with temperature.
The close consistency between DPMD and AIMD not only confirms the accuracy of DPMD models but also underscores the methodological advantage of transfer learning in modeling complex inorganic polymer systems. In contrast, although ClayFF captures qualitative trends, it systematically underestimates quantitative behavior and fails to reproduce the true dynamics of N-A-S-H and K-A-S-H. Therefore, transfer-learning-based DPMD emerges as a powerful and efficient approach that combines high accuracy with computational scalability, making it an ideal tool for investigating the dynamical properties of complex inorganic polymers.
![]() | (5) |
The distributions of the q-order parameter for the N-A-S-H system at 300 K calculated using different models are shown in Fig. 9, aiming to characterize the evolution of various types of coordination, including four-fold, five-fold, tetrahedral, and octahedral configurations, under thermal perturbation. Vertical dashed lines mark the reference q values corresponding to ideal configurations, facilitating the identification of deviations at different temperatures. The distribution results reveal a high degree of consistency between DPMD and AIMD in terms of peak positions, profile shapes, and the relative proportions of various coordination motifs. This agreement confirms that the DPMD model achieves structural resolution comparable to that of first-principles methods, demonstrating reliable physical accuracy. More quantitatively, the tetrahedral peak remains centered near q = 1.0 in both DPMD and AIMD, with a peak-position difference typically smaller than about 0.02–0.04. The neighboring defective or distorted motifs in the q range of approximately 0.3–0.9 also show only limited peak-position differences, generally below 0.2. The main residual discrepancy lies in the relative intensities of low-symmetry and defective local motifs, which indicates that DPMD captures the dominant local geometry accurately while retaining only limited statistical deviation in the distribution of distorted environments. At 300 K, the overall distribution is narrow and concentrated, with sharp peaks near the ideal values for tetrahedral and three-fold structures. This indicates that Si–O and Al–O tetrahedral units maintain well-defined geometries, serving as the primary contributors to structural stability at low temperature. The fractions of five-fold and octahedral configurations are small, suggesting that the system is predominantly composed of ideal tetrahedral units with high local order.48–50 In contrast, the ClayFF simulation exhibits pronounced deviations across multiple coordination types, particularly with anomalous differences in low-coordination regions. Additionally, the tetrahedral peak in ClayFF is both shifted and diminished in intensity, indicating a reduced ability to capture the correct local geometry. These discrepancies suggest inherent limitations in classical forcefields when describing the fine structural features of geopolymer systems, likely stemming from their simplified treatment of angular potentials and electrostatic interactions. Meanwhile, the q-order parameters of the N-A-S-H system at other temperatures are shown in Fig. S12, where tetrahedral structure distributions broaden with decreasing peak heights as temperature increases, and five-fold and low-coordination structures increase. These results indicate that the N-A-S-H framework remains dominated by tetrahedral local order at low temperature, while thermal activation mainly induces progressive local distortion rather than immediate collapse of the framework. This behavior is consistent with the RDF and MSD analyses, which show that the aluminosilicate network retains comparatively strong structural constraint in the presence of Na+.
Afterwards, the distributions of q-order parameters for the K-A-S-H system at 300 K from different models are presented in Fig. 10. In comparison with the N-A-S-H system, the framework cation in K-A-S-H is K+, whose larger ionic radius and weaker electrostatic interactions result in a more pronounced structural response under thermal perturbation.51–53 At 300 K, the distributions of four-fold and tetrahedral configurations are sharply peaked near their ideal q values, indicating that the system is predominantly composed of well-defined Si–O and Al–O tetrahedral units exhibiting pronounced local ordering. The fraction of three-fold and five-fold coordination is limited, likely associated with boundary regions or oxygen-deficient sites. Octahedral configurations are nearly absent, suggesting that the framework remains structurally stable at low temperature without significant coordination transitions. In both the DPMD and AIMD results, the tetrahedral configuration exhibits a distinct peak, suggesting that the Si–O and Al–O tetrahedral units in the K-A-S-H network preserve substantial geometric ordering at ambient temperature. Moreover, the distributions in the low- and high-coordination regions are largely consistent between the two methods, with no significant anomalies observed, suggesting that DPMD accurately reproduces the local structural features captured by AIMD. Quantitatively, the main tetrahedral-related peak positions remain nearly unchanged between DPMD and AIMD, with deviations generally smaller than about 0.04. In the distorted or defective coordination regions, the peak-position differences are also limited, typically below 0.2, whereas the relative intensity deviations are somewhat larger, particularly for five-fold and low-symmetry motifs. This indicates that the transfer-learned DPMD model accurately reproduces the dominant local structural motifs while still showing modest statistical differences in the populations of highly distorted environments. It is noteworthy that, although the K+ ion possesses a relatively large ionic radius and weak electrostatic binding, which could theoretically promote framework relaxation and distortion, neither DPMD nor AIMD reveals substantial coordination transitions or structural rearrangements at 300 K. This indicates that the system maintains good structural stability under low-temperature conditions. This behavior contrasts with that observed in the N-A-S-H system, which exhibits stronger tetrahedral dominance and lower coordination diversity at the same temperature. In comparison, the ClayFF simulation yields markedly different distributions across several coordination types, with pronounced differences in the three-fold and four-fold regions, and noticeable shifts in both the position and intensity of the tetrahedral peak. These discrepancies highlight the limitations of classical forcefields in accurately describing the local geometry of the K-A-S-H framework, likely due to simplified treatments of ionic size effects and angular potentials. Therefore, DPMD demonstrates a high level of agreement with AIMD in terms of structural resolution, effectively reproducing the local coordination distributions observed in first-principles simulations. This consistency validates the reliability of DPMD as an efficient surrogate for AIMD in modeling the K-A-S-H system. The q-order parameters of the K-A-S-H model at other temperatures are shown in Fig. S13, in which tetrahedral distributions broaden with decreasing peak intensity as temperature increases, and five-fold and octahedral configurations increase significantly (more pronounced than in N-A-S-H). Compared with N-A-S-H, the more pronounced broadening of the tetrahedral-related distributions and the stronger increase in distorted coordination motifs suggest that the K-A-S-H framework is more susceptible to local rearrangement. This provides complementary structural evidence that the weaker K+-framework interaction leads to a more flexible network, which is consistent with the enhanced dynamics and lower stiffness observed for K-A-S-H.
In summary, the strong agreement between DPMD and AIMD validates the feasibility and reliability of DPMD as an efficient surrogate for ab initio simulations in geopolymer modeling. The deviations observed in ClayFF further underscore the advantages of DPMD in chemically complex systems, providing a robust foundation for large-scale and long-timescale simulations of geopolymers.
| Properties | DPMD | ClayFF | Experiment54–57 |
|---|---|---|---|
| Density (g cm−3) | 2.06 ± 0.02 | 2.91 ± 0.05 | 1.98 ± 0.12 |
| Bulk modulus (GPa) | 36.10 ± 2.55 | 100.19 ± 6.16 | 16.32 ± 4.15 |
| Young's modulus (GPa) | 36.46 ± 2.37 | 118.59 ± 5.20 | 21.31 ± 5.24 |
| Shear modulus (GPa) | 13.69 ± 1.18 | 47.58 ± 3.78 | 8.58 ± 1.62 |
| Poisson ratio | 0.33 ± 0.04 | 0.31 ± 0.06 | 0.26 ± 0.02 |
It should be noted that both DPMD and ClayFF predict moduli that are generally higher than experimental values. This discrepancy can be attributed to several factors.58 First, MD simulations are typically based on idealized, homogeneous, and defect-free structural models, whereas real materials contain pores, microcracks, heterogeneous water distributions, and impurities, all of which significantly reduce macroscopic mechanical performance. Second, the preparation methods, curing conditions, and testing protocols of experimental samples strongly influence the measured moduli; for example, water content, curing temperature, and loading rate often lead to lower experimental values. Third, approximations made during the parameterization of forcefields may result in a slight overestimation of bond strength and framework rigidity, thereby yielding higher simulated values. It is worth noting that the density and Poisson ratio were not significantly affected by similar factors. This is mainly because the two properties differ remarkably from the modulus in their physical essence: density, as a fundamental physical property that characterizes the mass per unit volume of a material, has its simulated value dependent only on the spatial arrangement density and atomic mass of atoms, and is less sensitive to microscopic defects (e.g., pores) in the model. Even when an idealized pore-free structure is adopted for simulation, the calculated density can still maintain good consistency with the apparent density of real materials through the direct calculation of system mass and volume. The Poisson ratio reflects the ratio of lateral deformation to longitudinal deformation of a material under stress, and its magnitude is mainly determined by the anisotropy of interatomic bonding. Its susceptibility to macroscopic defects such as pores and microcracks is much lower than that of the modulus and other mechanical parameters that depend on the overall structural stiffness. Therefore, the aforementioned factors that cause the overestimation of simulated modulus values are not applicable to all performance parameters, which reflects the characteristic differences between different physical quantities in simulation-experiment comparisons.
A comparison between DPMD and ClayFF clearly shows that ClayFF imposes an excessively rigid description of interatomic interactions, leading to predicted moduli that are unrealistically high and physically unreasonable, as well as an overestimated Poisson ratio. In contrast, DPMD, by employing a DPMD model that more accurately fits the energy surface and atomic forces, significantly improves the description of both structural and mechanical properties while maintaining high computational efficiency. Notably, the moduli predicted by DPMD follow the experimental trends, and its dimensionless indicators such as the Poisson ratio are in close agreement with experimental data, further confirming its reliability in complex inorganic polymer systems. Therefore, DPMD not only outperforms the traditional empirical forcefield ClayFF in predicting mechanical properties but also achieves results that are overall closer to experimental observations. From a structural viewpoint, the relatively higher stiffness of N-A-S-H can be attributed to its stronger Na+-framework interaction, which promotes a denser packing state, a more constrained local coordination environment, and more stable bridging configurations within the aluminosilicate network. These features collectively enhance resistance to deformation under applied stress. This demonstrates that DPMD provides a more reliable and efficient tool for investigating the structure–property relationships of geopolymer systems, while also revealing the inherent shortcomings of traditional forcefields in modeling complex multicomponent materials and provides a solid theoretical foundation for understanding the structure–property relationships of N-A-S-H materials.
Afterwards, to further validate the generalization capability of the DPMD model, the mechanical properties of the K-A-S-H system are calculated. The DPMD potential is obtained through transfer learning from the N-A-S-H model, in order to examine its applicability in a different cationic environment, as summarized in Table 2.59–62 The density of the K-A-S-H system predicted by DPMD is 2.33 ± 0.03 g cm−3, which lies within the experimental range of 2.23 ± 0.11 g cm−3, whereas the value obtained from ClayFF (2.60 ± 0.07 g cm−3) is noticeably overestimated. The bulk modulus, Young's modulus, and shear modulus measured by DPMD are 29.32 ± 2.47 GPa, 31.80 ± 2.70 GPa, and 12.05 ± 1.09 GPa, respectively, which are notably below the respective values from ClayFF (51.90 ± 6.07 GPa, 61.20 ± 5.31 GPa, and 20.51 ± 2.55 GPa) and much closer to the experimentally reported range.59–62 It should be noted that the moduli predicted by DPMD are still slightly higher than experimental values, primarily because the simulated systems are typically idealized, dense, and defect-free, whereas real materials contain pores, microcracks, and heterogeneous water distributions that reduce macroscopic mechanical performance. In addition, experimental conditions such as curing methods and testing protocols can also influence the measured moduli. DPMD demonstrates high accuracy and physical reliability in the K-A-S-H system, thereby confirming the robustness and transferability of the proposed transfer learning strategy for modeling complex geopolymer systems. Compared with N-A-S-H, the lower stiffness of K-A-S-H can be structurally interpreted by its weaker cation-framework coupling, lower packing compactness, and larger free volume. The larger K+ ion induces a more weakly connected and more deformable network, which reduces resistance to elastic deformation and results in lower bulk, Young's, and shear moduli. In addition, the mechanical properties of both N-A-S-H and K-A-S-H at elevated temperatures (500–1000 K) are further evaluated, and the results reveal a clear temperature-dependent evolution governed by the competition between intermediate-temperature structural rearrangement and high-temperature thermal disorder; detailed data and analysis are provided in the SI.
| Properties | DPMD | ClayFF | Experiment59–62 |
|---|---|---|---|
| Density (g cm−3) | 2.33 ± 0.03 | 2.60 ± 0.07 | 2.23 ± 0.11 |
| Bulk modulus (GPa) | 29.32 ± 2.47 | 51.90 ± 6.07 | 27.32 ± 6.58 |
| Young's modulus (GPa) | 31.80 ± 2.70 | 61.20 ± 5.31 | 26.8 ± 2.2 |
| Shear modulus (GPa) | 12.05 ± 1.09 | 20.51 ± 2.55 | 10.34 ± 1.26 |
| Poisson ratio | 0.32 ± 0.02 | 0.33 ± 0.03 | 0.31 ± 0.04 |
It should be noted that although the DPMD predictions are overall closer to experimental data, the calculated moduli remain slightly higher than experimental values. This discrepancy primarily arises from the differences between simulated and real materials: MD simulations are typically performed on idealized, dense, and defect-free structures, whereas real geopolymers contain pores, microcracks, heterogeneous water distributions, and impurities, all of which reduce macroscopic mechanical performance. In addition, experimental conditions such as curing methods, temperature, humidity, and loading protocols can significantly influence the measured mechanical properties, often leading to lower values. Despite these differences, the DPMD results still capture the experimental trends and magnitudes with good fidelity, underscoring the reliability of the model in describing realistic chemical environments and mechanical responses.
From a methodological perspective, the successful application of transfer learning is of particular significance. Its significance lies not only in reducing the cost of extending a pretrained model to a related system, but also in demonstrating a coherent route for constructing system-specific MLP for geopolymer compositions of actual scientific and engineering interest. Compared with conventional forcefield development, where parameter sets are typically fixed and system extension often depends on separate reparameterization or empirical adjustment, the present strategy enables a more flexible and internally consistent description of chemically related amorphous geopolymer gels within the same first-principles-based framework. This is particularly valuable for geopolymer research, where cation chemistry, composition, hydration state, and temperature are all closely coupled to the phenomena under investigation. Traditionally, each new system requires extensive AIMD sampling and model training, which is computationally demanding. The present results show that a DPMD model trained on N-A-S-H can be efficiently adapted to K-A-S-H with only a limited quantity of additional data, while retaining high accuracy. This not only lowers the computational burden but also reveals the scalability and flexibility of deep potential methods in multicomponent systems. Going forward, this approach can be expanded to encompass a wider spectrum of geopolymer compositions, enabling unified modeling across diverse systems that are critical for practical applications. In this broader context, the present work is important not only because it reports accurate models for two representative systems, but because it establishes a transferable route for constructing machine-learning potentials around concrete materials problems, including composition optimization, cation substitution, and temperature-dependent performance prediction in waste-derived, low-carbon geopolymer binders. Specifically, it can be expanded to geopolymers with different alkali cations, like Li+ and Cs+, which are commonly encountered in industrial alkali activators. For these systems, the key lies in leveraging the structural similarity of the aluminosilicate framework while supplementing targeted DFT data that capture the unique coordination behavior and electrostatic interactions of specific cations, especially for cations with distinct ionic radii or charge densities that may induce significant framework relaxation. Furthermore, the approach can be extended to geopolymers with variable Si/Al ratios, which exert a direct impact on the degree of polymerization and mechanical properties of the framework. For such extensions, the training dataset should cover the structural diversity arising from different Si/Al ratios, including variations in tetrahedral connectivity and charge-balancing cation distribution, to ensure that the model generalizes across stoichiometric gradients.
When extending the applicability of this approach to other geopolymers, several critical considerations must be prioritized. Firstly, dataset representativeness is essential; the supplementary DFT data for the target system should cover the key thermodynamic states and structural configurations that are specific to that geopolymer. This ensures that the transfer-learned model captures the unique features of the target system without over-reliance on the parent N-A-S-H model. Secondly, transfer learning parameter optimization is crucial, adjusting the quantity of fixed layers within the neural architecture and the size of the supplementary dataset based on the similarity between the parent and target systems. For systems closely related to N-A-S-H, fewer frozen layers and a smaller supplementary dataset may suffice; for more distinct systems, increasing the number of trainable layers and expanding the supplementary dataset can improve adaptation. Thirdly, capturing multi-component interactions is necessary for complex geopolymers: the model must be trained to account for non-bonded interactions and chemical bonds involving impurity atoms or additives, which the original model may be unable to capture comprehensively trained on pure N-A-S-H.
Furthermore, as the model continues to be refined, it can be applied to more complex and harsh service environments. For example, under high temperature, humidity, or chemically aggressive conditions, geopolymers undergo significant microstructural evolution, including local crystallization, framework rearrangement, and pore structure changes. These processes are directly linked to long-term durability and service performance. DPMD-based MD simulations can provide atomistic insights into these mechanisms, offering a theoretical basis for understanding degradation behavior under extreme conditions. Such studies are of great importance for applications of geopolymers in nuclear waste immobilization, refractory materials, and long-duration infrastructure. In addition, the high accuracy and efficiency of DPMD models provide a foundation for future multiscale simulations. By coupling atomistic simulations with mesoscale or continuum models, it will be possible to achieve cross-scale investigations from the atomic to the engineering level, thereby enabling a deeper and more integrated understanding of the structure–property correlations of geopolymers. This approach not only helps to reveal fundamental mechanisms but also provides guidance for material design and optimization. With advances in computational power and data availability, it is foreseeable that microstructural simulations with near-DFT accuracy can be realized across the entire geopolymer family, supporting future research on structure–property relationships, durability prediction, and the development of new formulations.
Supplementary information (SI): details on plane-wave cutoff energy convergence tests, training loss and RMSE evolution of the deep potential models (including transfer learning for K-A-S-H), and comprehensive structural (RDF, MSD, q-order parameters) and mechanical property analyses of N-A-S-H and K-A-S-H geopolymers under varying temperatures, Si/Al ratios, water contents, and alkaline environments. See DOI: https://doi.org/10.1039/d6ta01407k.
| This journal is © The Royal Society of Chemistry 2026 |