Open Access Article
Jie
Shen
a,
Chenyu
Wang
a,
Libin
Chen
a,
Shaoqin
Jiang
*b,
Jianhui
Chen
a,
Cuilian
Wen
a,
Bo
Wu
a,
Baisheng
Sa
*a and
Lin-Wang
Wang
*c
aState Key Laboratory of Green and Efficient Development of Phosphorus Resources, Materials Genome Institute, College of Materials Science and Engineering, Fuzhou University, Fuzhou 350108, China. E-mail: bssa@fzu.edu.cn
bLonXun KuangTeng Technology Inc, Beijing, 100083, China. E-mail: jiangshaoqin@pwmat.com
cInstitute of Semiconductor, Chinese Academy of Science, Beijing, 100083, China. E-mail: lwwang@semi.ac.cn
First published on 19th January 2026
Machine learning force fields (MLFFs) are rapidly evolving to provide molecular dynamics simulations of molecules and materials with an accuracy comparable to ab initio methods, while significantly reducing the need for computational resources. However, conventional MLFFs are generally system-specific; introducing new chemical components requires assembling a new training dataset and retraining the entire model from scratch. To address this, we present a density-based machine learning force field (DBMLFF). The key advantage of DBMLFF lies in its modular parametrization strategy: by modeling each molecular species independently, the resulting force fields achieve seamless transferability across diverse chemical environments and retaining high accuracy without the need for retraining. This significantly improves model portability and cross-system applicability. Unlike most of the statistically based MLFFs, DBMLFF is a physics-based force field with machine learning components in it. It computes intermolecular interactions directly from electron density, enabling accurate descriptions of complex non-bonded behavior. In terms of computational efficiency, DBMLFF is three orders of magnitude faster than ab initio molecular dynamics and exhibits linear scaling with system size, allowing efficient simulations of large-scale systems. These features make DBMLFF a robust tool for multi-component electrolyte MD simulations, ideal for practical electrochemical systems with variable compositions and large scales.
Molecular dynamics (MD) simulations provide a unique perspective for elucidating microscopic interactions in electrolyte systems.7,8 By constructing atomic-scale interaction models, this technique enables the resolution of activation energy barriers for lithium-ion transport, dynamic reorganization of solvent coordination structures, and formation processes of interfacial electric double layers.9–11 However, conventional simulation approaches exhibit notable limitations: classical MD based on empirical potentials fails to accurately capture complex polarization effects between ions and solvents, often resulting in systematic deviations from experimental observations. Meanwhile, ab initio MD (AIMD), while offering higher accuracy, faces computational constraints that preclude simulations at practical operational timescales.12,13 This inherent trade-off between precision and efficiency significantly hinders in-depth investigations of electrolyte systems.
The emerging machine learning force field (MLFF) technology presents a novel solution to this dilemma.14–17 By integrating quantum chemical calculations with machine learning algorithms, MLFF achieves atomic-scale accuracy while dramatically enhancing computational efficiency.18–20 However, existing MLFF methods still face challenges in engineering applications.21,22 Current MLFFs typically exhibit a “system-specific nature,” wherein any change in system composition (such as the introduction of new components) necessitates computationally expensive retraining of the entire force field.23,24 This limited scalability poses a major bottleneck in the development of multi-component electrolyte systems.25,26 On the other hand, while general-purpose MLFFs (e.g., MACE, materials adaptive convolutional equivariants)27 demonstrate potential in terms of broad coverage, their predictive accuracy in complex molecular systems (such as electrolytes) still requires further improvement. For example, there are cases where the MD trajectories using such big models diverge, producing unphysical atomic configurations, and due to the large set of parameters, such models become slow and incapable of simulating large systems. This leaves the field grappling with the trade-off between generality and accuracy. Moreover, existing MLFFs typically assume that energy depends solely on local atomic environments, limiting their ability to accurately capture long-range electrostatic interactions.28,29 The predominant solution separates long-range electrostatic energy, commonly using point charge models, from the machine learning-fitted short-range energy. Other strategies such as environment-dependent charges30 or neural-network-predicted electronegativities31 have also been proposed. However, these still rely on the point charge approximation, resembling classical force fields.32 While the point charge model performs well at large intermolecular distances without electron density overlap, its accuracy degrades significantly at intermediate distances where electron cloud overlap becomes non-negligible. To address this limitation, our work introduces an explicit representation of electron charge density, aiming to overcome the inherent constraints of the point charge model in this regime.
To address these challenges, we developed the density-based machine learning force field (DBMLFF) simulation package for energy and force prediction tasks, as well as MD calculations.33 DBMLFF calculates intermolecular interactions based on fitted electron charge densities, while intramolecular energies are modeled using traditional atomic structure-based MLFF models. In addition, a local potential polarization model based on charge density calculations is introduced to describe polarization interactions between molecules. The methodology advances the field in two key aspects: first, it enables independent parametrization of individual molecular components in electrolyte systems. Following this, each molecular model can be seamlessly transferred across different chemical environments without retraining, significantly improving transferability and applicability. Second, by incorporating electron density distribution as a fundamental descriptor, DBMLFF achieves marked improvements in modeling non-bonded interactions, reaching quantum-chemical accuracy. Validations on typical electrolyte systems demonstrate that DBMLFF not only accurately captures atomic-scale solvation structures but also delivers a computational efficiency three orders of magnitude higher than AIMD simulations. This combination of accuracy, transferability, and efficiency opens a new technical pathway for investigating mesoscale electrolyte reaction kinetics.
Classical molecular dynamics simulations utilized GROMACS40 with the optimized potential for liquid simulations for all atoms (OPLS-AA) force field.41 Force field parameters were generated using Ligpargen,42 and initial atomic configurations were constructed via PACKMOL.43 Long-range electrostatic interactions were treated by the particle–particle particle–mesh (PPPM) method,44 while van der Waals interactions were modeled with Lennard-Jones potentials. Energy minimization using the conjugate gradient algorithm was performed to eliminate unphysical configurations in initial structures.
The self-diffusion coefficients of ions were calculated from the uncorrelated mean square displacements (MSDs) of individual ions using the following equation.45
![]() | (1) |
Ionic conductivity from MD simulations is calculated using the Einstein relationship:46
![]() | (2) |
![]() | (3) |
![]() | (4) |
The enthalpy of vaporization consists of the gas phase enthalpy value and liquid phase enthalpy value, which we compute using the equation49,50
| ΔHvap = Hgas − Hliq = Ugas + PVgas − Uliq − PVliq | (5) |
| ΔHvap ≈ Ugas − Uliq + RT = Ep−gas − Ep−liq + RT | (6) |
:
2). Freshly prepared electrolytes were utilized within two weeks to minimize potential decomposition.
| σ = L/RbA | (7) |
![]() | (8) |
![]() | (9) |
![]() | (10) |
,
and
are
functions with different parameters. Due to the different element types and the number of index α used, we typically have one to two hundred features for a given center atom.
The parameterization of our intramolecular MLFF incorporates a threefold constraint based on total energy, atomic forces, and atomic energy decomposition. A distinctive aspect of our approach is the decomposition of the DFT total energy into atomic contributions, which are then used as fitting targets; this strategy significantly increases the amount of valuable data available during the fitting process and reduces the need for lengthy ab initio simulations to generate training data. While these features could serve as inputs for fully connected neural networks, we find that a linear regression model is sufficient to produce highly accurate intramolecular MLFF. This can be attributed to two main reasons: first, our focus is solely on molecular configurations near equilibrium, without chemical reactions or bond breaking/formation, resulting in a relatively low-dimensional configuration space; and second, the combination of the index α and the extended element type σ provides a large number of features, a scenario in which linear models often perform well. Furthermore, linear regression models are easy to train and can be learned on-the-fly with minimal computational cost, making them well-suited as accelerators for MD simulations. The accuracy of this approach is demonstrated by the fitting results for ethyl acetate (EA) molecules in Fig. 2a and b, where total energy predictions yield the root-mean-square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2) values of 0.045 eV, 0.027 eV, and 0.998, respectively, while atomic force predictions demonstrate RMSE, MAE, and R2 values of 0.142 eV Å−1, 0.092 eV Å−1, and 0.999, respectively. It should be noted that the intrinsic molecular properties presented in Fig. 2a and b, as described by the intramolecular MLFF trained on isolated molecules in vacuum DFT data, do not include intermolecular interactions or polarization effects, which will be incorporated in subsequent stages.
![]() | ||
| Fig. 2 (a) Total energy and (b) atomic force comparisons between DFT calculations and MLFF predictions for EA molecules in various configurations. (c) Electron density isosurfaces of EA molecule from DFT (upper) and spherical charge center model fittings (lower), with isovalues of 0.046, 0.001, and 0.0001 e bohr−1.3 | ||
![]() | (11) |
We randomly selected 50 distinct atomic configurations from AIMD simulations at different temperatures and fitted their DFT calculated charge density ρDFT(r) with the above model ρm(r). These 50 configurations were fitted collectively, with each contributing thousands of volumetric grid points to the fitting process. A linear fitting to minimize ∫[ρDFT(r) − ρm(r)]2d3r was used to yield the coefficient C(k,β). We have shown a comparison of the model fitted charge density and DFT calculated charge density in Fig. 2c for EA molecules. It shows that the charge density is well fitted not only in the large density areas but also in the small density tail regions. The small tail regions are important for intermolecular interactions as they modify the electrostatic interaction, as well as induce exchange coupling between the molecules. Note that the charge density tail extends 2 Å from atoms, a range comparable to molecule–molecule or molecule–ion distances; thus, density overlaps exert a notable influence, with the induced exchange coupling serving as a major source of intermolecular attraction. Notably, such overlap-induced interactions cannot be adequately captured by the simple point charge models used in most classical force fields. Residual errors, particularly in tail regions, may propagate to affect subsequent predictions. For instance, subtle mismatches in tail density (e.g., deviations in density decay 1–2 Å from atoms) could distort intermolecular electrostatic fields, leading to Coulomb energy biases. These, in turn, alter atomic forces and molecular trajectories, distorting properties like diffusion coefficients and viscosity. Similarly, fitting errors in density overlap regions (critical for exchange coupling) may modify intermolecular attraction, impacting liquid-phase cohesive energy and causing deviations in thermodynamic properties such as enthalpy of vaporization. While Fig. 2c only shows one atomic configuration, it is important to note that the same C(k,β) coefficients can generate electron charge densities for other configurations of the same molecule. Future work may test configuration-dependent C(k,β) coefficients in eqn (11) for optimization, but overall, interaction energies derived from the fitted charge densities are satisfactory.
Having developed a fit for the electron charge density of individual molecules for any given molecular configuration, the next step is to calculate intermolecular interactions. Our approach assumes that the total charge density of a system is the sum of the individual molecular charge densities, even if there are overlaps. This assumption has been confirmed in other theories, such as symmetry-adapted perturbation theory to calculate intermolecular interaction energies,57 and in the charge patching method.58 Given this assumption, to describe the total charge density of an aggregate of closed-shell molecules, it only needs to provide the charge densities for each molecule before polarization effects are considered. Conversely, we define the polarization electron charge as the difference from the simple summation result after an ab initio self-consistent field (SCF) calculation of the whole system. To further demonstrate this point, we tested two dimethyl ether (DME) molecules, by calculating the total charge density when they are close to each other and compared it to the sum of the individual molecule densities. The results are shown in Fig. 3a–c when they approach each other. To estimate the overlap of the two molecule charge densities, we have defined an overlap charge density as:
![]() | (12) |
| ρe(r) = ρ12(r) − ρ1(r) − ρ2(r) | (13) |
Using the sum of the individual molecule charge density ρm(r) to yield the total charge density ρtotal(r), we can now calculate the intermolecular interaction as follows:
![]() | (14) |
. The functional E(ρ) uses the charge density to calculate the following energies:![]() | (15) |
| ρtotal(r) = ρe(r) + ρnucl(r) | (16) |
In summary, we have expressed intermolecular interactions by constructing the system's charge density via superposition of pre-fitted molecular densities and computing the interaction energy using an energy density functional. As shown in Fig. 3d–f, the DME–DME molecule interaction energies calculated from this approach are in close agreement with direct DFT calculations. In the direct DFT calculation, the binding energy is calculated from:
| Etot(DFT) − Em1(DFT) − Em2(DFT) | (17) |
Herein, we combine the intramolecular MLFF with an intermolecular density-based interaction model to construct a complete force field for MD simulations. Notably, polarization potential is not incorporated in the current model. However, for systems without strong electrostatic interactions (e.g., those containing no charged ions), the polarization effect is expected to be relatively weak, rendering the model reasonable. To validate this, we present the energy profile of two EA molecules during AIMD simulations, while the DBMLFF was employed to compute the potential energy (excluding nuclear kinetic energy) along these trajectories. Specifically, independent ab initio molecular dynamics simulations were performed for five pairs of EA molecules with distinct orientations and positions, each lasting 100 fs at a time step of 0.5 fs. Subsequently, the DBMLFF was used to infer the energy of all 1000 snapshots extracted from these AIMD simulations. As illustrated in Fig. 4, the energies computed by the DBMLFF exhibit quantitative agreement with the AIMD reference data (RMSE = 0.068 eV, MAE = 0.055 eV, R2 = 0.950), accurately capturing both global trends and local features.
To accurately capture electronic relaxation under local external potentials, we implement a spatially resolved local electric field model, superseding conventional uniform field approximations. The polarization energy for individual molecules is formulated as:
![]() | (18) |
| Eh(i) = ∫Vext(r)exp[−(r − Ri)2/ω2]rhd3r | (19) |
| Vext(r) = aerf(d)/d + bexp(−d2/ω2) | (20) |
By integrating the polarization model, we coupled the polarization energy from eqn (18) with coulombic, exchange, and kinetic energy terms in eqn (14) to construct the total interaction energy. Systematic evaluations were performed for EA dimer systems (Fig. 5c) and EA-Li+ complexes (Fig. 5d), covering 200 distinct configurations for each system with varying displacements and orientations. Note that we have revised the prefactor α of the van Weizsäcker kinetic energy term in eqn (15) from 0.04 to 0.025. This adjustment is based on our observation that the contribution of this kinetic energy term is consistently negative and sometimes compensates for the polarization energy to some degree. Thus, in subsequent work where the polarization energy is included, we will adopt α = 0.025. As shown in Fig. 5c and d, we calculated the intermolecular and molecule–ion interactions (obtained by subtracting the energies of isolated components from the total system energy) in two systems using both DBMLFF and DFT methods. The results demonstrate excellent agreement between DBMLFF and DFT in both systems, with RMSE of 25 meV and 77 meV, respectively. Remarkably, Li+-containing systems exhibit interaction energies reaching −2 eV magnitude. The low RMSE values, particularly in the strongly interacting Li+ system, demonstrate that our polarization model effectively captures ion–molecule interactions. Any residual error in polarization treatment inherently disrupts the delicate balance between charge-transfer attraction and short-range repulsion. Such deviations propagate through atomic forces to molecular dynamics simulations, thereby impairing the predictive accuracy of dynamic properties like diffusion coefficients and viscosity. Particularly in strongly coupled systems (e.g., concentrated electrolytes), errors tend to amplify, exerting a more pronounced impact on the prediction of key properties such as ion-pair structures, free ion concentration, and ionic conductivity.
While the LDA functional was chosen for its computational efficiency and simplicity to test our charge-density-based approach, we acknowledge its inherent limitations. The use of LDA exchange–correlation functional and the spherical charge density approximation may limit accuracy for systems with significant charge transfer or anisotropic interactions. To quantitatively assess these limitations, we compared the performance of our DBMLFF model against GGA using a model charge-transfer system (the Li+–EA complex), as shown in Fig. S9. The results indicate that while LDA underestimates the binding energy by approximately 4.3% compared to PBE-D3, the deviation between the DBMLFF predicted values and the PBE-D3 benchmarks remains within an acceptable margin. Furthermore, for anisotropic systems such as DME molecular pairs with varying orientations and positions, the spherical density approximation introduces minor deviations in intermolecular interaction energies. Therefore, the relative interaction trends predicted by our model remain consistent with those obtained from higher-level functionals.
log(N), where N is the system size. However, this global FFT constitutes only a minor fraction of the total computational cost and is therefore a secondary contribution. Notably, even classical force fields that employ methods like the fast multipole63 or particle–particle particle–mesh44 for long-range electrostatics exhibit the same N
log(N) scaling for Coulomb interactions. For typical electrolyte systems comprising over 200 molecules, DBMLFF achieves three orders of magnitude acceleration compared to AIMD methods, as quantitatively demonstrated in Fig. 6b.
To confirm the performance of DBMLFF for actual liquid aggregated states, we therefore conducted comparative simulations on an electrolyte system containing 10 EA, 15 TTE, and 5 LiFSI, with a total atomic number of 460 for the whole system. This system, characterized by strong electrostatic/dipole interactions and polarization effects, serves as an exemplary platform for methodological validation. In modeling Li+, we only performed charge density fitting (Fig. S1) without including a polarization energy term. This is because Li+, as a closed-shell ion, has highly localized electron density and negligible polarizability, and is thus treated as a source of the polarizing field rather than a polarizable species. TTE (Fig. S2) and FSI− (Fig. S3) were processed using the same fitting protocol as EA. All simulations were performed at 500 K for 5 ps under identical initial conditions to ensure sufficient mixing and mobility, with representative configurations shown in Fig. S10a. Radial distribution function analyses (Fig. 6c and d) demonstrate excellent agreement between DBMLFF and AIMD results, whereas GROMACS simulations show shortened Li–O bond lengths with intensified first peaks. To quantitatively assess the accuracy and temperature transferability of the DBMLFF methodology, we performed DFT single-point energy calculations on 150 configuration snapshots systematically sampled from three independent DBMLFF-generated molecular dynamics trajectories at 300 K, 400 K, and 500 K (50 snapshots from each 5 ps trajectory). A comprehensive comparison was conducted on potential energy surface distributions (including the polarization term) and atomic force characteristics across this thermally diverse dataset. As shown in Fig. 6e and f, the DBMLFF calculations maintain excellent agreement with DFT benchmark data across all temperature regimes: the consolidated analysis yields an RMSE of 0.003 eV per atom (MAE = 0.002 eV per atom, R2 = 0.973) for total energy, while atomic forces exhibit an RMSE of 0.207 eV Å−1 (MAE = 0.136 eV Å−1, R2 = 0.949). The consistent performance across this 200 K temperature range demonstrates the robustness of the DBMLFF approach for molecular simulations under varying thermodynamic conditions. In comparison, the MACE shows improved force prediction accuracy (RMSE = 0.354 eV Å−1, Fig. S11b) over conventional GROMACS simulations (RMSE = 1.412 eV Å−1, Fig. S11a), yet still falls short of the DBMLFF performance, further underscoring the marked superiority of the DBMLFF approach in modeling atomic-scale interactions.
To assess reliability and transferability in extended electrolyte simulations, we selected the well-studied LiFSI/EGDME/TTE system. Since DBMLFF models for LiFSI and TTE were already established in our previous work, only the newly introduced EGDME molecule required parametrization (fitting procedure shown in Fig. S4). We constructed an initial configuration by randomly packing 24 EGDME, 60 TTE, and 20 LiFSI molecules (totaling 1664 atoms) at the experimental density of 1.49 g ml−1, with a representative snapshot provided in Fig. S10b. Subsequently, we performed 700 ps MD simulations at 300 K using both DBMLFF and the classical OPLS-AA force field in GROMACS. Ion diffusion coefficients were calculated from MSD analysis during the final 550 ps following adequate equilibration (Fig. 7a and b). The MSD curves exhibit clear linear regimes throughout this period, confirming well-converged diffusive behavior. It should be noted that all reported diffusion coefficients, for both DBMLFF and GROMACS, are presented without finite-size corrections, as our primary objective was to conduct a direct comparison under identical simulation conditions. To validate the simulations against experiment, we conducted pulsed field gradient nuclear magnetic resonance (PFG-NMR) measurements on the same electrolyte. The self-diffusion coefficients derived from Stejskal–Tanner equation analysis were 1.377 × 10−10 m2 s−1 for Li+ and 1.180 × 10−10 m2 s−1 for FSI− (see NMR spectra in Fig. 7c and d). The experimental ionic conductivity was obtained from Nyquist plots (Fig. S13). As quantified in Table S1, DBMLFF demonstrates significantly better agreement with the experimental diffusion coefficients and ionic conductivity than the classical force field, confirming its superior predictive accuracy.
We further extended our assessment to ionic mobility in a lithium hexafluorophosphate (LiPF6)/ethylene carbonate (EC)/dimethyl carbonate (DMC) system (80 EC, 60 DMC, 10 LiPF6) via 2.5 ns MD simulations at 300 K (representative configuration in Fig. S10c). The fitting procedures for EC, DMC, and PF6− are detailed in Fig. S5–S7. The diffusion coefficients from MSD analysis (Fig. S14) again show that DBMLFF quantitatively aligns the established experimental trends64 and offers a marked improvement over classical force field prediction (Table S2). The observed systematic deviations may originate from the simulation errors due to limited simulation times and system sizes, the intrinsic error of the DBMLFF model, as well as the possible experimental uncertainties. In the future more systematic studies might be necessary to analyze the source of remaining errors.
To evaluate force prediction stability in extended systems, we randomly selected 20 snapshots from the DBMLFF molecular dynamics trajectories of the LiFSI/EGDME/TTE and LiPF6/EC/DMC electrolyte systems and performed a comparative analysis of atomic forces using DFT calculations and MACE predictions. The results (Fig. S15) demonstrate that in both electrolyte systems, DBMLFF achieves significantly better agreement with DFT reference forces than MACE, along with markedly lower prediction errors. More critically, the same LiFSI and TTE models were employed in both the LiFSI/EA/TTE system (Fig. 6d) and the LiFSI/EGDME/TTE system (Fig. S15a). Despite the change in solvent from EA to EGDME, which alters the chemical environment, the forces predicted by the DBMLFF models remain in excellent agreement with DFT. This robust performance, maintained without the need for retraining, strongly demonstrates the outstanding transferability of the DBMLFF framework.
Beyond electrolyte applications, we envision that this machine learning potential based on electron density descriptors will pioneer new directions in force field development for molecular simulations, particularly for systems where intermolecular interactions govern macroscopic properties. To validate the accuracy and general applicability of DBMLFF in describing intermolecular interactions, we computed evaporation enthalpies for five representative molecules. Specifically, for each molecule, we constructed a liquid system containing 100 molecules at experimental density and performed 2 ns molecular dynamics simulations at 298 K using DBMLFF and GROMACS, respectively. Each system was independently simulated three times to obtain statistical averages. The potential energy of gaseous molecules was derived from 20 ps molecular dynamics simulations at 298 K in large-sized simulation boxes (ensuring no intermolecular interactions). The fitting process of DME is shown in Fig. S8. As shown in Table S3, the evaporation enthalpies calculated using DBMLFF and the traditional force field were compared with experimental values. The results demonstrate that DBMLFF significantly outperforms the traditional force field (although traditional force fields often use such experimental vaporization energies as fitting targets), indicating its superior ability to accurately describe intermolecular interactions.
Supplementary information (SI): figures and tables that support the key findings reported in this work. See DOI: https://doi.org/10.1039/d5dd00508f.
| This journal is © The Royal Society of Chemistry 2026 |