Open Access Article
Philipp Stärk
ab,
Henrik Stooß
c,
Marcel F. Langer
d,
Egor Rumiantsev
d,
Alexander Schlaich
c,
Michele Ceriotti
d and
Philip Loche
defg
aStuttgart Center for Simulation Science (SC SimTech), University of Stuttgart, 70569 Stuttgart, Germany
bInstitute for Computational Physics, University of Stuttgart, 70569 Stuttgart, Germany
cInstitute for Physics of Functional Materials, Hamburg University of Technology, 21073 Hamburg, Germany. E-mail: alexander.schlaich@tuhh.de
dLaboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland. E-mail: michele.ceriotti@epfl.ch
eDepartment of Physics, Technical University of Munich, Garching, Germany. E-mail: philip.loche@tum.de
fAtomistic Modelling Center, Munich Data Science Institute, Technical University of Munich, Germany
gMunich Center for Machine Learning (MCML), Munich, Germany
First published on 18th June 2026
Long-range interactions and electric response are essential for accurate modeling of condensed-phase systems, but capturing them efficiently remains a challenge for atomistic machine learning. Traditionally, these two phenomena can be represented by static charges that underlie Coulomb interactions between atoms, and dynamic charges such as atomic polar tensors—aka Born effective charges—describing the response to an external electric field. We critically compare different approaches to learn both types of charges within a single model architecture, taking bulk water and water clusters as paradigmatic examples: (1) learning them independently; (2) coupling static and dynamic charges based on their physical relationship with a single global coupling constant to account for dielectric screening; (3) coupled learning with a local, environment-dependent screening factor. In the coupled case, correcting for dielectric screening is essential, yet the common assumption of homogeneous, isotropic screening breaks down in heterogeneous systems such as water clusters. A learned, environment-dependent screening restores high accuracy for the dynamic charges. However, the accuracy gain over independent dynamic predictions is negligible, while the computational cost increases compared to using separate models for static and dynamic charges. This suggests that, despite the formal connection between the two charge types, modeling them independently is the more practical choice for both condensed-phase and isolated cluster systems.
Several strategies have been developed to endow MLIPs with electric-field awareness. One class of methods learns dipoles or local dipole contributions,9–14 but these approaches face fundamental challenges: for periodic systems, dipoles are defined only modulo a polarization quantum,12,15,16 which requires care during training, as the loss function must account for this multi-valuedness. Other approaches bypass these issues by learning scalar effective charges17,18 or tensorial quantities such as Born effective charges (BECs), also called atomic polar tensors (APTs).19–21 These models couple external fields to learned charges, generating auxiliary field-dependent forces and thus avoid the conceptual ambiguities of polarization-based methods. BEC-based models perform well in capturing the linear electric response. However, these models typically neglect an important physical ingredient: the external electric-field response of atoms is intimately linked to the electrostatic interaction between atoms. Effective charges used to model long-range Coulomb forces reflect internally screened interactions, whereas BECs describe the unscreened response of the electron density to an external field. Crucially, the distinction between internal (screened) and external (unscreened) responses becomes pronounced in heterogeneous systems such as clusters, interfaces, or molecular mixtures, where assuming a single homogeneous screening value may not be sufficient.
Besides electric-field response, long-range electrostatics have already been incorporated in MLIPs using fixed charges,22,23 ML-fitted surrogate charges such as Hirshfeld charges,24,25 dipole-matching schemes,26 or global charge-equilibration networks.17,18,27,28 Recently, differentiable Ewald summation frameworks29,30 have enabled end-to-end charge learning, removing ambiguities in charge assignment.31–33 A few recent studies attempt to combine long-range electrostatics with electric-field response, for example by deriving BECs from end-to-end learned internal charges.34,35 These approaches generally assume homogeneous isotropic screening, which can provide decent qualitative behaviour in bulk systems but may fail in heterogeneous environments, limiting their applicability for general-purpose ML potentials.
This breakdown raises a central open question: should static (effective) and dynamic (BEC) charges be learned together, constrained by their physical relationship, or treated as independent quantities? Here, we address this question by systematically evaluating both coupled and uncoupled strategies in long-range ML architectures, including cases with environment-dependent screening, to clarify where and why homogeneous screening assumptions break down and how accuracy can be restored. We critically compare different models in terms of accuracy, interpretability, and computational cost, using high-quality finite-field reference data for bulk water as a prototypical polar liquid with well-characterized experimental reference data including IR spectra36 and water clusters. Water clusters are of direct scientific relevance as precursors of atmospheric ice-nucleating particles37 and as models for aerosol droplets in the context of airborne pathogen transmission.38
![]() | (1) |
α is the α-component of the system polarization density, rβi the β-component of the position of atom i, V is the system volume, Fβi the force on atom i and Eα an externally applied field. The symbols α, β ∈ {x, y, z} specify Cartesian components of the tensors. The right-hand side of eqn (1) motivates the use of these charges for describing electrostatic interactions with external fields, as it represents the corresponding linear-response coefficient—the change in force induced by an applied external field.
We consider for the moment non-periodic systems, where we define partial charges qIRi that reproduce the polarization P =
V,
![]() | (2) |
This allows us to relate BECs Zi to static charges qIRi through39
![]() | (3) |
![]() | ||
| Fig. 1 Schematic illustration of the two different model architectures. (A) Shows the architecture of an “uncoupled model”, where BECs Z and static charges q are treated independent of one another. (B) Shows a coupled architecture where the learned pseudo charges qi are related to the BECs through the polarization P, with either a global γθi = const. or local screening parameters γθi. The final output of each model is the total force per atom Fi and the electrostatic enthalpy H (see eqn (11)), which reduces to the internal energy H = U in the case of no external fields, E = 0. | ||
![]() | (4) |
![]() | (5) |
Beyond this physical definition of Ulr, we also consider a more flexible long-range formulation introduced by us in ref. 44. Here, the electrostatic potential is processed by a learnable function fθ, yielding
![]() | (6) |
This modification, referred to as “long-range representations with equivariant messages” (LOREM) has been shown to significantly reduce errors across multiple prediction tasks.44 We note that in this work we restrict the architectures to learned scalar pseudo charges only, as opposed to the more general equivariant tensorial pseudo charges proposed in ref. 44. When the learnable function is set to be V({qj,rj}j)qi, the model reduces to eqn (5) and retains interpretability of the pseudo charges. We refer to this latter version as the physical long-range architecture.
The simplest extension of long-ranged MLIPs to also predict BECs is to treat them as independent outputs without connection to the static pseudo charges. An architecture capable of this is shown in Fig. 1A. Here, the pseudo charges qi(ξi) are predicted solely for computing Ulr, while the BECs constitute a second, independent prediction target derived from the same local environment representation ξi. This design allows the model to learn static and dynamic charge responses separately, without enforcing any explicit relation between them.
Because we employ an equivariant descriptor ξi, predicting Zi becomes straightforward. To obtain the Born effective charge tensor Zαβi from the local atomic environment, we map the equivariant spherical-harmonic features of each atom through a small neural module. The descriptor (restricted to features up to two angular channels) is first processed by several dense layers applied to the spherical channels. An equivariant tensor-coupling step based on Clebsch–Gordan coefficients mixes the angular components to form all symmetry-allowed rank-2 contributions, which are subsequently linearly recombined into a 3 × 3 matrix, which is fitted to BEC labels derived from DFT (see Methods Section 5.1).45 This construction ensures that the predicted tensors transform correctly under rotations. Such an approach is essentially the extension of previous works19,46 to long-ranged architectures.
| qIRi = γqi(ξi). | (7) |
Here, the coupling parameter γ can be interpreted in terms of the high-frequency dielectric permittivity,
.34,48,49 The underlying assumption of the latter relation is that learned pseudo charges correspond to physically interpretable partial charges and has been shown to allow for a posteriori prediction of Zi in a large variety of systems, if one assumes ε∞ between 1 (isolated molecules) and 1.83 (bulk water).34,35 Here, we will refrain from interpreting the screening parameter as directly related to ε∞. Instead, we simply treat γ as a learnable parameter that scales the learned pseudo charges to infrared charges. This scaling is determined by requiring that the resulting charges reproduce BEC targets obtained from DFT calculations. However, as we will show below, the assumptions of homogeneous and isotropic screening break down in inhomogeneous environments such as interfaces, where the dielectric response is well known to become both anisotropic and spatially varying.50–53 A simple way to resolve the homogeneity assumption is to introduce a local screening factor,
| qIRi = γi(ξi)qi(ξi), | (8) |
Using the proposed coupling relations eqn (7) and (8) to predict Zi via eqn (3) is the basis for the second family of MLIPs that we study, the coupled models. These models, shown in Fig. 1B, use the predicted qi(ξi) to construct the BECs. We explore two variants of these coupled models. (i) In the coupled, global approach, the MLIP is first trained only on energies and forces. Afterwards, a single global screening parameter γ (see eqn (7)) is fitted a posteriori to minimize the BEC error on the validation set. We intend this strategy to closely follow the strategy of ref. 34 and 35. (ii) In the coupled, local approach, we drop the assumption of a uniform screening factor. Instead, the model predicts a local, environment-dependent γi (see eqn (8)), learned jointly with energies, forces, and BEC labels. This allows us to test whether relaxing the homogeneity assumption improves BEC prediction accuracy, especially in inhomogeneous systems.
For all coupled architectures an issue arises when incorporating periodic boundary conditions. A naive definition of the polarization density via eqn (2) and (7) can result in ill-defined values, due to the periodic boundaries. This problem can be mitigated in molecular systems by learning molecular dipole contributions,54 but this approach is incompatible with the atom-centered, point-charge picture adopted by the models we consider in the present work. Zhong et al.34 solved this by utilizing an artificial complex phase. We instead calculate Zi from the learned pseudo charges via their positions by
![]() | (9) |
![]() | ||
| Fig. 2 Scatter plot of diagonal components of the BECs Zαα comparing DFT values with model predictions for bulk water (A) and water clusters (B). | ||
Overall, the uncoupled and the coupled model with local screening (γi) both reproduce the BEC labels accurately for the bulk and for the clusters. The global-screening model, however, shows a clear deterioration in performance on both subsets. This stems from the global model's homogeneous-screening assumption: as expected from our discussion above, a single γ cannot simultaneously capture the different effective dielectric responses present in homogeneous bulk and inhomogeneous cluster environments. For this global-γ model—where γ was obtained by an a posteriori fit to the BEC labels of the validation-set—we find γ = 1.974. While one could reduce the error further by fitting distinct screening factors for bulk and cluster subsets, this strategy requires identifying and splitting the dataset into several classes based on their structure. As dataset size and diversity grow, such manual classification becomes increasingly impractical.
Furthermore, the local screening varies strongly in clusters. Fig. 3 shows the distributions of local screening values γi predicted by the coupled local-γi model as a function of the number of molecules Nmol per cluster. For small clusters the predicted γi values are tightly distributed, whereas the distributions broaden as cluster size increases. Notably, we observe pronounced differences for dangling OH groups, highlighted by the color-coded structures in Fig. 3A. This trend is also reflected in the bimodal distribution of hydrogen-atom γi values in cluster structures, which we attribute to the distinction between hydrogens engaged in hydrogen bonding and those exposed at the cluster surface.
![]() | ||
| Fig. 3 (A) Snapshot of water clusters with 6 and 20 molecules. Atoms are colored according to their local screening values γi. An interactive view for the whole validation set is provided online https://doi.org/10.24435/materialscloud:fs-8h. (B) Distribution of the local screening values from the coupled γi model as a function of water cluster sizes. Gray dashed line shows the average value of γi for bulk structures. | ||
In Fig. 3B, the dashed gray line indicates the mean screening value for bulk structures,
i = 1.22 ± 0.09, which the γi values approach for larger clusters, as expected. Together, these findings provide a physically interpretable explanation for the coupling between learned pseudo charges, infrared charges, and BECs. Accurate modeling in heterogeneous environments requires this coupling to depend on the local atomic environment. By contrast, a single global screening parameter fails to capture the variability present in inhomogeneous structures.
In order to estimate the general fidelity of the different MLIP architectures, we compare the different models in terms of their mean absolute errors (MAEs) on validation sets for energies, forces and BECs (Fig. 4). Discussing first the physical models, we find differences in energy and force errors to be minor, with the uncoupled model performing overall the best. As already observed in Fig. 2, the coupled global γ model performs significantly worse on BEC predictions, while the uncoupled and coupled local γi models achieve similar accuracies for predicting BECs, underscoring the need for environment dependent coupling between learned pseudo charges and BECs for these systems. Importantly, modeling the coupling increases the computational cost by a constant prefactor relative to the uncoupled model, while the asymptotic scaling with system size remains unchanged (see Fig. S1 in the SI).
So far, all models were trained utilizing the physically motivated Coulomb interaction of eqn (5). To investigate if a more expressive but less physically interpretable model utilizing eqn (6) may perform even better, we also trained models with the LOREM formulation. As can be seen in Fig. 4, this modification improves performance on all targets with a particularly strong improvement in bulk potential energy errors. The parity plot (similar to Fig. 2) for the LOREM models are presented in the SI (Fig. S2), showing qualitative agreement with the physical models and mirroring the need for an environment dependent coupling parameter to achieve good BEC accuracy on cluster and bulk structures. However, as shown in Fig. S3, the local screening values γi predicted by the LOREM models are no longer physically interpretable due to the learnable mapping fθ.
Finally, we assess the behavior of the different physical models in predicting infrared spectra from molecular dynamics simulations. Computational details are provided in method Section 5.3. Fig. 5 shows the imaginary part of the complex susceptibility for periodic bulk water at T = 300 K and for the cage and book configurations of water hexamer clusters at T = 10 K. The spectra are organized into two columns: the left column (panels A, C, E) shows the OH-bending mode around ∼50 THz, while the right column (panels B, D, F) shows the OH-stretching mode around ∼100 THz. We note that nuclear quantum effects are neglected, so comparison with experiment is qualitative, although error cancellation in GGA functionals yields reasonable agreement for bulk water. To contextualize the sensitivity of spectra to the predicted BECs, we also provide results for an analysis based on constant scalar charges. To this end, we calculate the average diagonal component of the BECs per element and assign this charge to all O and H atoms for the analysis of the entire trajectory. The results of this procedure are given by the purple lines in Fig. 5.
![]() | ||
| Fig. 5 Imaginary part of the susceptibility spectrum of periodic bulk water at T = 300 K (A + B), cage (C + D) and book (E + F) configuration of water hexamer clusters at T = 10 K. We show results for the three model architectures discussed in this work and for reference the result of a simple constant scalar charge analysis (purple line). The black dashed line shows experimental bulk reference spectrum taken from ref. 36. | ||
Focusing first on a comparison of models in the bending mode in panels A, C, E, a consistent hierarchy emerges: the fixed charge model (based on atom-type averaged charges), global screening (γ), local screening (γi), and uncoupled models estimate the bending mode amplitude in decreasing order. In bulk water, where we also provide an experimental reference, this ordering is particularly evident. The global γ model strongly overestimates the bending intensity (A) and underestimates the stretching band (B), which is additionally shifted to higher frequencies. The fixed-charge model shows similar but slightly more pronounced deviations from the experimental reference.
In contrast, both the γi and uncoupled models reproduce intensities and peak positions much more accurately, in closer agreement with the experiment. Discussing in more detail the results for the water hexamers, we find similar but slightly less pronounced trends. As the spectral features of these isomers have been discussed extensively elsewhere,36,57,58 we focus here on differences between models. Overall, all models reproduce the main spectral features, indicating consistent nuclear dynamics. Differences arise mainly in intensities and subtle frequency shifts. The bending-mode intensity again increases with decreasing model flexibility (C, E), following the same hierarchy as discussed above, while differences in the stretching region (D, F) are more subtle. The LOREM models give spectra supporting the same conclusion, as shown in Fig. S4 in the SI. In summary, these results are consistent with the model errors discussed above, in that the global screening model shows the largest deviations in predicted infrared spectra, including for the experimentally accessible bulk water case. This reinforces the conclusion that a more local treatment of the coupling between latent and dynamic charges, as realized in the γi model, or the complete decoupling in the uncoupled model, more accurately captures the charge redistribution underlying IR activity, particularly for the OH bending/stretching part of the spectrum, where the dynamic part of BEC contributions are expected to be pronounced. At the same time, the overall spectral shape is already reasonably well reproduced even with the simplified fixed-charge analysis, indicating that IR spectra are primarily governed by the nuclear dynamics and the symmetry of vibrational modes, while still reflecting the improved physical consistency of more flexible charge-coupling schemes.
In this work, we systematically examine this question within models that incorporate explicit electrostatic interactions. Using water clusters and bulk water as controlled test systems—where high-quality reference data are available and long-range polarization effects are essential—we demonstrate that even in this comparatively simple setting, static charges learned as coefficients of a Coulomb term are only correlated with Born effective charges (BECs). As we show, the relationship is not captured well with a single factor across heterogeneous environments, breaking down when fitting on systems such as clusters of varying size. We show that restoring quantitative agreement within coupled models for heterogeneous datasets requires employing spatially varying screening coefficients. While this improves accuracy, it removes the anticipated simplicity of the physically constrained approach and does not provide a clear computational advantage over directly learning BECs as an independent target. In contrast, treating dynamic charges as a separate, well-defined observable yields higher accuracy, improved infrared intensities, and lower complexity.
Even though it is tempting to avoid the explicit calculation of reference BECs, especially given that observables such as IR spectra are comparatively insensitive to moderate errors in the predicted dynamical charges, our results show that measurable and systematic differences nevertheless emerge, in particular for bulk water where experimental reference data are available. In this case, models incorporating local screening or decoupled charge responses reproduce the relative intensities and peak positions more accurately than approaches based on global screening or fixed charges. Our results indicate that whenever heterogeneous datasets are considered, quantitative accuracy is strongly improved by explicit training on BECs. For these reasons, we recommend (i) using electrostatic potentials to capture long-range interactions without attributing physical meaning to static charges, which are inherently model-dependent quantities, and (ii) learning dynamic charges explicitly and independently from well-defined ab initio BEC data.
For consistent labeling of the structures, we recalculated energies, forces and BECs for all structures with DFT at the revPBE-D3 level using the CP2K package.65 These calculations employed the revised Perdew–Burke–Ernzerhof (PBE) exchange–correlation functional, a DZVP-MOLOPT atom-centered basis set, and Goedecker–Teter–Hutter (GTH) pseudopotentials.
To compute BECs via finite electric fields we start from the right-hand-side of eqn (1), indicating that BECs can be obtained as the derivative of atomic forces with respect to externally applied fields. This relation, strictly only true in the infinitesimal limit, can be approximated via central finite differences, giving us
![]() | (10) |
![]() | (11) |
![]() | (12) |
Finally, training of all models is performed by constructing a typical force and energy loss function via an L2 norm, i.e.
![]() | (13) |
![]() | (14) |
For the short-range features, we follow the architecture described in ref. 44. All models use a cutoff radius of rc = 5.0 Å and spherical harmonics up to degree 6, with eight spherical channels. The radial dependence is represented using 32 radial basis functions, and the chemical embedding uses eight channels. We employ a cosine cutoff to ensure smooth behavior at the cutoff distance, and perform the radial expansion using basic Bernstein basis functions.70 The network uses the SiLU activation function. No message passing is applied, and for each atom a single floating-point value is predicted, representing its pseudo charge.
Training is performed using the Adam optimizer with an initial learning rate of 6 × 10−5 and a batch size of one structure. Optimization proceeds until the loss no longer improves significantly. For validation, we use a random 80
:
20 train–validation split.
For the training of the coupled models with global screening parameter γ, we first train the model purely on energy and force labels. Utilizing the pseudo charge predictions of this model (through eqn (3) for non-periodic structures and eqn (9) for periodic structures) we determine a single γ value which minimizes the absolute error on BEC predictions on the entire validation set, including clusters and bulk structures simultaneously. This value is then used for all subsequent evaluations and simulations of the model.
![]() | (15) |
![]() | (16) |
![]() | (17) |
(ω) is the Fourier transform of the polarization time series and where Lt is the length in time of the polarization P(t).
For the reference analysis method utilizing fixed charges, we use the trajectory obtained from the uncoupled model and take the average
αα of the diagonal BEC components over 50 ps per atomic species. The dipole IR spectra are then calculated via eqn (17) via
.
The atomic structures used to train the models are available from https://doi.org/10.24435/materialscloud:fs-8h. In the same entry we also provide a chemiscope visualization for the validation set of the water clusters for the local gamma models containing the structures, screening values and learned pseudo charges.
Code availability: the trained models, training scripts, CP2K input files to generate the dataset, and the code to reproduce this study is available on GitHub at https://github.com/pstaerk/si_charge_learning_bec.
| This journal is © the Owner Societies 2026 |