Open Access Article
Venkat
Kapil
*a,
Dávid Péter
Kovács
b,
Gábor
Csányi
b and
Angelos
Michaelides
*a
aYusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK. E-mail: vk380@cam.ac.uk
bEngineering Laboratory, University of Cambridge, Cambridge, CB2 1PZ, UK
First published on 20th July 2023
Vibrational spectroscopy is a powerful approach to visualising interfacial phenomena. However, extracting structural and dynamical information from vibrational spectra is a challenge that requires first-principles simulations, including non-Condon and quantum nuclear effects. We address this challenge by developing a machine-learning enhanced first-principles framework to speed up predictive modelling of infrared, Raman, and sum-frequency generation spectra. Our approach uses machine learning potentials that encode quantum nuclear effects to generate quantum trajectories using simple molecular dynamics efficiently. In addition, we reformulate bulk and interfacial selection rules to express them unambiguously in terms of the derivatives of polarisation and polarisabilities of the whole system and predict these derivatives efficiently using fully-differentiable machine learning models of dielectric response tensors. We demonstrate our framework's performance by predicting the IR, Raman, and sum-frequency generation spectra of liquid water, ice and the water–air interface by achieving near quantitative agreement with experiments at nearly the same computational efficiency as pure classical methods. Finally, to aid the experimental discovery of new phases of nanoconfined water, we predict the temperature-dependent vibrational spectra of monolayer water across the solid-hexatic-liquid phases transition.
The state-of-the-art first-principles framework for modelling aqueous interfaces9 combines classical molecular dynamics (MD) and density-functional theory (DFT). These approaches incorporate electronic quantum effects in the potential energy surface (PES)10,11 and dielectric responses12 but cannot describe quantum nuclear effects such as zero-point fluctuations, quantum tunnelling and equilibrium/kinetic isotope effects.13 State-of-the-art IR and Raman predictions include on-the-fly predictions of the total polarization and polarizabilities on classical trajectories. The SFG spectra, which require much longer trajectories to converge, are calculated with a further approximation using a surface-sensitive velocity auto-correlation formula.14 This simplification requires shorter trajectories but neglects the rigorous environment dependence of dielectric response tensors (referred to as non-Condon effects). Ambient-temperature quantum nuclear dynamics can be incorporated with first-principles path-integral (PI) quantum dynamics15,16 and machine learning potentials (MLPs).17,18 However, the PI approach introduces additional computational overhead and provides only semi-quantitative accuracy due to artefacts from approximations to exact quantum dynamics.19 Moreover, the computational overhead and accuracy artefacts become more pronounced as the temperature decreases, systematically worsening accuracy.19 These limitations create a void in the current state-of-the-art for predicting vibrational spectroscopy for aqueous systems such as crystalline and amorphous ice phases and general aqueous interfaces of solid, liquid, and solution phases.
This work presents a new ML-driven framework that dramatically enhances the accuracy and efficiency of full quantum modelling (see Fig. 1) of various vibrational spectroscopic techniques, at finite thermodynamic conditions. Our framework exploits advances in ML for predicting first-principles PES
20 and dielectric response tensors,21–23 at a lower cost than direct calculations. It further incorporates new quantum statistical mechanics methods to map quantum nuclear dynamics to classical MD
24 on an effective PES at greater accuracy and lower computational cost than previous methods. Our approach involves a straightforward training step to fit an effective quantum MLP that includes quantum nuclear effects24 and a physics-based ML model of the systems dielectric responses.17 In the prediction stage, we generate quantum trajectories via classical MD on the quantum MLPs, and predict dielectric response properties or their derivatives to model IR, Raman and SFG spectra rigorously. The ML models of the quantum PES and dielectric responses transfer to other phases and thermodynamic conditions with high accuracy, opening the door to modelling a wide range of bulk and interfacial aqueous systems that were hithertho not possible.
![]() | ||
Fig. 1 Schematic illustration of the presented approach. The first step requires a single PIMD simulation at an elevated temperature to generate training data for a MLP that encodes quantum dynamical effects. We use BPNN 20 to perform first-principles-level PIMD simulations. The second step fits the quantum MLP using PI centroid positions and PI centroid forces within the PIGS protocol.24 This potential is transferable to different phases and thermodynamic conditions. The third step generates quantum trajectories by performing classical MD on the quantum MLP. And the fourth step, applied differentiable and equivariant MACE 26 models of dielectric response functions to generate a time series of the polarization and polarizability and their derivatives. Time correlation functions of dielectric responses (derivatives) yield different vibrational spectroscopy intensities. | ||
We first demonstrate our framework’s capabilities by predicting the IR, Raman and SFG spectra of bulk water and ice with a near-quantitative agreement with experiments, at a much lower computational cost than earlier first-principles studies. Finally, to showcase the potential of our approach, we predict the temperature-dependent vibrational spectra of monolayer water across the Kosterlitz–Thouless-like two-step melting process.25
, at inverse temperature β = (kBT)−1 is estimated in terms of the standard quantum time-correlation function27 (TCF)CAB(t) = Z−1tr{e−βĤÂeiĤt/ℏ e−iĤt/ℏ}, | (1) |
Eqn (1) can be understood simplistically as sampling nuclear positions from the system's Boltzmann distribution, performing quantum dynamics of the nuclear coordinates, and estimating the correlation of first-principles dielectric response time-derivatives: A ≡ A(q) on the initial and B ≡ B(q) on the time-evolved positions. The autocorrelation of the positions, i.e. C![[q with combining dot above]](https://www.rsc.org/images/entities/b_i_char_0071_0307.gif)
(t) gives the vibrational density of states without any selection rules, C![[small mu, Greek, dot above]](https://www.rsc.org/images/entities/b_i_char_e153.gif)
(t) and C![[small alpha, Greek, dot above]](https://www.rsc.org/images/entities/b_i_char_e14c.gif)
(t) relate to IR and Raman intensities respectively,29 and the cross-correlation C
a
bc(t) relates to the second order susceptibility
which determines the SFG intensity.30 A first-principles treatment of eqn (1) for aqueous systems typically requires three sequential steps: (1) selecting an electronic structure theory to model the PES and dielectric responses, (2) performing quantum nuclear dynamics on the PES to generate trajectories, and (3) calculating the intensities of vibrational spectra as TCFs of dielectric response functions. Unfortunately, each step presents unique challenges.
The high associated computational cost is the main challenge with accurately modelling a system's electronic structure. To this extent, DFT
10,11 is typically the method of choice, providing the best compromise between accuracy and computational cost. DFT avoids the high cost of calculating electronic wavefunctions by defining the ground state energy as a functional of the total electron density.10 While the exact functional remains unknown, approximate non-local functionals of the electron density can exhibit accuracy comparable to more sophisticated wavefunction-based methods.31 For instance, so-called meta-GGA32 or hybrid functionals33 with van der Waals (vdW) inclusions,34 are considered appropriate choices for correctly modelling the PES of bulk liquid water16,32 and its interface with a vacuum.35 Despite their relatively lower cost, DFT calculations are many orders of magnitude times costlier than empirical or first-principles forcefields,36–38 and therefore hinder study of large system sizes and long trajectories. Additionally, selecting an appropriate DFT functional for a new system can require stringent benchmarks,34 typically requiring comparisons with more-accurate correlated electronic structure methods39,40 and experiments.35
Once a DFT functional is selected, it can be straightforwardly combined with methods to time-evolve the nuclei on the PES. These techniques are typically MD for classical dynamics41 and PI approaches for quantum dynamics.42–45 Unfortunately, both approaches present accuracy limitations19 and have a high cost when combined with electronic structure calculations. For instance, due to the neglect of enhanced zero-point vibrational fluctuations46 and quantum tunnelling,47 classical MD does not fully sample the anharmonic regions of the O–H bond. PI methods, on the other hand, can correctly account for dynamical effects arising from thermodynamic quantum fluctuations.45 At ambient conditions, PI methods provide semi-quantitative accuracy in describing quantum nuclear motion at a modest overhead compared to classical approaches. Unfortunately, as the temperature is reduced, for instance, to study ice, PI methods become systematically expensive,48 and their artefacts lead to worsening accuracy.19 Both classical and PI trajectories can require around a million to a billion force evaluations for converged vibrational spectra (vide infra). Therefore, the total cost of generating trajectories with accurate electronic structure calculations can be very high even for ambient-temperature liquid water16 – arguably the simplest aqueous condensed phase system – and prohibitive for more complex systems.
Finally, evaluating vibrational spectra requires dielectric responses such as polarization and/or polarizabilities along the trajectories.29,30 These quantities can be generated from additional electronic structure calculations or on the fly during the simulation at an additional cost of a density functional perturbative calculation.49,50 Estimating the TCF in eqn (1) for IR and Raman spectra is straightforward, as their intensities are related to total dielectric responses. However, for SFG, which probes the water molecules at the interface, directly evaluating eqn (1) with the total first-principles polarization and polarizabilities of a water slab is problematic as it is a sum of two equal and opposite signals from its two interfaces.51 While this problem may be alleviated using molecular dielectric responses52 and by artificially mirroring the slab35 so that the SFG signals no longer cancel, the molecular identity of atoms is invalid for reactive systems or exhibits artefacts in general. These challenges highlight the need for a general and uniquely-defined expression of SFG intensity.
In this work, we use the Behler–Parrinello high-dimensional neural network (BPNN) framework57 for representing the PES of revPBE0-D3 bulk water and its interface with a vacuum. This scheme employs hand-crafted radial and angular symmetry functions for every atom type that encode up to three body correlations as inputs to a shallow feed-forward artificial neural network.58 The revPBE0-D3 DFT level describes the structure and dynamics of bulk water in excellent agreement with experiments16,59 and the revPBE0-D3+BPNN framework has been extensively used for thermodynamic and dynamical properties of bulk,17,59 interfacial,18,35 and nanoconfined water25 with classical and PI simulations.
We eliminate these shortcomings using the approach introduced in ref. 24 by some of us. This involves a temperature elevation (Te) ansatz for accurate TCF and PI coarse-grained simulations (PIGS) to map quantum nuclear dynamics to classical MD. The Teansatz alleviates the low-temperature artefacts by writing a TCF of a ground state system exactly in terms of its high-temperature (βe = (kBT)−1 with Te > T) limit,
| CAB(t;β) = β−1βeCAB(t;βe), | (2) |
![]() | (3) |
To evaluate eqn (3) at the cost of classical dynamics, we use the PIGS technique that applies a bottom-up coarse graining approach70 to calculate FCMD. We represent this function as the sum of the Born–Oppenheimer PES and a perturbation24 that we fit analogously to a standard MLP with a so-called “delta learning” scheme.71 We sample centroid positions and the difference between the centroid forces and the Born–Oppenheimer force on the centroid from a short equilibrium PIMD simulation at high temperatures, which converges with a few replicas. Using off-the-shelf MLP fitting protocols, we obtain an effective MLP that includes quantum dynamical effects by regressing the above differential force on centroid positions. Our training step24 requires a single 5–10 ps long efficiently sampled PIMD simulation with eight replicas at Te = 500 K, which is dramatically cheaper than typical PI simulations for converging IR and Raman vibrational spectra of liquid (few ns at 300 K with 32 replicas) and ice phases (few 100 ps at 150 K with 64 replicas).
17,74,75 and Raman spectra.17,18,22 A hurdle towards first-principles SFG modelling is that its computable expressions are written in terms of molecular polarization and polarizability tensors, which are ill-defined in first-principles modelling. Recent work in ref. 51 exploits that equivariant ML models predict polarization and polarizabilities as a sum of atomic contributions. This construct allows definitions of quasi molecular dielectric responses, which are sums of atomic components for the atoms in a molecule, as proxies for molecular polarizations and polarizabilities used to predict SFG intensities as has been done using forcefields.35 Unfortunately, these local molecular proxies are no longer straightforwardly or robustly defined for systems exhibiting reactivity or with localized charges.76
Here, we provide a new expression for modelling SFG spectra that does not invoke a molecular identity for atoms and is also applicable to non-molecular and reactive systems. We take inspiration from ref. 74 that models first-principles IR spectroscopy in terms of the spatial derivative of the polarization. We rewrite the expression for the SFG susceptibility in terms of the spatial derivatives of the total polarization and polarizability tensors, which is naturally a sum of atomic contributions without invoking molecular identity of atoms. Our expression is particularly suited for first-principles modelling as the modern theory of polarization12 rigorously defines derivatives of dielectric responses. In contrast, first-principles modelling does not uniquely define molecular dielectric responses used in forcefield studies.14,52 We first reformulate the definition of the dielectric time-derivatives of a mirrored slab in terms of dielectric derivatives,
![]() | (4) |
77 model, which combines equivariance with a rigorous many-body expansion of atomic properties, leading to improved accuracy, extrapolation and data-efficiency as well as favourable scaling with the number of chemical species compared to earlier ML approaches. In contrast to ref. 74 that trains on spacial dielectric derivatives, we train our ML model on total polarization μ and polarizabilities α. Thanks to the PyTorch implementation of MACE with automatic differentiation, we can predict their analytical derivatives in a highly-efficient manner.
1. Perform an efficiently sampled PIMD simulation using the MLP PES at Te = 500 K and fit FCMD(q;βe) to the centroid forces using the BPNN framework. At this temperature, 8 replicas are sufficient to converge quantum nuclear effects.
2. Perform a 100 ps NVT simulation on FCMD(q;βe), efficiently thermostatted with a generalized Langevin equation thermostat, to sample 10 initial positions and momenta.
3. Perform a 10 × 100 ps NVE simulation on FCMD(q;βe), sampling positions every 2.0 fs.
4. Predict
,
on the NVE trajectories using the MACE models to calculate IR or Raman spectra, or
M,
M for the SFG spectrum.
5. Calculate TCFs of position for the vibrational density of states and dielectric time derivatives for IR, Raman, or SFG spectra.
We note that, even with the most accurate trajectory-based method, Matsubara dynamics, the predicted O–H stretching band is blue-shifted by 22 cm−1 and slightly broadened w.r.t. to the exact quantum mechanical result – due to quantum coherence being neglected in classical dynamics.67 Therefore, at best, PI methods will be slightly blue-shifted and slightly broadened with respect to the experimental results. Any closer agreement will indicate error cancellation. For an apples-to-apples comparison with our predictions, we shift the experimental O–H stretching band to the blue by 22 cm−1.
We model liquid water using a simulation cell from ref. 16–18 containing 64 molecules at experimental density in the NVT ensemble, Ih using a proton disordered simulation cell from ref. 17 containing 96 molecules at experimental density in the NVT ensemble, a slab of 160 molecules in vacuum from ref. 14 in the NVT ensemble, and monolayer water simulation cells containing 144 molecules from ref. 25 in the NPT ensemble at 1 GPa and temperatures 200 to 600 K.
To incorporate quantum nuclear effects for dynamics in MLPs, we use the BPNN framework with the same setup as in ref. 58 for modelling the PES of bulk water and the n2p2 code
80 for performing the training.
81 software. Following ref. 78, we use ML models trained on bulk water to predict the dielectric response tensors for the water–air interface. The success of this extrapolation can be understood in terms of the near-sightedness of aqueous dielectric response derivatives.82
83 software using the LAMMPS84 code as the force-provider. Classical and path-integral MD simulations are performed with a 0.50 fs timestep using the BAOAB
85 integration scheme extended to path-integral MD.86 Classical NVT simulations use the optimal sampling generalized Langevin equation thermostat, while the PIMD simulations use the path-integral Langevin equation thermostats with a time-constant of 100
fs.
89 and anisotropic Raman spectra88 of liquid water display a broad band at around 3400 cm−1, while the experimental isotropic Raman spectrum90 contains a doublet around 3400 cm−1 with a shoulder at 3700 cm−1.
![]() | ||
| Fig. 2 IR and Raman stretching band of liquid water at 300 K with the Te PIGS (red), MD (pink), TRPMD (blue), and CMD (green) and the experimental spectra87,88 (shifted by 22 cm−1 to account for the missing quantum coherence effects in simulation methods) in grey. All simulations are performed with a BPNN with SPC charges trained at the revPBE0-D3 level of theory. | ||
We predict these spectra using the workflow in Section 3. As shown in Fig. 2, the Te PIGS scheme gives an excellent description of the line position of the IR and the anisotropic Raman spectra in near quantitative agreement with experiments. The only disagreement, i.e. the slight broadening of peaks, is consistent with the limitations of all classical dynamics approaches as they neglect quantum coherence.67 Our approach also correctly describes the overall isotropic Raman band, including the high-frequency shoulder but does not recover the doublet.
We now compare our results with current simulation methods used to simulate the IR or Raman spectrum of water. Classical dynamics, which neglects quantum nuclear motion, is blue shifted as it suppresses sampling the anharmonic regions of the O–H bond softened due to hydrogen bonding. Typically, the frequency scale of classical results is rescaled by ∼0.96 to match the experiments or is performed with GGA DFT functionals that soften the O–H bond91 and fortuitously agree with experiments.16 State of the art for including quantum nuclear motion is partially-adiabatic61 CMD and thermostatted44 RPMD. Therefore, these methods require full PI simulations and are at least two orders of magnitude more expensive (see Fig. 6). We observe that these methods reproduce band positions and shapes with semi-quantitative accuracy in agreement with results of previous studies16–18 although their artefacts are visible at ambient conditions. The CMD spectrum is slightly red-shifted due to the onset of the curvature problem, and the TRPMD spectrum is slightly blue-shifted and significantly broadened. Comparing our classical MD predictions with previous first-principles16 and MLP classical simulations,88 at the revPBE0-D3 level of theory, which also does not recover a doublet in the isotropic Raman spectrum, we conclude that it is a shortcoming of the PES. To our knowledge, this is the first instance of first-principles modelling with a quantum description of both electrons and nuclei displaying this level of agreement for the O–H stretching band with the three experiments.
![]() | ||
| Fig. 3 IR and Raman stretching band of polycrystalline ice at 150 K with the Te PIGS (red), MD (pink), TRPMD (blue), and CMD (green) and the experimental spectra92 (shifted by 22 cm−1 to account for the missing quantum coherence effects in simulation methods) in grey. All simulations are performed with a BPNN with SPC charges trained at the revPBE0-D3 level of theory. | ||
As shown in Fig. 3, incorporating quantum nuclear dynamics with the Te PIGS technique, our predicted IR and Raman spectra are in excellent agreement with the experiment and have a systematic improvement over the classical and the state-of-the-art quantum spectra. We predicted IR and Raman recover all known line positions in quantitative agreement with the experiments. However, residual errors exist in the relative intensities and width of the peaks, particularly in the IR spectrum. These are likely due to insufficient sampling of proton disorder in our simulation cell or the level of DFT for the PES and the dielectric responses. To put into perspective the difficulty in simulating quantum nuclear effects in vibrational spectra at low temperatures, we also compare predictions of state-of-the-art methods, which are severely shifted or featureless and over three orders of magnitude more expensive (see Fig. 6).
35 and MB-pol
52 and more recently with MLPs,18,51 with less accurate quantum dynamics methods.
![]() | ||
| Fig. 4 SFG stretching band of water–air interface at 300 K calculated with mirrored dielectric derivatives (eqn (4)) and the surface-sensitive velocity–velocity autocorrelation function (ssVVACF) using the Te PIGS (red), MD (pink), TRPMD (blue), and CMD (green) and the experimental spectra94 (shifted by 22 cm−1 to account for the missing quantum coherence effects in simulation methods) in grey. All simulations are performed with a BPNN with SPC charges trained at the revPBE0-D3 level of theory. | ||
As shown in Fig. 4, by incorporating quantum nuclear effects with the Te PIGS approach and calculating the SFG susceptibility with a surface-sensitive velocity autocorrelation function,14 we can recover the line positions of the positive and negative lobes of the SFG in quantitative agreement with the experiment. The only remaining disagreement, i.e. the missing shoulder, is an artefact of approximating the environment dependence of the dielectric responses via the velocity autocorrelation. Incorporating first-principles polarizations and polarizabilities with eqn (4), we can estimate the SFG of one of the interfaces of a water slab in a vacuum. By incorporating quantum nuclear effects, we describe very well the line positions of the main dangling bond peak and its shoulder arising from intermolecular coupling. We also reproduce the negative lobe, i.e. the bonded part of the O–H stretching mode, within the statistical errors. Our predictions also systematically improve over TRPMD and CMD, which are much more computationally demanding (see Fig. 6) but also display broad and shifted peaks.
As shown in Fig. 5, the temperature-dependent changes in the character of the O–H stretching band clearly show a first-order phase transition between the flat-rhombic and hexatic phase and provide support for a continuous phase transition between the hexatic and the liquid phase. At 200 K, we observe the flat-rhombic phase with two bands in the stretching region, corresponding to the in-plane zigzagging hydrogen bonding O–H bond and the dangling O–H bond. The doublet nature of these bands arises from coherent O–H vibrations. At 300 K, the doublet of the peaks vanishes due to the thermalized homogeneous broadening of the peaks, but the presence of two distinct bands confirms the absence of molecular rotation. At 300 K, we see evidence for a hexatic phase of water with a single band with a doublet feature, suggesting that water molecules are free to rotate between the 6-fold symmetric bonded and the dangling, configurations. From 400 K to 600 K, we observe a gradual emergence of a single broad spectrum indicating an isotropic liquid. We hope these spectroscopic features will help confirm the existence of the flat-rhombic, hexatic, and liquid phases of monolayer water.
![]() | ||
| Fig. 5 Temperature-dependent vibrational density of states of monolayer water25 across the solid–hexatic–liquid phase transition at a lateral pressure of 1 GPa calculated using the Te PIGS method. The spectra are colour-coded according to temperature. The top panels (border colours indicating temperatures) correspond to snapshots of monolayer water structures. All simulations are performed with a BPNN trained at the revPBE0-D3 level of theory. | ||
Although we focused on accurate modelling by incorporating electronic and quantum nuclear effects, our framework is computationally less demanding than other approaches. As shown in Fig. 6, which illustrates the total number of force evaluations and dielectric response calculations needed for the systems studied, our approach is many orders of magnitude cheaper than a full first-principles method, and with respect to previous approaches that combined MLPs and ML dielectric models with commonly used quantum nuclear dynamics methods like TRPMD and CMD. Barring the small overhead of training the Te-FES, our framework for full quantum vibrational spectroscopy has approximately the same cost as for classical spectra, even at low temperatures where standard methods have a divergingly high computational cost. Therefore, our framework provides predictive accuracy and an efficient and practical approach towards characterising complex aqueous interfaces. We believe the developments presented in this paper will simplify the modelling of full quantum spectra of generic systems and provide the methodology to accurately study open questions in interfaces such as proton disorder at the surface of ice, the quasi-liquid layer, pH of the surface of water, and reactivity at aqueous interfaces.
| This journal is © The Royal Society of Chemistry 2024 |