Open Access Article
Margarethe Bödecker
,
Eaindra Lwin
,
Finn Knüppe
,
Noah O. Evers
,
Elisabeth Sennert
,
Moritz Niessner
,
Nils O. B. Lüttschwager
and
Martin A. Suhm
*
Institut für Physikalische Chemie, Universität Göttingen, Tammannstr. 6, 37077 Göttingen, Germany. E-mail: msuhm@uni-goettingen.de
First published on 1st June 2026
HyDRA blind challenges aim at unbiased forecasts of water vibrations in vacuum-isolated hydrate complexes of organic molecules by different computational protocols. In this work, directly after the completion of the HyDRA II challenge and ignorant of its computational results, the experimental findings for the 9 new solvates with a total of 12 hydrogen-bonded water OH stretching wavenumbers are disclosed and discussed. We further summarise and analyse the 70 entries of the training database, which were made available at the start of the challenge. Monohydrates of 4 other previously uncharacterised solvates are added to support the assignment of the blind test entries and to widen the future training database. The compounds for which gas phase monohydrate OH stretching fundamentals are reported for the first time comprise a difluorobenzonitrile, a difluorobenzoic acid, eucalyptol, an iodo-methyl-pyrazole, a methoxypyridine, Hünig's base, a pyrrolidine ethanol, TEMPONE, diethylhydroxylamine, pyrazine, pyrimidine, a triazine and a chiral chloropropionic acid. An experimental bridging approach (Bridging Experiments by Theory, or in short BEsT guess) using inexpensive harmonic density functional theory predictions from the training database for educated guesses of unknown vibrational wavenumbers is proposed to assist the future evaluation of the blind challenge. Proposals on how to compare computational predictions to experiment depending on the employed level of anharmonic perturbation are made. It will be interesting to see how different computational protocols perform in the wide range between outstanding predictions (root-mean-square deviation <10 cm−1) and placebo-quality deviations (>100 cm−1) for the 12 disclosed wavenumbers.
While electronic spectroscopy exhibits the stronger quantisation, it has been recognised from the start of modern quantum mechanics that suitable approximations for the quantum nature of nuclear motion beyond the harmonic oscillator approximation are essential as well,5 when comparing to experimental observables. This is clearly the case when hydrogen nuclei are involved. Usually, the true bottleneck for reliable predictions is not the Born–Oppenheimer or Born–Huang approximation for the separation of electronic and nuclear motion.6 Instead, it is the combination of approximations within the electronic and nuclear parts, which may sometimes create the illusion of an excellent performance (see SI chapter 1 for an example) when more or less systematic cancellation between or within electronic and nuclear approximation errors is actually in place.7,8 Systematic error cancellation can be very useful for the practical spectroscopist, but it is easily lost when extrapolating too far from the training base. Therefore, the training database for spectroscopic observables needs to be periodically extended, to reduce extrapolation in favour of interpolation and to identify the most powerful quantum-chemical approximations available.
A field of hydrogen-probing vibrational spectroscopy with particular relevance for the persistent challenge of explicit solvation modelling9 is microhydration of organic molecules in the gas phase.10–12 HyDRA (Hydrate Donor Redshift Anticipation) is a series of double-blind challenges and database extensions in this field.13,14 Cold, isolated mono- and dihydrates of organic molecules are generated in supersonic jet expansions and their intense, hydrogen bond-sensitive water OH stretching (OHb) transitions are recorded by infrared or Raman spectroscopy.15–17 To predict the transition wavenumbers (and equivalently the down- or redshift relative to the symmetric stretching fundamental of isolated water molecules,4 experimentally observed at 3657 cm−1), one may combine any electronic structure treatment of the complex with an appropriate model of the vibrational dynamics or one may use some kind of empirical training and interpolation protocol.12
Hydrogen-bonded water OH stretching states in microhydrate complexes are often nicely isolated from other fundamental vibrations, but sometimes they come close to one18 or more19 combination states which involve several quanta in other motions of the water molecule, as sketched in Fig. 1. These local perturber (lp) states typically carry negligible infrared and Raman intensity on their own (lp0), but if they share the same symmetry and some coupling with the water OHb stretch due to the large amplitude motion of the solvating water unit, they can borrow intensity and stretching character by wavefunction mixing, thus reducing the purity or intensity of the bright transition to a fraction P < 1 of its intrinsic intensity in the raw spectrum (see Fig. 1). The deperturbed location of the OHb transition (OHb0) is then close to the intensity centroid of the coupled modes involved in the intensity sharing.18 Any computational model which averages over such local couplings is therefore advised to compare its predictions to the OHb0 centroid or deperturbed (dep) band position (central trace in Fig. 1), whereas fully anharmonic coupling models can directly address the dominant OHb transition and its purity P in the raw spectrum (upper trace in Fig. 1).
By far the strongest and longest-range coupling expected for OHb is a Fermi resonance with the 2-quantum water bending overtone17 b2 of the same water molecule (generalised to gp for global perturber in Fig. 1). Basically, its influence is always there, even if it may not be as apparent in the experimental spectrum as a local resonance due to the large spectral separation and the weak intensity transfer. This low-order coupling between OHb and b2 is usually captured to some extent by second order vibrational perturbation theory approaches and their variants.20 Therefore, such leading order perturbation approaches as well as restricted variational treatments of this Fermi resonance are advised to compare their results to the locally deperturbed (dep) experimental values as well, unless they also include an explicit treatment of any local perturbers.
This is not the case for the popular scaled harmonic calculations, where the harmonic prediction ω is multiplied by some scaling factor σ due to a mix of electronic structure deficiencies and neglected anharmonic contributions, to better match experiment. Such a scaled harmonic approach is blind to any resonances, be they local (lp) or more global (gp). Therefore, uniformly scaled harmonic approaches are expected to deviate systematically whenever OHb and b2 approach each other due to strong hydrogen bonding and the resulting Fermi resonance shifts become pronounced. In this range of hydrogen bond strengths, scaled harmonic approaches trained on weaker hydrogen bonds are thus expected to be somewhat off for the right reason, when compared to either raw or locally deperturbed (dep) reference values. If OHb is observed with a wavenumber above b2, they are likely to underestimate this perturbed OHb wavenumber. If it is below b2, they are expected to overestimate the actual OHb wavenumber. By assuming an upper limit of the Fermi resonance coupling strength (W ≈ 60 cm−1, as for concerted bending overtones in cyclic water clusters17), we encode the worst case effect of this global perturbation into one side of the experimental uncertainty estimate, even if the b2 perturber remains hidden in the noise, perhaps due to excessive lifetime broadening (see Fig. 1), perhaps due to interference with intrinsic bending overtone intensity. This asymmetric extended uncertainty is thus a service to scaled harmonic approaches in the range of strong hydrogen bonding. Because the asymmetric uncertainty only applies to cases where the gp (b2) is too weak or broad to be explicitly included into an experimental deperturbation, one can drop it in the case of amine monohydrates, where the hydrogen bonds are of the right strength to redistribute significant intensity from OHb to both lp and gp (b2) states.
Are scaled harmonic approaches useful at all in the field of hydrogen bonding? For weaker hydrogen bonds, they have been quite successful by exploiting the partial cancellation of anharmonic effects, as demonstrated in the first HyDRA blind challenge.13 By looking at the shift between water monomer and complexed water instead of absolute wavenumber predictions and by adding this shift to the experimental monomer value, they circumvent the need for explicit anharmonicity along the OH bond, but they are blind to the strong b2 Fermi resonance and are thus expected to perform poorly for OHb in the neighbourhood of b2. Our proposed global deperturbation analysis (gdp) alerts to this by quoting extended uncertainties. It cannot fix the perturbation and therefore does not change the central complex OHb0 wavenumbers from the local deperturbation approach (dep), because the Fermi coupling constant remains experimentally undetermined at a distance. If, however, the Fermi resonance between OHb and b2 is already part of the local resonance analysis within the dark state approximation, such as in tertiary amine monohydrates,19 there is no need to increase the one-sided uncertainty estimate and the (dep) and (gdp) approaches are essentially identical, apart from a subtle effect on the monomer reference, which slightly feels the effect of the distant b2 state through a reduced Fermi resonance coupling constant (see SI section 2.5).
In the first HyDRA challenge, the scope was relatively narrow (10 monohydrates in the test set with only weak to medium strength hydrogen bonds) and several different strategies were quite successful. With the current HyDRA II extension to stronger hydrogen bonds and cooperative double hydration, it should become evident which approaches remain successful, which ones can be improved and which novel approaches might emerge. For this purpose, the training database was substantially extended from 10 to 70 curated experimental OHb stretching wavenumbers and from a wavenumber spread of about 200 cm−1 to more than 600 cm−1. The new test set consists of 7 monohydrate and 3 dihydrate wavenumbers. It is complemented by a bonus set of 2 monohydrates, for which the experimental assignment was particularly challenging due to either open-shell character and overlapping resonances or a missing training counterpart.
The core test set includes five nitrogen acceptors with widely different hydrogen bond acceptor strength: an aromatic nitrile (DFB), an NH-protected and iodinated pyrazole (IMP), a methoxypyridine (MOP), the bulky tertiary amine known as Hünig's base (DPA) and an amino alcohol21 (PYE). In addition to monohydrates, dihydrates are characterised for Hünig's base (only the indirectly bound water, because the directly bound water overlaps with the CH stretching region) and for the pyrazole derivative (both water molecules). The pyrazole derivative is relatively far from any training set members, in particular when it binds two water molecules. Two oxygen acceptors, a benzoic acid derivative (FBA) and a bulky ether (EUC), are more similar in their bond strength towards a single water molecule, but differ in the number of hydrogen bonds and thus cooperativity. The two bonus monohydrates involve a bifunctional nitroxyl radical22 with closely competing binding sites (OTM) and a hydroxylamine (DHA), for which no chemically similar training set member is available. The monohydrate of OTM shows evidence for a local resonance, perhaps in both conformations. DPA has conformational isomerism which may depend on hydration but is not critical for the position of the OHb fundamental, which shares intensity with 2–3 perturber states like other tertiary amines. For FBA, a weak local resonance cannot be ruled out but this possibility is absorbed in the uncertainty of the quoted band position. Table 1 and Fig. 2 present the experimental results for the test and bonus sets in a compact form.
For our experimental and assignment strategies, we largely refer to the SI (chapters 2, 4–6). Table 1 offers three sets of reference values of OH stretching fundamental wavenumbers (raw, dep, gdp) which different theory levels may want to address, as discussed above. The (pre-)publication of this table in the present work ends the HyDRA II blind challenge period for theory participants and it forms the basis of later joint and individual evaluations of the performance of the proposed computational models.
To further extend the HyDRA database beyond the existing 70 wavenumbers and the 12 new wavenumbers of HyDRA II, we also present the spectra for 4 closely related systems (Table 2). These extra systems were studied to secure some HyDRA II assignments, based on their chemical similarity and expected analogies in the spectra. For that purpose, they can be bundled with related test and training systems and their experimental trends are compared to harmonic B3LYP calculations. Parallel trends across the systems between harmonic calculation and raw as well as dep(erturbed) experimental values help to confirm smaller deperturbations due to local resonances. Details are provided in chapter 6 of the SI. As an example, Fig. 3 shows the effects of what we call a b2lib deperturbation18 on five monohydrates for related N-heterocycles. Their OHb transitions fall in a narrow range of less than 100 cm−1, where the combination band between the water bending overtone (b2) and a librational (lib) motion of the hydrogen-bonded water against the acceptor (b2lib) is also predicted. If the level coincidence is comparable to the anharmonic coupling constant (about 10 cm−1), the lp scenario described in Fig. 1 comes into play. No resonance with this potential perturber state is observed for the training member pyridine, the test set member 2-methoxypyridine, or 1,3,5-triazine as an extra system. In each case, a single OHb band is correspondingly found. However, the two extra systems pyrazine23 and pyrimidine show characteristic spectral splittings, likely due to the resonating b2lib state. In the case of pyrimidine,24 the evidence is less clear and only compatible with a very broad (short-lived) local perturber, almost hidden within the noise (SI, chapter 6). It is rewarding to see that the experimental trends after deperturbation (orange) are even better aligned with the scaled harmonic predictions than the trends for the dominant raw transitions (grey). It is further rewarding to observe this resonance, which had previously been observed mainly for ketone,18 nitroxyl,22 and tentatively also carboxylic acid25 monohydrates, at a similar wavenumber and with a similar coupling strength for the chemically different N-heterocycles. To further underscore the resonance interpretation, we compare to a Raman spectrum (SI, chapter 6), which must look basically the same, if the intensity pattern results from wavefunction mixing instead of electrical transition moments. As expected, this kind of bend-librational resonance in hydrogen-bonded water is thus quite universal, whenever the water OHb stretching vibration comes close to 3500 cm−1 and a librational mode has the right force constant for bridging the gap to the bending overtone b2 of the same water molecule. It is remarkable that a higher order anharmonic resonance in a solvating water molecule with a coupling constant on the order of 10 cm−1 is so robust across the chemical space, given the wide variation in hydrogen bond anisotropy for nitroxyl-, carbonyl- and N-heterocyclic acceptors. Perhaps, the HyDRA II challenge will motivate theoretical spectroscopists to rigorously analyse this regularity beyond our simplistic effective coupling models. In the first HyDRA challenge, none of the theoretical approaches did address this higher order resonance explicitly, although it was part of the training set and cannot be ruled out in some test set members.
![]() | ||
| Fig. 3 Trends in N-heterocycle monohydrate OHb wavenumbers from the HyDRA database14 (PYR) and from this work (TRZ, PRZ, PYM, MOP). Scaled harmonic wavenumbers (B3LYP-D3(BJ,abc)/def2-TZVP, σ = 0.98) are shown in blue, raw experimental wavenumbers are shown in grey and deperturbed experimental wavenumbers are shown in orange. For each system, the jet-spectrum is shown in grey and the assigned bands are filled. After deperturbation (coupling matrix elements Wij in grey), the experimental wavenumber trend is even better reproduced by theory than before. | ||
While we are looking forward to the most powerful computational models which are able to truly predict the HyDRA II experimental results, it makes sense to contemplate on alternative predictive methods for OHb stretching wavenumbers which are available to experimentalists who only have routine computational models at their disposal, typically based on density functional theory (DFT). The idea is to use some inexpensive but reasonably complete density functional, likely including London dispersion correction, to make harmonic predictions of the OHb stretching fundamental in the relevant complexes, and to transfer the required harmonic scaling factor(s) σ to match experiment, from one or more known training complexes to an unassigned test complex. The hope is that sufficiently similar training systems will have similar electronic structure and anharmonic errors, which may partially cancel in the difference and lead to robust forecasts. We call this family of interpolations BEsT guesses (schematically depicted in Fig. 4), where BEsT stands for Bridging Experiments by Theory. Note that they are not blind in the present case (although efforts were made to ignore the experimental test system wavenumbers when deciding for an interpolation sampling) and thus should not be compared in their performance to the future results of the HyDRA II blind challenge. BEsT guesses may include pitfalls such as incorrect predictions of the relevant global minimum structure of the complex by the inexpensive density functional or a lack of sufficiently similar compounds in the training database, but they are straightforward to obtain. BEsT guesses are experimentalist's substitutes as long as theory is not powerful enough to provide unbiased accurate predictions on a regular basis.
We explore two different BEsT guess distance metrics in the present work, using the standard harmonic B3LYP-D3(BJ,abc)/def2-TZVP model as a bridging tool. A qualitative metric is chemical similarity and a quantitative metric is harmonic wavenumber proximity, both referring to the database of 70 training wavenumbers. The starting point is to use all 70 experimental wavenumbers to obtain the average scaling factor (
) for the best overall match as well as the RMSD (root-mean-square deviation) between the uniformly scaled computed harmonic values and their experimental counterparts. This training-RMSD is a measure for the average prediction quality of the chosen scaled harmonic approach. The average scaling factor can now be applied to harmonic predictions for the full set or a partial set of blind test candidates and provides a test-RMSD in comparison to the test set experiment. This test-RMSD should be relatively close to the training-RMSD, if the training and test sets are sufficiently similar. In our case, the training- (based on 70 samples) and test-RMSD (based on 12 samples) values amount to 23.5 cm−1 and 26.1 cm−1, respectively. They are not as poor as one might have expected, given that they correspond to the performance of a single-parameter linear fit of the harmonic model to the experimental training data only.
To further improve the test-RMSD, one can now remove training set members from the procedure which are distant from the test set in terms of either chemical similarity or computed wavenumber proximity. This can be done on an individual basis for each test system, leaving only the 1–3 most related training members (A, B, C), either based on chemical similarity or based on harmonic wavenumber proximity. This small individual set of most similar training systems for a given test system is used to determine an optimised scaling factor which is then transferred to the test system. Instead of using individual matches for each test system, one can also consider the average scaling factor for the collective set of individual matches to predict all 12 hydrate wavenumbers at the same time. Depending on the diversity of the test set, this will typically result in a poorer RMSD than individual matches. This formalised procedure mimics the intuitive prediction strategy of an experimentalist, who uses a moderate quality theory model to relay pre-existing spectroscopic information from known to new systems.
Fig. 5 gives examples of such BEsT guess RMSD values for different test sets (for a more detailed analysis see chapter 7 in the SI). If one considers all 12 entries of the present blind test, there is no substantial advantage in restricting the BEsT guesses to the chemically or spectrally individually closest three training members, rather than using the merged set of training systems. The training set is apparently not close enough to the test set, such that individual outliers dominate the test-RMSD. This is confirmed by analysing the consequence of leaving away the three largest outliers in the test set. Now, there is a significant improvement in particular when the BEsT guess is performed on the individually closest three training members, down to a test-RMSD of 6 cm−1. If one alternatively restricts the BEsT guess to the 6 monohydrates with a single OHb fundamental and leaves out the cooperative systems, the predictions also get better. Chemical similarity and even more so wavenumber proximity again reduce the RMSD significantly, if one restricts the interpolation to the 3 most similar training members, down to a test-RMSD of 6 cm−1. When interpolating all 6 non-cooperative monohydrates with the same scaling factor (obtained from the merged training subset), the RMSD values are significantly larger, but still much better than for the full set of 12 mono- and dihydrate wavenumbers. Harmonic DFT OHb wavenumbers apparently cannot describe in a satisfactory way mono- and dihydrates, systems with OHb wavenumbers higher and lower than b2 or strong and weak hydrogen bonds at the same time.
The present work discloses and describes the experimental results which were largely collected in 2025 for the HyDRA II blind challenge, about one Olympiad after HyDRA I,4 together with an analysis and exploitation of the training database14 published in mid-2025. The raw spectral wavenumbers reported in this work represent our best possible reference for accurate theoretical models, but we make efforts to invite more approximate models by deperturbing for anharmonic resonances. Fig. 6 shows that the presented (deperturbed) core test (10), bonus test (2) and extra (4) wavenumbers are enclosed by the training base (70), together spanning about 80% of the previously outlined training range. Of particular interest will be the computational model performance for cooperative dihydrates, for swapped OHb-b2 wavenumber sequences and for the hydroxylamine case (with no functional group training available). Our BEsT guess analysis indicates that there is potential for models with RMSD values around 10 cm−1 or even below, at least for monohydrates of acceptors without their own OH group which could lead to harmonic mode mixing. Such a < 2% prediction performance relative to the span of red shifts would indeed be impressive for the combined challenge of getting the electronic structure right and capturing the various anharmonic effects. It would allow for a new era of vibrational spectroscopy in the field of microsolvation, where theory can start to replace selected spectroscopic experiments rather than to only assist their assignment and interpretation. On the other hand, the average of the 12 blind test wavenumbers of HyDRA II is 3431 cm−1 (compared to 3469 cm−1 for the 70 available training wavenumbers) and their RMSD from this average is 132 cm−1 (compared to 135 cm−1 for the 70 available training wavenumbers). Therefore, computational models which have an RMSD larger than about 100 cm−1 in comparison to the 12 experimental test set wavenumbers do not perform significantly better than an optimised null hypothesis (or kind of placebo) which assumes that all OHb stretching wavenumbers are the same, independent of the type of solvate molecule or number of solvent units. This RMSD threshold can be straightforwardly adapted to computational submissions which only cover one of the ≈4 × 103 allowed distinct subsets of 3-11 wavenumbers.
![]() | ||
| Fig. 6 OHb wavenumber distribution (dep) of the current HyDRA database14 (blue) together with the 16 new entries from this work. HyDRA II test (10) and bonus (2) set wavenumbers are shown in orange, 4 further additions are shown in grey. This figure illustrates that the HyDRA II test and bonus sets cover a broad wavenumber range (513 cm−1) which is still enclosed by the range of the full training set. | ||
Independent of the performance achieved by the participating computational models in HyDRA II and in the spirit of its mythological namesake,26 we anticipate two important directions on the experimental side, given the relative scarcity of experimental references for uncharged systems in this field.12 While most of the current HyDRA II database rests on spectra obtained in our Göttingen laboratory (see SI, chapter 3), the community could now make an effort to contribute carefully curated results from other laboratories. This will also help to flag potentially erroneous assignments.27,28 To our knowledge, such databases already exist for isolated molecules29 and to some extent also for a few hydrates.30 Notifications to the corresponding author are welcome and a submission form will be provided. Furthermore, efforts to extend beyond the localised high frequency OHb oscillator towards more global spectroscopic probes such as the hydrogen bond librations31 in isolated complexes should be encouraged despite their associated experimental challenges. Perhaps, (semi-)quantum matrix isolation work is sufficiently close to the gas phase32 to help in filling this gap. These librations also play a key role in understanding local resonances in the OHb stretching range, as we confirm in the present work. Therefore, quantitatively predicting the combined OH stretch/libration dynamics in microhydration is a rewarding goal.
The raw experimental spectra as well as ORCA output files are made available in GRO.data at https://doi.org/10.25625/G27IJZ. The results will be included into the HyDRA database at https://qmbench.net.
| This journal is © the Owner Societies 2026 |