 Open Access Article
 Open Access Article
      
        
          
            Beppo 
            Hartwig
          
        
       and 
      
        
          
            Martin A. 
            Suhm
 and 
      
        
          
            Martin A. 
            Suhm
          
        
       *
*
      
Institut für Physikalische Chemie, Tammannstr. 6, Göttingen, Germany. E-mail: msuhm@gwdg.de
    
First published on 23rd September 2021
The theoretical description of spectral signatures for weakly bound hydrogen contacts between alcohol groups is challenging and remains poorly characterised. By combining Raman jet spectroscopy with appropriately scaled harmonic DFT predictions and relaxation path analyses for 16 vicinal diols (ethylene glycol (ethane-1,2-diol), propane-1,2-diol, 3,3,3-trifluoro-propane-1,2-diol, rac-butane-2,3-diol, 2-methyl-propane-1,2-diol, 2-methyl-butane-2,3-diol, pinacol (2,3-dimethyl-butane-2,3-diol), 3-butene-1,2-diol, 1-phenyl-ethane-1,2-diol, trans-cyclobutane-1,2-diol, trans-cyclopentane-1,2-diol, trans-cyclohexane-1,2-diol, trans-cycloheptane-1,2-diol, cis-cyclohexane-1,2-diol, 1-(1-hydroxy-1-methylethyl)-cyclopentanol and [1,1′-bicyclopentyl]-1,1′-diol), 69 conformational assignments become possible in a two-tier approach with a 5 diol training and an 11 diol test set. The latter reveals systematic deviations for ring strain and secondary π interactions, but otherwise a remarkably robust correction and correlation model based on hybrid DFT with a minimally augmented triple-zeta basis set is obtained, whereas GGA functionals perform significantly worse. Raw experimental data in the 3560–3700 cm−1 wavenumber range as well as computed geometries of all conformations invite further vibrational and structural benchmarking at the onset of hydrogen bonding. Beyond this diol-probed threshold, the accurate prediction of hydrogen bond induced shifts of different magnitudes remains one of the challenges for DFT functionals.
Several vicinal diols have been structurally characterised by microwave spectroscopy. Examples include ethane-1,2-diol,14,15 propane-1,2-diol,16,17 butane-1,2-diol,18rac-butane-2,3-diol,19meso-butane-2,3-diol19 and catechol (benzene-1,2-diol).20,21 However, large amplitude motions which result in a tunneling splitting sometimes make the analysis of the symmetrically substituted aliphatic systems difficult.19,22–24 Vibrational spectroscopy provides a lower resolution and thus initially simpler perspective, once a systematic comparison between experiment and theory is established. This also extends to the calculation of zero point vibrational energies to properly predict the energetic ranking of conformers.
Most of the systems presented here have been previously investigated in solution10,25–44 or matrices32,35,45–47via IR spectroscopy. However, the resulting vibrational frequencies are less suited as reference data for quantum chemistry. Although the simulation of environmental effects is of much interest by itself, the isolated gas phase provides a better entry point for theory. This is particularly true for the low rotational temperatures accessible by jet spectroscopy. In contrast to low temperature matrix isolation studies,35,47 the jet expansion conserves more population of higher conformations and therefore provides richer access to the conformational landscape of these diols, besides avoiding matrix shifts.
Two diols, ethane-1,2-diol and trans-cyclohexane-1,2-diol, have already been vibrationally characterized by us in detail and lend themselves to an initial exploration, because they consist of only two conformations separated by a low barrier.13 They are complemented by the more heavily substituted pinacol (2,3-dimethyl-butane-2,3-diol) in the same conformational complexity class, such that primary, secondary and tertiary alcohol environments are represented. Mixed substitution patterns are provided by propanediol and a methyl-butanediol. After its characterization by experiment and theory and unambiguous assignment, this set of 5 training systems is used to establish linear reference correlations between theory and experiment. These correlations are tested step-by-step by investigating 11 less symmetric, unsaturated, cyclic and fluorinated vicinal diols, thus introducing secondary interactions and environments which conserve or affect the systematic correlation between theoretical (harmonic) and experimental (anharmonic) OH stretching wavenumbers.
In this work, we show that different density functionals can be used in the harmonic approximation to assign and correlate experimentally observed OH stretching fingerprints across a number of populated conformations from a database of vicinal diols. Only one of these functionals yields fitting parameters which look physically reasonable, but even in that case, it remains to be explored by higher level electronic structure and vibrational dynamics calculations whether the description of these weakly hydrogen-bonded and loosely coupled OH groups is accurate for the right reason or whether different errors cancel in a systematic way. For this purpose, the present benchmark database is made available to the theoretical chemistry community. In our opinion, only theoretical models which are able to describe this comprehensive dataset of vicinal diols sufficiently well should be considered for a reliable future modelling of carbohydrate vibrational spectra.
Conformational designators are ′ (prime), * and t specifying the OH–OH hydrogen bond arrangement or lack thereof. If no prime is added it refers to an O–OH angle which is above 120° whereas a ′ indicates that it is below 120°. This is shown for rM-M in Fig. 2. In general a ′ implies a more orthogonal OH arrangement while unprimed conformations imply a more colinear arrangement. In some cases OH dipoles orient in a more antiparallel or crossed fashion (Fig. 2) instead of forming an intramolecular hydrogen bond. In these instances a * is added. In extreme cases the OH groups are no longer in direct contact at all and the OCCO dihedral angle is anti-periplanar. This is denoted with a t. An example for CP-MM is given in Fig. 2. The t label is also used in case of di-axial conformations (t5-5) or related conformations without hydrogen bonds (t4-4). Should a label repeat, letters a, b, c, d are added i.e. rM-Ma, rM-Ma′ or CP-MMat. Note that the labels need not to be correlated, so a and a′ may denote conformations differing by more than the difference in O–OH angle. Figures of all molecular structures are provided in Section S5 of the ESI.†
All DFT calculations were done with the ORCA (version 4.2.1) program package.55,56 Only four functionals are explored in this experimentally driven work, two GGA and two hybrids to work out the generic difference between these two traditional approaches. The B3LYP,57–59 PBE0,60,61 PBE62 and BP8657,63,64 functionals were used in conjunction with Grimme's D3 dispersion correction using Becke–Johnson damping as well as three body terms.65,66 This dispersion correction will be implied in the following without explicit statement. We chose the ma-def2-TZVP Ahlrichs basis67,68 set as a compromise between accuracy and computational cost and will abbreviate it as maTZ in the following, where appropriate. In our testing the augmentation raises the computational costs only slightly and does not change the frequencies much relative to the regular triple zeta basis set but significantly helps reduce the BSSE (Basis Set Superposition Error) related to the intramolecular hydrogen bond or contact. This becomes particularly relevant once anti-periplanar arrangements of the OCCO backbones are included. This general issue was previously discussed in more detail by Reiling et al.69 for ethanediol and by Jensen70 for ethanediol and propane-1,3-diol at the MP2 level of computation. Furthermore, density fitting (RI-J) was employed for BP86 and PBE with the corresponding auxiliary basis71 set whereas for B3LYP and PBE0 none was applied.
The geometries generated with CREST were optimised and an analytical frequency calculation was carried out within the double harmonic approximation. These computed analytical frequencies were used as a basis to assign the experimental results. Because the coupling of the vicinal local OH oscillators is weak,40 there is typically no ambiguity in terms of free and bound modes. Since ORCA can only compute Raman activities for numerical frequency calculations, these were added using the RI-JCOSX72 approximation after additional energy sorting of the analytical frequency results. From these Raman activities the differential cross-sections are calculated which take setup specific factors into account. This conversion is described in more detail in ref. 73 and its ESI.
For the transition state search the NEB-CI (Nudged Elastic Band-Climbing Image) method74 was used for the initial guess, as it is implemented in ORCA. A transition state optimisation was then carried out on the converged climbing image. During the optimisation an exact Hessian calculation was carried out on every third optimisation step. All barriers are always given relative to the less stable species. The results are shown in the form of an energy-matrix (see Fig. 3 and more extensive examples in the ESI,† some with arrows to guide the reader to experimentally feasible interconversions). The diagonal elements give the relative energy to the global minimum, increasing when moving from the bottom-left towards the top-right. All off diagonal elements represent the corresponding barrier where rows indicate all relaxation pathways away from the conformer while columns indicate all pathways towards a conformer. Based on previous examples and the results of Ruoff et al.75 we expect and tentatively locate this relaxation threshold near 5 kJ mol−1. This threshold only offers a coarse orientation, due to the large amount of influence parameters like expansion conditions and the gradual nature of conformational relaxation. Example inputs for the ORCA calculations can be found in Table S2 in the ESI.†
Because the theory guidance is not perfect for individual systems, the experimental assignment task has to be split into three hierarchical stages. The first stage involves three symmetric, unquestionably assignable diols, the second one varies substitution patterns but leaves out further functionalisation or ring strain and the third stage introduces secondary chemical functionality or strain as a complicating factor. The key question is: Can such weak secondary hydrogen bond acceptors still be accommodated, before the generic failure of DFT in quantitatively describing normal hydrogen bond shifts5 bends the theory-experiment correlation?
In between these stages, there has to be a learning process which is carried out by a model which correlates harmonic theoretical OH stretching wavenumbers to experiment. Instead of two previously used types of one-parameter correlation (scaling and shifting), we start with a strongly covariant two-parameter correlation in which we subsequently fix the scaling parameter for each of four functionals at its mean value to fit only the shifting parameter. This experimentally driven parametric treatment leads to insights about a fundamental difference between GGA and hybrid functionals in describing weak hydrogen bonds in combination with the well-characterised alcohol substitution trends. For each of the two families of functionals, we use two representatives to avoid overinterpretation. In the end, the qualitatively different values of the obtained parameters in the two families are interpreted to confirm why B3LYP in hindsight has been and still is most useful for vibrational spectroscopists, because it yields fitting parameters which are closest to a diatomic spectroscopic picture of the OH bond in a weakly hydrogen-bonded environment. Here the key question is: How far can the use of evidently unphysical fitting parameters rescue the interpolating performance of other density functionals at the onset of hydrogen bonding?
The spectroscopically helpful three-tier approach requires an interplay of assignment and modelling description, which could also be useful for automated spectroscopic learning algorithms. The grossly different response of GGA and hybrid functionals to weak hydrogen bonding requires correlation figures which include the full range covered by all functionals, at the expense of subtle substitution effects, which are better followed in the linear regression coefficients and in the ESI.† A reader who is just interested in using the present experimental benchmark set for the testing of further density functionals which could replace B3LYP as perhaps the de facto standard in vibrational spectroscopy – an activity strongly encouraged by our work – can jump directly from Sections 4.2–5 before extracting the relevant experimental reference data from the ESI.†
A system with similar conformational simplicity as 0-0 and t6-6 is pinacol (MM-MM). A compact energetic overview of all three systems is given in Fig. 3. The energy difference between the ground state and the excited conformation (′) as well as the energy barrier from the excited conformation to the ground state are rather similar and quite small, allowing for significant conversion if the energetical driving force does not vanish.
Fig. 4 shows that the simulated harmonic B3LYP spectra fit quite well to the experimental ones, once they are wavenumber scaled to align the dominant, least downshifted OH stretching fundamental with experiment. The required scaling factors are not identical but rather similar, indicating some substitution dependence of either the DFT error or anharmonicity. The small deviations between the scaled harmonic theory and experiment are not systematic in sign and size. The dominance of the respective global minimum structures is quite pronounced even in a helium expansion, as shown by the shrinking factors relative to the simulation above the black arrows for non-overlapping bands which range from about 0.2 to 0.5. This relaxation is expected for the 1.8–2.3 kJ mol−1 barriers under jet conditions, despite the small predicted energy differences of 0.7–1.5 kJ mol−1. It is in qualitative contrast to the GGA predictions for t6-6 (see Table S11, ESI†). For pinacol, Lomas et al. find a similar energy difference of about 1.9 kJ mol−1 at the PBE0/cc-pVTZ//B3LYP/6-311+G(d,p) computational level.76 Olschewski et al. also find a similarly low barrier.41 A comparison of our computational results with others35,41,76 can be found in the ESI† (see Table S16). Earlier Ar-matrix FT-IR results by Dahlqvist et al. only observed the most stable conformer35 (see Table S35 in the ESI†). It is also instructive to compare the wavenumber splitting between the two OH oscillators in pinacol as a function of phase state. In different solutions and also in the gas phase at room temperature,10,25,35,39,41,77 this splitting is below 50 cm−1 and isomers cannot be distinguished due to thermal excitation. In an Ar cryomatrix,35 it is close to 50 cm−1, but the matrix isolation shift is comparable in size to spectral isomer differences and only the most stable isomer can be observed. Only jet cooling reveals two isomers separated by 11 and 14 cm−1 with oscillator splittings of 53 and 56 cm−1. It provides the best meeting point for theory, where only the B3LYP calculation predicts a (harmonic) oscillator splitting around 55 cm−1, whereas all other explored methods predict splittings between 67 and 92 cm−1.
|  | ||
| Fig. 4 Experimental spectra (plotted upwards) and simulated spectra (plotted downwards) of ethanediol (0-0),13trans-cyclohexanediol (t6-6)13 and pinacol (MM-MM). To investigate the conformational relaxation behaviour, different admixtures of argon to helium were used in case of t6-6. Saturator (TS) and nozzle temperatures (TN) are provided. The simulation is based on B3LYP calculations and Boltzmann weighted according to TN and the relative energies given in Fig. 3. Wavenumber scaling factors for the free OH mode for the most stable conformer are also given. | ||
The relaxation and thus distinction between the two isomers can be further enhanced by the addition of argon to the carrier gas (see the t6-6 case), confirming the energy sequence. The shoulder of the 3658 cm−1 signal in the MM-MM spectrum is caused by water impurities, also visible as a separate or overlapping signal for the other two diols. Separation of the water Q-branch and the actual band of interest can be achieved by monitoring the bands over extended periods of time, where the water band will continually decrease. It should be noted though that some residual water will remain for typical measurement periods.
When comparing the spectra for the three diols in Fig. 4, a progressive downshift of the bands with the degree of substitution from ethanediol to cyclohexanediol to pinacol is noticeable. It parallels the downshift observed from primary over secondary to tertiary saturated mono-ols.9,11 However, diols allow for more differentiated substitution patterns including asymmetric variants, which shall be explored in later sections together with more challenging spectral assignments. Before doing this, it appears useful to train different correlation models between experimental and theoretical band positions with the three symmetric diols 0-0, t6-6 and MM-MM.
Our approach is to take the three symmetric diols with established assignment and to initially fit all assigned transitions to a two-parameter expression
| ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = hωi − a2 | (1) | 
![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i is the experimental anharmonic wavenumber of the i-th mode and ωi the harmonic wavenumber predicted by theory. An ideal quantum-chemical method would yield h ≈ 1 and a2 ≈ (175 ± 15) cm−1 for weakly hydrogen-bonded and free OH groups.78 In reality, for a given DFT level, the two parameters are strongly correlated, washing out any subtle substitution patterns among different diols. Therefore, the h value from the global fit to 12 transitions in the three symmetric diols for a given quantum-chemical level is subsequently frozen and the different diols are further discriminated by their best a2 fits. Due to the strong parameter correlation (see exemplary covariance matrices in Table S39 of the ESI†), it is not so important which value of h is used, and we arbitrarily round and freeze the global fit value to the second decimal place when a2 is fitted. A quantitatively poor method may still allow for meaningful predictions, in particular interpolations, based on such a training fit, but the parameters may lose their physical meaning.
i is the experimental anharmonic wavenumber of the i-th mode and ωi the harmonic wavenumber predicted by theory. An ideal quantum-chemical method would yield h ≈ 1 and a2 ≈ (175 ± 15) cm−1 for weakly hydrogen-bonded and free OH groups.78 In reality, for a given DFT level, the two parameters are strongly correlated, washing out any subtle substitution patterns among different diols. Therefore, the h value from the global fit to 12 transitions in the three symmetric diols for a given quantum-chemical level is subsequently frozen and the different diols are further discriminated by their best a2 fits. Due to the strong parameter correlation (see exemplary covariance matrices in Table S39 of the ESI†), it is not so important which value of h is used, and we arbitrarily round and freeze the global fit value to the second decimal place when a2 is fitted. A quantitatively poor method may still allow for meaningful predictions, in particular interpolations, based on such a training fit, but the parameters may lose their physical meaning.
        Fig. 5 shows the result of such a strategy for the B3LYP predictions. One can see that h is slightly (but insignificantly) higher than 1.00. Because at the same time, the best fit a2 is rather close to twice the (negative) diagonal anharmonic constant for alcohols78 and the net effect of off-diagonal contributions is small,4,79 the diol OH bonds are seen to be slightly too soft for this hybrid functional. This combined statement is possible despite the strong fitting parameter correlation and associated large standard deviation. If one now freezes h = 1.01 (rounded to two decimal places from 1.008 of the two parameter all fit) and looks at the individual best fits for the three separate diols, a systematic (and now significant) trend of increasing a2 values with increasing substitution is found. This matches nicely the finding for isolated alcohols,4 but now includes weakly hydrogen-bonded situations.
Fig. 5 also contains the results for PBE0, which was shown to be slightly superior to B3LYP in terms of the independence of a2 on the substitution pattern, for h = 1.00.4 Now, including weakly hydrogen-bonded situations in vicinal diols, the deviation is actually dependent on the predicted wavenumber value, i.e. on weak hydrogen bond-like contacts. Therefore, the unconstrained fit yields h ≈ 0.84, although only the fact that h < 1 is really significant and in part corresponds to the common knowledge that PBE0 predicts significantly too stiff OH bonds. Another important contribution to the low h value is the exaggerated PBE0 OH wavenumber lowering in electron-rich environments. By arbitrarily freezing h at 0.84, one therefore obtains a2-values which are much larger and of opposite sign than one would physically expect due to anharmonicity. As a positive aspect, PBE0 shows a smaller, hardly significant dependence on substitution pattern, like in the simple alcohol case.4 This is still the case when one constrains h to 1, although to a lesser extent (see Fig. S1 in the ESI†). Therefore, we freeze h at 0.84, suggesting that PBE0 somehow systematically overestimates the frequency lowering due to weak hydrogen bonding, if the true anharmonicity is invariant to weak hydrogen bonding.
This systematic overestimation of the frequency lowering due to hydrogen bonding is even more pronounced for GGA functionals, which are exemplified in Fig. S2 of the ESI† for PBE and BP86, again including empirical dispersion correction and using the same basis set. The global two-parameter fit yields h close to 2/3, whereas the magnitude of the a2 value exceeds 1000 cm−1 to compensate for the reduced slope. This shows that the harmonic trends in GGAs are far too reactive to weak hydrogen bond interactions, if one assumes that the true anharmonicity does not change much. For correlation and prediction purposes, this is not necessarily detrimental, but any success of GGAs in predicting OH stretching frequencies is clearly for the wrong reason. Therefore, we will focus more on the two hybrid functionals in the next steps, which appear to be closer to the anharmonic reality. An overview of all fit parameters including their errors can be found in Section S4.3 of the ESI.†
0-M is of considerable interest to the astrochemical microwave community80,81,83,84 as an extension of 0-0 which has been found in interstellar clouds.85,86 A comparison of other computational data17,80–82,87,88 with ours can be found in the ESI† (Table S4). The hybrid functionals as well as the MP2 results agree with each other while the GGAs and HF fail to predict the correct energetic order as indicated by our vibrational and the rotational data.80,81 The GGAs predict the primed species to be more stable, a deficiency recurring for other diols. Our results are consistent with Lovas et al., who investigated the relaxation pathways80 and was also able to observe all conformers besides 0-Mc′. Arenas et al.81 observed all conformers besides 0-Mb′ and 0-Mc′ (more details are provided in Table S4 of the ESI,† where we also translate our nomenclature to the one introduced by Vázquez et al.89).
A first candidate to test the previously established correlation for t6-6 is rac-butane-2,3-diol (rM-M) since it shares the same substitution pattern (secondary–secondary). Given the more symmetric nature of rM-M in comparison to 0-M the conformational landscape is reduced to 5 energetically relevant, partially interconverting conformers. The rM-M* conformer is only stable at the B3LYP/maTZ level of computation and will not be assigned. The remaining conformers were assigned with satisfactory agreement to the experimental results.
The previously mentioned study by Barone and co-workers44 also investigated rM-M. The energetical landscape they found is similar to ours with regards to the relative energies as well as the interconversion pathways. A comparison of their anharmonic vibrational treatment can be found in Table S20 in the ESI.† The VPT2 approach performs significantly better while the local mode approach heavily underestimates the band positions in some instances (up to 87 cm−1), although it looked quite promising for t6-6. Jesus et al. investigated rM-M via FT-IR matrix spectroscopy in Ar and Xe.47 Their results are also compared to ours in Table S20 (ESI†) and highlight the importance of jet reference data since the matrix effects can be fairly substantial and non-uniform. They also only observe the rM-M and rM-Ma conformers. A comparison of our computational data with literature values19,36,37,44,47,88 can be found in the ESI† (Table S14). Besides the results by Wang et al.36 the computational results agree reasonably well with each other. Furthermore whether or not rM-M* is a stable species (without imaginary frequencies) depends on the method used.
Diols allow for different local substitution patterns for the same overall degree of substitution at the vicinal carbons. For example 2-methyl-propane-1,2-diol (0-MM) has the same overall degree of substitution (nD = 2) as the previously discussed t6-6 and rM-M. Therefore the derived correlation for t6-6 may have predictive power for 0-MM. Indeed, this is the case (see Fig. S15 ESI†), suggesting that the overall degree substitution is the deciding factor. Furthermore, the conformational landscape of 0-MM is similar to rM-M, including the stability issues of rM-M*/0-MM*. An energetic comparison with other tested functionals can be found in the ESI† (Table S8).
Another possible methyl substitution pattern is secondary-tertiary (nD = 3) which is realised by methyl-butanediol (M-MM). In terms of its symmetry it is analogous to 0-M which results in a very similar conformational landscape with 8 energetically relevant conformers. The B3LYP energetic ordering varies slightly but the barriers are very similar. Therefore we expect a similar relaxation behaviour. The other tested functionals also behave similarly as they did for 0-M as can be seen in the ESI† (Table S15). The assignment procedure was analogous to that of 0-M, where 6 conformers were assigned in total (see Fig. S15 ESI†).
A surprisingly large and non-monotonic energetical preference is observed for diol hydrogen bonds pointing away from or towards a fully substituted carbon site (0-MM, M-MM, MM-MM) for both hybrid functionals. While this isomerism is absent for MM-MM, it prefers the direction towards the MM site by about 1.5 (for 0-MM) or even 2.3 kJ mol−1 (for M-MM). In case of 0-M the 0 site is energetically favoured by 0.2 kJ mol−1, instead.
A final note on the B3LYP performance in the training set refers to the uniformly increasing downshift from theory with nD to match experiment. A simple correlation ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = ωi − (137 + 3nD) cm−1 (all(nD)) appears to fit all substitution classes within error bars. This is reminiscent of, but more systematic than the trend in alcohols (mono-ols),4 where the average correction between primary and tertiary alcohols is close to 2 × 3 cm−1 but secondary alcohols resemble tertiary systems. We will come back to this physically looking elementary global fit alternative, although it is likely that the apparent stepwise increase of anharmonicity with degree of substitution is an artifact of the functional4 and the actual anharmonicity of the OH group is likely larger, being artificially attenuated by the intrinsically too soft OH oscillator at B3LYP level for this basis set.
i = ωi − (137 + 3nD) cm−1 (all(nD)) appears to fit all substitution classes within error bars. This is reminiscent of, but more systematic than the trend in alcohols (mono-ols),4 where the average correction between primary and tertiary alcohols is close to 2 × 3 cm−1 but secondary alcohols resemble tertiary systems. We will come back to this physically looking elementary global fit alternative, although it is likely that the apparent stepwise increase of anharmonicity with degree of substitution is an artifact of the functional4 and the actual anharmonicity of the OH group is likely larger, being artificially attenuated by the intrinsically too soft OH oscillator at B3LYP level for this basis set.
In case of PBE0 the rounded h value remains unaffected by the enlarged training set. The fits for 0-0 and 0-M are significantly separated from each other whereas t6-6, M-MM and MM-MM are clustered closely together. The relative indifference to the degree of substitution of PBE0 relative to B3LYP still prevails. The slightly lower errors of the PBE0 combined (all) fit in comparison to B3LYP indicate a more global character of PBE0, whereas the individual fits perform slightly worse. If one had to come up with a substitution-parameterised global PBE0 fit like in the B3LYP case, it would perhaps be ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = 0.84ωi + (411 + 3n0) cm−1 (all(n0)), where n0 is the number of unsubstituted CH2-groups in the vicinal diol scaffold.
i = 0.84ωi + (411 + 3n0) cm−1 (all(n0)), where n0 is the number of unsubstituted CH2-groups in the vicinal diol scaffold.
Both GGAs deviate substantially from h = 1 with about 0.66 in either case and small or insignificant changes with the extension of the training set (see Fig. S2 and S3 of the ESI†). The 0-0, 0-M and t6-6 fits are still significantly separated from each other for both functionals, whereas M-MM and MM-MM cluster together for both GGAs. The individual fits for rM-M (BP86: −1228.8 ± 1.8; PBE: −1221.5 ± 1.8) and 0-MM (BP86: −1228.8 ± 2.2; PBE: −1221.5 ± 2.1) are again very close together, as in the other investigated functionals. However rM-M and 0-MM do not agree well with t6-6 for the GGA functionals. Overall, the individual fits of the GGAs do worse than those for the hybrid functionals but the combined (all) fits still perform reasonably well globally.
After this minimalistic nD-dependent training (5 species of different nD) and initial testing (2 species of the same nD) of harmonic theory-experiment correlations for vicinal diols of alkanes without any other functional groups or ring strain, it must be checked how well the models hold up when modest chemical complexity is introduced.
The conformational landscape of 0-V is the most complicated yet with 9 conformers within 5 kJ mol−1. However, many barriers are sufficiently low that relaxation under jet conditions is expected. In Table S6 of the ESI† the different tested functionals are compared showing a considerable dependence upon zero point correction. The nD = 1 correlation works quite well with the notable exception of the 0-V′ conformer, where larger deviations for the bound OH mode (O–H⋯O–H) can be found. This can be explained by the direct involvement of the π-system in the hydrogen bond arrangement exclusive to this conformer. Additionally, the two OH stretching vibrations are considerably less localised in comparison to all other conformers. In total 7 conformers were assigned.
In contrast to 0-V phenyl-ethanediol is less conformationally complex with 7 energetically relevant conformers. Additionally, the relative energies increase more quickly and an anti-periplanar orientation may become relevant. The PBE0 results of Lomas88 (Table S7, ESI†) agree with the B3LYP/maTZ energetic order. As for 0-V some relaxation is expected. The nD = 1 correlation again fits well to the experimental data with larger deviations for the Pha′ conformer. Apart from a poorer match for both OH stretching vibrations, this behaviour is analogous to that of 0-V′. The structural motif is the same for both conformers as can be seen in Fig. S25 and S26 in the ESI.†
The effect of trifluorination is tested with trifluoro-propanediol (0-F). Only 6 conformations are potentially energetically relevant with very different relative energies in comparison to 0-M and only two conformers are expected to relax significantly. This is also the first system where a primed conformer i.e. 0-F′ constitutes the global minimum, likely due to hydrogen bonding towards the fluorine atom. The GGAs yield the same energetic order as the hybrid functionals which can be seen in Table S5 in the ESI.† Other than for π systems, the involvement of fluorine in the hydrogen bond arrangement does not result in a worse wavenumber description with the nD = 1 model. In comparison to 0-V′ and 0-Pha′ the OH stretching vibrations of 0-F′ are more localised. Overall 3 conformers were experimentally assigned.
In summary, the B3LYP model yields good predictions for both 0-V and 0-Ph except for cases where the vinyl or phenyl group is directly involved in the hydrogen bonding. For 0-F, such a restriction is not required. This could indicate that the nature of the additional hydrogen bond acceptor is the deciding factor. There seems to be a systematic theoretical underestimation of the shift due to π-interaction for 0-V′ and 0-Pha′, which could be extrapolated to related systems.90
The assigned bands are plotted together with the derived fits in Fig. 7 for B3LYP and PBE0. They cluster satisfactorily around the nD = 1 (0-M) or nD = 2 (t6-6) correlation for B3LYP while in case of PBE0 due to the uniform nature of the fits no clear favourite can be declared. A B3LYP fit including only the three nD = 1 derivatives with weak additional hydrogen contact options 0-V, 0-Ph and 0-F yields a2 = (142.6 ± 0.6) cm−1. If the conformers with actual weak hydrogen bonds 0-V′ and 0-Pha′ are excluded, a2 = (141.8 ± 0.5) cm−1 moves closer to the trained nD = 1 correlation. This already indicates that in the presence of stronger hydrogen bonds the current correlations have to be modified, most likely due to deficiencies of hybrid functionals in spectroscopically describing such hydrogen bonds in a balanced way. The high substitution sensitivity for B3LYP, which likely is also a subtle and specific deficiency,4 is confirmed. Fig. S4 in the ESI† shows the less systematic results for BP86 and PBE.
Before discussing each system in more detail we briefly look at the evolution of the intramolecular hydrogen bond/contact length with increasing ring size. For the most stable unprimed species OH–O distances are predicted to be 3.34 Å for t4-4, 2.67 Å for t5-5, 2.31 Å for t6-6, 2.25 Å for c6-6 and 2.23 Å for t7-7 at the B3LYP/maTZ level. This is reflected in the larger experimental downshifts when the ring size is increased and also supports similar Raman signal spreads for t6-6, c6-6 and t7-7. For 0-0 a distance of 2.38 Å is computed (experimental estimate: 2.36 Å15), only slightly higher than for t6-6. This contrasts their substantially different spectral positions of the bound oscillators (Δ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) ≈ 10 cm−1) clearly indicating the importance of nD besides the hydrogen bond length when judging the general downshift of different systems. Note again that the term hydrogen bond is used here in a purely phenomenological way in the context of OH stretching wavenumber downshifts, whereas bond critical points considered by some researchers to indicate hydrogen bonding require shorter distances.91
 ≈ 10 cm−1) clearly indicating the importance of nD besides the hydrogen bond length when judging the general downshift of different systems. Note again that the term hydrogen bond is used here in a purely phenomenological way in the context of OH stretching wavenumber downshifts, whereas bond critical points considered by some researchers to indicate hydrogen bonding require shorter distances.91
The rather long hydrogen bond in case of t4-4 leads to a more complicated conformational landscape in comparison to t6-6, where conformers with nothing resembling a hydrogen bond become energetically competitive. These conformers are designated with a t (e.g. t4-4t). In total 6 conformers are energetically relevant and connected by low, but sometimes wide and thus more relaxation-resistant barriers. A comparison of all computational results can be found in Table S9 in the ESI.† The B3LYP nD = 2 correlation deviates considerably from the experimental spectrum. However, the most intense bands can be matched and 4 conformers can be assigned. The difficulties of the model can be attributed to the strong ring strain found in cyclobutane. The assignment of one band remains unclear but likely belongs to t4-4′ with B3LYP significantly underestimating the shift between the two OH vibrations. Higher level electronic structure methods or vibrational treatment might resolve this issue.
For t5-5 di-axial conformations (indicated by a t) become energetically relevant for a total of 8 relevant conformers. Similar to t4-4 barriers are quite low and unlike for t6-6 conversion from di-axial to di-equatorial conformations is feasible. An energetic overview of all tested functionals can be found in Table S10 in the ESI.† The B3LYP correlation appears to fit well at a first glance. However the lack of spacing between signals makes a proper assignment difficult. VPT292 calculations carried out with Gaussian 16 (Revision A.03)93 allow to analyse the possible involvement of hot bands of the most stable conformer at a nozzle temperature of 360 K. A summary of these calculations can be found in Table S21 of the ESI.† In general an assignment of di-axial conformers is difficult and also brings the intensities and indirectly barriers into question in some instances. Therefore, we restrict ourselves to the assignment of the two most stable conformers of t5-5 and leave higher energy conformations to a future, perhaps microwave-assisted investigation.
Among all discussed systems in this section c6-6 is the conformationally simplest, with 4 relevant structures. In contrast to the other systems some significantly higher barriers exist. Relative energies can be found in Table S12 in the ESI† and include a comparison with the results of Lomas.88 The correlation predicts the band positions quite well. This suggests that axial–equatorial hydrogen bonds do not provide a challenge for the model. To gauge the relaxation behaviour spectra at different nozzle distances (dN) were recorded. Closer to the nozzle 3 conformers can be assigned while further away this drops to 2.
As for t6-6, di-axial conformations are not energetically relevant for t7-7. For some other conformations, barriers are not expected to be overcome under jet conditions. An energetic comparison of all tested methods for t7-7 can be found in Table S13 of the ESI.† The nD = 2 B3LYP model works quite well, making a total of 5 assignments possible.
The assigned bands are plotted together with the derived fits in Fig. S5 in the ESI,† for B3LYP and PBE0. It immediately becomes apparent that t4-4 cannot be described as a regular nD = 2 system. This is somewhat expected since the ring strain is significantly higher than for all other cyclic systems. For both B3LYP and PBE0 t4-4 behaves more like an nD = 0 or nD = 1 diol. For B3LYP, the t6-6 (nD = 2) correlation fits well to the other cyclic systems, while PBE0 behaves rather uniform among the fits for more than one substituent. In Fig. S6 in the ESI† the plots for BP86 and PBE show no improvement with regard to t4-4, but some larger deviations for t7-7.
CP-MM is among the few systems studied here where t conformers are energetically feasible. Furthermore, with 11 conformers within 5.2 kJ mol−1 it is also the most complex diol. Two families of conformers (t and non t) can convert to some degree within themselves but not among each other. An overview of the energetics for all tested methods can be found in Table S17 of the ESI.† The B3LYP nD = 4 model still performs reasonably well, making an assignment of 6 conformers possible. However, no t conformers are included in the assignment.
In case of CP–CP t conformers are no longer energetically feasible and the amount of conformers to be considered is reduced to 8. There are again two families that are relaxing separately (CP–CPc/CP–CPc′ and all other conformers). Similar to 0-F, primed species constitute the most stable conformers for CP–CP. An overview of the energetics for all tested methods can be found in Table S18 of the ESI.† The addition of another cyclopentyl moiety also significantly increases the deviations of the B3LYP nD = 4 model from the experiment. However, similar to t4-4 one can still make sense of the spectrum with some assignments remaining tentative, but all conformers being assigned. Additionally, it appears likely that the energetics of the 2 most stable conformers are reversed.
The assigned bands are plotted together with the derived fits in Fig. S7 in the ESI† for B3LYP and PBE0. As the plots indicate, CP-MM and progressively CP–CP are downshifted relative to the nD = 4 (MM-MM) fit. This is further quantified by fitting the CP–CP data points separately. The deviations for both hybrid functionals are very similar. Fig. S8 in the ESI† shows the results for the GGAs which behave in line with the hybrid functionals as can be seen when comparing the individual CP–CP fits. For all methods the same substitution trend is found, different from the rather universal behaviour of PBE0, BP86 and PBE for CP-free systems. Judging from these observations as well as those for t4-4 it appears that closing the cycle from one carbon atom to the other of the 0-0 basic diol unit leads to an increase in wavenumber (decrease of a2, whether it is positive for B3LYP or negative for the other functionals) while closing it at the same carbon atom leads to a decrease in wavenumber (increase in a2). It may be speculated that for trans-cyclopropane-1,2-diol a2 would decrease even further while it should increase for [1,1′-bicyclobutyl]-1,1′-diol. Apart from these ring strain effects, the diol OH stretching wavenumbers can be modeled surprisingly well with a simple linear model and even better with some explicit dependence on the degree of substitution for the hybrid functionals. This shall be analysed in more quantitative detail in the next section.
In Fig. 8 the hybrid functionals are compared. As was previously alluded to in Section 4.2 PBE0 is less dependent upon the substitution patterns which can also be seen from the all fit boxplot. The individual fits for each degree of substitution (relevant for conformational assignment) show a smaller deviation from the experimental results for B3LYP whereas the global fit performs better in case of PBE0. For all fits except nD = 4 the median centers around 0 cm−1, showing no clear trend for over- or underestimation. In case of the nD = 4 substitution a general trend for both functionals towards overestimation can be found as indicated by the median being around +2.5 cm−1. Fig. 8 also shows the pattern specific all fits (all(nD) and all(n0)) as introduced in Section 4.4. In case of B3LYP the median increases slightly while the spread decreases substantially. In case of PBE0 the median decreases slightly while the spread does not significantly change. This confirms the substitution dependence of B3LYP, whereas PBE0 behaves largely independent of the degree of substitution.
Fig. 9 shows the behaviour of the two tested GGAs. Both functionals perform similarly and significantly worse than in particular B3LYP for substitution-dependent fits. For the global fit, a somewhat opposite trend may be seen despite strongly unphysical GGA parameters, but PBE0 still performs best. The performance contrast between the combined (all) fit and the substitution specific ones for B3LYP further emphasises its unique behaviour in comparison to the GGAs, but there are also qualitative similarities (overestimation for nD = 4,1, underestimation for nD = 2). Overall, the global fit for BP86 and PBE could still be used for assignment purposes in cases where signals are well separated. However, a good substitution-dependent fit is a substantial assignment aid within a given diol or a class of related diols.
As can be seen from Fig. 11, the deviations for the GGAs are much larger with a broader spread than for the hybrid functionals. Both behave very similar to each other. A general trend towards overestimation with an increase of the degree of substitution can again be observed. Abberations for CP–CP and t4-4 are reminiscent of those for PBE0.
The average errors for the shifts between free and bound OH stretching fundamentals should be compared to the experimental median (43 cm−1) and extreme values (0–81 cm−1) for this shift in all assigned cases. From this perspective, the GGA performance is not very satisfactory. A normalised error evaluation based on the median absolute deviation (MAD) and the mean absolute error (MAE) can be found in the ESI† (Fig. S12), which further highlights this fact. Regular MAD and MAE variants can be found in Fig. S11 of the ESI.†
The plot illustrates that h is close to the ideal value of 1 for B3LYP while the other functionals suggest values lower than 1, with PBE0 still being somewhat close. It also indicates an individual fitting of each of the four classes. This is illustrated in Fig. S6 and S7 of the ESI.† One could also think of mixing the B3LYP and PBE0 results with similar weights, to obtain the best overall correlation between theory and experiment and at the same time rather physical h and a2 values.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 13) is observed. For liquid diol hydrogen bonded dimer motifs, the deviations are likely larger.94 The systematic substitution dependence (nD = 0 to nD = 2) observed for B3LYP persists in the most stable dimer. While these findings confirm that the presented correlations should not be extrapolated, they also indicate significant potential for a refined DFT-based model which includes stronger hydrogen bonds.
13) is observed. For liquid diol hydrogen bonded dimer motifs, the deviations are likely larger.94 The systematic substitution dependence (nD = 0 to nD = 2) observed for B3LYP persists in the most stable dimer. While these findings confirm that the presented correlations should not be extrapolated, they also indicate significant potential for a refined DFT-based model which includes stronger hydrogen bonds.
      
      
        
        In case of π and F hydrogen contacts, discussed in Section 4.5, the derived correlation for B3LYP still works sufficiently well, unless a π system cooperatively enhances the hydrogen bond. It is rewarding that the best slope of the linear experiment-theory correlation is very close to 1 for B3LYP and the offset qualitatively corresponds to the expected diagonal anharmonicity, whereas this is not the case for PBE0, PBE and BP86. On the other hand, the latter three functionals give offsets which depend less systematically on substitution (cf.Fig. 5 and Fig. S2 (ESI†), Fig. 8 and 9), which probably corresponds to slightly more physical behaviour.78
All tested functionals follow a trend where a cyclic connection of one carbon atom to the other of the ethane-1,2-diol subunit results in an excessive increase of the wavenumber once ring sizes as small as cyclobutane are reached. Additionally, closing the ring on the same carbon atom with some ring strain leads to a progressive decrease of the OH stretching wavenumber relative to the best fits for open systems (cf. Fig. S7 and S8 ESI†). For all functionals a trend towards overestimation of intramolecular splittings induced by the weak hydrogen bond can be found with an increase of the degree of substitution, and also with the shortness of the intramolecular hydrogen bond. B3LYP performs best, followed by PBE0, while GGAs significantly overestimate such splittings (cf.Fig. 10 and 11), probably due to delocalisation error.95
The overall good performance of B3LYP with the compelling multiplicative scaling factor h of 1 and a systematically substitution dependent anharmonicity correction a2 in eqn (1) should not be overinterpreted in terms of physical content, because the diagonal anharmonicity effects are likely larger and more uniform.4,78 More physical a2 values can be obtained by fixing h at 1.01, leading to an all(nD) fit of the form ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = 1.01ωi − (175 + 3nD) cm−1. This is also in line with small expected off-diagonal anharmonicity contributions for alcohols.4,79 In the presence of ring strain, further systematic correction factors would have to be included. This simple formula is close to the best interpolation of the experimental data and at the same time semiquantitatively encodes the two remaining B3LYP deficiencies, i.e. the need to upscale the somewhat too soft computed harmonic frequencies by up to 1% and to make the (now reasonable) anharmonic correction somewhat too dependent on the degree of substitution. To illustrate the correlation between harmonic and anharmonic deficiencies, different fits with set h values can be found in the ESI† (Table S40). To underscore that the specific numbers depend on the basis set without changing the essential conclusions, we also include the h = 1.01 and h = 1.00 fits for the Gaussian16 B3LYP calculations using the may-cc-pVTZ96–98 basis set as used in ref. 4. In this case, an all(nD) fit of the form
i = 1.01ωi − (175 + 3nD) cm−1. This is also in line with small expected off-diagonal anharmonicity contributions for alcohols.4,79 In the presence of ring strain, further systematic correction factors would have to be included. This simple formula is close to the best interpolation of the experimental data and at the same time semiquantitatively encodes the two remaining B3LYP deficiencies, i.e. the need to upscale the somewhat too soft computed harmonic frequencies by up to 1% and to make the (now reasonable) anharmonic correction somewhat too dependent on the degree of substitution. To illustrate the correlation between harmonic and anharmonic deficiencies, different fits with set h values can be found in the ESI† (Table S40). To underscore that the specific numbers depend on the basis set without changing the essential conclusions, we also include the h = 1.01 and h = 1.00 fits for the Gaussian16 B3LYP calculations using the may-cc-pVTZ96–98 basis set as used in ref. 4. In this case, an all(nD) fit of the form ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = 1.01ωi − (198 + 2nD) cm−1 or
i = 1.01ωi − (198 + 2nD) cm−1 or ![[small nu, Greek, tilde]](https://www.rsc.org/images/entities/i_char_e0e1.gif) i = 1.00ωi − (160 + 2nD) cm−1 would be more appropriate, indicating that the B3LYP deficiencies can be somewhat attenuated by the choice of a different basis set.
i = 1.00ωi − (160 + 2nD) cm−1 would be more appropriate, indicating that the B3LYP deficiencies can be somewhat attenuated by the choice of a different basis set.
For practical purposes, a vibrational spectroscopist who wants to quickly predict OH stretching fundamentals for a single new vicinal diol is well advised to pick a diol with the same degree (and kind) of alkyl substitution from our database, do a B3LYP-D3 difference calculation between the new and the reference molecule and add this difference to the experimental value of the reference system.
Evidently, the main goal of this work is more far-reaching in different directions. Our deposited experimental spectra and structure files (ref. 99 and 100 and ESI†) help to expand the experimental benchmark sets for alcohols4 and provide a strong basis for the future testing of anharmonic frequency calculations as well as different underlying electronic structure methods for OH groups engaged in weak hydrogen bond like environments. It remains to be seen whether similarly successful and simple correlations between harmonic theory and experiment can be found for medium strength hydrogen bonds, where the DFT delocalisation error95 grows, but anharmonic effects also become more important and perhaps less cancelling. It may be necessary to move to wavefunction-based methods and adequate anharmonic treatments to disentangle these two contributions.5,78,79 Another avenue may be the improvement of conformational abundance predictions by adding single point energies from higher level electronic structure methods to the numerous DFT-predicted minima and saddle points reported in this work. However, the illustrated complexity of the relaxation processes in the context of non-equilibrium supersonic jet preparation of vicinal diols likely shifts that further into the future and will profit from further microwave characterisation of the cold conformations.
| Footnote | 
| † Electronic supplementary information (ESI) available: Experimental details as well as the raw spectra (http://10.25625/CVAGRL), relative electronic and zero point corrected energies, additional plots, assignment arguments and results as well as structures including the xyz files (http://10.25625/VJF95K) are provided. See DOI: 10.1039/d1cp03367k | 
| This journal is © the Owner Societies 2021 |