Sönke
Oswald
and
Martin A.
Suhm
*
Institut für Physikalische Chemie, Universität Göttingen, Tammannstr. 6, 37077 Göttingen, Germany. E-mail: msuhm@gwdg.de
First published on 15th August 2019
An experimental benchmark data base on rotational constants, vibrational properties and energy differences for weakly and more strongly hydrogen-bonded complexes and their constituents from the spectroscopic literature is assembled. It is characterized in detail and finally contracted to a more compact, discriminatory set (ENCH-51, for Experimental Non-Covalent Harmonic with 51 entries). The meeting points between theory and experiment consist of equilibrium rotational constants and harmonic frequencies and energies, which are back-corrected from experimental observables and are very easily accessible by quantum chemical calculations. The relative performance of B3LYP-D3, PBE0-D3 and M06-2X density functional theory predictions with a quadruple-zeta basis set is used to illustrate systematic errors, error compensation and selective performance for structural, vibrational and energetical observables. The current focus is on perspectives and different benchmarking methodologies, rather than on a specific theoretical method or a specific class of compounds. Extension of the data base in chemical, observable and quantum chemical method space is encouraged.
Our simple approach complements the more sophisticated strategy to directly and accurately predict experimental anharmonic observables for very small molecules with high level methods.57–61 By focusing on the smallest molecules with 1–3 vibrational degrees of freedom, no significant compromise in the vibrational treatment is required, such that the quality of the electronic structure approach can be directly assessed. For the smallest hydrogen-bonded complexes, minor compromises in the electronic structure level and in the vibrational treatment still provide excellent matches between theory and experiment.57,61 Such matches are very encouraging for the success of modern numerical quantum chemistry, but they lack the scalability to chemically more relevant system sizes with potentially new binding contributions such as many-body dispersion effects or chirality-selective forces.
Our proposal also complements the strategies for medium-sized molecules of, say, 5–20 atoms, to assume the correctness of vibrational perturbation theory62 or to limit the expansion of the anharmonic force field to its leading terms.63 These and related methods are very valuable and powerful tools for well-chosen systems and degrees of freedom, but they quickly fail whenever secondary minima on the potential energy hypersurface come into reach. Adding a methyl rotor can deteriorate their performance even for apparently unrelated degrees of freedom or at least demands tailored computational treatment.
Our approach avoids the high computational and man-power barrier associated with such detailed calculations and is meant to replace the frequent but unsatisfactory practice of directly matching harmonic predictions with anharmonic experiment in a “comparing apples-to-oranges”-fashion. The proposed back-correction rules are designed to be generally valid and sufficiently conservative, but occasional exceptions are to be expected for very floppy systems such as quasistructural molecules64 and complexes.65 However, outliers could also be due to experimental error. To give just a few examples, direct absorption measurements of hydrogen-bonded clusters may not always provide unambiguous size and conformation assignments.66 The omnipresent methyl groups in organic complexes may complicate the unique rotational constant determination by microwave spectroscopy67 and also the dissociation energy determination by stimulated-emission-pumping/resonant 2-photon ionization.14 Multi-experimental approaches are particularly promising in this context.68,69 It is crucial to remain skeptical on the experimental and the theoretical side70 in such benchmark comparisons and both sides profit from a critical discussion.
Standard Newtonian molecular dynamics simulations are not an option for anharmonic high frequency vibrations, although this is frequently claimed in the literature.80–82 There will always be a specific simulation temperature for which the average classical frequency matches the quantum mechanical anharmonic value for a specific mode, but this temperature exceeds several 1000 K for a hydride stretching fundamental and changes from mode to mode.44 At room temperature or even below,80 hardly any high frequency mode anharmonicity is probed by Newtonian dynamics. Agreement with experiment is then the result of a fortuitous cancellation between electronic structure error and vibrational error without any benchmarking insight. We note that a more elaborate procedure based on time-integrated normal modes83 claims to recover a larger fraction of the anharmonicity of high frequency modes from classical trajectories even at room temperature.
A meeting point at the harmonic level is much more universal to explore, because any quantum chemical method which can provide minimum structures of molecular systems also allows for the computation of the Hessian matrix of local second derivatives. Rigorous anharmonic calculations, on the other hand, fall short of being routine for any system with more than 4 atoms.84,85 Vibrational perturbation theory62 may represent an alternative particularly for stiff high frequency modes, where it appears to be quite successful even in hydrogen-bonded situations.86 However, this is difficult to prove rigorously in the absence of analogous results from fully converged wavefunction methods and potential energy hypersurfaces of spectroscopic accuracy for the same systems. This is particularly true whenever large amplitude motion like methyl rotation comes into play. Vibrational CI methods also partially lose their intrinsically high performance when it comes to large amplitude motion.87 A similar situation applies to rotational constants, which are much harder to predict including anharmonic zero point motion than at equilibrium.88 It has been remarked more than 15 years ago89 that “one of the traditional tasks of rotational spectroscopy, to provide unambiguous benchmarking data for the development of theoretical descriptions is considerably hindered, in particular for the most weakly bound clusters.” To a large extent, this is still true today, although there are several very accurate theory-experiment comparisons for small systems90 and semiexperimental non-covalent bond lengths have recently been proposed.91 Rotational constants are likely to involve less methodical bias than singled-out bond distances in a polyatomic molecule and we therefore concentrate on these collective observables. Again, a meeting point at the equilibrium structure level would be very helpful for the theoretical community, but it comes at the price of losing the enormously high accuracy of rotational spectroscopy, which includes zero-point motion. As long as the shift of the meeting point and its resulting accuracy loss does not spoil the discriminatory power of the experimental benchmark, we consider this acceptable. In the end, a large number of soft experimental constraints can be more useful for a ranking of approximate methods than a small number of too rigorous ones, which theory cannot meet, anyway.
We propose the following classes Xn of fairly conservative constraints for the systems under investigation for back-correction of experimental, intrinsically anharmonic data to harmonic or equilibrium structure estimates (see Fig. 1):
![]() | ||
Fig. 1 Illustration of the back-correction procedures Xn proposed in the text with a microwave spectrum of jet-cooled HP and its dimer,46 an FTIR spectrum of jet-cooled HP/NN mixtures,47 and schematic energy diagrams for the dissociation of NN from a hydrogen bond donor like HP or HF and for two interconvertible HPHPNN isomers.26 |
• R1: Experimental ground state rotational constants of rigid molecules are back-corrected to equilibrium values by +1(±1)% to account for typical harmonic and anharmonic averaging effects, building on previous suggestions and evidence.90,92,93 For diatomics, extrapolation of constants in vibrationally excited states allows for much more accurate equilibrium values (denoted R1D) than polyatomic molecules (R1P).
• R2: Experimental rotational constants of weakly bound complexes are back-corrected by +1(±3)% to account for potentially more pronounced anharmonic effects in both directions.
• V1: Experimental fundamental OH stretching wavenumbers of monomers ν1 are corrected for diagonal anharmonicity x11 from an overtone measurement and given an additional error bar of ±20% of the diagonal anharmonicity constant due to off-diagonal contributions, which are often small and relatively unsystematic for isolated OH modes. Again, diatomics (V1D) represent a special case because of the absence of off-diagonal corrections, as opposed to polyatomic molecules (V1P).
• S1: Complexation shifts of OH stretching wavenumbers relative to the monomer should involve a high degree of anharmonic cancellation between the monomer and the complex. Nevertheless, we add a safety margin of 50% in both directions, because it is known from an analysis of the simplest organic hydrogen bond that diagonal and off-diagonal effects tend to cancel but are individually quite large.94
• S2: Spectral shifts between different isomers of a complex (or of a flexible molecule) or of different components in a degenerate mode coupling pattern are likely to involve little anharmonic correction due to systematic cancellation, because the messenger is the same. If the complexation shift (S1) is small, it is unlikely that it induces very different anharmonicities. If the complexation shift is large, the uncertainty in the isomer shifts also becomes larger. Therefore, we introduce a S1-proportional error bar of ±10% for back-correction to harmonic isomer shifts. Within a flexible molecule, a proper experimental reference point is usually missing and one has to resort to S2 between isomers instead of S1 for the error estimation.
• E1: Dissociation energies of OH-bonded dimers are typically too small in the harmonic limit and anharmonic treatments bring them closer to the infinite nuclear mass limit, but intramolecular effects may counteract this trend. We back-correct experimental dimer dissociation values to harmonically estimated ones by subtracting up to 10% and adding up to 5%, which should cover the expected variation for typical hydrogen bonds.
• E2: Energy differences between different isomers of a complex can be affected strongly by anharmonic corrections. Even if the total energy difference is small, it can be the net effect of rather different and thus differently anharmonic interactions. Also, experiment often can only provide the sign of the energy difference with some certainty. Tentatively, we add an error bar of ±50% to the anharmonic value, although well-designed cases of isomerism are likely to involve smaller errors95 and worst cases will not be covered by this uncertainty.
All these back-conversion errors are added to the individual experimental error bar, which can be significant for low resolution and more indirect observables but is usually negligible for high resolution spectroscopy data. We emphasize that these back-correction rules do not depend on the theory level to be tested but rather apply to the experimental data themselves. Due to the dependence of even the most straightforward anharmonicity corrections for diatomics on the employed model Hamiltonian, the Born–Oppenheimer approximation and other subtleties, we somewhat arbitrarily set a lower limit for the experimental error bar at 1 MHz for rotational constants (which is certainly conservative for polyatomic molecules96) and 1 cm−1 for vibrational term values.
There is always the possibility that the error bar for an individual back-correction from experiment to the meeting point with theory is not conservative enough. Single outliers must be compensated for by a sufficiently large size of the benchmark database. A few examples of well-characterized experimental outliers for small molecular systems shall be briefly discussed.
The ground state rotational constant of H2 is lower than the equilibrium constant97 by 2.5%, whereas R1 assumes 0–2%. For molecular complexes, mixed theoretical-computational analyses which rigorously connect the equilibrium value of rotational constants with the ground state value are not very abundant.89 An extreme example is Ar-HBr, which, according to the analysis in ref. 98 has a qualitatively different preferred structure at equilibrium (Ar-BrH) than at zero point level (Ar-HBr), which leads to ZPVE effects on the order of 30% for rotational constants. Complexes with several equivalent accessible minima such as HCl–H2O may show similar deviations.89 Also, the A rotational constant of quasilinear complexes may experience large ZPVE corrections due to centrifugal effects and the asymptotic divergence at linear configurations.20 Furthermore, there are cases where molecular rotation couples to internal rotation, occasionally leading to enormous deviations between the experimental A constant and the equilibrium one.65 The R2 constraint used in this work may thus encounter exceptions, but should still be valid as a rule for not too floppy and not too linear polyatomic complexes.
Experimental OH stretching wavenumbers for monomers usually have a narrow range of anharmonicity contributions, unless there is significant intramolecular hydrogen bonding. Even in the latter case, diagonal and off-diagonal shift contributions tend to attenuate each other. Furthermore, the OH stretching mode only shows major resonances once shifted below about 3 μm. Therefore, it is actually difficult to find molecules where off-diagonal anharmonicity contributions exceed 10% of the diagonal correction. Formic acid is a case where the 10% are reached for the net effect, because all off-diagonal corrections point in the same direction.30 Complexation shifts in OH stretching wavenumbers are particularly difficult to assess, because rigorous analyses are scarce. In the case of the methanol dimer,99 there is a near cancellation of different contributions which have an individual magnitude up to 30% of the total shift. Formic acid dimer is an extreme case,30 where diagonal and off-diagonal terms also tend to cancel, but framework resonances contribute very significantly to the OH stretching intensity in a shift range of (500 ± 200) cm−1. For weaker hydrogen bonds, cancellation also appears to be active.100 Thus, the safety margin of 50% in S1 appears generally justified.
Dimer dissociation energies are among the most fundamental quantities which a computational method for non-covalent interactions must reproduce, but unfortunately they are rarely accessible to experiment.36,55,101 Dissociation changes six low frequency intermolecular modes into ZPVE-free degrees of freedom. Therefore it is particularly susceptible to anharmonic effects and there can be qualitative effects such as in the He dimer which is not bound in the harmonic approximation, being barely bound in experiment and in accurate anharmonic calculations. In the field of classical hydrogen bonding, the dimer of HF is a pronounced case with an anharmonic contribution to the dissociation energy of about 9%. Anharmonicity typically increases the dissociation energy, as those modes created by the interaction usually have negative anharmonicities. This may be amplified by intramolecular stretching modes involved in the interaction, whose anharmonicity typically increases by the interaction. However, for the formic acid dimer, there are indications that the net effect of anharmonicity on the dissociation energy may only be on the order of 1%30,102 despite diverging earlier assumptions.103 Finally, there are even rare cases where the anharmonic dissociation energy is larger than the electronic dissociation energy.104
The main purpose of this perspective is to provide an initial experimental benchmark set and not so much to test a large number of theoretical methods against it. The latter is left to interested theory groups and we provide cartesian coordinates for all investigated systems in the ESI.† However, as a start, we compare the performance of two established hybrid functionals for structural, energetic and vibrational properties of hydrogen bonds, PBE0 and B3LYP, both with Grimme's D3 dispersion correction,105 as well as the popular M06-2X hybrid functional with heavy empirical fitting. The D3 correction or some other way of including long range dispersion is essential for hydrogen-bonded systems once they reach a certain size and it would be easy to demonstrate poor performance by leaving it away.46,66
Once a theoretical value (xQ) at computational level Q for an observable rotational constant, wavenumber or energy based on electronic structure and harmonic curvature is available for a compound C, it is compared with the recommended best experimental back-corrected value (xexp.) for the same quantity. The difference is divided by the estimated experimental uncertainty (Δxexp.) based on the back-correction strategy Xn and on experimental errors. The ratio between the theoretical deviation and the experimental error is squared. This removes the sign of the deviation and at the same time weighs small deviations (which might still suffer from limitations of experimental back-correction) much less than large deviations:
![]() | (1) |
The resulting individual dimensionless δXn2(C,Q) values can be averaged over different compounds C and different back-correction classes for observables Xn (R1, R2, V1, S1, S2, E1, E2) for a given method Q (B = B3LYP-D3, P = PBE0-D3, M = M06-2X, all employing the def2-QZVP basis set, using the Gaussian09 program suite106), or over all compounds (providing their number #, optionally followed by a D for diatomics and a P for polyatomics). Such averages are further marked with a bar, . Furthermore, averages over all classes
or families of classes (e.g.
) may be formed. These overall rankings can only be reasonably balanced if the different classes have similar discriminatory power. Therefore, diatomics usually have to be treated separately, as they would otherwise dominate the benchmark due to their straightforward, much more rigorous modeling of anharmonicity. Alternatively, for a given set of m methods Q to be compared, one can normalize each individual δ2(Q) by the value averaged over all m investigated methods. The resulting Δ2(Q) values are close to 0 for outstandingly good and fall between 1 and m for the poorer methods. However, the assessment of the experimental and back-correction error bar and thus a key ingredient of the present approach is lost, whereas the best estimate for the back-correction is kept. Another possible approach, still widely used, is to compare apples to oranges by directly taking the anharmonic experimental value as a reference for the computed harmonic or even electronic (in the case of rotational constants) value. We discourage this practice for obvious reasons, but provide some results for the popular shortcut, denoted as ω2(Q) and Ω2(Q). Given the bold approximation involved, a careful assessment of experimental error bar is superfluous in this case and we thus focus on Ω2(Q) values normalized by all considered methods.
By partial averaging, one can identify methods which are particularly successful for, e.g., structural or vibrational benchmarks and also remove classes with strongly deviating discriminatory power. In this way, rather comprehensive and detailed judgements of the performance of theoretical methods for growing benchmark datasets can be obtained. For example denotes the relative performance of M06-2X for 30 rotational constants in molecular complexes. If it is less than 1, M06-2X performs better than the average of all investigated methods.
In terms of the size of the benchmark database, the strategy is to start with some similarities in the considered systems to identify experimental outliers. In the present case, we start with a database size of 82 entries. After validation and analysis, it is compactified to 51 entries with enhanced discriminatory power based on a total of only 5 molecules and their complexes. Because the quantum chemical effort typically scales sublinearly with the number of benchmarked observables, a small number of systems with a large number of observables offers a computational advantage. In particular, harmonic frequency shifts and dissociation energies draw from the same Hessian calculations. Future efforts may focus on such cost-effective extensions to other observables related to intermolecular interactions such as electric moments, polarizabilities107 or spectral intensities,108 but should always be open for the addition of new systems with different method-discriminating properties.
System | R1/R2 | V1 | S1 | S2 | E1 | E2 | Σ |
---|---|---|---|---|---|---|---|
NN | 1 | 1 | — | — | — | — | 2 |
OO | 1 | 1 | — | — | — | — | 2 |
HF | 1 | 1 | — | — | — | — | 2 |
dHF | 1 | 1 | — | — | — | — | 2 |
FA | 3 | 1 | — | — | — | — | 4 |
dFA | — | 1(0) | — | — | — | — | 1(0) |
AA | 3(0) | 1(0) | — | — | — | — | 4(0) |
dAA | — | 1(0) | — | — | — | — | 1(0) |
gTE | 3(0) | 1(0) | — | — | — | — | 4(0) |
tHP | 3 | 1 | — | — | — | — | 4 |
gHP | — | 1(0) | — | 1(0) | — | — | 2(0) |
HFNN | 1 | — | 1 | — | 1 | — | 3 |
FANN | — | — | 1 | — | — | — | 1 |
dFANN | — | — | 1(0) | — | — | — | 1(0) |
AANN | — | — | 1(0) | — | — | — | 1(0) |
dAANN | — | — | 1(0) | — | — | — | 1(0) |
tHPNN | 3 | — | 1 | — | — | — | 4 |
FAOO | — | — | 1 | — | — | — | 1 |
AAOO | — | — | 1(0) | — | — | — | 1(0) |
tHPOO | — | — | 1 | — | — | — | 1 |
HFHF | 1 | — | 2 | — | 1 | — | 4 |
dHFHF | 1(0) | — | 1(0) | — | 1(0) | — | 3(0) |
HFdHF | 1 | — | 1 | — | 1 | 1 | 4 |
FAFA | 6(3) | — | — | — | 1 | — | 7(4) |
AAAA | — | — | — | — | 1(0) | — | 1(0) |
gTEgTE | 3(0) | — | 2(0) | — | — | — | 5(0) |
tHPtHP | 3 | — | 2 | — | — | — | 5 |
gHPtHP | 3(0) | — | — | — | — | — | 3(0) |
tHPtHPNN | — | — | — | 2 | — | 1 | 3 |
gHPgHPgHP | 3 | — | — | 2 | — | — | 5 |
Σ | 41 (25) | 11 (6) | 17 (10) | 5 (4) | 6 (4) | 2 | 82 (51) |
The following discussion thus focuses on the 74 polyatomic benchmark entries from Table 1. Here, experimental and back-correction errors are much closer to the model deviations for the three investigated DFT functionals. The largest outliers are found in the dissociation energy of HFNN, the weakly bound complex between HF and NN. It was experimentally determined by the Miller group at 4.76(2) kJ mol−1,26 obviously including the anharmonic zero point energy of the complex. Based on the back-correction rule E1, we add 5% and subtract 10% of the value to obtain a best estimate for the harmonically corrected dissociation energy of the complex at 4.64(36) kJ mol−1, or 4.64(38) kJ mol−1 after adding the experimental error. The harmonically corrected B value for the dissociation energy of 5.9 kJ mol−1 is thus far too high, resulting in δE12(HFNN,B) = 10.2. This is actually the largest of all 74 × 3 δ2 entries in the present data base, all other values being single digit all the way down to less than 0.1. There is other experimental evidence that B overbinds hydrogen-bonded complexes of NN,55 even without correction for anharmonicity.
Table 2 lists values for different sub-categories Xn. They range from 0.1 (perfect match between back-corrected experiment and theory within the back-correction uncertainty) to about 3. Characteristically, the two extremes are often covered by the same methods. M describes rotational constants of polyatomic molecules best (0.1) and OH vibrational frequencies worst (3.2). B describes polyatomic OH vibrations best (0.1) and hydrogen bond dissociation energies of their complexes worst (3.1). This underscores the large extent of error cancellation in such methods, because a method which gives the right answers for the right reasons should perform systematically well for different observables with such a close relationship as it does exist for bond lengths and bond vibrations. It is noteworthy that on average
, all methods show the smallest deviations for rotational properties (0.8) and the largest errors for energetical quantities (2.2). This suggests that energetical quantities are particularly helpful for benchmarking, but unfortunately also the hardest to obtain by experiment. The best discrimination (
) between the three methods is found to behave similarly. For rotational properties, the average standard deviation is smallest (0.9), for energetical and vibrational properties it is larger (1.7 and 1.5, respectively). It will be interesting to see in the future whether accurate wave function based methods perform more systematically for the different properties and corresponding studies are invited.7 All relevant back correction data are available in the ESI.†
# | Xn | B | P | M |
![]() |
|
---|---|---|---|---|---|---|
12 (6) | R1P | 2.0 (2.2) | 0.4 (0.7) | 0.1 (0.2) | 0.9 (1.0) | 1.5 (1.6) |
25 (15) | R2 | 0.6 (0.7) | 0.9 (1.0) | 1.0 (0.8) | 0.8 (0.8) | 0.7 (0.5) |
37 (21) | R1P, R2 | 1.0 (1.2) | 0.7 (0.9) | 0.7 (0.6) | 0.8 (0.9) | 0.9 (0.8) |
7 (2) | V1P | 0.1 (0.2) | 2.1 (3.3) | 3.2 (4.7) | 1.8 (2.7) | 2.3 (3.4) |
17 (10) | S1 | 1.7 (2.0) | 2.8 (2.8) | 0.7 (0.7) | 1.7 (1.8) | 1.5 (1.6) |
5 (4) | S2 | 0.4 (0.3) | 0.5 (0.6) | 0.3 (0.4) | 0.4 (0.4) | 0.5 (0.5) |
29 (16) | V1P, S1, S2 | 1.1 (1.3) | 2.2 (2.3) | 1.2 (1.1) | 1.5 (1.6) | 1.5 (1.5) |
6 (4) | E1 | 3.1 (4.1) | 2.6 (3.3) | 1.6 (1.6) | 2.3 (3.0) | 0.6 (1.9) |
2 | E2 | 1.5 | 1.4 | 2.2 | 1.7 | 0.6 |
8 (6) | E1, E2 | 2.7 (3.2) | 2.3 (2.6) | 1.6 (1.8) | 2.2 (2.6) | 1.7 (1.5) |
74 (43) | All | 1.3 (1.5) | 1.5 (1.7) | 1.0 (1.0) | 1.3 (1.6) | 1.2 (1.5) |
After these general observations, a few specific findings deserve discussion, because they can help to condense and later also to extend the present benchmark. The poor performance of B for rotational constants for fluorinated alcohols has already been remarked. The corresponding δ2 values are an order of magnitude larger than for carboxylic acids and for the other DFT methods. M misses the harmonic OH stretching frequencies most prominently. These poor performances disappear when moving to molecular complexes, where B performs particularly well for rotational constants and M for vibrational shifts. This is related to the well-known general overestimation of hydrogen bond-induced downshifts of the OH stretching frequency for most density functionals. The M functional compensates partly for this overestimate, but at the price of missing the monomer values. Unless one adopts a very pragmatic attitude, this is not satisfactory. Returning to the initial discussion of HFNN, we back-correct the observed downshift22 of the HF stretching mode of 43 cm−1 to a loose harmonic estimate of 43 ± 22 cm−1, because of diagonal and off-diagonal anharmonic contributions (rule S1). Nevertheless, this back-correction window is missed badly by B (92) and P (102 cm−1), somewhat in line with the overestimated dissociation energy. Already two decades ago, it was shown that such deviations should be blamed more on DFT than on anharmonic corrections.111 The best harmonic CCSD(T) estimate for the downshift at that time was 64 cm−1, within our experimental back correction and significantly below the B and P predictions. At the time, the harmonically corrected dissociation energy was predicted at 3.9 kJ mol−1, somewhat below our back correction estimate of 4.6(4) kJ mol−1. Nowadays, one could carry these calculations to even higher level to see whether our back-correction is applicable to such a highly anharmonic system.
A more recent test of the back-correction reliability is possible for HFHF.57,112 For this system, a combination of harmonic CCSD(T) basis set limit estimate and vibrational perturbation theory has yielded a surprisingly accurate match for the two HF stretching downshifts, within a few cm−1. While there may still be some error compensation at play, it will not be very large. The underlying harmonic downshifts are less than 30% larger than the anharmonic ones, well within the S1 back-correction window of ±50% despite a substantial floppiness of this dimer. For the dissociation energy, an elaborate theoretical study61 suggests that the anharmonic zero point energy correction to the dissociation energy is on the order of +6(1)%, again well within the +10% error margin of the E1 criterion.
We note that the difference in performance of the tested DFT functionals for the open shell complexes of OO as compared to the closed shell system NN is not very significant, despite the simple UHF treatment in the former. This may deserve further investigation and extension of the open shell database to more sensitive quantities.113
These performance details are graphically summarized in Fig. 3. For rotational (top panel), vibrational (central panel) and energy quantities (bottom panel), the values from Table 2 are plotted for each of the methods and the results for subcategories are connected with those for the full category. The poor performance for monomer rotational constants for method B is evident, as is the poor performance of M for monomer vibrational fundamentals and the generally poorer performance for energies. In some sense, P yields the most balanced performance within a category – its polygons are best centered and of intermediate size.
![]() | ||
Fig. 3 ENCH-51 polyatomic ![]() |
Fig. 4 is also instructive. In the top panel, it compares the overall rotational, vibrational and energy performance of the different methods. M, although quite erratic in detail (Fig. 3), has the best overall performance, as its polygon is inscribed in those of the others. The lower panel compares the average BPM performance over all methods with the standard deviation among the methods. They match quite well, indicating a moderate discriminatory power of the investigated observables for the three functionals. Only for energies, the average deviation is larger than the standard deviation. This may hint at a systematic error, either in the experiment or in the functionals, but the E data base is currently too small for significant conclusions. A general conclusion from these figures is that future efforts to broaden the data base should probably focus on energies and vibrational properties.
![]() | ||
Fig. 4 ENCH-51 polyatomic ![]() ![]() ![]() |
![]() | ||
Fig. 5 Systems covered by the compactified benchmark database. Monomers are circled and complexes are represented by connecting lines. The dashed line marks a meaningful next extension. |
The current data set should thus be seen as the first step into a continuous refinement procedure. We denote it ENCH-51 for Experimental Non-Covalent Harmonic database with currently 51 entries. As long as future extensions make sure that the number of entries is never repeated, this allows for a unique identification. Once these proposed experimental extensions gain momentum, an online repository of the experimental benchmark values is planned in the context of a local research training group on benchmarking. It should be open to external suggestions for systems and experimental data corrections and it could be extended to performance compilations of quantum-chemical methods contributed by interested theoretical groups based on the cartesian coordinates of starting structures provided in the ESI.†
Most of the error bars used in this work to translate experimental quantities into easily accessible electronic structure data rely on a reasonably systematic cancellation between different vibrational contributions which can sometimes be as large as or even larger than the combined error bar. Therefore, outliers are unavoidable. The ENCH strategy is to compensate for such potential outliers by a large number of carefully chosen benchmark data.
We recommend ENCH for a simple test of the suitability of new or old quantum chemical models to describe noncovalent interactions.114 A side use could be to focus on a certain type of observable and to pragmatically choose the best suited method for this observable, irrespective of its performance for other observables. This is still more satisfactory than the popular scaling for or ignoring of anharmonic nuclear motion effects. However, a method which provides a balanced and reasonably good description of several observables is to be preferred over a method which performs very well for a specific observable only. The latter is most likely due to fortuitous error cancellation which should not be relied upon, although it can be useful as a spectroscopic assignment aid.
ENCH relies on experimentalists providing more reference data for its molecular entries14,40,107 and on computational chemists adding quantum chemical models to its performance tables.6–8,12,60,92 In particular, an investigation of the current 51 data entries at the highest consistently affordable wave function theory level would be instructive to test the degree of realism of the back-correction rules in a more rigorous way.
Footnote |
† Electronic supplementary information (ESI) available: Keywords for calculations, spreadsheets connecting the experimental data with benchmark parameters and cartesian coordinates of the benchmark structures. See DOI: 10.1039/c9cp03651b |
This journal is © the Owner Societies 2019 |