Thomas
Stadelmann
,
Chantal
Balmer
,
Sereina
Riniker
* and
Marc-Olivier
Ebert
*
Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 1-5, 8093 Zürich, Switzerland. E-mail: sriniker@ethz.ch; marc-olivier.ebert@org.chem.ethz.ch
First published on 14th September 2022
1H and 13C chemical shifts of 35 small, rigid molecules were measured under standardized conditions in chloroform-d and in tetrachloromethane. The solvent change mainly affects carbon shifts of polar functional groups. This difference due to specific interactions with CDCl3 cannot be adequately reproduced by DFT calculations in implicit solvent. The new dataset provides an accurate basis for the validation and calibration of shift calculations, especially with respect to improved solvent models.
δcalci = σcalcTMS − σcalci | (1) |
Instead of TMS, any chemical shift determined with respect to TMS can be used as reference, if the corresponding shielding calculation has been carried out at the same level of theory:
δcalci = δexpref + σcalcref − σcalci | (2) |
To compensate for shortcomings of the theoretical method in specific electronic environments, one can also employ multiple standards for different types of carbons (e.g. based on hybridization).3 In organic chemistry, the most popular way to convert shieldings into chemical shifts is to use a large set of reference data and perform a linear regression between calculated shieldings and experimental shifts:
![]() | (3) |
If the calculation reproduced all shieldings perfectly, a slope of −1 would be obtained with the intercept being equivalent to σcalcTMS. Chemical shifts are very sensitive to the chemical environment. They also depend on temperature, solute concentration,4,5 protonation state and the presence of impurities.6 Therefore, control over the conditions at which the reference shifts are recorded is important for a consistent dataset. A popular reference set is the one from Tantillo et al.,7 which consists of 1H and 13C shifts of 80 small and relatively rigid organic molecules. Scaling factors for numerous functional/basis set combinations can be found on the authors’ webpage.8 The chemical shifts, however, were collected from different sources and experimental conditions (i.e. temperature, concentration, and purity) are not indicated. In addition, the dataset contains multiple chlorinated compounds whose 13C chemical shifts are affected by relativistic effects that cannot be accounted for by standard DFT. Hehre et al.9 used a reference set of 2000 molecules to derive an elaborate empirical correction scheme for 13C chemical shifts calculated at the inexpensive ωB97X-D/6-31G* level of theory. Also here, no special attention was paid to solvent and concentration effects or standardized experimental conditions.
Given a high-quality reference set, there are different types of errors and approximations that impact the quality and reliability of the resulting shift prediction. One source of error is the chosen DFT method itself (functional and incomplete basis set). It is possible to go beyond DFT and perform e.g. a coupled-cluster calculation for improved accuracy, but such calculations are computationally extremely expensive and only feasible for very small systems. Numerous studies attempted to determine the best combination of functional and basis set for chemical shift calculations.7,10–13 Presumably, some combinations are only better than others due to fortunate error compensation. In terms of the applied level of theory, a good compromise between accuracy and computational cost are double-hybrid functionals, giving mean absolute relative errors as low as 1.9% for the calculated shieldings compared to coupled-cluster calculations.14 Another potential source of error when comparing calculated shieldings with experimental chemical shifts are specific intermolecular interactions with impurities like water or the solvent itself (e.g. via hydrogen bonds), which are not trivial to account for in DFT calculations.15–17 Also vibrational contributions to the chemical shift are usually neglected.
topshim 1h rga lockoff o1p = 0.2 selwid = 0.5 durmax = 120 |
If peaks could not be assigned unambiguously, 13C-HSQC, 13C-HMBC, DQF-COSY, NOESY and PSYCHE18 spectra were recorded as needed. Processing of the spectra was done with Bruker TopSpin™ version 4.1 (Bruker Biospin AG) and MestreNova 14.1 (Mestrelab Research). A line broadening of 0.1 Hz for 1H and 0.5 Hz for 13C spectra was used. The data points were extended by zero-filling to 64k data points for 1H spectra and to 256k data points for 13C spectra.
All spectra were referenced to the signal of internal TMS set to 0 ppm in both solvents. , the susceptibility corrected chemical shift of TMS in CCl4 referenced to external TMS in CDCl3 was calculated using eqn (4).19
![]() | (4) |
Assigned NMR spectra in CDCl3 and CCl4 for compounds 1–35 are provided in the NMReData21 format as well as in separate MestreNova files together with the DFT optimized coordinates in vacuum. The data can be downloaded free of charge from https://doi.org/10.3929/ethz-b-000570841.
Analysis of the data was done in a Jupyter Notebook39 using Python 3.9.12.40 The functionalities of the matplotlib 3.5.1,41 nglview 3.0.3,42 numpy 1.21.6,43 openbabel 3.1.1,44 and scipy 1.5.345 packages were used. DFT calculations were performed on the Euler cluster at ETH Zürich.
After averaging magnetically equivalent nuclei, 141 1H and 170 13C chemical shifts were obtained for the 35 molecules and unambiguously assigned in CDCl3 (Table 1) and CCl4 (Table 2). Exchangeable protons are listed but were not included in this study. First, we compared the chemical shifts in CDCl3 with previously published data in the literature. Then, we investigated the chemical shift differences between the two solvents, followed by a detailed comparison with DFT calculated values.
Compound | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
---|---|---|---|---|---|---|---|---|---|---|---|
a First value corresponds to cis-configured 1H. b First value corresponds to 1H on the same side as the nitrile group. c Exchangeable proton. d First value corresponds to proR 1H. e First value corresponds to axial 1H. | |||||||||||
1 | 4.33 | ||||||||||
62.5 | |||||||||||
2 | 1.59 | 4.42 | |||||||||
12.3 | 70.5 | ||||||||||
3 | 1.30 | 2.36 | |||||||||
10.5 | 10.9 | 120.7 | |||||||||
4 | 2.03 | 6.87 | 6.15 | 9.50 | |||||||
18.7 | 153.9 | 134.7 | 193.9 | ||||||||
5 | 1.33 | 2.70 | |||||||||
20.0 | 19.8 | 123.8 | |||||||||
6 | 4.83/4.92a | 5.85 | 1.01 | ||||||||
108.9 | 149.9 | 33.7 | 29.2 | ||||||||
7 | 2.88 | 3.81 | |||||||||
74.8 | 74.5 | 153.1 | 53.0 | ||||||||
8 | 1.34 | 1.08/1.01b | |||||||||
122.2 | −3.5 | 7.1 | |||||||||
9 | 2.50 | 2.27 | 4.35 | ||||||||
177.6 | 27.8 | 22.2 | 68.5 | ||||||||
10 | 3.87 | 4.25 | 2.50 | ||||||||
215.0 | 70.6 | 66.9 | 37.1 | ||||||||
11 | 1.98 | 3.82 | 4.23 | ||||||||
13.8 | 165.5 | 54.6 | 67.4 | ||||||||
12 | 7.03 | ||||||||||
164.1 | 136.5 | ||||||||||
13 | 3.78 | ||||||||||
67.1 | |||||||||||
14 | 2.50 | 3.98 | |||||||||
206.5 | 42.9 | 67.9 | |||||||||
15 | 8.24 | 7.56 | 7.70 | ||||||||
148.2 | 123.2 | 129.3 | 134.6 | ||||||||
16 | 4.61c | 6.83 | 7.25 | 6.94 | |||||||
155.4 | 115.3 | ||||||||||
17 | 3.81 | 6.91 | 7.30 | 6.95 | |||||||
55.1 | 159.6 | 113.9 | 129.5 | 120.7 | |||||||
18 | 10.03 | 7.89 | 7.54 | 7.64 | |||||||
192.4 | 136.4 | 129.8 | 129.0 | 134.5 | |||||||
19 | 7.67 | 7.48 | 7.61 | ||||||||
118.9 | 112.5 | 132.2 | 129.1 | 132.8 | |||||||
20 | 3.00 | 6.49 | 8.23 | ||||||||
39.0 | 154.2 | 106.6 | 149.9 | ||||||||
21 | 9.25 | 8.76 | 7.34 | ||||||||
159.1 | 156.9 | 121.6 | |||||||||
22 | 6.05 | 2.24 | |||||||||
180.2 | 113.8 | 165.4 | 19.8 | ||||||||
23 | 2.85 | 1.31/1.08d | 5.99 | 0.95/1.61e | |||||||
41.7 | 48.5 | 135.4 | 24.6 | ||||||||
24 | 2.60 | 1.53/1.82d | 1.80/1.44d | 2.66 | 1.55/1.73d | 2.05/1.84d | |||||
218.3 | 49.9 | 24.2 | 27.2 | 35.3 | 37.7 | 45.3 | |||||
25 | 1.03 | 1.03 | 2.14 | 1.53/1.79d | 1.71/1.79d | 1.39/1.56d | 1.14 | ||||
23.4 | 47.4 | 21.7 | 45.4 | 41.7 | 25.0 | 31.9 | 54.2 | 14.6 | 223.4 | ||
26 | 5.88 | 2.17 | 1.04 | 2.20 | 1.94 | ||||||
199.9 | 125.6 | 160.3 | 45.3 | 33.6 | 28.3 | 50.8 | 24.6 | ||||
27 | 10.10 | 8.00 | 7.85 | ||||||||
190.6 | 138.8 | 129.9 | 132.9 | 117.7 | 117.7 | ||||||
28 | 2.58 | 6.90 | 2.32 | 10.57 | |||||||
20.5 | 141.5 | 130.5 | 143.8 | 21.5 | 130.0 | 193.0 | |||||
29 | 0.91 | 2.35/1.85d | 2.09 | 0.84 | 0.96 | 1.95/1.34d | 1.68/1.41d | ||||
9.3 | 57.7 | 219.7 | 43.3 | 43.1 | 46.8 | 19.8 | 19.2 | 27.1 | 29.9 | ||
30 | 8.14c | 7.22 | 6.56 | 7.41 | 7.20 | 7.12 | 7.65 | ||||
127.9 | 135.8 | 124.1 | 102.7 | 111.0 | 122.0 | 119.8 | 120.7 | ||||
31 | 1.88 | 1.75 | |||||||||
28.3 | 37.8 | ||||||||||
32 | 3.22 | 5.49 | 5.49 | 1.68/2.18d | 2.73 | 2.88 | 5.98 | 5.94 | 2.78 | 1.30/1.48d | |
54.8 | 132.0 | 132.1 | 34.7 | 41.2 | 46.2 | 132.4 | 136.0 | 45.2 | 50.4 | ||
33 | 2.70 | 8.00 | 7.52 | 7.48 | 7.85 | 7.71 | 7.37 | 7.32 | |||
19.4 | 134.3 | 132.6 | 124.1 | 125.7 | 125.5 | 128.5 | 133.5 | 126.4 | 125.6 | 126.6 | |
34 | 7.66 | 7.30 | 7.49 | 7.53 | |||||||
193.9 | 134.2 | 144.5 | 124.4 | 129.1 | 134.7 | 120.3 | |||||
35 | 8.01 | 7.46 | 8.43 | ||||||||
131.7 | 128.2 | 125.3 | 126.2 |
Compound | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
---|---|---|---|---|---|---|---|---|---|---|---|
a First value corresponds to cis-configured 1H. b First value corresponds to 1H on the same side as the nitrile group. c Exchangeable proton. d First value corresponds to proR 1H. e First value corresponds to axial 1H. | |||||||||||
1 | 4.27 | ||||||||||
61.6 | |||||||||||
2 | 1.59 | 4.34 | |||||||||
12.0 | 69.5 | ||||||||||
3 | 1.32 | 2.29 | |||||||||
10.4 | 10.5 | 118.1 | |||||||||
4 | 2.03 | 6.72 | 6.07 | 9.43 | |||||||
18.1 | 149.9 | 134.9 | 190.3 | ||||||||
5 | 1.34 | 2.62 | |||||||||
19.9 | 19.4 | 121.3 | |||||||||
6 | 4.78/4.86a | 5.77 | 1.01 | ||||||||
109.0 | 149.0 | 33.4 | 29.1 | ||||||||
7 | 2.72 | 3.76 | |||||||||
73.4 | 74.6 | 151.6 | 51.9 | ||||||||
8 | 1.27 | 1.07/0.96b | |||||||||
120.1 | −3.1 | 6.9 | |||||||||
9 | 2.36 | 2.22 | 4.25 | ||||||||
173.7 | 27.0 | 22.0 | 66.7 | ||||||||
10 | 3.73 | 4.16 | 2.39 | ||||||||
211.2 | 69.7 | 66.0 | 36.4 | ||||||||
11 | 1.88 | 3.70 | 4.11 | ||||||||
13.2 | 163.4 | 54.4 | 66.5 | ||||||||
12 | 6.96 | ||||||||||
162.7 | 135.7 | ||||||||||
13 | 3.57 | ||||||||||
66.5 | |||||||||||
14 | 2.39 | 3.89 | |||||||||
202.1 | 42.5 | 67.3 | |||||||||
15 | 8.23 | 7.53 | 7.64 | ||||||||
148.3 | 123.2 | 128.7 | 133.4 | ||||||||
16 | 4.27c | 6.70 | 7.14 | 6.82 | |||||||
155.2 | 114.9 | ||||||||||
17 | 3.76 | 6.78 | 7.17 | 6.83 | |||||||
54.3 | 159.2 | 113.4 | 128.9 | 120.2 | |||||||
18 | 9.97 | 7.83 | 7.49 | 7.56 | |||||||
189.3 | 136.6 | 129.2 | 128.5 | 133.4 | |||||||
19 | 7.63 | 7.45 | 7.55 | ||||||||
117.0 | 113.5 | 131.7 | 128.6 | 131.7 | |||||||
20 | 2.99 | 6.37 | 8.08 | ||||||||
38.8 | 153.4 | 106.1 | 149.5 | ||||||||
21 | 9.11 | 8.65 | 7.23 | ||||||||
158.9 | 156.0 | 120.7 | |||||||||
22 | 5.85 | 2.20 | |||||||||
176.7 | 113.8 | 162.9 | 19.3 | ||||||||
23 | 2.83 | 1.31/1.06d | 5.92 | 0.95/1.59e | |||||||
41.4 | 48.3 | 135.0 | 24.4 | ||||||||
24 | 2.50 | 1.54/1.77d | 1.77/1.44d | 2.63 | 1.50/1.70d | 1.96/1.74d | |||||
212.4 | 48.8 | 23.8 | 27.2 | 35.0 | 37.3 | 44.5 | |||||
25 | 1.00 | 0.99 | 2.09 | 1.47/1.75d | 1.68/1.79d | 1.37/1.50d | 1.10 | ||||
23.3 | 46.7 | 21.6 | 45.1 | 41.4 | 24.9 | 31.4 | 53.4 | 14.6 | 218.5 | ||
26 | 5.75 | 2.10 | 1.03 | 2.07 | 1.91 | ||||||
195.5 | 125.8 | 155.7 | 45.1 | 33.2 | 28.3 | 50.3 | 24.1 | ||||
27 | 10.05 | 7.96 | 7.82 | ||||||||
188.0 | 138.5 | 129.3 | 132.3 | 118.0 | 116.2 | ||||||
28 | 2.56 | 6.81 | 2.30 | 10.50 | |||||||
20.7 | 141.1 | 130.6 | 142.6 | 21.7 | 130.5 | 190.5 | |||||
29 | 0.86 | 2.25/1.75d | 2.03 | 0.83 | 0.95 | 1.93/1.34d | 1.62/1.39d | ||||
9.2 | 56.8 | 214.3 | 42.6 | 42.9 | 46.3 | 19.7 | 19.1 | 27.1 | 29.5 | ||
30 | 7.91c | 7.08 | 6.44 | 7.25 | 7.06 | 6.99 | 7.50 | ||||
127.6 | 135.5 | 122.8 | 102.9 | 110.3 | 121.7 | 119.6 | 120.5 | ||||
31 | 1.88 | 1.75 | |||||||||
28.0 | 37.5 | ||||||||||
32 | 3.17 | 5.40 | 5.40 | 1.59/2.14d | 2.69 | 2.84 | 5.89 | 5.84 | 2.75 | 1.28/1.47d | |
54.5 | 131.5 | 131.7 | 34.7 | 41.0 | 45.9 | 131.9 | 135.7 | 44.9 | 50.1 | ||
33 | 2.68 | 7.90 | 7.42 | 7.39 | 7.74 | 7.60 | 7.27 | 7.22 | |||
19.2 | 133.3 | 132.4 | 123.6 | 125.2 | 125.0 | 128.2 | 133.4 | 126.2 | 125.0 | 126.1 | |
34 | 7.62 | 7.26 | 7.42 | 7.48 | |||||||
191.2 | 134.2 | 144.0 | 124.1 | 128.6 | 133.6 | 119.6 | |||||
35 | 7.92 | 7.38 | 8.34 | ||||||||
131.5 | 127.9 | 124.8 | 125.9 |
While our shift data in CDCl3 and the literature values do not differ much on average, the spread for some of the datapoints is still as high or higher as the accuracy one would like to achieve in a chemical shift calculation by DFT (≤1 ppm for 13C and ≤0.1 ppm for 1H). Thus, the observed differences are still too large for the validation of computational approaches. Especially for the study of solute–solvent interactions or other weak effects, the use of compiled literature values is not optimal since their influence on the chemical shifts is expected to be on the same order of magnitude as the spread in experimental values. The homogeneous sets of chemical shifts measured in this study will therefore serve as valuable reference to both validate in silico methods and to investigate the influence of specific solvation effects.
To a minor extent, the change in chemical shift of TMS itself should also influence the observed chemical shift differences. We have experimentally determined this contribution which corresponds to the susceptibility corrected chemical shift of TMS in CCl4 referenced to external TMS in CDCl3. In Fig. 2,
is indicated as horizontal line. Chemical shifts of positions that are only marginally affected by the solvent change should symmetrically scatter around this line. For the majority of the sp3-positions this is indeed the case.
To visualize in more detail which nuclei experience the largest solvent-induced change in chemical shift, we selected all 13C and 1H chemical shifts which differ between the two solvents by more than 1.0 and 0.1 ppm, respectively (see Chart 1). For 13C, the marked nuclei agree with positions where – according to the general concepts used in organic chemistry – one would expect the largest change in partial charge upon protonation or interaction with a H-bond donor. Again, the trend is less clear for 1H resonances.
Overall, inspection of the two datasets confirms that there is a clear effect of specific solvent–solute interactions on the chemical shifts – even for these relatively apolar solvents –, which is important to consider when comparing them with computed values.
DFT method | Exp. solvent | 1H | 13C | ||||
---|---|---|---|---|---|---|---|
All | sp2 | sp3 | All | sp2 | sp3 | ||
PBE0 | CCl4 | 0.11 | 0.10 | 0.09 | 1.37 (1.27) | 1.47 (1.27) | 1.05 (1.04) |
PBEP86 | CCl4 | 0.11 | 0.11 | 0.09 | 1.27 (1.05) | 1.27 (0.86) | 0.91 (0.88) |
PBE0 | CDCl3 | 0.12 | 0.11 | 0.10 | 1.50 (1.27) | 1.69 (1.32) | 1.12 (1.08) |
PBEP86 | CDCl3 | 0.12 | 0.11 | 0.10 | 1.78 (1.09) | 1.75 (1.06) | 0.97 (0.91) |
For the 1H shifts, only a very small change in RMSD (0.01 ppm) is observed when comparing to CDCl3 or CCl4 data. This might seem surprising as the experimental 1H chemical shifts in the two solvents differed by a RMSD of 0.08 ppm. However, some of the offset between the two datasets can be compensated by changing the intercept in the linear regression (eqn (3)) without a significant increase in RMSD.
A large part of the potential shortcomings of the vacuum calculations can be masked by this mechanism. For the 13C chemical shifts, on the other hand, a clear increase in the deviation from experiment (0.13–0.51 ppm) can be seen when going from CCl4 to CDCl3 data. The sp2 carbons are thereby more affected than the sp3 carbons (as expected from the results in Fig. 2). Again, the differences are not of the order of magnitude expected from the experimental comparison (RMSD of 1.40 ppm, see Table S2 in the ESI†). But here, the compensation by the regression procedure is not as efficient as for 1H. Part of the reason for this might be that the differences in experimental 13C shifts between the two solvents systematically increase towards lower field. This is not the case for the 1H shifts. The effect of the functional (PBE0 or PBEP86) is negligible for the 1H shifts as also noted recently by Oliveira et al.49 For 13C, PBEP86 performs better than PBE0 for the values recorded in CCl4 and worse for the values recorded in CDCl3. Mean absolute deviations (MAD) and maximum absolute deviations (max. AD) are given in Tables S3 and S4 (ESI†). Exchanging the basis sets (using PBE0/pcSseg-3 and PBEP86/cc-pVTZ) shows that PBE0 profits only marginally from the larger basis set. For PBEP86/cc-pVTZ, the 13C RMSD increases slightly by ca. 0.1 ppm (Table S31, ESI†).
To assess the baseline error of the DFT method, independent of the solvent, we can focus the analysis only on the 13C nuclei that showed no or a very small solvent effect in experiment (nuclei without mark in Chart 1, 135 13C shifts). Table 3 shows that for this reduced set the deviation between calculation and experiment becomes similar for the two solvents, indicating a baseline error for PBEP86 of ca. 1 ppm for 13C chemical shifts (and 1.3 ppm with PBE0).
We have also calculated the vacuum chemical shifts using eqn (1) (TMS as single reference, Table S6, ESI†). The RMSDs from the experimental values are consistently larger than for the shifts obtained using eqn (3). The largest increase (up to five times) is seen for 13C shifts. RMSD values are only comparable for sp3 protons, i.e. for protons with shifts similar to the reference. The same trend can be seen if single resonances in adamantane (31) or anthracene (35) are used as reference (Table S9, ESI†). But for the latter compounds, 13C RMSDs are significantly smaller than for TMS. This behaviour is not surprising as eqn (1) assumes a linear relation with a fixed slope of −1 between chemical shift and calculated shielding and TMS lies at the edge of the sampled chemical shift range. With increasing distance from the reference, small deviations from the expected slope of −1 lead to large deviations between calculation and experiment. In this regard the peripheral position of TMS is far from ideal. Further, the difference between the results obtained with adamantane and TMS as reference points to an issue in the treatment of silicon, similar to the relativistic effects of chlorine that cannot be properly accounted for in standard DFT.13
If only closely related electronic environments are investigated, the use of eqn (1) or (2) is justified. But e.g. for the validation of constitution or configuration of a newly isolated, structurally diverse natural product, the use of eqn (3) is clearly preferable.
It is instructive in this respect, to use eqn (3) with m fixed at −1. This enforces the theoretically expected slope of −1 but is independent of a specific single reference. For 1H, the RMSDs are about twice as large as in Table 3 but differences between the two datasets (CDCl3 and CCl4) and the different functional/basis set combinations are small. Combinations using the double hybrid functional perform slightly better. The 13C RMSDs for the reduced datasets vary between 1.11 and 3.80 ppm. Interestingly, changing the basis from cc-pVTZ to pcSseg-3 invariably leads to a doubling of the RMSD, irrespective of the functional. The best combination (PBEP86/cc-pVTZ) leads to values very close to the ones in Table 3 (1.11 ppm for CDCl3 and 1.17 ppm for CCl4). Inspection of the residual plots (see ESI†) shows that for calculations with pcSseg-3, the slope of σcalcvs. δexp deviates more distinctly from −1 than for cc-pVTZ. Since this deviation cannot be compensated if m is fixed, cc-pVTZ performs significantly better.
![]() | ||
Fig. 3 Difference between the difference in experimental shifts in CDCl3 and CCl4 (Δδexp) and the difference in calculated PBE0 shielding with implicit CHCl3 and CCl4 (Δδcalc = −Δσcalc) plotted against the experimental shifts in CDCl3. Note: since the experimental shift differences are mostly positive (see Fig. 2) a positive value for Δδexp − Δδcalc typically corresponds to an underestimation of the absolute shift difference by the calculation. The solid grey horizontal lines indicate ![]() |
After conversion to chemical shifts using eqn (3), the RMSDs from the experimental values in the two solvents were again investigated. While the agreement with experimental 1H shifts improves slightly compared to the vacuum calculations (0.01–0.03 ppm), the use of an implicit solvent model only leads to an improvement for the CDCl313C shifts calculated with PBEP86. Table 4 summarizes the performance of the two DFT methods with the corresponding implicit solvents. The MAD and max. AD values are given in Tables S12 and S13 (ESI†). As the limitations of the CPCM model (lack of local H-bonding capacity) are expected to be more severe for chloroform than for CCl4, it is surprising to see that the agreement with experimental data in CCl4 is negatively affected by CPCM. Possible reasons for these findings are deficiencies in the CPCM implicit solvent model and/or more favourable error compensation in the vacuum calculations. For comparison, CPCM shieldings were also calculated for structures optimized in vacuum. Interestingly, lower RMSD values were obtained for the 13C shifts, whereas no difference was observed for the 1H shifts (for evaluation metrics see ESI†). This suggests that – at least for the common procedure using eqn (3) – geometries optimized at the BP86/def2-tzvp level using CPCM provide no advantage compared to geometries optimized in vacuum.
DFT | Exp. | 1H | 13C | ||||
---|---|---|---|---|---|---|---|
Method | Solvent | All | sp2 | sp3 | All | sp2 | sp3 |
PBE0 | CCl4 | 0.09 | 0.08 | 0.08 | 2.07 | 1.84 | 1.09 |
PBEP86 | CCl4 | 0.10 | 0.11 | 0.07 | 1.53 | 1.21 | 0.99 |
PBE0 | CDCl3 | 0.10 | 0.10 | 0.08 | 1.87 | 1.85 | 1.10 |
PBEP86 | CDCl3 | 0.10 | 0.12 | 0.06 | 1.34 | 1.21 | 0.96 |
Our experimental data show that specific interactions with the solute are present even in such apolar solvents. Especially the 13C shifts of carbonyl and nitrile groups are differentially affected by the two solvents. A direct comparison of calculated shieldings with chemical shifts recorded in CDCl3 and CCl4 implies that DFT in implicit solvent is not able to reproduce experimental values equally well for all functional groups. The likely reason are specific interactions with the solvent (e.g. H-bonds) that cannot be adequately described by implicit solvation. By converting the shieldings to chemical shifts using the common multi-standard regression (eqn (3)), the errors resulting from neglecting directed solvent-interactions are redistributed over the whole shift range, also affecting functional groups not engaging in interactions with the solvent. In the worst case, selective deficiencies of the solvation model can lead to higher inaccuracies in chemical shift prediction for all atoms. This effect may have added to the ambiguous outcome found in this study regarding the usefulness of implicit CHCl3. For 1H, there was no significant advantage from using an implicit solvent model. For 13C, the double-hybrid functional PBE86 with implicit solvent provided the best agreement with chemical shifts recorded in CDCl3, while the implicit solvent was detrimental to the performance of PBE0. Importantly, our data also imply that the explicit treatment of solute–solvent interactions will be necessary for an even more accurate chemical shift prediction. Ignoring an incomplete representation of the solvent shell potentially obscures the direction to further improvement of DFT chemical shift prediction and hampers its fair assessment.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2cp03205h |
This journal is © the Owner Societies 2022 |