Shamil K.
Latypov
*,
Fedor M.
Polyancev
,
Dmitry G.
Yakhvarov
and
Oleg G.
Sinyashin
A.E. Arbuzov Institute of Organic and Physical Chemistry, Kazan Scientific Center, Russian Academy of Sciences, Kazan, Russian Federation. E-mail: lsk@iopc.ru
First published on 16th February 2015
The aim of this work is to convince practitioners of 31P NMR methods to regard simple GIAO quantum chemical calculations as a safe tool in structural analysis of organophosphorus compounds. A comparative analysis of calculated GIAO versus experimental 31P NMR chemical shifts (CSs) for a wide range of phosphorus containing model compounds was carried out. A variety of combinations (at the HF, DFT (B3LYP and PBE1PBE), and MP2 levels using 6-31G(d), 6-31+G(d), 6-31G(2d), 6-31G(d,p), 6-31+G(d,p), 6-311G(d), 6-311G(2d,2p), 6-311++G(d,p), 6-311++G(2d,2p), and 6-311++G(3df,3pd) basis sets) were tested. On the whole, it is shown that, in contrast to what is claimed in the literature, high level of theory is not needed to obtain rather accurate predictions of 31P CSs by the GIAO method. The PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) level can be recommended for express estimation of 31P CSs. The PBE1PBE/6-31G(2d)//PBE1PBE/6-31G(d) combination can be recommended for routine applications. The PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) level can be proposed to obtain better results at a reasonable cost. Scaling by linear regression parameters significantly improves results. The results obtained using these combinations were demonstrated in 31P CS calculations for a variety of medium (large) size organic compounds of practical interest. Care has to be taken for compounds that may be involved in exchange between different structural forms (self-associates, associates with solvent, tautomers, and conformers). For phosphorus located near the atoms of third period elements ((CH3)3PS and P(SCH3)3) the impact of relativistic effects may be notable.
Having strong evidence that the 1H/13C/15N GIAO calculations are reliable and very helpful in practice, it would be also desirable to extend the approach to other nuclei. From this point of view 31P CSs are very attractive because, on the one hand, phosphorus is contained in many practically important compounds of organic, bioorganic and inorganic chemistry. On the other hand, the 31P CS is extremely sensitive to the electronic structure and range within 1000 ppm, therefore even small changes in the structure are strongly reflected in its CS.36 Thus, if there was a reliable method to predict 31P CSs, it could be useful as an additional tool for structural elucidation of novel phosphorus containing compounds.
However, despite the great need for such a tool, it is still unclear if 31P CSs calculated in the framework of the GIAO method are reliable. In fact, there are relatively few reports on 31P NMR CS calculations using quantum chemical methods.37–55 In several systematic studies it was shown48–55 that a satisfactory agreement between calculations and experiments is observed when quite a “heavy” basis set or high level of theory‡ was used although deviations and exceptions were also found. It is necessary to stress that all these studies were focused only on the restricted types of small size compounds.
Summarizing the literature one can conclude that obviously there is some progress in quantum chemical 31P CS calculations. However some methodological questions are still left to be answered concerning the scopes and limitations of the method, in particular for practical applications. Moreover, if to take into account that authors used either high levels of theory or very “heavy” basis sets it can hardly be applied to medium and large size compounds of practical interest.
Nevertheless in spite of limitations, it seems that with some care the method can be applied at least to phosphorus in most types of environment, although several problems have to be cleared up. First, the additional shell in phosphorus may complicate the calculation of NMR parameters in frames of the GIAO method or may necessitate using “reach” basis sets and/or high level of theory, which may dramatically increase computational requirements, and as a result, it may become inapplicable to real size organic compounds. Therefore there should be some compromise between the cost and the quality, and it seems that for a variety of systems “cheaper” calculations may be sufficient.
Second, in most cases the shielding calculations are performed for individual molecules, i.e. actually vacuum conditions are modeled, when there are no collisions and interactions with other molecules. This raises the question of how the data obtained in the vacuum will apply to solutions.
Third, in practice one operates not by shielding but with CSs, which for phosphorus are referenced in respect of 75% H3PO4 in water. However, no data are available for this standard in the gas phase, which at first glance makes it difficult to use this compound as a reference in the calculations.
In this work we try to analyze the influence of the level of theory (CS//geometry) on the quality of CSs with the aim of finding the compromise of being “light” enough to be applied to large compounds and still giving precision of practical value. To achieve this goal we will be guided by the next “road map” that comprises three goal-driven questions: (1) is it possible to calculate 31P CSs with reasonable accuracy, and what is the minimum level of theory required? (2) Will the required level of theory (applied to small models) be applicable to “not small” organophosphorus compounds of practical interest? (3) What are complicated cases?
The problem of the optimal choice for “combination” is the key point in CS calculations. These calculations consist of two steps: geometry optimization followed by the magnetic shielding calculation. Therefore, the first task is to find the optimal method for geometry optimization, which is then used in the CS calculations. Second, the influence of level of theory on the resulting CSs should be analyzed. To this end, the calculations using the small model molecules (Fig. 1) that cover a wide range of structural types and particularities in organophosphorus chemistry were run with different “combinations”. Theoretically calculated data have to be compared with gas-phase absolute shielding values. However, on the one hand, there are only very few models for which gas-phase data are available. On the other hand, almost all 31P NMR measurements are carried out in solution. Therefore, the solution CSs referenced to H3PO4 were used in correlation analysis. The calculated absolute shielding values were converted into CSs by referencing to H3PO4 calculated under the same conditions. This combination of the gas-phase and the solution data poses some problems that will be discussed especially in the paper below. The test set of molecules excludes H3PO4 itself.
On the whole, calculated data for most of the model compounds at different levels of theory (for geometry optimization) correlate well with experimental values, e.g. in Fig. 2a the plot of calculated (PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d)) versus experimental 31P CSs is shown (results of the calculations for selected combinations are given in Table 1). The correlation coefficient (R2) is high enough (0.93) although in several cases there is a notable deviation from the line. It is worth stressing that there are two points that lie in very low/high field regions (−525.0 (P4, 27) and 494.0 (P2H2, 26) ppm) that inevitably increase the R2 value due to the large band of correlated data (Table S1, ESI†) and thus masks discrepancies in the main region. To analyze in more detail this significant part of the plot, R2 values with the exception of extremely low/high field regions were recalculated (second column Table S1, ESI†). Indeed R2 values become lower (0.874–0.927) in these cases. As one can see (Fig. 2a) there are several points that deviate remarkably from linear correlation: −298.2 ([PCl6]−, 22), −143.7 ([PF6]−, 21), −207.0 (FCP, 28), −13.9 (H3PO, 1), 191.2 (CH3PCl2, 23), 275 (PN, 30), 217 (PCl3, 24), 124.5 (P(SCH3)3, 34) and 59.1 ((CH3)3PS, 32) ppm. The variation of parameters at geometry optimization and magnetic shielding calculations do not improve correlations (Tables S1 and S2, ESI†) and these points always deviate remarkably from the rest of the cases. Probably in these points either additional effects interfere or the GIAO method fails to work well. Therefore these “difficult points” were excluded from the correlation analysis so far, and we consider the possible reasons for that disagreement in more detail later. Thus, except these latter compounds, the correlation improves notably for the rest of the 23 models (Fig. 2b). These “normal” models were used for further analysis in order to reveal key factors that influence the quality of calculated 31P CSs.
No. | Compound | 6-31G(d)//6-31G(d)b | 6-31G(2d)//6-31G(d)b | 6-311G(2d,2p)//HF/6-31G(d)a,b | 6-311G(2d,2p)//6-31+G(d)b | Exp. | Solvent | Ref. | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Unscaled | Scaled | Unscaled | Scaled | Unscaled | Scaled | Unscaled | Scaled | |||||
a DFT PBE1PBE was used, except for the third column, where HF/6-31G(d) was used for geometry optimization. b First row – the basis set used for CS calculation, second row – the basis set used for geometry optimization. c In ppm, referenced to H3PO4. d R 2 and RMSE are the correlation coefficient and the root-mean-square error, respectively; calculated without a low/high field region and difficult cases (in italic). e Linear regression parameters (data for all combination are given in the ESI). f Not available. | ||||||||||||
1 | H 3 PO | −49.4 | −50.5 | −84.5 | −56.4 | −62.7 | −51.2 | −71.2 | −53.0 | −13.9 | H 2 O | 56 |
2 | H3PO2 | −0.2 | 3.1 | −25.8 | 4.1 | −1.4 | 5.9 | −9.1 | 4.9 | 12.5 | H2O | 57 |
3 | H3PO3 | 6.0 | 9.8 | −8.5 | 21.9 | 3.5 | 10.6 | 2.5 | 15.7 | 3.0 | HCl | 58 |
4 | H 3 PO 4 | 0.0 | 3.3 | 0.0 | 30.7 | 0.0 | 7.3 | 0.0 | 13.4 | 0.0 | H 2 O | 59 |
5 | CH3P(O)(OH)2 | 24.1 | 29.5 | 17.2 | 48.4 | 28.9 | 34.2 | 29.0 | 40.4 | 24.8 | H2O | 60 |
6 | (CH3)2P(O)OH | 34.0 | 40.3 | 21.7 | 53.0 | 42.1 | 46.5 | 39.4 | 50.1 | 49.4 | CH3OH | 61 |
7 | (CH3)3PO | 4.9 | 8.6 | −13.3 | 17.0 | 14.9 | 21.2 | 6.7 | 19.6 | 36.2 | C6H6 | 62 |
8 | (CH3)2P(O)H | −8.4 | −5.9 | −31.0 | −1.3 | −4.5 | 3.0 | −11.7 | 2.5 | 20.5 | CH3OH | 63 |
9 | (CH3CH2)2P(O)H | 17.8 | 22.7 | −5.4 | 25.1 | 18.6 | 24.7 | 13.2 | 25.7 | 41.0 | CHCl3 | 64 |
10 | H2P(O)OCH3 | 2.9 | 6.4 | −22.5 | 7.5 | −4.6 | 3.0 | −7.5 | 6.4 | 19.2 | C6H6 | 65 |
11 | HP(OCH3)2 | 166.8 | 185.0 | 146.7 | 181.8 | 194.8 | 189.0 | 172.8 | 174.4 | 171.5 | naf | 66 |
12 | PH3 | −225.1 | −241.9 | −264.7 | −241.9 | −269.0 | −243.7 | −284.7 | −251.9 | −266.1 | Gas phase | 67 |
13 | (H2P)2PH | −137.5 | −146.5 | −179.8 | −154.5 | −205.2 | −184.2 | −178.1 | −152.6 | −162.6 | (CH3)2CO | 68 |
14 | (H2P)2 | −191.2 | −205.0 | −233.6 | −209.9 | −229.1 | −206.5 | −239.1 | −209.5 | −203.6 | (CH3)2CO | 68 |
15 | H2PCH3 | −155.9 | −166.6 | −191.7 | −166.8 | −182.3 | −162.8 | −193.1 | −166.6 | −163.0 | na | 69 |
16 | PH2(C6H5) | −121.1 | −128.6 | −158.0 | −132.0 | −145.8 | −128.7 | −156.8 | −132.7 | −122.0 | na | 36 |
17 | H2PCF3 | −114.5 | −121.5 | −149.0 | −122.8 | −143.8 | −126.8 | −155.9 | −131.9 | −129.0 | na | 69 |
18 | HP(CH3)2 | −104.6 | −110.7 | −133.1 | −106.4 | −112.3 | −97.5 | −123.0 | −101.2 | −99.0 | na | 69 |
19 | CH3P(N(CH3)2)2 | 67.9 | 77.2 | 51.1 | 83.3 | 79.6 | 81.5 | 70.9 | 79.5 | 86.4 | CHCl3 | 70 |
20 | CH2CHPF2 | 214.3 | 236.7 | 195.0 | 231.5 | 231.1 | 222.9 | 230.8 | 228.5 | 219.5 | na | 71 |
21 | [PF 6 ] − | −107.5 | −113.8 | −116.8 | −89.6 | −133.6 | −117.4 | −133.1 | −110.7 | −143.7 | C 6 H 6 | 72 |
22 | [PCl 6 ] − | −83.1 | −87.3 | −140.4 | −113.9 | −141.6 | −124.8 | −143.0 | −119.9 | −298.2 | CH 2 Cl 2 | 73 |
23 | CH 3 PCl 2 | 224.1 | 247.4 | 176.6 | 212.5 | 216.7 | 209.4 | 211.9 | 210.9 | 191.2 | CHCl 3 | 74 |
24 | PCl 3 | 292.7 | 322.1 | 229.3 | 266.8 | 258.5 | 248.4 | 261.1 | 256.7 | 217.0 | Gas phase | 48 |
25 | Si(PH2)4 | −192.4 | −206.3 | −230.8 | −207.0 | −218.6 | −196.7 | −233.3 | −204.0 | −205.0 | C6H6 | 48 |
26 | H 2 P 2 | 601.7 | 658.7 | 528.5 | 575.0 | 594.3 | 561.7 | 596.7 | 569.5 | 494.0 | na | 75 |
27 | P 4 | −502.4 | −544.0 | −569.7 | −556.0 | −583.0 | −536.6 | −582.4 | −529.4 | −525.0 | Gas phase | 49 |
28 | F–CP | −319.6 | −344.9 | −360.1 | −340.2 | −350.0 | −319.2 | −357.9 | −320.2 | −207.0 | na (193 K) | 76 |
29 | H–CP | −24.7 | −23.6 | −75.4 | −47.0 | −18.9 | −10.4 | −26.7 | −11.5 | −32.0 | na | 77 |
30 | PN | 336.4 | 369.7 | 286.3 | 325.5 | 323.1 | 308.6 | 351.7 | 341.2 | 275.0 | Gas phase | 38 |
31 | 3,4-Dimethyl-phosphorine | 187.1 | 207.1 | 152.5 | 187.7 | 206.8 | 200.2 | 197.4 | 197.4 | 187.9 | CHCl3 | 78 |
32 | (CH 3 ) 3 PS | 14.9 | 19.5 | −8.7 | 21.7 | 25.1 | 30.7 | 10.3 | 23.0 | 59.1 | CHCl 3 | 50 |
33 | (CH3O)3PS | 82.0 | 92.6 | 58.2 | 90.6 | 74.8 | 77.0 | 70.9 | 79.5 | 73.0 | na | 50 |
34 | P(SCH 3 ) 3 | 182.0 | 201.5 | 132.8 | 167.4 | 161.6 | 158.0 | 154.9 | 169.9 | 124.5 | CCl 4 | 79 |
R 2 | 0.987 | 0.987 | 0.989 | 0.989 | 0.991 | 0.991 | 0.993 | 0.993 | ||||
RMSEd (ppm) | 14.6 | 13.4 | 12.3 | 10.9 | ||||||||
Slopee | 0.918 | 1.000 | 0.971 | 1.000 | 1.072 | 1.000 | 1.073 | 1.000 | ||||
Intercepte | −3.0 | 0.0 | −29.8 | 0.0 | −7.8 | 0.0 | −14.4 | 0.0 |
The close examination of theoretical versus experimental CS correlations for this set of model compounds lets us come to some conclusions on main factors that influence the quality of calculations. Regardless of the basis set used for CS calculations (PBE1PBE/6-31G(d) and PBE1PBE/6-311++G(2d,2p)) similar dependencies of R2 values from the geometry optimization method were observed (e.g. the results of the PBE1PBE/6-31G(d) method for CS calculations is presented in Fig. 3. Data with other basis sets used to calculate CSs are given in Fig. S1, S2 and Tables S1 and S2, ESI†). First, only the inclusion of an additional diffuse function slightly improves correlations and the use of better basis sets has almost no effect. Second, surprisingly, the HF level, in general, produces high enough correlation coefficients comparable with DFT and MP2 results (Fig. 3). Third, the MP2 level improves correlation slightly but it essentially increases the time needed for calculations. Fourth, in general both popular functionals, B3LYP and PBE1PBE, give similar results although the latter seems to be slightly more preferable.
Fig. 4 R 2 values (the geometry optimized at the PBE1PBE/6-31G(d) level) against the indicated functional/basis set approach used in CS calculations of 23 representative molecules. |
As one can see, in this case the R2 depends more strongly on the level of theory, on the quality of the basis sets and even on the functional. All three sets (for all basis sets used for geometry optimization) demonstrate similar dependencies. Analysis of these data allows us to reveal key factors that influence the correlation coefficients. First, the inclusion of additional diffuse functions makes the correlation worse. Second, the inclusion of additional polarization functions on heavy atoms (2d) or protons (p) augments the correlation. Third, MP2 calculations in some cases (6-31G(d,p) and 6-311G(d)) improve correlations (the MP2 calculations were possible to run only up to the 6-311G(d) basis set. Heavier basis sets become impractical even for small compounds, vide infra). Fourth, the use of triple split valence basis sets in conjunction with additional polarization functions (6-311G(2d,2p) or 6-311++G(3df,3pd)) improve the correlation coefficient, thus R2 becomes similar to the value obtained from MP2 calculations using moderately heavy basis sets (6-31G(d,p) and 6-311G(d)).
From these data it looks like the PBE1PBE functional is superior than B3LYP for CS calculations. But perhaps this might be due to the fact that B3LYP CS calculations were carried out on geometry optimized by the PBE1PBE functional and not by B3LYP, i.e. due to not using a “parent” functional (different for optimization and CS calculations). To prove or disprove this hypothesis we carried out a correlation analysis for the data obtained using the B3LYP functional for both the CS calculations and geometry optimization as well (using the 6-31G(d) basis set). In this case the R2 (B3LYP/6-31G(d)//B3LYP/6-31G(d)) = 0.982 that is clearly lower than the correlation for PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) (R2 = 0.987) and even for the B3LYP/6-31G(d)//PBE1PBE/6-31G(d) combinations (R2 = 0.985). Thus, the PBE1PBE functional indeed is preferable to use particularly at the CS calculation step.
It is also important that DFT (PBE1PBE) calculations, particularly using heavy basis sets, result in good R2 values which are close to MP2 results. The HF level gives a correlation similar to DFT results when small basis sets are used. However, using triple split valence basis sets HF calculations of CS are notably worse.
Thus, CS calculations at the PBE1PBE/6-31G(d) level are good enough. Inclusion of additional polarization functions should improve the correlation. A notable improvement is observed for heavy basis sets, like 6-311G(2d,2p) or 6-311++G(3df,3pd).
At the same time it is necessary to stress that for most of the “combinations” slopes of linear approximation lines are not equal to 1, and these lines do not cross the co-ordinate origin (e.g. for selected combinations, see Table 1), i.e. calculations suffer from disadvantage due to systematic errors. This sort of disagreement is easily eliminated via linear scaling. But before the phase-state problem and the question of correct reference have to be considered which may also be concerned with the above limitations.
Fig. 5 Relative computational costs against the indicated basis set approach (at DFT PBE1PBE level of theory) used at geometry optimization (a) and CSs calculations (b) for 35. |
The CS calculation step being less time consuming, heavier basis sets can be easily applied. The PBE1PBE/6-31G(2d) level for CS calculations is a good compromise between the cost and the quality. If one needs more accurate data, the 6-311G(2d,2p) basis set can be recommended since such calculations are still affordable. Further augmentation of the basis set only leads to an unreasonable increase of computational costs (Fig. 5b).
To sum it up the PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) level can be recommended for express estimation of 31P CS. The PBE1PBE/6-31G(2d)//PBE1PBE/6-31G(d) combination can be recommended for routine applications. The PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) level should be used to obtain better results at a reasonable cost (Table 1). If one deals with a very large molecule, PBE1PBE/6-311G(2d,2p)//HF/6-31G(d) can be recommended as a compromise.
In order to answer this question, let us consider in detail the possible contributions to shielding in media. In general, the shielding of a particular atom in the environment can be written in the following form:
σ = σvacuum + σdia + σpolar + σrovibrat + σspecificsolventeffect | (1) |
And CS is determined according to eqn (2):
δ(X) = σ(stand) − σ(X) + δ(stand) | (2) |
The second term in eqn (1) arises because the magnetic field in solution is different from the one in vacuum. However, this term for the most common solvents is small (0.3–0.8 ppm). Moreover, it is nearly the same both for the molecule X and for the standard and, therefore, its contribution to the CS can be neglected.
The third term in eqn (1) accounts for the influence of the solvent’s polarity on the electron density distribution. From this point of view it can be expected that more polar molecules can be more sensitive to these effects than the less polar ones. In order to estimate the magnitude of the possible contribution due to the polarity of the solvent, one can try to calculate this contribution theoretically and measure experimentally.49
According to calculations in frames of the polarizable continuum model (PCM)80 for several model compounds (H3PO4 (4, 0.02 D), P(OEt)3 (36, 1.50 D), (CH3)3PO (7, 4.32 D)), the magnetic shielding difference in vacuum and in benzene (ε = 2.3) should be negligible (less than 0.7 ppm) for non-polar molecules (Table S8, ESI†). Only for a relatively polar molecule ((CH3)3PO (4.32 D)), this contribution may be notable (up to 7 ppm). According to the calculations, the increase of solvent’s polarity (e.g. ε (CHCl3) = 4.7) should not change the shielding significantly (Table S8, ESI†).
On the other hand, at a fixed magnetic field, the changes in CSs of compounds P(OEt)3 (36), HP(O)(OEt)2 (37) and ClP(O)(OEt)2 (38) do not exceed 0.8 ppm upon the transition from non-polar (C6H6, ε = 2.3) to moderately polar (CHCl3, ε = 4.7) solvents (Table S8, ESI†). Thus, the contribution of the polarity of the medium to phosphorus shielding is also insignificant and should not exceed 7 ppm. Therefore, the dependence of CS on the polarity should be even less and may be neglected in quantum-chemical calculations.
The next is the rovibrational term in eqn (1), which is not taken into account in quantum chemical calculations. According to estimations, this term is also small (less than 10 ppm)81 and if to take into account that there is a similar contribution to the shielding of the standard, the impact of this term in CS should also be small.
The most ambiguous is the situation with the last term in eqn (1) which reflects the contributions from specific intermolecular interactions. The experimentally measured CSs correspond to the thermodynamic ensemble of molecules interacting with each other and/or with the solvent molecules. In most cases, the interactions are weak dispersion ones of the title and solvent molecules, and the gas phase approximation is correct. A different situation may occur in the presence of specific intermolecular interactions. In these cases, the changes in geometry (due to association, self-association, tautomeric or conformational transformations) can be significant and may influence CSs notably, so that the calculated and experiment data can dramatically disagree. In such cases, in order to verify the absence of significant effects of intermolecular interactions and self-association on CSs, it is necessary to examine the concentration dependence of the CSs or solvent dependence (polar/nonpolar). In each such case, it should be considered individually.
Thus, if there are no specific interactions, the calculated CSs for vacuum should well reproduce experimental values.
Alternatively, the secondary reference like PH3 is often used because its value in the gas phase is documented. But, in fact, its 31P CS strongly depends on the phase state as well (−266.1 and −238 ppm in the gas and the neat liquid, respectively)48 implicating notable association effects in solution. In other words, PH3 is also not a good reference. Thus, in fact, both references widely used in calculations, are not perfect.
However, to our opinion, in fact there is no need to refer exclusively to the gas phase data. After all, if one tries to be fully correct and refer to the standard in the gas phase, the results of calculations for the compound of interest should also be compared only with its gas phase data, which is impossible in most of the cases.
On the other hand, for most “normal” systems referencing relative to the generally accepted standard H3PO4 gives CSs, which can be compared with experimental data obtained in solution. That is, the calculated data correlate well the experiment carried out in solution with some under- or over-estimation (depending on the “combination” used). As a last resort, the empirical correction can be applied to overcome this problem.
δscaled = (δunscaled − intercept)/slope | (3) |
It is interesting to note that the scaled CSs for the frequently used secondary reference, PH3, agree better with its experimental value in liquid than in the gas phase implying that referencing to its gas phase would produce worse CSs for the whole set of model compounds.
In general, the reasons for such discrepancies may be divided into two types: the drawback of theoretical models and the interference of additional medium effects in a real experiment that is not accounted for. For example, it was supposed that the disagreement between experimental and calculated 31P CSs for PCl3 and PN might be due to the inadequate description of their structures at the DFT level. Therefore if to take into account that the paramagnetic contribution to the magnetic shielding in these molecules is large, CS will strongly depend on the bond lengths.47
On the other hand, relativistic effects arising from magnetic shielding of the nucleus located in the vicinity of the atoms of third period elements may also be strong.5,51,54 The relativistic spin–orbital interaction is not accounted for in our calculations and therefore the problems for (CH3)3PS and P(SCH3)3 might be well due to the contribution from relativistic effects of vicinal sulfur atoms. The same reason may account for the discrepancy observed for PCl3.
Some of the above “exceptions” may be due to the “incorrect” description of the molecular system in calculations. Namely, if molecules in solution are prone to association or self-association, the isolated molecular model will not be correct.53,55 Intra- or intermolecular coordination involving phosphorus results in a dramatic 31P nuclear shielding amounting to approximately 150 ppm upon changing the phosphorus coordination number by one.55 Thus, the impact of associated forms in solution may dramatically change the observed 31P CSs.
For example, the accurate prediction of 31P NMR CSs of ion pair systems requires consideration of the full system.49 Therefore, the deviations observed for the small ions, [PCl6]− and [PF6]−, presumably, are due to considerable solvent or counterion effects.48
In a similar way, intermolecular coordination may be responsible for the discrepancies observed for PN, P2H2 and FCP. In solution these molecules may exist not only in a monomolecular form but also in the associated form, as well. The calculations performed for the (PN)3 complex in fact demonstrate that, on the one hand, the trimer is much more stable than the monomer. On the other hand, while theoretical CSs (PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d)) of the monomer deviates to the lower field (336.4 ppm) from the experimentally observed one (275.0 ppm), the value of the trimer shifts in the opposite direction (262.1 ppm). Thus, the experimental value lies somewhere in between these calculated values for two structural forms, i.e. calculated and experimental CSs agree if one supposes that there is a fast exchange between these forms. Similar considerations can also be applied to P2H2 and FCP molecules as well.
Another reason for the discrepancy to occur may be due to the system that is involved in intramolecular exchange processes. In this case consideration of only one form may not be enough. For example, the calculations performed for phosphine oxide (H3PO, 1a) underestimate the 31P CS by ca. 36 ppm (Fig. 6). However, if to suppose that this form is in exchange with the acid tautomer58,87–95 the contradiction will be resolved.
Fig. 6 Equilibrium of phosphine oxide (1a) and phosphinous acid (1b) and schematical representation of 31P spectra of 1. |
According to the calculations, these two forms are close in energy in vacuum (Table S9, ESI†). In H2O solution, e.g. in the frame of PCM, the preference is expected to invert although the energy gap is still small (Table S9, ESI†), i.e. the populations of these two forms should be close or comparable. Further on, if to take into account that the calculation for phosphinous acid (1b) predicts the 31P CS at a notably lower field (ca. 34 ppm), the experimental value corresponds to the intermediate between these tautomeric forms which may be in fast exchange in the NMR time scale (Fig. 6).
This hypothesis is also strongly supported by the finding that a similar pentafluorophenyl derivative in solution exists in equilibrium with the phosphinous acid (C6F5(C2F5)POH, 39a) and phosphane oxide (C6F5(C2F5)P(O)H, 39b) tautomers.96 Due to higher barrier of exchange in this case both tautomers are observed separately in 31P NMR spectra (80.6 and −1.9 ppm, respectively). To this end, the calculations using PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) for the pentafluorophenyl derivative predicts that 31P CSs are in good agreement with experiment (71.4 and −10.8 ppm for acid and oxide tautomers, respectively).
In the case of other acids (e.g. H3PO2 (2), H3PO3 (3)) no deviations from experiment were observed suggesting a strong preference to one tautomeric form (Table S9, ESI†). Energy analysis for these acids also supports this conclusion, viz. there is an essential preference to one dominant form and solvent has only an insignificant effect on the energy gap. Thus, in solution these acids are in one form and calculations well describe their NMR CSs.
The second question – will the required level of theory (established for small models) work well “for larger organophosphorus compounds of practical interest”? And will it be within the limits of computational resources? To answer these questions typical representatives of several classes of phosphorus compounds with a molecular weight of about 200–300 Da (Fig. 7) were analyzed. Calculations were run on PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d), PBE1PBE/6-31G(2d)//PBE1PBE/6-31G(d) and PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) levels of theory. A linear scaling procedure was applied to GIAO calculated CSs (Table S10, ESI†).
Fig. 7 Structures of “large compounds” 35, 40–48 (a) and the correlation of experimental versus calculated at PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) 31P NMR CS for 35, 40–48 (b). |
On the whole, calculated values agree well with experimental data for all combinations tested (e.g. for the PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) combination, see Fig. 7 and Table 2; for the rest, see Table S10, ESI†). In all cases correlations are close to linearity (R2 = 0.989–0.996) and RMSE are small (9.0–11.6 ppm).
No. | δ unscaled | δ scaled | δ exp | Solvent | Ref. |
---|---|---|---|---|---|
a At the PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) level. b In ppm. c Not available. | |||||
35 | 32.1 | 43.3 | 54.8 | C6H6 | 97 |
309.6 | 302.0 | 322.9 | |||
40 | −172.4 | −147.3 | −157.7 | nac | 98 |
63.9 | 72.9 | 76.7 | |||
41 | −129.3 | −107.2 | −110.0 | na | 99 |
42 | −68.6 | −50.5 | −54.5 | na | 100 |
43 | −41.9 | −25.7 | −29.7 | CHCl3 | 101 |
84.1 | 91.7 | 100.5 | |||
−7.1 | 6.7 | −10.6 | |||
58.7 | 68.1 | 75.7 | |||
44 | 22.5 | 34.3 | 24 | C6H6 | 102 |
45 | 74.3 | 82.6 | 84.1 | CHCl3 | 103 |
−36.5 | −20.6 | −22.6 | |||
46 | 15.0 | 27.4 | 27.6 | CHCl3 | 104 |
5.8 | 18.8 | 18.3 | |||
47 | −23.1 | −8.1 | −10.2 | CHCl3 | 105 |
48 | 36.8 | 47.7 | 38.7 | CHCl3 | 106 |
R 2 | 0.996 | 0.996 | |||
RMSE (ppm) | 9.0 |
Fig. 8 Possible structural isomers of 45 (a) and Δδ = δcalc − δexp for isomers of 45 (b). Calculations at PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) level of theory. |
According to the calculations, the 31P CSs essentially depend on the isomeric structure varying within 100 ppm (Table S11, ESI†). However, only in the case of the A isomer, the differences between theoretical and experimental 31P CSs are small (less than 3 ppm) while in other cases deviations are high (Fig. 8b). Thus the analysis of 31P CSs allows a simple and unequivocal assignment of the isomeric form of 45 to structure A that is in full agreement with X-ray data.103
To sum it up, the PBE1PBE/6-31G(d)//PBE1PBE/6-31G(d) level can be recommended for express estimation of 31P CS of organophosphorus compounds. The PBE1PBE/6-31G(2d)//PBE1PBE/6-31G(d) combination can be recommended for routine applications. The PBE1PBE/6-311G(2d,2p)//PBE1PBE/6-31+G(d) level can be proposed to obtain better results at a reasonable cost. In the case of very large molecules PBE1PBE/6-311G(2d,2p)//HF/6-31G(d) can be recommended as a compromise. Scaling by linear regression parameters significantly improves results.
Care has to be taken for compounds that may be involved in exchange between different structural forms (self-associates, associates with solvent, tautomers, and conformers) and therefore, experimental 31P CSs may correspond to the exchange averaged values. In such suspicious cases the disagreement between calculated and experimental data most likely “says” that the problem formulation is incorrect.
Problems for phosphorus located near the atoms of third period elements ((CH3)3PS and P(SCH3)3) may be due to the impact of relativistic effects that was not accounted for in our calculations.
31P CSs can be safely used to establish fine structural peculiarities such as isomeric and tautomeric structures.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5cp00240k |
‡ Here and further in the text “higher level of theory” means either higher level (HF–DFT–MP2) and/or more reach basis set. “Heavy combination” means level of theory used either for optimization of geometry or for chemical shift calculation steps. |
§ Parameters of the linear scaling equation for different calculation levels are given in the ESI.† |
This journal is © the Owner Societies 2015 |