Benchmark fragment-based 1 H, 13 C, 15 N and 17 O chemical shift predictions in molecular crystals

Joshua D. Hartman; Ryan A. Kudla; Graeme M. Day; Leonard J. Mueller; Gregory J. O. Beran

doi:10.1039/C6CP01831A

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/C6CP01831A (Paper) Phys. Chem. Chem. Phys., 2016, 18, 21686-21709

Benchmark fragment-based ¹H, ¹³C, ¹⁵N and ¹⁷O chemical shift predictions in molecular crystals†

Joshua D. Hartman ^a, Ryan A. Kudla ^a, Graeme M. Day ^b, Leonard J. Mueller ^a and Gregory J. O. Beran *^a
^aDepartment of Chemistry, University of California, Riverside, California 92521, USA. E-mail: gregory.beran@ucr.edu; Tel: +1 951-827-7869
^bSchool of Chemistry, University of Southampton, Highfield, Southampton, UK

Received 18th March 2016 , Accepted 11th July 2016

First published on 19th July 2016

Abstract

The performance of fragment-based ab initio¹H, ¹³C, ¹⁵N and ¹⁷O chemical shift predictions is assessed against experimental NMR chemical shift data in four benchmark sets of molecular crystals. Employing a variety of commonly used density functionals (PBE0, B3LYP, TPSSh, OPBE, PBE, TPSS), we explore the relative performance of cluster, two-body fragment, and combined cluster/fragment models. The hybrid density functionals (PBE0, B3LYP and TPSSh) generally out-perform their generalized gradient approximation (GGA)-based counterparts. ¹H, ¹³C, ¹⁵N, and ¹⁷O isotropic chemical shifts can be predicted with root-mean-square errors of 0.3, 1.5, 4.2, and 9.8 ppm, respectively, using a computationally inexpensive electrostatically embedded two-body PBE0 fragment model. Oxygen chemical shieldings prove particularly sensitive to local many-body effects, and using a combined cluster/fragment model instead of the simple two-body fragment model decreases the root-mean-square errors to 7.6 ppm. These fragment-based model errors compare favorably with GIPAW PBE ones of 0.4, 2.2, 5.4, and 7.2 ppm for the same ¹H, ¹³C, ¹⁵N, and ¹⁷O test sets. Using these benchmark calculations, a set of recommended linear regression parameters for mapping between calculated chemical shieldings and observed chemical shifts are provided and their robustness assessed using statistical cross-validation. We demonstrate the utility of these approaches and the reported scaling parameters on applications to 9-tert-butyl anthracene, several histidine co-crystals, benzoic acid and the C–nitrosoarene SnCl₂(CH₃)₂(NODMA)₂.

1 Introduction

Hydrogen, carbon, nitrogen and oxygen-containing functional groups play a central role in the structures, chemical reactivity, and solubility of biological and pharmaceutical compounds.¹ Advances in instrumentation and methodology over recent decades have made nuclear magnetic resonance (NMR) spectroscopy a particularly potent tool for investigating structural features associated with ¹H, ¹³C, ¹⁵N, and ¹⁷O nuclei. However, even with modern multi-dimensional NMR experiments, structure elucidation can prove challenging due to the complexity of the spectra and the subtle effects of chemical environment on the chemical shifts.

Computational tools play an increasingly prominent role in NMR spectral assignment and structure elucidation. Early ab initio chemical shielding prediction began with small cluster models, often employing simple charge-embedding schemes to mimic the crystal environment.^2,3 The inherent limitations of such models resulting from their approximate treatment of the crystal lattice limited their widespread application. On the other hand, periodic density functional theory (DFT) is well-suited for modeling chemical shieldings in extended crystal systems, and the plane wave DFT-based gauge-including projector augmented wave (GIPAW) method^4–6 has now become the method of choice for chemical shift prediction for molecular crystals.

The success of the GIPAW technique has contributed significantly to the rapidly expanding field of NMR crystallography, which combines solid state NMR, diffraction methods and ab initio chemical shielding predictions to solve crystal structures.^7–11 However, despite the widespread success of plane wave DFT methods, they suffer from two main limitations. First, plane wave calculations are limited to GGA-type density functionals in practice. Hybrid density functionals can offer improved accuracy for NMR chemical shift prediction,^12–14 but they typically require at least an order of magnitude more computational effort to evaluate in a plane wave basis compared to a GGA functional. In contrast, the cost premium for hybrid density functionals in Gaussian basis sets is typically less than a factor of two.

The second limitation lies in the mapping of absolute shifts obtained from calculations to empirically determined chemical shifts. This mapping is generally performed using a simple linear regression model relating experimental and chemical shifts. However, the linear regression parameters are specific to a given functional/basis set combination.¹⁵ Regression models obtained from plane wave/pseudopotential GIPAW calculations in periodic crystals are not transferable to chemical shieldings computed from all-electron models in non-periodic systems, such as an enzyme active site.

By decomposing the molecular crystal into a series of interacting molecules, fragment methods provide an accurate, low-cost alternative to plane wave techniques for computing a variety of chemical properties, including NMR chemical shieldings.^12,16–20 Fragment methods pave the way for routine use of hybrid density functionals or perhaps even a high-accuracy wave function-based correlation treatment of magnetic properties. Further, fragment methods allow the same density functionals, basis sets and empirically derived scaling parameters to be applied across systems ranging from molecular crystals to molecules in solution or even biomolecules.

We have recently shown that both GIPAW and an electrostatically embedded two-body fragment model reproduce the experimental ¹³C isotropic shifts in a set of 25 organic molecular crystals to within a root mean square error of 2.1–2.2 ppm when using the PBE functional.¹³ However, using a hybrid density functional like PBE0²¹ or B3LYP²² in the fragment model instead decreases the error by a third to 1.4 ppm.¹³ The same study also found good performance for the principal components of the shielding tensors. Furthermore, the chemical shielding scaling parameters obtained here have been successfully applied to the NMR characterization of the 2-aminophenol quininoid intermediate in the mechanism of tryptophan synthase.²³

Building upon the success of fragment-based methods in the context of ¹³C chemical shift predictions,¹³ the present work explores the application of fragment, cluster, and combined cluster/fragment approaches to predicting the isotropic chemical shifts of ¹H, ¹⁵N and ¹⁷O nuclei, and it compares the performance of these models to the widely used GIPAW approach. For the sake of consistency, we also slightly revise the earlier ¹³C chemical shift test results¹³ here using the identical geometry optimization protocol and sets of density functionals across all four nuclei. These four nuclei were chosen because of their ubiquity in organic and biological systems and their importance in NMR studies of these systems. Their widespread use in NMR also means that relatively large sets of experimental shifts could be drawn from studies found in the literature. Because reliable shielding tensor data is harder to find for these nuclei, we focus only on isotropic shifts in this work.

Previous work has shown that the many-body expansion converges more slowly for ¹⁵N and ¹⁷O compared with ¹H and ¹³C.¹² It is particularly important, therefore, to assess the viability of fragment methods for these nuclei. To do so, we have compiled three new benchmark sets of molecular crystals consisting of 80 ¹H, 51 ¹⁵N, and 28 ¹⁷O experimentally measured isotropic chemical shifts. These benchmark sets augment our previously developed ¹³C set consisting of 169 shifts. These new sets include diverse chemical shifts that span the ranges observed in most common biological and pharmaceutical species.^24,25 The use of molecular crystals with well-defined and largely static structures helps mitigate the influence of confounding variables such as solvation or conformational dynamics. It enables more direct assessment of fragment-based methods and produces linear regression parameters that properly account for an explicit chemical environment. Because common benchmark data sets are extremely useful for validating and comparing models developed by different researchers, complete optimized crystal structures and tabulated chemical shifts for these test sets are provided as ESI.†

Developing a robust set of chemical shielding scaling parameters for ¹H, ¹⁵N, and ¹⁷O nuclei is challenging. ¹⁵N and ¹⁷O exhibit broad chemical versatility, and the shifts of all three nuclei types can be affected by proton dynamics, particularly in hydrogen bonding situations. Nitrogen, for example, bonds with most other elements, has oxidation numbers ranging from −3 to +5, coordination numbers from 1 to 6, and bond orders up to 3.²⁴ Such diversity manifests in a chemical shift range spanning ∼1100 ppm. An even greater diversity in chemical shifts is observed for oxygen, with the chemical shifts for water and dioxygen separated by nearly ∼2000 ppm.²⁵ However, in the context of organic molecular crystals and biologically relevant applications, these ranges span ∼400 and ∼1000 ppm for ¹⁵N and ¹⁷O, respectively.²⁶ Proton isotropic shifts are generally limited to a more modest ∼20 ppm range. On the other hand, hydrogen is much more susceptible to dynamics such as methyl group rotation or rapid proton exchange between hydrogen-bonded partners or even nuclear quantum effects. In all cases, care must be taken to adequately represent the desired range of chemical shieldings and to avoid errors introduced in the calculations by neglecting the dynamical averaging occurring in the NMR experiments.

Despite efforts to include a wide variety of chemical environments in each molecular crystal test set, biases resulting from the composition of the test set could impact both the reported accuracy of a given method and the general transferability of the scaling parameters. To assess the degree to which the composition of the training and validation sets impact performance and transferability, a series of statistical cross-validation numerical experiments were performed. In agreement with cross-validation results obtained previously for ¹³C chemical shifts,¹³ the analysis here reveals relatively little impact of training/validation set composition on both overall accuracy or scaling parameters.

Using these benchmark sets, we examine the impact of electrostatic embedding, two-body fragment and cluster cut-off distance, and the choice of density functional on the chemical shift predictions. Specifically, we compare the hybrid density functionals PBE0 and B3LYP, the GGA-based functionals PBE²⁷ and OPBE,²⁸ as well as the meta-GGA functional TPSS and its hybrid variant TPSSh.²⁹ The meta-GGA and hybrid meta-GGA functionals performed well for ¹⁵N chemical shift tensors in a recent study using symmetry-adapted cluster calculations.¹⁴

Although this work primarily focuses on the fragment-based approaches, we have also benchmarked GIPAW across the same test sets for comparison. Despite its widespread use for chemical shift prediction, extensive GIPAW benchmarks for nuclei like hydrogen and oxygen are hard to find. The test sets developed here provide scaling parameters for mapping GIPAW chemical shieldings for these four nuclei to chemical shifts, which may prove useful in other applications.

Through these benchmarks, we demonstrate that with the PBE functional, the fragment-type approaches perform competitively with plane wave GIPAW, especially for hydrogen, carbon, and nitrogen. As we found in our previous study of ¹³C chemical shifts,¹³ however, hybrid density functionals provide statistically improved accuracy over GGAs. In particular, fragment PBE0 calculations out-perform GIPAW PBE ones in terms of the root-mean square (rms) errors and the maximal errors for ¹H, ¹³C, and ¹⁵N. Oxygen chemical shifts, which are more sensitive to many-body effects, prove more difficult to model with the fragment approach. Nevertheless, the ¹⁷O cluster/fragment PBE0 results are almost as accurate as the GIPAW PBE ones, with the differences in root-mean-square error between the two functionals being comparable to typical experimental uncertainties in ¹⁷O chemical shifts. Given the high computational expense associated with using hybrid functionals in plane wave GIPAW calculations, these results provide further support for using fragment-based approaches instead of GIPAW methods for NMR chemical shielding predictions in organic molecular crystals.

Finally, we provide a collection of illustrative examples demonstrating the utility of fragment-based methods coupled with the chemical shielding scaling parameters derived from these training sets. First, we help assign the previously unpublished ¹H/¹³C heteronuclear correlation spectrum for the 9-tert-butyl anthracene ester (9-TBAE) molecular crystal. Second, the prediction of both ¹H and ¹⁵N isotropic shieldings for a particularly challenging set of histidine co-crystals is assessed. Third, we briefly examine issues of proton exchange dynamics using ¹⁷O chemical shielding predictions applied to crystalline benzoic acid. Finally, we assess the accuracy of predicted ¹⁷O chemical shieldings for a challenging organometallic C–nitrosoarene whose experimental chemical shift³⁰ lies far outside the range of oxygen chemical shifts included in the test set.

2 Theory

2.1 Many-body expansion for ab initio shielding tensor calculations

The chemical shielding tensor describes the screening of the external magnetic field experienced at the nucleus by the surrounding electron density. This change in local magnetic field at the nucleus results directly from interaction between the external magnetic field and the local electron density. Formally, the NMR chemical shielding tensor element σ_αβ for atom A is defined as the second derivative of the electronic energy with respect to the α-th component of the external magnetic field B_α and the β-th component of the nuclear magnetic moment on atom A, μ^A_β:


	(1)

As discussed previously,¹² differentiating the many-body decomposition of the energy allows one to express the total chemical shielding tensor [small sigma, Greek, tilde]

of atom A on molecule i in a crystal as a sum of one-body, two-body, and higher-order terms,


	(2)

where σ^A_i is the shielding tensor for atom A on the isolated monomer i, Δ²σ^A_ij is the two-body contribution to the shielding tensor arising from the interaction of monomer i with monomer j,


Δ²σ^A_ij = σ^A_ij − σ^A_i − σ^A_j	(3)

and Δ³σ^A_ijk is the three-body contribution to the shielding tensor and is defined as


Δ³σ^A_ijk = σ^A_ijk − Δ²σ^A_ij − Δ²σ^A_ik − Δ²σ^A_jk − σ^A_i − σ^A_j − σ^A_k	(4)

Note that because atom A lies on monomer i and not monomer j or k, terms like σ^A_j and σ^A_k in eqn (3) and (4) are actually zero.

2.2 Fragment and cluster methods

For each atom on each symmetrically unique molecule in the unit cell, the summations in eqn (2) are carried out over all unique sets of fragments (dimers, trimers, etc.). Evaluation of the one- and two-body contributions can be done inexpensively, but the cost of the three-body and higher terms grows rapidly, giving rise to a significantly larger computational burden if contributions beyond two-body are included. However, assessment of the many-body expansion for ¹³C chemical shielding tensors has demonstrated that the contributions from three-body and higher terms are small relative to the inherent errors from density functional theory, especially if electrostatic embedding models are employed to mimic the crystal lattice.^12,13 Accordingly, a two-body fragment model approximates the shielding tensor as


	(5)

where σ^A,emb._i and Δ²σ^A,emb._ij are the one and two-body contributions with each monomer and dimer calculation carried out in an electrostatic embedding environment which will be discussed below.

A two-body fragment-based calculation in a molecular crystal is carried out by defining a cut off radius R_c around the asymmetric unit, as illustrated in Fig. 1. The chemical shielding tensor for each atom in the asymmetric unit is then approximated by calculating the two-body contributions for all dimers involving that monomer in the asymmetric unit and any other monomer lying within the defined cut off according to eqn (5). For example, using the labeling and cut-off defined in Fig. 1, the total chemical shielding tensor for atom A on monomer i is given by:


	(6)


	(7)

By focusing these many-body expansions on molecules in the asymmetric unit, the fragment approach readily exploits space group symmetry to achieve additional computational savings. Nevertheless, it still computes chemical shielding tensors for every symmetrically unique atom in the crystal. Further computational savings can be achieved by exploiting locally dense basis sets, which use smaller basis sets on all atoms outside the molecules in the asymmetric unit, as described in Section 3.2.


	Fig. 1 Schematic illustrating the application of fragment and cluster-based methods to molecular crystal systems. R_c denotes the cut off distance for a given model. Each cell is labeled for ease of reference.

If a two-body fragment method proves insufficient, a cluster-based calculation can be constructed by including all monomers within the defined cut off radius R_c in a single supermolecular chemical shielding calculation. This effectively amounts to summing the many-body expansion through all orders for the subset of molecules lying closest to the molecule of interest. Additional chemical shielding contributions from more distant molecules in the crystal lattice can then be approximated in a pairwise (two-body) fashion. Specifically, one computes the two-body contributions between the molecule of interest and any other molecule outside the initial cluster but inside a second two-body fragment cut off (orange region in Fig. 1). Cluster-based calculations improve the treatment of local many-body effects, albeit with a substantial increase in computational cost since DFT methods typically scale O(N³) with system size for moderately sized systems.

Electrostatic embedding provides a secondary strategy for approximating both long-range and many-body effects which can be beneficial for both cluster and fragment-based calculations. In the present work, we use a simple electrostatic embedding model based upon point charges obtained from Gaussian distributed multipole analysis (GDMA).^31–33 GDMA point charges are positioned at each atomic center for every monomer within a user-defined charge-embedding radius of molecule i in the central unit cell. The presence of these point charges mimics many-body effects by polarizing the monomer and dimer fragment calculations. Long-range electrostatic effects are captured by extending the charge embedding cut off well into the surrounding lattice. Previous work has demonstrated that a 30 Å charge-embedding cut off ensures convergence in the calculated shieldings,¹² and that cut off is used here as well.

2.3 Computational efficiency

For a typical small-molecule organic crystal, the computational efficiency of GIPAW PBE and fragment 2-body PBE are similar. Our software implementation automatically fragments the crystal, generating the necessary jobs which can then be run with an external electronic structure package, such as Gaussian. The software then combines the results of those fragment jobs into the final set of shielding tensors. For a crystal with a single molecule in the asymmetric unit (Z′ = 1) and a 6 Å two-body cut off, the fragmentation procedure generates one monomer calculation and typically ∼20–25 dimer calculations.

Perfect computational efficiency comparisons between GIPAW and the fragment approach are difficult due to differences in the basis sets, grids, etc. Nevertheless, for a typical small-molecule organic crystal with a handful of molecules in the unit cell, two-body fragment PBE and GIPAW PBE require similar amounts of total CPU time. Several computational features of the fragment method are notable, however. First, the cost premium for using the hybrid functionals in the fragment approach is typically only ∼50%, making them routinely applicable. In contrast, hybrid functionals are rarely employed in GIPAW due to their high computational expense.

Second, due to the local nature of the two-body interactions and exploitation of space group symmetry, the cost of the fragment approach depends on the number of molecules in the asymmetric unit, rather than the total number of molecules in the unit cell. The time to compute chemical shielding tensors for a four-molecule unit cell crystal and an eight-molecule cell polymorph will be about the same, as long as Z′ = 1 for both. Doubling the number of molecules in the asymmetric unit from Z′ = 1 to Z′ = 2 will double the cost of the shielding tensor calculation.

Third, the fragment approach achieves high parallel efficiency. Each of the two dozen or so fragment jobs can be run simultaneously with minimal internode communication. Each individual fragment job can be run in parallel across 1–2 dozen processor cores. Together, these two tiers of parallelization enable the chemical shielding tensor calculation to be run very efficiently on hundreds of processor cores. If one has several hundred processor cores available, chemical shifts on a Z′ = 1 crystal structure of a 70–80 atom molecule can be obtained within a few hours, irrespective of the size of the unit cell.

The cluster/fragment approach is substantially more expensive than the two-body fragment, since it requires computing chemical shifts for a large 10–15 molecule cluster. Still, such calculations are very feasible for clusters containing hundreds of atoms or more due to efficient and/or linear-scaling chemical shift algorithms.^34–36

3 Computational methods

3.1 Crystal structures

Separate benchmark sets of molecular crystal structures were constructed for hydrogen, nitrogen and oxygen to augment our previously developed¹³ carbon set. Specific molecular crystals were chosen based on the availability of both high-resolution room temperature NMR chemical shift data in the literature with unambiguous spectral assignment as well as high-quality X-ray or neutron diffraction crystal structures. Structural data for each compound in the study was obtained either from the literature or from the Cambridge Structure Database (CSD) maintained by the Cambridge Crystallographic Data Center.

The hydrogen, carbon, nitrogen and oxygen benchmark sets consist of 13, 25, 24 and 15 crystal structures, respectively. In the interest of providing a uniform set of scaling parameters obtained using a single computational procedure outlined below, we update our predicted ¹³C shifts here for the carbon test set, though the differences in the ¹³C predicted shifts here and in our earlier work¹³ are trivial (less than 0.1 ppm difference in root-mean-square error). Each of the species in the hydrogen, nitrogen, and oxygen sets are depicted in the ESI,† along with their common names and CSD reference codes. The carbon set species were shown previously.¹³ Each set was chosen to include a variety of intermolecular interactions which are representative of common interactions in pharmaceutical and biological compounds. Experimental chemical shifts for ¹H and ¹³C are reported relative to neat TMS under magic angle spinning (MAS) conditions,³⁷ the ¹⁵N shifts are given relative to external solid NH₄Cl also under MAS, and ¹⁷O shifts are relative to liquid water. Conversions to other chemical shift scales can be accomplished as described by ref. 37; for expediency, several of the most common conversions are given in the ESI.† The CSD reference codes and experimental references for NMR data for the three test sets are:

• Hydrogen (13 structures, 80 shifts): CIMETD,¹ INDMET,³⁸ URACIL,³⁹ co-crystal of 4,5-dimethylimidazole and 3,5-dimethylpyrazole,⁴⁰ AMBACO05,⁴¹ PHBARB06,⁴² IPMEPL,⁷ COYRUD11,⁴³ FPAMCA11,⁸ BAPLOT01,⁸ WEZCOT,⁸ FLUBIP,⁴⁴ ZIVKAQ.⁴⁵

• Carbon (25 structures, 169 shifts, see ref. 13): MBDGAL02, MEMANP11, MGALPY01, MGLUCP11, XYLOBM01, SUCROS04, RHAMAH12, FRUCTO02, GLYCIN03, LALNIN12, LSERIN01, LSERMH10, ASPARM03, LTHREO01, GLUTAM01, LTYROS11, LCYSTN21, NAPHTA36, ACENAP03, TRIPHE11, HXACAN09, INDMET, SULAMD06, ADENOS12, PERYTO10.

• Nitrogen (24 structures, 51 shifts): BITZAF,⁴⁶ GEHHAD,⁴⁶ GEHHEH,⁴⁶ GEHHIL,⁴⁶ LHISTD02,⁴⁷ LHISTD13,⁴⁷ TEJWAG,⁴⁷ GLYCIN03,²⁶ FUSVAQ01,⁴⁸ CYTSIN,⁴⁸ THYMIN01,⁴⁸ URACIL,⁴⁸ CIMETD,¹ BAPLOT01,⁴⁹ compound 1,⁵⁰ LTYRHC10, CYSCLM11, LSERIN01, GLUTAM01, ASPARM03, LCYSTN21, ALUCAL04, GLYHCL01, LGLUAC11.

• Oxygen (15 structures, 28 shifts): LALNIN12,²⁶ ALAHCL,²⁶ VALEHC11,²⁶ LTYRHC10,²⁶ CYSTIN,⁵¹ ACANIL03,⁵² BZAMID07,⁵³ GLYCHL01,²⁶ LGLUTA03,⁵⁴ MBNZAM10,⁵² FEQYUM,⁵⁵ LILEUC10,⁵⁵ PHALNC01,⁵⁵ CYSCLM11,⁵⁵ TPEPHO02.⁵⁶

A number of the nitrogen chemical shifts (those without a citation given above) were measured and reported here, as discussed in Section 4.

3.2 Computational techniques

Experimental crystal structures were subjected to all-atom geometry optimizations with fixed lattice parameters using the freely available, open-source Quantum Espresso software package.⁵⁷ All geometry optimizations were performed using the PBE²⁷ density functional and the D2 dispersion correction,⁵⁸ ultrasoft pseudopotentials with a plane wave cut off of 80 Ry, and typically a 3 × 3 × 3 Monkhorst–Pack k-point grid. This grid typically corresponds to a spacing of 0.04 Å⁻¹ between nearest k-points and, for all but a few structures, a spacing no larger than 0.07 Å⁻¹. Larger numbers of k-points were used in some of the smaller unit cells as needed based on chemical shift convergence tests. See ESI,† for details. We used the pseudopotentials H.pbe-rrkjus.UPF, C.pbe-rrkjus.UPF, N.pbe-rrkjus.UPF, O.pbe-rrkjus.UPF, S.pbe-n-rrkjus_psl.0.1.UPF, F.pbe-n-rrkjus_psl.0.1.UPF, Cl.pbe-n-rrkjus_psl.0.1.UPF, I.pbe-n-rrkjus_psl.0.2.UPF, Na.pbe-spn-rrkjus_psl.0.2.UPF, K.pbe-n-mt.UPF, and P.pbe-n-rrkjus_psl.0.1.UPF from http://www.quantum-espresso.org.

The use of fixed room-temperature experimental lattice parameters effectively compensates for the appreciable increases in volume that occur between the minimum electronic energy structure and the finite temperature structure.^59,60 Note that the use of fixed experimental lattice parameters reduces sensitivity to the specific dispersion correction. The root-mean-square errors in the predicted ¹³C chemical shifts obtained here using PBE-D2 geometries differ by less than 0.1 ppm from those obtained previously for the same crystals using PBE-TS geometries.¹³

Molecular crystal fragmentation through two-body was carried out using our hybrid many-body interaction (HMBI) code.^16,61,62 Individual fragment shielding tensor calculations were performed using Gaussian09⁶³ with the B3LYP, PBE0, PBE, OPBE, TPSS, and TPSSh density functionals. Calculations were performed using a locally dense basis set and the GIAO approximation. Unless otherwise stated, a 6 Å fragment cut off R_c was used for both fragment and cluster/fragment models, and a 4 Å cluster was used in cluster/fragment calculations. All calculations used a 6-311+G(2d,p) basis^64–67 for all atoms on the central monomer of interest, a 6-311G(d,p) basis for all neighboring atoms out to 4 Å, and a 6-31G^68,69 basis for all atoms beyond 4 Å. This locally dense basis set^70,71 combination has proved effective in previous studies^12,72 and is simply referred to here as our “mixed basis”. The ATZP basis⁷³ was used on the tin atom for the nitrosoarene in Section 7.4.

As described in previous work,¹² a large DFT integration grid consisting of 150 radial and 974 Lebedev angular points was used to approach rotational invariance and mitigate the introduction of noise from fragment contributions given by symmetrically equivalent molecules with different orientations. After constructing the raw shielding tensors via the fragment or cluster/fragment approach, the tensors are symmetrized and diagonalized to compute the principal components. Isotropic shieldings are reported as the average of these diagonal values. Note that if one is only interested in predicting the isotropic shifts, one can use the raw isotropic shieldings for each fragment instead of the full tensors. However, the tensor approach used here is more general.

Distributed multipoles computed with the GDMA package^33,74 were used to construct the embedding environment. The GDMA charges were calculated using the same functional and 6-311+G(2d,p) basis set as the chemical shielding calculation. The GDMA point charges were then placed on all molecules lying within 30 Å of any atom in the asymmetric unit cell, as described in previous work.^12,13

Gauge-including projector augmented wave (GIPAW) chemical shielding calculations were performed at the same optimized PBE-D2 (Quantum Espresso) geometries without further relaxation. Calculations were performed using CASTEP⁷⁵ with the PBE functional, ultrasoft pseudopotentials generated on-the-fly and an 850 eV plane wave basis set cut off. Electronic k-points were sampled on a Monkhorst–Pack grid to give a maximum separation between k-points of 0.05 Å⁻¹. The basis set cut off and k-point density were chosen based on chemical shift convergence tests (see ESI†). The basis set plane wave cut off energy was chosen to converge absolute chemical shifts to within 0.4 ppm and, more importantly, relative chemical shifts to 0.05 ppm or better. Chemical shielding was found to be fairly insensitive to the density of the k-point grid. Full space group symmetry was used in all GIPAW calculations.

3.3 Chemical shift linear regression and statistical cross-validation

The experimentally observed chemical shift δ_i represents the difference between the absolute chemical shielding σ_i of nucleus i and the absolute shielding σ_ref of a reference compound. Thus, comparison of predicted shifts with experimental NMR spectra requires mapping between the computed absolute shieldings σ_i and the experimentally referenced chemical shift δ_i. Numerous techniques exist for performing this mapping.¹⁵ Here we adopt a linear regression approach which addresses the shift referencing and helps correct for systematic errors in the calculations.


δ_i = Aσ_i + B	(8)

In the absence of any systematic error, A would take a value of −1, and B would simply be the absolute shielding of the reference compound σ_ref. However, obtaining these parameters via a linear least-squares fit between the calculated and experimental data for each of the test sets provides scaling parameters for each type of nucleus which partially mitigate systematic errors present in the calculations. Further, assuming these regression parameters are fitted to a sufficiently broad and representative test set, they can be used to scale predicted shifts obtained for compounds not included in the test set and can even be applied in the context of non-crystalline systems. Fig. 2 illustrates the application of this approach using the hydrogen, carbon, nitrogen and oxygen test sets using PBE0 fragment two-body calculation.


	Fig. 2 Plot of experimental vs. calculated isotropic shifts for the (a) ¹H, (b) ¹³C, (c) ¹⁵N and (d) ¹⁷O benchmark sets. Predicted shifts were obtained from two-body fragment calculations at the PBE0/mixed basis level with a 6 Å two-body cut off distance.

In an effort to assess the robustness of the linear regression parameters, exhaustive N-choose-5 cross-validation was performed for each test. The cross-validation procedure consists of partitioning the N crystal structures into a training set with N − 5 training structures and 5 validation structures. In this way, 1287, 53 [thin space (1/6-em)] 130, 42504 and 3003 different partitionings were formed for the hydrogen, carbon, nitrogen and oxygen test sets, respectively. Linear regression and error analysis was then performed for each partitioning to obtain distributions of errors and linear regression parameters.

4 Experimental methods

Most experimental chemical shifts considered here were obtained from the literature. However, we found relatively few examples of high-quality, small-molecule crystals with primary amine nitrogen chemical shifts in the literature. Therefore, new experimental ¹⁵N NMR chemical shifts for many amino acids (structures 8 and 16 through 24 in the nitrogen test set) are reported here. The experimental ¹H and ¹³C chemical shifts for 9-tert-butyl anthracene (Section 7.1) are also reported here for the first time.

4.1 9-tert-Butyl anthracene ester

Two-dimensional ¹H, ¹³C heteronuclear-correlation (HETCOR) experiments⁷⁶ were performed at 14.1 T (600.01 MHz ¹H, 150.87 MHz ¹³C) on a Bruker AV600 spectrometer equipped with a triple resonance 1.3 mm MAS probe with a sample spinning rate of 50 kHz (±2 Hz). Less than 2 mg of microcrystalline sample were packed into each rotor. For these experiments, cross polarization (CP) was established using a 2 ms contact time with nutation frequencies of 125 kHz for ¹H and 75 kHz for ¹³C; high power ¹H decoupling during ¹³C acquisition was implemented using XiX (125 kHz, 2.85 τ_r).⁷⁷ Chemical shifts were indirectly referenced to neat TMS using an external sample of adamantane in which the ¹H resonance was set to 1.87 ppm and the down-field ¹³C peak to 38.48 ppm.^78,79

4.2 ¹⁵N solid-state NMR

¹⁵N cross-polarization magic-angle-spinning (CPMAS) solid-state NMR experiments were performed at 9.4 T (¹H frequency 400.37 MHz, ¹⁵N frequency 40.57 MHz) on a Bruker AVANCE III spectrometer equipped with a double resonance 4 mm MAS probe, spinning at a MAS rate of 8 kHz. 83 kHz ¹H π/2 and decoupling pulses were used throughout, along with a 2 ms CP and high power (83 kHz) ¹H decoupling during acquisition. During CP the ¹⁵N nutation rate was set to 46 kHz and the ¹H nutation rate ramped from 31–41 kHz. For each spectrum, 2048 complex data points with a dwell of 20 μs (spectral width 50 kHz, total acquisition time 41 ms) were acquired with a recycle delay between 4 and 60 s. Chemical shifts were referenced to external ¹⁵NH₄Cl set to 0.0 ppm. Samples of nitrogen test set crystal structures 8, 16, and 21 were obtained from Sigma Aldrich, structure 22 from Acros Organics, 18 from Alfa Asar, structures 20 and 24 from Fisher Scientific, and structure 19 from MP Biomedical. These crystal samples were used directly from the supplier without recrystallization. The crystal structures of the samples were confirmed via powder X-ray diffraction.

5 Results and discussion

We begin by first examining the performance of fragment and cluster/fragment models for each of the four nuclei, assessing both the impact of electrostatic embedding as well as two-body cut-off distance. Second, we compare the predicted shifts for fragment, cluster and cluster/fragment approaches across six different commonly used density functionals. Third, the accuracy of the various fragment-type approaches relative to the widely used GIPAW is assessed. Fourth, we examine the statistical robustness of the linear regression parameters using statistical cross-validation. Finally, we apply fragment methods along with our scaling parameters to four chemically interesting problems which involve species not included in the benchmark test sets.

5.1 Performance of fragment and cluster/fragment models

To ensure that the chemical shift predictions are well-converged with respect to the fragment contributions included, we first examine the impact of the two-body cut off distance for both fragment and cluster/fragment models. Fig. 3 illustrates the root-mean-square errors in the isotropic chemical shifts relative to experiment for (a) hydrogen, (c) carbon, (e) nitrogen and (g) oxygen as a function of two-body cut-off distance. Linear regression models of the form presented in eqn (8) were applied separately to each model/cut-off combination using the experimental isotropic shifts for the respective test sets (see ESI† for details). All calculations were performed using the locally dense basis set defined previously in Section 3.2.


	Fig. 3 Assessment of two-body cut off distance dependence as well as relative performance of fragment and cluster/fragment models on the rms errors (left) and error distributions (right) for the calculated isotropic shifts relative to experiment for all hydrogen (a and b), carbon (c and d) nitrogen (e and f) and oxygen (g and h) structures. Calculations were performed with PBE0 and the locally dense basis defined in Section 3.2. Electrostatic embedding was employed unless otherwise specified. Cluster-based calculations reported in the error distributions (right) use a 4 Å cluster. Error distributions given in (b), (d), (f), and (h) for fragment and cluster/fragment calculations were obtained using a 6 Å two-body cut off.

All four nuclei demonstrate a dramatic reduction in rms errors once sufficient two-body terms are included to capture all hydrogen-bonding partners (2–3 Å two-body cut off). As noted previously for ¹³C,¹³ reasonable convergence is achieved for both hydrogen and nitrogen once all nearest-neighbor molecules are included in the fragment calculation using a 4 Å two-body cut-off.

As a result of the inherently local character of the chemical shielding, two-body contributions decrease rapidly with increasing distance between the two molecules. Extending the two-body cut off beyond 6 Å has a very small impact on the predicted ¹H, ¹³C, and ¹⁵N isotropic shifts (Fig. 3a, c and e). Isotropic shifts for ¹⁷O display a slightly greater dependence on long-range two-body contributions (Fig. 3g) and could perhaps benefit slightly from an 8 Å two-body cut off. Nevertheless, for the sake of consistency with calculations on other nuclei, we adopt a 6 Å cut off throughout.

Fig. 3a and b show that both electrostatic embedding and cluster-based calculations provide minimal improvements in the predicted ¹H isotropic shifts compared to a simple, un-embedded fragment two-body model. Carbon benefits slightly more from electrostatic embedding,¹³ while nitrogen and oxygen demonstrate a much greater dependence on it. Fragment-based calculations for ¹⁵N and ¹⁷O without point charge embedding (not shown) proved unreliable, with absolute errors almost uniformly exceeding 20 ppm. These results indicate a greater sensitivity of ¹⁵N and ¹⁷O nuclei to both long-range electrostatics as well as local many-body effects when compared with ¹H and ¹³C. For ¹⁵N, electrostatic embedding is largely sufficient for capturing local many-body effects, as evidenced by the relatively uniform behavior of the fragment and cluster/fragment approaches both in terms of overall rms error (Fig. 3e) as well as the error distributions (Fig. 3f).

In contrast, explicit treatment of local many-body contributions using a small cluster becomes much more important for ¹⁷O chemical shieldings. Both cluster and combined cluster/fragment methods improve the accuracy of predicted isotropic ¹⁷O shifts relative to the two-body fragment model. Cluster/fragment calculations using a 4 Å cluster uniformly improve the overall rms errors relative to two-body fragment calculations by ∼2 ppm (Fig. 3g). These results highlighting the impact of local many-body effects on ¹⁷O shieldings agree with earlier work indicating larger three-body contributions for ¹⁷O compared with ¹H, ¹⁵N and ¹³C.¹² Of course, measuring oxygen chemical shifts experimentally is also challenging, due to the line broadening resulting from the quadrupolar interaction. Experimental uncertainties range ∼0.5–5 ppm^26,51–55 and likely contribute to the errors seen for oxygen here.

The error distributions presented as box plots in Fig. 3b, d, f and h demonstrate similar trends to those observed for the rms errors. Hydrogen, carbon, and nitrogen show comparable performance between fragment and cluster/fragment methods. On the other hand, oxygen clearly shows a noticeably broadened error distribution for the two-body fragment methods compared with cluster-based approaches. Further, individual chemical shifts converge at the same rate with respect to the two-body cut off as the rms errors presented in Fig. 3 show (see Fig. S6 in ESI†). We therefore conclude that in the presence of electrostatic embedding, a two-body fragment approach with a 6 Å cut off is the method of choice for calculating ¹H, ¹³C, and ¹⁵N shifts. On the other hand, high accuracy ¹⁷O calculations benefit from the use of a central 4 Å cluster and perhaps a longer 8 Å two-body cut off.

5.2 Relative performance of DFT functionals

We now assess the performance of various commonly used density functionals for NMR chemical shift prediction in molecular crystals using these models. Specifically, we benchmark two GGA functionals (PBE, OPBE), two hybrid density functionals (PBE0, B3LYP) and a meta-GGA (TPSS) and its hybrid variant (TPSSh). TPSS and TPSSh performed well in a recent molecular crystal study.¹⁴ Using the fragment, cluster, and combined cluster/fragment models, rms errors for each of the ¹H, ¹³C, ¹⁵N, and ¹⁷O chemical shifts on their respective molecular crystal test sets are given in Fig. 4. Table 1 summarizes the rms errors along with linear regression parameters for each nucleus and density functional included in the present work. Results for GIPAW PBE are also included in the table and figures and will be discussed in Section 5.3.


	Fig. 4 Root-mean-square errors broken down by functional and method for (a) ¹H, (b) ¹³C, (c) ¹⁵N and (d) ¹⁷O isotropic chemical shifts.

Table 1 Linear regression parameters and rms errors for the ¹H, ¹³C, ¹⁵N and ¹⁷O test sets using two-body fragment, cluster/fragment (with a 4 Å cluster), and GIPAW calculations. All cluster and fragment calculations employ the mixed basis, electrostatic embedding and a two-body fragment cut off of 6 Å. We recommend these scaling parameters be used for general applications

Atom	Functional	Two-body fragment			Cluster/fragment			GIPAW
Atom	Functional	RMSE	Slope	Intercept	RMSE	Slope	Intercept	RMSE	Slope	Intercept
Hydrogen	OPBE	0.34	−0.9391	29.28	0.36	−0.9324	29.05
	PBE	0.34	−0.9335	29.05	0.36	−0.9260	28.78	0.43	−0.8730	27.19
	TPSS	0.33	−0.9366	29.46	0.35	−0.9291	29.20
	TPSSh	0.33	−0.9294	29.27	0.35	−0.9238	29.07
	PBE0	0.34	−0.9169	28.69	0.35	−0.9111	28.49
	B3LYP	0.33	−0.9210	28.95	0.35	−0.9140	28.71

Carbon	OPBE	1.90	−1.0603	194.78	1.91	−1.0592	194.66
	PBE	2.09	−1.0273	180.43	2.12	−1.0240	180.25	2.18	−0.9902	169.19
	TPSS	2.16	−1.0442	185.91	2.21	−1.0419	185.75
	TPSSh	1.84	−1.0179	184.94	1.86	−1.0162	184.82
	PBE0	1.48	−0.9676	179.58	1.47	−0.9661	179.49
	B3LYP	1.51	−0.9702	173.83	1.52	−0.9685	173.70

Nitrogen	OPBE	6.11	−1.1296	215.63	6.52	−1.1067	215.41
	PBE	5.48	−1.0808	197.53	5.83	−1.0609	197.72	5.40	−1.0165	184.98
	TPSS	5.55	−1.1061	206.39	5.90	−1.0859	206.27
	TPSSh	4.97	−1.0797	205.63	5.26	−1.0616	205.52
	PBE0	4.20	−1.0201	197.84	4.06	−0.9997	197.15
	B3LYP	4.20	−1.0177	191.02	4.34	−0.9996	191.18

Oxygen	OPBE	11.01	−1.1461	278.91	8.85	−1.1347	281.53
	PBE	11.56	−1.1440	262.61	8.79	−1.1291	264.15	7.20	−1.0663	248.30
	TPSS	11.35	−1.1632	276.67	8.88	−1.1505	278.32
	TPSSh	10.63	−1.1265	278.45	8.33	−1.1153	280.05
	PBE0	9.80	−1.0607	270.18	7.55	−1.0502	271.60
	B3LYP	9.96	−1.0646	264.15	7.48	−1.0551	266.32

Fig. 4 reveals several insightful trends. First, the three hybrid functionals demonstrate improved rms errors relative to the GGA-type and meta-GGA functionals across all functionals and atom types. The PBE0 and B3LYP hybrid functionals clearly give rise to the lowest rms errors for ¹³C, ¹⁵N and ¹⁷O, notably outperforming the GGA, meta-GGA and meta-hybrid functionals. For ¹H, the differences in rms error across the different functionals are a trivial few hundredths of a ppm. Second, the trends regarding the relative performance of the different computational models reported for PBE0 in the previous section hold across all functionals for ¹H, ¹³C and ¹⁵N nuclei. Root-mean-square errors for the GGA-type functionals differ by only a couple tenths of a ppm across fragment, cluster and cluster/fragment calculations. For oxygen, on the other hand, the rms errors for the two-body fragment model are up to several ppm larger than those for the cluster or cluster/fragment models, reiterating the importance of capturing many-body effects when predicting ¹⁷O chemical shifts.

Third, it was recently reported that the meta-GGA density functional TPSS demonstrated improved agreement with experiment relative to GGA functionals using a GIAO/symmetry-adapted cluster model in predicting the principal components of the chemical shielding tensor for ¹³C, ¹⁵N, ¹⁹F and ³¹P nuclei.¹⁴ Here, no such behavior is observed for the isotropic ¹³C and ¹⁷O chemical shifts. For nitrogen, the meta-GGA TPSS does perform slightly better than PBE or OPBE, but the differences are ∼0.1 ppm or less between TPSS and OPBE. For ¹H, all six functionals are essentially indistinguishable from one another in terms of accuracy.

Holmes et al.¹⁴ also observed that PBE0 performed worse than the GGA and meta-GGA functionals they tested for nitrogen, while it and B3LYP perform the best here. The reasons for the discrepancy between that study and the current one are unclear, but the studies do differ in a few notable ways. First, ref. 14 focuses on principal components of the chemical shielding tensors, while the work here is based on isotropic chemical shifts. On the one hand, shielding tensor principal components contain more detailed information about the local environment that is averaged out in the isotropic shift, making them a more sensitive probe for the structure and more challenging for models to predict. On the other hand, measuring tensor components is inherently less precise than extracting isotropic chemical shifts, which could increase the experimental errors in the tensor data used in the benchmarking. Second, no van der Waals dispersion correction was employed when performing all-atom crystal structure optimizations in ref. 14. Dispersion is critical in molecular crystals,^80–83 though freezing the lattice parameters at their experimental values partially mitigates problems arising from its neglect.⁸⁴ Third, the clusters used in ref. 14 typically contain 13–17 molecules, which are similar in size to the 4 Å clusters used here, and did not employ charge-embedding. As seen in Fig. 3, the effects of charge embedding and longer-range interactions beyond 4 Å are modest but non-zero. While further investigation is needed to sort out these details, hybrid functionals stand out here as the functionals of choice for NMR chemical shift calculations for all nuclei under consideration.

Fourth, the slopes in the regression lines of the best-performing hybrid functionals mostly deviate from the ideal −1 by a few percent. Such deviations are fairly typical¹⁵ and they help compensate for various systematic errors, including the omission of dynamics, vibrational averaging, etc.^85–89 Interestingly, the deviations from −1 in the hydrogen slopes are much larger at 8–9%. This might reflect the particular importance of dynamics and nuclear quantum effects for hydrogen.⁹⁰

Finally, we examine the degree to which similarities in the predicted chemical shifts among various computational models and density functionals extend to individual atoms. To provide visual context for the scatter plot, the shaded region in these plots indicates plus or minus the rms error for the method on the y-axis. Fig. 5 plots the correlations between a collection of top-performing model/density functional combinations. Previous results for ¹³C isotropic chemical shifts demonstrate relatively uniform performance at the individual atom level across both functional class and fragment model,¹³ and comparable results are seen here in Fig. 5c and d. Fig. 5a illustrates the correlations in the errors for the isotropic ¹H shifts relative to experiment for PBE0 fragment and cluster/fragment calculations. Fig. 5b plots the similar comparison between fragment-based PBE0 and B3LYP calculations. The corresponding linear fits demonstrate both high correlation coefficients as well as slopes near unity, indicating that fragment and cluster/fragment models using either PBE0 or B3LYP predict very similar ¹H shieldings. Similar results are observed for ¹⁵N and ¹⁷O when comparing across hybrid functionals (see Fig. 5f and h).


	Fig. 5 (left) Correlation of the errors predicted with the two-body fragment and cluster/fragment PBE0 models and (right) correlation of the errors between the PBE0 and B3LYP functionals with two-body fragment (or cluster/fragment for oxygen) models. The shaded region indicates ±RMSE from the model on the y-axis for a visual guide.

On the other hand, both ¹⁵N and ¹⁷O show slightly reduced correlation for fragment and cluster/fragment errors (Fig. 5e and g). In the case of ¹⁷O these results are not surprising given the reduction in rms errors observed for cluster-based approaches. However, the differences in the predicted ¹⁵N isotropic shifts between fragment and cluster/fragment methods is more surprising given the uniform performance in terms of rms errors.

5.3 Comparison of fragment and GIPAW approaches

Given the widespread use of plane wave GIPAW (particularly with the PBE functional) for chemical shift prediction in molecular solids,⁶ it is interesting to compare the accuracy of fragment-based approaches against GIPAW ones. The right column in Fig. 3 plots the error distributions for GIPAW PBE across the four test sets. Fig. 4 plots the rms errors for GIPAW PBE on each test set against the various fragment models and functionals described in the Section 5.2, and Table 1 summarizes the resulting linear regression parameters and rms errors.

Consider first how GIPAW and fragment approaches compare when using the same PBE functional. As shown in Table 1, GIPAW PBE predicts the chemical shifts with rms errors of 0.43 ppm for ¹H, 2.2 ppm for ¹³C, 5.4 ppm for ¹⁵N, and 7.2 ppm for ¹⁷O. We previously demonstrated that GIPAW and the two-body fragment PBE model behave very similarly for ¹³C.¹³ This observation is reiterated here, where the two-body fragment model obtains an rms error of 2.1 ppm for ¹³C versus 2.2 ppm for GIPAW. Similarly, both the two-body fragment and GIPAW models perform about the same for ¹⁵N with PBE (RMSE 5.5 ppm).

Interestingly, GIPAW PBE (RMSE 0.43 ppm) performs notably worse for ¹H than the two-body fragment model in the same functional (RMSE 0.34 ppm). Though this difference is fairly small in absolute terms, the larger GIPAW errors stand out notably from the highly consistent results obtained for the fragment-type methods with different functionals in Fig. 4. The reasons underlying the larger GIPAW ¹H errors are unclear. Of course, the experimental uncertainties for ¹H are often ∼0.1 ppm, so the difference between the fragment and GIPAW approaches may not be statistically significant.

Only for oxygen does GIPAW PBE (RMSE 7.2 ppm) significantly out-perform the two-body fragment PBE model (RMSE 11.6 ppm). As discussed above, many-body effects are particularly important for oxygen, and these are only approximated via point charge embedding in the two-body fragment model. Switching to the combined cluster/fragment PBE model improves the oxygen chemical shifts substantially, reducing the rms error down to 8.8 ppm. Still, even this smaller rms error remains ∼25% larger than the GIPAW one.

As discussed in Section 5.2, a key advantage of the fragment approaches is that they enable routine use of hybrid functionals for chemical shift prediction with only modest additional computational effort. For the nuclei other than hydrogen, hybrid functionals appreciably improve the quality of the chemical shift predictions. In ¹³C, two-body PBE0 or B3LYP rms errors of 1.5 ppm are about a third smaller than the GIPAW PBE ones (2.2 ppm). For ¹⁵N, the two-body fragment PBE0 and B3LYP errors are 1.2 ppm (∼20%) smaller than those from GIPAW PBE. For ¹⁷O, switching to a hybrid functional reduces the rms error to 7.5–7.6 ppm, eliminating most of the difference in the errors between the cluster/fragment and plane wave methods. Indeed, the ∼0.3 ppm difference in statistical errors between cluster/fragment PBE0 and GIPAW PBE is comparable to or smaller than the magnitude of the uncertainties in the experimental oxygen chemical shifts.

How these rms errors translate into the overall error distributions can be seen in the box plots in Fig. 3. For ¹H, ¹³C, and ¹⁵N, two-body fragment PBE0 exhibits both more small errors (as indicated by the smaller box sizes, which delineate the middle 50% of the data) and smaller-magnitude maximum errors (as indicated by the box whiskers). For oxygen, the GIPAW PBE and cluster/fragment PBE0 have a similarly sized boxes (i.e. many of the errors are comparable), but the worst errors have larger magnitude with the cluster/fragment approach.

It is also interesting to compare correlations between individual shifts predicted with GIPAW and the fragment approaches. Several such plots are shown in Fig. 6. To provide visual context for the scatter plot, the shaded region in these plots indicates plus or minus the GIPAW PBE rms error. For ¹³C, individual shifts predicted with GIPAW and two-body fragment PBE are very similar, as demonstrated by the excellent correlation in Fig. 6c. Virtually all of the variation between the two models is within the magnitude of the rms errors for the models (shaded region). On the other hand, individual two-body fragment PBE0 ¹³C shifts are somewhat different (and statistically better) than the GIPAW PBE ones (Fig. 6d). A moderate fraction of these differences are larger than the magnitude of the rms errors for GIPAW. For ¹H, GIPAW PBE correlates relatively poorly with either two-body fragment PBE or PBE0 (Fig. 6a–b), as might be expected from the overall larger rms errors seen with GIPAW. On the other hand, differences in the individual predicted shifts between the fragment and plane wave models are largely comparable in magnitude to the GIPAW rms error.


	Fig. 6 The left column plots the correlation between the fragment PBE and GIPAW PBE isotropic shift errors for each nucleus, while the right column compares fragment PBE0 and GIPAW PBE. For oxygen, the cluster/fragment data is shown. The shaded region indicates ±RMSE from GIPAW for a visual guide.

For ¹⁵N, the correlation between two-body PBE and GIPAW is moderate (Fig. 6e), despite their statistically similar errors. In other words, the two models do predict moderately different shifts for individual atoms, but the variations in individual shifts between the two models generally appear to be comparable to or less than the overall rms errors of ∼5.5 ppm. Just as for carbon, the fragment PBE0 shifts correlate less well with GIPAW shifts than the PBE ones do (Fig. 6f). Again, as with carbon, PBE0 shifts differ from PBE ones for individual atoms, and those differences lead to smaller statistical errors across the test sets. Finally, for oxygen, the correlations with GIPAW PBE are reasonable for both cluster/fragment PBE and PBE0 (Fig. 6g–h), despite the differences in the rms errors for the different models.

Taken together, these results indicate that individual chemical shifts predicted by the different models are generally similar. For the most part, the variations seen for individual chemical shifts among the different models are on par with the statistical errors relative to experiment observed in these test sets.

Finally, the errors seen here compare well with those found in other solid-state benchmarks. Previous DFT cluster and/or GIPAW benchmark studies in molecular crystals found errors of ∼0.3 ppm for hydrogen⁷ and ∼1.5–2 ppm for carbon.^7,14,91 Fewer statistics on large sets of molecular crystals are available for ¹⁵N and ¹⁷O nuclei. Holmes et al.¹⁴ report rms errors of ∼15 ppm for principal components of ¹⁵N shielding tensors, compared to errors of less than ∼5 ppm for isotropic shifts here. Isotropic shifts are generally much easier to predict correctly, and the ratio of ∼3 between the errors in the principal components and isotropic shift is consistent with the analogous ratios observed in ¹³C benchmarks.^13,91 A different cluster model study by Rorick et al. reports a mean absolute deviation of 13 ppm for a dozen ¹⁷O isotropic shifts spanning a broad range of organic and inorganic environments.⁵⁶ Our results are also compatible with smaller-scale GIPAW studies on nitrogen^92–94 and oxygen nuclei.^{92,93,95–97} See the review article by Bonhomme et al.⁶ for additional GIPAW examples.

In summary, the results presented in these sections clearly demonstrate excellent performance for fragment-based NMR chemical shielding calculations for ¹H, ¹³C, ¹⁵N and ¹⁷O isotropic chemical shieldings. Hybrid functionals like B3LYP and PBE0 consistently out-perform GGA functionals for all four nuclei. Statistically, the use of hybrid functionals allows fragment-based approaches to nearly match (¹⁷O) or improve upon (¹H, ¹³C, and ¹⁵N) the quality of isotropic chemical shifts obtained with GIPAW PBE. Coupled with the general utility of Gaussian basis set wave function-based ab initio shielding calculations, the present work provides compelling support for more widespread use of fragment-based methods.

6 Statistical cross-validation of the regression models

When training a model against a set of benchmark results, as is done here for the linear regressions that convert absolute chemical shieldings to observed chemical shifts (eqn (8)), there is always the danger that the fit parameters will not transfer well to systems that were not part of the training set. Accordingly, it is important to validate the performance of the models on species not included in the training set.

Instead of arbitrarily partitioning the test sets collected here into distinct training and validation subsets, we assess the robustness of the linear regression parameters obtained here using exhaustive N-choose-k cross validation trials for each nucleus and density functional. Here, N is the number of crystals in the benchmark set, and k is the number of crystals in the validation subset. The cross-validation procedure then considers all possible ways of partitioning the N crystals into N − k training crystals and k validation crystals. For example, the oxygen set contains N = 15 crystal structures. Setting k = 5 results in 3003 possible combinations in which the regression model is trained on 10 of the crystal structures and then validated on the remaining 5 structures. Comparing the statistical performance of the model over all 3,003 combinations to the original value obtained by fitting to the entire benchmark set provides a measure of the robustness of the linear regression models. This same approach was applied previously to our carbon test set.¹³ That earlier work found that k values between 5 and 9 gave very similar cross-validation results in the 25 crystal ¹³C test set.

Table 2 summarizes the mean rms errors and linear regression parameters obtained across the thousands of fits. Standard deviations in the mean linear regression parameters are also reported. Comparison of these values with those obtained from the original fits against the entire benchmark sets (listed in Table 1) provides a measure of robustness of the regression models and chemical shift error estimates.

Table 2 Cross-validation analysis for isotropic shifts. Data reported for ¹H, ¹³C and ¹⁵N nuclei was obtained using the charge-embedded two-body fragment method (R_2b = 6 Å, R_emb = 30 Å). Cluster/fragment results are reported for ¹⁷O (R_2b = 6 Å, R_emb = 30 Å, 4 Å cluster). Uncertainties in the slope and deviation represent the standard deviations in the fit parameters observed over all cross-validation fits

Atom	Functional	Original RMSE	Cross-validation RMSE	Mean slope (A)	Mean intercept (B)
Hydrogen	OPBE	0.34	0.37	−0.9382 ± 0.0134	29.26 ± 0.32
	PBE	0.34	0.36	−0.9327 ± 0.0127	29.03 ± 0.30
	TPSS	0.33	0.36	−0.9358 ± 0.0124	29.45 ± 0.30
	TPSSh	0.33	0.36	−0.9287 ± 0.0119	29.25 ± 0.29
	PBE0	0.34	0.36	−0.9160 ± 0.0116	28.67 ± 0.28
	B3LYP	0.33	0.36	−0.9327 ± 0.0127	29.03 ± 0.30
	PBE (GIPAW)	0.43	0.47	−0.8731 ± 0.0182	27.20 ± 0.47

Carbon	OPBE	1.90	1.91	−1.0602 ± 0.0023	194.77 ± 0.25
	PBE	2.09	2.13	−1.0273 ± 0.0028	180.42 ± 0.27
	TPSS	2.16	2.22	−1.0441 ± 0.0031	185.89 ± 0.35
	TPSSh	1.84	1.87	−1.0178 ± 0.0024	184.93 ± 0.29
	PBE0	1.48	1.51	−0.9676 ± 0.0014	179.58 ± 0.15
	B3LYP	1.51	1.49	−0.9701 ± 0.0016	173.83 ± 0.18
	PBE (GIPAW)	2.18	2.25	−0.9901 ± 0.0028	169.18 ± 0.25

Nitrogen	OPBE	6.11	6.40	−1.1220 ± 0.0058	215.37 ± 0.82
	PBE	5.48	5.75	−1.0755 ± 0.0049	197.44 ± 0.68
	TPSS	5.55	5.81	−1.1001 ± 0.0051	206.23 ± 0.70
	TPSSh	4.97	5.20	−1.0747 ± 0.0043	205.50 ± 0.62
	PBE0	4.20	4.41	−1.0200 ± 0.0033	197.83 ± 0.50
	B3LYP	4.20	4.42	−1.0176 ± 0.0034	191.00 ± 0.50
	PBE (GIPAW)	5.40	5.65	−1.0117 ± 0.0046	184.95 ± 0.64

Oxygen	OPBE	8.85	10.29	−1.1393 ± 0.0267	281.52 ± 1.22
	PBE	8.79	10.06	−1.1331 ± 0.0233	264.06 ± 1.33
	TPSS	8.88	10.19	−1.1548 ± 0.0248	278.28 ± 1.31
	TPSSh	8.33	9.45	−1.1188 ± 0.0209	280.03 ± 1.27
	PBE0	7.55	8.42	−1.0525 ± 0.0151	271.56 ± 1.19
	B3LYP	7.48	8.36	−1.0574 ± 0.0150	266.26 ± 1.19
	PBE (GIPAW)	7.20	8.14	−1.0687 ± 0.0152	248.21 ± 1.21

The mean cross-validation rms error for ¹³C and ¹⁵N nuclei across all functionals is only ∼1–5% larger than the corresponding value obtained by fitting against the entire set. Larger increases in the rms error are observed for ¹H and ¹⁷O nuclei, ranging from ∼6–8% and ∼9–15%, respectively. These differences likely result from a combination of test set size, inherent challenges in predicting chemical shieldings for certain nuclei, as well as uncertainties in the experimental data. Note too that the overall cross-validation behaviors of the fragment and GIPAW approaches are qualitatively similar.

Comparing the mean linear regression parameters obtained via cross-validation (Table 2) with those reported in Table 1 also reveals close agreement in the regression parameters. The mean slopes differ only in the third decimal place, while the intercepts generally agree to within a few hundredths of a ppm. The standard deviations in the cross validation parameters are roughly one order of magnitude larger. For ¹³C, ¹⁵N and ¹⁷O, these uncertainties in the regression model parameters have minimal effect on the predicted chemical shifts. Proportionally, the uncertainties in the ¹H scaling parameters are much larger, however, the relatively uniform rms errors for each partitioning indicate a degree of compensation between the slope and intercept.

Overall, these tests demonstrate the robustness of the regression model parameters obtained in the benchmarks here. The reported scaling factors in Table 1 should prove useful in future molecular crystal studies or in other cases such as biological systems where the explicit treatment of the environment is necessary for accurate chemical shift prediction. The next section discusses four illustrative applications which apply these scaling parameters to systems which were not included in the benchmark sets.

7 Applications

The purpose of developing the regression models in the previous sections is to enable chemical shift prediction in new systems which are not included in the test sets. The following sections provide example applications of the regression models for each of the four nuclei studied here. In assessing the quality of the predictions, one should consider the distributions of errors observed in the benchmark test sets. As shown in Fig. S5 in the ESI,† the error distributions are somewhat Gaussian, particularly for the larger test sets. Notably, 65–75% of the errors fall within one standard deviation of the mean (zero error), and 92–97% of the errors fall within two standard deviations. These percentages compare favorably with the 68% and 95% probabilities expected for data lying within one and two standard deviations in an ideal normal distribution.

Accordingly, as a rule of thumb, we consider any predicted shift that lies within twice the root-mean-square error of the experimental value to be in reasonable agreement. For example, based on two-body fragment (or cluster/fragment for oxygen) data from Table 1, this means that individual PBE0 shift errors of 0.68 ppm for ¹H, 3.0 ppm for ¹³C, 8.4 ppm for ¹⁵N, and 15.1 ppm for ¹⁷O should be considered acceptable. Larger errors will sometimes occur, of course, but relatively rarely (∼5% of the time). When considering larger sets of predicted shifts, one might similarly hope that the rms errors would be similar in magnitude to the test set rms errors. One could attempt to make this latter argument more rigorous using a χ² test, for instance, but we do not do so here.

7.1 9-tert-Butyl anthracene ester

The 9-tert-butyl anthracene ester (9-TBAE) has been the subject of recent experimental interest in light of its unique photochemical properties. Previous work demonstrated a photodimerization-induced expansion of the molecular crystal nanorods of up to 15%.^98,99 Although initial evidence characterized the photodimerized product as a metastable intermediate, the mechanism for the expansion remains unclear. In the present work we present tentative ¹H and ¹³C assignments in the heteronuclear correlation (HETCOR) spectra for the 9-TBAE monomer (Fig. 7).


	Fig. 7 HETCOR ¹H/¹³C spectrum for 9-TBAE (top) and zoom focusing on the aromatic region (bottom). The location of the experimental ¹³C peaks are given by red dashed lines. Predicted cross-peak locations are illustrated in green for protons directly bound to the carbon and in purple for the nearest neighboring proton.

A previous X-ray diffraction study of the 9-TBAE crystal structure revealed two distinct configurations of the t-butyl side chain with a 69%/31% occupancy ratio at low temperature, which disappears above 100 K.⁹⁹ Hydrogen-only plane wave geometry optimizations with fixed lattice parameters were performed on each configuration (see Section 3.2 for further details). Fragment-based NMR chemical shielding calculations using the PBE0 functional and mixed basis described above with a 6 Å two-body cut off were performed on both optimized crystal structures. The predicted chemical shieldings were converted to chemical shifts using the test-set-derived PBE0 scaling parameters presented in Table 1. The predicted chemical shifts for each configuration were then weighted according to the site occupancy to obtain final predicted chemical shifts for assigning the spectrum. The methyl carbon and its hydrogen shifts were also averaged under the assumption of fast dynamics.

The upper panel of Fig. 7 illustrates the HETCOR spectrum with the experimental peaks labeled in red and the corresponding predicted peaks in blue. While unambiguous assignment of C_Me, C₁, C₂ and H_Me is possible upon inspection of Fig. 7, the aromatic region requires more careful analysis. The lower panel in Fig. 7 shows an expansion of the aromatic region of the HETCOR spectrum. The experimental ¹³C shifts are indicated as dashed red lines. Predicted cross-peaks to carbon are illustrated using green triangles for directly bound hydrogen atoms and purple for nearest-neighbor hydrogens. The predicted cross-peaks agree well with the experimental HETCOR spectrum, with the largest ¹³C error approximately ∼2.5 ppm.

Tables 3 and 4 present tentative spectral assignments based on the fragment NMR calculations. Most carbon shifts in the aromatic region can be plausibly assigned based on the sequential ordering of the predicted carbon shifts. Two pairs of carbon atoms, C₈/C₁₀ and C₄/C₉, are difficult to resolve unambiguously due to their very similar experimental shifts, but our predicted shieldings provide candidate assignments. C₃ has no directly bound hydrogen and should exhibit weak cross-peaks to neighboring protons, so it was assigned to the weak peak at 129.22 ppm. C₄ and C₉ exhibit similarly weak cross-peaks, but were assigned to the larger peak at 127.70 ppm due to their predicted overlapping resonances. Based on these assignments, two-body fragment PBE predicts an overall ¹³C rms error of only 1.57 ppm. GIPAW PBE calculations lead to the same assignments, albeit with a larger ¹³C rms error of 2.17 ppm relative to experiment.

Table 3 Experimental ¹³C isotropic shifts (in ppm) along with tentative carbon assignments based on two-body fragment PBE0 NMR calculations using PBE0 and a mixed basis. GIPAW PBE shifts are also listed. See Fig. 7 for atom numbering

Atom	Expt.	2-Body PBE0	Error	GIPAW PBE	Error
C_Me	28.63	28.52	0.11	24.05	4.58
C₁	83.21	83.27	−0.06	85.00	−1.79
C₂	169.94	171.43	−1.49	170.06	−0.12
C₃	129.22	126.69	2.53	126.95	2.27
C₄	127.70	125.26	2.44	124.95	2.75
C₅	124.20	123.28	0.92	122.79	1.41
C₆	128.84	127.89	0.95	127.57	1.27
C₇	123.42	122.38	1.04	122.10	1.32
C₈	130.94	129.73	1.21	129.39	1.55
C₉	127.70	125.25	2.45	125.57	2.13
C₁₀	130.62	129.16	1.46	129.01	1.61
RMSE			1.57		2.17

Table 4 Possible cross-peak assignments of the 9-TBAE HETCOR spectrum, based on the carbon assignments from Table 3, along with the corresponding predicted ¹H chemical shifts. All shifts in ppm. Listed atom pairs correspond to hydrogens either directly bonded to the carbon or on nearest-neighbor carbons

¹H/¹³C expt.	Cross-peak assignment	2-Body PBE0 ¹H	GIPAW PBE ¹H
0.68/28.63	H_Me/C_Me	0.89	0.67
0.65/83.21	H_Me/C₁	0.89	0.67
0.79/169.94	H_Me/C₂	0.89	0.67
7.41/129.22	H₅/C₃	7.45	6.82
7.72/127.70	H₅/C₄, C₉	7.45	6.82
7.72/127.70	H₁₀/C₄, C₉	7.22	6.42

7.70/124.20	H₅/C₅	7.45	6.82
7.70/124.20	H₆/C₅	7.76	7.27

7.31/128.84	H₅/C₆	7.45	6.82
	H₆/C₆	7.76	7.27
	H₇/C₆	6.77	6.42

7.37/123.42	H₆/C₇	7.76	7.27
	H₇/C₇	6.77	6.42
	H₈/C₇	6.95	6.33

6.90/130.94	H₇/C₈	6.77	6.42
	H₈/C₈	6.95	6.33
	H₁₀/C₈	7.22	6.42

6.90/130.62	H₈/C₁₀	6.95	6.33
6.90/130.62	H₁₀/C₁₀	7.22	6.42

Given the degree of overlap in the proton resonance for the aromatic region, spectral assignment of the ¹H shifts proves more difficult. Table 4 lists the prominent, experimentally observed cross peaks in the HETCOR spectrum. Combining the carbon assignments and the assumption that the resonances will be dominated by contributions involving either directly bonded or nearest-neighbor hydrogen atoms, we propose the possible assignments listed in the table. Based on the assignments for C₈ and C₁₀, for instance, the two cross peaks at 6.9 ppm for ¹H likely correspond to H₈/C₈ and H₁₀/C₁₀ correlations, perhaps with smaller contributions from dipolar couplings to nearest-neighbor hydrogens (e.g. H₇/C₈, H₁₀/C₈, and H₈/C₁₀). As shown in Fig. 7 and Table 4, the predicted two-body fragment PBE0 ¹H shifts for these assignments are generally within a few tenths of a ppm of the experimental values.

Performing the analysis with ¹H GIPAW PBE shifts instead of two-body PBE0 leads to the same qualitative cross-peak assignments. For the hydrogen shifts associated with the methyl, C₆, and C₇ atoms, GIPAW PBE performs comparably well or up to a few tenths of a ppm better than two-body PBE0. On the other hand, GIPAW PBE underestimates the hydrogen shifts by ∼0.5–1 ppm for cross-peaks associated with C₃, C₄, C₅, C₈, and C₁₀. Overall, the ¹H and ¹³C isotropic chemical shifts predicted with either the two-body fragment PBE0 model or GIPAW PBE can help assign the HETCOR spectrum of 9-TBAE. However, the fragment PBE0 predictions provide moderately better agreement with experiment.

7.2 ¹⁵N Chemical shift predictions in histidine co-crystals

Hydrogen bonding between imidazole and carboxylate moieties occurs frequently in biological systems. Given that both functional groups have similar pK_a values, enhanced proton mobility is often observed.¹⁰⁰ In an effort to characterize these ubiquitous interactions better, a recent study used solid-state NMR spectroscopy to probe the magnetic properties of a collection of histidine-containing molecular co-crystals.¹⁰⁰

Here, we investigate the ability of our fragment-based chemical shift predictions to discriminate among the two imidazole ring nitrogens and between the different crystal environments found in four such co-crystals: L-histidine perchlorate (H1), L-histidine monohydrochloride monohydrate (H2), L-histidine hydrogen oxalate (H3), and L-histidine hydrogen oxalate co-crystals (H4). The hydrogen bonding and CSD reference codes for these crystals are shown in Fig. 8.


	Fig. 8 Hydrogen bonding in optimized histidine structures. N–O bond lengths are given in Å.

Each co-crystal was subjected to an all-atom geometry optimization using fixed lattice parameters, as described in Section 3.2. Both GIPAW PBE and fragment PBE0 chemical shielding calculations were performed on each of the optimized structures. The raw shieldings were scaled according to eqn (8) using the test-set derived scaling parameters reported in Table 1.

A comparison of the experimental and predicted ¹⁵N isotropic shifts for each of the histidine co-crystals is given in Fig. 9. The predicted ¹⁵N isotropic shifts systematically overshoot the experimental values (Fig. 9) by between 3–9 ppm for fragment PBE0 and 2–10 ppm for GIPAW PBE. The fragment PBE0 rms error of 6.9 ppm is larger than the nitrogen test set rms error of 4.2 ppm (Table 1), but it still lies well within the expected error distribution. In fact, close examination of the predicted shifts for comparable sp² hybridized nitrogen atoms hydrogen-bonded to a carboxylate group in the nitrogen benchmark set (e.g. see structures 3, 5, 6, 7, and 9 in the ESI†) reveals a similarly large rms error. In other words, these particular nitrogen environments prove slightly harder to model than other ones. For comparison, cluster/fragment PBE0 predicts the shifts with an rms error of 6.1 ppm, while GIPAW PBE gives an rms error of 6.6 ppm.


	Fig. 9 (a) Comparison between two-body fragment PBE0 predicted and experimental imidazole nitrogen chemical shifts for the histidine co-crystals H1–H4 shown in Fig. 8. (b) Comparison of GIPAW PBE versus experiment.

As a side note, the systematic nature of the over-estimation of the chemical shifts with all three methods means that one could reduce the rms errors by directly fitting the predicted shieldings to the experimental shifts viaeqn (8). For example, doing so with the two-body PBE0 data reduces the error to 2.7 ppm. Of course that approach lacks the transferability of the scaling models developed here.

The differences between the δ and ε N chemical shifts in a given co-crystal range from 3–13 ppm. Except for L-histidine perchlorate (H1), the predicted chemical shifts correctly order the δ and ε nitrogen shifts, which would be important when using these calculations to assign the histidine nitrogen features. H1 exhibits the smallest difference between the two nitrogen shifts (3.4 ppm), and both the fragment PBE0 and the GIPAW PBE results incorrectly order those shifts.

A more stringent test comes from considering the ordering of the chemical shifts across all four co-crystals. Fragment PBE0 orders most of the shifts correctly, except for those in H1 and the ε nitrogen in H4. The δ nitrogens in H2, H3, and H4, all of which exhibit short N–O distances, are over-estimated by 8–9 ppm, while the ε nitrogen in H4 has the smallest error of 3 ppm, leading to the incorrect ordering. GIPAW does better for H4_ε, correctly predicting that it occurs further downfield from H2_ε and H3_δ. Overall, despite imperfect reproduction of the qualitative trends, the predicted chemical shifts derived from the nitrogen scaling parameters provide a useful tool for investigating how chemical environment impacts these ¹⁵N chemical shifts.

7.3 Benzoic acid

In the crystalline state, benzoic acid molecules form symmetric carboxylic acid dimers (Fig. 10). There are two possible configurations for the hydrogens in the carboxylic acid dimer, and these configurations inhabit slightly different environments in the crystal,¹⁰¹ as shown in Fig. 10. In configuration A, the protonated oxygen is closer to the meta hydrogen in the neighboring co-planar dimer, while in configuration B it is closer to the ortho hydrogen.


	Fig. 10 Predicted cluster/fragment PBE0 ¹⁷O chemical shifts for benzoic acid. Though the isolated dimers would be symmetric, symmetry of the two configurations is broken in the crystal. The protonated oxygen is closer to either the (a) meta hydrogens in configuration A or (b) ortho hydrogens of the neighboring benzoic acid dimers in configuration B.

Given four oxygens per dimer and two unique dimer configurations, one might expect up to eight oxygen chemical shifts. However, the oxygens diagonal to one another within a given dimer are equivalent by symmetry, which reduces the number of potential shifts to four. At room temperature, fast proton exchange further dynamically averages over configurations A and B. In the end one expects to observe two unique ¹⁷O shifts. The experimental NMR spectrum shows a single, broad ¹⁷O peak at 230 ppm under magic angle spinning conditions.⁵³ The carboxyl carbon appears at 173–174 ppm.^102,103

Such dynamical effects can be modeled by computing the chemical shifts for the two different possible proton positions and Boltzmann averaging over them. This has been done for the carboxylic acid dimers in aspirin and salicyclic acid,¹⁰⁴ for instance. We perform similar analysis here. The chemical shift tensors were averaged over configurations A and B according to:


σ_O1 = P^Aσ^A_CO + P^Bσ^A_C–OH	(9)


σ_O2 = P^Aσ^A_C–OH + P^Bσ^B_CO	(10)


σ_C = P^Aσ^A_C + P^Bσ^A_C	(11)

where P^A and P^B are Boltzmann weights for the two configurations,


	(12)

To test the performance of the scaling models derived here, configuration A was generated by optimizing the experimental benzoic acid crystal structure (CSD code BENZAC02) using periodic PBE-D2 (all atom, fixed lattice parameters). Configuration B was generated by transferring the hydrogens to the opposite oxygens and re-optimizing. The resulting energies predict that configuration B is more stable by 1.13 kJ mol⁻¹, so P^A = 0.387 and P^B = 0.613 at 298 K. Fragment, cluster/fragment, and GIPAW NMR chemical shielding calculations were then performed using each density functional. For each model, the final isotropic chemical shifts were obtained by diagonalizing the tensors obtained from eqn (9)–(11), averaging the principal components to obtain isotropic shieldings, and converting the shieldings to shifts according to the linear regression parameters reported in Table 1.

Fig. 10 shows the two unique ¹⁷O chemical shifts predicted for each of the two configurations at the cluster/fragment PBE0 level. The shifts resulting from room-temperature Boltzmann-averaging according to eqn (9)–(12) are summarized in Table 5 for all six functionals using the fragment and cluster/fragment models. As expected from previous results¹³ and the discussion in Sections 5.1 and 5.2, the carboxyl carbon shift predicted with each functional varies by only ∼0.1–0.3 ppm between the fragment and cluster/fragment methods. The fragment approach shifts differ from the GIPAW PBE value by 1–1.5 ppm, but they still lie within 1–2 ppm of the experimental ¹³C shift. Similarly good agreement is seen for OPBE, PBE0, and B3LYP.

Table 5 Predicted and experimental carboxyl-¹³C and ¹⁷O isotropic chemical shifts for benzoic acid

Functional	¹³C fragment	¹³C cluster/fragment	¹³C GIPAW	¹⁷O fragment	¹⁷O cluster/fragment	¹⁷O GIPAW
a Ref. 102 and 103. b Ref. 53. Experimental spectrum showed a single broad, unresolved peak.
OPBE	175.5	175.3		215.2, 233.4	216.4, 238.8
PBE	175.3	175.1	173.8	211.2, 229.6	213.1, 235.8	211.6, 237.8
TPSS	176.7	176.5		212.9, 231.3	214.5, 237.2
TPSSh	176.7	176.5		213.0, 231.6	214.6, 237.3
PBE0	175.4	175.2		212.0, 231.0	213.7, 236.5
B3LYP	175.7	175.4		210.8, 229.7	212.8, 235.7
Experiment		173–174^a			230^b

Analysis of the oxygen atoms is slightly more difficult, since the experimental study did not resolve the two distinct shifts. Nevertheless, the predicted two-body fragment PBE0 shifts (213 and 231 ppm) and the cluster/fragment PBE0 shifts (214 and 237 ppm) are reasonably consistent with the broad experimental peak assigned to 230 ppm. They are also within a couple ppm of the GIPAW PBE predictions of 212 and 238 ppm. Similar shifts are obtained with fragment-based approaches for most of the other functionals, too. In summary, the fragment approaches predict these carboxylic acid atom shifts in good agreement with both GIPAW and experiment.

7.4 C–nitrosoarene complex

C–nitroso compounds represent an important class of organic compounds with chemical, biological and pharmaceutical relevance.^30,105 Of particular interest for the present work is the wide span of chemical shieldings nitrosoarene–metal complexes display. For instance, p-NMe₂C₆H₄¹⁵NO, p-[¹⁵N]-nitroso-N,N-dimethylaniline (NODMA) has one of the largest ¹⁵N chemical shift anisotropies known.^106,107 In the present work, we examine the ¹⁷O chemical shielding for SnCl₂(CH₃)₂(NODMA)₂, hereafter referred to as compound 2 and depicted in Fig. 11. Strong tin–oxygen interactions give rise to an experimental ¹⁷O isotropic chemical shift of 717 ppm relative to liquid water.³⁰ The largest chemical shift included in the ¹⁷O test set is approximately 350 ppm, therefore the chemical shift for compound 2 lies well outside the range of shifts which comprise the test set. This test case allows us to simultaneously assess the accuracy of fragment methods in the context of organometallic molecular crystals and the accuracy of the scaling parameters when applied to chemical environments significantly different from those included in the test set.


	Fig. 11 Structure of compound 2: SnCl₂(CH₃)₂(NODMA)₂.

Compound 2 exhibits static disorder about the oxygen–nitrogen bond. The two dominant configurations were observed with occupancy rations of about 3 [thin space (1/6-em)] :1 in the X-ray crystallography. We performed all-atom optimizations with fixed lattice parameters on each of the three possible crystal structures for compound 2 (CSD code BISVII01) as outlined in Section 3.2. The two minor configurations had stabilities of +2.5 and +9.9 kJ mol⁻¹ relative to the major one, and the corresponding room temperature Boltzmann factors suggest populations of approximately 72%, 26%, and 1%, in good agreement with the 3 [thin space (1/6-em)] :1 ratio cited experimentally for the two most important structures. The original experimental study of this nitrosoarene ascribes the oxygen shift of 717 ppm to the dominant configuration,³⁰ so we focus on that structure here. Though oxygen chemical shift calculations benefit appreciably from explicit treatment of many-body effects via the cluster/fragment approach, pairwise-only fragment calculations with a 6 Å two-body cut off and electrostatic embedding were performed instead due to the large size of this system. Table 6 reports the absolute and scaled isotropic shifts and the error in the predicted shifts relative to the experiment.

Table 6 Predicted isotropic ¹⁷O chemical shifts (in ppm) for compound 2 using a two-body fragment method and scaling parameters reported in Table 1. The experimental shift is 717 ppm

Functional	Absolute shielding	Scaled shift	Error
OPBE	−428.6	770.1	53.1
PBE	−444.9	771.6	54.6
TPSS	−399.7	741.6	24.6
TPSSh	−404.5	734.1	17.1
PBE0	−450.5	748.0	31.0
B3LYP	−452.1	745.4	28.4
GIPAW (PBE)	−499.6	781.0	64.0

First, we observe that hybrid functionals reproduce the ¹⁷O isotropic shieldings to within ∼30 ppm, while GGA functionals exhibit errors nearly double that. Interestingly, the TPSS and TPSSh notably out-perform the other functionals here, with errors of only 25 and 17 ppm, respectively. The ∼30 ppm errors for the hybrid and ∼20 ppm errors for the meta-GGA/hybrid functionals are large but are plausibly within the tails of the error distributions one expects for oxygen with the two-body fragment method based on the test set results (cf.Fig. 3h). On the other hand, GIPAW PBE dramatically overestimates this shift by 64 ppm, which is an order of magnitude larger than the GIPAW oxygen test set rms errors.

Second, the errors reported in Table 6 reflect contributions from both the fragment DFT calculations and the extrapolation of the scaling parameters well-outside the range for which they were fitted. Although the statistical cross-validation studies demonstrate low sensitivity of the regression parameters to the choice of fitting set, those studies included only shifts in the ∼50–350 ppm range. If, hypothetically, one were to expand the test set to include compound 2, comparing the resulting linear regression parameters with those from the initial test set would provide further insights into the robustness of the regression parameters.

Fig. 12 illustrates the differences in the linear regression parameters for the ¹⁷O test set upon including compound 2. The magnitude of the differences in regression parameters in Fig. 12 directly correlate with the magnitude of the errors given in Table 6. Accordingly, both the TPSS and TPSSh density functionals demonstrate the smallest variations in the regression parameters upon including compound 2. With these new regression parameters for the cluster/fragment model, individual predicted shifts in the oxygen test set would vary by an rms ∼2 ppm. However, the overall rms errors versus experiment would increase by only a few tenths of a ppm.


	Fig. 12 Differences in linear regression parameters for the ¹⁷O test set resulting from the inclusion of compound 2 for the (a) slopes and (b) intercepts.

Table 7 Linear regression parameters for the ¹⁷O test set with compound 2 included. Errors in the predicted shieldings for compound 2 using the linear regression parameters from the expanded test set

Functional	Slope	Intercept	Error
OPBE	−1.0794	276.42	22.0
PBE	−1.0765	261.05	22.9
TPSS	−1.1316	275.55	10.9
TPSSh	−1.1052	277.63	7.7
PBE0	−1.0250	269.02	13.7
B3LYP	−1.0319	263.27	12.8
PBE (GIPAW)	−0.9934	247.48	26.7

These results raise the question of whether compound 2 should be included in the test set. Its inclusion would improve the prediction for its oxygen chemical shift (Table 7) and would extend the range of oxygen chemical shifts upon which the regression parameters are included. That could improve the realm of applicability of the regression parameters. On the other hand, due to its extreme 717 ppm chemical shift, this single data point has an outsized effect on the regression line through the more typical ∼50–350 ppm range of oxygen shifts. For that reason, we excluded this shift from the test set. Even without including this particular shift in the test set, the ∼20–30 ppm errors obtained for this oxygen with the fragment approach remain tolerable, demonstrating the broad range of applicability of the test-set-derived parameters. Of course, if one is interested in chemical shifts outside the more typical range, one might wish to include compound 2 and perhaps other similar species in constructing the regression model.

8 Conclusions

In conclusion, a series of benchmark calculations assessing the performance of fragment, cluster and combined cluster/fragment models for predicting ¹H, ¹³C, ¹⁵N, and ¹⁷O isotropic chemical shifts in molecular crystals using a variety of density functionals have been carried out. Test sets have been assembled for each nucleus which enable one to validate chemical shift predictions against experiment. The following key conclusions can be drawn from this work.

• Fragment, cluster, and combined cluster/fragment methods using a 6 Å two-body cut off and a 4 Å cluster size demonstrate comparable performance for ¹H, ¹³C, and ¹⁵N isotropic shift prediction. However, local many-body effects result in improved accuracy (∼2 ppm reduction in rms error) for predicted ¹⁷O isotropic shifts using a cluster-based approach. Accordingly, the two-body fragment approach can be used for most applications, but a cluster/fragment approach should be used for oxygen when higher accuracy is needed.

• Hybrid functionals out-perform GGA-type functionals for ¹H, ¹³C, ¹⁵N, and ¹⁷O nuclei, regardless of the model used. Among the functionals tested, the hybrid functionals PBE0 and B3LYP stand out as the functionals of choice for modeling NMR chemical shifts. Whether the improved performance of hybrid functionals is seen for other NMR active nuclei will be explored in future work. The benchmarks here do not reproduce earlier findings which found particularly good performance for the TPSS and TPSSh meta-GGA/hybrid functionals. Those functionals do perform particularly well for the unusual oxygen chemical shift in nitrosoarene compound 2, though it is unclear how broadly one should generalize from that single data point.

• The fragment-based approaches exhibit accuracy that is highly competitive with GIPAW. Two-body fragment PBE0 and B3LYP out-perform GIPAW PBE on the ¹H, ¹³C, and ¹⁵N test sets by 20–30%. For ¹⁷O, where many-body effects are particularly important, the cluster/fragment PBE0 and B3LYP models produce rms errors that are about half a ppm (8%) worse than those from GIPAW PBE.

• Linear regression parameters mapping absolute chemical shieldings to observable chemical shifts have been provided for six different density functionals (OPBE, PBE, TPSS, PBE0, B3LYP, and TPSSh) using both fragment and cluster/fragment methods and the locally dense 6-311+G(2d,p)/6-311G(d,p)/6-31G basis combination. Mean scaling parameters obtained via statistical cross-validation (Table 2) are in excellent agreement with those obtained directly from the test set (Table 1), with only small variations in the regression parameters for any of the nuclei and density functionals included in the analysis. For general applications, we recommend the use of the linear regression parameters presented in Table 1.

• The applicability of these regression parameters for each nucleus to systems not included in the test set was demonstrated on several systems, including assignment of the ¹H and ¹³C HETCOR spectrum of 9-TBAE, investigation of the nitrogen chemical shifts in histidine co-crystals, analysis of the carboxylic acid group shifts in benzoic acid, and prediction of the unusually far downfield oxygen chemical shift of oxygen bound to a tin atom in a C–nitrosoarene.

These models provide practical, high-accuracy alternatives to existing plane wave methods in organic molecular crystals. In the future, it will be interesting to explore the performance of these models more widely. Fragment methods generally perform well in systems with sufficiently large band gaps,^108–110 and so one might expect the chemical shift prediction methods developed here to be effective in a variety of non-metallic systems. On the other hand, some nuclei (like ¹⁷O) are more sensitive to many-body effects, so the impact of the truncating the many-body expansion should be tested on other nuclei.

Acknowledgements

Funding for this work from the National Science Foundation (CHE-1362465 for J. H. and G. B., NSF DMR-1508099 for R. K. and L. M.), the National Institutes of Health (R01GM097569 for L. M.), and supercomputer time from XSEDE (TG-CHE110064 to G. B.) are gratefully acknowledged. GMD thanks the European Research Council under the European Union's Seventh Framework Programme (FP/2007-2013)/ERC through grant agreement no. 307358 (ERC-stG-2012-ANGLE), and acknowledges the ARCHER UK National Supercomputing Service via UK's HEC Materials Chemistry Consortium membership, which is funded by the EPSRC (EP/L000202). We thank Robert Young for many helpful discussions. We thank B. P. T. Fokwa and H. Park for their assistance with the powder X-ray diffraction experiments.

References

A. S. Tatton, T. N. Pham, F. G. Vogt, D. Iuga, A. J. Edwards and S. P. Brown, CrystEngComm, 2012, 14, 2654 RSC.
M. B. Ferraro and J. C. Facelli, J. Mol. Struct., 2002, 603, 159–164 Search PubMed.
D. Stueber, Concepts Magn. Reson., Part A, 2006, 28, 347–368 CrossRef.
C. Pickard and F. Mauri, Phys. Rev. B: Condens. Matter Mater. Phys., 2001, 63, 245101 CrossRef.
J. R. Yates, C. J. Pickard and F. Mauri, Phys. Rev. B: Condens. Matter Mater. Phys., 2007, 76, 024401 CrossRef.
C. Bonhomme, C. Gervais, F. Babonneau, C. Coelho, F. Pourpoint, T. Azais, S. E. Ashbrook, J. M. Griffin, J. R. Yates, F. Mauri and C. J. Pickard, Chem. Rev., 2012, 112, 5733–5779 CrossRef CAS PubMed.
E. Salager, G. M. Day, R. S. Stein, C. J. Pickard, B. Elena and L. Emsley, J. Am. Chem. Soc., 2010, 132, 2564–2566 CrossRef CAS PubMed.
M. Baias, C. M. Widdifield, J.-N. Dumez, H. P. G. Thompson, T. G. Cooper, E. Salager, S. Bassil, R. S. Stein, A. Lesage, G. M. Day and L. Emsley, Phys. Chem. Chem. Phys., 2013, 15, 8069–8080 RSC.
K. Kalakewich, R. Iuliucci and J. K. Harper, Cryst. Growth Des., 2013, 13, 5391–5396 CAS.
L. J. Mueller and M. F. Dunn, Acc. Chem. Res., 2013, 46, 2008–2017 CrossRef CAS PubMed.
M. Dračínský and P. Hodgkinson, RSC Adv., 2015, 5, 12300–12310 RSC.
J. D. Hartman and G. J. O. Beran, J. Chem. Theory Comput., 2014, 10, 4862–4872 CrossRef CAS PubMed.
J. D. Hartman, S. Monaco, B. Schatschneider and G. J. O. Beran, J. Chem. Phys., 2015, 143, 102809 CrossRef PubMed.
S. T. Holmes, R. J. Iuliucci, K. Mueller and C. Dybowski, J. Chem. Theory Comput., 2015, 11, 5229–5241 CrossRef CAS PubMed.
D. J. T. Michael Lodewyk and M. R. Siebert, Chem. Rev., 2012, 112, 1839–1862 CrossRef PubMed.
G. J. O. Beran and K. Nanda, J. Phys. Chem. Lett., 2010, 1, 3480–3487 CrossRef CAS.
K. Kitaura, E. Ikeo, T. Asada, T. Nakano and M. Uebayasi, Chem. Phys. Lett., 1999, 313, 701–706 CrossRef CAS.
H.-J. Tan and R. P. A. Bettens, Phys. Chem. Chem. Phys., 2013, 15, 7541–7547 RSC.
D. M. Reid and M. A. Collins, Phys. Chem. Chem. Phys., 2015, 17, 5314–5320 RSC.
Q. Gao, S. Yokojima, D. G. Fedorov, K. Kitaura, M. Sakurai and S. Nakamura, J. Chem. Theory Comput., 2010, 6, 1428–1444 CrossRef CAS.
C. Adamo and V. Barone, J. Chem. Phys., 1999, 110, 6158 CrossRef CAS.
P. J. Stephens, F. J. Devlin, C. F. Chabalowski and M. J. Frisch, J. Phys. Chem., 1994, 98, 11623–11627 CrossRef CAS.
R. P. Young, B. G. Caulkins, D. Borchardt, D. N. Bullo ch, C. K. Larive, M. F. Dunn and L. J. Mueller, Angew. Chem., Int. Ed., 2016, 55, 1350–1354 CrossRef CAS PubMed.
J. Mason, Chem. Rev., 1981, 81, 205 CrossRef CAS.
V. Lemaitre, M. E. Smith and A. Watts, Solid State Nucl. Magn. Reson., 2004, 26, 215–235 CrossRef CAS PubMed.
C. Gervais, R. Dupree, K. J. Pike, C. Bonhomme, M. Profeta, C. J. Pickard and F. Mauri, J. Phys. Chem. A, 2005, 109, 6960–6969 CrossRef CAS PubMed.
J. P. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1996, 77, 3865 CrossRef CAS PubMed.
Y. Zhang, A. Wu, X. Xu and Y. Yan, Chem. Phys. Lett., 2006, 421, 383–388 CrossRef CAS.
J. Tao, J. Perdew, V. Staroverov and G. Scuseria, Phys. Rev. Lett., 2003, 91, 146401 CrossRef PubMed.
G. Wu, J. Zhu, R. Wang and V. Terskikh, J. Am. Chem. Soc., 2010, 132, 5143–5155 CrossRef CAS PubMed.
A. J. Stone, Chem. Phys. Lett., 1981, 83, 233–239 CrossRef CAS.
A. J. Stone and M. Alderton, Mol. Phys., 1985, 56, 1047–1064 CrossRef CAS.
A. J. Stone, J. Chem. Theory Comput., 2005, 1, 1128–1132 CrossRef CAS PubMed.
C. Ochsenfeld, J. Kussmann and F. Koziol, Angew. Chem., Int. Ed., 2004, 43, 4485–4489 CrossRef CAS PubMed.
J. Zienau, J. Kussmann and C. Ochsenfeld, Mol. Phys., 2010, 108, 333–342 CrossRef CAS.
S. Loibl, F. R. Manby and M. Schütz, Mol. Phys., 2010, 108, 477–485 CrossRef CAS.
R. K. Harris, E. D. Becker, S. M. Cabral de Menezes, P. Granger, R. E. Hoffman and K. W. Zilm, Pure Appl. Chem., 2008, 80, 59–84 CrossRef CAS.
J. P. Bradley, S. P. Velaga, O. N. Antzutkin and S. P. Brown, Cryst. Growth Des., 2011, 11, 3463–3471 CAS.
A. C. Uldry, J. M. Griffin, J. R. Yates, M. P. Torralba, M. D. Maria, A. L. Webber, M. L. Beaumont, A. Samoson, R. M. Claramunt, C. J. Pickard and S. P. Brown, J. Am. Chem. Soc., 2008, 130, 945–954 CrossRef CAS PubMed.
M. Sardo, S. M. Santos, A. A. Babaryk, C. Lopez, I. Alkorta, J. Elguero, R. M. Claramunt and L. Mafra, Solid State Nucl. Magn. Reson., 2015, 65, 49–63 CrossRef CAS PubMed.
R. K. Harris and P. Jackson, J. Phys. Chem. Solids, 1987, 48, 813–818 CrossRef CAS.
A. Abraham, D. C. Apperley, T. Gelbrich, R. K. Harris and U. J. Griesser, Can. J. Chem., 2011, 89, 770–778 CrossRef CAS.
E. Carignani, S. Borsacchi, J. P. Bradley, S. P. Brown and M. Geppi, J. Phys. Chem. C, 2013, 117, 17731–17740 CAS.
J. R. Yates, S. E. Dobbin, C. J. Pickard, F. Mauri, P. Y. Ghi and R. K. Harris, Phys. Chem. Chem. Phys., 2005, 7, 1402–1407 RSC.
R. K. Harris, P. Hodgkinson, V. Zorin, J. N. Dumez, B. E. Herrmann, L. Emsley, E. Salager and R. S. Stein, Magn. Reson. Chem., 2010, 48, S103–S112 CrossRef CAS PubMed.
S. Sharif, D. Schagen, M. Toney and H. Limbach, J. Am. Chem. Soc., 2007, 129, 4440–4455 CrossRef CAS PubMed.
Y. Wei, A. C. Dios and A. E. McDermott, J. Am. Chem. Soc., 1999, 121, 10389–10394 CrossRef CAS.
J. Z. Hu, J. C. Facelli, D. W. Alderman, R. J. Pugmire and D. M. Grant, J. Am. Chem. Soc., 1998, 120, 9863–9869 CrossRef CAS.
E. D. Smith, R. B. Hammond, M. J. Jones, K. J. Roberts, J. B. Mitchell, S. L. Price, R. K. Harris, D. C. Apperkey, J. C. Cherryman and R. Docherty, J. Phys. Chem. B, 2001, 105, 5818–5826 CrossRef CAS.
Z. J. Li, Y. Abramov, J. Bordner, J. Leonard, A. Medek and A. V. Trask, J. Am. Chem. Soc., 2006, 128, 8199 CrossRef CAS PubMed.
G. Wu, S. Dong, R. Ida and N. Reen, J. Am. Chem. Soc., 2002, 124, 1768 CrossRef CAS PubMed.
K. Yamada, S. Dong and G. Wu, J. Am. Chem. Soc., 2000, 122, 11602–11609 CrossRef CAS.
S. Dong, K. Yamada and G. Wu, Z. Naturforsch., A: Phys. Sci., 2000, 55, 21–28 CrossRef CAS.
V. Lemaitre, K. Pike, A. Watts, T. Anupold, A. Samoson, M. Smith and R. Dupree, Chem. Phys. Lett., 2003, 371, 91–97 CrossRef CAS.
K. J. Pike, V. Lemaitre, A. Kukol, T. Anupold, A. Samoson, A. P. Howes, A. Watts, M. E. Smith and R. Dupree, J. Phys. Chem. B, 2004, 108, 9256–9263 CrossRef CAS.
A. Rorick, M. Michael, L. Yang and Y. Zhang, J. Phys. Chem. B, 2015, 119, 11618–11625 CrossRef CAS PubMed.
P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G. L. Chiarotti, M. Cococcioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. Scandolo, G. Sclauzero, A. P. Seitsonen, A. Smogunov, P. Umari and R. M. Wentzcovitch, J. Phys.: Condens. Matter, 2009, 21, 395502 CrossRef PubMed.
J. Antony and S. Grimme, Phys. Chem. Chem. Phys., 2006, 8, 5287–5293 RSC.
Y. N. Heit, K. D. Nanda and G. J. O. Beran, Chem. Sci., 2016, 7, 246–255 RSC.
Y. N. Heit and G. J. O. Beran, Acta Crystallogr., Sect. B: Struct. Sci., Cryst. Eng. Mater., 2016, 72 DOI:10.1107/S2052520616005382.
G. J. O. Beran, J. Chem. Phys., 2009, 130, 164115 CrossRef PubMed.
S. Wen and G. J. O. Beran, J. Chem. Theory Comput., 2011, 7, 3733–3742 CrossRef CAS PubMed.
M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, G. Scalmani, V. Barone, B. Mennucci, G. A. Petersson, H. Nakatsuji, M. Caricato, X. Li, H. P. Hratchian, A. F. Izmaylov, J. Bloino, G. Zheng, J. L. Sonnenberg, M. Hada, M. Ehara, K. Toyota, R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, T. Vreven, J. A. Montgomery Jr, J. E. Peralta, F. Ogliaro, M. Bearpark, J. J. Heyd, E. Brothers, K. N. Kudin, V. N. Staroverov, R. Kobayashi, J. Normand, K. Raghavachari, A. Rendell, J. C. Burant, S. S. Iyengar, J. Tomasi, M. Cossi, N. Rega, J. M. Millam, M. Klene, J. E. Knox, J. B. Cross, V. Bakken, C. Adamo, J. Jaramillo, R. Gomperts, R. E. Stratmann, O. Yazyev, A. J. Austin, R. Cammi, C. Pomelli, J. W. Ochterski, R. L. Martin, K. Morokuma, V. G. Zakrzewski, G. A. Voth, P. Salvador, J. J. Dannenberg, S. Dapprich, A. D. Daniels, Ã. Farkas, J. B. Foresman, J. V. Ortiz, J. Cioslowski and D. J. Fox, Gaussian 09 Revision A.1, 2009, Gaussian Inc., Wallingford CT Search PubMed.
R. Krishnan, J. S. Binkley, R. Seeger and J. A. Pople, J. Chem. Phys., 1980, 72, 650–654 CrossRef CAS.
A. D. McLean and G. S. Chandler, J. Chem. Phys., 1980, 72, 5639–5648 CrossRef CAS.
M. J. Frisch, J. A. Pople and J. S. Binkley, J. Chem. Phys., 1984, 80, 3265–3269 CrossRef CAS.
T. Clark, J. Chandrasekhar, G. W. Spitznagel and P. v. R. Schleyer, J. Comput. Chem., 1983, 4, 294–301 CrossRef CAS.
W. J. Hehre, R. Ditchfield and J. A. Pople, J. Chem. Phys., 1972, 56, 2257–2261 CrossRef CAS.
P. C. Hariharan and J. A. Pople, Theor. Chim. Acta, 1973, 28, 213–222 CrossRef CAS.
D. B. Chesnut and K. D. Moore, J. Comput. Chem., 1989, 10, 648–659 CrossRef CAS.
D. B. Chesnut, B. E. Rusiloski, K. D. Moore and D. A. Egolfs, J. Comp. Chem., 1993, 14, 1364–1375 CrossRef CAS.
J. D. Hartman, T. J. Neubauer, B. G. Caulkins, L. J. Mueller and G. J. O. Beran, J. Biomol. NMR, 2015, 62, 327–340 CAS.
L. S. C. Martins, F. A. L. de Souza, G. A. Ceolin, F. E. Jorge, R. C. de Barredo and C. T. Campos, Comput. Theor. Chem., 2013, 1013, 62–69 CrossRef CAS.
GDMA, Distributed Multipole Analysis of Gaussian Wavefunctions, version 2.2.09, A. J. Stone, http://http://www-stone.ch.cam.ac.uk/pub/gdma/, accessed may 28, 2014 Search PubMed.
S. J. Clark, M. D. Segall, C. J. Pickard, P. J. Hasnip, M. I. Probert, K. Refson and M. C. Payne, Z. Kristallogr., 2005, 220, 567–570 CAS.
P. Caravatti, L. Braunshweller and R. R. Ernst, Chem. Phys. Lett., 1983, 100, 305–310 CrossRef CAS.
A. Detken, E. H. Hardy, M. Ernst and B. H. Meier, Chem. Phys. Lett., 2002, 356, 298–304 CrossRef CAS.
C. R. Morcombe and K. W. Zilm, J. Magn. Reson., 2003, 162, 479–486 CrossRef CAS PubMed.
R. K. Harris, E. D. Becker, D. M. Cabral, M. Sonia, P. Granger, R. E. Hoffman and K. W. Zilm, Solid-State NMR Spectrosc., 2008, 33, 41–56 CrossRef CAS PubMed.
B. Civalleri, C. M. Zicovich-Wilson, L. Valenzano and P. Ugliengo, CrystEngComm, 2008, 10, 405–410 RSC.
A. Otero-de-la Roza and E. R. Johnson, J. Chem. Phys., 2012, 137, 054103 CrossRef CAS PubMed.
L. Kronik and A. Tkatchenko, Acc. Chem. Res., 2014, 47, 3208–3216 CrossRef CAS PubMed.
G. J. O. Beran, Chem. Rev., 2016, 116, 5567–5613 CrossRef CAS PubMed.
J. van de Streek and M. A. Neumann, Acta Crystallogr., Sect. B: Struct. Sci., 2010, 66, 544–558 CAS.
J.-N. Dumez and C. J. Pickard, J. Chem. Phys., 2009, 130, 104701 CrossRef PubMed.
I. D. Gortari, G. Portella, X. Salvatella, V. S. Bajaj, P. C. A. van der Wel, J. R. Yates, M. D. Segall, C. J. Pickard, M. C. Payne, M. Vendruscolo, I. De Gortari, G. Portella, X. Salvatella, V. S. Bajaj, P. C. A. van der Wel, J. R. Yates, M. D. Segall, C. J. Pickard, M. C. Payne, M. Vendruscolo, I. D. Gortari, P. C. A. V. D. Wel and X. J. R. Yates, J. Am. Chem. Soc., 2010, 132, 5993–6000 CrossRef PubMed.
M. Dračínský and P. Hodgkinson, CrystEngComm, 2013, 15, 8705 RSC.
T. E. Exner, A. Frank, I. Onila and H. M. Möller, J. Chem. Theory Comput., 2012, 8, 4818–4827 CrossRef CAS PubMed.
A. M. Teale, O. B. Lutnaes, T. Helgaker, D. J. Tozer and J. Gauss, J. Chem. Phys., 2013, 138, 024111 CrossRef PubMed.
M. Dračínský, P. Bouř and P. Hodgkinson, J. Chem. Theory Comput., 2016, 12, 968–973 CrossRef PubMed.
S. T. Holmes, R. J. Iuliucci, K. T. Mueller and C. Dybowski, J. Chem. Phys., 2014, 141, 164121 CrossRef PubMed.
C. Gervais, R. Dupree, K. J. Pike, C. Bonhomme, M. Profeta, C. J. Pickard and F. Mauri, J. Phys. Chem. A, 2005, 109, 6960–6969 CrossRef CAS PubMed.
J. Zhu, A. J. Geris and G. Wu, Phys. Chem. Chem. Phys., 2009, 11, 6972–6980 RSC.
L. A. O'Dell, R. W. Schurko, K. J. Harris, J. Autschbach and C. I. Ratcliffe, J. Am. Chem. Soc., 2011, 133, 527–546 CrossRef PubMed.
J. R. Yates, C. J. Pickard, M. C. Payne, R. Dupree, M. Profeta and F. Mauri, J. Phys. Chem. A, 2004, 108, 6032–6037 CrossRef CAS.
I. De Gortari, M. Galván, J. Ireta, M. Segall, C. J. Pickard and M. Payne, J. Phys. Chem. A, 2007, 111, 13099–13105 CrossRef CAS PubMed.
A. Wong, A. P. Howes, J. R. Yates, A. Watts, T. Anupõld, J. Past, A. Samoson, R. Dupree and M. E. Smith, Phys. Chem. Chem. Phys., 2011, 13, 12213–12224 RSC.
R. O. Al-Kaysi and A. M. Muller, and C. J. Bardeen, J. Am. Chem. Soc., 2006, 128, 15938–15939 CrossRef CAS PubMed.
L. Zhu, A. Agarwal, J. Lai, R. Al-Kaysi, F. Tham, T. Ghaddar, L. Mueller and C. Bardeen, J. Mater. Chem., 2011, 21, 6258–6268 RSC.
X. Song, C. M. Rienstra and A. E. McDermott, Magn. Reson. Chem., 2001, 39, S30–S36 CrossRef CAS.
R. Li, J. A. Zeitler, D. Tomerini, E. P. J. Parrott, L. F. Gladden and G. M. Day, Phys. Chem. Chem. Phys., 2010, 12, 5329 RSC.
J. Kempf, H. W. Spiess, U. Haeberlen and H. Zimmermann, Chem. Phys., 1974, 4, 269–276 CrossRef CAS.
S. Nagaoka, T. Terao, F. Imashiro, A. Saika, N. Hirota and S. Hayashi, Chem. Phys. Lett., 1981, 80, 580–584 CrossRef CAS.
X. Kong, M. Shan, V. Terskikh, I. Hung, Z. Gan and G. Wu, J. Phys. Chem. B, 2013, 117, 9643–9654 CrossRef CAS PubMed.
W. Rice, C. Schaeffer, B. Harten, F. Villinger, T. South, M. Summers, L. Henderson, W. Bess, L. Arthur, S. McDougal, L. Orloff, J. Mendeleyev and E. Kun, Nature, 1993, 361, 473–475 CrossRef CAS PubMed.
M. Lumsden, G. Wu, R. Wasylishen and R. Curtis, J. Am. Chem. Soc., 1993, 115, 2825–2832 CrossRef CAS.
R. Salzmann, M. Wojdelski, M. McMahon, R. Havlin and E. Oldfield, J. Am. Chem. Soc., 1998, 120, 1349–1356 CrossRef CAS.
M. S. Gordon, D. G. Fedorov, S. R. Pruitt and L. Slipchenko, Chem. Rev., 2012, 112, 632–672 CrossRef CAS PubMed.
M. A. Collins and R. P. A. Bettens, Chem. Rev., 2015, 115, 5067–5642 CrossRef PubMed.
K. Raghavachari and A. Saha, Chem. Rev., 2015, 115, 5643–5677 CrossRef CAS PubMed.

Footnote

† Electronic supplementary information (ESI) available: All crystal structures used, experimental chemical shifts, convergence tests, detailed benzoic acid results, and the predicted fragment PBE0 and GIPAW PBE chemical shifts are reported for each of the test sets studied here. See DOI: 10.1039/c6cp01831a

Click here to see how this site uses Cookies. View our privacy policy here.

Benchmark fragment-based 1H, 13C, 15N and 17O chemical shift predictions in molecular crystals†