Hannah E.
Kerr
a,
Helen E.
Mason
a,
Hazel A.
Sparkes
b and
Paul
Hodgkinson
*a
aDepartment of Chemistry, Durham University, Science Site, Durham DH1 3LE, UK. E-mail: paul.hodgkinson@durham.ac.uk
bSchool of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK
First published on 2nd August 2016
The crystal structure of a new 1:
2 caffeine–citric acid hydrate cocrystal is presented. The caffeine molecules are disordered over two positions, with the nature of the disorder confirmed to be static by 13C solid-state NMR. NMR linewidths in statically disordered systems reflect the distribution of local chemical environments, and this study investigates whether the disorder contribution to 13C linewidths can be predicted computationally. The limits of NMR crystallography calculations using density functional theory are tested by investigating how geometry optimisation conditions affect calculated NMR parameters. Careful optimisation is shown to reduce differences between 13C constants of symmetry-related sites to about 0.1 ppm. This is just sufficient to observe a correlation between calculated and experimental linewidths, and also show that systematic errors associated with geometry optimisation do not compromise other applications of “NMR crystallography”. In addition, the unit cell enthalpies calculated after careful optimisations provide insight into why the disordered structure is adopted.
Molecular solids involving caffeine are frequently observed to involve disorder due to the nearly symmetrical structure of the caffeine molecule. This is exemplified by the two forms of anhydrous caffeine. The β form crystallises in C2/c and has two and a half independent molecules in the asymmetric unit; one fully ordered and the others exhibiting different orientational disorder over two positions. This challenging structure was determined using single crystal X-ray diffraction data in combination with high field 13C solid-state NMR (SSNMR).5 Above its glass transition, the α form is a rotator phase of high symmetry (space group is Rc) with the caffeine molecules orientationally disordered on their lattice sites. This disorder is at least partially frozen out below the glass transition, and so the 13C lineshapes of both α and β forms at room temperature are significantly broadened compared to caffeine hydrate, in which the caffeine molecules are ordered.6,7 Caffeine disorder has also been observed in several caffeine-containing cocrystals,8–10 though the nature of the disorder has yet to be determined.
For 13C shifts calculated via the gauge-including-projector-augmented-wave (GIPAW) approach,11 experimental chemical shifts and calculated shieldings are generally found to agree within about 2 ppm.4 Similar levels of agreement have been observed using alternative approaches to calculating 13C shifts in solids.12,13 The fact that different correlations are obtained between experimental shifts and calculated shieldings for different types of carbon12–14 implies, however, that a significant fraction of this uncertainty is systematic. In the case of sites with the same chemical environment (including the limit of symmetry-related sites), the systematic “uncertainties” should be much lower, and are likely to be dominated by incomplete optimisation of the geometry, which is a pre-requisite for obtaining good correlations between experimental and calculated parameters. Here we assess whether these uncertainties can be reduced to the point such that 13C linewidths can be predicted for disordered systems, using a previously unreported hydrated cocrystal between caffeine and citric acid (1:
2) as a model system.
Empirical formula | C20H30N4O18 |
Formula weight/g mol−1 | 614.48 |
Temperature/K | 120(2) |
Crystal system | Monoclinic |
Space group | C2/c |
a/Å | 16.7407(6) |
b/Å | 9.5561(3) |
c/Å | 16.5918(6) |
β/° | 90.525(3) |
Volume/Å3 | 2654.17(16) |
Z | 4 |
ρ calc/g cm−3 | 1.538 |
μ/mm−1 | 0.137 |
F(000) | 1288.0 |
Crystal size/mm3 | 0.44 × 0.29 × 0.14 |
Radiation | Mo Kα (λ = 0.71073 Å) |
2Θ range for data collection/° | 5.478 to 61.266 |
Index ranges | −23 ≤ h ≤ 23, −13 ≤ k ≤ 13, −23 ≤ l ≤ 23 |
Reflections collected | 13![]() |
R int | 0.0531 |
Data/restraints/parameters | 3787/1/240 |
Goodness-of-fit on F2 | 1.050 |
Final R indexes [I ≥ 2σ (I)] | R 1 = 0.0457, wR2 = 0.1000 |
Final R indexes [all data] | R 1 = 0.0630, wR2 = 0.1108 |
Largest diff. peak/hole/e Å−3 | 0.38/−0.30 |
All calculations used the Perdew–Burke–Ernzerhorf functional, and either Vanderbilt ultrasoft pseudopotentials (USP)20 or the on-the-fly-generated ultrasoft pseudopotentials (OTFG). The centre of mass was fixed and a cut-off energy of either 400 eV or 700 eV was used, which is above the fine cut-off of 340 eV for the hardest element (oxygen). Integrals were taken over the Brillouin zone using a Monkhorst–Pack grid with a minimum k-point sample spacing of 0.1 Å−1, corresponding to one k-point. Geometry optimisations were carried out using the Broyden–Fletcher–Goldfarb–Shanno scheme21 using the different conditions given in Table 2.
Namea | Cut-off energy/eV | dEelectronic/10−5 eV | dEgeometry/10−5 eV | |F|max/10−2 eV per atom | Mean ΔE between pairs/eV |
---|---|---|---|---|---|
a Codes used: D – dispersion correction applied, O – on-the-fly-generated pseudopotentials used, W – convergence window increased from 2 to 10, C – unit cell optimised. b |F|max converged to within 0.1 × 10−2 eV per atom but did not reach the 0.01 × 10−2 eV per atom tolerance. | |||||
[1] | 400 | 1 | 2 | 5 | 0.036 |
[2] | 700 | 0.1 | 0.2 | 1 | 0.009 |
[2-D] | 700 | 0.1 | 0.2 | 1 | 0.012 |
[3] | 700 | 0.001 | 0.002 | 0.01b | 0.005 |
[3-D] | 700 | 0.001 | 0.002 | 0.01b | 0.004 |
[3-DO] | 700 | 0.001 | 0.002 | 0.01b | 0.002 |
[3-DW] | 700 | 0.001 | 0.002 | 0.01b | 0.004 |
[3-DC] | 700 | 0.001 | 0.002 | 0.01b | 0.005 |
All 288 atomic positions were optimised with the unit cell parameters fixed, except in case [3-DC] where the unit cell parameters were allowed to relax. dEelectronic is the threshold (energy per atom) for convergence of the electronic self-consistent field minimisation in each geometry optimisation step, while dEgeometry is the corresponding convergence threshold between successive geometry optimisation steps. The latter parameter cannot be meaningfully tightened beyond the former. |F|max is the convergence parameter on the maximum atomic force, while the convergence window sets the number of successive iterations within which both the energy and force tolerances are achieved for the optimisation to be considered converged. The dispersion correction used here is the Tkatchenko–Scheffler (TS) scheme,22 which has been shown to return reliable distance and unit cell parameters in molecular crystals.23,24 No finite basis correction was applied in the variable unit cell calculations.
Since the initial structures were obtained by manual manipulation of the atomic co-ordinates, an initial crude geometry optimisation (using CASTEP v6.0) was used to provide reasonable starting points for the subsequent tight optimisations (all performed in CASTEP v8.0). Table S1 in the ESI† gives additional information on the time taken and the number of successive iterations required to complete the geometry optimisations.
NMR calculations used OTFG pseudopotentials and cut-off energy of 700 eV in all cases. The k-point sample spacing was 0.05 Å−1 (12 k-points) with an offset of (0.25, 0.25, 0.25) in fractional co-ordinates. Simulated disorder spectra were calculated using the MagresPython25 library to read in the CASTEP outputs and pNMRsim26 to create the spectra.
D–H⋯A | d D–H/Å | d H⋯A/Å | d D⋯A/Å | D–H–A/° |
---|---|---|---|---|
1 1 − x, 1 − y, 1 − z; 2 ½ − x, ½ − y, 1 − z; 3 ½ − x, ½ + y, ½ − z; 4x, −1 + y, z; 5x, 1 + y, z. | ||||
O4–H4⋯O91 | 0.82(2) | 2.13(2) | 2.885(1) | 154(2) |
O7–H7⋯O102 | 0.92(3) | 1.63(3) | 2.550(8) | 175(2) |
O7–H7⋯N22 | 0.92(3) | 1.82(3) | 2.735(9) | 171(2) |
O1–H1A⋯O83 | 0.87(2) | 1.93(2) | 2.787(2) | 166(2) |
O1–H1B⋯O2 | 0.87(2) | 1.95(2) | 2.795(2) | 167(2) |
O5–H5⋯O14 | 0.91(1) | 1.69(1) | 2.591(2) | 169(3) |
O3–H3⋯O65 | 0.83(2) | 1.93(2) | 2.708(2) | 157(2) |
The disorder of the caffeine over the 2-fold rotation axis in C2/c means that the pairs of carbons C8/C8i and C10/C10i are crystallographically indistinguishable and share the same label, see Fig. 1(ii). They are, however, chemically distinct and are distinguished in the 13C NMR spectrum. No changes were observed in variable-temperature 13C SSNMR experiments between 22 °C and 70 °C conducted at a low magic-angle spinning rate of 3 kHz to highlight any effect of dynamics on spinning sideband patterns. Additionally, no peaks were visible in spectra acquired with short (1–5 s) recycle delays, confirming that the caffeine disorder is static rather dynamic.
The caffeine linewidths are slightly broader than the ordered caffeine hydrate, but sharper than those of the highly disordered anhydrous β-caffeine, see Fig. 2. The quantitative prediction of NMR linewidths is complicated by magnetic susceptibility effects. While magic-angle spinning is effective at removing the effects of local variations of the bulk magnetic susceptibility, χ, the anisotropy of the bulk magnetic susceptibility (ABMS) tensor is not fully averaged, resulting in a uniform but orientation-dependent shift of the NMR frequencies.28–30 Quantitative calculation of this effect on the linewidths of polycrystalline samples is non-trivial,30 but we have previously shown that the parameter |4πΔχ|, where Δχ is the anisotropy of the magnetic susceptibly tensor, provides a useful metric for the line broadening due to ABMS.31 This factor, calculated from the CASTEP-calculated susceptibility tensor, is 1.1 ppm for CCA compared to 2.0 ppm for the 1:
1 cocrystal.15 Since the linewidths for CCA are 0.09 ppm broader on average than those for the 1
:
1 co-crystal, this implies that broader lines for CCA are indeed associated with static disorder. The linewidths due to the static disorder is small, however, and their computational prediction is a challenging test of NMR crystallography protocols.
The current system is an ideal test case, since the 16 simulated-disorder models form 8 symmetry-related pairs of structures whose NMR parameters should be identical in the limit of perfect geometry optimisation. After the initial crude optimisation, however, there is no discernible pattern to the calculated lattice energies, see Fig. S1 in the ESI,† and tighter convergence of the structures is required.
Fig. 3 shows that the final energies of the 16 structures depend on the parameters of the final geometry optimisation step. Tightening the energy tolerances by one order of magnitude compared to the default, [2] and [2-D], results in the 1111 pair (yellow) converging to approximately 0.03 eV higher in energy than the other structures. Other matching pairs emerge as the convergence criteria are further tightened, [3], [3-D] and [3-DO].
![]() | ||
Fig. 3 Final energies of each structure following different optimisations. Symmetry-related pairs discussed in the main text are coloured/shaded for clarity. |
A slightly different pattern of energies is observed when the unit cell parameters are allowed to vary, [3-DC]; the 1212 pair (blue) converge to lower energy rather than the 1122 pair (green). This correlates with a larger percentage change in the α and γ unit cell angles for 1212 compared to the other simulated pairs, see Fig. S2 in the ESI.† In the material, however, the unit cells are those of the average unit cell, and so the optimisations in which the unit cell parameters are constrained to those of the mean unit cell from XRD are more physically meaningful.
Including dispersion correction systematically lowers all the energies, but does not change the relative energetic ordering. Using OTFG pseudopotentials with dispersion correction also results in the same pattern of energies, although the energy gaps between pairs (see Table 2) and different sets (see Fig. 3) are noticeably smaller. Increasing the convergence window so that the convergence criteria need to be matched over 10 iterations, [3-DW], does not improve the convergence of the final energies of symmetry-related pairs. Instead, none of the calculations converge with respect to the energy due to the over-tight termination conditions. This set of optimisations is not discussed further.
The degree of convergence between symmetry-related pairs can also be tested using the root-mean-square difference (RMSD) between the positions of the non-H atoms. As might be expected, these show the same trends, Fig. S3,† as observed in Fig. 3. The pseudo-rotation axis shown in Fig. 1 is not precisely retained following geometry optimisation, see Fig. 4, but these deviations are within the boundaries of the thermal ellipsoids i.e. the optimised structures are indistinguishable from the original crystal structure within the experimental uncertainties.
Optimisations including dispersion correction consistently lower the mean 13C shielding difference. The pattern in Fig. 5(ii) generally reflects the energy differences in Fig. 3; the smallest differences in energy between symmetry-related pairs are seen for [3-D] and [3-DO] and similarly these optimisations result in the lowest mean 13C shielding differences. The fewer iterations needed for convergence of the optimisations with dispersion correction (cf. Table S1†) may reflect a smoother energy landscape for geometry optimisation, but further work would be required to confirm this speculation.
Fig. 6 examines the relationship between the geometry optimisation and agreement with experimental chemical shifts. The calculated 13C shieldings of all simulated-disorder structures were averaged for each carbon site and then referenced using δcalc = mσcalc + c, where m and c are determined by linear regression of the average calculated 13C shieldings against the experimental 13C chemical shifts. Due to the overlap of the citric acid resonances with those of excess citric acid only the caffeine carbon atoms were considered. Given that the effects of imperfect geometry optimisation are of the order of 0.1 ppm, it is unsurprising that there is no observable correlation between the geometry optimisation protocol and the quality of agreement with experimental chemical shifts. [3-DO] shows a significantly larger mean difference compared to the optimisations using the USP pseudopotentials. This is consistent with the systematic limitations of DFT being the major source of disagreement with experiment.13,14,32 Including dispersion correction has no significant effect, in keeping with previous observations of only marginal improvement in the agreement of 15N shifts.33
![]() | ||
Fig. 6 Root mean squares differences between experimental 13C caffeine shifts and average calculated shieldings after referencing as described in the text. |
The mean difference between the shieldings of chemically related 13C sites caused by geometry optimisation conditions is significant in the simulation of CCA linewidths, see Fig. 7(i). A full description of the linewidth simulation method is given in the ESI† but, in brief, the 13C linewidths were simulated by summing the spectra of the 16 simulated-disorder structures from the [3-DO] geometry optimisation yielding a single simulated-disorder spectrum, Fig. 7(ii). The homogeneous linewidth34 of each site is taken as 1/π with a Lorentzian lineshape, where the
values are measured from spin-echo experiments, see Table S2.† The inhomogeneous linewidth (incorporating quadrupolar broadening contributions from adjacent 14N nuclei and ABMS effects) is taken from the inhomogeneous linewidths measured on the ordered 1
:
1 cocrystal and is modelled with a Gaussian lineshape, see Table S3.† There is a reasonable correlation between calculated and experimental linewidths (R2 = 0.54), but the uncertainties associated with geometry optimisation clearly limit the quality of the correlation (the uncertainties in the experimental linewidths are less than a few Hz) i.e. within the size of the symbols used.
In the case of caffeine citric acid hydrate, the uncertainties associated with geometry optimisation are significant compared to the limited line-broadening associated with the disorder. In other systems, however, such as β-caffeine, the effects of disorder on the NMR spectrum are much stronger and should be computable. The approach described may be particularly valuable where 2D NMR is being used to probe the correlation between shifts in disordered systems.34
Tight control of the geometry optimisation is also essential when trying to compare lattice energies, since the uncertainties are of a similar magnitude to the differences in lattice energies of polymorphic forms.36 Although the absolute energies of the systems are highly dependent on the optimisation protocol and DFT parameters, the relative energies of the symmetry-related pairs converge well as the optimisation criteria are tightened, giving some confidence that the lattice energies between the different (but chemically very similar) disordered structures can be meaningfully compared, cf.Fig. 3. Although the pair of structures based on the 1212 configuration, S1212, are calculated to be about 2.5 kJ mol−1 lower in energy per unit cell than a disordered structure, Sdisord, based on an equally weighted random distribution of configurations, there is an “entropic” bias towards the disordered structure (which has 16 configurations compared to 2). Very crudely, the “Boltzmann ratio” for Sdisord/S1212, is about 3 at ambient temperature (kBT = 2.5 kJ mol−1). Similar energetic arguments have recently been used to understand the presence of disorder in some, but not all, solvates of droperidol,37 and the presence of disorder in β-caffeine but not isocaffeine.19 Although crystallisation is a complex phenomenon, largely controlled by kinetic factors, kinetics are also likely to favour the less ordered structure. Hence the first-principles calculations provide insight into why this system adopts a disordered rather than an ordered structure.
Footnote |
† Electronic supplementary information (ESI) available: Including details of the optimisation calculations, correlation plots between the energies and atom displacements of the simulated disorder structures and details of the 13C NMR linewidth simulation method. CCDC 1471476. Original research data can be accessed through DOI: 10.15128/r1np193917h. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/c6ce01453d |
This journal is © The Royal Society of Chemistry 2016 |