Daniel M.
Dawson
*,
Iain A.
Smellie
and
Sharon E.
Ashbrook
*
School of Chemistry, EaStCHEM and Centre of Magnetic Resonance, University of St Andrews, St Andrews, KY16 9ST, UK. E-mail: dmd7@st-andrews.ac.uk; sema@st-andrews.ac.uk
First published on 1st April 2024
We use a combination of one- and two-dimensional solid-state nuclear magnetic resonance (NMR) spectroscopy and density functional theory (DFT) calculations to obtain a full assignment of the 1H and 13C signals for solid (+)-usnic acid, which contains two molecules in the asymmetric unit. By combining through-space 1H–1H correlation data with computation it is possible to assign signals not just to the same molecules (relative assignment) but to assign the signals to specific crystallographic molecules (absolute assignment). Variable-temperature measurements reveal that there is some variation in many of the 13C chemical shifts with temperature, likely arising from varying populations of different tautomeric forms of the molecule. The NMR spectrum of crystalline (+)-usnic acid is then compared with that of ground Usnea dasopoga lichen (the source material of the usnic acid). The abundance of usnic acid is so great in the lichen that this natural product can be observed directly in the NMR spectrum without further purification. This natural sample of usnic acid appears to have the same crystalline form as that in the pure commercial sample.
Small-molecule solid-state NMR spectroscopy is often more challenging than its liquid-state counterpart, owing to the fact that crystal packing can lead to the number of crystallographically distinct molecules in the unit cell (Z′) being greater than one, necessitating assignment of the resulting spectroscopic signals to not only their corresponding chemical sites within the molecule, but also to the correct crystallographically distinct (but chemically identical) molecule within the cell. This can be particularly problematic where Z′ is large: for example, Webber et al. characterised campho[2,3-c]pyrazole, for which Z′ = 6 and the signal for each of the 11 chemical sites was split into six by the crystallographic inequivalences.9 Absolute crystallographic assignment is typically assisted by density functional theory (DFT) calculations although, as demonstrated below, care must be taken to use experiment to verify DFT-based assignments wherever possible, as the errors are often on the order of the small splitting of the signals arising from crystallographic inequivalences. In such cases, it can be advantageous to use advanced NMR methodology to identify the signals from adjacent H–H, H–C and C–C pairs and confirm (or refute) the DFT-based assignments. However, such an approach requires access to specialist hardware (such as a probe capable of achieving magic angle spinning (MAS) rates ≥60 kHz for 1H–1H correlations with usefully high resolution) and experimental times can be very long (multiple days for 13C–13C correlations at natural abundance). Despite these practical challenges, such experiments can prove invaluable. As an early and illustrative example, Harris and co-workers were able to assign all 38 13C signals observed for the β-form of testosterone (Z′ = 2) using a 13C–13C Incredible Natural Abundance DoublE Quantum Experiment (INADEQUATE) correlation spectrum at natural abundance.10
The present work combines one- and two-dimensional 1H and 13C solid-state NMR spectroscopy with periodic DFT calculations to characterise the pure solid form of (+)-1 (with Z′ = 2) and achieve an experimentally verified absolute assignment of signals to the crystal structure, as well as demonstrating the observation of signals from crystalline 1 in a sample of wild lichen (Usnea dasopoga11).
![]() | ||
Fig. 2 Photograph of a representative sample of wild U. dasopoga collected for this work. See the ESI† for a larger reproduction of this figure. |
Calculations generate the shielding tensor in the crystal frame, σ, from which the calculated isotropic shielding is given by σiso = (1/3)Tr{σ}, and the isotropic chemical shift by σiso = –(σiso – σref)/m, where σref is a reference shielding and m is a scaling factor. In this work, values of m = 1.015 and σref = 172.07 for 13C and m = 1.102 and σref = 30.55 for 1H were obtained as described below.
Fig. 4 shows the 13C DEPTQ NMR spectrum of (+)-1 in CDCl3 solution, and the corresponding 13C cross polarization (CP) MAS NMR spectrum of solid (+)-1. As the solid form contains two crystallographically (and, hence magnetically) inequivalent molecules (shown in red and blue in Fig. 3b), one might expect twice the number of signals to be observed for the solid. On initial count, only 33 signals are observed, compared to the expected 36, suggesting that some signals from the two distinct molecules are overlapped. The assignment of the solution-state spectrum is known and is indicated in Fig. 4.7 Given the similarity of the peak positions in the solid-state spectrum, it is reasonable to take this order of assignments as a starting point for assigning the signals for the solid. Three pairs of signals for the solid appear to be overlapped: those of carbons C2 (106.8 ppm), CA (30.6 ppm) and CB (29.8 ppm). The challenge of assigning the 13C NMR spectrum in the solid phase is not one simply of chemistry, but of assigning signals, from chemically identical species to the correct magnetically distinct molecules. Harris et al. demonstrated a strategy to address this problem for β-testosterone, which, like (+)-1 contains two distinct molecules in the unit cell.10 Using a combination of the two-dimensional 13C INADEQUATE and DFT calculations, the authors were able to observe and assign all 38 13C signals. INADEQUATE requires adjacent pairs of 13C nuclei, which, at natural abundance (ca. 1%), gives low overall sensitivity and leads to experimental times on the order of days, even with signal enhancement by CP.
![]() | ||
Fig. 4 Expansions of the overlaid 13C CP MAS NMR spectrum of solid (+)-1 (black) and the 13C DEPTQ NMR spectrum of (+)-1 in CDCl3 solution (CH3 and CH positive, CH2 and C negative). Assignments of the solution-phase spectrum are shown in red. The signal at 28.9 ppm indicated with a dagger (†) arises from an unidentified impurity. For the complete spectra, see the ESI.† |
Fig. 5 shows expansions of the 13C CP INADEQUATE NMR spectrum of (+)-1, and several observations can be made at this stage. First, the assumption that the signals for CA are overlapped is incorrect, as correlations between CA and C6a are observed for the methyl signals at 31.9 and 30.5 ppm, whereas correlations between CD and C9b are observed for the methyl signals at 32.5 and 30.8 ppm. Second, with the mixing time chosen for this experiment (3.84 ms) cross peaks for the C4a signals (at 176.7 and 179.8 ppm) are very weak and this may hinder full assignment (additional experiments with mixing times of 3.2 and 4.48 ms, shown in the ESI†, did not yield further correlations for C4a). Finally, to have a sweep width in the indirect dimension spanning δ1 = 115.6 ppm (the lowest anticipated shift for CC and C8) to δ1 = 312.0 ppm (the highest anticipated shift for C2 and C2a), a rotor-synchronised t1 increment of τr/2 = 40 μs was used, resulting in spinning sidebands being observed for some signals, which clutter the spectrum but do not interfere with assignment.
![]() | ||
Fig. 5 Expansions of the 13C CP INADEQUATE NMR spectrum of (+)-1 (9.4 T, 12.5 kHz MAS). Positive contours are shown in black and negative contours in red. Spinning sidebands are coloured grey and marked with asterisks (*). The 13C CP MAS NMR spectrum from Fig. 4 is shown along δ2 as a guide to the eye. The correlations between CA and C6a are highlighted in blue and those between CD and C9b in green. |
In principle, assignment of signals in an INADEQUATE spectrum is trivial, with pairs of peaks at the same δ1 corresponding to carbon atoms in the same molecule. In practice, for (+)-1 some assignments were more challenging. In particular, C2, C9b, C9a and C6 have shift differences of 0.20, 0.35, 0.33 and 0.31 ppm between the two molecules, respectively, which is less than the limiting resolution in the indirect dimension. For these signals, close inspection of the lineshapes in the INADEQUATE spectrum allowed tentative assignments to be determined, and these assignments were internally consistent with other unambiguous assignments. As such, two larger fragments of the molecules could be identified from the INADEQUATE experiment, comprising C4a, C4, C3 and C2 in the first fragment and C6, C7, C8, C9, C9a and CC in the second fragment. Additionally, cross peaks linking C6a and CA could be identified, but it was not possible to unambiguously include these in the second large fragment. Closer inspection of the signal for C2 allowed identification of correlations between C2 and C2a to be added to the first fragment, which also adds CB to this fragment, since it has not proven possible to distinguish more than one signal for CB in any of the experiments in this work. Owing to the shift difference between the two CD signals, the fragment containing CD and C9b could be identified but not connected to either of the large fragments, as a result of the small shift differences for both C9a and C9b, the fact that the correlation between C9b and C4a was not observed. The resolution was also insufficient to identify the C9b-C1 pairs. This left two large fragments, the acyl group C6a-CA, the C9b-CD pair with two isolated carbons, C1 and C5 unassigned. Further connection of these fragments was not possible from the INADEQUATE spectrum alone.
Owing to the presence of strong homonuclear dipolar couplings, solid-state 1H NMR spectra are typically of much poorer resolution than 13C. Fig. 6a shows the 1H MAS NMR spectrum of (+)-1, recorded at a spinning rate of 60 kHz. Even at this rapid rotation rate, resolution of the signals for the two distinct molecules is not possible, with the exception of the hydrogen-bonded H between C7(OH) and C6(O), which has shifts of 12.0 and 12.4 ppm. Processing with exponential weighting of −200 Hz (shown in the ESI†) further reveals that the signal for the hydrogen-bonded proton between C9(OH) and C1(
O) has contributions at 11.0 and 10.8 ppm. The CH3 region of the spectrum clearly contains multiple signals, but these are too overlapped to yield useful information. A 1H–13C INEPT experiment, shown in Fig. 6b, reveals pairs of spins that interact via the J coupling (note that the delay in the INEPT experiment is τ = 1/4J and here, τ = 0.33 ms, indicating J = 750 Hz, i.e., there is a small residual dipolar contribution to the coupling). This experiment allows assignment of the 1H signals from the CH3 groups and shows that although the two signals from C4 have a shift difference of 1.9 ppm, their attached protons have the same chemical shift.
![]() | ||
Fig. 6 (a) 1H (14.1 T, 60 kHz MAS) NMR spectrum of (+)-1 (black) and 1H (9.4 T, CDCl3) NMR spectrum of (+)-1 with peak assignments (red, s = solvent, † = impurity). (b) Expansions of the 1H–13C (14.1 T, 60 kHz MAS) INEPT NMR spectrum of (+)-1 with the 13C CP MAS NMR spectrum from Fig. 4 shown along the top as a guide to the eye. (c) Expansions of the 1H–13C (9.4 T, 12.5 kHz MAS) CP-HETCOR spectrum of (+)-1. Signals from nuclei near the C9(OH)–C1(![]() ![]() ![]() |
To gain further insight into longer-range 1H–13C interactions, a heteronuclear correlation (HETCOR) experiment was recorded, with magnetization transfer by CP, using the through-space dipolar interaction. By using a short contact time of 0.35 ms, the spin–spin interactions observed are largely intramolecular rather than intermolecular. Fig. 6c shows the CP-HETCOR spectrum and despite the dramatic broadening of signals in the 1H dimension (as a consequence of the lower MAS rate of 12.5 kHz), the spectrum reveals valuable information about the carbon spins near to the O–H⋯O hydrogen bonds. The signals highlighted in green in Fig. 6c correspond to C6a, C6 and C7, which are adjacent to the C7(OH)–C6a(O) hydrogen bond with δH = 12.0 and 12.4 ppm. By noting that the 1H signal with higher chemical shift correlates to C6a at 200.69 ppm and to C7 at 162.16 ppm, it is possible to join the fragment containing C6a and CA to the larger molecular fragment. The signals highlighted in red in Fig. 6c correspond to the carbons adjacent to the C9(OH)–C1(
O) hydrogen bond (with δH = 10.8 and 11.0 ppm): C9, C9a, C9b and C1. Although the difference in 1H chemical shifts here is small, it is still possible to use these correlations to extend the large molecular fragment to include both C1 and the fragment containing C9b and CD. The signals highlighted in gold in Fig. 6c correspond to C2a, C2 and C3, which are adjacent to the C3(OH)–C2a(
O) hydrogen bond (δH = 18.9 ppm). However, these signals provide no additional information since there is no resolution of the 1H chemical shifts and the 13C signals were assigned fully by analysis of the INADEQUATE spectrum. At this stage, as shown in the ESI,† there are two large fragments of the molecule for which assignment of all 1H and 13C signals has been possible. The first fragment contains C2, C2a, CB, C3, C4 and C4a. The second contains C6, C6a, CA, C7, C8, C9, C9a, C9b, CD and C1. Only C5a cannot be assigned to either fragment and it was not possible to connect either C4a to C9b or C1 to C2.
One further relatively routine experiment presents itself in the solid-state NMR toolkit for small organic molecules; the 1H–1H single-quantum–double-quantum (SQ–DQ) correlation experiment. In this work, double quantum coherences were excited and converted using the BAck-to-BAck sequence,26 which uses the dipolar interaction and can, therefore, show both intramolecular and intermolecular 1H–1H interactions. Indeed, given the sparse 1H network within the structure of 1, many of the shortest H–H distances (i.e., those giving rise to the strongest dipolar couplings) in the solid are intermolecular. Given that the two molecules within the asymmetric unit are arranged such that some H–H distances are notably different, the 1H SQ–DQ correlation experiment provides a means of absolute assignment of the signals to their crystallographic sites, rather than simply the relative assignment to molecular fragments that has been discussed so far.
Fig. 7a shows the 1H SQ–DQ correlation spectrum of (+)-1. The resolution of the CH3 region of the spectrum is not sufficient to allow unambiguous information to be extracted and many of the more obvious correlations observed do not add any new information to the spectroscopic assignment discussed above.
![]() | ||
Fig. 7 (a) Expansions of the 1H (14.1 T, 60 kHz MAS) SQ–DQ correlation spectrum of (+)-1. The δ1 = 2δ2 autocorrelation diagonal is indicated with a grey dashed line, the signals corresponding to H4–C9(OH) are indicated with a pink box and the signals corresponding to correlations with C7(OH) (discussed in more detail in the text) are indicated with blue boxes. (b and c) The local environment (in the DFT-optimised structure) around the C9(OH)–C1(![]() |
However, the pair of signals at (δ2 = 5.7 ppm, δ1 = 16.8 ppm) and (11.1 ppm, 16.8 ppm), corresponding to H4 (5.7 ppm) and one of the C9(OH) (11.1 ppm) distinguish unambiguously between the two molecules in the crystal structure. As shown in Fig. 7b, for the optimised structure of (+)-1, H4 in one molecule (the lower, as presented in Fig. 3a) is 3.17 Å from the nearest H4 (in a different dimer) and, as shown in Fig. 7c, the other C9(OH) has no nearby H4. Similar information is also provided from the correlations observed for the C7(OH) signals at 12.4 and 12.0 ppm, highlighted in pink in Fig. 7a. For the C7(OH) in the upper molecule in Fig. 3a, the closest C3(OH) is 3.5 Å, the closest H4 is 3.8 Å and there are no nearby C9(OH), whereas for the lower molecule, there is a C9(OH) within 4 Å and the C3(OH) and C4 are further away at 3.8 and 4.4 Å, respectively. This is consistent with the observed cross peaks (note that additional signals might be expected for all H–H distances of around 4 Å, and there is some limited evidence of these just above the baseline of the spectrum, not shown in Fig. 7a). By combining the evidence from the 1H SQ–DQ correlation experiment with the assignments of the 1H and 13C signals discussed above, it is possible to achieve an absolute crystallographic assignment for the largest molecular fragment, which contains both C9(OH) and C7(OH). However, there remains no unambiguous experimental means of connecting these fragments to C5a or the smaller fragment.
As there is a fragment of the molecule for which the 1H and 13C chemical shifts are known, as discussed above, this information was used to provide “internal referencing” of the DFT-calculated magnetic shielding. In this method, calculated σiso values are plotted against experimental δiso, with the slope and intercept used to determine m and σref (as described in the experimental details). For the fragment of the structure for which experimental shifts were known, this resulted in σref and m of 30.55 ppm and 1.102, respectively for 1H, and 172.07 ppm and 1.015, respectively, for 13C (note that for 1H, data for C3(OH) and H4 were also included because these have the same experimental δiso in both molecules). A small error of around 1–2% of the chemical shift range of a nucleus is typical of periodic DFT calculations and, indeed, in this case mean absolute errors (MAEs) of 0.62 and 1.77 ppm were obtained for 1H and 13C (around 3% and 0.7% of the respective shift ranges), allowing confident assignment of 13C signals from the second fragment of the molecule that are separated by at least 1.77 ppm. The second part of the molecule contains C4 and C4a, which have shift differences of 1.9 and 3.1 ppm, respectively. This allows absolute crystallographic assignment of the signals from C2, C2a, C3, C4 and C4a (and CB) and gives an overall MAE for the 13C shifts of 1.52 ppm. Assuming the alternative assignment of this second fragment to the molecule gives a slightly larger MAE of 1.60 ppm. However, we note that there is around a 6 ppm error in the calculated values of δiso for C4a, which may make this an unreliable indicator. C5a has an experimental shift difference of 0.97 ppm between the two molecules, allowing tentative assignment of its signals and completing the absolute assignment of the 13C NMR spectrum. For 1H, H4 and C3(OH) have the same experimental δiso in both molecules, with means that the only 1H signals to assign are those of methyl group B. These have experimental shifts of 2.8 and 2.6 ppm, and calculated shifts of 2.7 and 2.1 ppm for the two separate molecules, which allows a tentative assignment to be made.
Table 1 summarises the 13C chemical shifts for (+)-1 in the solid and solution and compares these with the periodic DFT calculation for the solid. Table 2 provides a similar summary for the 1H spectroscopic data, although the lower resolution of the CH3 region of the spectrum leads to lower confidence in assignment of the experimental shifts, especially for methyl group B. Note that the calculated shifts for the CH3 groups are the average of the three H within the group, each of which is magnetically distinct in the static structure, but which will rapidly interconvert owing to unhindered rotation about the C–CH3 bond at room temperature. This approximation further adds to the error in the computed 1H δiso.
C | δ (CDCl3) | Molecule 1 δ (ppm) | Molecule 2 δ (ppm) | ||
---|---|---|---|---|---|
Exp. | DFT. | Exp. | DFT | ||
2a | 201.8 | 205.1 | 206.7 | 204.3 | 205.0 |
6a | 200.3 | 200.7 | 198.9 | 199.5 | 197.6 |
1 | 198.1 | 197.3 | 195.2 | 196.9 | 194.2 |
3 | 191.7 | 192.5 | 188.7 | 193.3 | 188.8 |
4a | 179.4 | 176.7 | 182.3 | 179.8 | 184.9 |
7 | 163.9 | 162.2 | 163.6 | 164.3 | 165.7 |
9 | 157.5 | 159.8 | 159.8 | 157.8 | 157.3 |
5a | 155.2 | 155.9 | 156.4 | 154.9 | 154.5 |
8 | 109.4 | 108.5 | 108.9 | 110.3 | 111.4 |
2 | 105.2 | 106.7 | 106.7 | 106.9 | 107.3 |
9a | 104.0 | 104.9 | 104.9 | 104.5 | 104.4 |
6 | 101.5 | 100.6 | 100.2 | 100.9 | 100.5 |
4 | 98.4 | 100.2 | 99.3 | 98.3 | 97.7 |
9b | 59.1 | 58.7 | 62.2 | 58.6 | 61.3 |
D | 32.1 | 30.6 | 31.2 | 32.5 | 32.8 |
A | 31.3 | 31.9 | 30.3 | 30.5 | 28.5 |
B | 27.9 | 29.8 | 30.9 | 29.8 | 29.7 |
C | 7.5 | 7.1 | 5.5 | 8.1 | 5.5 |
H | δ (CDCl3) | Molecule 1 δ (ppm) | Molecule 2 δ (ppm) | ||
---|---|---|---|---|---|
Exp. | DFT. | Exp. | DFT | ||
3(OH) | 18.86 | 18.9 | 18.5 | 18.9 | 17.9 |
7(OH) | 13.33 | 12.0 | 13.7 | 12.4 | 12.8 |
9(OH) | 11.04 | 11.0 | 11.0 | 10.8 | 11.4 |
4 | 6.00 | 5.6 | 5.4 | 5.6 | 5.5 |
A | 2.70 | 1.9 | 2.3 | 2.5 | 2.2 |
B | 2.68 | 3.0 | 2.7 | 2.5 | 2.1 |
C | 2.12 | 2.3 | 0.7 | 0.8 | 2.0 |
D | 1.78 | 1.6 | 1.7 | 1.8 | 1.5 |
Despite the overall reasonable MAE values quoted above, it is worth looking in more detail at the calculated δiso for 13C. Fig. 8a shows a plot of the calculated chemical shifts against the experimental values and, on the scale of the whole spectrum, the agreement is reasonable (R2 = 0.999). However, when the calculated peak positions are superimposed on the CP MAS NMR spectrum (as in Fig. 8b) or the INADEQUATE spectrum (shown in the ESI†), it is clear that the calculated shifts do not adequately describe certain regions of the spectrum, especially for C4a and the methyl region. A possible source of the apparent inaccuracy of the DFT calculations could be dynamics, particularly of the CH3 groups in this case. The DFT calculation has an inherent temperature of 0 K and, hence, neglects thermal effects such as dynamics. Various computational strategies can be undertaken to more accurately model dynamic behaviour of molecules, although these approaches are more limited and often computationally costly for periodic calculations.24 An alternative experimental approach is to carry out variable-temperature (VT) NMR experiments, in which temperature-dependent changes in chemical shift can indicate the presence of dynamics.
![]() | ||
Fig. 8 (a) Plot of DFT calculated 13C δiso against the experimental values for (+)-1. The ideal 1![]() ![]() |
Fig. 9 shows the change in peak positions, δδ, for 13C signals from (+)-1 as a function of temperature (from 260 K to 330 K). All of the signals display a small temperature dependence, but some show a much larger effect. In particular, the acyl group comprising CB and C2a, and backbone carbon C2 exhibit larger δδ than other carbons in the molecule with comparable chemical environments. While these shift differences may arise from molecular motion, this part of the molecule also exhibits tautomerism in the solution phase, as shown in Fig. 10, which has been explored by molecular DFT calculations.27 Given that the two tautomers A and B are very close in energy in the molecule, it seems reasonable to assume that they will both also be present in the solid state. As different sets of signals are not observed for the different tautomers, it can be suggested that rapid interconversion between tautomers A and B occurs on the NMR timescale, (i.e., a low energy barrier between the tautomers). As such, it is likely that the changes observed experimentally arise from variation in the populations of tautomers A and B, rather than from increasing the population of C, which would require reorientation of the acyl group within the sterically confined environment of the solid crystal. This proposal is consistent with the observation that tautomers C, C′ and D do not appear to contribute to the room-temperature solution-phase NMR spectra.27 The tautomerism of 1 could also explain some of the discrepancies between the calculation (only tautomer A) and experiment (an interconverting mixture of tautomers A and B).
![]() | ||
Fig. 10 Proposed tautomers of 1, with the relative gas-phase energies calculated by Galasso27 indicated. The region of the molecule undergoing changes between the various tautomers is highlighted in red. |
![]() | ||
Fig. 11 13C CP MAS NMR spectra (9.4 T, 12.5 kHz MAS) of ground lichen (black) and commercial (+)-1 (red). |
Variable-temperature 13C CP MAS NMR experiments showed that the region of the molecule capable of exhibiting tautomerism (carbons 2, 2a and B) gives rise to the largest temperature-dependent chemical shifts. However, a detailed investigation of the effects of tautomerism in the solid form of (+)-usnic acid is beyond the scope of the present work.
Finally, a sample of powdered lichen was characterised by 13C CP MAS NMR. This experiment presents a rare observation of a natural product in its parent tissue (facilitated by the high concentration of usnic acid in many lichen species) and allowed confirmation that the usnic acid is present in the same crystalline form in the lichen as in the pure solid.
Footnote |
† Electronic supplementary information (ESI) available: further details of solid-state NMR experiments, complete one-dimensional 13C NMR spectra, additional 13C INADEQUATE NMR spectra and more detailed interpretation, and further comparison of 13C NMR spectra of solid (+)-usnic acid and U. dasopoga powder. See DOI: https://doi.org/10.1039/d4cp01127a |
This journal is © the Owner Societies 2024 |