Nicholle G. A. Bella, Graeme Rigga, Sarah Mastersb, Juraj Bellaa and Dušan Uhrín*a
aEastChem School of Chemistry, University of Edinburgh, West Mains Road, Edinburgh, EH9 3JJ, UK. E-mail: dusan.uhrin@ed.ac.uk
bDepartment of Chemistry, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
First published on 5th September 2013
We have developed novel NMR methods for the measurement of heteronuclear residual dipolar couplings (RDCs) in molecules with severely overlapping NMR resonances. These and other methods enabled us to obtain 31 RDCs for α-D-cellobiose and 24 RDCs for β-D-cellobiose. The interpretation of the data in the approximation of a rigid disaccharide structure, using RDCs and interglycosidic 3J coupling constants, yielded conformation that is very close to that determined using X-ray crystallography. However, depending on which ring was used to calculate the order parameters, the dihedral angle ψH varied up to 30° or 40°, while the ϕH angle was always the same. This indicates residual flexibility of the glycosidic linkage between the two monosaccharide rings and was observed for both α- and β-D-cellobiose. The RDC analysis using rigid fragments rather than a complete molecule has thus shown that the glycosidic bond of cellobiose is not completely rigid and exhibits low-level flexibility. The sources of this flexibility are discussed and evidence presented to support a hypothesis that it is associated with the ψ more than the ϕ angle.
The SVD approach allows rigorous characterisation of the alignment of rigid fragments and is more likely to reveal flexibility, rather than hiding any discrepancies between the experiment and the theory in the “structural noise”. Using cellobiose, 4-O-(β-D-glucopyranosyl)-β-D-glucopyranose (see the inset to Fig. 1), a disaccharide that has rather “rigid” glycosidic linkage, we present here novel approaches that allow the detection of low-level molecular flexibility.
Fig. 1 800 MHz 1H spectra of cellobiose. Expansions of the anomeric and aglyconic signals of the isotropic, (a) + (c), and aligned, (b) + (d), samples. The structure of cellobiose, together with the ring and atom numbering used, is shown in the inset. |
Cellobiose is the disaccharide repeating unit of cellulose, a key component of plant cell walls. Its conformation has been the subject of continuous research by both theoreticians and experimentalists producing a considerable amount of data. Theoretical studies have identified conformational energy minima of cellobiose,15–24 which are characterised by certain combinations of glycosidic dihedral angles ϕ and ψ. Using the ϕH (H1′–C1′–O4–C4) and ψH (C1′–O4–C4–H4) definitions of dihedral angles, the most stable conformer of cellobiose is referred to as syn-ϕH/syn-ψH (ϕH–ψH ∼ 60°), while the identified metastable states are anti-ϕH/syn-ψH (ϕH ∼ 180°/ψH ∼ 60°) or syn-ϕH/anti-ψH (ϕH ∼ 60°/ψH ∼ 180°). Although all theoretical models show the existence of these minima, their populations vary dramatically depending on the method used for their calculation. For example, molecular dynamics simulations using CHARMM25 with the Carbohydrate Solution Force Field (CSFF)19 identified the anti-ϕH conformer as the most stable, while replica exchange dynamics using the AMBER/GLYCAM force field18 placed 99% of the molecules at 300 K into the syn-ϕH/syn-ψH region and ∼1% and ∼0% into the anti-ψH and anti-ϕH regions, respectively. Similar results were obtained with the GROMOS 45A4 force-field26 with the artificially enhanced sampling around the glycosidic bond using the local elevation umbrella method.20 However, this time the populations of the anti-conformers were approximately reversed. In contrast to these results, quantum mechanical calculations in the gas-phase predicted the dominance of the anti-ϕH/syn-ψH conformer.15,16 The existence of this conformer in the gas phase was later confirmed (in the presence or absence of a single water molecule) using vibrational spectroscopy under molecular beam conditions in conjunction with quantum chemical calculations.17 Recent DFT calculations using the COSMO solvation model,21 while still favoring the anti-ϕH/syn-ψH conformer, have reduced the energy difference between the anti-ϕH and syn-ϕH states. Subsequent use of the SMD solvation model has placed the syn-ϕH conformer 2 kcal mol−1 below the minimum of the anti-ϕH conformer.22
Substantiated by the analysis of inter-residue ROEs and hetero- and homonuclear three-bond interglycosidic coupling constants, experimental NMR studies in water have consistently identified syn-ϕH/syn-ψH as the major conformer of cellobiose.27–31 Investigations by the Widmalm group28,31 concluded that the syn-conformer is present at ∼93%, while the minor anti-ϕH and anti-ψH states are populated to 0–2% and 5–7%, respectively. Although in a broad agreement with some of the theoretical studies,18,20 given the very small populations of the minor conformers, one has to be cautious when interpreting the experimental NMR. This is because such interpretation relies on a number of simplifications, such as the two-spin approximation, the assumption of uncorrelated molecular motions or isotropic reorientation, all of which are used in the analysis of proton cross relaxation rates. The Karplus curves parameterisation used for the interpretation of scalar coupling constants is also only an approximation. Most crucially, the analysis of averaged NMR parameters associated with flexible systems relies on the use of computed structural models. Given the variations between the different force fields and solvation methods demonstrated above, the time averaging of theoretical parameters and their “agreement” with experimental data cannot therefore be taken as a definite proof of the existence of conformational flexibility. Approaching the problem without any preconception, one has to ask two questions: (i) is the glycosidic linkage of cellobiose rigid or does it exhibit some flexibility? (ii) if flexible, what is the nature of the underlying motion? In this work we have employed RDCs and scalar coupling constants to answer the first question and to hypothesize on the second.
All spectra of aligned and isotropic cellobiose were recorded at 297 K on an 800 MHz Bruker Avance spectrometer equipped with a TCI triple-resonance cryoprobe with z gradients. Twenty 1D intensity modulated 13C spectra were collected for the determination of one-bond splittings using the pulse sequence of Fig. 2a or b in a pseudo 2D manner. The following parameters were used: number of scans 16 × 12 = 192 (12 repetitions of each experiment in a cyclic manner to ensure long-term averaging of spectrometer instabilities), an acquisition time of 0.68 s, and a relaxation time of 1.5 s. The BIRDd,X pulse33,34 was optimised for 1JCH = 155 Hz by setting Δ = 1/21JCH. The evolution interval T was set to 4, 6, 8, 10 ms and then 150 to 165 ms in 1 ms increments. Pulsed field gradients, Go–G3, were applied over 500 μs at 33, 13, 18 and 27%, respectively. The total acquisition time was 2.75 hours.
Fig. 2 Pulse sequences of intensity modulated 13C-detected 1D experiments for the measurement of one-bond, (a) and (b), and long-range, (c) and (d), proton–carbon coupling constants. Narrow and wide rectangles represent 90° and 180° rectangular pulses, respectively, with phase x, unless stated otherwise. T is the variable evolution interval, Δ = 1/21JCH, Δ1 = 1/41JCH, Δ2 = 1/4nJCH. The gradients were applied for a period of 0.5 ms at the following strengths: G0 = 33%, G1 = 13%, G2 = 18%, G3 = 27%. The following phase cycling was as follows: φ1 = 8x, 8(−x); φ2 = y, −y; φ3 = 2x, 2(−x); φ4 = 4x, 4(−x) and ψrec = 4(x, −x), 4(−x, x). |
The long-range coupling constants were measured using the pulse sequence of Fig. 2c (or 2d). The following parameters were used: number of scans 16 × 60 = 960 (60 repetitions of each experiment in a cyclic manner), an acquisition time of 0.68 s, and a relaxation time of 1.5 s. A 30 ms Gaussian selective 180° 1H pulse was applied. The refocusing interval (Δ2 = 1/4nJCH) was optimised for a 5 Hz coupling constant. The evolution interval, T, was set from 50 to 525 ms in 25 ms increments, Go–G3 were applied over 500 μs at 33, 13, 18 and 27%, respectively.
The RDCs were determined as a difference between the splittings measured in the aligned and isotropic media. The sign of RDCs was, in most cases, determined unambiguously based on the known sign of the scalar couplings. Where this was not possible, e.g. DH2,H4, the sign was implied during the calculation of the alignment tensor. The experimental splittings were determined by a three parameter fit (Io, KCH, and T2) of signal intensities derived from the series of intensity modulated spectra using eqn (1):
I = I0 sin(π1KCHT)exp[−(T + 2Δ)/T2] | (1) |
I = I0 sin[πnKCH(T + τeff)]exp[−(T + τ180)/T2] | (2) |
Calculations were performed using the resources of the National Service for Computational Chemistry Software (NSCCS) (http://www.nsccs.ac.uk) utilizing the Gaussian 03 suite of programs.35 Coordinates for the molecular structure of cellobiose were taken from a neutron diffraction structure of β-D-cellobiose.36 For the analysis of α-D-cellobiose, the coordinates of the reducing ring were replaced with those of the neutron diffraction structure of α-D-glucose.37 The heavy atom coordinates were fixed at these positions. The position of the hydrogen atoms was then optimized using the B3LYP functional38–40 with the 6-31G* basis set.41–43
These experiments can be characterised as 1D refocused INEPT50 experiments with a BIRDd,X pulse33,34 applied amid a spin-echo interval. Exclusive inversion of 13C-attached protons during this variable time interval refocuses proton–proton (or long-range proton–carbon) scalar and dipolar couplings, while at the same time preserves the signal modulation by 1KCH (1JCH or 1JCH+1DCH). This modulation can take place while the magnetisation is on the carbon (Fig. 2a) or the proton (Fig. 2b) spins. Fitting the signal intensities of individual carbon resonances according to eqn (1) yields the values of 1KCH.
The pulse sequences of Fig. 2a and b can easily be modified for the measurement of long-range proton–carbon splittings (nKCH). By inverting “selectively” a single 12C-attached proton during the evolution and refocusing intervals (pulse sequences of Fig. 2c and d), the signal intensities of all carbons that are long-range coupled to the inverted proton are modulated by nKCH. It may seem that the use of these methods is limited due to the extensive overlap of proton resonances discussed above. However, the selectivity condition only requires the chemical shift of the inverted proton to be sufficiently separated from the chemical shift of other protons with which it shares a mutual coupling. In addition, if more than one inverted proton is long-range coupled to the same carbon, this carbon cannot be analysed using eqn (2). Also, one has to consider the 1H spectra of individual 13C-isotopomers, as large one-bond couplings change the position of resonances. Nevertheless, the limitations of the use of this experiment can often be reduced to the requirement for proton resonances within one monosaccharide ring to be sufficiently separated. While for isotropic samples it is straightforward to ascertain if this is the case, care must be taken when this experiment is applied to the aligned samples.
1KCH splittings were measured using the pulse sequences shown in Fig. 2a and b. Both pulse sequences yielded identical values; the long-range splittings were consequently measured only by the pulse sequence of Fig. 2c. An example of the application of these methods to cellobiose is shown in Fig. 3. As can be seen from Fig. 3a–c, a very small difference between JC1′β, H1′β (162.51 Hz) and JC1′α, H1′α (162.41 Hz) is detectable using these methods. Altogether 14 out of 20 1DCH of CH pairs were measured for the four monosaccharide rings (Table 1). The remaining six RDCs, (1DC3′,H3′, 1DC4′,H4′ and 1DC5′,H5′ of rings Iα and Iβ) were affected by higher order effects in the 13C-satellite spectra and were therefore excluded from the subsequent analysis. Also measured were 13 nDCH intra-ring couplings and three out of four inter-ring 3DCH couplings across the glycosidic linkage. Unfortunately 3DC1′,H4β was not accessible because of strong coupling between H3 and H4 of ring IIβ. An example of the analysis of a long-range splitting is shown in Fig. 3d.
Fig. 3 (a) C1′α and C1′β signals from the intensity-modulated spectra of the isotropic sample of cellobiose (pulse sequence of Fig. 2a), as a function of the evolution time T; (b) intensity of the C1′β signals shown in (a) fitted to eqn (1); (c) expansion of the boxed area from (b) showing both signals; (d) fitting (eqn (2)) of the signal intensities of C1′α acquired on the aligned sample using the pulse sequence of Fig. 2c with the selective inversion of H4α. |
Atoms | α-D-Cellobiose | β-D-Cellobiose | ||||||
---|---|---|---|---|---|---|---|---|
Ring Iα (non-reducing) | Ring IIα (reducing) | Ring Iβ (non-reducing) | Ring IIβ (reducing) | |||||
Exp. | Calc. | Exp. | Calc. | Exp. | Calc. | Exp. | Calc. | |
a Not included in calculations because of higher order effects.b Using RDCs of individual monosaccharides separately.c Using RDCs of both monosaccharides including the interglycosidic RDCs.d Based on 20 RDCs of ring II.e Based on 11 RDCs of ring II. | ||||||||
One-bond proton–carbon couplings | ||||||||
C1–H1 | 12.60 | 13.13 | −9.40 | −9.26 | 12.38 | 12.80 | 12.67 | 12.74 |
C2–H2 | 12.65 | 12.30 | 13.87 | 13.88 | 12.71 | 12.40 | 13.05 | 12.82 |
C3–H3 | a | 13.96 | 13.77 | a | 13.18 | 13.20 | ||
C4–H4 | a | 13.19 | 13.37 | a | 12.67 | 12.47 | ||
C5–H5 | a | 13.65 | 13.43 | a | 12.66 | 12.88 | ||
Long-range proton–carbon couplings | ||||||||
C1–H2 | 0.61 | 0.51 | — | 0.26 | 0.35 | 1.13 | 0.89 | |
C2–H3 | −0.21 | 0.06 | 0.23 | 0.23 | 0.1 | 0.47 | ||
C3–H1 | −0.3 | −0.26 | ||||||
C3–H2 | 0.2 | 0.06 | −0.32 | −0.51 | ||||
C3–H4 | 0.23 | 0.40 | ||||||
C4–H3 | 0.58 | 0.49 | ||||||
C5–H1 | −0.63 | −0.65 | ||||||
C5–H4 | 0.87 | 0.99 | ||||||
Proton–proton couplings | ||||||||
H1–H2 | 1.84 | 1.76 | −0.63 | −0.82 | 1.48 | 1.64 | 1.93 | 1.90 |
H1–H3 | −6.26 | −6.35 | ||||||
H2–H3 | 1.04 | 1.12 | 1.56 | 1.46 | 1.31 | 1.42 | ||
H2–H4 | −5.63 | −5.63 | ||||||
H3–H4 | 1.34 | 1.58 | ||||||
One-bond carbon–carbon couplings | ||||||||
C1–C2 | −0.85 | −0.76 | 0.45 | 0.38 | −0.89 | −1.08 | 0.35 | 0.19 |
C2–C3 | −0.98 | −1.00 | a | a | −0.95 | −0.76 | a | a |
C3–C4 | 0.43 | 0.75 | −0.55 | −0.41 | 0.65 | 0.42 | −0.05 | −0.10 |
C4–C5 | −0.75 | −1.13 | 0.1 | 0.35 | −1.1 | −1.25 | 0.15 | 0.04 |
C5–C6 | 0.7 | 0.59 | −0.1 | −0.29 | 0.6 | 0.52 | 0.1 | 0.38 |
Inter-ring couplings | ||||||||
Exp. | Calc. | Exp. | Calc. | |||||
C1′–H4 | −1.03 | −0.73 | a | a | ||||
C4–H1′ | −1.2 | −1.48 | −1.05 | −0.99 | ||||
C1′–C4 | −0.35 | −0.45 | −0.48 | −0.48 | ||||
Number of intra-ring RDCs (Number of unique RDCs) | ||||||||
11(7) | 20(15) | 12(9) | 12(6) | |||||
Order matrix parameters based on the analysis of individual rings | ||||||||
104Szz | 3.74 | 4.72 | 4.07 | 4.10 | ||||
104Syy | −2.95 | −3.09 | −2.92 | −3.00 | ||||
104Sxx | −0.79 | −1.6 | −1.15 | −1.10 | ||||
104 GDO/η | 3.94 ± 0.10/0.58 ± 0.06 | 4.80 ± 0.06/0.31 ± 0.02d | 4.20±0.04/0.44±0.02 | 4.24 ± 0.12/0.46 ± 0.05 | ||||
4.58 ± 0.18/0.41 ± 0.06e | ||||||||
rmsdb | 0.10 | 0.14d | 0.08 | 0.12 | ||||
0.14e | ||||||||
(ϕr, ψr, rmsd) | (−10°, −30°, 1.10) | (−10°, 0°, 0.25) | (−10°, −40°, 0.41) | (−10°, 0°, 0.31) | ||||
(ϕr, ψr, rmsd) | (−10°, −10°, 1.60) | (−10°, 0°, 0.65) | (−10°, −10°, 1.36) | (−10°, 0°, 0.71) | ||||
Order matrix parameters based on simultaneous analysis of both rings | ||||||||
104 GDO/η | 4.78 ± 0.05/0.31 ± 0.01 | 4.30 ± 0.05/0.43 ± 0.02 | ||||||
rmsdc | 0.19 | 0.19 |
The heteronuclear RDCs obtained by these experiments were supplemented by 10 nDH,H, 18 1DC,C and two 2DC,C RDCs measured using intensity-based selective 1D methods51,52 and a 2D INADEQUATE experiment53 described previously. The experimental details of these measurements together with illustrative examples are given in Fig. S2 and S3 (ESI‡).
GDO has been proposed as a convenient parameter for assessing the flexibility of moieties connecting rigid molecular fragments.7 When applied to carbohydrates, GDOs of rigid monosaccharide rings connected by rigid glycosidic linkages are expected to be identical. Our calculations show that the GDOs of rings Iβ and IIβ differ by 1% while GDOs of rings Iα and IIα differ by 23%. Therefore based solely on this criterion, β-D-cellobiose appears to be a rigid molecule, while α-D-cellobiose shows signs of flexibility. In order to investigate this apparent discrepancy further, we analysed both molecules in the approximation of a rigid structure. This involved exploring the conformational space described by the glycosidic torsion angles ϕ and ψ in a search for the best fit between the experimental and theoretical RDCs. The results presented below use angles ϕr, ψr, which are the angles relative to those of the X-ray structure of β-D-cellobiose36 [(ϕr, ψr) = (0°, 0°) = (ϕH, ψH) = (+44°, −12°)].
Fig. 4 2D rmsd maps between the experimental and theoretical RDCs based on the simultaneous analysis of RDCs of both rings as a function of ϕr, ψr angles. (a) α-D-cellobiose, the minima were observed at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 0.23), (60, 60, 0.35) and (180, −30, 0.56); (b) β-D-cellobiose, the minima were observed at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 0.21), (120, 0, 0.29) and (60, 60, 0.35). The conformers identified by the molecular mechanics calculations20 are indicated here (and in Fig. 5–7) by * (syn-ϕH/anti-ψH) and + (anti-ϕH/syn-ψH); a white cross at (0, 0) indicates the X-ray structure of cellobiose.36 |
In the second approach, the order matrices were calculated using the RDCs of ring I, while the rmsds were based on the RDCs of ring II and vice versa. Two sets of rmsds for each of 121 structures of α- and β-D-cellobiose calculated in this way are presented in Fig. 5. A more detailed inspection of the global minima in steps of 10° (Fig. S4, ESI‡) yielded values of (ϕr/°, ψr/°, rmsd/Hz) = (−10, −30, 1.10) and (−10, 0, 0.25) based on the alignment of ring Iα and IIα, respectively. Analogous analysis of β-D-cellobiose generated global minima at (−10, −40, 0.41) and (−10, 0, 0.31), based on the alignment of rings Iβ and IIβ, respectively. Several observations can be made based on the inspection of these 2D maps. (i) All maps (Fig. 5) show a better definition of minima contained in deeper valleys compared to when RDCs of both rings were considered at the same time (Fig. 4); (ii) general trends are identical for both α- and β-D-cellobiose. In all four cases a global minimum was found at ϕr = −10°, while ψr showed two values: ψr = 0° (based on the alignment of rings I) or ψr = −30° (or −40°) (based on the alignment of rings II); (iii) the position of the minima changed compared to when RDCs of both rings were considered simultaneously.
Fig. 5 2D rmsd maps between the experimental and theoretical RDCs as a function of ϕr, ψr angles. (a) α-D-cellobiose, the rmsds for ring II are based on the order parameters of ring I. The main minima occur at (ϕr/°, ψr/°, rmsd/Hz) = (0, −30, 1.16) and (60, 90, 1.39); (b) α-D-cellobiose, the rmsds for ring I are based on the alignment of ring II. The minima occur at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 0.50), (60, 90, 1.02) and (180, −60, 0.98); (c) β-D-cellobiose, rmsds for ring II are based on the alignment of ring I. The main minima occur at (ϕr/°, ψr/°, rmsd/Hz) = (0, −30, 0.55), (90, 90, 0.75), (−90, 60, 0.84), (−150, −90, 0.98) and (120, 0, 1.24); (d) β-D-cellobiose, the rmsds for ring I are based on the alignment of ring II. The minima occur at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 0.31), (60, 90, 0.89) and (120, 0, 1.10). |
These results indicate that the approximation of a rigid structure is not valid for cellobiose. If cellobiose was completely rigid, identical results would have been obtained regardless of which ring was used to calculate the order matrix parameters. The most interesting result of this analysis is the fact that both α- and β-D-cellobiose show identical trends, i.e. one set of dihedral angles based on the RDCs of the non-reducing ring and another set based on those of the reducing ring. This is despite the fact that GDOs of both rings of β-cellobiose were identical, while those of α-cellobiose differed. It is thus evident that the second treatment of RDCs is preferable. Simultaneous consideration of RDCs of both rings produces an “average” structure, characterised by shallow definition of the minima on the RDC rmsd maps. It should be noted that considering the cellobiose rings individually also produces local minima (see Fig. 5). Do these minima have a physical meaning? Can they be interpreted as signs of the flexibility of the glycosidic linkage? It has been observed previously54 that a random combinations of dihedral angles can yield small rmsds between the theoretical and experimental RDCs. This is a consequence of the mathematical relationship between the order matrix and RDCs and does not necessarily mean that these minima have a physical meaning. Using the approximation of a rigid structure the relevance of these minima can be investigated through the comparison with other experimental parameters, such as scalar inter-residue coupling constants, as illustrated next.
Using the appropriate Karplus type equations56,57 the combined rmsds between five experimental and theoretical couplings (see ESI‡) were calculated as a function of ϕr and ψr dihedral angles for α-D-cellobiose. Very similar results were obtained for β-D-cellobiose, although here only one of the two interglycosidic 3JCH coupling was available (data not shown). When only more readily measurable 3JCH couplings are used, a J-based 2D rmsd map of α-D-cellobiose (Fig. S5, ESI‡) shows four sets of minima for each dihedral angle. When both 3JCH and 3JCC couplings are considered, a J-based 2D rmsd map of α-D-cellobiose (Fig. 6) shows four minima, at (ϕr/°, ψr/°, rmsd/Hz) = (−10, −6, 0.22), (−181, −6, 0.28), (−10, 163, 0.57) and (−181, 163, 0.63), where the global minimum at (ϕr, ψr) = (−10, −6°) is very close to the coordinates of the X-ray structure. The appearance of several minima is caused by the periodic nature of the Karplus curves and limited accuracy of their parameterisation. Fig. 6 also shows the positions of the anti-ϕ and anti-ψ conformers as predicted by molecular modelling. Although distinct, these are close to two local minima shown on this map. Hence the usefulness of J-couplings in distinguishing between syn and anti-conformers is likely limited.
Fig. 6 Combined rmsds between the experimental and theoretical 3JCH and 3JCC coupling constants as a function of ϕr, ψr angles. The minima are located at (ϕr/°, ψr/°, rmsd/Hz) = (−10, −6, 0.22), (−181, −6, 0.28), (−10, 163, 0.57) and (−181, 163, 0.63). |
When RDCs based rmsds of α-D-cellobiose (Fig. 5) were combined with J based rmsds (Fig. 6), the global minima identified individually by each method reinforced each other and dominated the rmsd landscape (Fig. 7). Their closer inspection in steps of 10° based on the alignment of ring Iα or IIα (Fig. S6, ESI‡) positions them at (ϕr/°, ψr/°, rmsd/Hz) = (−10, −10, 1.60) or (−10, 0, 0.65), respectively. Analogous analysis of β-D-cellobiose yielded similar results (data not shown), but is not as informative because the 3JC1′, H4 coupling constant cannot be measured for this anomeric form. Addition of interglycosidic J couplings thus (i) eliminated the secondary minima produced by the analysis of the RDCs only, and (ii) moved the global minima to within 10°of the X-ray structure.
Fig. 7 2D rmsd maps between the experimental and theoretical RDCs and J coupling as a function of ϕr, ψr angles of α-D-cellobiose. (a) The rmsds of ring II were based on the alignment of ring I. The global minimum was found at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 2.86); (b) the rmsds for ring I were based on the alignment of ring II. The global minimum was found at (ϕr/°, ψr/°, rmsd/Hz) = (0, 0, 1.61). |
In summary, the interpretation of RDCs and inter-ring J couplings in the approximation of a rigid glycosidic linkage yielded a solution conformation of cellobiose that is very close to that observed by X-ray in the solid state. Based on this analysis, it is very unlikely that the secondary minima on the RDC rmsd maps have a physical meaning, however, as both RDCs and J couplings report on an average conformation in solution this analysis does not rigorously exclude such a possibility.
Fig. 8 Sauson–Flamsteed plots for (a) α- and (b) β-D-cellobiose. Full and dotted lines represent the directions of the PAFs for the ring I and II, respectively. (c) and (d) show the orientation of PAFs based on ring I (blue) and ring II (orange) in α-D-cellobiose and β-D-cellobiose, respectively. Differences (Δ) in the eigenvalues (%) and orientations (°) between the two corresponding order matrices are also given. |
This analysis clearly shows that despite identical (β-D-cellobiose) or similar (α-D-cellobiose) GDOs of rings I and II, the alignment of these rings is not identical. This was found to be the case for both α- and β-D-cellobiose and is the reason why the analysis of RDCs in the approximation of a rigid glycosidic bond produced different dihedral angles (Fig. 5) depending on which ring was used to calculate the order matrix. Overall, the above analysis implies that some flexibility is associated with the glycosidic linkage of cellobiose.
An exact solution of an order matrix requires as a minimum five RDCs,12,60 hence their quality can only be assessed if additional RDCs are available. This exercise is only meaningful if a sufficient number of unique RDCs, i.e. measured along non-collinear directions, can be obtained. In hexopyranose chairs of carbohydrates many bond vectors, e.g. axial CH vectors, have, at least to a first approximation, identical orientation. When such degenerate RDCs are excluded for rings Iβ and IIβ, a unique set of 9 (out of 12) and 6 (out of 12) RDCs is obtained, respectively; while 7 (out of 11) and 15 (out of 20) can be classed as unique for rings Iα and IIα, respectively. We therefore have a sufficient number of RDCs to characterise the alignment of α- or β-D-cellobiose rings. In addition, the very small rmsds (0.08 to 0.14 Hz) between the experimental and theoretical RDCs indicate that the obtained RDCs are both accurate and precise.
Unsurprisingly, the largest set of unique RDCs was collected for the IIα ring. This is due to the equatorial position of the Hα anomeric proton of α-D-cellobiose and a very good dispersion of 1H chemical shifts of this ring. Is this the reason why the GDO of this ring is the largest and differs most from GDOs of the other rings? In order to answer this question, we have recalculated the order matrix of ring IIα using a reduced set of RDCs. When only 7 out of 15 unique RDCs, and the same types as available for ring Iα were used, the GDO parameter changed from 4.82 ± 0.06 to 4.58 ± 0.18. The error bounds of the two GDOs just touch and the GDO based on the reduced set of RDCs is less precise. However, it is still larger than the average GDO of rings Iβ and IIβ (4.22) or the GDO of ring Iα (3.94). Monte Carlo simulations show a comparable spread of the eigenvector orientations (Fig. 8a and b) for all rings, with an exception of the Szz component of ring IIα. This is smaller due to the largest number of unique RDCs measured. In all instances the spread of eigenvectors is much smaller than the differences between the orientations of the Syy or Sxx principle axes for any of the rings. We can therefore conclude that the experimental RDCs are measured accurately and precisely and that the order matrix parameters derived for individual monosaccharide rings are meaningful.
Is there any evidence in our data for a specific interaction of either α- or β-D-cellobiose with the C12E5–hexanol–water aligning medium? Focusing initially on the strength of the alignment, there is practically no difference between the Iβ and IIβ rings, while larger differences were observed for rings Iα and IIα, the Szz and Sxx components in particular (Table 1). However, as seen in Fig. 8, the orientation of PAFs changes in an identical manner when switching the alignment from one based on the RDCs of the non-reducing to that of the reducing rings for both α- and β-D-cellobiose. The largest principal component does not change its orientation at all. Furthermore this orientation is practically identical for all four rings. At the same time the PAF rotates by about 24° (±4°) around the Szz axis. It is worth noting that 3JH1′,C4, which is sensitive to the ϕ angle, was identical (3.95 and 3.97 Hz) for both anomeric forms of cellobiose. Unfortunately, 3JC1′,H4, which is sensitive to the ψ angle could only be measured for α-D-cellobiose due to the higher order between H3 and H4 protons of β-D-cellobiose. The other ψ-dependent coupling constant, 3JC1′,C5, was small (2.1 Hz) and identical for both anomeric forms of cellobiose. All available corresponding interring RDCs (Table 2) were very similar between α- and β-D-cellobiose indicating that the conformation of the glycosidic linkage is the same for both molecules.
This analysis suggests that (i) the nature of the interaction with the C12E5–n-hexanol–D2O medium is very similar for both anomeric forms of cellobiose (ii) this interaction either affects the conformation of the glycosidic linkage of both forms equally, or it does not affect it at all (iii) the stronger alignment of the IIα ring (larger Szz and consequently larger 1DCH) is likely caused by steric factors reflecting the configuration of the anomeric center. The C1 axial OH group of ring IIα strengthens the alignment relative to that of ring IIβ, in which the OH group is equatorial.
By virtue of its magnitude, a large coupling constant cannot be an average of smaller numbers. In cellobiose, the 3JC2′,C4 reporting on the ϕ angle is large (3.1 Hz, the maximum on the corresponding Karplus curve is 3.65 Hz) as is the 3JC1′,H4 reporting on the ψ angle (5.0 Hz, the maximum on a corresponding Karplus curve is 5.6 Hz). These values imply that a dominant conformer of cellobiose (i) either occupies a limited ϕ/ψ space characterized by large interglycosidic coupling constants, i.e. the syn conformer of the X-ray structure, or (ii) that the glycosidic linkage interconverts between states characterized by large coupling constants across the glycosidic bond. Fig. 6 illustrates the fact that scalar coupling constants cannot fully distinguish between the syn and anti-conformers as both are characterised by large interglycosidic coupling constants. However, the anti-conformers referred to here have dihedral angles (ϕr, ψr) = (−181, −6) or (−10, 163), which differ from those calculated [(ϕr, ψr) = (+136, 12) or (4, −150)] by molecular dynamics simulations.20 As the corresponding interglycosidic couplings of the computed conformers are small, their population must be low; otherwise lower experimental coupling constants across the glycosidic bond would be expected.
In the approximation of a rigid glycosidic linkage and using RDCs only (Fig. 5), we have shown that ϕr = −10°, while ψr = −30°/−40° or 0° depending on which rings are used to calculate the order matrix. It must be emphasized that this interpretation of the experimental data is fundamentally flawed because the underlying assumption it relies on – the rigidity of the glycosidic linkage – is not fulfilled. Therefore the obtained dihedral angles cannot be taken at their face values. We also note that ψr = −30°/−40° is not compatible with the large 3JC1′,H4 (5.1 Hz) measured for α-D-cellobiose. Nevertheless, our analysis of RDCs and interglycosidic J couplings allows us to conclude that the prevailing conformation of cellobiose in solution is very similar to that observed in the solid state using X-ray crystallography.
The most important finding of our RDC investigation of cellobiose is the fact that the order parameters calculated based on the RDCs of rings I and II differed slightly (Fig. 8). Our calculations only assumed rigidity of individual monosaccharide rings. No other assumptions were made; no parameterisation or models were needed to interpret the experimental data. These admittedly small differences are indicative of segmental motions of a partially flexible glycosidic linkage. They can be a consequence of local variations of the glycosidic angles of about ±30° often seen in MD simulations of carbohydrates.28 It is possible that a stochastic averaging of local motions will yield different alignment parameters for individual rings and thus a different set of dihedral angles based on the analysis of RDCs in the approximation of a rigid structure as was the case here. Alternatively, the partial flexibility can also be a consequence of an occasional transition to an anti-conformer on either of the glycosidic angles. The detected flexibility can also be a combination of both motions. However, based on the treatment of the experimental data presented here, it is not possible to determine which is the case for cellobiose. We would like to note in passing that experimental parameters that could be potentially very useful in assessing the relative populations of the syn- and anti-conformers in carbohydrates are 1JCH coupling constants of anomeric and aglyconic carbons.61 Differences of up to 6 Hz were predicted between the two anomeric forms. However, their current parameterisation is not accurate and therefore these coupling constants cannot be used in conformational analysis of the glycosidic linkage at present, despite the fact that they are easily measureable with high accuracy.
Finally, several approaches were put forward to date for the interpretation of RDCs in flexible carbohydrates. These utilise SVD type analysis10 or predict the alignment without using the experimental RDCs.9,62 Studying cellobiose using these methods is, in our opinion, a challenging task due to the limited flexibility of this molecule and the existence of a dominant conformer as demonstrated by others and us. Nevertheless, these approaches can in principle shed more light on the nature of the motion underlying the limited flexibility of cellobiose.
It has been suggested that the stabilizing exo-anomeric effect66 tends to make the ϕ angle more rigid relative to the ψ angle. There is theoretical and experimental evidence to support this assertion. Tvaroška and Carver have analyzed 2-methoxytetrahydropyrans67 (ϕ angle) and 1,4-dimethoxytetrahydropyrans68 (ψ angle) using ab initio methods and found a deep and narrow global minimum for the ϕ angle, while a broad double-minimum spanning 80° was identified for the ψ angle. Based on these results the authors concluded that the ψ angle is more flexible than the ϕ angle.
A survey of 21 crystal structures of cellobiose and its related oligosaccharides69 showed that the variations of ϕ angles (up to 29°) are half of that for ψ angles (up to 58°). It is interesting to note that the acetylated carbohydrates cluster together (Fig. S7, ESI‡) and are separated from the other structures in their ψH angles. Nevertheless, even if this group was excluded, the range of ψ angles is substantial (up to 39°). Such spread of ψ angles fits well with a broad double-minimum predicted for the ψH values by Tvaroška and Carver.68 Another interesting observation is that the changes in ψ and ϕ angles in this set of related molecules are correlated (Fig. S7, ESI‡). This may also be the case in solution. In summary, the spread of ψH angles frozen out in the solid state indicates a possibility that a larger range of ψH could be explored in solution.
Overall, the above data suggest that the flexibility of the glycosidic linkage of cellobiose is more likely associated with the ψ rather than the ϕ dihedral angle. Longer cellulose oligosaccharides will need to be studied to decide if this flexibility can be attributed to the end-ring effects or if it is a genuine property of the ψ angles in cellulose.
We have suggested that larger flexibility is more likely associated with the ψ rather than ϕ angle. This could be due to occasional transition from syn- to trans-conformers on either angles, or stochastic local averaging of the glycosidic dihedral angles. From the perspective of cellulose, the movements detected in cellobiose can be exaggerated because both monosaccharide rings of cellobiose are effectively the “terminal” residues of cellulose. In order to establish if this is the case, detailed studies of longer cellulose oligosaccharides will be needed. Such oligosaccharides will likely require partial deuteration and/or 13C labeling in order to provide high quality experimental data suitable for their detailed conformational analysis.
The NMR methodology for accurate and precise measurement of large numbers of very small RDCs developed here is applicable to the conformational study of molecules ranging from simple organic molecules to complex carbohydrates. At the same time, we have shown that the detailed analysis of the alignment parameters of rigid molecular fragments can reveal low-level flexibility of molecules.
Footnotes |
† Celebrating 300 years of Chemistry at Edinburgh. |
‡ Electronic supplementary information (ESI) available: Sample preparation details, definition of the order parameter and calculations of rmsds, the 13C spectrum of cellobiose, examples of the determination of HH and CC splittings, 2D RDC or J-coupling constants-based rmsd maps, and a survey of dihedral angles of cellobiose related carbohydrates. See DOI: 10.1039/c3cp52987h |
This journal is © the Owner Societies 2013 |