Paúl
Pinillos‡
,
Ander
Camiruaga‡
,
Fernando
Torres-Hernández
,
Francisco J.
Basterrechea
,
Imanol
Usabiaga
* and
José A.
Fernández
*
Department of Physical Chemistry, Fac. of Science and Technology, University of the Basque Country (UPV/EHU), Barrio Sarriena s/n, 48940, Spain. E-mail: josea.fernandez@ehu.es; i.usabiaga@outlook.es
First published on 7th February 2023
Sugars, together with amino acids and nucleobases, are the fundamental building blocks of a cell. They are involved in many fundamental processes and they especially play relevant roles as part of the immune system. The latter is connected to their ability to establish a collection of intermolecular interactions, depending on the position of their hydroxyl groups. Here we explore how the position of the OH in C4, the anomeric conformation and the nature substituent affect the interaction with phenol, which serves as a probe of the preferred site for the interaction. Using mass-resolved excitation spectroscopy and density functional calculations, we unravel the structure of the dimers and compare their conformation with those found for similar systems. The main conclusion is that the hydroxymethyl group has a very strong influence, guiding the whole aggregation process and that the position of the substituent in C4 has a stronger influence on the final structure of the dimer than the anomeric conformation.
To understand the whole recognition process, information regarding two aspects of the system is essential: a deep knowledge of the structure of mono- and poly-saccharides and of the interactions that they can establish with other molecules, such as for example, the lateral chains of amino acids. There are many research groups contributing to increasing the knowledge on the structure of saccharides. Since the pioneering works of Simons’ group using laser spectroscopy,7 the main advances have come from the high resolving power of microwave spectroscopy.8–15
On the other hand, the large size of the aggregates between monosaccharides and other molecules usually makes the use of other techniques to extract structural information necessary. Two techniques are commonly used: NMR and mass-resolved spectroscopy in jets. While the former gives important information regarding the behaviour of the saccharides in solution,16 the latter enables the characterization of the most stable structures in the absence of external perturbations.17 Yet, such studies are not easy: first, the saccharides usually do not contain a chromophore and therefore, it is necessary to modify the molecule to include an aromatic substituent. This is usually done in the anomeric carbon, to avoid spontaneous α/β isomerization. Certainly, monosaccharides present a linear form, especially in solution, which allow them to interconvert between anomers. Second, sugars cannot be transferred to the gas phase by simple warming. They usually require a desorption system, which introduces additional noise and limits the maximum signal intensity achievable. Signals become less stable and the spectra noisier, complicating their acquisition and interpretation. This is probably one of the reasons why not many groups have engaged in the study of the structure of sugar aggregates.
Our group has contributed to this field with the study of the aggregation preferences of glucose (Glc), galactose (Gal) and several derivatives with a collection of molecules: from sugar derivatives to drugs, using several mass-resolved laser spectroscopic techniques.5,6,18–20
Here, we extend those previous studies to the aggregation preferences of Gal derivatives with phenol (PhOH, Scheme 1) and compare the results obtained with those found for the aggregation of Glc derivatives with PhOH. As can be seen, Glc and Gal only differ in the position of the hydroxyl group attached to C4. Such a small structural difference produces a re-accommodation of the rest of OH groups in the molecule and a change in the orientation of the intramolecular hydrogen bond network, also modifying the way in which the sugar unit interacts with other molecules.
Scheme 1 Structures of phenol, phenyl-β,D-glucopyranose, methyl-β,D-glucopyranose, methyl-α,D-glucopyranose and their galactose analogues. |
Two derivatives of each monosaccharide were included in this study: with an –OMe and with an –OPhe group in the anomeric carbon, either in the α or in the β configuration. The comparison between the structures of the aggregates with each of the two substituents allowed us to evaluate the tendency of the sugar towards stacking, a type of interaction necessary to understand the aggregation of nucleobases. On the other hand, past studies demonstrated that the subtle structural variation introduced by the α/β conformation of the anomeric substituent resulted in very different aggregation structures and interaction energy values.6 Therefore, we also analyse here the anomeric effect on the structure of the aggregates.
Once the final set of structures was obtained, their IR spectra were simulated using the normal modes generated in the corresponding calculation and an algorithm that takes into account the anharmonicity by introducing a parameter (0.953 for CHs and 0.9385 for OHs for the calculations at the M06-2X/6-311++G(d,p) level and 0.968 for CHs and 0.9535 for OHs for the calculations at the B3LYP-D3/6-311++G(d,p) level respectively). The broadening in the OH stretching transitions introduced by the formation of the hydrogen bonds was also included using a polynomial expression:
Δν = 1 + (80 (3600 − XFreq)/(3600–3200)) |
Finally, the laser bandwidth was also introduced by convolving the spectrum (Lorentzian function for each frequency) with a Gaussian function of 5 cm−1 FWHM.
Very close in energy but slightly to higher wavenumbers, the spectrum of PhGlc and PhGal appear. They are very similar, with well-defined transitions that hide the contribution from several conformational isomers.
The conformational variability of hexoses arises from the flexibility of the hydroxymethyl group and the orientation of the intramolecular hydrogen bond network: either clockwise or anticlockwise. Previous studies using mass-resolved excitation spectroscopy (MRES) already identified three isomers of PhGlc, differing in the relative orientation of the hydroxymethyl group,33 and another three isomers of PhGal, although in the latter, one of the isomers is based in a different orientation of the hydrogen bond (hbond) ribbon.34
In stark contrast to the spectra of the monomers, those of the dimers are unstructured absorptions, with a very limited number of discrete features. The broadening seems to be connected to the nature of the aggregation partner. Previous reports on the MRES of sugar units and on their aggregates demonstrated that even disaccharides tagged with a chromophore present discrete spectra.3 A background absorption starts appearing in the electronic spectra of the monohydrates35 and becomes more evident in the dihydrates36,37 to become unstructured absorptions in the dimers with larger molecules.38 However, the loss of vibronic structure does not hamper recording a clean and discrete mass-resolved IR spectrum, as demonstrated previously.38
Slightly larger differences between experiment and predictions were found for β-MeGlc⋯PhOH, for which the calculations predict a smaller shift for the OH stretching of the phenolic hydroxyl group. Apparently, the experimental structure presents a stronger hbond than predicted. The simulation for the second most stable isomer (option 2 in the figure) better reproduces the position of the OPhH stretch (Fig. S5, ESI†), but at the expense of a worse agreement with the bands around 3600 cm−1. Those bands are due to the stretching of the OH groups involved in mild interactions and therefore, present the lowest anharmonicity. In consequence, they should be described more accurately by the computational methods used. In conclusion, we prefer to maintain the assignment to the most stable structure.
Interestingly, a single isomer was found for each aggregate, except for α-MeGlc⋯PhOH, for which the experimental trace recorded probing different wavenumbers point to the existence of at least two isomers. Determining the exact number of isomers for these systems is not an easy task, as the employment of double resonance techniques such as UV/UV hole burning is not possible, due to the unresolved nature of the electronic spectra. However, the IR spectra were recorded at several wavelengths, always obtaining the same results, except for the above-mentioned case of α-MeGlc⋯PhOH.
Fig. 3 Assigned structures of PhOH aggregates with β-PhGlc, β-PhGal, β-MeGlc, β-MeGal, α-MeGlc and α-MeGal, computed at the M06-2x/6-311++G(d,p) level. Numbers in brackets are the relative energy and the binding energy in kJ mol−1 calculated at the specified temperatures. The rest of the computed structures may be found in the ESI.† |
There is a clear preference for phenol to act as a proton donor to the sugar unit, independently of the anomer and/or the anomeric substituent. Also interesting is the strong propensity for the interaction of PhOH with the hydroxymethyl group. Although the calculations demonstrate that several conformers with very similar stability are possible, it seems that the molecules always find a low-energy-barrier path to the global minimum, except for α-MeGlc⋯PhOH.
Interestingly, the axial/equatorial position of the hydroxyl moiety in C4 also modifies the attack angle of the phenol, through the alteration of the position of the hydroxymethyl group. Such modification is amplified and transmitted to the substituent, mainly because, in all the systems studied, there is a clear preference for maximizing the OPhH⋯O interaction while the CH⋯π contacts appear as secondary interactions. This is clearly seen comparing β-MeGlc and β-MeGal aggregates: in both cases PhOH “attacks” one of the lone pairs of the O6H. However, the latter is below the plane of the sugar ring in β-MeGlc to interact with O4H and above the plane in β-MeGal. This difference is mainly a rotation of the C4C5C6O6 dihedral angle, from −71 to 71 degrees, and results in a completely different orientation of the PhOH, which in the former interacts with the CringH atoms and with the CβH3 in the latter. A similar shift in the position of the PhOH was observed for α-MeGlc/Gal.
Dimer | Binding energy 0 K (kJ mol−1) | σ(PhOH)/cm−1 | σ(OH)sugar/cm−1 |
---|---|---|---|
β-PhGlc⋯PhOH | −44.3 | 3390 | 3595, 3645 |
β-PhGal⋯PhOH | −45.6 | 3360 | 3600, 3606, 3632, 3645 |
β-MeGlc⋯PhOH | −43.0 | 3420 | 3529, 3642 |
β-MeGal⋯PhOH | −48.4 | 3405 | 3548, 3596, 3636 |
α-MeGlc⋯PhOH I | −44.2 | 3380 | 3562, 3594, 3634 |
α-MeGlc⋯PhOH II | −50.3 | 3415 | 3521, 3618, 3643 |
α-MeGal⋯PhOH | −50.9 | 3390 | 3541, 3594, 3616 |
The anomeric conformation does not seem to have a strong influence on the binding energy. The largest changes in these values seem to be related to the conformation of the substituent in C4, at least for the methyl-substituted species. It is clear from the table that the α-/β-MeGal species present ∼10% higher binding energy values than their homologous Glc species.
The shift in the position of the OH stretching band is usually a good indicator of the strength of a hydrogen bond. As can be seen in Fig. 4, two different correlations may be established between these two parameters: one for α-/β-MeGal plus β-MeGlc and a second linear correlation with a smaller slope for the rest of the species plus β-MeGlc. The second isomer assigned for α-MeGlc does not fit well in this general picture, although it is closer to the values for α-/β-MeGal. The steeper slope of the red fit, means that the increase in binding energy produced a smaller shift in the position of the stretching of the phenolic OH, indicating that the extra energy may come from other interactions instead of the reinforcement of the intermolecular hydrogen bond. Certainly, when the strength of the interactions is computed using the bond critical points, all the values fit to a single straight line (Fig. 5).
Fig. 4 Binding energy of the dimers studied in this work vs. position of the stretching of the phenolic OH. |
Fig. 5 Strength of the hbonds in the systems studied in this work vs. shift in the position of the OH stretching. The hbond strength was estimated using the bond critical points. The numbers correspond to the structures in Fig. 3. The values calculated for the O4H⋯OPhH interaction in structures 7 and 8 were added in red. The red circle labelled as “Free Glc/Gal” corresponds to the values for the OH groups not directly bonded to PhOH in Glc and Gal. Values for PhOH and water OH stretching were also added for comparison. |
Comparison of the results obtained in the present work with those from the monohydrates may help us understand the influence of the interactions due to dispersive forces in the final structure of the aggregates. However, one of the problems encountered for such comparative study is that some of the published works are already close to 20 years old and ask for a re-investigation, especially due to the computational levels used to interpret the experimental results, which were not as accurate as the state-of-the-art functional. Certainly, the carbohydrate–aromatic interaction is considered a test for DFT calculations, because of the delicate balance between the interactions involving OH and CH groups.41
Nevertheless, there is a general agreement in that water inserts in the weakest intramolecular hbond of the sugar.42 In Glc and mannose, water tends to interact with O4H as a proton-acceptor and, at the same time, to donate to O6, forming an eight-membered ring.33,35,43,44 Additional studies on Glc-water found a secondary isomer with the water molecule trapped between the O6H and the ring's oxygen atom,34 which is precisely the preferred conformation of Gal monohydrates.36 Thus, in the case of sugar–water interaction, the preferred solvation site changes with the monosaccharide, while in the case of the dimers with phenol, the global minimum always has the phenol molecule acting as proton-donor to the suggar's O6H. This is true at least for the two monosaccharides studied in this work. This may be the result of several factors: phenol is a better proton-donor than water, and therefore, it may result more efficient for the system to place the phenol molecule at the tail of the hydrogen-bond network. It is also a poorer hydrogen bond acceptor, and therefore, it may not be able to insert between the O4H and O6, as water does in its interaction with Glc. Furthermore, adopting the position shown in Fig. 3, it may be able to maximize other secondary interactions.
To evaluate the relative weight of each of these factors it would be interesting to extend the studies on phenol–sugar dimers to other monosaccharides, in which water forms what has been termed as “insertion structures”: isomers in which water inserts between O3H and O4H or between O2H and O3H, breaking an intramolecular hydrogen bond.43,45–48
Regarding the sugar–aromatic interaction, several studies demonstrated that the optimal interaction takes place with the hydrogen atoms of C3, 4 and 5.41,49–54 These interactions are maximized in those monosaccharides in which all the OHs are on one side of the plane of the ring, such as in fucose.53 Interestingly, a computational study on fucose–phenol dimer using CCSD(T) and extrapolating to the basis set limit, predicted a structure for the global minimum substantially different from those in Fig. 3, with the phenol forming part of an O5H⋯OPhOHH⋯Oring hydrogen bond network and the aromatic ring pointing away from the sugar. Conversely, in all the systems studied here, phenol acts as a proton-donor to the hydroxymethyl group. Assuming that the authors did not overlook any isomer, an always present risk in this kind of complex system with a multitude of local minima, the reason for this difference with the dimers studied in this work may be the absence of a hydroxymethyl group in fucose. This functional group may completely modify the conformational landscape. Certainly, the global minimum of fucose–water presents an insertion structure, as mentioned above,46 differing from the structures found for Glc/Gal-water.33,37
Comparison between the structures in Fig. 3 and those found for monosaccharide–aromatic (aromatic = benzene, toluene or indole) aggregates40,41,49–54 shows that the hydroxyl group is a game changer. While in the former systems the aromatic always lies parallel to the saccharide ring, phenol–sugar interaction is guided by the optimization of the hydrogen bond. Only if the orientation of the O6H allows it, phenol can adjust the position of its aromatic ring to also maximize the interaction with the aliphatic hydrogens of the monosaccharide, such as in α-/β-Glc⋯PhOH. It is also worth mentioning that the interaction with phenol fixes the orientation of the hydroxymethyl group in a single position, collapsing the collection of rotamers observed for the monosaccharides into a single species. This is also in line with the observation of several authors, which connect interaction with the solvent with a simplification of the conformational landscape of the sugar units.36
The other determinant factor of the structure adopted in the aggregate is the position of the O4H, axial in galactose and equatorial in glucose. Such small difference propagates through the intermolecular hydrogen bond network and determines the orientation of the O6H group, to which phenol is anchored, strongly influencing its final position. This observation is in agreement with previous studies that attributed the intramolecular hydrogen bond network to a kind of amplification effect that facilitates the “reading” of the small structural differences between sugars by the receptor, improving the sensitivity and specificity of the sugar–receptor interaction.
Footnotes |
† Electronic supplementary information (ESI) available: Additional computed structures and predicted IR spectra. See DOI: https://doi.org/10.1039/d2cp06036a |
‡ These two authors contributed equally to this work. |
This journal is © the Owner Societies 2023 |