Open Access Article
This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

Discovery of selective monosaccharide receptors via dynamic combinatorial chemistry

Miguel Alena-Rodriguez ab, Marcos Fernandez-Villamarin a, Ignacio Alfonso b and Paula M. Mendes *a
aSchool of Chemical Engineering, University of Birmingham, Edgbaston, Birmingham, West Midlands B15 2TT, UK. E-mail: p.m.mendes@bham.ac.uk
bDepartment of Biological Chemistry, Institute for Advanced Chemistry of Catalonia, IQAC-CSIC, Jordi Girona 18-26, 08034 Barcelona, Spain

Received 4th January 2024 , Accepted 11th April 2024

First published on 12th April 2024


Abstract

The molecular recognition of saccharides by synthetic hosts has become an appealing but elusive task in the last decades. Herein, we combine Dynamic Combinatorial Chemistry (DCC) for the rapid self-assembly and screening of virtual libraries of receptors, with the use of ITC and NMR to validate the hits and molecular modelling to understand the binding mechanisms. We discovered a minimalistic receptor, 1F (N-benzyl-L-phenylalanine), with considerable affinity for fructose (Ka = 1762 M−1) and remarkable selectivity (>50-fold) over other common monosaccharides. The approach accelerates the discovery process of receptors for saccharides.


Introduction

The precise recognition of saccharides, more commonly known as carbohydrates or sugars, has gained significant relevance in the past years due to their key role in the appropriate functioning of organisms. Alteration of their normal physiological levels, whereas they are free or conjugated with more complex biomolecules, is associated with the development of diseases such as diabetes, Crohn's disease, or cancer.1–3 Therefore, the recognition of saccharides by artificial hosts represents an excellent approach for the molecular study of these diseases.

Despite the apparent simplicity of sugars, these molecules are a challenging target in biological samples. The three most common monosaccharides are glucose, fructose, and galactose, which are isomers with a 6-carbon backbone and different arrangements of their functional groups. The similarity among sugars, together with the fact that they are normally heavily hydrated in aqueous environments,4,5 has made the selective recognition of carbohydrates a challenging task that has attracted researchers for the last 35 years.6,7 From the first sugar-receptor ever designed by Aoyama in 1988,8 with discrete affinities and selectivities for their target, and unable to work in aqueous media to probably the most powerful sugar receptor synthesised yet, developed by Davis in 2019,9 the field of molecular recognition of monosaccharides have witnessed enormous progression. However, these advances were mostly based on rational design and ultimately, proof and error of individual candidates.

Herein we propose an approach that could potentially accelerate the discovery process. The core of this methodology is the well-known technique named Dynamic Combinatorial Chemistry (DCC).10–14 DCC relies on the dynamic generation of interconvertible species via reversible reactions between starting building blocks (BBs), and the ability of this system to respond to external stimuli such as a change in temperature, pH, or the addition of an external molecule (template). This last situation will be exploited in this work, the template being a carbohydrate that will trigger the self-assembly and system-enrichment of the best possible hosts with the highest affinity for a given sugar template (Fig. 1).


image file: d4ob00015c-f1.tif
Fig. 1 Schematic representation of DCC protocol. The figures on top represent the starting building blocks in the Dynamic Combinatorial Library (DCL) reversibly reacting with one another. In the bottom, a screening of the DCLs in absence (blank experiment) and in presence (templated experiment) of the saccharide template (represented as a puzzle piece). The library member with the highest affinity for the template gets amplified in the templated experiment (highlighted in green).

DCC has been thoroughly implemented in the successful discovery of receptors for a number of biomolecules: peptides,15 DNA,16 RNA,17 proteins,18 complex glycosaminoglycans19 and even whole living cells.20 Nevertheless, this technique has been far less explored in the discovery of receptors for monosaccharides,21 perhaps due to their aforementioned difficulties. In this work, we present a novel workflow that combines the rational selection of BBs to perform DCC experiments with the individual synthesis and validation by ITC and NMR of the most promising candidates. The knowledge obtained from these analyses with regards to the most relevant features to achieve binding can then be implemented in the re-design of further optimised Dynamic Combinatorial Libraries (DCLs) that will produce even better fitting receptors.

Results and discussion

In order to design the DCL we considered BBs that meet three crucial requirements: (i) possess functional groups capable of reversible exchange, (ii) contain groups potentially able to interact with the templates, and (iii) can offer multivalency, as this is known to enhance binding affinities. In terms of the geometrical shape of the desired receptors, there are a few designs that have proven to be effective in the past for the recognition of sugars. From 3D cage-like structures,9,22 to linear oligomers that fold to encapsulate their target.23 However, we pursued a more simple design that could be afforded with inexpensive and readily available starting materials. With this in mind, we selected the BBs shown in Fig. 2a. Isophthalaldehyde (2) as the central scaffold would ensure reversibility and multivalent library members upon imine formation reaction with BBs A, B, W, F, and D. This set of molecules would enrich the library with an array of functionalities capable of creating interactions with the target saccharides. In particular, boronic acids (A) are known to bind diols,24 extensively present in saccharides. Phenylalanine (F) and aspartic acid (D) were recently reported to have excellent binding properties for sugar-containing glycans.25 Furthermore, tyrosine (represented by 4-(aminomethyl)phenol, B) and tryptophan (W) are also known to be extensively present in binding pockets of carbohydrate-recognition lectins.26 In summary, the BBs selected ensured a DCL rich in functionalities able to participate in both covalent and non-covalent interactions with the target saccharides. Particularly, we promoted the possibility of CH–π interactions as these have proven to drive molecular recognition processes with glycans and carbohydrates.27,28
image file: d4ob00015c-f2.tif
Fig. 2 (a) Building blocks selected for our DCC experiments. (b) Imine formation reaction in equilibrium followed by irreversible reduction with NaCNBH3. The non formation of 2X-OH was confirmed by LC-MS. (c) Protocol for the syntheses of 1F and 1D. (d) Representation of the pyranose (left) and furanose (right) forms of monosaccharides in solution. α-D fructose drawn as an example.

Since imines are not stable in the conditions required for the analysis of the DCLs,29 we reduced them with NaCNBH3 prior analysis. This reduction step is irreversible and therefore the equilibrium composition was frozen, as shown in Fig. 2b. NaCNBH3 was chosen as it is a mild reducing agent that effectively reduces imines to their corresponding secondary amines (2X, Fig. 2b).30 The products of reduction of the aldehyde groups in 2X (2X-OH, Fig. 2b) were not detected by LC-MS in any DCC experiment (ESI). With these BBs and the conditions previously optimised by us, we performed different DCC experiments employing four isomeric sugars as templates: D-glucose, D-mannose, D-galactose and D-fructose. Each experiment consisted of a positive and a negative experiment, performed as detailed in ESI. The DCLs were analysed by LC-MS and library members were detected and quantified according to the corresponding (M + H)+ ions formed in the extracted ion chromatogram (EIC). The potentially competing reductive amination reaction between the amine building blocks and the open forms of the saccharides was ruled out since the corresponding by-products were not detected by LC-MS in any of the assayed libraries.

Amplification (A) values were calculated as the division between the LC-MS peak intensity (in the EIC) in presence and in absence of the template, meaning that values A > 1 correspond to actual amplification of the species in the presence of the template while values A < 1 means the opposite. This methodology therefore infers that A values are inevitably related to the binding constant of a library member and a template, but it is worth highlighting that when comparing different DCC experiments, A value is rather a function of selectivity and not of affinity for the template of study in each experiment.31 In other words, A2DD-glucose = 3.43 means that 2DD should have a better binding affinity (higher Ka) than any other virtual DCC member with A < 3.43 in the experiment with glucose as template. However, A1F-fructose = 2.11 could correspond (and in fact it does, see below) with a higher Ka value for the corresponding 1F-fructose complex.

DCC experiments were performed in triplicate and the average A values, plus the standard deviation, are plotted in Fig. 3a–d (left Y axes, coloured bars). While both the net A values and their deviations varied quite drastically across the different experiments, the trend remained the same in experiments with glucose, mannose, and galactose, with library member 2DD standing out as the best receptor in all the three cases. The experiment with fructose resulted in a much different output with 2D and 2F as the most promising receptors in a more competitive environment. In order to correlate A values with affinity for the template, library members 2D, 2F, 2DD, and 2FF were individually synthesised by reductive amination, and they were tested by ITC to find their binding constants towards the saccharides of interest.


image file: d4ob00015c-f3.tif
Fig. 3 DCC results (amplification, left Y axes, coloured bars) and ITC results (Ka, right Y axes, red dots) plotted together for the experiments with (a) glucose, (b) mannose, (c) galactose, and (d) fructose. *The molecules tested by ITC were 1F and 1D but the plotted ‘A’ values in those positions in the bar graph were those of 2F and 2D, respectively.

Commercially available amino acids D and F were assayed too. As the main objective of this research was to create a simple methodology for the rapid screening and testing of hits, we decided to simplify the synthesis of monosubstituted products 2D and 2F. Benzaldehyde (1), instead of isophthalaldehyde (2), was employed (Fig. 2c) thus avoiding the extra synthetic steps or tedious purification protocols derived from the presence of an extra reactive aldehyde in the starting material. Hence, the molecules tested by ITC were 1D and 1F, differing from the actual library members in the DCC experiment in the lack of a CHO group in meta in the central aromatic ring. However, that group being distant from the potential recognition region of the molecule, we believe that its absence should not affect the overall binding properties of the receptors. Notably, ITC and DCC results were in good agreement which suggests that this substitution did not endanger the usability of the protocol (Fig. 3).

In the experiments with glucose, mannose, and galactose as templates 2DD was the best receptor, as predicted by DCC. Ka values were in the three cases in the range of 45 M−1 (Table 1). The similar behaviour of the three monosaccharides did not come as a surprise since they are isomeric structures, differing only in the spatial orientation of their hydroxyl groups. More interesting results were found in the experiment with fructose. The dissimilarity observed by DCC between fructose and the other sugars was supported by ITC data, with the highest Ka values being significantly higher than for the other saccharides. Remarkably, Ka1D-fructose and Ka1F-fructose were 1548 ± 98 and 1762 ± 162 M−1, respectively. To the best of our knowledge, the binding constant exhibited by 1F for D-fructose (1762 M−1) is among the highest yet reported by synthetic small molecules for such saccharide in water environments. It surpasses reported values of phenylboronic acid (PBA, Ka = 560 M−1) or o-amino PBA (Ka = 1640 M−1),32 both relying on covalent bond formation with the sugar. However, there are examples of receptors of this type that outperformed 1F, for instance, the bis-boronic acid reported by Fossey et al. that binds D-fructose with Ka = 130 × 103 M−1.33

Table 1 Affinity values (Ka, M−1), measured by ITC, of the set of molecules and sugars tested, and selectivity of molecule 1F for fructose calculated by dividing the affinity of 1F for fructose over its affinity for the other saccharides tested
  D-Glucose D-Galactose D-Mannose D-Fructose
D 0.0 0.0 10.7 ± 1.2 170.7 ± 39.6
F 0.0 0.0 0.0 1136.0 ± 148.3
1D 0.0 0.0 11.7 ± 0.7 1548.3 ± 98.1
1F 0.0 17.9 ± 2.6 31.9 ± 2.1 1762.0 ± 162.4
2DD 44.5 ± 2.0 56.7 ± 0.3 47.7 ± 3.5 840.7 ± 39.0
2FF 0.0 0.0 0.0 289.0 ± 21.0
 
Selectivity >100 98.4 55.2 1.0


Fructose is an isomer to the other tested sugars, but it is the only ketose of the series, which produces a considerably higher amount of its five-membered furanose forms in comparison with the other saccharides tested (28.6% vs. 0.1–6%)34,35 and a different structural arrangement around the anomeric carbon (see furanose and pyranose forms of D-fructose in Fig. 2d). It was also interesting to find that 1D and 1F show better binding affinity than their bidentate analogues 2DD and 2FF (and better than D and F alone), suggesting that our DCC approach led to the optimal size for the receptors. The ITC titration graph of molecules 1D and 1F with D-fructose also revealed that the binding is mainly entropy driven (ΔS dominates ΔG, ESI). For molecules like these ones that are small and relatively flexible, it can be hypothesized that this relatively strong binding may be attributed to desolvation processes. Another interesting and unexpected outcome was the positive behaviour of 2DD in the experiments with glucose, mannose, and galactose. Posing only 1 aromatic unit, 2DD outperformed other competitors with greater ability to offer CH–π interactions. It could be proposed that H bond formation was the preferential intermolecular force between 2DD and such sugars. In terms of selectivity, the binding affinity of 1F for fructose was 55 times higher than for mannose and 98 times higher than for galactose (Table 1). Its affinity for glucose was too weak to be accurately measured with our ITC equipment, which suggests that the selectivity of 1F for fructose over glucose should be even higher (>100-fold) than over galactose.

Molecule 1F was further analysed by NMR titrations to understand the mechanisms behind the binding phenomena. From the binding constants obtained from ITC, we estimated that the optimal concentrations of receptor and sugar to maximize the formation of 1F-fructose complex was 10 mM for both. Unfortunately, due to low solubility of 1F, titrations were performed at lower concentrations, which limits the applicability of the results. 1F was kept constant at 0.9 mM and different concentrations of fructose (below, above, and at 0.9 mM) were tested. The maximum displacements of chemical shifts were observed at equimolar concentrations. Under these conditions all the signals in 1F shifted upfield, although they did it to different extents. Aromatic protons (7.45–7.20 ppm) experienced very small change (Fig. 4, blue dotted line). It was also appreciated a subtle modification in the shape of some of those peaks (Fig. 4, dotted rectangles) although this region of the spectrum is too complex to be analysed in detail. Both groups of protons next to the secondary amine (i.e., benzylic, and alpha amino acid protons, green and red lines in Fig. 4, respectively) suffered the largest displacements (Δδ = 0.02 ppm). Finally, the beta protons of amino acid (orange line in Fig. 4) shifted Δδ = 0.008 ppm. The peaks of fructose did not shift at all. These results identify the most perturbed central moiety of 1F as key for the binding. The combined implication of the aromatic residues should not be disregarded though, especially considering that, as outlined before, the molecule with two aromatic rings (1F) outperforms those with three and one of them (2FF and F).


image file: d4ob00015c-f4.tif
Fig. 4 Stacked 1H NMR spectra (400 MHz, carbonate buffer in D2O, 298 K) of receptor 1F (bottom), fructose (middle) and the equimolar mixture of 1F and fructose (top). The structure of the receptor 1F is drawn and the shift of the signals can be visualised with the coloured dotted lines.

Considering the limited information obtained from the NMR titration experiments, we undertook molecular modelling studies on the intriguing [1F-fructose] complex. To this aim, we used the anionic carboxylate form of 1F and different isomers of D-fructose. We hypothesized that 1F is able to bind native forms of fructose according to the minor changes observed in the saccharide 1H NMR signals upon titration with 1F. Thus, we carried out Monte Carlo conformational searches with OPLS4 force field minimizations in implicit water followed by DFT geometry optimizations with water PCM solvation model (Fig. 5, see ESI for details).


image file: d4ob00015c-f5.tif
Fig. 5 Optimized structures for the complexes formed between 1F and β-fructopyranose (left) or β-fructofuranose (right) collectively accounting for >90% of sugar isomers in water. Monosaccharides are displayed with green C-atoms and H-bonds are depicted by black dashed lines. Computed relative overall energies (ΔErel), binding energies (ΔEbind = Ecomplex − (E1F + Esugar)) as well as solvation energies (ΔEsolv = EwaterEgas) of the corresponding complexes are given in kcal mol−1. For other representative optimized structures, see ESI.

The results support the favourable formation of stable supramolecular complexes, as reflected in the negative values of ΔEbind. The structures are stabilized by hydrogen bonding interactions between sugar OH groups and 1F-carboxylate, in addition to potential C–H⋯π contacts between the substrate and the aromatic rings of the host. In the case of the five-member ring, the phenylalanine side chain is mainly implicated while for the pyranose form (most abundant isomer and most stable complex), both aromatic rings are pointing towards the sugar (Fig. 5). This defines a tweezer-like arrangement of carboxylate and aromatic rings that would efficiently de-solvate both host and guest, explaining the entropically-driven binding obtained experimentally by ITC. Actually, the highly negative solvation energies (ΔEsolv) reflect the key role of the aqueous medium for the stabilization of the structures. Moreover, the most stable complex bears concerted H-bonds between carboxylate of 1F and both hemiacetal–OH and C1–OH of β-fructopyranose, suggesting a potential explanation for the observed ketose selectivity over the aldoses where this specific arrangement is not possible.

Conclusions

In summary, this study provides a robust DCC methodology for the rapid discovery of receptors for the most common monosaccharides in Nature. Library member 2DD -two units of aspartic acid connected in meta to a benzene ring- was found to be the best receptor among the library members for D-glucose, D-mannose, and D-galactose, though with low selectivity among sugars. The D-fructose-templated DCC experiments identified molecule 1F (N-benzyl-L-phenylalanine), as the best host, with a considerably large binding constant (Ka = 1762 M−1) and selectivity (>50-fold) over the other saccharides tested. These are remarkable results considering that all tested sugars are isomers. Virtually any type of saccharide could be studied with this methodology. Likewise, the complexity and size of the DCL can be extended up to their analytical limitations, making this approach easily scalable with the potential to discover new selective saccharide receptors.

Author contributions

M. A. conducted the experimental work, P. M., I. A. and M. F.-V. reviewed the manuscript.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

This work is financially supported by Prostate Cancer UK (RIA17-ST2-020) and MCI/AEI/10.13039/501100011033 (ref. project PID2021-128411NB-I00).

References

  1. D. Bruen, C. Delaney, L. Florea and D. Diamond, Sensors, 2017, 17, 1–21 CrossRef PubMed.
  2. F. Clerc, et al. , Gastroenterology, 2018, 155, 829–843 CrossRef CAS PubMed.
  3. J. Munkley and D. J. Elliott, Oncotarget, 2016, 7, 35478–35489 CrossRef PubMed.
  4. G. V. Oshovsky, D. N. Reinhoudt and W. Verboom, Angew. Chem., Int. Ed., 2007, 46, 2366–2393 CrossRef CAS PubMed.
  5. A. Davis, Nature, 2010, 464, 169–170 CrossRef CAS PubMed.
  6. T. D. James, M. D. Phillips and S. Shink, The Molecular Recognition of Saccharides, 2006 Search PubMed.
  7. A. P. Davis, Chem. Soc. Rev., 2020, 49, 2531–2545 RSC.
  8. Y. Aoyama, T. Yasutaka, T. Hiroo and H. Ogoshi, J. Am. Chem. Soc., 1988, 110, 635–637 CrossRef.
  9. R. A. Tromans, et al. , Nat. Chem., 2019, 11, 52–56 CrossRef CAS PubMed.
  10. J. M. Lehn, Chem. – Eur. J., 1999, 5, 2455–2463 CrossRef CAS.
  11. J.-M. Lehn and A. V. Eliseev, Science, 2001, 291, 2331–2332 CrossRef CAS PubMed.
  12. F. B. L. Cougnon and J. K. M. Sanders, Acc. Chem. Res., 2012, 45, 2211–2221 CrossRef CAS PubMed.
  13. M. Mondal and A. K. H. Hirsch, Chem. Soc. Rev., 2015, 44, 2455–2488 RSC.
  14. P. Frei, R. Hevey and B. Ernst, Chem. – Eur. J., 2019, 25, 60–73 CrossRef CAS PubMed.
  15. K. C. Nicolaou, et al. , Chem. – Eur. J., 2001, 7, 3824–3843 CrossRef CAS.
  16. A. M. Whitney, S. Ladame and S. Balasubramanian, Angew. Chem., Int. Ed., 2004, 43, 1143–1146 CrossRef CAS PubMed.
  17. A. Bugaut, J. J. Toulmé and B. Rayner, Org. Biomol. Chem., 2006, 4, 4082–4088 RSC.
  18. L. Monjas and A. K. Hirsch, Future Med. Chem., 2015, 7, 2095–2098 CrossRef CAS PubMed.
  19. D. Carbajo, et al. , J. Med. Chem., 2022, 65, 4865–4877 CrossRef CAS PubMed.
  20. D. Carbajo, Y. Perez, J. Bujons and I. Alfonso, Angew. Chem., Int. Ed., 2020, 54, 17202–17206 CrossRef PubMed.
  21. M. Rauschenberg, S. Bomke, U. Karst and B. J. Ravoo, Angew. Chem., Int. Ed., 2010, 49, 7340–7345 CrossRef CAS PubMed.
  22. C. Zhai, et al. , Chem. – Eur. J., 2023, 29, e202300524 CrossRef CAS PubMed.
  23. N. Chandramouli, et al. , Nat. Chem., 2015, 7, 334–341 CrossRef CAS PubMed.
  24. X. Wu, et al. , Chem. Soc. Rev., 2013, 42, 8032–8048 RSC.
  25. Z. Chen, et al. , NPG Asia Mater., 2018, 10, e472 CrossRef CAS.
  26. H. C. Siebert, et al. , Proteins: Struct., Funct., Genet., 1997, 28, 268–284 CrossRef CAS.
  27. K. L. Hudson, et al. , J. Am. Chem. Soc., 2015, 137, 15152–15160 CrossRef CAS PubMed.
  28. L. L. Kiessling and R. C. Diehl, ACS Chem. Biol., 2021, 16, 1884–1893 CrossRef CAS PubMed.
  29. J. T. Goodwin and D. G. Lynn, Angew. Chem., Int. Ed., 1992, 6, 951–952 Search PubMed.
  30. C. F. Lane, Synthesis, 1975, 135–146 CrossRef CAS.
  31. P. T. Corbett, J. K. M. Sanders and S. Otto, Chem. – Eur. J., 2008, 14, 2153–2166 CrossRef CAS PubMed.
  32. H. R. Mulla, N. J. Agard and A. Basu, Bioorg. Med. Chem. Lett., 2004, 14, 25–27 CrossRef CAS PubMed.
  33. W. Zhai, L. Male and J. S. Fossey, Chem. Commun., 2017, 53, 2218 RSC.
  34. T. Barclay, et al. , Carbohydr. Res., 2012, 347, 136–141 CrossRef CAS PubMed.
  35. K. Inoue, et al. , Molecules, 2011, 16, 5905–5915 CrossRef CAS PubMed.

Footnote

Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4ob00015c

This journal is © The Royal Society of Chemistry 2024