Open Access Article
Weizhun
Yang
a,
Sherif
Ramadan
ab,
Jared
Orwenyo
c,
Tayeb
Kakeshpour
a,
Thomas
Diaz
a,
Yigitcan
Eken
a,
Miloslav
Sanda
d,
James E.
Jackson
*a,
Angela K.
Wilson
*a and
Xuefei
Huang
*aef
aDepartment of Chemistry, Michigan State University, 578 South Shaw Lane, East Lansing, MI 48824, USA. E-mail: jackson@chemistry.msu.edu; wilson@chemistry.msu.edu; xuefei@chemistry.msu.edu
bChemistry Department, Faculty of Science, Benha University, Benha, Qaliobiya 13518, Egypt
cDepartment of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
dDepartment of Oncology, Georgetown University, Washington DC 20057, USA
eDepartment of Biomedical Engineering, Michigan State University, East Lansing, MI 48824, USA
fInstitute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
First published on 31st August 2018
N-Linked glycopeptides have highly diverse structures in nature. Herein, we describe the first synthesis of rare multi-antennary N-glycan bearing glycan chains on 6-OH of both α1,6- and α1,3-linked mannose arms. To expedite divergent generation of N-glycan structures, four orthogonal protective groups were installed at the branching points on the core tetrasaccharide, which could be removed individually without affecting one another. In addition, the synthetic route is flexible, allowing a bisecting glucosamine moiety to be introduced at a late stage of the synthesis, further expanding the diversity of sequences that could be achieved. The bisecting glucosamine unit significantly reduced the glycosylation yields of adjacent mannoses, which was attributed to steric hindrance imposed by the glucosamine based on molecular modelling analysis. The N-glycans were then transformed to oxazoline donors and ligated with a glycopeptide acceptor from haptoglobin promoted by the wild type Arthrobacter endo-β-N-acetylglucosaminidase (Endo-A). Endo-A exhibited interesting substrate preferences depending on donor sizes, which was rationalized through molecular dynamics studies. This is the first time that a glycopeptide bearing a bisecting N-acetyl glucosamine (GlcNAc), the rare N-glycan branch, and two LewisX trisaccharide antennae was synthesized, enabling access to this class of complex glycopeptide structures.
During the past decade, tremendous advances have been made in synthesis of highly complex N-glycans with many innovative methods developed.14–22 However, to date, N-glycans bearing a branch at 6-OH of mannose E (α1,3-arm) have not been synthesized. Given the structural complexity of N-glycans, synthesis of asymmetric antennae at each branching point remains a highly challenging task. To address this, we report our strategy enabling synthetic access to rare structures of N-glycans bearing antennary glycans on 6-OH of mannose E. To generate diverse glycan structures, a core tetrasaccharide was designed with potential branching points strategically protected with temporary protective groups that can be orthogonally removed. Furthermore, suitable conditions have been identified for the late stage installation of the bisecting GlcNAc moiety. The reducing ends of the synthetic N-glycans were then converted to oxazoline and transferred to a GlcNAc functionalized glycopeptide by endo-β-N-acetylglucosaminidase (Endo-A),23–28 providing access to glycopeptides bearing these rare N-glycan sequences.
To improve the overall efficiency of glyco-assembly, our N-glycan synthesis design hinges on divergent modifications of a key tetrasaccharide 1 bearing four protective groups, i.e., tert-butyl-diphenyl silyl (TBDPS), fluorenylmethyloxycarbonyl (Fmoc), 2,2,2-trichloroethoxycarbonyl (Troc) and levulinoyl (Lev), which are strategically placed at the potential branching points on mannoses D and E and may be removed orthogonally. Orthogonal deprotections have been shown to be powerful strategies for divergent synthesis of a large number of glycans including N-glycans.16,29–31 The free 4-OH of the mannose-C unit in 1 was kept free for late stage installation of the bisecting GlcNAc.
:
2.5%
:
2.5%) solution for 2.5 h. After filtration, the resins were washed with trifluoroacetic acid (2 × 10 mL), and the volume of the combined filtrates was concentrated to 1 mL, then absolute Et2O (15 mL) at 0 °C was added dropwise to the residues. The precipitates were separated from the mother liquor by centrifugation and washed with cold Et2O (10 mL). The crude products were dissolved in methanol and treated with NaOMe to cleave the ester protecting group on sugar, then neutralized with AcOH and the solvent was removed. Finally, the crude mixture was dissolved in H2O and subjected to a SUPELCOSILTM LC-18 HPLC column (25 cm × 10 mm, 5 mm) or (25 cm × 4.6 mm, 5 mm) for purification. The solvent systems used were A (0.1% TFA in H2O) and B (acetonitrile (MeCN)) with detection at 220 nm and 254 nm. Mass spectra were obtained by ESI mass spectra (Water Xevo G2-S Q-TOF LC-MS instrument).
:
CH3OH (1/1) was added hydrazine acetate (1 mmol), which was prepared in situ by mixing acetic acid and 51% hydrazine (4/1). The reaction was stirred at room temperature for 3 h, after which it was diluted with ethyl acetate and washed with sat. NaHCO3 and brine. The organic phase was dried (Na2SO4), filtered, and the filtrate was concentrated under reduced pressure. The residue was purified by flash column chromatography.
:
1, 5 mL) was added zinc dust (200 mg). The reaction was stirred at room temperature for 20 min, after which it was filtered by celite and the filtrate was concentrated under reduced pressure. The residue was purified by flash column chromatography.
:
nBuOH (5 mL, 1/4) was stirred at 90 °C overnight. The volatiles were evaporated, and the residue was purified by Sephadex LH-20. The crude product was dissolved in 0.5 mL pyridine followed by the addition of 0.05 mL acetic anhydride-1,1′-13C2 and the reaction was stirred overnight. The residue was purified by Sephadex LH-20. The products were then de-acetylated using sodium methoxide in MeOH (1.5 mL) overnight. The reaction mixture was neutralized by IR-120, filtered and concentrated in vacuum, and purified by Sephadex LH-20. To the products dissolved in MeOH
:
H2O
:
DCM (4/1/1, 2 mL), Pd(OH)2 (100% by weight) was added, and the mixture was hydrogenated overnight. The reaction mixture was filtered through cotton and concentrated. The residues were purified by G-15 gel filtration chromatography using water as the eluent. The collected solution containing the product was lyophilized to obtain the product as a white powder.
The volumes of N-glycan oxazolines 39 and 41 were calculated using the van der Waals volume QSAR descriptor of MOE, using a connection table approximation to calculate 2D molecular descriptors. The compounds were then non-covalently docked with the docking program within MOE. Binding poses were refined using an induced fit refinement method.
The geometries of the N-glycan oxazoline compounds were optimized using the Gaussian 16 program package. The optimizations were performed using the AM1 method.38 The obtained Mulliken charges were used with the antechamber module of Amber 16 in the generation of parameters for the N-glycan compounds. The systems were prepared using the Leap module of AmberTools16 (ref. 32) under the AMBER ff14SB and GAFF force fields. Each enzyme complex was solvated in a 14 Å cube of TIP4P-Ew water beyond the solute and 100 mM sodium chloride. The systems were relaxed under NVT conditions over six minimization procedures with decreasing restraints on the protein of 500.0, 200.0, 20.0, 10.0, 5.0 kcal mol−1 Å−2 to no restraints. The systems were then heated to 300 K over 30 ps. Atomistic molecular dynamics simulations were performed for 30 ns at 300 K and 1 atm using AMBER 16. The SHAKE algorithm constrained bonds involving hydrogen.39 The trajectories were produced using Langevin dynamics and the pressure of the system was regulated with isotropic position scaling. Long-range electrostatic effects were modeled using the particle-mesh Ewald method with a 10 Å cutoff.
The produced trajectories were analyzed using AMBER 16 and visualized with MOE and the UCSF Chimera package. Free energy of binding was calculated for every picosecond using the Poisson Boltzmann model form the MMPBSA.py module of AmberTools and AMBER 16. The relative free energy trends between models were compared so solute entropy was neglected.
27 (Scheme 1). Upon removal of the p-methoxybenzyl (PMB) group from 2, the glycosylation between the newly generated acceptor 3 and donor 4 went smoothly as promoted by N-iodosuccinimide (NIS) and TfOH to provide trisaccharide 5 in 88% yield. Trisaccharide 5 was treated with 80% AcOH to remove the benzylidene moiety followed by glycosylation with donor 7 to afford tetrasaccharide 1 bearing four different protecting groups at the branching points.
![]() | ||
| Scheme 1 Synthesis of tetrasaccharide 1 bearing different protective groups at the four strategic branching points. | ||
We next focused on the introduction of bisecting GlcNAc to the free 4-OH of tetrasaccharide 1. Glycosylation of the 4-OH group is known to be challenging as it is flanked by two glycans.18,40,41 Prior strategies to synthesize N-glycans with the bisecting glucosamine unit typically introduced the bisecting residue at 4-OH first followed by the removal of the protective group on 6-OH and installation of the α1,6-arm.27,42–47 The direct glycosylation of an acceptor containing free 4-OH with both α1,3- and α1,6-arms is attractive as it reduces protective group manipulations needed. Unverzagt and coworkers successfully developed this approach although a large excess (10 equiv.) of glycosyl donors was needed to overcome the low nucleophilicity of the 4-OH.18,40,41
In our synthesis, we first tested the direct glycosylation of tetrasaccharide 1 with 1.5 equiv. of glucosamine donor 8 (ref. 48) in the presence of NIS/TfOH. Surprisingly, while the desired pentasaccharide was detected from the reaction mixture by mass spectrometry, a pentasaccharide side product containing an iodide group was also formed, which likely resulted from electrophilic substitution of the product by the electron-deficient iodonium ion generated from NIS. The side product could not be separated from the desired pentasaccharide. While NIS/TfOH is a promoter system widely used to activate thioglycosides,49 the side reaction of iodination was not unprecedented as the Yu group reported aglycon iodination during NIS promoted thioglycosylation of a steroid derivative.50 As an alternative to NIS/TfOH, we tested the reaction of 1 and 8 using p-TolSCl/AgOTf51 as the promoter. In this case, however, a pentasaccharide side product containing a p-TolS moiety was found.
We hypothesize that the most likely location in tetrasaccharide 1 that was electrophilically substituted would be the methyl or methylene group α to the ketone carbonyl of the Lev. As electron-withdrawing protective groups, i.e., Troc and Lev, are on the same mannose unit of 1, the acidity of α-H of ketone may be higher. This consideration led us to switch the protective groups of Fmoc and Lev on the mannosyl donors, i.e., using new donors 9 and 12. The coupling between donor 9 and disaccharide acceptor 3 went smoothly, which followed by subsequent benzylidene removal gave the trisaccharide 11 in 77% yield (Scheme 2). Selective glycosylation at the primary hydroxyl group of 11 by mannose donor 12 generated tetrasaccharide 13. The 4-OH of tetrasaccharide 13 was protected as the O-acetate (tetrasaccharide 14) to synthesize N-glycans without the bisecting moiety.
The bisecting glucosamine was successfully introduced via the reaction of 13 with 2 equiv. of donor 8 with NIS/TfOH promoter to form pentasaccharide 15 (Table 1, entry 1). Although the yield was a modest 37%, the undesired iodide substitution products were not detected. When 3 equiv. of donor 8 was used, the yield increased to 53% (Table 1, entry 2). Further increasing the amount of donor did not improve the yield significantly, but rendered purification more complicated. The major side product was due to the reaction of succinimide generated from NIS with the activated glycosyl donor, which competed with the formation of the desired product. The promoter p-TolSCl/AgOTf was utilized next instead of NIS/TfOH. With 1.5 equiv. of donor 8, the desired pentasaccharide 15 was obtained in 44% yield (Table 1, entry 3). Increasing the amount of donor from 1.5 to 3.0 equiv. led to a significant increase of the yield of glycosylation to 81% (Table 1, entries 3–5). This compared favorably with several literature approaches18,40,41 for introduction of the bisecting glycan as good glycosylation yield was obtained without the need for a large excess (10 equiv.) of the glycosyl donor.
With the key pentasaccharide 15 in hand, the possibility of orthogonally removing each of the temporary protective groups was explored. As shown in Scheme 3, treatment of 15 with triethylamine selectively removed the Fmoc group in 95% yield without affecting any other protective groups. The Lev group in 15 could be deprotected by hydrazine acetate, while Troc and TBDPS moieties were cleaved by Zn/AcOH and HF·pyridine respectively in good yields.
![]() | ||
| Scheme 3 The four temporary protective groups on key pentasaccharide 15 could be orthogonally removed without affecting one another. | ||
With the orthogonality of protective group removal established, glycosylations of the newly liberated hydroxyl groups were tested for the construction of asymmetrically branched N-glycans. To understand the impact of the bisecting glycan on glycosylation, both pentasaccharide 17 and tetrasaccharide 20 obtained from Lev removal of 14 were utilized as acceptors.
Glycosylation of pentasaccharide 17 with donor 8 promoted by NIS/AgOTf gave hexasaccharide 21 in 53% yield (Scheme 4). Other promoter systems such as NIS/TMSOTf, NIS/TfOH and p-TolSCl/AgOTf were also tested, but did not lead to better yields. Compound 21 was treated with Et3N followed by glycosylation by disaccharide 23 and removal of TBDPS, affording octasaccharide 24 in 43% overall yield. LewisX trisaccharide thioglycosyl donor 26 was prepared through a one pot procedure in gram scale.52 Introduction of donor 26 onto the primary OH of octasaccharide 24 proceeded to give undecasaccharide 27 in 33% yield, where 21% of the unreacted acceptor was recovered in the reaction.
To synthesize glycans without the bisecting glycan, analogous glycosylations with donors 8, 23, 26 followed by subsequent deprotections were performed on the tetrasaccharide acceptor 20 (Scheme 4). The penta-, hepta- and deca-saccharides 22, 25 and 28 were efficiently synthesized in 71%, 70% and 74% yields respectively, significantly higher than the analogous yields for forming 21, 24 and 27 (53%, 43% and 33%). These results demonstrate that the bisecting glucosamine negatively impacted glycosylation yields at branching points.
To better understand the effect of the bisecting glycan, molecular modeling studies were performed. Both glycans 17 and 20 were subjected to molecular dynamics (MD) simulations at 900 K. From the resulting MD frames, 1000 conformers were extracted and optimized using the AM1 semi-empirical method. The conformers within 6.0 kcal mol−1 from the lowest energy conformation were selected for comparisons. It was noteworthy that the glycan 17 was more rigid than 20, as there were fewer distinct conformers of 17 than 20 in this energy range (Table S1†). Conformational analysis showed that the free 2-OH of compound 17 was much more sterically hindered than that in compound 20. The average number of atoms within 5 Å of the OH group in the low energy conformers of 17 was quantified and found to be significantly higher than that in compound 20 (42.9 vs. 35.9 Table S1†). Furthermore, the OH group was pointing at the bisecting glucosamine unit in many conformers of 17 (Fig. 2). There were no additional hydrogen bond donors interacting with the free OH group in the optimized structures of 17 compared to that in 14. Therefore, the lower yields of glycosylation of 17 were mainly attributed to the increased steric hindrance around the OH group caused by the bisecting glucosamine unit.
As N-glycans are attached to proteins, we next explored the possibility of introducing the rare N-glycan to peptides with haptoglobin peptide (amino acids 233–254)53,54 as a representative acceptor. The transglycosylation strategy by endoglycosidases23–28 pioneered by the Wang lab can enable convergent synthesis of glycopeptides by transferring glycosyl oxazoline donors to peptide acceptors bearing a GlcNAc on an asparagine residue. While the successful transfer of a bisected N-glycan by the endoglycosidase Endo-A has been reported,27 the donor tested did not contain α1,3- or α1,6-glycan branches. Furthermore, it is unclear how the bisecting GlcNAc may impact the transglycosylation reaction.
To synthesize N-glycans for haptoglobin glycopeptides, compounds 15 and 14 were first treated with Zn/AcOH and HF/Py giving diols 29 and 30. LewisX trisaccharide thioglycosyl donor 26 glycosylated diols 29 and 30 using the NIS/AgOTf promoter system followed by subsequent Lev and Fmoc removal producing undeca- and deca-saccharides 31 and 32 in 40% and 51% yields respectively (Scheme 5a). To prepare tri-antennary N-glycan, the Lev group in 29 and 30 was selectively removed. The resulting triols were glycosylated with LewisX trisaccharide thioglycosyl donor 26 leading to tetradeca- and trideca-saccharides 33 and 34 (Scheme 5b).
The four oligosaccharides 31–34 were deprotected by first incubating with ethylenediamine in n-butanol at 90 °C,31 which removed all ester and carbonate groups as well as the Phth to generate free amines (Scheme 6). To facilitate future quantification by mass spectrometry,55,56 the free amines were protected with 13C labeled acetic anhydride to introduce 13C labels into the GlcNAcs. Subsequent hydrogenolysis with Pearlman's catalyst under a hydrogen atmosphere57 led to free N-glycans 35–38.
With the free N-glycans in hand, they were transformed to the respective oligosaccharide oxazolines 39, 40, 41 and 42 using a mild chloroformamidinium dehydrating agent 43 (Scheme 7a), which were directly subjected to the enzymatic transglycosylation reactions with GlcNAc bearing haptoglobin glycopeptide 44 as the acceptor (Scheme 7b). Glycosyl oxazoline 40 was first tested using endoglycosidases Endo-A (wild type) and Endo-M (wild type), and their respective Endo-A N171A, Endo-M N175A and Endo-M N175Q mutants as catalysts. Interestingly, we found that wild type Endo-A was able to transfer the glycan to give glycopeptide 46 in 73% overall yield, while all other enzymes tested failed to generate significant quantities of desired products. Next, the bisecting GlcNAc containing oxazoline 39 was utilized as the donor. Similar to donor 40, Endo-A promoted the transglycosylation reaction efficiently producing glycopeptide 45 in 65% yield. These results suggest that the presence of the bisecting GlcNAc does not negatively affect the donor abilities of 39 and glycosyl oxazolines containing bisecting GlcNAc and complex glycans such as rare α1,3- and α1,6-branches can be suitable donors for transglycosylation to glycopeptides.
To test the limit of the Endo-A enzymes, tri-antennary N-glycan oxazolines 41 and 42 bearing three LeX moieties were subjected to transglycosylation reactions with acceptor peptide 44. However, no desired glycopeptides were observed with Endo-A. Nor did the Endo-M or any of the Endo-A N171A, Endo-M N175A and Endo-M N175Q mutants tested give desired glycopeptides.
To better understand the divergent behavior of glycosyl oxazolines 39vs.41, computational studies were performed on their complexes with Endo-A. The initial coordinates of Endo-A were obtained from the Protein Data Bank34 (PDB ID: 3FHA)35 and N-glycan oxazolines 39 and 41 were docked into the binding pocket of Endo-A using the docking program within Molecular Operating Environment v.2016.08 (MOE).36 Binding poses were scored using the GBVI/WSA ΔG scoring function. It is known that W93, N171, E173, Y205, F125, W216, F243, W244 and Y299 are the active site residues interacting with the donor substrate.35 Poses with the oxazoline ring positioned within range of these residues were simulated using atomistic MD with the AMBER 16 software package.32 The lowest energies poses with the oxazolines remaining in the binding pockets were obtained and analyzed.
MD simulations demonstrate that the binding energy of 39 and Endo-A is on average −81.44 ± 11.10 kcal mol−1 (Table 2). In contrast, the binding energy of 41 is significantly lower at −56.24 ± 9.95 kcal mol−1, suggesting the much weaker binding of 41 by Endo-A. In crystal structures of Endo-A as well as that of Endo-A complexed with a tetrasaccharide oxazoline substrate,35 the indole rings of W216 and W244 are perpendicular to each other, possibly forming a gate sealing off the active site. When oxazoline 39 was docked into Endo-A, the indole of W244 was moved to be parallel to that of W216 to allow the substrate to enter the active site. In all poses generated from subsequent MD simulations, the indole of W244 returned to the original perpendicular orientation to W216 (Fig. 3a). However, in the complex of the tri-antennae bearing donor 41 with Endo-A, the additional antenna of 41 is located between these two tryptophan residues, possibly preventing the rotation of the indole ring. As a result, the two indoles of W244 and W216 remained close to parallel to each other (Fig. 3b). This may have precluded the formation of a closed active site for catalysis, reducing the yields for transglycosylation.
| Compound | Binding pose | Binding energy (kcal mol−1) |
|---|---|---|
| 39 | 1 | −72.97 ± 6.04 |
| 2 | −94.00 ± 9.15 | |
| 3 | −77.36 ± 7.96 | |
| Average | −81.44 ± 11.10 | |
| 41 | 1 | −52.08 ± 11.26 |
| 2 | −60.17 ± 11.56 | |
| 3 | −55.26 ± 7.95 | |
| 4 | −58.80 ± 7.83 | |
| 5 | −54.86 ± 11.17 | |
| Average | −56.24 ± 9.95 | |
Footnote |
| † Electronic supplementary information (ESI) available: Synthetic procedures, characterization data, Table S1 as well as NMR spectra. See DOI: 10.1039/c8sc02457j |
| This journal is © The Royal Society of Chemistry 2018 |