Open Access Article
Kudakwashe
Nyandoro
a,
Charles M. G.
Lamb
a,
Haoran
Yu
b,
Jian
Shi
a and
Derek
Macmillan
*a
aDepartment of Chemistry, UCL, 20 Gordon Street, London, WC1H 0AJ, UK. E-mail: d.macmillan@ucl.ac.uk
bSchool of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China
First published on 17th October 2022
Homogeneous glycoprotein syntheses have become possible in the last decade due to advances in chemical ligation strategies, particularly Native Chemical Ligation (NCL). For native glycoproteins this still requires laborious and technically challenging syntheses of glycopeptide components, combined with multi-segment ligation reactions. Here we explore new reactions between sugar-linked acyl transfer auxiliaries and peptide thioesters. We show that native glycoproteins are difficult to produce using this approach but various related analogues are accessible. The results show that site-specific neoglycoconjugation is a viable route to simply glycosylated proteins, which may be extended using well-documented enzymatic processes.
The carbohydrate moiety confers desirable properties upon the protein to which it is attached. These include prolonged longevity in the blood, which is particularly relevant to therapeutic proteins, reducing the dosing regimen and consequently the potential for intolerance and error, as well as protecting the protein backbone from proteolysis, and can target the protein to the appropriate site within the body.4 The N-glycans in the Fc domain of antibodies are also crucial for directing effector function.5
Despite over 100 years of carbohydrate chemistry, the biosynthetic process has not been replicated using organic synthesis. Formation of the N-glycopeptide bond typically involves amide bond formation between suitably protected aspartic acid and glycosylamine building blocks in a process introduced by Lansbury.6 Further advances have increased the efficiency of N-glycoside bond formation7 but N-glycoprotein chemical synthesis remains the endeavour of specialized synthetic groups that not only construct the short peptides decorated with complex sugars, but also combine those short peptides into larger bioactive proteins, using chemoselective coupling processes such as native chemical ligation (NCL).8 In an alternative strategy full length proteins of biological origin, and with N-acetylglucosamine (GlcNAc) pre-installed at a specific location as a result of pre-existing cellular glycosylation, are redesigned using endo-glycosidases.1,9 An important challenge is how to attach this first required GlcNAc residue in recombinant proteins, at a site of your choice?
Here we explore a new paradigm for semi-synthetic glycoprotein synthesis by uniting synthetic monosaccharide building blocks with fully recombinant protein, containing the unnatural amino acid “ThioD” (1),10 or “BnE” (2),11 and their corresponding hydrazides (Scheme 1).
This work encouraged us to prepare sugar-linked acyl transfer auxiliaries that may enable site-specific attachment of GlcNAc to fully recombinant proteins with pre-installed thioesters such as ThioD (Scheme 2). To this end, a reductive amination between glycosylamine 2a16 and aldehyde 317 was investigated, but found to produce a mixture of products including imine 4 and over-reduced product 5, with the desired product 6 forming in only 36% yield at best. This was not unexpected, considering the dearth of examples of reductive aminations featuring glycosylamines in the literature. However, once isolated, p-methoxybenzyl (PMB) cleavage, using 3-nitro-2-pyridylsulfenyl chloride (NPysCl) produced the mixed disulfide 7 in 41% yield. In an attempt to overcome the troublesome reductive amination, a transamination process was also investigated, using the fully unprotected glycosyl amine 2b.18 A test transamination with benzylamine proceeded efficiently and, following purification was observed to be exclusively β-configured. Upon extension to benzylic amine 8,19 the reaction was found to proceed similarly and 9 was isolated in 40% yield. Once more the S-PMB protecting group was exchanged to form the mixed disulfide. However 10 proved unstable to chromatography and was used without further purification. For comparison, our methylene extended building block 11 was also prepared. Whilst unnatural, it has precedent as a valuable glycopeptide analogue.20 As expected, the synthesis of 11 was more straightforward. Glycosyl nitrile 1221 was reduced by catalytic hydrogenation in the presence of Boc2O allowing isolation of pure 13 in 82% yield.20a,22 Utilising 13 in place of 2a simplified the reductive amination and subsequent steps, confirming that the hemi-aminal motif present within 2 was the source of synthetic complications. It is noteworthy that, en-route to 7, 10 or 11, the intermediates were not significantly exposed to aqueous conditions. The NPys unsymmetrical disulfide 11, formed from treatment of 14 with NPysCl, existed as a 2
:
1 mixture of desired product and symmetrical disulfide, which complicated analysis. We found that 11 could be reduced with tris-carboxyethylphosphine (TCEP) and the thiol 15 was isolated by preparative HPLC, which, although adding an additional step and compromising yield, afforded a compound that was significantly simpler to characterise. In contrast, 7 or 10 were not stable to the same reducing conditions. Nevertheless, both were used in NCL reactions, assuming that their disulfides would be reduced under ligation conditions.
With building blocks in hand, we turned our attention to model ligation reactions using preformed peptide thioesters.
In these experiments (Fig. 1) C-terminal peptide thioester 16, derived from β-interferon residues 76–83 (sequence: H-STGWNETIG-SCH2CH2SO3H) bearing a natural glycosylation site (N80) was initially employed. The C-terminal thioester was first investigated to avoid complicating factors that may arise from additional reagents used for in situ thioester formation at the sidechain, and potentially problematic aspartimide formation. Otherwise we considered it an acceptable model for thioD containing peptides. 16 was incubated with sugar-linked auxiliaries 7, 10 and 15. Reactions progressed to consume 16 but, in our hands, and under various reaction conditions (varying pH, thiol additive, and temperature), only 15 furnished the expected product, with 17 being isolated as the major species. When using 7 or 10, the only peptide-based material that could be identified in the reaction was 18, derived from hydrolysis of the glycosyl amine followed by ligation to the unglycosylated auxiliary. LC-MS also showed two signals, (labelled 7*) corresponding to 7 (by mass) which may have arisen from anomerisation. We reasoned that, since the expected product is a glycosyl amide and relatively stable, hydrolysis of 7, followed by ligation of the free auxiliary was the likely source of the undesirable conjugate. We attempted to combat this by conducting the ligation in organic solvent (DMF, Et3N)23 but no reaction was observed under these conditions. This phenomenon was not unique to the trimethoxybenzyl (Tmb) class of acyl transfer auxiliary. We encountered similar difficulties in the preparation and ligation of “naturally-linked” conjugates to the 2-mercapto-2-phenethyl auxiliary developed by Seitz.24 Again, the introduction of a methylene spacer to afford 19 (Fig. 2a) solved the problem of building block hydrolysis during ligation. 19 was prepared in a similar manner to 15, employing reductive amination between 13 and aldehyde 20.2519 was obtained upon exposure of 21 to triethylsilane and TFA in DCM. To make progress towards protein glycosylation we further examined ligation reactions between 15 or 19 and free ThioD and reactions were complete within 24 h (see ESI†). Whilst encouraging, it proved difficult to isolate the conjugates to free ThioD and so ligation using 19 was also performed against peptide 16 (Fig. 2b). In this case the product 22 was easily isolated and the auxiliary removed to afford the methylene bridged N-glycopeptide analogue 23 in 74% yield. In a final model reaction 19 was ligated to peptide 24 (H-STGWNETIG-OH), exploring reaction at the N80 asparagine (bold and underlined) residue of β-interferon (Fig. 3a). Because the sidechain thioester is not stable to Fmoc-based SPPS, it was produced in situ from the corresponding hydrazide.14 In contrast to the reaction with 16, the ligation product was clearly observed as an inseparable pair of epimers, which returned to a single species, 25, following cleavage of the racemic auxiliary (see ESI†). Confident that 15 and 19 served as reliable building blocks we turned to recombinant protein modification although, due to the ease of synthesis and use of 19 relative to 15, only his building block was progressed.
![]() | ||
| Fig. 2 (a) Synthesis of sugar-linked auxiliary 19. Reagents and conditions (i) TFA, DCM, 3 h, rt, 96%, then 20, NaB(OAc)3H, DCM, 2% v/v acetic acid, rt, 4 h, 50% over 2 steps. (ii) TFA, Et3SiH, DCM, rt, 2 h, 96%. (b) Ligation reaction between 19 and 16 with accompanying HPLC traces of isolated products. Reagents and conditions (iii) 19 (1.3 equiv.), 0.1 M Na phosphate; pH 7, 6 M guanidine.HCl, 0.2 M MPAA, 60 mM TCEP, 25 °C, 16 h, 34%. (iv) 0.5 M TCEP, 2 M morpholine (pH 8.5), rt, 5 h, 74%.24b | ||
Sperm whale myoglobin, expressed from bacterial cells co-transformed with plasmid pMyo4TAG26 and cognate ThioD tRNA synthetase (ThioDRS)10 was an available model system and so was employed in proof of concept experiments. Using published procedures for protein expression we could isolate the protein containing ThioD (26) in yields of approximately 1 mg L−1 and characterise it by SDS-PAGE and mass spectrometry (Fig. 3b). The presence of the thioester was confirmed by treatment of the protein, typically obtained at concentrations of 18.5 μM, with 2.5% v/v hydrazine hydrate, which showed complete conversion to the hydrazide within 2 h at room temperature (see ESI†). In contrast to reaction with hydrazine, myoglobin reacted only slowly with 19. The reaction progressed to approximately 20% conversion after 6 h as judged by LC-MS despite employing a large excess of 19, and heating to 37 °C. Heating to 37 °C for prolonged periods of time was generally unhelpful resulting in visible sample precipitation, loss of the of the protein's characteristic red/brown colour, and deterioration of the MS Signal. Encouragingly, only initial protein thioester 26, product 27a, and disulfide linked adduct between 27b and 19, were the major species observed in reactions by LC-MS. However, despite significant attempts to improve the reaction by (i) raising or lowering the reaction temperature (ii) conducting the reaction in 3 M guanidine hydrochloride, (iii) addition of MPAA, we were unable to improve the data. Unlike in our model systems there was no evidence of an MPAA thioester intermediate. We considered that the low conversion may be due to either the increased steric bulk of 19 and protein 29 relative to the model systems, or the oxidising nature of the model protein. Whilst 19 was significantly more stable towards oxidation relative to 15, it still contains a benzylic thiol that is susceptible to oxidation. However, the oxidised dimeric form of 19 was not observed to accumulate significantly under the reaction conditions in the first 6 h and conducting reactions in the presence of 10 mM EDTA had a negligible effect on the outcome. Evidence of expected by-products such as thioester hydrolysis or aspartimide formation were only observed after significantly longer reaction times (>6 h). Just as we were keen to investigate whether extending the glycoside by a methylene unit aided ligation of GlcNAc to model peptides, we additionally became interested in whether extending the protein sidechain from Asp to Glu might allow us to overcome the obstacle of low ThioD reactivity towards 19. The corresponding “ThioE” is not available and so we prepared Glu thioesters from the benzyl ester (BnE) using the method recently reported by Xuan and co-workers.11 From an inhouse PylRS mutant (N311A), we were able to recreate the BnE RNA synthetase (BnERS) by simple site directed mutagenesis of this template. We confirmed that the enzyme selectively introduced BnE (Fig. 4a) and once again expressed myoglobin containing the unnatural amino acid. Using an adaptation of the published procedure we could convert the BnE containing protein to the γ-glutamyl hydrazide (Fig. 4b). This procedure required exposing the protein to 30% v/v hydrazine hydrate and heating the sample at 37 °C for 2.5 h. Removal of excess hydrazine was accomplished by snap-freezing and lyopillisation of the reaction mixture. The presence of a viable protein hydrazide 28 was confirmed by reducing the pH of the sample to approximately pH 4 and incubating with 1 M GlcNAc at 40 °C for 4 h. In a reaction analogous to that recently reported at Asp β-hydrazides15 we observed near quantitative conversion to novel neoglycoprotein 29 (Fig. 4c). We did not test the longer-term stability of this linkage and neither confirm the β-stereochemistry at the anomeric linkage nor that the cyclic nature of sugar had been preserved, although preceding studies suggest both features are retained.27
![]() | ||
| Fig. 4 (a) Selectivity of the γ-Benzyl glutamic acid pylRS mutant was confirmed by co-transformation with plasmid pREPpylT for resistance to 50 μg ml−1 chloramphenicol in the presence (left) and absence (right) of 2 mM BnE.26 (b) Purified benzyl ester is converted to the γ-hydrazide by exposure to 30% v/v hydrazine hydrate at 37 °C for 2 h. (c) The presence of the hydrazide was confirmed by GlcNAc bioconjugation. Reagents and conditions: 28 (18.5 μM), 50 mM Na phosphate buffer; pH 4, 1 M GlcNAc, 40 °C, 4 h >80%. (d) Ligation between 28 and 19. Reagents and conditions: 50 mM Na phosphate buffer; pH 4, 1 mM NaNO2, 0 °C, 20 min, then 19, 2.0 M guanidine.HCl 50 mM MPAA, 25 mM TCEP, pH 7, 25 °C, 4 h. | ||
Confident that the hydrazide 28 should serve as a suitable thioester precursor we investigated ligation with 19 using both oxidative28 and pyrazole forming29 methods for hydrazide activation. In preliminary investigations, and in contrast to experiments with model peptide 24, only oxidative activation of the hydrazide with NaNO2 proved successful. However, as with thioD containing protein 26, only partial conversion to the desired neoglycoconjugate was observed after 3 h. A significant quantity of material had also failed to activate and/or hydrolysed during hydrazide activation and ligation, which is a disadvantage of not utilising a preformed thioester. However the results suggested that the nature of the amino acid itself (Asp vs. Glu) was not the only factor affecting the extent of protein modification. Since the reactions between protein samples and 19 gave rise to a mixture of starting material and product, no attempt was made to remove the auxiliary.
Overall, we have demonstrated an important proof of concept, uniting sugar-linked auxiliaries, via an amide linkage, with recombinant protein sidechains. After cleavage of the auxiliary the glycoconjugate is extended by a single methylene unit when compared to a natural N-linked oligosaccharide. Unlike cysteine based “tag and modify” approaches32 the introduction of ThioD or the corresponding benzyl esters of aspartic and glutamic acids don't create the potential for disulfide scrambling and provide access to stable amide linked products. Owing to its more sustainable and less synthetically demanding components, as well as the potential to combine with developed enzymatic GlcNAc extension protocols,9,30 this approach may be developed into a useful glycoprotein and neoglycoprotein synthesis strategy.
:
EDT: water (95
:
2.5
:
2.5) for 4 h. The resin was filtered off and to the filtrate added to diethyl ether (12 mL) which induced peptide precipitation. The mixture was centrifuged at 4000 rpm, 4 °C for 20 min. The ether layer was decanted and fresh Et2O (12 mL) was added followed by further centrifugation. Ether was then decanted, and the peptide precipitate was dissolved in a minimum volume of water and purified by preparative reverse phase (RP) HPLC. Fractions containing product were identified by LC-MS and lyophilised to afford pure peptide.
This peptide (10 mg, 9.4 μmol) was dissolved in 0.1 M sodium phosphate buffer (pH 7.0) to a concentration of 1 mg mL−1. To this solution was added 10% w/v MESNa (1.0 g, 6.1 mmol), 10% w/v hydrazine acetate (1.0 g, 10.9 mmol) and 0.5% w/v TCEP (0.05 g, 0.2 mmol). The reaction was mixed on an Eppendorf thermomixer (600 rpm) at 50 °C for 30 h. The crude mixture was purified by RP-HPLC (tR = 26.0 min) and fractions containing product were lyophilised to afford a pure peptide hydrazide (8.0 mg, 87%) as a white solid. ESI-HRMS: m/z calculated for C41H63N13O15 [M + H]+ 978.4645, found [M + H]+ 978.4634.
Finally, the peptide C-terminal hydrazide (14 mg, 14.3 μmol) was dissolved in 7.0 mL of ligation buffer comprised of 0.1 M sodium phosphate buffer pH 3.0–4.0 and 6.0 M guanidine.HCl. The solution was cooled to −15 °C and NaNO2 (0.7 mL of 0.2 M stock solution dissolved in deionised water) was then added dropwise and the reaction was stirred at −15 °C for 20 min.28 The pH of the reaction was then adjusted to pH 7.0 by dropwise addition of 2 M NaOH. MESNa (2.3 g, 14 mmol) was dissolved in 6M guanidine.HCl (7.0 mL) and added to the reaction mixture followed by TCEP to final concentration of 30 mM. The reaction was stirred at room temperature for 2 h. The crude mixture was purified by RP-HPLC (tR = 27.5 min) and fractions containing 16 were lyophilised to afford pure peptide thioester (9.0 mg, 60%) as a white solid. ESI-HRMS: m/z calculated for C43H66N11O18S2 [M + H]+ 1088.4029, found [M + H]+ 1088.4015.
22 (1.8 mg, 1.25 μmol) was dissolved in 0.5 mL solution of 2 M morpholine in deionised water. TCEP (62.5 mg, 0.25 mmol) was added to the reaction and the pH was adjusted to pH 8.5 using 2 M NaOH. The mixture was placed in an Eppendorf thermomixer (750 rpm) and shaken at room temperature. After 5 h the reaction was complete and product isolated via preparative HPLC (tR = 31.0 min). Lyophilization resulted in a white solid of 23 (1.2 mg, 74%). ESI-MS: m/z calculated for C56H84N13O23 [M + H]+ 1306.58, found [M + H]+ 1307.0.
To form the hydrazide 50–60% v/v hydrazine hydrate (200 μmol) was added to (8 mg, 7.59 μmol) of peptide benzyl ester dissolved in 8 mL of water. The mixture was stirred at room temperature for 1 h then purified by RP-HPLC (tR = 26.0 min) and lyophilised to yield 24 (6.8 mg, 92%). ESI-HRMS: m/z calculated for C41H63N12O16[M + H]+ 979.4485, found [M + H] 979.4484.
000 rpm for 15 min. The protein was purified by Ni2+ affinity chromatography, eluting from the column in 50 mM Tris.HCl, 0.5 M NaCl, 250 mM imidazole; pH 8.0. Prior to analysis and ligation reactions, samples were buffer-exchanged into 50 mM sodium phosphate buffer (pH 8.0), and concentrated, using an Amicon Ultra-15 centrifugal filter unit (3000 MWCO) according to the manufacturers instructions. Yields were typically 0.9 mg L−1 after 6 h induction. After concentration to 1.0 mL, typical working protein concentrations of 18.5 μM were obtained. Expected mass for MbThioD (26) M = 18554.1 Da, found M = 18553.0 Da. Expected mass for Mb4BnE M+, 18487.2 Da, found M = 18497.5 Da. Note: The isolated protein containing BnE has a mass anomaly of +10 Da, presumably arising from a post translational modification, but all subsequent masses are correct relative to each other).
Mai
Hoang, M. D. Vu, K. Pasunooti, C.-F. Liu and X.-W. Liu, Angew. Chem., Int. Ed., 2016, 55, 10363–10367 CrossRef PubMed.Footnote |
| † Electronic supplementary information (ESI) available: Experimental procedures for synthesis of 7, 10, 11, 15 and 19. LC-MS data for characterisation of model peptides and spectra of selected compounds. See DOI: https://doi.org/10.1039/d2ob01633h |
| This journal is © The Royal Society of Chemistry 2022 |