Marta
Fernández-González
,
Omar
Boutureira
,
Gonçalo J. L.
Bernardes
,
Justin M.
Chalker
,
Matthew A.
Young
,
James C.
Errey
and
Benjamin G.
Davis
*
Department of Chemistry, University of Oxford, Chemistry Research Laboratory, 12 Mansfield Road, Oxford, OX1 3TA, United Kingdom. E-mail: Ben.Davis@chem.ox.ac.uk; Fax: +44 (0) 1865 275674; Tel: +44 (0) 1865 275652
First published on 20th October 2010
Combined chemical tagging followed by Endo-A catalysed elongation allows access to homogeneous, elaborated glycoproteins. A survey of different linkages and sugars demonstrated not only that unnatural linkages can be tolerated but they can provide insight into the scope of Endo-A transglycosylation activity. S-linked GlcNAc-glycoproteins are useful substrates for Endo-A extensions and display enhanced stability to hydrolysis at exposed sites. O-CH2-triazole-linked GlcNAc-glycoproteins derived from azidohomoalanine-tagged protein precursors were found to be optimal at sterically demanding sites.
Recombinant expression systems, chemoenzymatic and chemical methods have all emerged as powerful techniques that can resolve the supply issues of homogeneous glycoprotein production.7–12 One powerful approach is to remodel a glycoprotein enzymatically: the initial heterogeneous glycoform mixture is treated with endoglycosidase (“Endo”) to trim off the variable portions of the attached oligosaccharides, yielding a single protein glycoform with N-acetylglucosamine (GlcNAc) residue(s) at N-linked glycosylation site(s) (Route A). Subsequent enzyme-mediated glycosylation of the remaining single GlcNAc residues can then produce a homogenous sample of a desired glycoprotein (Scheme 1, Route A).10,13–16 Endo-β-N-acetylglucosaminidases hydrolyze the glycosidic bond in the N,N′-diacetylchitobiose core of N-linked glycans. Some of them, such as Endo-A17 and Endo-M18 also possess significant transglycosylation activity and have been used, for example, to remodel the N-glycoprotein RNaseB.19,20 Importantly, the efficiency of glycosylation has been valuably improved21–26 by the use of sugar oxazolines as donor substrates and mutant Endo enzymes. Very recently the re-engineering of bacterial glycosylation has been usefully combined with remodeling to extend this method.16 So far, these methodologies have been applied to a limited number of proteins with pre-existing, biologically-defined glycosylation sites, which in turn requires the successful incorporation and recognition of glycosylation peptide consensus motif (NxS/T). In some cases this motif may not be recognised, leading to failed or incomplete glycan incorporation and lowered yields.16
![]() | ||
Scheme 1 Existing remodeling (A) approach compared with chemoenzymatic (B) strategy, disclosed here, for endoglycosidase-catalysed protein glycosylation. |
The power of the Endo-catalysed approach would therefore be enhanced by combination with methods that would allow more free-ranging glycosylation site control and the ability to incorporate unnatural or altered motifs. In this way, a system can be envisaged that would allow Endo-catalysed glycosylation on proteins in which (a) the first GlcNAc (or other) residue that is required had been positioned with complete freedom and (b) the use of other glycans. This would also allow a broad exploration of Endo substrate tolerance in glycoprotein construction and the possible discovery of more efficient substrates and more stable products.
We describe here a method (Scheme 1, Route B) that achieves this using chemical positioning of first sugars. This combined chemoenzymatic approach consists of site-selective convergent chemical glycosylation of a protein scaffold with a single GlcNAc (or other sugars) followed by Endo-catalysed glycosylation and allows greater flexibility.27 The strategy brings with it the potential to use sugars other than GlcNAc as glycosylation acceptors, as well as linkages beyond the δ-amide of Asn. Such ‘unnaturally’ linked acceptors for Endo-catalysed glycosylation have been explored in short peptides28,29 and may bring with them the potential for enhanced yield as a result of reduced product hydrolysis by Endo. We not only use this method here to map the tolerance and efficiency of Endo but use it to discover unnatural glycoprotein linkages that are more stable (and hence more efficiently formed) than the natural.
Entry | Acceptor | 1/μmol | Endo-A [mU/g] | Conda | t mc min | 2–11b Conv [%] |
---|---|---|---|---|---|---|
a Conditions: 0.2 μmol acceptor, NaPi buffer (50 mM) and A: pH 6.0, c 0.2 mM, 27 °C; B: pH 7.0, c 0.2 mM, 27 °C; C: pH 7.0, c 2 mM, 21 °C; D: pH 7.0, c 0.2 mM, 21 °C. b t mc: time of maximal conversion. | ||||||
1 |
![]() |
4 × 0.6 | 120 | A | 29 | 25 |
2 |
![]() |
4 × 0.6 | 70 | A | 48 | 78 |
3 |
![]() |
0.6 + 0.3 | 20 | A | 26 | 40 |
4 | 5 × 0.6 | 110 | A | 48 | 83 | |
5 | 5 × 0.6 | 110 | B | 48 | 75 | |
6 | 0.6 | 30 | C | 1 | 74 | |
7 |
![]() |
0.6 | 30 | D | 48 | 7 |
8 | 0.6 | 30 | C | 4 | 50 | |
9 |
![]() |
1.9 | 60 | D | 120 | 28 |
10 | 0.6 | 30 | C | 3 | 34 | |
11 |
![]() |
0.6 | 30 | C | 5 | 33 |
12 |
![]() |
0.6 | 30 | C | 24 | 0 |
13 |
![]() |
0.6 | 30 | C | 24 | 0 |
14 |
![]() |
2.0 | 30 | C | 20 | 89 |
15 |
![]() |
0.6 | 30 | C | 0.5 | 75 |
16 | 0.6 | 30 | E | 0.5 | 98 |
Having shown the potential of unnatural linkages and sugars, the convergent, chemoenzymatic strategy outlined in Scheme 1, Route B was demonstrated on peptide substrates. Fragments of cancer antigen protein gp9036 bearing alkyne or azide tags, and fragments of subtilisin Bacillus lentus (SBL) bearing a Cys-tag were assembled using Fmoc chemistry (Scheme 2). GlcNAc moiety was then incorporated convergently using Cu(I)-catalysed [3 + 2] cycloaddition to give 12, 13a,37,38 and in the case of Cys-peptide 14a, by conversion of Cys to Dha using O-mesitylenesulfonylhydroxylamine (MSH)32 followed by conjugate addition of GlcNAc-SH. Subsequent Endo-A glycosylation gave the corresponding elaborated glycopeptides 12–14b. Consistent with the prior observations on amino acid models, S- and triazole also proved to be efficient substrate linkages in peptides: >98% conversions of 13a and 14a were observed after 2 and 7 h, respectively. Lower conversion (70%) was observed in the case of triazole-glycopeptide 12a (from alkynyl tag Hpg).
![]() | ||
Scheme 2 Endo-A catalysed transglycosylation of glycopeptides 12–14; see ESI† for full details. |
Entrya | Acceptor | [15–24a]/mg mL−1 | t/h | Max. conv [%] |
---|---|---|---|---|
a See ESI† for full details. | ||||
1 |
![]() |
1.0 | 6 | 15b [>95] |
2 |
![]() |
0.29 | 4 | 16b [35] |
3 |
![]() |
0.29 | 6 | 17b [0] |
4 |
![]() |
0.4 | 24 | 18b [10] |
5 |
![]() |
0.4 | 24 | 19b [25] |
6 |
![]() |
0.29 | 2 | 20b [80] |
1.0 | 2 | 20b [89] | ||
7 |
![]() |
0.29 | 2 | 21b [26] |
1.0 | 2 | 21b [28] | ||
8 |
![]() |
0.5 | 2 | 22b [<10] |
9 |
![]() |
0.29 | 2 | 23b [63] |
0.43 | 2 | 23b [55] | ||
1.0 | 2 | 23b [54] | ||
10 |
![]() |
1.0 | 6 | 24b [70] |
The potential to incorporate fluorinated glycans into proteins gives an unnatural motif that not only might modulate Endo-associated kinetics but is a possible strategy for labeling proteins.41 In this context disulfide-forming glyco-PTS42 reagents have been used to introduce PET-reagent FDG (2-fluoro-2-deoxy-Glc) into short peptides.43 We were pleased to see that the Endo-catalysed glycosylation was successful for both 19FDG as an acceptor and its C-2 epimer the 2-F-Man variant (Table 2, entries 4, 5, 8 and 9). Surprisingly, Man-2F-proteins 19 and 23a were better acceptor substrates than corresponding Glc-2F-proteins 18 and 22a (yields up to 63%) and even better in some cases than corresponding Glc-proteins e.g., 21a and 23a (Table 2, entries 7 and 9). This further highlighted the striking promiscuity of Endo discovered with this approach. However, it should be noted that the conversion times established here for Endo-catalysed reactions make this method more compatible with NMR-labeling (19F) than for example radio/PET (18F), given the short half life of 18F.41
To probe the effects of linker upon transglycosylation activity and kinetics, three synthetic glycoproteins bearing different linkages (S-, SS- and SeS-) but identical acceptor sugar GlcNAc were compared (Fig. 1). These gave a clear insight into the higher transglycosylation efficiencies observed for S-linked glycoproteins. Thus, SeS-glycoproteins were not only faster acceptors for Endo-A glycosylation than S- and SS-substrates (Fig. 1) but also for the subsequent hydrolysis of product. Such undesirable hydrolytic behaviour has hampered other Endo-catalysed approaches.14,15 In contrast, S-glycoprotein product 15b was stable and formed with full conversion after 6 h; the corresponding SeS-glycoprotein 20b was hydrolyzed by 60% back to 20a in the same period (Fig. 1).
![]() | ||
Fig. 1 Comparison of linkage type: time course of Endo-A catalysed glycosylation with S-, SeS-, and SS-linked SBL-GlcNAc. |
Finally, we set out to compare the S- and triazole linker effects directly. Accordingly, three glycoproteins were constructed that differ only by the linkage bearing the GlcNAc acceptor (25–28a). A model protein, Np276 from Nostoc punctiforme,44 was used to make this comparison. Aha, Hpg, and Cys were incorporated into position 61 as ‘tagged’ proteins or precursors of tagged proteins. This required the construction of appropriate mutant gene sequences to allow site-selective tag positioning (Table 3). GlcNAc was subsequently attached to these tags as follows. Triazole-synthesis chemistry from Aha (azide tag) and Hpg (alkyne tag) gave 25a and 26a, respectively. Conversion of Cys61 to dehydroalanine as a tag and conjugate addition of GlcNAc thiol provided 27a. Position 61 in Np276 was more hindered than 156 in SBL and this was reflected in the longer reaction times required for complete conjugation (see ESI†).
These three differently-linked GlcNAc-ylated protein variants allowed direct comparison at a single site in one protein of triazole- and thioether-linked acceptors. We considered such a hindered site an alternative testing ground for the relative efficiency of each linker in Endo-A catalysed transglycosylations. Interestingly, while the thioether-linked GlcNAc on SBL derivative 15a was an efficient acceptor for the Endo-A catalysed glycosylation, the same thioether-linked sugar at position 61 of Np276 was a poor acceptor and provided no detectable product after 6 h. In contrast, substrate 25a, the O-triazole-linked GlcNAc variant derived from Aha at the same position, did show activity and 40% conversion to the glycosylated product was observed even at this hindered site. No product was observed with the Hpg-derived N-triazole-linked GlcNAc variant 26a. From these results, we can draw the following conclusions. Firstly, it appears that for hindered sites such as position 61 in Np276, the Aha-derived triazole, containing a hydroxymethyl spacer, is superior to both the thioether-linked GlcNAc and the Hpg-derived triazole variants. One factor to account for this difference is simply the relative accessibility of each linker: the Aha derived triazoles are more exposed from the protein surface than the thioether derived from Dha. The triazole motif in 25a is also more spaced from the GlcNAc residue as compared with 26a; this difference is apparently sufficient to lead to 40% conversion for 25a and yet no reaction for 26a. The position on the protein surface is also critical. In our next experiment (entry 4), by moving the Aha azide tag to a more accessible position 101, both the initial installation of GlcNAc and the subsequent Endo-A catalysed transglycosylation were more efficient. Indeed, 28a was then completely consumed in an Endo-catalyzed reaction with 1 after only 4 h.
Footnote |
† Electronic supplementary information (ESI) available: Detailed experimental procedures, including 1H and 13C NMR spectra for all new compounds and ESI-MS for all protein samples. See DOI: 10.1039/c0sc00265h |
This journal is © The Royal Society of Chemistry 2010 |