Ward
Doelman
and
Sander I.
van Kasteren
*
Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands. E-mail: s.i.van.kasteren@chem.leidenuniv.nl
First published on 29th July 2022
Protein glycosylation is a key post-translational modification important to many facets of biology. Glycosylation can have critical effects on protein conformation, uptake and intracellular routing. In immunology, glycosylation of antigens has been shown to play a role in self/non-self distinction and the effective uptake of antigens. Improperly glycosylated proteins and peptide fragments, for instance those produced by cancerous cells, are also prime candidates for vaccine design. To study these processes, access to peptides bearing well-defined glycans is of critical importance. In this review, the key approaches towards synthetic, well-defined glycopeptides, are described, with a focus on peptides useful for and used in immunological studies. Special attention is given to the glycoconjugation approaches that have been developed in recent years, as these enable rapid synthesis of various (unnatural) glycopeptides, enabling powerful carbohydrate structure/activity studies. These techniques, combined with more traditional total synthesis and chemoenzymatic methods for the production of glycopeptides, should help unravel some of the complexities of glycobiology in the near future.
This is particularly poignant in immunology, where protein glycosylation plays a major role in many key processes, like regulation of immunotolerance,3,4 uptake of antigens,5,6 the creation of neo-epitopes,7,8 and even the protection of the foetus.9 The precise study of these processes has been hampered by a dearth of well-defined reagents. In some cases, well-defined glycans, in the absence of any protein scaffold, can be used to study these functions. For instance, the carbohydrate binding specificity of many lectins has been determined using synthetic oligosaccharides in the absence of a protein scaffold.10,11 However, for many processes, the combination of carbohydrate and protein is key. For example, antigen (cross-)presentation is thought to be enhanced by uptake of the antigen via lectins12,13 and therefore requires a covalent attachment of the protein and the (specific) oligosaccharide.
Entire glycoproteins, bearing a well-defied glycan, are formidable synthetic targets. Their use is therefore often not feasible. Synthetic, glycosylated peptides can offer a useful alternative; a compromise between reality and synthetic accessibility for biochemical studies. Additionally, synthetic glycosylated peptides in themselves are being actively developed as vaccine candidates.14 These have been shown to induce glycosylation-specific immune responses.15,16 Examples of antigens that have been tried as targets for glycopeptide vaccines are the envelope glycoprotein of human immunodeficiency virus (HIV),17 as well as certain tumor associated antigens.18
From a synthetic perspective, the synthesis of glycosylated peptides is still a challenge in itself, as it involves the combination of two types of bio-organic chemistry: peptide synthesis and carbohydrate chemistry. While these two fields are not inherently incompatible, there are challenges to overcome for successful glycopeptide synthesis. This review will focus on the synthesis of glycopeptides and glycopeptide conjugates useful for immunological studies. To give a good overview of the field both early and recent developments will be highlighted.
The simplest form of O-glycosylation is the introduction of β-GlcNAc onto serine or threonine (Fig. 1, 1). This is a regulatory modification that is, unlike the more complex forms of O-glycosylation, reversable.21 Despite this, it seems to be an immunologically relevant peptide modification, as O-GlcNAcylated peptides have been shown to be presented by major histocompatibility complexes (MHC) in vivo.22 The non-dynamic types of O-glycosylation start with the introduction of an α-linked N-acetyl galactosamine (GalNAc) residue (2), also called the Tn-antigen, onto serine or threonine. This single carbohydrate can then be enzymatically elaborated towards more complicated structures. For instance, the introduction of galactose onto the 3-OH creates the T-antigen (3), while 2,6-sialylation creates the STn-antigen (4). More elaborate structures are produced by enzymatic extension of different core structures. The four O-glycan cores most important in humans are also shown in Fig. 1 (3, 5–7). The biosynthesis and biochemical functions of these more complex O-glycosylation patterns are, as of yet, poorly understood.23 It cannot, for example, be predicted from the structure where on the protein this modification will be introduced, nor is the glycosylation machinery well defined. Besides these common modifications, glycosylation of serine and threonine with mannose, fucose, galactose and xylose have also been shown to exist within the human glycoproteome.1
One of the most well-studied examples of protein O-glycosylation is the mucin family of glycoproteins. The members of this family are produced primarily by epithelial cells and are either cell-membrane bound or secreted into the extracellular matrix. While elaborate O-glycosylation is suspected to exist on many proteins, the total amount and structural diversity found within this protein family is striking. Aberrant glycosylation of mucins, resulting in the proteins being decorated with the more simplistic forms of O-glycosylation (2–4), is found in several types of cancer.24 Targeting of these hypoglycosylated proteins by the immune system is a very active area of research for the development of onco-immunology based cancer treatments.25
N-Glycosylation is biochemically distinct from O-glycosylation. Unlike O-glycosylation, the location of N-glycosylation can be easily predicted: introduction of N-glycans occurs at asparagine residues that are part of the consensus sequence Asn-Xxx-Ser/Thr, with Xxx representing any amino acid except proline.1 Furthermore, the structure of the glycan is very different from those seen for O-glycans. Every N-glycan has the same core structure, that is Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4GlcNac (8), containing a rare β-mannoside linkage (Fig. 2).26 Generally, this core structure is further elaborated with additional glycosylation. One commonly seen N-glycan structure is the so-called “high mannose”, or Man9, N-glycan (9), a modification that is introduced early in the biosynthesis of N-glycosylated proteins.26 Remodeling and elaboration of these immature N-glycans by the cellular glycosylation machinery leads to more elaborate N-glycans, the so-called “complex” type, such as a so-called biantennary and sialylated structure 10.
1. Total chemical synthesis, where the entire molecule is produced using organic transformations;
2. Chemoenzymatic synthesis, where parts of the synthesis are accomplished using enzymatic transformations;
3. Conjugation based approaches, where the covalent linkage of peptide and oligosaccharide is achieved using one of various chemical approaches. These approaches usually yield a glycoconjugate containing one or more unnatural linkages.
While the first two approaches produce glycopeptides identical to naturally glycosylated peptides, this last approach will produce neoglycopeptides that, while not identical to native molecules, can be used to study the biochemical functions of glycosylation in some instances.14,27–32
For some applications, like the development of vaccines against glycosylated epitopes, having the entire, natively glycosylated amino acid in the peptide sequence is of crucial importance. This is necessary to produce antibodies and T-cell responses against a specific antigen in vivo without producing antibodies against unnatural linker-structures.33 However, the majority of lectin interactions only require certain oligosaccharide substructures derived from larger O- or N-glycans.10,29,30,34 For these areas of research, where the interactions under study only depend on a substructure of the glycan, simplified glycopeptides or glycoconjugates can be used. These can mimic some of the effects of natural glycosylation and can be used in place of natively glycosylated peptides. Specific areas of (immunological) interest these conjugates have seen use are the study of lectin mediated peptide uptake,13,28 modulation of cytokine production29,35 or targeting of specific cell-types.27,36 The increased synthetic accessibility of these glycoconjugates enables the synthesis and evaluation of more variants, in turn enabling more in-depth studies of glycan function. Therefore, conjugation methods to synthesize non-natively glycosylated peptides in a facile (compared to synthesis of native structures) manner will be described in more detail.
![]() | ||
Fig. 3 (A) TFA mediated deglycosylation observed by Unverzagt and Kunz.37 During TFA mediated deprotection of a peptide containing a trisaccahride (11) cleavage of the α-fucoside bond was observed yielding partially deprotected glycopeptide 12. A peptide containing an acetyl protected trisaccharide (13), when treated under similar conditions, did fully retain the fucose residue. (B) Base mediated β-elimination of an O-glycosylated threonine residue as observed by Zhang et al.40 |
The use of particular protecting groups on the sugar part of the glycopeptide can help alleviate these problems, but require careful consideration during synthesis of the glycosylated amino acid building blocks. O-Acetyls are most often employed, as they improve the acid stability of the molecule, while enabling removal under mild enough conditions to avoid the aforementioned base-mediated side reactions. Treatment with dilute solutions of NaOMe,42 NaOH43 or hydrazine44 can often be used to selectively liberate the protected glycoside without affecting the peptide or glycosidic bonds.
These days, peptides containing (simple) O-glycosylation patterns are synthesized as a matter of course, with many useful building blocks readily available from commercial sources. This, of course, is made possible by decades of development, that are briefly highlighted here. The initial development of synthetic methods to produce O-glycosylated peptides started in the 1980s.45–47 Kunz introduced the building block approach described above, synthesizing the first of Fmoc-protected O-glycosylated amino acids, using esters to protect the sugar hydroxyl groups (Fig. 4A). The common approach to introduce the α-glycosidic bond between the amino acid and the galactosamine, that is glycosylation with a 2-azido donor followed by reduction and acetylation, was used. These amino acids were used to produce several tripeptides decorated with both Tn- and T-antigen structures.47 Paulsen then applied a similar building block to Fmoc-SPPS, producing an octapeptide bearing five threonine residues modified with α-GalNAc,42 opening up the way towards more complicated glycopeptides.
![]() | ||
Fig. 4 (A) First synthesis of the Tn- (20) and T-antigen (21) as Fmoc-protected amino acids, as described by Kunz.47 (B) Synthesis of a Sialo-Tn antigen building block (26) and use of this building block in Fmoc-SPPS, as described by Kihlberg and coworkers.49 Reagents and conditions: (a) Fmoc-Ser-OBn or Fmoc-Thr-OBn, Ag+, 30–65% (b) i) NaBH4, NiCl2 (ii) Ac2O, pyridine, 60–86% (c) NBS, Bu4NOTf, DCM, −28 °C, 71% (α/β mixture) (d) (i) AcSH, pyridine, 67% (ii) AcOH, MeOH, THF, 60 °C, 86% (iii) DMP, TsOH, 85% (e) MeSBr, AgOTf, DCM, MeCN, −78 °C, 49% (f) H2, Pd/C, EtOAc, 88% (g) (i) Fmoc-SPPS (ii) TFA, PhSMe, HSCH2CH2SH (iii) NaOMe, MeOH (iv) NaOH, H2O, (v) RP-HPLC, 12%. |
The synthesis of glycopeptides bearing sialic acid containing O-glycans, a key modification highly abundant on certain tumors,48 would take another decade.43,49 As an example, the synthesis of a sialyl Tn-antigen building block (26) and incorporation into a peptide fragment from the HIV glycopeptide gp12050 (27) by Kihlberg is given in Fig. 4B.49 This synthesis, like the one outlined above, used an azide function on the galactose C-2 to mediate formation of the α-glycosidic bond, followed by transformation to the desired acetamide function. This synthesis also exemplifies an early use of an acid labile protecting group, in the form of the isopropylidene, on a glycosylated amino acid building block. Since the sialic acid containing amino acid is introduced as the methyl ester, final deprotection has to be carried out using a hydroxide solution, in order to obtain the free acid.
With the synthesis of the glycopeptides via Fmoc-SPPS reaching greater maturity, the complexity, as well as the diversity, of the oligosaccharide structures has increased.51–55 For instance, the group of Westerlind developed a microarray containing a wide variety of mucin-protein derived glycopeptides, enabling the profiling of the antibody response of mice vaccinated with different MUC-glycopeptide based vaccine candidates.56
Besides mucin-derived peptides, other O-glycosylated peptides have been the object of study. As an example, glycopeptide ligands for the P- and E-selectin receptors on white blood cells are an important class of O-glycosylated peptides.57 P-selectin binds to the sialyl-LewisX (sLeX) tetrasaccharide (Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ), a synthetically challenging target on its own. Total synthesis of a 12-mer peptide, derived from the protein P-selectin glycoprotein ligand-1 (PSGL-1), containing a threonine residue glycosylated with a hexasaccharide containing the sLeX structure was achieved by Baumann et al.58
Recently, renewed interest in the development of O-glycosylated amino acid building blocks has been seen. Several novel acid-labile protecting group strategies have been developed, enabling a one-step deprotection of the resin-bound glycopeptide. These approaches use electron-rich benzyl ethers59,60 or silyl ethers.49,61,62 While these are promising developments simplifying the synthesis of O-glycosylated peptides, they have only been applied to peptides bearing mono- or disaccharides, and have yet to be proven in the synthesis of glycopeptides bearing larger protected glycans.
Early work on N-glycosylated peptides used a similar building block approach as was used for the synthesis of O-glycosylated peptides. The group of Ogawa managed the first synthesis of a tripeptide carrying the core-pentasaccharide.63 This synthesis, outlined in Fig. 5, starts with the complete synthesis of the pentasaccharide azide 37. In this synthesis, no particular care was taken to selectively produce the β-mannoside linkage in compound 30, preferring instead to separate the two diastereomers. After allyl ether deprotection (34) and introduction of the final two mannose moieties (36), the phthalimide functions were converted to the acetamides (37). To introduce the (orthogonally protected) asparagine residue, the anomeric azide was selectively reduced by hydrogenation over Lindlar catalyst to obtain the glycosyl amine. This intermediate was then condensed with anhydride 38 in the same pot, to limit hydrolysis of the sensitive anomeric amine. TFA mediated deprotection of the tert-butyl ester produced Fmoc-amino acid 40, usable for Fmoc-SPPS of glycopeptides. In this case, however, solution phase peptide synthesis was employed, with EDC mediated coupling to dipeptide 41 yielding the protected tripeptide 42. Fmoc-deprotection followed by hydrogenolysis yielded the desired deprotected glycopeptide 43.
Further development of this building block approach by Unverzagt produced a pentapeptide decorated with a heptasaccharide N-glycan, again by in solution peptide synthesis.64,65 This glycosylated amino acid, bearing no protective groups on the sugar hydroxy functions, was later also used in Fmoc-SPPS, where the choice of solid support turned out to be critical to mediate effective introduction of this extremely polar amino acid.66 When using glycosylated amino acids without hydroxyl-group protection, side reactions, in the form of unwanted acylation, limit the maximum length of the peptide that can be synthesized.67 However, when the glycoside was protected with acetyl groups, via on-resin acetylation, another set of unspecified side-reactions still prevented the synthesis of N-glycosylated peptides longer than 20 amino acids.68
To avoid these problems, the total synthesis of N-glycosylated peptides is usually carried out via an convergent approach. This is most often accomplished using the so-called Lansbury aspartylation (Fig. 6).69,70 In this reaction, a glycosylamine derivative of the oligosaccharide is condensed with a partially protected peptide, containing an activated aspartic acid residue. This approach is not free of side reactions, with aspartimide formation and the associated formation of iso-aspartate and loss of stereochemistry of the aspartic acid residue being key problems.71
Still, in this manner the total synthesis of highly complex N-glycosylated peptides was achieved, with the chemical synthesis of homogeneously glycosylated Erythropoietin by Danishefsky and coworkers the most famous example.72 Also, a glycopeptide derived from the HIV envelope protein, a potential vaccine candidate, was produced by total synthesis. This peptide was used to isolate a so-called broadly neutralizing antibody (BnAb) from the plasma of a HIV patient, as well as induce production of antibodies in a Rhesus Macaque model.73 However, to further avoid the side reactions associated with chemical introduction of N-glycans or N-glycosylated amino acids, as well as the laborious work of producing said glycan by synthesis, many examples of N-glycopeptide synthesis have instead been carried out using chemoenzymatic and semisynthetic means.
![]() | ||
Fig. 7 (A) general overview of ENGase mediated glycosylation of a peptide containing an N-GlcNAc asparagine residue with an oxazoline donor. (B) ENGase mediated synthesis of several N-glycosylated derivatives of the cytomegalovirus derived peptide pp65491–509 by McIntosh et al.79 Both synthetic oxazoline donor 45 as well as semisynthetic oxazoline donor 46, produced from an N-glycopeptide obtained from soy flour, were enzymatically introduced onto glycopeptide 44, producing vaccine candidates 47 and 48. |
ENGase mediated synthesis has been used to synthesize a peptide derived from the HIV glycoprotein gp120, containing two core pentasaccharides,80 and a vaccine candidate glycopeptide against cytomegalovirus (CMV), decorated with either a core-pentasaccharide (47, Fig. 7B) or a full high mannose-type glycan (48). As the peptide sequence under investigation contains two distinct N-glycosylation sites, the effect of glycosylating one or both sites was evaluated. The found that while glycosylation of Asn507 had negligible inhibitory effect on T-cell activation, introduction of the N-glycans onto both Asn495 and Asn507 completely inhibited T-cell activation.79
ENGase mediated glycopeptide synthesis has also used to synthesize lectin binding peptides; Yamaguchi et al. produced, by total chemical synthesis, an oxazoline donor of a bis-phosphorylated high-mannose type N-glycan (50), and used this to produce a cyclic peptide containing two phosphorylated N-glycans (51, Fig. 8). This molecule had a nanomolar affinity for the cation-independent mannose-6-phophate receptor (CI-MPR). The cyclic nature of the peptide was found to be critical, as linear glycopeptide 52 had a 20-fold lower affinity for the CI-MPR.81 Besides the synthesis of N-glycosylated peptides, ENGase catalyzed glycosylation has also been extensively applied to the production of single glycoform N-glycosylated proteins, including remodeling of the N-glycans on the Fc-domain of antibodies.82–85
![]() | ||
Fig. 8 ENGase mediated synthesis of a CI-MPR binding cyclic phosphorylated glycopeptide 51 by Yamaguchi et al.81 Reagents and conditions: (a) Endo A N171A, Tris buffer (pH 7.1) (b) (i) DTT (ii) Iodoacetamide. |
Another area where biochemical methods have improved the study of N-glycosylated peptides is in the production of the N-glycosylated asparagine building blocks and oxazoline donors required for the above syntheses. Methods have been developed to isolate N-glycopeptide carrying specific N-glycans from biological sources. This was first demonstrated by the isolation of sialoglycopeptide (SGP) from egg yolk, a hexapeptide bearing a biantennary sialoglycan.86 This peptide was used directly with Endo M (one of the ENGases) to produce a glycosylated form of the CD52 peptide,87 as well as being chemically transformed into an oxazoline donor for other ENGase mediated reactions.88,89 Furthermore, after peptidase mediated liberation of the glycosylated asparagine followed by selective introduction of an Fmoc-group and protection of the sialic acid residues as benzyl esters67 or phenacyl (Pac) esters,90 this semisynthetic glycosylated amino acid has been used in SPPS for the facile production of N-glycosylated peptides.91,92 These protecting groups inhibited the acid mediated hydrolysis of the sensitive glycosidic bond of sialic acid, enabling both Fmoc- and Boc-SPPS approaches. A high mannose type N-glycan (Man9GlcNAc2-Asn) can, in a similar manner, be isolated from soybean flour93,94 or egg yolks,95 and has also been used in ENGase mediated glycotransfer reactions79,96 as well as SPPS.97
Another enzymatic approach that has furthered the synthesis of N-glycopeptides is enzyme-based glycan remodeling. This is the enzymatically removal and/or addition of specific glycosides to an N-glycan. Enzymes have most commonly been employed to install sialic acids onto peptides bearing synthetic complex-type N-glycans,65 or for introduction of the so-called core-fucose moiety (α-fucosylation of the 6-OH of the reducing-end GlcNAc in the core pentasaccharide).98 More extensive remodeling of N-glycosylated asparagine has also been reported. The group of Kajihara used partial acidic hydrolysis of glycosylated asparagine obtained from egg-yolk SGP, followed by further enzymatic hydrolysis to produce various N-glycosylated asparagine residues. These were subsequently used in peptide synthesis.99 More recently, the group of Boons showed the production of various N-glycosylated asparagine structures by enzymatic remodeling of the biantennary N-glycan isolated from egg yolk.100,101 This method has, however, not yet been employed for the synthesis of N-glycosylated peptides.
Chemo-enzymatic approaches have also been applied to the synthesis of O-glycans, by enzymatically elaborating synthetic O-glycosylated peptides. These approaches start by the chemical synthesis of a peptide containing an α-GalNAc-Ser/Thr as a starting point for enzymatic introduction of additional saccharides. Sialic acids have been introduced in a stereoselective manner using this approach,102 a feat that can be challenging using organic synthesis techniques.103 More complex oligosaccharides have also been made chemoenzymatically; for instance, a glycosulfopeptide derived from PGSL-1 has been produced by enzymatic elaboration of an α-GalNac containing synthetic peptide.104 Producing O-glycosylated peptides from non-glycosylated starting material has also been accomplished, as shown by the group of Clausen, who synthesized various glycoforms of a 60-mer MUC1 repeat peptide.105 Two of the peptides produced in this manner were then used to immunize mice, where production of specific antibodies was observed. Using the various differently glycosylated MUC1 peptides produced by the authors, they were able to precisely determine the specificity of these antibodies.105,106
The simplified N-glycan mimics, as the most natural-like of the simplified glycans, still retain some key properties of the glycans they are mimicking, i.e. the presence of a glycosidic bond with asparagine. Peptides produced via the simplified N-glycan approach are capable of mimicking certain functional aspects of N-glycans, while circumventing production of the entire, synthetically complex natural N-glycan. The group of Kunz has, for example, developed glycopeptide ligands for the lectin E-selectin, containing asparagine linked Sialyl-LeX determinants (53, Fig. 9A).108,109 Simplified N-glycans, bearing other, non-sialylated and immunologically important Lewis-type sugars, like LeX29,110 and LeA
111 have also been described. These kinds of simplified N-glycans can be used for functional studies. For instance, LeX is known to bind to the lectin Dendritic Cell-Specific Intercellular adhesion molecule-3-Grabbing Non-integrin (DC-SIGN), inducing a tolerogenic response from the targeted dendritic cell.112 We produced a peptide, derived from the protein myelin oligodendrocyte glycoprotein (MOG), decorated with a simplified LeXN-glycan on the native N-glycosylation site (54).29 This molecule was able to bind to DC-SIGN and elicit a tolerogenic response in vitro.
![]() | ||
Fig. 9 Examples of simplified N-glycopeptides. (A) Synthetic N-glycopeptide bearing simplified N-glycans produced by Filser et al.108 (53) and us (54).29 By including only the part of the N-glycan required for lectin binding, the synthetic complexity of glycopeptides can be limited without compromising on functional activity. (B) Structure of the native (55) and stabilized (56) variants of a neoglycoepitope derived from collagen type II synthesized by Gustafsson et al.113 Peptide 56 was still able to induce a T-cell response, despite the non-native glycosidic linkage. |
Stabilized building blocks were developed not to simplify chemical synthesis, but to increase (bio)chemical stability of the glycopeptide. These approaches are occasionally used to suppress the chemical side reactions described earlier. More commonly, stabilized building blocks are applied in glycopeptide synthesis to prevent the enzymatic hydrolysis of natural linkages in biological systems.21,114 These actions of cellular enzymes can be a complication when studying the effect of peptide glycosylation in biological systems. To suppress enzymatic degradation, stabilized glycosylated amino acid building blocks have been developed, based on S- or C-glycosidic linkages. Cys(β-GlcNAc), as mentioned before, is a good functional mimic of Ser(β-GlcNAc), that is resistant to hydrolysis by the glycosidase O-GlcNAcase.107 An S-glycoside analogue of a Tn-antigen containing MUC1-peptide has also been developed and tested as a vaccine candidate. The antibodies obtained from mice immunized with this peptide were able to recognize a human tumor cell line, indicating that the unnatural linkage does not necessarily impede the ability of a synthetic glycopeptide to induce an immune response to the native antigen.115 This result further exemplifies the ability of S-glycopeptides to mimic the functions of O-glycopeptides. C-Glycosides have been similarly used in immunological studies. For instance, the group of Kihlberg produced a glycopeptide epitope, derived from type II collagen containing a galactosylated δ-hydroxylysine (55, Fig. 9B), as well as a C-glycoside mimic of this moiety (56).113 This peptide may play a key role in the pathogenesis of rheumatoid arthritis (RA).116,117 The C-glycoside analogue 56 was able to activated T-cell hybridomas, although at a lower level compared to the natural peptide. Stabilized glycopeptides can also bind lectins; for instance, C-mannoside decorated peptides have been successfully used as DC-SIGN ligands.118
Finally, sometimes glycosylated building blocks bearing very little resemblance to the natively glycosylated amino acids are used. Here, a glycoside is coupled, usually via the anomeric lactol function, to a linker bearing a carboxylic acid. These can be coupled on-resin or in solution to the N-terminus of the peptide or to the sidechain amine of lysine residues. These linkers can take the form of a simple hydroxylated carboxylic acid, as used by Kantchev et al.119 These molecules were used to decorate vaccine peptides with glycans and study the effect of lectin mediated uptake on antigen (cross-)presentation. An alternative glycoside derivatization strategy was developed by the group of Roche (Fig. 10). Here, the free reducing-end lactol of a glycoside (57) is reacted with the amine group of glutamic acid (58), forming the hemiaminal. Cyclization with the sidechain carboxylic acid formed the pyroglutamate derivative, stabilizing the N-glycosidic bond (60).120 After hydrogenolysis of the benzyl ester protecting the carboxylic ester (61) the glycans were coupled in solution phase to various oligolysine peptides (producing, for example, 62). Various fluorescently labeled glycoclusters were build using this method. These were then used to study the binding specificity of the mannose receptor.28 The same group later further expanded on this approach, using it to target antigenic peptides to DC-SIGN in order to improve antigen cross-presentation. Here, they showed a glycan-dependent effect on T-cell activation.121
![]() | ||
Fig. 10 Example synthesis of the fluorophore labeled oligomannose clusters synthesized by Frison et al.28 Reagents and conditions: (a) imidazole, NMP, 50 °C (b) BOP, imidazole (c) H2, Pd/C (d) Lys-Lys-Lys-Lys-Ala-Cys(S-Pyc)-NH2, HBTU, HOBt, DiPEA, NMP (e) (i) TCEP, NMP, NaPi pH = 7.2 (ii) Fluorescien-iodoacetamide. |
All of these strategies still rely on introduction of glycosyl building blocks into the peptide on the solid support or in solution. This often requires large amounts of the glyco-building block under investigation and are often limited by low peptide yields. In order to limit the need for large amounts of precious oligosaccharide, conjugation methods were developed, where the carbohydrate part of the molecule can be conjugated to the peptide after SPPS and HPLC purification. This requires far less of the precious oligosaccharide, enabling the use of more complex sugars.
Another class of thiol selective chemistry that can be applied to cysteine is the photoinduced thiole–ene reaction,127 which has been used to produce glycoconjugate 76 using allyl glycoside 75.128,129 Alternatively, the reactivity of thiols can also be used in the opposite direction. The thiol–ene reaction can also be performed by reacting a thiol glycoside (78) to an alkene containing peptide (76), producing neoglycopeptide 79.130 The nucleophilicity of thioglycosides was also exploited to synthesize peptides containing S-glycosylated cysteine residues in an unconventional manner. The nucleophilic opening of an aziridine amino acid (80) on solid support was used to produce an S-glycoside mimic of the Tn-antigen (82).131
However, the above-described approaches all rely on thiolate functions to mediate chemoselective ligation, making these approaches incompatible with peptides containing natural cysteine residues. Earlier attempts at making glycoconjugation reactions orthogonal to common peptide functionalities hinged on the reactivity between the aldehyde function present in carbohydrates and oxy-amine functionalities (Fig. 12). As a simple example, the oxy-amine derivative of serine (83) can be used to chemoselectively ligate a free lactol.132 The downside of this ligation approach is that it primarily forms the ring-opened configuration of the reducing end glycoside (85).133 When the nitrogen in this oxy-amine is alkylated (86), the cyclic form of the glycoside is conserved after ligation (87).134,135 Another approach, that more closely mimics the structure of N-glycans, uses the hydrazide derivative of aspartic acid (88). Conjugation with the free lactol of GlcNAc (84) leads to the formation of an anomeric β-hydrazide (89), a close mimic of a native N-glycan, with an additional nitrogen in between the glycoside and the asparagine residue.132 The condensation of oxy-amines with carbonyl functionalities has also been used in the form of a chemoselective ligation between a peptide containing a ketone containing amino acid (90) and a hydroxylamine-glycoside (91).136
![]() | ||
Fig. 12 Various glycoconjugation approaches used for the synthesis of neoglycopeptides, based on the unique reactivity of oxy-amine and hydrazide derivatives. |
However, the most popular metal-catalyzed cross-coupling reaction used for the production of glycopeptide conjugates is the copper-catalyzed azide–alkyne cycloaddition (CuAAC). In this reaction, an alkylic azide reacts with an alkyne group to form a triazole structure. The reaction is mediated by Cu(I) catalysis and compatible with a wide variety of solvents including water and other polar media usable for the dissolution of unprotected peptides. Soon after the reactions’ discovery,139,140 the group of Rutjes produced the first triazole-linked glycopeptide by reacting protected dipeptides, containing the unnatural amino acid propargylglycine, with acetyl-protected 1-azido-glucose and -cellobiose.141 The utility of the reaction for the modification of unprotected peptides was discovered soon after. Walsh produced several variants of the peptide antibiotic tyrocidine containing one or more propargylglycine residues. These were, in unprotected form, conjugated with several unprotected sugars, modified with an anomeric azidoethanol function, displaying for the first time the utility of the CuAAC reaction in the late-stage introduction of glycosylation onto a peptide scaffold.142 This was further exemplified by the work of Davis34 and Brimble,143,144 producing various unnatural glycopeptides.
To further streamline the synthesis of glycoconjugate peptides via CuAAC, novel methods to produce glycosyl azides from reducing sugars were developed. Shoda introduced the reagent 2-chloro-1,3-dimethylimidazolinium chloride (DMC) for the activation of anomeric lactol towards nucleophiles like sodium azide, producing exclusively the β-azide on glucose-configured reducing sugars (Fig. 13A).145 This method enables the use of oligosaccharides isolated from natural sources, like the biantennary N-glycan isolated from egg yolk described in section 2, to be conveniently conjugated to peptides containing one or more alkyne handles. Fairbanks further improved on this approach by utilizing a more efficient reagent for the introduction of the azide, 2-azido-1,3-dimethylimidazolinium hexafluorophosphate (ADMP), and using this approach to synthesize several non-natively glycosylated MUC1 derivatives, among them compound 95, bearing a biantennary sialoglycan (Fig. 13B).146
![]() | ||
Fig. 13 (A) DMC145 or ADMP146 mediated synthesis of azidoglycosides from unprotected sugars in water. (B) One pot synthesis of a glycoconjugate MUC1-peptide (95) using ADMP mediated conversion of lactol 93 into the glycosyl azide. CuAAC conjugation in the same pot with alkyne-containing peptide 94, where Thr16 was replaced with propargylglycine, yielded the desired glycoconjugate.146 Reagents and conditions: (a) DMC, 2,6-lutidine, NaN3, H2O (b) ADMP, Et3N, D2O, MeCN (c) 94, CuSO4, ascorbic acid. |
The CuAAC mediated synthesis of glycoconjugates was soon utilized in the synthesis of more glycosylated antigenic peptides. The group of Brimble used CuAAC to synthesize six different MUC1 derived peptides containing 1–5 Tn-antigen mimics, introduced using a late-stage click reaction between unprotected peptides and glycosides (Fig. 14A).147 More recently, CuAAC-based approaches have been used to rapidly produce large libraries of antigenic peptides containing various oligosaccharides, in order to study the role peptide glycosylation has in antigen (cross)-presentation. The group of Codée investigated the binding of mannosylated gp100 antigen, an often studied cancer epitope,148 to the lectins DC-SIGN31 and Langerin.149 To study the effects of both multivalency and carbohydrate configuration, large libraries of glycopeptides bearing 1,2,3 or 6 azide handles, usable for the introduction of alkyne-modified oligomannosides, were produced. While the higher valency conjugates showed an increase in affinity for the targeted receptors, this did not translate into improved presentation of the antigens to T-cells.31,149
![]() | ||
Fig. 14 (A) Synthesis of MUC1 derived neo-glycopeptides bearing click-based Tn-antigen mimics by Lee et al.147 Selected threonine and/or serine residues were replaced with propargyl-glycine in the peptide sequence, and a CuAAC reaction with the Tn-antigen mimic 97 facilitated the synthesis of neoglycopeptides. (B) Synthesis of an antigenic peptide derived from chicken ovalbumin bearing a glycocluster of mannose-6-phosphate receptor binding glycans by Reintjes et al.32 By synthesizing a peptide bearing multiple azidonorleucine residues on the N-terminus, followed by CuAAC conjugation using propargyl glycoside 100, the effect of multivalent glycan presentation on antigen uptake could be studied. |
In a similar manner, stabilized ligands for the mannose-6-phosphate receptor, in the form of mannose-6-phophonate (100), conjugated to epitopes derived from chicken ovalbumin (99), were produced (Fig. 14B).32 All of these works were based on the synthesis of peptides bearing several azidonorleucine residues, conjugated to different carbohydrates all bearing an anomeric propargyl group.
CuAAC-based neoglycopeptide synthesis strategies have now shown to be able to quickly deliver various unnatural but functional glycoconjugate peptides. It is expected that these methods will be further exploited in the future to elucidate the complex roles of lectins in the immune system.
This journal is © The Royal Society of Chemistry 2022 |