Open Access Article
Yixuan
Xie‡
ab,
Siyu
Chen‡
a,
Michael Russelle
Alvarez
a,
Ying
Sheng
a,
Qiongyu
Li
a,
Emanual
Maverakis
c and
Carlito B.
Lebrilla
*ad
aDepartment of Chemistry, University of California, Davis, Davis, California, USA. E-mail: cblebrilla@ucdavis.edu
bDepartment of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
cDepartment of Dermatology, University of California, Davis, Sacramento, California, USA
dDepartment of Biochemistry, University of California, Davis, Davis, California, USA
First published on 11th March 2024
Cell membrane glycoproteins are generally highly fucosylated and sialylated, and post-translational modifications play important roles in the proteins' functions of signaling, binding and cellular processing. For these reasons, methods for measuring sialic acid-mediated protein–protein interactions have been developed. However, determining the role of fucose in these interactions has been limited by technological barriers that have thus far hindered the ability to characterize and observe fucose-mediated protein–protein interactions. Herein, we describe a method to metabolically label mammalian cells with modified fucose, which incorporates a bioorthogonal group into cell membrane glycoproteins thereby enabling the characterization of cell-surface fucose interactome. Copper-catalyzed click chemistry was used to conjugate a proximity labeling probe, azido-FeBABE. Following the addition of hydrogen peroxide (H2O2), the fucose-azido-FeBABE catalyzed the formation of hydroxyl radicals, which in turn oxidized the amino acids in the proximity of the labeled fucose residue. The oxidized peptides were identified using liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). Variations in degree of protein oxidation were obtained with different H2O2 reaction times yielding the acquisition of spatial information of the fucose-interacting proteins. In addition, specific glycoprotein–protein interactions were constructed for Galectin-3 (LEG3) and Galectin-3-binding protein (LG3BP) illustrating the further utility of the method. This method identifies new fucose binding partners thereby enhancing our understanding of the cell glycocalyx.
When present, sialic acids most commonly modify the nonreducing termini of cell-surface glycans. Several approaches have been developed to investigate sialic acid-mediated interactions.5 However, far fewer analytical methods exist to characterize fucose–peptide interactions, due in part to the greater diversity of fucose linkages. Fucose–peptide interactions are typically studied using glycan arrays;6 however, this method provides only binding information with no spatial resolution for the chemical interactions. Another recently developed method to determine fucose-mediated interactions is FucoID, which utilizes in situ proximity-based transfer of fucosylated biotin using fucosyltransferases.7 This method utilizes fluorescence-based methods to detect the fucosylated biotin and is devoid of glycan structural information. Wibowo et al. synthesized photoactivatable glycopolymers to mimic cellular fucosylated glycoproteins, allowing the isolation of fucose-binding proteins through photo-cross-linking.8 While informative, this technique alone cannot elucidate the complex interactions of fucosylated glycoproteins on the cell membrane. More recently, Sun et al. integrated protein–protein cross-linking and enzymatic reactions to investigate cell-surface glycoprotein interactions.9 Similar crosslinking tools have also been developed in this laboratory for capturing sialylated glycoproteins and proximal proteins for mass spectrometry (MS) identification.10 This cross-linking method provides site-specific information about the interacting proteins; however, it may not be applied to fucose-associating proteins because fucose is labile during tandem mass spectrometry (MS/MS) analysis, making the detection of cross-linked products potentially unreliable.
In addition to the aforementioned techniques, proximity-based labelling strategies also have great potential for interrogating glycan–protein interactions.11,12 In our previous studies, we have employed an iron-based proximity tag, FeBABE, to identify the sialic acid environment on the cell surface.13 Similarly, Meyer et al. developed a photo labeling tool to reveal the sialylated glycoproteins in regulating cell-surface transporters.14 However, the same technique is difficult to apply for mapping fucose-associating proteins because the corresponding 6-azido-fucose (6AzFuc) cannot be efficiently incorporated into fucosylated glycans, while 6-alkynyl-fucose (6AlkFuc) blocks the activity of fucose transferases, which in turn alters the glycan profile.15,16 Of particular interest to the current study, Kizuka et al. reported a novel 7-alkynyl-fucose (7AlkFuc) probe, which can be metabolically incorporated into cell-surface fucosylated glycoproteins with low toxicity.17 We reasoned that an alkynyl group on fucose could be used as the handle for attaching the oxidative reagent FeBABE. The FeBABE catalyst motif when attached onto fucose residues of glycoproteins acts as a proximity-labeling probe allowing fucosylated glycoproteins and their binding partners to form stable complexes, followed by Fe-redox chemistry to oxidize the neighboring residues.18
Herein, we describe a proximity-based method to decipher the cellular fucosylated glycoprotein-associating environment in situ (Scheme 1). We termed the method “protein oxidation of fucose environments” (POFE). Briefly, a synthetic azido-FeBABE (AzFeBABE) probe was conjugated to the cell-surface fucosylated glycoproteins via copper-catalyzed azide–alkyne cycloaddition (CuAAC) click chemistry. Upon treatment with low-concentration hydrogen peroxide (H2O2), the incorporated iron catalyzed the generation of hydroxyl radicals, which oxidized amino acid residues residing in close proximity to the labeled fucose. The oxidized peptides were then identified using MS-based proteomic analysis. Furthermore, the hydroxyl radicals diffused to varying distances with different H2O2 incubation times, and thus, the spatial environment of fucose would be revealed by quantitative LC-MS/MS analysis.
To validate the incorporation and optimize the conditions of 7AlkFuc treatment for PNT2 cells, the alkynyl-modified glycoproteins were coupled with an azide-rhodamine reporter tag using CuAAC click chemistry and visualized by SDS-PAGE with in-gel fluorescence scanning. The signal increased with rising 7AlkFuc concentration and reached a plateau when the concentration reached 100 μM (Fig. S2†). Generally, with 100 μM treatment for 72 hours, the fucose bioorthogonal reporter was efficiently incorporated into cell fucosylated glycoproteins.
The localization of 7AlkFuc-containing glycoproteins was analyzed by labeling with coumarin azide and visualized using confocal microscopy. PNT2 cells were grown on a glass-bottom dish and treated with 7AlkFuc for three days. The cells were then incubated with 3-azido-7-hydroxycoumarin for 1 hour and labeled using CellMask™ Deep Red, a marker for the cell plasma membrane, for confocal imaging. The 7AlkFuc-treated cells showed a strong signal (red) after reaction with coumarin azide (Fig. S3†). This signal overlapped well with the membrane signal, indicating the incorporation of 7AlkFuc on the cell plasma membrane. In comparison, 6AlkFuc-treated cells displayed a much lower signal, consistent with the glycomic results obtained from LC-MS/MS analysis suggesting much lower incorporation (Fig. S4†). Notably, 6AzFuc was found mostly enriched inside the cells (Fig. S5†) as opposed to the membrane, which may be explained by its high incompatibility with being transferred onto cell proteins rendering the membrane incorporation pathway inefficient. This largely depends on the performance of glycosyltransferases in transporting the corresponding active unnatural sugar-nucleotides into glycoproteins. Some reporters such as 6AlkFuc can even induce unexpected inhibition of these carbohydrate-active enzymes.21,22
We further validated the results of POFE by performing gene ontology analysis of the oxidized proteins and annotating them based on cellular components (ESI 1 and 2†).23 The majority of the oxidized proteins (128 proteins in PNT2 and Caco2) were designated as cell membrane components with a small fraction (12) of proteins in the extracellular space. Thus, nearly all the proteins were indeed found in the cell membrane or at least associated with the membrane. Two proteins – P29692 (EEF1D) and P61513 (RL37A) – were annotated as cytosolic. However, these proteins were oxidized after 15 minutes of incubation, at which point the hydroxyl radicals could have diffused into the cell.
In addition to PNT2 cells, we similarly tested the POFE method with Caco-2 cells. To identify the oxidized proteins with high confidence, we filtered the results generated by the software Byonic with the following criteria: Score > 300, DeltaMod score > 10.0, and log
ProbValue determined from the test > 2. We looked at oxidation modifications and intensities of peptides containing specific amino acid residues: +15.994915 to M, C, W, Y, F, H, L, I, R, V, T, P, K; +13.9793 to L, I, R, V, P, K; +31.9898 to M, W, Y, F, C; +47.9847 to C; −15.9772 to C; −23.0106 to H; −22.032 to H; −10.032 to H; −43.0534 to R; −30.0106 to D, E; and +47.9847 to C (ESI 3 and 4†).13,24 Furthermore, we manually annotated the spectra of peptides detected in the control, H2O2, and FeBABE–H2O2 experiments (Fig. S7†). The experiments yielded more than 150 oxidized peptides, corresponding to more than 80 proteins from each of the PNT2 and Caco-2 cells (Fig. 1b). Less protein oxidation was observed for Caco-2 cells due to the lower 7AlkFuc incorporation.25 Indeed, the efficiency of oxidation by the same probe on different cell lines can vary to a large extent. For example, PNT2 cells offered a much more effective incorporation for both ManNAz and 7AlkFuc than Caco-2 and similarly yielded more oxidation.26
The oxidized proteins using the fucose-centered probes were compared to our previous results based on sialic acid probes. For PNT2 cells, many of the glycans were sialofucosylated (containing both sialic and fucose residues). Indeed, comparison of the proteins oxidized with the fucose probe yielded a 60% overlap with the sialic acid probe (Fig. 1c). In contrast, comparison of the same fucose probe with different cell lines, Caco-2, yielded less than 30% similarities (Fig. 1d). The results can be rationalized by the differences in cell-surface glycosylation of the two cell lines. N-Glycan composition analyses using LC-MS yielded varying distributions of fucosylated-only, sialylated-only glycans, and sialofucosylated glycans. Specifically, PNT2 cells were mainly associated with sialofucosylated glycans (>36% total relative abundances) with only 7% of the glycans being fucosylated-only glycans (Fig. 1a). For Caco-2, we observed the reverse, with 40% of fucosylated-only glycans and 40% of sialofucosylated glycans. Consequently, the higher percentage of asialo-fucosylated glycans in Caco-2 cells resulted in a larger portion of unique proteins in the fucose environment that were not identified in the sialic acid environment.
Gene ontology enrichment analyses for Molecular Function were conducted using STRING-DB, applying a 0.400 interaction score and 0.05 FDR value cutoff.23
Initially, a small group of proteins (16 proteins in total) were observed to be oxidized after 5 minutes of incubation, all of which were fucosylated according to a separate glycoproteomic analysis. The results suggested that the hydroxyl radicals were indeed generated near the fucose and in turn oxidized the fucosylated proteins themselves. These proteins included membrane glycoproteins such as BSG (basigin), CD44, and EGFR (epidermal growth factor receptor), which are well known to be present and abundant on the PNT2 cell membrane.30
After 10 minutes of reaction, additional glycoproteins and several non-glycosylated proteins such as CTNB1 (catenin beta-1) were oxidized. To understand further the nature of the protein complexes on the cell membrane, we annotated the functions of the proteins that were oxidized after each successive time-point (Fig. 2b). The interaction networks for both PNT2 (Fig. 2c and S10†) and Caoc-2 (Fig. 2d and S11†) grew larger with each succeeding time-point, with the number of proteins involved in carbohydrate-binding increasing considerably after 15 minutes of incubation. This result suggested that proteins involved in these pathways were being oxidized due to their interactions with the fucosylated glycoproteins. As shown in Fig. 3a and b (Fig. S12 and S13†), we evaluated the interactions of a specific glycoprotein, LEG3, and found a similar pattern from both PNT2 and Caco-2, in which there is an increase in carbohydrate-binding, cell adhesion, and signaling proteins after 15 minutes of incubation time. Collectively, these results suggest a time-dependent propagation of the oxidation reaction based on proximity to fucosylated source glycoproteins.
We further annotated the oxidized proteins based on cellular component analysis. Cellular component analysis showed that all these proteins were found in the extracellular space or plasma membrane (Fig. S14a†). Additionally, we explored the membrane topology of these proteins using DeepTMHMM,31 and found that a large fraction of them contained transmembrane domains (Fig. S14b†). The results further confirm that these interactions are occurring on the plasma membrane and the highly interactive network in the cell glycocalyx. We combined the data from the first three layers, we constructed a highly interactive protein network through STRING analysis,23 and visualized the interactome using the Cytoscape software (Fig. S15†).32
After 30 minutes an additional 20 proteins were identified, however only four were annotated as plasma membrane proteins. The remainder were annotated as cytoplasm and nucleus proteins (data not shown). This was possibly due to the hydroxyl radicals diffusing across the membrane and labeling the intracellular proteins after 15 minutes.
In summary, more than 65% of the proteins interacting with the fucosylated glycans of the core proteins were annotated as binding proteins according to the Gene Ontology (GO) function annotation (Fig. S16a†). The binding functions were further sub-grouped into cell adhesion molecule binding, cadherin binding, protein-containing complex binding, and exogenous protein binding (Fig. S16b†). In addition, nearly 40% of the identified proteins were associated with cell adhesion molecule binding, consistent with the fact of fucose being highly involved in cell adhesion.32 In addition, dysregulation of fucose has been known to impair the attachment of colorectal cancer cells and affect key cell functions modulating tumor progression.33 Similarly, we found that the fucose-proximal proteins are enriched in proteins related to cell–cell adhesion. Also enriched was the biological function pathway of L1CAM (neural cell adhesion molecule L1) interactions. Indeed, it has been reported that core fucosylation impacted L1CAM proteolytic cleavage and the ability of L1CAM-supported melanoma metastasis.34 Overall, these results demonstrated that the current method can effectively map proteins within the surface fucosylated glycoprotein environment.
Afterwards, we incorporated N-glycan residues on LG3BP–Asn551 to determine the effect of specific N-glycan residues on the glycan–protein interactions. We modeled the following tetra-antennary N-glycans on Asn551: undecorated Hex7HexNAc6, core-fucosylated Hex7HexNAc6Fuc1, sialylated Hex7HexNAc6NeuAc1, and core-fucosylated Hex7HexNAc6Fuc1NeuAc1. After the equilibration, we monitored the frequency and types of contacts between the N-glycan and amino acid residues (Fig. 4a). First, we observed a drastically higher frequency of hydrogen-bonding interactions between the N-glycan and protein upon the addition of either a fucose (Fig. 4b) or a sialic acid (Fig. 4c) residue; the effect becomes more pronounced with the addition of both sialic acid and fucose (Fig. 4d). Based on the types of interactions, the sialic acid in Hex7HexNAc6NeuAc1 contributes additional hydrogen bonding and electrostatic interactions, specifically with a buried His217 residue in LEG3. On the other hand, fucose itself contributes to several hydrogen-bonding interactions. Furthermore, in the binding between LEG3 and LG3BP–Hex7HexNAc6 and Hex7HexNAc6NeuAc1, the galactose residues contribute the most hydrogen-bonding interactions, which is consistent with the previous results.37 In contrast, upon addition of core-fucose to the N-glycan, the N-acetylglucosamine residues then contribute the most hydrogen-bonding interactions. As shown in Fig. S19,† further comparison of the fucosylated N-glycan structures with undecorated N-glycan shows significantly different conformations (RMSD = 0.826, 309 atoms). This result suggests that the addition of core-fucose to the N-glycan changes the conformation of the N-glycan significantly, such that it exposes the core N-acetylglucosamine residues to more interactions, specifically with LEG3–Arg183.
When mapping these LG3BP–N-glycan and LEG3–amino acid residue interactions on the domain map of LEG3 (Fig. S20†), we observed several interesting features. Across the different N-glycan structures, several amino acid interactions are conserved: mannose interacting with LEG3–Gly13, Gly21, Trp22, Pro23, and galactose residues interacting with LEG3–Glu205, Pro206, Asp207, and Gln220. The addition of fucose in the N-glycan increases the number of interactions with the Galectin domain of LEG3, specifically with residues Arg183, Glu184, Glu185, Asn222, and Arg224. On the other hand, adding sialic acid to the N-glycan includes more interactions with Lys210, and specifically the electrostatic interaction with His217. Based on these results, it can be observed that the composition of the N-glycan in LG3BP drastically changes the molecular interactions with LEG3. Overall, by integrating the information from the POFE method with glycoproteomic analysis, specific fucose-mediated interactions can be investigated in detail.
:
1, followed by argon flashing to remove the oxygen. The structure of the product was determined by LC-MS analysis in negative ionization mode.
000×g for 30 minutes, and the supernatant was purified using porous graphitic carbon (PGC) on an SPE plate. The glycan samples were dried and reconstituted in 30 μL of nanopure water. The sample (5 μL) was injected and analyzed with an Agilent 6520 Accurate Mass Q-TOF LC/MS equipped with a PGC nano-chip (Agilent, CA). A binary gradient using solvent A with 3% (v/v) ACN and 0.1% (v/v) formic acid in water and solvent B with 90% (v/v) ACN and 1% (v/v) formic acid in water was applied to separate N-glycans at a 300 nL min−1 flow rate. The resulting chromatograms of glycans were extracted with the MassHunter Qualitative Analysis B08 software (Agilent, CA). N-Glycan compounds were identified using GlycoNote (https://github.com/MingqiLiu/GlycoNote), which contains the accurate mass and formula of human N-glycans, and the N-glycan structures were confirmed through tandem MS fragmentation.
The dried peptides were reconstituted with 0.1% (v/v) FA in water and separated on an UltiMate WPS-3000RS nanoLC system using a C18 column (3 μm, 0.075 mm × 250 mm) coupled with a Fusion Lumos Orbitrap Mass Spectrometer (Thermo Scientific, CA). A binary gradient was applied using 0.1% (v/v) formic acid in (A) water and B 80% acetonitrile: 0–5 min, 4–4% B; 5–133 min, 4–32% B; 133–152 min, 32–48% B; 152–155 min, 48–100% B; 155–170 min, 100–100% B; 170–171 min, 100–4% B; 171–180 min, 4–4% B. The instrument was run in data-dependent mode with 1.8 kV spray voltage, 275 °C ion transfer capillary temperature, and the acquisition was performed with the full MS scanned from 700 to 2000 in positive ionization mode. Stepped higher-energy C-trap dissociation (HCD) at 30 ± 10% was applied to obtain tandem MS/MS spectra with m/z values starting from 120. Glycopeptide fragmentation spectra were annotated using Byonic software. Common modifications, including cysteine carbamidomethyl, methionine oxidation, asparagine deamidation, and glutamine deamidation, were assigned.
:
100 protease inhibitor, 0.25 M sucrose, and 20 mM HEPES–KOH (PH 7.4).
000×g for 45 min at 4 °C to extract the cell membrane. The pellets were washed stepwise with 500 μL of 0.2 M Na2CO3 solution and 500 μL of water through the same ultra-centrifugation, to obtain the cell membrane fraction.
The peptide samples were reconstituted with 0.1% (v/v) FA in water and separated on an EASY-nanoLC system (Thermo Scientific, CA) using the Acclaim PepMap C18 column (3 μm, 0.075 mm × 250 mm) at a flow rate of 300 nL min−1. Water containing 0.1% formic acid and 80% acetonitrile containing 0.1% formic acid were used as solvents A and B, respectively. Peptides were resolved using the following binary gradient: 0–135 min, 2–25% B; 135–173 min, 25–40% B; 173–180 min, 40–100% B. The peptides were analyzed on a Q-Exactive Plus Orbitrap (Thermo Scientific, CA) with the full MS scanned from 200 to 2000 in positive ionization mode. The charge range of the precursor ions was 2 to 7 and the top 15 of them were selected for higher-energy collisional dissociation (HCD) fragmentation with stepped collision energy. Each selected precursor was excluded for 10 seconds after the fragmentation. The MS/MS spectra were collected for product ions with m/z over 130.
Prob. > 2.0, and DeltaMod score > 10. Alkylation of cysteine with carbamidomethylation was assigned as a fixed modification. Deamidation of asparagine and glutamine, methylation of lysine and arginine, and acetylation of protein N-terminus were assigned as rare variable modifications. Oxidation modifications were selected as common variable modifications according to previous settings: +15.994915 to M, C, W, Y, F, H, L, I, R, V, T, P, K; +13.9793 to L, I, R, V, P, K; +31.9898 to M, W, Y, F, C; +47.9847 to C; −15.9772 to C; −23.0106 to H; −22.032 to H; −10.032 to H; −43.0534 to R; −30.0106 to D, E; and +47.9847 to C.24,40,41 After the identification, the peptides were quantified using the Byologic software (Protein Metrics, CA), by measuring the extracted ion chromatograms (XICs) of the oxidized peptides. Oxidized peptides were further annotated for pathway enrichment. Gene ontology enrichment analyses for Molecular Function were conducted using STRING-DB, applying a 0.400 interaction score and 0.05 FDR value cutoff.23
In the future, we expect that the glycan-based proximity labeling approach can be applied to other glycoforms after development of new carbohydrate reporters, and eventually, the combination of these techniques will provide a comprehensive view of the cellular glycocalyx environment and extend our knowledge in glycobiology.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc06432h |
| ‡ Yixuan Xie and Siyu Chen contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2024 |