Open Access Article
Priyanka
Bansal
,
Ahmed
Ben Faleh
,
Stephan
Warnke
and
Thomas R.
Rizzo
*
Laboratoire de Chimie Physique Moléculaire, École Polytechnique Fédérale de Lausanne, EPFL SB ISIC LCPM, Station 6, CH-1015, Lausanne, Switzerland. E-mail: thomas.rizzo@epfl.ch
First published on 21st January 2022
While glycans are present on the surface of cells in all living organisms and play key roles in most biological processes, their isomeric complexity makes their structural characterization challenging. Of particular importance are positional isomers, for which analytical standards are difficult to obtain. We combine ultrahigh-resolution ion-mobility spectrometry with collision-induced dissociation and cryogenic infrared spectroscopy to determine the structure of N-glycan positional isomers. This approach is based on first separating the parent molecules by SLIM-based IMS, producing diagnostic fragments specific to each positional isomer, separating the fragments by IMS, and identifying them by comparing their IR fingerprints to a previously recorded spectral database. We demonstrate this strategy using a bottom-up scheme to identify the positional isomers of the N-linked glycan G0-N, in which a terminal N-acetylglucosamine (GlcNAc) is attached to either the α-3 or α-6 branch of the common N-glycan pentasaccharide core. We then use IR fingerprints of these newly identified isomers to identify the positional isomers of G1 and G1F, which are biantennary complex-type N-glycans with a terminal galactose attached to either the α-3 or α-6 branch, and in the case of G1F a fucose attached to the reducing-end GlcNAc. Starting with just a few analytical standards, this fragment-based spectroscopy method allows us to develop a database which we can use to identify positional isomers. The generalization of this approach would greatly facilitate glycan analysis.
The inherent complexity of glycan analysis has led to the use of various experimental approaches. Nuclear magnetic resonance (NMR) and X-ray crystallography can be extremely valuable for glycan structural determination, but they require large amounts of highly purified samples, which limits their application.8–10 Single-stage mass spectrometry (MS) is by its very nature blind to isomers and must be combined with other techniques to extract structural information from isomeric glycans. One approach is to selectively fragment isomers using collision-induced dissociation (CID) in MSn schemes to elucidate glycan sequence and branching patterns.11–13 However, interpretation of fragmentation data is not always straightforward, and MSn alone cannot distinguish among all isomeric forms.14 It is therefore often used in conjunction with separation methods such as capillary electrophoresis (CE) and liquid chromatography (LC).15–20 Despite its extensive use in glycan analysis, the identification of subtly different glycan isomers by LC-MS remains challenging.21
The use of various IMS technologies together with tandem MS has been reported for the analysis of isomeric glycans, including positional isomers.22–32 While ultrahigh-resolution IMS based on structures for lossless ion manipulation (SLIM)33,34 has been applied to achieve separation of different kinds of glycan isomers,35,36 the assignment of a drift peak to a precise isomeric form often remains elusive.35,37
We have previously demonstrated that cryogenic vibrational spectroscopy can be used for the identification of glycan isomers based on their unique infrared (IR) absorption spectrum.38–40 Such fingerprint spectra are robust identifiers, since they are inherent to a given molecule and largely insensitive to experimental conditions. An IR spectrum can thus positively identify a compound even when the spectra of other possible isomeric species are not known. The combination of IR fingerprinting with SLIM-based ultrahigh-resolution cyclic IMS was subsequently demonstrated to be a powerful tool for the analysis of glycan mixtures.41–44 However, any identification method that relies on a database, such as IR spectral fingerprinting, requires access to pure analytical standards. For many complex glycan structures, such standards are either expensive to synthesize or unavailable, which raises the need to create reference data from glycans in isomeric mixtures. Our recently developed IMS-IMS approach, where fragments of isomer-separated species can undergo further isomer separation before spectroscopic interrogation,45 poses an ideal starting point for the development of such methods.
In this work we describe a technique to identify positional isomers of underivatized N-glycans with a reduced need for pure analytical standards. To this end, we have developed an experimental protocol based on isomer separation using ultrahigh-resolution IMS followed by fragmentation and IR fingerprinting of structurally diagnostic fragments specific to each positional isomer. We then construct an IR spectral database for the parent glycans and their fragments, the latter of which we use to identify positional isomers from an isomeric mixture. We demonstrate the principle of this new approach by following a bottom-up scheme to identify the positional isomers of the N-glycans G0-N, G1 and G1F.
:
N2 mixture (80
:
20) is pulsed into the trap to help confine and cool them. During this process, the ions form weakly bound clusters with one or two N2 molecules, which are used to perform messenger-tagging IR spectroscopy.46,48 The tagged ions are irradiated for 50 ms with a continuous wave, mid-IR, fiber-pumped laser (IPG Photonics), operated at 1 W output power with a linewidth of ∼1 cm−1. Photons are absorbed by the N2-tagged ions when the frequency of the incident infrared light is resonant with a vibrational transition of the molecule, leading to redistribution of vibrational energy and causing the N2 tag to detach. The ions are then extracted towards a commercial time-of-flight mass spectrometer (Tofwerk) to measure their mass spectrum. We obtain an IR spectrum by monitoring the depletion of N2-tagged molecules as a function of the laser wavenumber. All ions were studied in their singly sodiated charge state, and our spectral database was constructed for the sodiated species. Since we are trying to determine the glycan primary structure and not 3-dimensional structure, we are free to choose whatever charge state is most convenient.
![]() | ||
| Fig. 1 (a) Structure of the glycans used in this work, using CFG notation. (b) Glycan fragment notation according to Domon and Costello.49 | ||
We use glycan notation issued by the Consortium for Functional Glycomics (CFG). The nomenclature of glycan fragments follows that of Domon and Costello, which is summarized in Fig. 1b.49
In step I of Fig. 2, reference IR fingerprints of small analytical standards are recorded after IMS to separate α and β reducing-end anomers26,35,42 or coexisting gas-phase conformers. In step II, positional isomers are identified by first separating them by IMS and then subjecting them to CID to create structurally diagnostic fragments from each isomer peak. The diagnostic species for isomers A′ and B′ are labeled A and B, respectively. Fragments are then identified based on their IR fingerprint spectrum that has been recorded in step I, and the precursor positional isomers A′ and B′ can subsequently be assigned. Further IMS isomer separation of fragments might be necessary if the presence of positional fragment isomers cannot be excluded.
Once an isomer is identified based on the IR fingerprints of its fragments, its own IR fingerprint is recorded and stored in the database in step III. This serves two purposes: (1) in a subsequent encounter of this species, it can be directly identified by its IR fingerprint without the need to analyse its fragments; and (2) its fingerprint can potentially be used to identify structurally diagnostic fragments of larger positional isomers. Further in step III, isomers identified by the database can serve as precursor structures to create other fragments that were not accessible through fragmentation of smaller precursor species, such as species a′ and b′ in Fig. 2. It is important to note that these diagnostic fragments need not correspond to molecules that exist in solution. The proposed method can thus serve both to identify isomers of glycans and to construct an IR fingerprint database in a bottom-up fashion.
An important assumption on which we base our identification scheme is that the dissociation of a single covalent bond is more likely to occur under the CID conditions applied here than the dissociation of two bonds. Therefore, only Y3α and Y3β fragments of G0-N were taken into consideration.
Figure 3(a) shows the arrival time distribution (ATD) of singly sodiated G0-N ions (m/z 1136) obtained from the commercial sample, where two distinct peaks were observed after one mobility-separation cycle (10 m drift path). While it might be tempting to assume that these two drift peaks correspond to the two positional isomers of G0-N, they might also be a result of the two reducing-end anomers for one of the species42 or different conformational states of the molecules.
Structurally diagnostic fragment ions (m/z 771) corresponding to the Man-2 species in the initial database were generated upon CID of each mobility-selected drift peak, and their resulting IR fingerprints are depicted in Fig. 3(b) (tan). The two fingerprint spectra are strikingly similar in position and intensity of the resolved absorption bands, with the only difference being the precise position of three bands in a wavenumber region corresponding to free OH-stretching vibrations (3620 cm−1–3670 cm−1). While both spectra have similarities with the database spectrum of Man-2(6), neither of them poses a perfect match that would confirm its presence. On the other hand, the similarity of the fingerprint spectra can be an indicator that these species represent the two reducing-end anomers42,50 that were not separated for the Man-2(6) standard. Under this assumption, we prepared a synthetic mixture in a 40/60 ratio of the two fragment IR fingerprints, corresponding to the relative intensities of the drift peaks in the ATD in Fig. 3(a). The resulting spectrum represents an extremely good match to the database fingerprint of Man-2(6), depicted in orange and grey, respectively, in Fig. 3(b). By assigning fragments from both mobility features of G0-N to Man-2(6), the content of the analysed sample was determined to be constituted solely by one positional isomer of G0-N, namely G0-N(3), despite the specifications from the supplier. Consequently, the two drift peaks observed for G0-N are likely a result of the two reducing-end anomers.42Figure 3(c) summarizes the identification procedure.
In accordance with step III in the scheme of Fig. 2, we recorded the IR fingerprint spectra of both G0-N(3) species (Fig. S3†), which can then be used for its identification in complex samples without having to analyse its fragments again. One should also conclude from these results that sample description from suppliers for subtly different isomers should be cautiously evaluated.
To obtain an IR fingerprint of the other positional isomer, G0-N(6), we applied our CID scheme to the glycan G0, which should yield both positional isomers of G0-N as fragments (Fig. 4(a)). Lacking any positional isomers, singly sodiated G0 ions were fragmented without prior ion-mobility separation. Fragments with m/z 1136, generated upon the loss of a GlcNAc monosaccharide, can exist in three isomeric forms Y4α, Y4β, and C4, where Y4α and Y4β correspond to G0-N(6) and G0-N(3), respectively (Fig. 4(a)). Following fragmentation of G0, the m/z 1136 fragments underwent one additional cycle of mobility separation. The resulting ATD, which features three well-separated peaks, is displayed in Fig. 4(b). The IR spectra of all three peaks are shown in Fig. 4(c), where they are paired with their best matching database fingerprint (displayed in grey, where applicable). A simple visual comparison of spectra confirms the assignment of the first two peaks in the ATD to the two isomers of G0-N(3) (i.e. fragment Y4β) for which we previously obtained the reference spectrum shown in Fig. S3.† The glycan corresponding to the C4 fragment is available commercially, and its IR fingerprint was measured and added to the database after mobility separation, however it does not correspond to that of the third mobility species in the fragment ATD (compare the bottom two spectra in Fig. 4(c)). Moreover, the molecule corresponding to the C4 fragment has an arrival time in between that of the second and third drift peak in the fragment ATD of Fig. 4(b) (Fig. S4†). We can therefore, by exclusion, assign the third drift peak and its spectrum in Fig. 4(c) to the Y4α fragment, which corresponds to the glycan G0-N(6), and store its fingerprint in the database to be used either for its future identification or to identify a structurally diagnostic fragment of a larger glycan. It should be noted that the spectrum of this isomer represents a mixture of both reducing-end anomers.
Isomer separation of singly sodiated G1 ions was performed prior to fragmentation, and the m/z 1136 fragments were sent through one additional cycle of ion mobility separation. An ATD of G1 after one separation cycle is shown in Fig. 5(a), where the drift-time windows selected for CID are highlighted. The ATD of the fragments generated from the first mobility peak of the G1 ions (blue), exhibits two features, whereas the fragment ATD generated from the second mobility region (pink) shows a single peak at longer drift times.
The IR spectra of all individual peaks in the fragment ATDs were recorded and compared to best-matching database IR fingerprints (Fig. 5(b)). Visual comparison confirms that both fragments generated from the first mobility peak (peach and green) correspond to the two species of G0-N(3) for which two isomers (presumably the reducing-end anomers) were added to the database, whereas fragments generated from the second mobility region (cyan) correspond to the database entry of G0-N(6). Based on the identification of these structurally diagnostic fragments, we can assign the first and second mobility region in the ATD of G1 to the positional isomers G1(6) and G1(3), respectively. The procedure is summarized in Fig. 5(c).
Spectra for both positional isomers G1(3) and G1(6) were recorded (Fig. S5†) and added to the IR fingerprint database and can be used for subsequent identification of G1 positional isomers or as structurally diagnostic fragments of yet larger glycans. It should be noted that under the separation conditions used here, the reducing-end anomers of G1 are completely unresolved for the positional isomers G1(6) and partially resolved for G1(3). However, given the partial separation observed for the latter, a few more cycles would be necessary to fully resolve these anomers, which might be useful if anomer-resolved reference spectra are required for fragment identification.
In accordance with the scheme described in Fig. 2, ion-mobility separation of G1F was performed prior to fragmentation. Figure 6(a) shows the ATD of singly sodiated G1F ions after four separation cycles and the drift-time windows selected for fragmentation are highlighted in different colours (red and purple). Following fragmentation, structurally diagnostic fragments (m/z 1501) underwent one additional cycle of mobility separation. The ATD of fragments generated from first two mobility peaks of G1F taken together (red), displays a single peak with a hint of an unresolved shoulder on its left. The ATDs obtained after fragmenting each of the parent drift peaks separately confirms the presence of two overlapping features (see Fig. S6†). The resulting fragment ATD from the second mobility region of G1F (purple) exhibits two partially resolved peaks.
The IR spectra of the fragments are shown in Fig. 6(b), where they are paired with their best matching database fingerprint (displayed in grey), which we obtained as described in the previous section (see Fig. S5†). Visual comparison confirms that the spectrum of fragments (pink) generated from the first region in the ATD of G1F corresponds to G1(3) and the fragment (blue) generated from the second region in the ATD of G1F corresponds to G1(6). This identifies the first two peaks in the ATD of G1F as belonging to the G1F(3) positional isomer, and the broader ATD peak arriving at longer drift time to the G1F(6) positional isomer. The assignment procedure is summarized in Fig. 6(c).
We recorded the IR fingerprints of the newly identified positional isomers of G1F (Fig. S7†), which can now be used either for their identification in complex mixtures or as structurally diagnostic fragment for yet larger glycans. It is important to note that the spectra for first two peaks, assigned as G1F(3), are very similar in their band positions and intensities, with the only noticeable difference being the bands at 3660 cm−1 and 3655 cm−1. This furthermore supports the hypothesis that these two drift peaks likely correspond to the two reducing-end anomers of G1F(3). Under the separation conditions used here, the broad peak assigned to be the G1F(6) isomer could not be fully resolved, and therefore the resulting IR spectrum represents a mixture of both reducing-end anomers.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1an01861b |
| This journal is © The Royal Society of Chemistry 2022 |