Kousik
Chandra
a,
Samah
Al-Harthi
a,
Fatimah
Almulhim
a,
Abdul-Hamid
Emwas
b,
Łukasz
Jaremko
*a and
Mariusz
Jaremko
*a
aBiological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. E-mail: Mariusz.jaremko@kaust.edu.sa
bCore Laboratories, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
First published on 25th May 2021
Here, we implemented and validated a suite of selective and non-selective CPMG-filtered 1D and 2D TOCSY/HSQC experiments for metabolomics research. They facilitated the unambiguous identification of metabolites embedded in broad lipid and protein signals. The 2D spectra improved non-targeted analysis by removing the background broad signals of macromolecules.
The advantage of NMR metabolomics is the simultaneous monitoring of the entire sample content, including metabolites and other entities like macromolecules. However, it is also often a limitation. Real samples are composed of hundreds or thousands of different compounds with a very high dynamic range of concentrations and signal shape. As a result, the severe overlap of signals from a few highly concentrated components dominates multivariate data analysis, such as principal component analysis (PCA).3 However, it is essential to also study the minor components as they dominate the biochemical and physiological properties and can be utilized as specific biomarkers of a disease or state.4 Targeted analysis explicitly monitors a predefined set of compounds and biomarkers.5 The greatest challenge in the targeted analysis is the presence of large macromolecules like proteins, lipids, and membranes that cover the signals from the small metabolites.6 For example, it is highly challenging in human blood serum samples to identify and quantify NMR signals arising from branched-chain amino acids (BCAAs) like valine, leucine, and isoleucine (resonating at 0.92–0.99 ppm). This is because the lower signals arising from BCAAs are substantially overlapped by the broad lipid signal resonating at 0.89 ppm (–CH3 groups).2 Since the BCAA signals bear some lipid components, relaxation-based editing has been commonly used to precisely observe the slowly relaxing molecules in bio-fluid.7,8
To address this challenge, we combine and implement total correlation spectroscopy (TOCSY)9-based magnetization transfer, transverse relaxation editing within a minimum set of pulse programs and subsequently validate this set for robust targeted and non-targeted analysis of bovine serum. There has been phenomenal advancement regarding the fast and efficient acquisition of homonuclear experiments for labile sites in proteins10 based on the HADAMARD principle.11 Recently significant advancement was made feasible by the application of PANSY (Parallel NMR SpectroscopY) experiments to obtain sufficient spectral information for the structure elucidation of small organic molecules.12 Diffusion ordered (DO) NMR spectroscopy has combined with statistical total correlation spectroscopy (STOCSY)13 for complex bio-mixture analysis.14 Here we take an approach that combines the previously developed advanced features like CPMG or TOCSY and 13C dispersion in the indirect dimension etc., and the main aim is to present the minimum number of experiments to encompass the full spectrum of targeted and non-targeted NMR based metabolomics analysis. This approach has the advantage of easy implementation and general applicability.
The minimum set of pulse sequences was selective 1D 1H/2D 1H–1H CPMG TOCSY for targeted analysis (Fig. 1). For non-targeted analysis, we used non-selective 2D 1H–1H CPMG TOCSY, and 2D 13C–1H CPMG HSQC TOCSY (Fig. 1). This is the minimum set of pulse sequences required for complete metabolite identification in NMR-based metabolomics. Efficient relaxation editing was achieved using the CPMG (Carr–Purcell–Meiboom–Gill) pulse train15 on transverse magnetization. For targeted analysis, a selective 1D 1H experiment with some known resonance for the desired molecule is required. Ideally, the spectrum should also contain molecular structural information. TOCSY experiments contain the complete spin-system connectivity, as they are capable of correlating all possible through bond-coupled nuclei within the spin network of a given molecule. The extent of magnetization transfer controls the degree of correlation during the TOCSY experiment mixing time. This parameter provides the opportunity to improve the signal assignment. Several mixing sequences have been proposed, and among them, DIPSI16 and MLEV based schemes17 are the two most used homonuclear sequences. We have chosen the DIPSI-218 sequence for TOCSY transfer. Among the MLEV sequences,19,20 MLEV-1721 is the most common one with delays inserted21 into the primitive elements22 of the mixing sequence to suppress the cross-relaxation effects. DIPSI sequences have been demonstrated to provide better coherence transfer than the MLEV-17 sequence.18 In one theoretical study, it was observed that DIPSI-sequences achieve better coherence-transfer efficiency than the MLEV in the absence of relaxation effects for two-spin systems.23 Later experimentally also it was observed than DIPSI-2 leads to more rapid and complete cross polarization for a given radiofrequency power.16 This was valid for longitudinal or transverse magnetization. DIPSI-2 could induce magnetization transfer with a minimum phase cycle step and can give absorption-mode 2D correlation spectra.16
For small molecules, a normally homonuclear 2D version of TOCSY is used, while for large macromolecules, heteronuclear (15N/13C) edited 3D is implemented. The 1D version of the homonuclear version using selective RF pulses has been reported.24 The most obvious advantage is selective irradiation; spectral crowding is relieved as the magnetization transfer is restricted within the spin system of the selected resonance. The TOCSY mixing time determines the degree of magnetization transfer through the spin network. As a result, the 1D spectrum contains only the closed spin network peaks exclusive of the excited resonance offset. There is a high chance of exciting multiple spins in highly overlapped spectral regions, and in that case, multiple spin networks are observed. However, the spectra assignment is feasible because the peak pattern and the intensity ratio of a given spin network can be used from any compound library database. The advantage is that even if some overlap is present, the situation is still much simpler than with non-selective spectra. Hence the search window becomes smaller and unambiguous assignment becomes feasible. As a result, selective TOCSY experiments have been implemented in mixture analysis and metabolomics.25–27 Various modifications have been proposed to the selective TOCSY experiment for better selectivity, magnetization transfer, and artifact-free spectra generation. The mixing time has been varied, and from that, modulations of intensities have been employed for molecular structure determination in TOCSY optimized mixture elucidation (TOOMIXED).28 We have combined the previous developments here. 1D selective 1H TOCSY employs a selective 180° shape pulse (Reburp)29 sandwiched between two z gradients to achieve selective excitation of any peak of the desired target molecule. Subsequently, the TOCSY spinlock is applied with a z-filter. For spinlock, DIPSI was used. The shape pulse was flanked by Gaussian-shaped pulsed z-field gradients of 1 ms duration each. A Reburp-shaped selective 180° inversion pulse with durations of 300 ms was the most effective for selective excitations among most overlapped regions. We also used 80 ms Reburp; however, in highly overlapped methyl regions, the 300 ms selective pulse worked better for bovine serum (Fig. S1, ESI†). The Reburp was chosen for its better off-resonance effect and higher sensitivity30,31 compared to the normally used Gauss pulse (Fig. S2, ESI†). It is worth mentioning that as we chose a more selective pulse, the sensitivity decreased (Fig. S3, ESI†). However, for a one-dimensional experiment, this does not matter. We get good sensitivity even with a 300 ms selective pulse with recycle delay of 5 seconds in 2 minutes (Fig. S4, ESI†). Since selective 1D 1H TOCSY is generally used for targeted analysis, it is more important to achieve clean spectra for the chosen metabolites. The optimal TOCSY mixing time was found to be 80 ms. Simultaneously, a swept-frequency adiabatic chirp 180° pulse32 and the low gradient were applied before and after spinlock for suppressing zero quantum coherences.33 Because of the applied gradient along the z-axis, zero quantum coherences evolve and refocus at 2αTf where α varies between 0 and 1 along the z-axis of the tube height, and Tf is the duration of the adiabatic pulse. For the time (1 − 2α)Tf, zero quantum coherences evolve and acquire different phases along the tube and get canceled. Subsequently, the magnetization is brought into the transverse plane by applying a 90° pulse, and the CPMG spin-echo sequence is employed for the T2 editing of large bio macromolecules. The CPMG spin-echo sequence (τ–180°–τ) consists of spin-echo time (τ, half of the delay between two successive 180° pulses), one hard 180° pulse, and several repetitive loops (n). We kept the spin-echo time as 1 ms. A total of 120 ms T2-relaxation delay is sufficient to attenuate the resonances from large macromolecules and small molecules bound to them due to large differences in their respective tumbling times. It is crucial to note that keeping a longer relaxation time decreases the sensitivity and the appropriate spacing between the refocusing pulses necessary for probe safety and to avoid sample heating. For a selective 1D 1H experiment, the sensitivity is not a problem as this is less time-consuming, and a longer T2-filter of 240 ms can easily be used. Usage of CPMG is especially required for less selective pulses like the conventionally used 40 ms Reburp (Fig. S5, ESI†). However, for multidimensional experiments, the experimental time increases, and appropriate optimization should be taken for minimal T2-filter and selective pulse duration.
We implemented 1D selective 1H CPMG TOCSY on bovine serum samples dissolved in D2O. As a demonstrative example, we chose the methyl region, where a broad lipid peak and protein resonance dominates, severely crowded sugar regions, and one less crowded region (marked as a–c in Fig. 2, respectively). We could filter out the distinct signals of valine, glucose, and succinate from these regions, as shown in Fig. 2. The corresponding offsets are mentioned within the panel (a1–c1). The chemical shifts along with peak pattern and peak intensities can easily be matched in the standard metabolite library for each candidate. We found excellent agreement with the metabolite libraries provided by Chenomx,34 as shown in Fig. S6 (ESI†). For the highly congested sugar regions, where chemical shifts differ very minutely, the chances of exciting multiple resonances are very high. For example, when the offset was chosen at 3.2701 ppm for one of the glucose peaks, the resultant spectra did not match the library's standard glucose peak patterns, however, shifting the offset to 3.4233 ppm unambiguously assigned the resonances of glucose (see Fig. S7, ESI†). In the less crowded region, things were more straightforward, as depicted for succinate resonance. In fact, we need to find one isolated peak for the given compound. For example, the offset at 1.045 ppm gives a clear pattern of valine only. However, we find that shifting the offset to 3.623 or 2.275 ppm excites multiple peaks. Still, based on the TOCSY pattern, these chemical shifts can be confirmed (Fig. S8, ESI†). Things are much simpler for isolated resonances like lactate, where the shifting of the offset between 4.119 and 1.333 ppm yields identical spectra as here both the peaks are in the isolated region. But the advantage is, still, one can assign a peak in the crowded region based on the TOSCY pattern. Additionally, adding CPMG helped in the complete removal of broad peaks of macromolecules. This technique retained all the advantages of selective TOCSY and proved highly robust in all the scenarios demonstrated above.
Our main aim was to provide a complete set of experiments for an unambiguous assignment for targeted and non-targeted analysis. For the second case, it is desirable to observe all the resonances in a single spectrum. In that case, in the one-dimensional NMR of complex biofluids, the situation becomes limited due to signal overlap from various metabolites. A computational approach was proposed from an analysis perspective where spectral deconvolution fits individual peaks in crowded spectra.35 Commercially available software like Chenomx34 also provides the option to fit the metabolites’ peaks in the libraries to adjust the effects of temperature and pH on chemical shifts. However, from the data acquisition approach, using multidimensional NMR techniques is most suitable as the signal could be spread in two to three dimensions, which increases the specificity of the signals. Since biological samples of metabolomics rely on natural abundance, 2D NMR is feasible here.36 This provides the opportunity to use a nucleus other than 1H, with hetero-nuclei like 13C being the most obvious alternative. Here we implanted CPMG on both non-selective homonuclear and heteronuclear TOCSY. On the homonuclear 2D 1H–1H CPMG TOCSY, data were collected in a phase-sensitive manner with coherence selection achieved using an Echo/Antiecho gradient pair.37 The water suppression was achieved using a 3-9-19 pulse sequence along with the gradient pair before detection.38,39 We also employed excitation sculpting for water suppression. 3-9-19 methods work better close to the water signal at 4.5 ppm. However, some off-resonance peaks around 2.2 ppm showed better sensitivity upon excitation sculpting (Fig. S9, ESI†).
Additionally, pre-saturation of the water signal was employed during the relaxation delay (Fig. 1b). T2 editing of large bio-macromolecules was achieved by applying a CPMG spin-echo sequence directly after the first pulse, bringing magnetization in the transverse plane. As discussed earlier, the optimized value of 120 ms T2 delay along 80 ms mixing time was used for optimal sensitivity. Fig. S5 (ESI†) demonstrates that CPMG TOCSY retains the same peaks as 2D TOCSY. The overlay figure shows that the broad signal from lipid and protein molecules gets reduced in the highlighted regions. Overall the spectra become much cleaner, as evident from the highlighted region of the overlay of the trace taken at 3.8 ppm (Fig. S10(d), ESI†), and signatures from metabolites become more prominent. Since moving to multidimensional NMR increases the measurement time, certain methodological developments have been proposed. Here we employed non-uniform sampling, which yielded excellent resolution (Fig. S11, ESI†) in half of the measurement time.40 In fact, we can go down easily to 25% NUS as most of the peaks are retained until that point (Fig. S12, ESI†). Below this, with 12.5% data, some artefacts start coming (Fig. S12, ESI†). Earlier selective optimized flip angle short transient (SOFAST) techniques have been implemented for experimental time reduction.36 However, for the non-targeted analysis TOCSY experiments performed here, this was not an option. We also achieved 13C based 2D 13C–1H CPMG HSQC TOCSY, which provides excellent carbon resolution in the indirect dimension, as evident from Fig. S13 (ESI†).
We provide a complete set of experiments, which are CPMG and TOCSY based for both targeted and non-targeted analysis. The main aim was unambiguous identification and metabolic profiling. We combined the advantage of selective excitation for targeted analysis, multidimensional resolution for both proton and carbon, along with CPMG and TOCSY. We implemented TOCSY on HSQC, the most sensitive heteronuclear 2D experiment for metabolomics, along with CPMG to retain the benefit of enhanced carbon resolution. The experimental data obtained using this technique leave all opportunities open for advanced pattern matching algorithms for assignment. To this end, Chenomx,34 Amix, metaboanalyst,41 or COLMAR web server42 could easily be used individually or in combination depending upon the requirements. The peak list of both proton and carbon can narrow down the search criteria in the available standard databases, such as HMDB, BMRB, or MMCD.
Recent reports suggest that selective TOCSY is quantitative26 and even more accurate because the selective experiment releases the spectral crowding.26 Metabolic profiling commonly employs the quantification of any species based on the precise evaluation of signal integrals. The selective TOCSY experiment detects a single-spin system from the highly congested spectra of biofluid. As a consequence, the assessment of signal integral becomes more accurate. Selective TOCSY has previously been employed to obtain correct concentrations for a set of metabolites over a complete dynamic range in honey samples.26 The absolute concentration determinations can be achieved by calibrating the TOCSY peak intensities using an external standard. A similar approach is used in 2D 1H–1H CPMG COSY also for BCAAs.2 Based on these observations, we expect a positive quantitative ability for this minimal set of experiments.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1mo00118c |
This journal is © The Royal Society of Chemistry 2021 |