M.
Mayzel
a,
K.
Kazimierczuk
b and
V. Yu.
Orekhov
*a
aSwedish NMR Centre, University of Gothenburg, Box 465, S-405 30 Göteborg, Sweden. E-mail: vladislav.orekhov@nmr.gu.se
bCentre of New Technologies, University of Warsaw, Banacha 2C, 02-097, Warsaw, Poland
First published on 23rd June 2014
Non-uniform sampling offers a dramatic increase in the power and efficiency of magnetic resonance techniques in chemistry, molecular structural biology, and other fields. Here we show that use of the causality property of an NMR signal is a general approach for major reduction of measuring time and quality improvement of the sparsely detected spectra.
It is well known that the Fourier transform of a causal time signal S(t) leads to a spectrum, whose real and imaginary parts can be produced from each other using the Kramers–Kronig relations also known as the Hilbert transform.10 The Kramers–Kronig relations are illustrated in Fig. 1. Signal SFID(t) (Fig. 1a) and the corresponding spectrum in Fig. 1b are related via the Fourier transform. The spectrum in Fig. 1d is produced from the one in Fig. 1b by zeroing its imaginary part. The inverse Fourier transform of the real spectrum in panel d gives a complex time domain signal (Fig. 1c), whose real and imaginary parts are essentially even and odd parts of the real and imaginary components of the FID (Fig. 1a), respectively. Thus, the signal in Fig. 1c can also be produced by the time reversal and complex conjugate of the FID.
(1) |
In the following, we call the SVE(t) signal in eqn (1) virtual-echo (VE). The original signal SFID(t) can be obtained from SVE(t) by zeroing the signal for negative time. Direct transition from panel d to panel b in Fig. 1 is done by the Hilbert transform. In practice, the Hilbert transform algorithm takes the detour d → c → a → b (Fig. 1) in order to use the computationally efficient fast Fourier transform.
The spectrum (Fig. 1d) obtained from the VE representation (Fig. 1c) consists of the traditionally looking real part and zero imaginary part. Depending on the signal phase, the real part can contain absorption, dispersion, or a mixture of the both modes. Given a priori, the phase, eqn (1) allows us to obtain the time domain signal corresponding to the pure absorption spectrum and, thus, to construct a sparsifying transform that produces a significantly darker spectrum than the traditional Fourier transform of the original FID.
Obtaining NMR spectrum from a time-domain signal is a typical example of the mathematical inverse problem. When all data points in the signal are present, the solution of the problem is trivial and is given by the Discrete Fourier Transform (DFT). In the case of NUS, most of the data in the time-domain signal are missing and the unconstrained inverse problem has an infinite number of solutions (i.e. spectra). A unique and “correct” spectrum is obtained by introducing additional assumptions such as minimal power, maximum entropy, maximal sparseness, etc. The VE presentation is equally applicable to traditional fully sampled and NUS signals. When the former is processed using DFT, FID and VE presentations lead to the equivalent spectra as illustrated in Fig. 1. However, when reconstructing spectra from the NUS signal and in some other cases,11 use of the Kramers–Kronig relations, namely path a → c → d in Fig. 1, represents a significant advantage over the traditional processing, which is a → b → d.
Fig. 2 demonstrates the benefits of the VE signal for two modern spectra recovering algorithms used for the NUS signal: spectroscopy by Integration of Frequency and Time Domain (SIFT)9 and Compressed Sensing by Iterative Reweighted Least Squares (CS-IRLS).4,12 Similar results for the alternative CS algorithm, Iterative Soft Thresholding (CS-IST),4,13,14 are presented in Fig. S3 (ESI†). Both CS algorithms and SIFT can be applied without modifications to either the traditional FID or VE signal. With SIFT making use of the prior knowledge about positions of dark regions in a spectrum and CS searching for the darkest among all possible spectra consistent with the measured data, both methods are expected to benefit from the darker representation of the spectrum provided by VE.
For a given number of NUS measurements, quality of the SIFT reconstruction improves, when the larger fraction of the spectrum area is free from signals and contains only the baseline noise. In our calculations, the signal-free area is defined by a mask, which excludes rectangles of defined size around all peaks in the spectrum. This corresponds, for example, to a set-up in relaxation and kinetics studies,15 where the peak positions are known and only their intensities or integrals need to be defined. Fig. 2a and b show reconstructions of a 2D 1H–15N HSQC spectrum of human alpha-synuclein obtained using only 15% of the data from the full experiment.
By avoiding broad dispersion peaks, the VE signal ensures that a larger fraction of the spectrum is “dark” and thus SIFT produces a much better spectrum (Fig. 2b and Fig. S4, ESI†) and more accurate peak intensities in comparison to the reconstruction from the original FID (Fig. 2e and Fig. S5, ESI†). Fig. 2e (inset) illustrates that prior information about the signal phase does not have to be exact. For the SIFT example, the peak intensities in the VE reconstruction obtained for the uncorrected up to 15° phase are still better reproduced than those measured in the spectrum calculated for the traditional FID representation. A similar behaviour is also observed for the CS algorithms. For most of the multidimensional experiments, zero order phases for the indirect spectral dimensions are known and thus can be corrected in the time domain to values close to zero prior to the spectrum reconstruction.
Similarly to SIFT, CS also assumes that the major part of a spectrum is dark. However, no assumption is made about the exact location of the dark regions, which creates an apparently unsolvable combinatorial problem. Yet, it has been recently reformulated as a relatively simple task of spectral lp-norm (0 < p ≤ 1) minimization:16
(2) |
|F|lp = (|F1|p + |F2|p + ⋯ + |FN|p)1/p | (3) |
In the present paper p = 1 is used for the IST algorithm13 and lp-norm with p iteratively approaching 0 for the IRLS algorithm.4,17 The use of the CS method in NMR spectroscopy has been commented recently by many authors,4,5,18,19 with important conclusions on the limited applicability to non-random sampling20 and superior performance of non-convex lp-norms (p < 1).19,21
Here we apply the CS IRLS algorithm4 to reconstruct a 3D HNCO spectrum sampled at the level of 0.7%, without VE (Fig. 2c) and with VE in both indirect dimensions (Fig. 2d). It can be seen that VE improves the reconstruction significantly by providing better line shapes, more accurate peak intensities (Fig. 2f), and revealing low intensity signals. Fig. S3 (ESI†) shows a notable improvement for the 2D 1H–15N HSQC spectrum of intrinsically disordered protein alpha-synuclein processed with CS-IST.
The effect can be explained using the basic CS theorem, binding the number of properly reconstructed spectral points, which is essentially a measure of spectrum darkness, with the sampling level.16 With the VE, fewer points contribute to each peak in the spectrum and thus relatively low sampling level is sufficient to fulfil the condition for the successful CS reconstruction. It should be emphasized that the striking advantage of the VE demonstrated in Fig. 2 and Fig. S3–S5 (ESI†) is mostly due to the very low sampling level. Without the VE, high quality reconstructions by CS and SIFT are also possible, but require at least twice as many sampling points for the presented spectra (inset in Fig. 2f and Fig. S4, ESI†).
As pointed out by Donoho et al.,8 there is an unambiguous relationship between the darkness of the NMR spectrum and the quality of the spectral reconstruction by the maximum entropy or minimum l1-norm minimisation. It is therefore likely that most of the related methods including FM-reconstruction,22 MINT,6 hmsIST,14 QME,7etc. will also benefit from the VE signal.
We show that the causality property of the NMR signal can be exploited to dramatically enhance the performance of the CS, SIFT and probably many other algorithms commonly used for the reconstruction of NUS spectra. Our findings open a way for significant reduction in measurement time and improvement of the quality of NUS spectra and thus should increase the power and appeal of multidimensional NMR spectroscopy in multitude of its existing and future applications. The method is particularly useful for short living systems, time resolved measurements, and high-dimensional experiments on intrinsically disordered proteins.
The work was supported by the Swedish Research Council (research grant 2011-5994); Swedish National Infrastructure for Computing (grant SNIC 001/12-271); Polish National Centre of Science (grant DEC-2012/07/E/ST4/01386); Polish Ministry of Science and Higher Education (grant IP2011 023171); and Foundation for Polish Science, TEAM programme. We thank Dina Katabi and Haitham Hassanieh (Dept Electr Eng & Comput. Sci., Massachusetts Institute of Technology) for an inspiring discussion and Anna Zawadzka-Kazimierczuk (Biological and Chemical Research Centre, University of Warsaw established from EU Regional Development Fund) for the HSQC spectrum of alpha-synuclein; The EU FP7 Bio-NMR project (contract 261863); The Knut and Alice Wallenberg foundation project NMR for Life.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c4cc03047h |
This journal is © The Royal Society of Chemistry 2014 |