Miroslav
Kloz
*ab,
Jörn
Weißenborn
a,
Tomáš
Polívka
c,
Harry A.
Frank
d and
John T. M.
Kennis
a
aDepartment of Physics and Astronomy, VU University Amsterdam, De Boelelaan 1081, 1081HV Amsterdam, The Netherlands. E-mail: miroslav.kloz@eli-beams.eu
bELI-Beamlines, Institute of Physics, Na Slovance 2, 182 21 Praha 8, Czech Republic
cUniversity of South Bohemia, Branišovská 1645/31a, 370 05 České Budějovice, Czech Republic
dDepartment of Chemistry, University of Connecticut, 55 North Eagleville Road, Storrs, CT 06269, USA
First published on 29th April 2016
A new method for recording femtosecond stimulated Raman spectra was developed that dramatically improves and automatizes baseline problems. Instead of using a narrowband Raman source, the experiment is performed using shaping of a broadband source. This allows locking the signal into carefully crafted watermarks that can be recovered from measured data with high fidelity. The approach uses unique properties of Raman scattering, thus allowing a direct recording of stimulated Raman signals with robust rejection of baselines and fixed-pattern-noise. Low cost technology for generating required pulse-shapes was developed and demonstrated. The methodology is applicable to any Raman experiment but primarily targets Femtosecond Stimulated Raman spectroscopy (FSRS) where a lack of robust methods for parasitic signal rejection has been a major obstacle in the practical development of the field in the last decade. The delivered improvement in FSRS experiments was demonstrated by recording evidence that the so-called S* state of carotenoids in solution corresponds to the optically forbidden S1 state of a sparsely populated carotenoid conformation.
The core of the technique relies on the principle that a very complex (rapidly oscillating) pseudorandom function with a vanishing integral (with a mean value close to or equal to zero) manifests a very low level of correlation with any normally encountered function, while being highly correlating with itself, but only in the case of perfect alignment. This allows recording and localizing a very weak watermark on top of a very strong carrier signal, and at the same time converting it into a sharp localized peak by a simple convolution procedure. The resonant spike is a direct product of the watermark amplitude and width, so even a very weak amplitude watermark can be detected given that it is inscribed within a sufficiently wide interval of data. Yet, the efficiency of a single watermark detection is still at a certain point limited by the watermark to carrier signal ratio, as the watermark can interfere with the carrier signal in an unpredictable way (Fig. 1A). A typical (continuous) baseline function has a very low level of correlation with a rapidly oscillating pseudorandom wavelet, however a certain correlation is in practice always present (i.e., a perfect non-correlation is equally improbable as a perfect correlation).
In Fig. 1A we can see that convolution of the signal (with an inscribed weak watermark) by the model watermark removes the carrier signal and produces a sharp peak in the center of the watermark location in the data. However, a substantial pseudorandom background around the peak still remains. This pseudorandom noise accompanying the resonant spike originates partly from the interaction between the watermark and the baseline function and partially from the interaction within the watermark itself, as explained below. The watermarking technique is based on statistics and so for a finite width of the watermark, it has only a finite performance. Any pseudorandom wavelet convolutes with an uncorrelated (baseline) function simply into another pseudorandom wavelet (as any function multiplied by a random function turns into a random function). The background is thus suppressed, but not entirely eliminated with the remaining offset turned into a random function. This holds also for the interaction of the watermark with itself in case it is not perfectly aligned. When two identical pseudorandom functions are aligned to each other they obviously perfectly correlate. In this case, their mutual product is a positive-only function and sums to a large positive number, and corresponds to the spike in the convoluted signal. However, their random structure causes that even the smallest mutual shift results in a complete loss of correlation with the level that any different pseudorandom function would manifest. In convolution, one basically performs a sliding scalar product of two vectors while filling zeros on the tail of the convoluting vectors to match the lengths for each shift. There is no correlation within the pseudorandom wavelet itself (that is basically a definition of a random function). The result is that when the watermark in the signal and the model convoluting watermark are just partially overlapped, it does not produce a zero but rather a random number of a magnitude equal to the average amplitude of the pseudorandom wavelet.
The interval where the wavelet in the signal and the model convoluting wavelet have at least partial overlap in convolution corresponds to twice the width of the wavelet and left and right part are mirror images (which does not become entirely clear from Fig. 1A because the interaction of the watermark with the strong carrier function dominates the convoluted signal). The reason is that head-to-tail correlation is identical to the tail-to-head correlation. The resonant spike (produced when the wavelet in the signal and the convoluting valet are perfectly aligned) is thus surrounded by a symmetric interval of a pseudorandom signal that has twice the width of the wavelet. It is shown clearly in Fig. S7 in the ESI† where the carrier signal is on purpose minimized. The pseudorandom tails of the resonant spike do not scale with the width of the watermark while the resonant spike does. They are thus less significant as the watermark gets wider, but never disappear for a single specific finite watermark. For example if the watermark consists of 10 pixels and has an average amplitude of 1, then in the convoluted signal the location of the wavelet will be manifested by a peak of the magnitude of 10, which is in the middle of the 20 pixel wide interval of the symmetric pseudorandom signal with the average amplitude of 1. Importantly, these pseudorandom tails are different for each specific wavelet structure. If we have the possibility to cycle various different pseudorandom watermarks inscribed to an identical location on top of the carrier signal and average the results (Fig. 1B and Fig. S7 in ESI†) we can improve the fidelity and suppress the pseudorandom tails. While the resonant spikes add fully constructively, the tails simply average out. Ultimately 100% fidelity of watermark detection is achieved with unlimited possibility to cycle various watermarks (Fig. 1C).
The carrier signal can amplify the non-resonant tails in case it is substantially stronger than the watermark (as it is the case in Fig. 1 but not in Fig. S7, ESI†), but they remain random. Thus, the premise still holds that when various watermarks are cycled, the tails are gradually averaged out. The width of the watermark and the ratio between the baseline and the watermark amplitude influence the efficiency of watermark detection. However, if the possibility exists to cycle watermarks in an unrestricted way any baseline can be rejected. This is the key property of watermarking as described above: it draws absolutely no assumption with respect to the baseline structure.
We recognized an important analogy between digital watermarking and the Raman experiment. In both cases the goal is to detect a fine structure on top of a strong, broadband and generally unspecified background. In the Raman experiment, fluorescence or stimulated emission can be treated as the carrier signal while the Raman signal itself can be seen as a watermark. The key problem in current Raman spectroscopy is that it depends on employing a spectrally-narrow Raman excitation that leads to a single specific manifestation of the Raman signal. The advantage of such an approach is that the recorded spectrogram represents a direct image of the vibrational spectra. Nowadays, however, in the era of digitalized detection, this benefit dropped in importance. Implicit data can be automatically converted to an explicit signal provided that the correct routine exists. The traditional approach of accumulating the signal by repeating the same experiment with a fixed narrow Raman excitation leads to the improvement of the signal, but the background is constantly accumulated as well. If we could instead accumulate the data by cycling the Raman signal as pseudorandom watermarks, and use the high fidelity of pseudorandom watermark recovery, we can in principle accumulate only the desired signal (as in Fig. 1C). This paper shows that especially for femtosecond stimulated Raman experiments this is indeed possible. If we watermark the pulses used to generate the Raman signal (Raman pulse-pump: “Rp”) we can directly watermark the Raman signal as well. This is illustrated in Fig. 2.
Raman peaks in the Stokes region are in fact red-shifted replicas of the spectral envelope of the Raman pulse, convoluted by the intrinsic bandwidth of the given Raman transition. When the Rp pulse has a fine spectral structure (but broader than the intrinsic width of the Raman transition), this structure will be reproduced for each Raman active transition. Undesired non-Raman contributions to the signal (e.g. fluorescence) might to some extent depend on the spectral shape of the pulse, but they are not manifested as a direct replica of the complex pulse watermark. Fine spectral watermarking of the Rp pulse thus watermarks the Raman signal but not the non-Raman contributions.
A drawback of the basic approach shown in Fig. 2 is the impossibility to create positive and negative signals at the same time, since Raman scattering is always signal gain (or always signal loss as in the case of anti-Stokes stimulated Raman loss11). Inscribing a signal into a watermark of an unrestricted shape requires access to predefined positive and negative features at the same time. This can be solved by inscribing the watermark in two stages as a difference between two complementary Raman experiments. This is illustrated in Fig. 3.
The procedure is the following: a pseudorandom watermark W0 with a zero integral is generated and divided into negative (W−) and positive (W+) parts. Two separate Raman experiments, one with Rp shape W− and the other with W+ shape, are performed. The difference between these two signals is then the W0 watermarked signal. Because the overall intensity and central wavelength of the W− and W+ are identical, the non-Raman contributions to the signal should be nearly identical for both experiments. As a result, the differential watermarking does not only allow accessing arbitrary watermark structures, but it also greatly suppresses the carrier signal and fixed pattern noise. Note that this method preserves both the sign and peak shapes of a particular Raman spectrum, so when recorded structures are for instance dispersive, this is not going to be lost by watermarking. The differential watermarking is in fact a merger of watermarking and the wavelength modulation technique proposed for Raman experiments in 2011.8 The main advantage of watermarking in comparison with the 2011 method is a substantially higher overall robustness but also more specifically robustness in capturing broadband and narrowband signals at the same time.
The watermarking approach in δ-representation is effectively a procedure how to process a cluster of traditional Raman experiments. The only condition is that these experiments have to be performed with different wavelengths of the Raman pulse (pump) (Rp). The crucial advantage is then naturally that experimentally it does not bring any alteration to the traditional FSRS experiment. Any theory and assumption valid to normal FSRS experiments is thus valid also for watermark-processed signals. A disadvantage is that it does not bring increased efficiency to the Raman pulse generation. However, in a research environment where the optical input pulse power is typically not the limiting factor we believe it is the best choice to harness the watermarking approach.
The procedure consists of four elementary steps: (1) shifted signal manifold recording, (2) scrambling signal into watermarks, (3) convolution of the scrambled signal by the ideal watermark forms, (4) summation of the convoluted signals. These are going to be described below:
![]() | ||
Fig. 4 Watermarking in δ-representation: a special chopper blade is designed to record several shifted Raman spectra both in the ground state and the excited state. |
The result of this experimental arrangement is 14 Raman signals slightly shifted to each other based on the dispersion of the bandwidth in the pulse shaper and on the separation of the slits on the blade (in our blade slit shifts for one millimeter for each cluster). The interval of spectral modulation in our set-up was ∼15 cm−1. This was chosen as the spectral bandwidth of the used Rp pulse. When the modulation interval is comparable with the effective resolution of the experiment, the watermarking approach is believed to be the most efficient. A too weak modulation decreases the power of the method (reduces the efficiency of baseline rejection) and a too strong modulation means that the collected manifold of signals does no longer form a full basis on a given interval; that means we do not fully sample the watermarking interval. In the latter configuration, we recorded artifacts such as interference patterns between neighboring peaks and the method can lead to spurious peak-shapes. In general, the minimal interval of the wavelength/wavenumber modulation (Δν) should not be larger than the spectral resolution the experiment targets.
The signal scrambled into watermarks is created by multiplying each of the 14 signals by a chosen pseudorandom number and summing them up. For the success of the method it is essential that the created watermark has a vanishing integral, which means that the 14 applied pseudorandom numbers have to sum into zero. The operation can be conveniently performed by matrix multiplication. When a set of pseudorandom numbers defining the watermark is vector “W” and the recorded manifold of signals is matrix “Dr” (“D” stand for “data”, “r” stand for “real”) then the signal scrambled into watermark “S” is simply:
W(1,n) × Dr(n,p) = S(1,p) | (1.1) |
The second element needed is the model watermark “M”. This is constructed in the same way as the scrambled signal:
W(1,n) × Dm(n,m) = M(1,m) | (1.2) |
S * M = filtered signal | (1.3) |
The entire process is summarized and visualized in Fig. 5.
![]() | ||
Fig. 5 The shifted Raman spectra are fed into a watermark filter that numerically creates a pseudorandom watermarked signal by a linear superposition of measured signals and detects the watermarks by convolution of the signal with the model watermark. For simplicity of the scheme the displayed model watermark M consists just out of 3 elements instead of 14 actually generated by the blade displayed in Fig. 4. |
The filter has two inputs: a manifold of the shifted signal (for simplicity illustrated on 3 signals instead of the used 14) and information about shifts between the signals in the manifold. The output is a vector of filtered data. When the manifold of signal enters the filter, a set of random numbers is generated that scrambles the manifold into a single signal. The same is done for the idealized signal that is based on known spectral shifts of the recorded manifold. The real signal locked in the watermark and the idealized watermark are then convoluted, resulting in the extraction of the watermark. This is repeated and averaged for a number of watermarks until it converges to a stable solution.
The second issue is the alteration of the peak shapes. If the watermark (pseudorandom function) is artificially shaped to have a precisely vanishing integral, it means that an intrinsic correlation within the watermark is introduced. If one part of the watermark is positive, the other part is forced to compensate for it. As a result, the convolution of the signal by the watermark with a vanishing integral forces also the convoluted signal to have vanishing integral. All recovered Raman peaks are then accompanied by a proportional amount of negative signal on the sides of the peak following the width of the watermark, giving the peaks a characteristic sombrero shape (not to be confused with the precisely defined “Mexican hat wavelet” or “Ricker wavelet” which is identical to the second derivative of the Gaussian function). The ratio between the peak amplitude and the amplitude of the negative contribution (a precise shape of the “sombrero”) is determined by the ratio between the width of the resolved peaks and the width of the watermark. From this we can conclude that the spectral width of the watermark needs to be substantially broader than the broadest Raman structures we want to resolve. If the watermark is much broader than the embedded peak, the effect is negligible (100:
1 in the case of Fig. 1C; there is no noticeable negative signal compensating the positive peak). In the case of the data presented in Fig. 5 and 6, the watermark is about 7× broader than the resolved peaks, making the negative contributions insignificant in terms of data interpretation, even though they are detectable (see positive tails of the negative CS2 solvent peak bleach around 650 cm−1 in Fig. 5). It must be stressed that this effect is associated with the resolved peaks themselves and not with the baseline. Thus, if the Raman signal gets weaker, this effect becomes also proportionally weaker. The effect is constant for a given experimental realization and no spurious peaks arise this way. The sombrero peak structure can be avoided simply by not using watermarks artificially tailored to have a perfectly vanishing integral. However, the baseline suppression is then compromised. Another alternative is to work with even broader watermarks.
The third issue to discuss is the standard “white noise” (frequency-unspecific noise). In watermarking, random noise appears in exactly the same way as in a traditional FSRS experiment. Convolution of the watermark with another random function simply blurs the watermark and decreases the contrast of recovered peaks. This is natural since watermarking can separate desired from undesired signals but obviously cannot spell a signal out of nothing. Watermarking, however, suppresses any frequency-specific or constant noise (e.g. periodic oscillations in the signal or a fixed pattern noise). In our experience, the frequency-specific noise and the fixed pattern noise typically dominate the genuine white noise, especially in thoroughly averaged data. This causes watermarking to effectively appear to reduce the noise in comparison to an equivalently long traditional FSRS experiment, however it does not alter the fundamental signal-to-noise ratio in the raw signal in any direction. It only conveniently automatically suppresses unwanted signals that manifest some regularity.
Let us first consider the signal improvements from our proposed method. Fig. 6 shows the watermarked spectra (magenta lines) of spirilloxanthin as compared to raw, unprocessed FSRS spectra (cyan and orange lines) at a time delay of 1 ps. The left panel shows the signals in absolute values, while in the right panel a constant offset was subtracted from the raw FSRS spectra. We observe that the watermarked signal is essentially baseline free, whereas the raw FSRS spectra show Raman features on a steeply sloping baseline signal that has an overall larger amplitude than the Raman signals. In particular, below we will focus on the region around 1770 cm−1, which in the watermarked spectrum shows a small but important positive band and corresponds to an upshifted CC stretch of spirilloxanthin. In the raw FSRS spectra, this signal cannot be discerned well as a result of the steeply sloping baseline and the higher noise. We draw two conclusions:
(i) generation of watermarked signals by the approach described in Fig. 3–5 removes the background with great robustness. Moreover, resolving the Raman peaks using convolution by the inscribed watermark with vanishing integral perfectly removes any offset broader than the wavelet. As a result, two independent consecutive mechanisms contribute to the offset removal. One originates from using watermarking as such and the other from the differential approach that we use to lock the Raman signals to the watermark.
(ii) the fixed-pattern-noise and normal time-dependent noise in the signal are greatly suppressed. This is again a result of two independent processes: first, the fixed pattern noise affects both |W+| and |W−| signals comparably, and their subtraction thus implies fixed-pattern noise suppression. Second, the watermarking approach delocalizes the detection over an interval of pixels (the entire watermark carries information about the Raman mode spectral location). In effect, any residual fixed pattern noise, possibly even dead pixels in the detector plus the time-dependent noise, is averaged over all pixels spanning the spectral interval of the watermark. As a result, if needed, the signal is smoother and can be averaged for a much longer period prior to reaching boundaries set by systematic artifacts. We recorded transient Raman signals with sensitivity up to 10−6 (∼1 μOD) based only on 1.5 seconds of effective averaging (Fig. S4 in the ESI†).
We now consider the issue of the S* state in carotenoids and demonstrate how our new method is instrumental in its solution. Carotenoids are known to have a specific response to photoexcitation: from the ground state they are excited to the S2 state that decays within hundreds of femtoseconds to a weakly radiative S1 state, which is inaccessible from the ground state because the S0 → S1 transition is symmetry-forbidden.13,17 For carotenoids with long π-electron conjugated chains, a clear signal, decaying slower than the S1 state, has been reported.13,14,16,18,19 The origin of this signal, usually denoted as the S* signal or the S* state, remains a subject of a considerable debate;20–22 proposals include it being either a separate excited state,14,15,19,21,23 a hot ground state24 or a Bu− state.14 A crucial observation from ultrafast Raman spectra of carotenoids is the CC stretch that is strongly upshifted in the S1 excited state (in the case of β-carotene from 1526 cm−1 in the ground state to 1778 cm−1 in the S1 state) due to vibrational coupling between the S0 and S1 states.25,26 The vibrational signature of the S* state has not been recorded so far, but due to the large difference between ground and excited state frequencies of the C
C stretch, the FSRS experiment has the potential to provide a rigorous test whether the S* signal originates from the ground or the excited state.25–29 Here we compare FSRS spectra of β-carotene (where the S* signal is very weak or absent, at least upon excitation at the red side of the absorption band) and spirilloxanthin (where the S* state signature is well-documented from femtosecond transient absorption experiments) in CS2 as a 3.2 ps component evolving independently of the S1 state that has a lifetime of 1.5 ps.14,18 High fidelity and improved sensitivity of the watermark-resolved FSRS allowed us to record the S* signature in spirilloxanthin.
Complete datasets are displayed in the ESI† for inspection in Fig. S2 and S3. Fig. 7 presents a zoom into the region where major S1 and S* signatures are manifested, for β-carotene in the left panel and spirilloxanthin in the right panel. The S1 state signature of spirilloxanthin can be seen to shift within the first two picoseconds from 1743 cm−1 to about 1771 cm−1 where it almost entirely decays within 10 ps. No shift of this kind can be observed for β-carotene measured under exactly the same experimental conditions whose S1 CC stretch is at 1778 cm−1 regardless of the delay time, and it decays significantly slower. Thus, the 1771 cm−1 feature of spirilloxanthin is explained as due to the C
C stretch in the S* state which decays slower (∼3 ps) than the S1 C
C stretch at 1743 cm−1. Note that the observed dynamics (dynamic up-shift of the C
C vibrational band) cannot be explained as arising from vibrational cooling in the spirilloxanthin S1 state: it is well established that its S1 state depopulates with a time constant of 1.5 ps,14,18 therefore the 3 ps phase must represent a distinct other electronically excited state.
There is a universal relation between the length of the conjugated backbone and the properties of the S1 state of carotenoids.30 As the backbone gets shorter, the S1 state energy increases, its lifetime is prolonged and the CC ground state vibrational frequency becomes larger13,31 (see Fig. S1, ESI†). The combination of a prolonged lifetime and a larger C
C frequency of the S* signal compared to the S1 state (Fig. 6) thus implies that the S* signal must have an origin in an S1-like excited state associated with a carotenoid conformation having a shorter effective conjugation length. Such an observation of the S* state signature rules out the assignment of the S* signal to a hot ground state. Instead, it supports the interpretation of the S* state as the S1 state of certain carotenoid conformations that in a small fraction exists already in the ground state.21 This is further supported by the observation of a high energy shoulder of the spirilloxanthin C
C ground state peak that can be clearly seen when the ground state C
C band is overlapped with a shifted β-carotene C
C peak (Fig. S6, ESI†); there is asymmetrical broadening towards higher energies which likely corresponds to the presence of a ground state population with shorter effective conjugation. All these results demonstrate greater resolving power of FSRS spectra recorded by the spectral watermarking approach and strongly suggest that S* phenomena in free carotenoids correspond to particular S1 states of less populated conformations with a shorter effective conjugation length.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c6cp01464j |
‡ Alternative would be constructing its own convolution matrix that would include the nonlinearity of the Raman shift with respect to the spectral sampling interval. In principle this might be more computationally effective, but associated with pitfalls related to unequal sampling and difficult to understand and program. |
§ Because we effectively investigate 14 different samplings over the same pixel distribution, it can be justified to do slight extrapolation when converting wavelengths into wavenumbers. We used a 1024 pixel detector to cover the interval ∼2000 cm−1, but we project the signal into the axis with intervals of 1 cm−1. Our output processed signal then has roughly twice the resolution than each of 14 raw signals at the camera. We did not observe any artifact associated with this extrapolation. See e.g. Fig. SX (ESI†) for the inspection of smoothness of the data. |
This journal is © the Owner Societies 2016 |