Open Access Article
Fivos
Perakis
a and
Christian
Gutt
b
aDepartment of Physics, AlbaNova University Center, Stockholm University, S-106 91 Stockholm, Sweden. E-mail: f.perakis@fysik.su.se
bDepartment Physik, Universität Siegen, D-57072 Siegen, Germany. E-mail: christian.gutt@uni-siegen.de
First published on 27th August 2020
In this perspective article we highlight research opportunities and challenges in probing structural dynamics of molecular systems using X-ray Photon Correlation Spectroscopy (XPCS). The development of new X-ray sources, such as 4th generation storage rings and X-ray free-electron lasers (XFELs), provides promising new insights into molecular motion. Employing XPCS at these sources allows to capture a very broad range of timescales and lengthscales, spanning from femtoseconds to minutes and atomic scales to the mesoscale. Here, we discuss the scientific questions that can be addressed with these novel tools for two prominent examples: the dynamics of proteins in biomolecular condensates and the dynamics of supercooled water. Finally, we provide practical tips for designing and estimating feasibility of XPCS experiments as well as on detecting and mitigating radiation damage.
The new X-ray sources, such as the 4th generation storage rings and X-ray free-electron lasers (XFELs), opens up new research opportunities for making molecular movies with unprecedented resolution. The new synchrotron sources feature the innovative Multi-Bend Achromat (MBA) design1–4 providing orders of magnitude increase in coherent X-ray flux. The advancing XFELs feature super-conducting accelerator technologies which boost further the brilliance of the current XFEL sources reaching MHz repetition rates.5–8
X-ray Photon Correlation Spectroscopy (XPCS)9–16 is a coherent scattering technique that benefits particularly from this increase in coherent flux holding the potential of capturing structural dynamics at variable length scales, ranging from the nanoscale down to atomic scales.17–20 One distinct advantage of XPCS experiments employing the new X-ray sources is the possibility to study dynamics in transient samples (e.g., deeply supercooled water) which are only stable for a short period of time. Furthermore, XPCS allows to track heterogeneous systems in which the dynamics is evolving in time such as during phase transition phenomena.
Here, we give a perspective on future exciting science questions in connection to biological aqueous solutions that can be addressed with XPCS at the new X-ray sources.21–23 Specifically, we discuss the importance of understanding biomolecular condensates on the nanoscale and the role of the heterogeneities of liquid and supercooled water in biological activity. Furthermore, we highlight the state of the art XPCS capabilities and provide practical considerations for designing XPCS experiments.
Collective dynamics in concentrated protein systems, in particular at non-equilibrium conditions, are of fundamental interest in connection to protein crystallization, protein phase separation and glass transition. However, it is still an open question to which extent these concepts can be applied to describe membrane-less protein compartments in cells.27 Such compartments occur when proteins condense into a dense phase that resembles liquid droplets28,29 which coexist with a dilute phase, a transition termed as liquid–liquid phase separation (LLPS). Biomolecular condensates are omnipresent in cells and are typically investigated in temperature-concentration phase diagrams (Fig. 1a) where the liquid–liquid coexistence curve, or binodal (solid line), denotes the condition at which the two phases may coexist. The spinodal (dashed line) indicates the region of instability in which the system must undergo demixing through spinodal decomposition. Fluctuations become largest near the critical point, denoted in Fig. 1a.
The formed condensates are considered important for concentrating proteins or nucleic acids and facilitating spatiotemporal regulation of cellular function.30 The LLPS may be interrupted by the slow dynamics upon approaching the glass transition line and resulting in an arrested state with a percolating gel network.31 However, so far only a few protein systems have been studied with respect to the kinetics of LLPS and the interplay with glass formation.25,32–35 In addition, very little is known about the dynamics of proteins during the phase transition and upon a deep quench leading to the arrested state,36 as well as aggregate formation (e.g., fibrilles) via protein denaturation.37
Monitoring the dynamics of formation is thus of key importance for advancing our understanding in this field. This in turn requires to resolve an exceptionally large window of time and length scales for which XPCS is ideally suited.
Liquid water is qualitatively different from many other liquids, since for instance water's thermodynamic properties diverge as the liquid is cooled, a behavior often called “anomalous”.41–44 Various thermodynamic scenarios have been proposed to explain these anomalies. One of the scenarios suggests the existence of two phases of liquid water: a high density liquid (HDL) and a low density liquid (LDL),45–47 as depicted in the temperature–pressure phase diagram in Fig. 1b. The HDL liquid is assumed to exhibit a more disordered hydrogen bond network, contrary to the LDL which is assumed to be locally and transiently tetrahedral. The two liquids are hypothesized to be separated by a phase coexistence line, the binodal (solid line) ending in a critical point deeply in the supercooled regime. Simulations indicate that this is indeed the case for many water models,48,49 although it has also been suggested that water can be explained as a single liquid too.50 Even though direct experimental evidence are still not available, there are however increasing experimental indications of the hypothesized critical point, such as the observation of the Widom line in supercooled water,51 which is the extension of the binodal line beyond the critical point (dotted line – Fig. 1b). In this part of the phase diagram, liquid water is depicted as a mixture of fluctuating HDL and LDL-like transient domains, the amount of each population depends on temperature and pressure.
This picture rises important questions such as what is the size and lifetime of these domains? What is the corresponding dynamics of the HDL-like and LDL-like species and what is the HDL–LDL fluctuation time? It has also been extensively debated whether the observed high- and low-density amorphous (HDA and LDA) ice forms are related to two distinct liquid forms through a glass transition.52 Recent dynamic measurements using XPCS indicate that there is a transition between the supercooled HDL and LDL liquids and the corresponding HDA and LDA glassy states in the ultraviscous regime.53 Does this imply that the corresponding dynamics will exhibit a similar behaviour at the high pressure region of the phase diagram?
This deeply supercooled range coincides with a biologically relevant temperature around T = 220 K, termed as the protein dynamic transition.54,55 Below this temperature, many proteins lose their conformational flexibility and biological function. Many studies associate this transition to the intrinsic temperature dependence of the motions of atoms in the protein itself, which experience a cross-over from a harmonic to an anharmonic regime56,57 or cold-denaturation.58,59 Alternatively, it has been proposed that the dynamics are dictated by the hydration water molecules, which is hypothesized to experience a fragile-to-strong transition at this temperature range60 and also coincides with the observation of the Widom line at this temperature range, observed both experimentally in supercooled water,51 and in several water simulations of both pure water48,49 and in protein solutions.61 Is this a coincidence, or does the maximum of water's heterogeneities at biological relevant temperatures signifies causality? Such a connection would directly link properties of supercooled water, which are idiosyncratic and anomalous, to biological function and can fundamentally change the way we view water as a specific requirement for life.
The measured signal in XPCS experiments is the normalized intensity autocorrelation function
![]() | (1) |
sin(2θ/2)/λ being the scattering vector, depending on the wavelength λ and the scattering angle 2θ. A schematic of such experiment is shown in Fig. 2. The time delay between two consecutive time frames is denoted t and 〈⋯〉 is the ensemble average over all equivalent delay times t and pixels within a certain range of the absolute value |
|.
The intermediate scattering function f(Q,t) yields information about collective dynamics and is modelled in an exponential form:
| f(Q,t) = exp[−(Γ(Q)t)α(Q)], | (2) |
An example is shown in Fig. 3a, where we have calculated the intensity autocorrelation function g2(Q,t) for Bovine Serum Albumin (BSA) protein at Q = 0.01 Å−1, assuming simple diffusive dynamics. Being able to measure collective diffusion coefficients D(Q) via Γ(Q) = D(Q)Q2 on molecular length scales is one of the key advantages of XPCS. It is especially important as macromolecular crowding exerts large effects on thermodynamics and kinetic processes64 such as protein stability and enzyme activity. Here, the simultaneous access provided by XPCS to both Γ(Q) and the static density correlation S(Q) yields detailed information about for example electrosteric and solvent-mediated hydrodynamic interactions H(Q).65H(Q) can be accessed via H(Q) = D(Q)S(Q)/D0 with D0 denoting the free diffusion coefficient. Thus, experimental XPCS results can directly be compared to simulation results and help to benchmark properties and importance of, e.g., macromolecular crowding, patchy interactions, dynamic cluster formation.23,25
![]() | ||
| Fig. 3 The intensity autocorrelation function g2(Q,t) calculated for (a) Bovine Serum Albumin (BSA) protein at Q = 0.01 Å−1. The calculation utilizes the Stokes–Einstein relation and assumes diffusive behaviour and the different curves correspond to increasing protein concentration. By changing the Q range one can effectively probe different length scales of protein dynamics, as depicted in the inset. In this case, the dynamics take place from nanoseconds to microseconds. (b) The g2(Q,t) of liquid and supercooled water estimated from the intermediate scattering function obtained by molecular dynamics.62 Here, the momentum transfer Q = 2.0 Å−1 corresponds to atomic length scales and the correlation decay consists of three separate regimes: the early ballistic motion and the later diffusive regime. Between these two regimes appear cage effects, where molecules experience a transient confinement by their neighbors, which become increasingly pronounced upon supercooling. | ||
In XSVS the speckle contrast β thus depends on the exposure time T via67
![]() | (3) |
![]() | (4) |
Thereby, XSVS can map out dynamics on timescales as fast as 100 fs at XFEL sources.10,62,68–70 In addition, XSVS provides a higher signal-to-noise ratio (SNR) than movie mode XPCS71 and is thus ideally suited to probe dynamics with doses well below 1 kGy. It has been demonstrated that XSVS can be performed with very small count rates of 10−2 to 10−3 photons per pixel,72 given that the response function of the detector is well understood.73,74
Under very low count rate conditions the Poisson-distributed discrete nature of the photon detection process has to be taken into account when analyzing the speckle contrast. Here, the probability P of k photons being registered within one speckle is given by a convolution of the gamma and the Poisson distribution. This results in the negative-binomial distribution also known as Poisson–Gamma distribution,
![]() | (5) |
![]() | (6) |
Thus, we can determine information about Γ(Q) by mapping out the number of 2 photon events for example as a function of XFEL pulse duration or as a function of detector exposure time. This is especially relevant for time scales from femtoseconds to nanoseconds which are too fast to be captured by the frame rate of 2d detectors but which will become accessible by split-and-delay line XPCS at FELs or by inter-pulse correlation at the upgraded MBA storage rings.
![]() | (7) |
![]() | (8) |
![]() | ||
| Fig. 4 Two-time correlation function c2(Q,t1,t2) of deeply supercooled water originating from high density amorphous (HDA) ice at T = 120 K, which is near the glass transition temperature of HDA. One can see that the dynamics are highly heterogeneous from the fluctuations along the diagonal. The corresponding dynamic susceptibility determined by c2 shown in the inset, which is a quantitative measure of dynamical heterogeneity (adapted with permission from ref. 53). | ||
The new MBA lattices push the brilliance Br of storage rings by 2–3 orders of magnitude leading to an increase of the coherent X-ray flux Fc as given by
![]() | (9) |
With this increase an up to six orders of magnitude better temporal resolution can be achieved, pushing the temporal window into the regime of microseconds and possibly even faster towards tens of nanoseconds via correlations between single SR pulses. MBA based storage rings also provide smaller source sizes, which allow to use large and coherent beams important for reducing photon density and mitigating beam damage.22
XPCS experiments can also be performed at superconducting XFELs such as the LCLS-II or the European XFEL taking advantage of the MHz repetition rate and the high-pulse brilliance of these sources. By using the split-and-delay technique10,70,81,82 time scales on the order of the pulse duration on the 100 fs scale and below are accessible. In this way XPCS will allow to record movies of molecular motion over a broad range of timescales.
An important prerequisite of XPCS experiments is the availability of fast pixelated X-ray detectors. As the signal to noise ratio (SNR) scales with the speckle contrast small pixels matching the size of the speckles are needed for maximizing the SNR. Typically, these are sizes well below 100 microns. The frame rate needs to be adapted to the time window of interest with the possibility of tracing faster dynamics by speckle visibility techniques. Another important aspect is that for low intensity XPCS the photon statistic needs to be preserved by the detection electronics.73 These are quite high demands on detector technology and additional efforts in detector development are needed to be able to fully exploit the possibility of the new X-ray sources for XPCS.
Recording series of coherent scattering patterns with 4 megapixel and with high frame rates over many days results in very large data sets with sizes easily exceeding tens to hundreds of terabytes. A fast online analysis during the experiment and the in-depth off-line analysis present a challenge due to the shear amount of data.83 Here, advanced analysis schemes such as FPGA based correlators,84 the use of smart data bases including harmonized schemes for meta-data need to be developed as well. Tools such as neural networks and machine learning are promising,85,86 especially for identification of certain events and for model aware analysis of correlation maps. These tools will be of invaluable help for coping with the data avalanche in XPCS.
From a time/lengthscale perspective there is a partial overlap of the two techniques. At synchrotron sources, XPCS features a broad range of “slow” dynamics, ranging from microseconds to hours. At XFELs, the accessible timescale of XPCS or XSVS implementations spans from femtoseconds to microseconds, as we discuss in the following sections. On both cases, by changing the momentum transfer, one can probe variable lengthscales ranging from atomic scales to micrometers. In the case of neutron scattering and spectroscopic techniques, such as neutron spin echo and neutron back scattering, the corresponding time window ranges from picoseconds down to hundreds of nanoseconds, whereas the accessible length scales can access from atomic to nanometric scales.
In addition, neutron techniques usually require larger sample volumes than those used at X-ray measurements, due to the larger beam focus size and low neutron scattering cross-section. This limitation poses a challenge for using rapid supercooling techniques, such as the evaporative cooling,51 which require micron-sized droplets. In addition, due to the extremely long scanning time required when using neutrons, it is difficult to follow heterogeneous dynamics in transient samples, such as e.g., the spinodal decomposition during the liquid–liquid phase separation in biomolecular condensates34 or during the liquid–liquid phase transition in ultraviscous water.53 Similar constrains are posed for measuring very slow dynamics, e.g., when examining samples near the glass transition, where dynamics can vary orders of magnitude within a narrow temperature range.
The SNR for the autocorrelation function g2(Q,t) depends on the average intensity per pixel Ipix the speckle contrast β, the number of pixels Npix, the number of frames Nframes and the number of repetitions Nrepvia87,88
![]() | (10) |
Considering that the number of frames Nframes = T/tfr with tfr being the single-frame exposure time and T being the total accumulated time for Nframes frames, we find in combination with eqn (10) that SNR ∝ Fc × (tfr × T)1/2. This scaling implies that an increase in coherent flux Fc by one order of magnitude gives access to two orders of magnitude faster dynamics for the same SNR.13 For radiation sensitive samples with a maximum tolerable dose one can show that the SNR is modified to SNR ∝ (Fc × tfr × T)1/2 implying that an increase in coherent flux Fc by one order of magnitude gives access to one order of magnitude faster dynamics for the same SNR.22
![]() | (11) |
By the relation D(Q) via Γ(Q) = D(Q)Q2 and utilising the eqn (1) and (2), one can derive the expected g2(Q,t) functions for a given momentum transfer Q, as shown in Fig. 3a for the case of Bovine Serum Albumin (BSA) protein, with hydrodynamic radius RH ≈ 3.5 nm.93 The timescales here reflect the capabilities of European XFEL, which due to the MHz repetition rate allows to reach pulse separation down to 220 ns. The viscosity changes depicted can arise from temperature changes, as the viscosity of the solvent can change by more than one order of magnitude upon supercooling, as well as due to increasing protein concentration. In the case of highly concentrated protein solutions, the dynamics can also deviate from the expected Stokes–Einstein behaviour, due to protein–protein interactions and crowding effects, as discussed in the introduction.
A similar estimation can be performed for liquid water (Fig. 3b) on the basis of molecular dynamic simulations.62 In this case the momentum transfer Q relates to atomic lengthscales and the dynamics exhibit distinct regimes: the sub-100 fs timescales relate to ballistic motion, where molecules move nearly independent of each other, followed by cage effects, which become more pronounced upon supercooling. The diffusive dynamics arise on picosecond timescales, as indicated in the schematic. These timecales can be probed with XPCS with the development of the split-and-delay approach,10,70,81,82 where two X-ray pulses are separated by a tunable time delay, which can be varried from a few fs to hundreds of ps.
![]() | (12) |
the differential cross section and ΔΩpix = (P/L)2 is the solid angle covered by a pixel of size P at a sample-detector distance L. The parameters used here are summarised in Table 1 and were chosen to match typical experimental conditions at the Materials Imaging and Dynamics (MID) instrument of the European XFEL. Here, we set Fc = 108 photons per pulse, in order to minimize any radiation damage effects, discussed in the following section. One can in principle go up to Fc = 3 × 1011 photons per pulse using the pink beam, whereas in the case of using a Si(111) monochromator the flux is estimated to be over Fc ≈ 109 photons per pulse. The pixel size (AGIPD) is P = 200 μm and the sample-detector distance is set at L = 8 m. The sample thickness d = 1 mm corresponds to that of a typical capillary with corresponding transmission Tsample = 0.60155 for water at photon energy E = 10 keV.
| Parameter | Symbol | Value |
|---|---|---|
| Incident flux (attenuated) | F c | 108 photons per pulse |
| Photon energy | E | 10 keV |
| Beam diameter | α | 10 μm |
| Energy resolution | ΔE/E | 10−4 |
| Pixel size | P | 200 μm |
| Sample-detector distance | L | 8 m |
| Sample thickness | d | 1 mm |
| Sample transmission | T sample(E) | 0.60155 |
| BSA absolute intensity | α p | 4.84 m2 kg−1 |
| Concentration | C | 250 mg ml−1 |
| Momentum transfer | Q | 0.01 Å−1 |
| Scattering intensity per pixel | I (Q = 0.01 Å−1) | 0.01 photons per pixel |
The differential cross section per unit volume is defined as:22
![]() | (13) |
2·Δρ2 relates to the molar mass M, specific volume
and scattering contrast Δρ. The estimation is done for a BSA solution with αp = 4.84 m2 kg−1 (see ref. 94) and concentration C = 250 mg ml−1. A spherical form factor P(Q) = |3·J1(QR)/(QR)|2 was chosen with a radius matching the hydrodynamic radius of BSA, where J1 is the Bessel function of the first kind. The estimation is performed for simplicity in the dilute limit, where the structure factor is unity. A more precise treatment of the protein form and structure factor takes into account the non-spherical shape of the proteins and the intermolecular interaction potential.95 Here, however, for our demonstration purpose this approach suffices and yields an estimate of I(Q) in the order of 0.01 photons per pixel for Q = 0.01 Å−1. Being able to resolve signals with lower photon density than 0.01 photons per pixel (i.e., 1 photon per 100 pixels) is challenging due to experimental reasons73,74 and a detailed level of data analysis and filtering is required in order to clear the signal from parasitic contributions, such as detector noise, photon energy fluctuations and pointing jitter of the source.62,68 However by utilising the combination of new detector technology, high coherence sources and novel data analysis techniques, such as the application of machine learning,85 one can potentially overcome such difficulties and resolve signals with even lower photon density corresponding to atomic length scales. In this case however, it is very important to consider the influence of incident radiation on the sample, as discussed in the following section.
By carefully designing the experimental parameters one can also optimise the SNR and minimize the dose.22 Here, we provide an educated guess of the dose when using a high repetition X-ray laser, such as the European XFEL, for performing protein XPCS. The absorbed dose D can be calculated via
![]() | (14) |
Using the parameters detailed in Table 1 we estimate the dose D as a function of beam size α and number of pulses per bunch train Npulses, as shown in Fig. 5. The upper panel depicts the European XFEL pulse structure, where pulses are separated by 220 ns and are grouped in bunch trains, where each bunch train is separated by 100 ms from the next one. For the MID instrument, a typical number of pulses per bunch can reach up to 350 pulses, which matches the number of pulses that can be read within a single train.66 The lower panel shows the estimated dose, where the dashed lines correspond to typical critical doses of 1 kGy and 10 kGy, respectively. The blue region indicates the low-dose limit (D < Dc), emanating from a larger beamsize and fewer pulses/bunch train, and the red region indicates the high-dose regime (D > Dc). In order to perform XPCS at least two pulses per bunch train are required, with variable spacing between them, so that the g2 function can be composed. The calculation presented in Fig. 5 indicates that for a typical beamsize of α = 10 μm, using 2 pulses would correspond to a dose of ∼0.2 kGy which is below the critical doses discussed above. Furthermore, the use of 10 pulses would correspond to ∼1 kGy and 100 pulses to ∼10 kGy. Lowering the X-ray flux allows to reduce the dose although it becomes increasingly challenging to resolve the speckle pattern for scattering intensities below 0.01 photons per pixel, as discussed in the previous sections. An alternative is to increase the focus size, which however will impact the speckle contrast as discussed in the following section. Finding the optimum SNR for a given critical dose within the experimental parameters is thus an optimization problem which can be solved by algorithms taking all mentioned effects into account.22
One of the most established approaches in order to detect radiation damage is to measure the SAXS lineshape as a function of photon flux and deduce radiation damage from changes in the protein radius of gyration.96 For XPCS measurements it is important to, in addition, investigate whether the X-ray beam influences the underlying dynamics. At synchrotron sources, one can achieve this task by measuring the g2 function for different photon fluxes and in this way find the regime where the dynamics become independent of the flux.53 At XFELs one can take a similar approach and sort and bin the detected scattering patterns as a function of the XFEL intensity.62 In addition, it is very helpful to have a second detector so that one can record the wide-angle X-ray scattering (WAXS), which again can allow to deduce any changes in the scattering intensity as a function of flux, in addition to the dynamic information.53,62 Finally, being able to capture the two-time correlation function can also allow for insight into potential beam induced effects, as seen previously in oxide glasses.102
We calculate the speckle contrast β for the parameters listed in Table 1, as shown in Fig. 6. The solid line depicts β as a function of momentum transfer Q for the experimental arrangement where a Si(111) monochromator is used, whereas the dashed line corresponds to the β obtained with a “pink beam”. The corresponding relative photon energy bandwidth ΔE/E is depicted in the figure caption, whereas the schematic above the figure shows speckle patterns for three characteristic contrast values. Despite the relatively rapid decrease of contrast, our estimation indicates that such experiments would benefit by using a monochromator in order to resolve the dynamics at higher momentum transfer Q, whereas further improvement can be implemented by using the XFEL in seeding mode.103
Furthermore, understanding supercooling and vitrification mechanisms of biological solutions is crucial for improving cryopreservation techniques used in medical applications. This is a key step in the storage of frozen organs and cells used in cancer research or in the cryopreservation of microorganisms, such as viruses and bacteria. When performed successfully, rapid quenching will lead to the formation of a perfect glass, free from crystal impurities that can damage the biological content. Even though the majority of studies have so far focused on optimising vitrification based on structural probes, such as transmission electron microscopy or X-ray diffraction techniques, the dynamical aspects are still largely unexplored. With XPCS we can gain new insight into the dynamics of supercooled water and protein solutions.
With several experimental stations being operational or being built in the coming years we expect that XPCS will contribute substantially to addressing open questions in molecular systems. Especially, making XPCS data FAIR104 and openly available will advance the field tremendously by enabling experimental reproducibility and by allowing direct comparison to simulations. Such a step can facilitate the implementation of more sophisticated modeling and data analysis algorithms based on machine learning methods85,86 which will become invaluable in the future in order to cope with rapidly increasing data rates and increasing experimental complexity.
| This journal is © the Owner Societies 2020 |