Open Access Article
Emiliano
De Santis
*ab,
Thomas
Mandl
cd,
Jocky C. K.
Kung
efg,
Khon
Huynh
hi,
Steven
Daly
j,
Lorenza A.
D'Alessandro
fg,
Luke
MacAleese
e,
Charlotte
Uetrecht
fg,
Erik G.
Marklund
b and
Carl
Caleman
*ck
aDepartment of Physics, University of Rome Tor Vergata and INFN, I-00133 Rome, Italy. E-mail: edesantis@roma2.infn.it
bDepartment of Chemistry - BMC, Uppsala University, Box 576, SE-751 23 Uppsala, Sweden
cDepartment of Physics and Astronomy, Uppsala University, Box 516, SE-751 20 Uppsala, Sweden. E-mail: carl.caleman@physics.uu.se
dUniversity of Applied Sciences Technikum Wien, Höchstädtplatz 6, A-1200 Wien, Austria
eInstitut Lumière Matière (iLM), Université Claude Bernard Lyon 1 & CNRS, UMR5306, F-69100, Villeurbanne, France
fCSSB Centre for Structural Systems Biology, Deutsches Elektronen Synchroton DESY, Leibniz Institute of Virology, University of Lübeck, Notkestraße 85, 22607 Hamburg, Germany
gInstitute of Chemistry and Metabolomics, University of Lübeck, Ratzeburger Allee 160, 23562 Lübeck, Germany
hLeibniz Institute of Virology (LIV), Martinistraße 52, 20251 Hamburg, Germany
iSchool of Biomedical Engineering, International University, Vietnam National University, Ho Chi Minh City, Vietnam
jMS Vision, Televisieweg 40, 1322 AM Almere, The Netherlands
kCenter for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, Notkestraße 85, 22607 Hamburg, Germany
First published on 30th May 2025
Structural biology is witnessing a transformative era with gas-phase techniques such as native mass spectrometry (MS), ion mobility, and single-particle imaging (SPI) emerging as critical tools for studying biomolecular assemblies like protein capsids in their native states. SPI with X-ray free-electron lasers has the potential to allow for capturing atomic-resolution structures of proteins without crystallization. However, determining particle orientation during exposure remains a major challenge, compounded by the heterogeneity of the protein complexes. Gas-phase Förster resonance energy transfer (FRET) offers a promising solution to assess alignment-induced structural perturbations, providing insights into the stability of the tertiary structure under various activation methods. This study employs molecular dynamics (MD) simulations to explore chromophore integration's effect on ubiquitin's structure and alignment properties in vacuum. Ubiquitin serves as an ideal model due to its small size, well-characterized properties, and computational simplicity. By investigating chromophore placement, we identified optimal sites for monitoring gas-phase denaturation and unfolding processes, advancing SPI applications and a broader understanding of protein stability in the gas phase.
Both static fields8,9 and laser fields10–12 have been explored for small molecules and have recently also been applied to proteins.13 Strong alignment fields inevitably affect protein structures,14 although simulations identify experimental conditions under which alignment occurs before denaturation.15
Beyond simulations, experimental tools are needed to evaluate alignment efficiency and measure structural perturbations induced by alignment fields. This is especially important in the gas-phase, where such tools are lacking. Förster resonance energy transfer (FRET) represents a promising technique to address these challenges.16,17 On the basis of dipole–dipole interactions between donor and acceptor chromophores, FRET is highly sensitive to changes in donor–acceptor distances, making it suitable for probing the effects of alignment fields on protein structure. Gas-phase FRET18–30 and variants of FRET, including action self-quenching,31 offer potential for sensitive and efficient measurements. Notably, these techniques, when applied to strategically engineered protein mutants, could elucidate stepwise unfolding processes under various activation methods, such as collisions, electric fields, or thermal activation. The grafting of chromophores at two intentional sites within the protein can provide insight into the differential stability of tertiary substructures in the gas-phase. Therefore, a deeper understanding of how chromophores influence protein stability and structural properties in the gas-phase is essential. Although similar questions have been extensively studied in solution-phase FRET, their relevance to gas-phase systems remains underexplored. Addressing this knowledge gap could provide valuable insights into the interplay between chromophore placement, protein stability, and alignment efficiency.
In this study, we employ classical molecular dynamics (MD) simulations to investigate the impact of chromophore integration on protein structure and alignment properties in vacuum. We focus on ubiquitin, a small globular protein, for several reasons: (i) its small size allows us to explore structural effects more prominently than in larger proteins; (ii) ubiquitin is well-characterized experimentally and has been studied using FRET related techniques;23,32 and (iii) its computational simplicity and extensive literature on vacuum stability make it an ideal model.3,9,33,34 We also focus on a model with the same chromophore on both mutation sites, to mimic action self-quenching experiments31 rather than tradition FRET experiments. This reduces complexity in both simulation and in the future real-world experiments that we plan to perform. The primary questions we aim to answer are: (1) how does the integration of chromophores affect ubiquitin's structure in the gas phase? (2) what is the impact of the placement of the chromophore on the properties of the field alignment of ubiquitin? (3) which chromophore sites are optimal for monitoring gas phase denaturation and unfolding processes? By addressing these questions, we hope to establish a robust framework for studying protein stability and alignment in the gas phase, advancing the application of SPI and related techniques to structural biology.
The initial wild-type (WT) ubiquitin structure was obtained from the Protein Data Bank (PDB ID: 1UBQ).40 The mutants were generated by substituting, in the WT sequence, the amino acid at the position of interest with a cysteine residue, which was subsequently functionalized with the ATTO520 chromophore (Fig. 1). In each mutant, two residues were replaced by an ATTO520-tagged cysteine: residue 73 was replaced in all cases, and additionally one of the residues at position 6, 20, 35, or 48 was substituted. Accordingly, we labeled the mutants based on the non-73 replacement as ubi6, ubi20, ubi35, and ubi48.
To make the simulations similar to a potential experiment, an additional amino acid sequence of ELALVPR was added to the C-terminus of each variant. This is due to the anticipation of the necessary inclusion of a cleavable HIS-tag to the protein for production and purification purposes. This ELALVPR amino acid sequence would be left behind after purification and enzymatic cleavage reaction.
The simulation strategy we used, identical for all the simulations, is the following. First, the ubiquitin structures are relaxed by performing a steepest descent minimization followed by a 60 ns long simulation in water at physiological pH and in the isothermal–isobaric ensemble. The temperature was kept constant at 300 K by using the v-rescale thermostat41 with a 0.1 ps coupling time. Pressure was kept constant at 1 bar by using the Berendsen barostat42 with a 1 ps coupling time and an isothermal compressibility of 4.5 × 10−5 bar−1. The tip4p water model was used for water molecules.43 Periodic boundary conditions were imposed to the system and the Particle Mesh Ewald algorithm44 was employed in dealing with the long-range Coulomb interactions. The MD integration time step was 2 fs. From each of these bulk simulations, five different structures, replicas, were extracted and used as starting points for the gas-phase simulations. For each replica, we then assigned the amino acids protonation states to match the net charge expected in mass spectrometry experimental data. We thus modeled any system to have a net charge of +6e, +7e, +8e. The choice of which amino acids are most likely to be protonated in vacuo was driven by their different gas-phase basicity.45 A comprehensive list of all the amino acid charges for all the simulated systems is shown in Table 1. Relaxation in vacuum was performed using the steepest descent algorithm. We then performed 250 ps simulation with Berendsen thermostat,42 using 300 K, 400 K, 500 K, 700 K and 1000 K temperatures. Note that these “temperatures” are used purely as surrogates for increasing levels of experimental internal energy deposition and should not be taken as literal gas-phase temperatures. The MD integration time step in vacuum was 0.5 fs. Long-range electrostatic forces were captured using no cutoffs for nonbonded interactions and no periodic boundary conditions were applied. A total of 375 independent simulations were performed (five molecules, three net charges, five replicas, and five temperatures). Bulk simulations were performed using Gromacs 2019.146 simulation package. Gas-phase simulations were performed using Gromacs 4.647 compiled in double precision.
, their color reflect the cartoon representation shown in Fig. 1. The location of the amino acid partial charges along the linear sequence are indicated for the three different net charges of the simulated systems. The charges are given in units of elementary charge, e
The experimentally feasible chromophore positions (at amino acid 6, 20, 35, 48 and 73) overlap reasonably well with the protein regions our simulations identify as the most dynamics parts of the proteins, see Fig. 2. Therefore, for the rest of the study, we focused on four ubiquitin mutants (ubi6, ubi20, ubi35 and ubi48), along with WT ubiquitin. Henceforth, we will refer to both WT ubiquitin and the mutants as variants. Fig. 1 and Table 1 illustrate the variants and the positions in the sequence where the chromophores were introduced.
![]() | ||
| Fig. 1 Cartoon representation of ubiquitin variants structure prior to the vacuum exposure. Secondary structure elements are given, in orange, green and cyan for helix, extended and coil regions, respectively. The chromophores are shown in licorice representation. Their colors resemble the position in the linear sequence presented in Table 1. | ||
![]() | ||
| Fig. 2 Average distance difference between all alpha carbons in WT ubiquitin, comparing simulations at 325 K and 275 K. White markers indicate the positions where the chromophores are placed. | ||
![]() | ||
| Fig. 3 (a) In the left column, intra-variant RMSD values (extracted from the diagonal of the pairwise RMSD matrix) are plotted for each ubiquitin variant, reflecting the mean structural variability across replica simulations. In the right column, WT-referenced RMSD values, calculated relative to the wild-type structure, illustrate the divergence of each variant from the reference. In both columns, each row corresponds to a system at a distinct net charge (q = +6e, +7e, +8e), and RMSD values are plotted as a function of simulation temperature. RMSD values are expressed in Å, with lower values (<5 Å) indicating high structural similarity and higher values (>5 Å) reflecting significant divergence. The complete pairwise RMSD matrix is presented in Fig. S1 of the ESI,† while additional detailed comparisons among the replicas can be found in (Fig. S2–S16b, ESI†) global hierarchical clustering of the five ubiquitin variants using the UPGMA algorithm on the distance matrix D defined in eqn (1). Branch heights (in Å) correspond to the average Cα RMSD between variant pairs: shorter branches denote greater structural similarity. | ||
From the figures, we can clearly see that for all charge states, the simulations at temperatures up to 500 K, the average RMSD between the structures is mostly below 10 Å. Within the same mutants, the RMSD is mostly below 5 Å, showing low sample heterogeneity. This holds for all three charges.
Another general trend is that the RMSD seems to increase slightly with the charge. This is in line with experimental evidence of many gaseous proteins, including ubiquitin.51–54 This statement is not true for all proteins at all temperatures and charges, but it is true if we consider the average RMSD for all proteins at a given charge state and temperature.
To assess the overall structural relationships among our five ubiquitin variants, we first constructed a “global” distance matrix D. Each entry
![]() | (1) |
denotes the average of the submatrix comparing the five frames of variant i to those of variant j under condition (q,T), and Ncond is the total number of (q,T) pairs. We chose the 500 K cutoff because our initial RMSD analyses showed that, below this temperature, all variants retained the native fold across charge states—thus focusing the clustering on biologically relevant conformational variability rather than high-temperature denaturation. This global distance matrix D was then subjected to agglomerative hierarchical clustering using the Unweighted Pair Group Method with Arithmetic Mean (UPGMA55) as implemented in the SciPy library.56 The resulting dendrogram (Fig. 3(b)) visualizes the mutual structural similarity of the five variants, with branch heights corresponding to mean–RMSD distances in Å. Across all charges and temperatures, the dendrogram shows:
• ubi35 and ubi48 form the tightest pair (smallest branch height ≈1.9 Å), indicating these two site-specific variants are most structurally similar.
• WT joins that cluster next (merge height ≈2.5 Å), showing that the native protein more closely resembles the ubi35/48 mutants than the ubi6 or ubi20 variants.
• ubi6 merges at a higher RMSD (≈4.1 Å), and ubi20 is the most divergent, joining only at the top of the tree (≈4.7 Å).
Thus, on average across all simulated conditions, ubi35 and ubi48 preserve the native fold most faithfully, while ubi6—and especially ubi20—induces the largest conformational shifts. Table 1 lists the modification sites; no obvious positional trend explains the observed deviations from WT, implying a complex interplay of local and long-range effects.
The results indicate that at temperatures up to 700 K, the distance between the chromophores is rather well defined and stable. The effect of increasing the charge state on the inter-chromophore distance is barely noticeable at 500 K (and only for mutant ubi48) and becomes more significant starting from 700 K. In other words, it is only at higher temperatures that charge-state-induced unfolding becomes significant. Overall, this suggests that the experimental selection of an arbitrary charge state (usually guided by pragmatic signal-to-noise considerations) may not be of paramount importance for experiments targeting the structural characterization of a given protein sample. This is promising for experiments measuring the chromophore distances, such as FRET performed at room temperature. At 1000 K, on the other hand, the spread in distances is wide, and it will be hard to draw any reasonable conclusion about the state of the molecule based solely on the chromophore distances. At this temperature the proteins are denaturated, and from the figure it is clear that higher charges result in a more unfolded protein.
![]() | ||
| Fig. 6 Secondary structure analysis of ubiquitin is shown for the crystal structure, PDB ID: 1UBQ (top) and simulations performed at 500 K (bottom). Each line represents a replica for a specific mutant and net charge of the systems, where coil, extended, and helix structures are depicted in cyan, green, and orange, respectively. | ||
In Fig. 8, we present the intrinsic dipole of ubiquitin as a function of time, temperature, and net charge in the simulated systems. The dipole moment
was computed as
, where qi is the partial charge of atom i and
i is its position,
COM is the center of mass of the protein, and N is the total number of atoms. This definition ensures translational invariance and is commonly used in simulations of charged systems.
We observe that for the cases of net charges +7e and +8e at temperatures of 300 K and 400 K, the protein's intrinsic dipole in the variants consistently surpasses that of the WT, thus confirming our speculation. In the case of a net charge of +8e, we also observed that the dipole of the mutants is larger compared to the WT at a temperature of 500 K. However, this trend is not observed for the net charge of +7e. Among the mutants, ubi48 exhibits the least influence on its dipole due to the presence of the chromophore. Thus, the dipole enhancement effect is more pronounced at high charge state and low temperature, which is interesting for experiments performed at room temperature on sample with similar dimensions as ubiquitin. This effect might nevertheless be less pronounced for larger proteins and protein complexes classically investigated by native mass spectrometry and which present typical mass and charge states both increased by 10- to 100-fold factor with regards to ubiquitin.
In classical molecular dynamics the charges are fixed during the simulations, and any effects caused by charge migration (due to a proton or an electron changing its position) are neglected. This includes processes such as proton hopping or charge scrambling, which are known to occur in the gas phase and can influence the detailed structure and dynamics of highly charged or partially unfolded proteins. While recent methods have begun to address such effects,63 there is currently no widely accepted or computationally efficient approach to accurately model charge mobility within standard MD frameworks. Nonetheless, fixed-charge MD simulations have been shown to offer qualitatively reliable insights into gas-phase protein behavior, particularly in capturing relative stability trends.64–67
Our simulations are also limited in the sense that they only simulate one charge distribution per total charge state, whereas in experiments over ensembles of molecules (such as mass spectrometry experiments where all ions with a given mass-to-charge ratio are simultaneously selected), multiple populations of charge distributions might be present at the same time, therefore also potentially affecting the secondary and tertiary structures of the protein, producing conformation ensembles within the same charge state. Our simulations only consider one charge distribution per molecule and charge state i.e., our model does not allow for any charge transfer, which is a limitation. However, by including an initial equilibration in solution at physiological pH, we selected charge distributions that we believe provide a realistic and representative model of experimental conditions.
Furthermore, it is worth pointing out that the applied simulation “temperatures” serve as tunable proxies for varying levels of internal vibrational energy deposition (e.g., in action-FRET experiments), rather than literal gas-phase temperatures. Because each protein's energy uptake and redistribution depend on chromophore placement, net charge, and the specifics of the excitation mechanism, there is no one-to-one mapping between an MD temperature and a precise experimental energy input, reinforcing their role as qualitative surrogates for relative activation.
Our simulations show that introducing the chromophore affects the dynamics of the protein. For all chromophore positions and charges, a general conclusion is that the structures are well kept and similar to the WT up to 500 K (moderate energy deposition).
Moreover we notice that the mutant where the distance between the two chromophores is the most affected by temperature is ubi6. At low temperatures (up to 500 K) the distance is below 20 Å, but at 1000 K (high energy deposition) the distance is around 30 Å, Fig. 5. This indicates that ubi6 might be the best mutant to use for measuring the structural stability in an FRET experiment. This conclusion holds for all charge states, but is more obvious at +7e and +8e compared to +6e. In ubi20 the chromophores are also separated by more than 30 Å at 1000 K, but since they start further apart than the chromophores in ubi6 the difference is less. Interestingly, ubi6 and ubi20 are also the mutants that show the greatest structural deviation from WT in our clustering analysis, suggesting a correlation between chromophore displacement and overall structural divergence. Judging from our simulations it is not straightforward to use chromophores at different positions in ubiquitin to monitor the temperature-induced unfolding using FRET. Our simulations do not reveal a pattern where the two chromophores in the protein are separated in a way that corresponds to their position in the amino acid sequence, at least at the temperatures when the protein structure is close the original, folded, state. At 1000 K, however the distance between the chromophores mirrors their position in amino acid chain. Adding the chromophores in general increases the dipole of the system, which could be beneficial for dipole orientation, however our simulations only show an increase for the two higher charge states (see Fig. 8). Since the structure of the proteins is not drastically affected by the presence of the chromophores, it might be a good idea to consider adding chromophores to proteins for dipole orientation.
In summary our main findings are: (i) all mutants are structurally stable at temperatures up to 500 K; (ii) below this temperature threshold, dendrogram analysis reveals that ubi35 and ubi48 retain conformations most similar to WT, while ubi20 diverges significantly; (iii) functionalization with chromophores increases the overall dipole moment without substantially altering the protein structure, making them promising tools for monitoring and potentially assisting dipole alignment in gas-phase proteins.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5cp01297j |
| This journal is © the Owner Societies 2025 |