Karen
Dos Santos‡
a,
Alessio
Bartocci‡
bcd,
Natacha
Gillet
e,
Sandrine
Denis-Quanquin
e,
Amandine
Roux
ef,
Eugene
Lin
a,
Zeren
Xu
a,
Raphael
Finizola
e,
Pauline
Chedozeau
a,
Xi
Chen
a,
Cédric
Caradeuc
a,
Mathieu
Baudin
ag,
Gildas
Bertho
a,
François
Riobé
h,
Olivier
Maury
c,
Elise
Dumont
*ij and
Nicolas
Giraud
*a
aUniversité Paris Cité, Laboratoire de Chimie et Biochimie Pharmacologiques et Toxicologiques, UMR CNRS 8601, Paris, France. E-mail: nicolas.giraud@u-paris.fr
bDepartment of Physics, University of Trento, Via Sommarive 14, I-38123 Trento, Italy
cINFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Via Sommarive 14, I-38123 Trento, Italy
dInstitut de Chimie de Strasbourg, UMR 7177, CNRS, Université de Strasbourg, Strasbourg Cedex 67083, France
eUniv. Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F69342, Lyon, France
fPolyvalan SAS, Lyon, France
gLaboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL Université, Sorbonne Université 45 Rue d’Ulm, 75005 Paris, France
hUniv. Bordeaux CNRS, Bordeaux INP, ICMCB UMR 5026, F-33600 Pessac, France
iUniversité Côte d’Azur, CNRS, Institut de Chimie de Nice, UMR 7272, 06108 Nice, France. E-mail: elise.dumont@univ-cotedazur.fr
jInstitut Universitaire de France, 5 rue Descartes, 75005 Paris, France
First published on 1st May 2024
The supramolecular interaction between lanthanide complexes and proteins is at the heart of numerous chemical and biological studies. Some of these complexes have demonstrated remarkable interaction properties with proteins or peptides in solution and in the crystalline state. Here we have used the paramagnetism of lanthanide ions to characterize the affinity of two lanthanide complexes for ubiquitin. As the interaction process is dynamic, the acquired NMR data only reflect the time average of the different steps. We have used molecular dynamics (MD) simulations to get a deeper insight into the detailed interaction scenario at the microsecond scale. This NMR/MD approach enabled us to establish that the tris-dipicolinate complex interacts specifically with arginines and lysines, while the crystallophore explores the protein surface through weak interactions with carboxylates. These observations shed new light on the dynamic interaction properties of these complexes, which will ultimately enable us to propose a crystallization mechanism.
In this paper, we report a comprehensive structural analysis of the dynamic association between a lanthanide complex and a protein. Capitalizing on our recent work demonstrating the combined use of paramagnetic NMR and molecular dynamics (MD) to study interactions with intrinsically disordered peptides,17 we extend our approach to get insight into another example of a weakly interacting system involving a more rigid protein, ubiquitin, and two complexes of interest, namely tris-dipicolinate and crystallophore (Fig. 1). First, ubiquitin is a small protein that plays a critical role in regulating various cellular processes, including protein degradation,18 DNA repair,19,20 and signal transduction,21,22 which makes it a key component in maintaining cellular homeostasis. Second, the tris-dipicolinate lanthanide complex [Ln(DPA)3]3−, which exists as a racemic mixture of two enantiomers that readily interconvert in water at room temperature,23 has been shown to develop supramolecular interactions with proteins.24–28
Fig. 1 Structures of (A) the tris-dipicolinate lanthanide complex, [Ln(DPA)3]3− and (B) the crystallophore [Ln–Xo4]+.6 |
It presents several advantages for probing protein structure and interactions: (i) there is no need to modify the protein chemically because [Ln(DPA)3]3− binds non-covalently to it, (ii) the recovery of the protein can be achieved easily via dialysis and (iii) in many cases, the interaction process results in a fast exchange with respect to chemical shifts between the bound and the free states, which allows for a simple monitoring using, for instance, a NMR titration protocol.28 The interactions between ubiquitin and a series of derivatives of [Ln(DPA)3]3− were successfully described by Wei et al. to demonstrate the potential of using lanthanide complexes as promising non-covalent tags for obtaining angle and distance restraints of proteins in structural biology applications.27 Wei et al. have notably observed that all derivatives of [Ln(DPA)3]3− interact with the first region including residues K6, I44, and H68. They have also reported that positively charged residues R72 and R74 from the C-terminal region are also prone to interact with the complexes; however, they have proposed that under the experimental conditions of their analysis this interaction is a transient one due to the flexibility of the C-terminal tail and did not identify it as a potential secondary interaction site with ubiquitin. Third, the crystallophore (Xo4) lanthanide complex [Ln–Xo4]+, which consists of a cationic Ln3+ with a macrocyclic triazacyclononane bis-picolinate ligand (Fig. 1), has been shown to be a powerful agent for protein crystallography, with optimized properties for nucleation, phasing in X-ray crystallography, and luminescence.6,29–32
One fundamental issue that is raised in this study is the weakness of the interactions that are transiently created between the lanthanide complex and the protein of interest, which often leads to an overall low affinity between the two partners. This supramolecular process yields in general small and sometimes undetectable perturbations of standard NMR spectra recorded for such samples. To overcome this limitation and gain a better insight into this interaction mechanism, we have implemented paramagnetic NMR experiments, using suitably chosen lanthanide metals to generate paramagnetic relaxation enhancements (PREs) or pseudo-contact shifts (PCSs) that are tailored to highlight the regions of the protein involved in the interaction with the lanthanide complex. Furthermore, we have computed all-atom molecular dynamics simulations spanning microseconds, to interpret our NMR data in light of the dynamic picture yielded by the resulting MD trajectories.
To build a more accurate picture of the regions of the protein that are at the interface with the lanthanide complex upon interaction, we have calculated for each N–H correlation of ubiquitin the peak intensity ratio Ipara/Idia, where Ipara and Idia are the intensities of a given correlation in the protein with and without the paramagnetic species, respectively. Fig. 3A shows the evolution of the resulting intensity ratio profile for increasing amounts of [Gd(DPA)3]3−.
Three different regions can be highlighted, which show a lower Ipara/Idia ratio upon addition of [Gd(DPA)3]3−: (i) K6 to T12, (ii) Q40, R42, F45, A46, G47, K48 and Q49, and (iii) T66, H68, L71, R72, L73, R74 and G76. This observation suggests (i) that the paramagnetic complex comes close to these residues during the interaction process and (ii) that there is more than one interaction site in the ubiquitin protein under our experimental conditions.
Fig. 4 shows the shift variation induced on amide protons by the addition of 10 equivalents of [Yb(DPA)3]3− to the ubiquitin sample. Two regions with significant chemical shift variations can be identified, including residues (i) G47–K48–Q49 and (ii) T66–L67, while other residues around these regions exhibit smaller variations. It should be reminded here that on the one hand the shifted signals arise from a fast exchange process of the protein between a free and a “bound” state. It is also possible that this “bound” state can be described as a dynamic association process that would also contribute to partially averaging the observed chemical shift variations by modulating the dipole–dipole interaction between the unpaired electron from the Yb center and the amino-acid N–H group in the protein.34
On the other hand, the sign and the amplitude of those variations, which reflect the anisotropy of the magnetic susceptibility tensor of the lanthanide ion, depend on the geometry of each interaction site in the protein. A thorough analysis of these variations is out of the scope of this paper, but we can however conclude that the use of this paramagnetic lanthanide ion allows for mapping rather accurately the main regions involved in the weak interaction with the tris-dipicolinate complex.
In conclusion for this part, under the experimental conditions of our analysis, the combination of PRE and PCS analyses reveals three main regions in ubiquitin that are involved in an interaction process with [Ln(DPA)3]3−.
Given the amount of paramagnetic species that are present in the solution, the amplitude of the observed relaxation effect suggests a very weak though real interaction between [Gd–Xo4]+ and ubiquitin, hence the average relaxation enhancement that is even slightly greater than the one observed with [Gd(DPA)3]3−, all other factors being equal. However, no specific interaction site can be identified through a noticeable perturbation of its spectral signature, which can be explained by two factors: (i) overall, [Gd–Xo4]+ has a net charge of +I compared to the −III charge of [Gd(DPA)3]3−, which can lead to a weaker and thus more diffuse electrostatic interaction mechanism over the surface of the protein; and (ii) although [Gd(DPA)3]3− is known to interact dominantly with cationic residues such as lysine or arginine, [Gd–Xo4]+ is expected to interact with anionic chemical moieties such as carboxylate groups.29
At this stage, another viewpoint is needed to complement NMR analysis with a dynamic picture of the association process. In a previous paper,17 we explored the potential of molecular dynamics for proposing an interaction scenario that can bridge the gap between the transient supramolecular interactions involving a mobile partner, and the average picture yielded by NMR. In the following, we perform MD simulations to decipher the genesis of the binding between the two lanthanide complexes under investigation and ubiquitin.
The different behaviors of the two lanthanide complexes are also evident from the per-residue normalized contact probabilities given in Fig. S8 (ESI†). For [Ln(DPA)3]3−, the relative population of the ligand–protein complex is almost equally shared between two main interaction sites, located around residues: H68–G47 and R42–R72–R74 (Fig. 6). A third, less populated binding site corresponds to the interaction with the surface, solvent-exposed lysine K6. The cluster analysis performed on the concatenated 3.8 μs (see ESI†) allowed us to extract the most representative structures of the ubiquitin–lanthanide complex system in line with the contact probability. The latter are shown in Fig. 6, which depicts the two major binding sites (see also Fig. S9 and S10, ESI†). It is worth noting that these sites do not correspond to a single residue, but to several that cooperatively help to bind the lanthanide tris-dipicolinate complex.
Fig. 6 Structure of ubiquitin with regions of higher contact probability with the [Ln(DPA)3]3− highlighted in red colour. Expanded regions showing the 3 most representative structures (1, 2 and 3) of the interaction between ubiquitin and [Ln(DPA)3]3− obtained from cluster analysis. Within the same protein structure, different complexes are captured interacting with the protein. The relative population percentages are 27%, 27% and 25% for structures 1, 2 and 3, respectively. Binding sites H68 and R42–R74 are reported as orange and red green lines and residues are listed in Table S3 (ESI†). |
The normalized contact probability (see Fig. S8, ESI†) corroborates that most of the interactions rely on positively-charged (such as K6, K11, K33, R42, H68, R72 and R74) and neutral (such as Q40, G47 and Q49) residues, while no interaction is likely to occur between [Ln(DPA)3]3− and negatively charged residues. In addition, these results suggest that the ubiquitin:[Ln(DPA)3]3− interaction is neither rigid nor gives rise to a single well-defined adduct, instead demonstrating a highly dynamic association between the lanthanide complex and the protein.
Our MD simulations also support that from 2 to 4 [Ln(DPA)3]3− interact with one ubiquitin (see Fig. S11 and S12, ESI†). Conversely, for [Ln–Xo4]+ we observe more than 50% of the relative population of complexes interacting with ubiquitin in one representative structure (Fig. 7), with the most probable stoichiometry values ranging between 1 and 2, reflecting a lower affinity. As expected, [Ln–Xo4]+ molecules are mainly found to interact with negatively charged sidechains, (i) glutamate (E18, E34, and E64) and (ii) aspartate (D32, D51, D58). Few contacts with other sidechains (K33 and Q40) and even hydrophobic residues (G35 and L73) are however also observed. Furthermore, the analysis of the contact probabilities (Fig. S8, ESI†) yields a more accurate picture of the latter interaction process. Indeed, it should be noted that the average contact probability is significantly lower for [Ln–Xo4]+ than that for [Ln(DPA)3]3−, which suggests that [Ln–Xo4]+ explores a broader surface of ubiquitin, with a significantly shorter residence time for each interaction site, highlighting the different behavior of this complex.
PRE – [Gd(DPA)3]3− | PCS – [Yb(DPA)3]3− | Molecular dynamics | |
---|---|---|---|
Site #1 | A46/G47/K48/Q49/T66/H68 | I44/G47/K48/Q49/T66/L67/H68 | I44/A46/G47/K48/Q49/T66/L67/H68 |
Site #2 | R42/R72/L73/R74 | R72/R74 | R42/R72/L73/R74 |
Site #3 | K6/T12 | — | K6: K11/T12 |
For the first interaction site, H68 should be emphasized because the protonation state of its side chain, which is pH dependent, is known for regulating the ubiquitin-binding.35 This residue belongs to the main interaction site identified by Wei et al. for the same supramolecular system.27 Furthermore, the second interaction site located at the C-flexible tail contains two positively charged arginine residues R72 and R74. It should be noted here that these residues are known for interacting with aromatic groups via cation–π interactions.36,37 More generally, most of the proteins that have been shown to bind with lanthanide complexes present an arginine residue in their binding site.28 In this study, we observe that this C-flexible tail indeed plays a role in the binding of [Ln(DPA)3]3− to ubiquitin. Our molecular dynamic simulations also enlighten us on the role of R42 in this interaction site. The pair of residues R72 and R74 forms a molecular tweezer that is flexible enough to host two tris-dipicolinates, which are further stabilized by the interaction with R42.38 We note that R72 and R74 have also been investigated by Wei et al. They have determined that the affinity of these residues for DPA ligands decreases significantly at higher pH values (from 6.4 to 7.6), which prevented them from considering these amino-acids as a primary binding site.27 It is also worth noting that several lanthanide complexes were used in their study except [Yb(DPA)3]3−, i.e., with protons at positions ortho and para on the aromatic ring, as in the present work. For the third interaction site, our MD simulations also show that lysine residues such as K6 and K11 are able to make transient contacts with the complex, as evidenced by the interaction footprint obtained along our MD simulations (see Fig. S8, ESI†). Such contacts characterize a transient site residue, typically solvent-exposed, which is in general too weak to be reflected in chemical shift or relaxation perturbation (except for a small relaxation enhancement detected for K6). K33, R54 and K63 play this role as well, as shown in Fig. S10 (ESI†).
To perform a more quantitative analysis of the observed perturbations of TOCSY correlations, we have fitted the evolution of selected measured 1H shifts upon the addition of [Yb(DPA)3]3− with titration curves modeling the effect of the interaction process (Fig. S3, ESI†). Relying on the results of MD simulations and NMR measurements, for each interaction site described above we have selected a set of residues that have been identified as belonging to this site, and we have chosen to fit their 1H shift evolutions together to determine one unique dissociation constant (Kd) value, hence improving the accuracy of the parameter adjustment. We have also assumed that because the structure of ubiquitin is not altered by the interaction process, the three interaction sites are independent. From the analysis of these titration curves it was possible to determine the average values of the dissociation constant of the order of 19 ± 1 mmol L−1 for Site #1, 7.3 ± 0.7 mmol L−1 for Site #2, and 11 ± 8 mmol L−1 for Site #3, reflecting overall a rather weak association process between ubiquitin and [Yb(DPA)3]3−. Noteworthily, these values are higher than the one reported by Wei et al. (Kd = 0.3 mmol L−1).27 The average shifts measured for the different steps of the titration experiment and the resulting titration curves can be found in the ESI† (see Table S2 and Fig. S3).
Finally, our data provide an interesting insight into residues that do not belong to one of the interaction sites highlighted above but play a particular role in the stability of the protein, or even in the supramolecular interaction process itself. On the one hand, Q41 is a widely conserved, crucial residue in the ubiquitin family of proteins as it forms the beginning of the β-strand and its side chain is tightly packed against the α-helix.39 This residue is most rigidly locked in the core of the protein, with its amino protons forming stable hydrogen bonds with the backbone carbonyl oxygens of K27 and I36, respectively.40 Interestingly, the latter hydrogen bond I36–Q41 is known to be decisive in defining the native, folded state of ubiquitin.41 Moreover, the side chains of residues Q41, L43, and F45 point towards the hydrophobic core, while the side chains of R42 and I44 are solvent-exposed.38 Beyond being essential for protein stability as described above, our NMR data suggest that the region around Q41 also participates in the protein–[Ln(DPA)3]3− supramolecular interaction. Indeed, although Q41 does not directly interact with the tris-dipicolinate ligand, this region presents residues, such as Q40 and even R42, that can interact with the negatively charged [Ln(DPA)3]3−. Indeed, these positively charged residues have already been reported as bridge residues between binding sites, due to their ability to coordinate ligands at the protein surface.42–44 We surmise that the use of a non-polarizable force field may also lead to overshooting the formation of salt-bridges with respect to other non-covalent interactions. Our MD simulations in the μs-time range are not prone to capture all microsecond-to-millisecond motions. On the other hand, particular attention should also be paid to residues K11, K27, K29, E34, and D52 that participate in the salt bridges within the α-helix anchor to the β-sheet.45,46 The K11–E24 salt bridge is a second critical interaction defining the ubiquitin folded, native state. Interestingly, most of these residues – with the exception of K11 – are not affected by the presence of the paramagnetic center, which suggests that the protein structure is not significantly changed upon binding. Similarly, no signal perturbation was detected in the residues V17 to A28 that form the α-helix.
Our combined NMR/MD investigation offers a direct comparison between the contrasted modes of action of [Ln(DPA)3]3−vs. [Ln–Xo4]+, which is fully consistent with the overall behavior of the crystallophore toward proteins described elsewhere, with a more versatile and tunable mode of action. Indeed, solid state crystallography data indicate that [Tb–Xo4]+ is able to induce a large variety of interactions with (i) the negatively charged residue via direct coordination to the lanthanide ion (i.e. carboxylate), (ii) positively charged residues (arginine and lysine) via hydrogen bonding and (iii) aromatic residues (tryptophan and phenylalanine) via hydrophobic interaction.29
On the other hand, we have shown for the [Ln–Xo4]+ complex that molecular dynamics allows for describing the main characteristics of the sites of the protein, which are visited, even furtively, by the complex when it explores its surface. We have notably been able to understand that [Ln–Xo4]+ targets negatively charged functions such as carboxylate groups, which is coherent with our NMR observations even if their interpretation is more challenging for such highly dynamic association processes. This ability to offer a description, even qualitative, of very weak interaction processes is important, because it has been demonstrated, for example, that the remarkable crystallization properties of [Ln–Xo4]+ are linked to its ability to interact not too strongly with proteins. However, probing the direct environment around these chemical functions has been shown to be out of the reach of the NMR experiments that were chosen to monitor this interaction process, i.e. 2D 1H–1H TOCSY and 1H–15N HSQC. We note that 1H–13C HSQC spectra might give further information about possible interaction sites involving carboxylate groups, although this analytical strategy would require a 13C-labelled protein sample to address the sensitivity issue raised by the actual protein concentration. The use of lanthanide center with stronger paramagnetism is also explored in our group to address this issue.
The development of such an approach combining MD and NMR is therefore strategic, since it will pave the way for the modelling of not only the structural, but also the dynamic basis of supramolecular mechanisms such as the crystallization of proteins in the presence of this type of crystallization agent.
The 2D 1H–1H TOCSY spectra were recorded using a MLEV-17 spin-lock sequence with a mixing time of 70 ms and a solvent suppression scheme. Typically, spectra were acquired with 256 t1 increments, 1024 data points, and a relaxation delay of 0.8 s and 8 with 32 scans.
The 2D 1H–15N HSQC spectra were obtained using a standard phase-sensitive edited HSQC sequence with gradients and adiabatic pulses. Typically, 256 t1 increments were acquired, with 4096 data points, a relaxation delay of 2 s, and 16 scans. All spectra were zero-filled in the F1 spectral dimension to 1024 data points followed by a forward linear prediction of 400 points. The baseline of the FID was corrected using a Gaussian function to further suppress the residual water signal. Finally, a square sine bell window function (SSB = 2) was applied in both dimensions prior to the Fourier transformation.
Binding site search onto the protein surfaces follows a computational protocol recently exploited.42–44,50 Ubiquitin monomer structure with 76 residues was taken from an unpublished, high-resolution PDB provided by Dr E. Girard IBS, Grenoble. Protonation states of ubiquitin residues were assigned using the H++ server48 at pH 6.5. Ubi–[Ln(DPA)3]3− and Ubi–LnXo4 with 10 ligands (5 Δ and 5 Λ enantiomers) were immersed in parallelepiped boxes containing 56940 and 65397 TIP3P49 water molecules, respectively. Na+ and Cl− counterions were added to correspond to a concentration of 0.15 M and to mimic the experimental ionic strength of about 0.17 M, leading to simulation boxes with dimensions of 130 × 123 × 123 Å3 and 134 × 130 × 128 Å3 (shown in Fig. S5, ESI†).
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cp00463a |
‡ K. Dos Santos and A. Bartocci contributed equally to this work as the first authors. |
This journal is © the Owner Societies 2024 |