Jesika
Schilder
a,
Wei-Min
Liu
a,
Pravin
Kumar
b,
Mark
Overhand
a,
Martina
Huber
b and
Marcellus
Ubbink
*a
aLeiden Institute of Chemistry, Leiden University, Gorlaeus Laboratories, Einsteinweg 55, 2333 CC Leiden, The Netherlands. E-mail: m.ubbink@chem.leidenuniv.nl
bDepartment of Physics, Huygens-Kamerlingh Onnes Laboratory, Leiden University, Niels Bohrweg 2, 2333 CA Leiden, The Netherlands
First published on 3rd September 2015
Paramagnetic NMR is a useful technique to study proteins and protein complexes and the use of paramagnetic relaxation enhancement (PRE) for this purpose has become wide-spread. PREs are commonly generated using paramagnetic spin labels (SLs) that contain an unpaired electron in the form of a nitroxide radical, with 1-oxyl-2,2,5,5-tetramethyl-2,5-dihydropyrrol-3-ylmethyl methane thiosulfonate (MTSL) being the most popular tag. The inherent flexibility of the SL causes sampling of several conformations in solution, which can be problematic as over- or underestimation of the spatial distribution of the unpaired electron in structural calculations will lead to errors in the distance restraints. We investigated the effect of this mobility on the accuracy of protein–protein docking calculations using intermolecular PRE data by comparing MTSL and the less mobile 3-methanesulfonilthiomethyl-4-(pyridin-3-yl)-2,2,5,5-tetramethyl-2,5-dihydro-1H-pyrrol-1-yloxyl (pyMTSL) on the dynamic complex of cytochrome c and cytochrome c peroxidase. No significant differences were found between the two SLs. Docking was performed using either single or multiple conformers and either fixed or flexible SLs. It was found that mobility of the SLs is the limiting factor for obtaining accurate solutions. Optimization of SL conformer orientations using intra-molecular PRE improves the accuracy of docking.
To convert the observed relaxation rates into distances, the correlation time (τc) of the vector r that connects the paramagnetic centre and the nucleus is required. τc depends of the rotational correlation time of this vector (τr) as well as the longitudinal electronic relaxation time (τs), according to τc−1 = τr−1 + τs−1.4,5 For spin labels, the contribution of τs is small and τc is dominated by τr. Any motions that change the vector will have an effect on τc, such as protein tumbling, SL mobility, local protein dynamics, and, for intermolecular PREs measured in a complex, the motions of one protein relative to the other.6 Thus, it is not straight forward to determine τc but, fortunately, due to the sixth power dependence of the PRE on the distance r between spin label and the nucleus, errors in τc result in only small errors in the distances.7,8
The differences between the free energies of SL conformations are often smaller than the thermal energy in the sample, meaning that the position of the unpaired electron is spatially distributed over an area determined by the occupied SL conformer orientations.6 Over- or underestimation of the spatial distribution of the free electron will lead to errors in the apparent mean distance, [〈r−6〉−1/6].7 To solve this problem, the SL can be treated as an ensemble of non-self-interacting conformers during simulated annealing calculations.7 In this case, the number of conformers required to make up the ensemble depends on the actual spatial distribution of the SL in solution as well as the precision required to match the experimental data for nearby nuclei, which will experience the strongest PRE. However, using an ensemble could potentially generate worse results if non-realistic conformer orientations are used, leading to an inaccurate structure of the protein complex.6 This can be overcome by first determining the most favourable conformer orientations experimentally using intra-molecular PRE data, followed by fixing the SL conformers in those positions during docking calculations based on inter-molecular PRE data. This has yielded good results using as few as one SL conformer for multi-domain proteins8,9 and has also been used to study DNA–protein interactions.10
Due to its wide commercial availability, as well as the availability of a suitable diamagnetic control, the most commonly used SL is 1-oxyl-2,2,5,5-tetramethyl-2,5-dihydropyrrol-3-ylmethyl methane thiosulfonate (MTSL). However, MTSL has a rather long linker consisting of five single bonds from the peptide Cα atom to the 3-pyrolline ring (Fig. 1B), resulting in substantial flexibility and dynamics.
Fig. 1 Molecular structures of three methanesulfonothioate based tags attached to a cysteine side chain. (A) MTS (diamagnetic control), (B) MTSL and (C) pyMTSL.11 |
In order to limit tag dynamics, the pyrolline ring of MTSL can be modified to increase the tag rigidity.6 This was successfully done via the addition of a pyridyl group to position 4 of the pyrolline ring resulting in 3-methanesulfonilthiomethyl-4-(pyridin-3-yl)-2,2,5,5-tetramethyl-2,5-dihydro-1H-pyrrol-1-yloxyl (pyMTSL; also called HO-3606) (Fig. 1C). It was reported that this addition restricts the SL movement, such that the PRE data could be fit using just one conformer.11
The yeast cytochrome c (Cc) and cytochrome c peroxidase (CcP) complex is a well-studied, highly dynamic electron transfer complex. CcP catalyzes the reduction of hydrogen peroxide to water using two electrons it receives from Cc.12 The crystal structure was determined in 1992, showing how Cc is docked on CcP.13 The orientation of Cc relative to CcP in the complex in solution was determined by Volkov et al. in 2006 by attaching MTSL at five positions on CcP. After confirming that the presence of the SLs did not interfere with complex formation, the PRE effects were measured for nuclei in Cc. The orientation of Cc was determined using rigid body docking with an ensemble of four MTSL conformers per position in orientations selected to represent the width of the ensemble of sterically allowed conformers, yielding a position of Cc that was close to the one observed in the X-ray crystal structure (RMSD = 2.2 Å for the Cα atoms of Cc).14 The data also provided evidence for the presence of an encounter complex, in which Cc assumed other orientations relative to CcP. In 2010 Bashir et al., described similar rigid body docking calculations in which the MTSL conformers at each position were treated as an ensemble of non-self-interacting conformers that could move freely during the docking, as could the nearby amino acid side chains. The result was a less precise protein complex ensemble, in which extreme positions for Cc were found within the ensemble when all the MTSL conformers were simultaneously orientated to one side or another; this effect appeared to have been averaged out in the previous study when the conformers were fixed in four dispersed orientations.15
In this work, we revisit the Cc–CcP complex using MTSL and pyMTSL tags at three positions close to the binding interface on the surface of CcP. The aim of this work is to establish whether PRE data alone are sufficient for accurate rigid body docking of two proteins that form a dynamic complex. The Cc–CcP complex spends approximately 30% of the time in an encounter state;14,16–19 therefore, the PRE represent not only the well-defined stereospecific state but also the encounter state. We investigate the role of SL mobility by comparing MTSL to pyMTSL, as well as by comparing the use of a single SL conformer to the use of an ensemble of conformers when the orientations are either fixed or mobile during the calculations. The position of Cc in the X-ray crystal structure has been used as the benchmark, under the assumption that the well-defined state of the complex in solution is very similar to the one observed in the crystalline state. It is concluded that the results of PRE based docking are highly dependent on the choice of SL conformers. Thus, the SL flexibility appears to be the limiting factor for obtaining accurate results. SL ensembles of conformers optimized by intra-molecular PRE yield the best results.
The cw-EPR spectra were simulated using Matlab version 7.14.0.739 (Natick, Massachusetts, USA) and the EasySpin package version 4.5.5.23 For all simulations, the following spectral parameters were used: g = [2.00906, 2.00687, 2.00300]24 and the hyperfine tensor parameters AXX = AYY = 13 MHz. Isotropic rotation was assumed in all cases, to reduce the number of simulation parameters and get a better overall picture of the relative changes in rotational properties of the spin labels. Usually a superposition of more than one component was required to simulate the spectra. One of these components has a rotation correlation time of several tens of ps and a contribution of less than 0.5% to the total simulation and is therefore assigned to a small fraction of residual free spin label in the sample.
The 1H and 15N resonance assignments were obtained from previous studies for Cc27–29 and from our previous work for CcP.16 In the early stages of this work for the intra-molecular PREs of TC288-(py)MTSL, we noticed that residue 120 experienced a very strong PRE (the paramagnetic peak disappeared from the spectra) while residues 123 and 124 did not experience any PREs. It was concluded that residues 123 and 124 had been swapped with those of residues 101 and 102. BMRB entry 19884 containing these backbone resonance assignments has been updated.
The calculated R2,para values were then converted into distances using (eqn (1)):18
(1) |
The precision of the ensemble was quantified by the average of the pairwise RMSD values of members of the ensemble and its mean structure. The accuracy was assessed by comparing the mean structure of the calculated structure ensemble to the crystal structure of the complex, using both the root mean squared deviation (RMSD) and distance root mean squared deviation (DRMS) for these two structures, which is defined as (eqn (2)):34
(2) |
(3) |
Fig. 2 Locations of SLs attached on the surface of CcP at positions 38 (teal), 200 (blue) and 288 (green) (nitroxide oxygen atom in red). (A) The locations of three MTSL tags shown on CcP (white) with respect to the binding site of Cc. The relative orientations of MTSL (B) or pyMTSL (C) around the binding site on the surface of CcP are also shown; Cc is shown as a pink cartoon shadow. The haem groups are shown in black sticks (PDB-entry 2PCC).13 |
These three attachment positions ring the stereo-specific binding interface with Cc and have also previously been used to produce significant inter-molecular PRE effects on Cc.14,18 The spin-label mobility at these positions for MTSL and pyMTSL was compared using EPR measured in solution at room temperature (Fig. 3).
The EPR spectra of the spin labels at the three positions investigated show that the lineshapes of the nitroxides are under conditions of not fully averaged anisotropy, typical for spin-labels attached to proteins. In all cases, the spectrum of the protein labelled with MTSL at a given position has narrower lines than that of the protein labelled with pyMTSL, showing that the rotation of the nitroxide group in pyMTSL is slower than that of MTSL. More details can be seen from the parameters of the spectral simulation (Fig. S1, ESI† and Table 1).
Component 1 | Component 2 | Free MTSLa | |||||||
---|---|---|---|---|---|---|---|---|---|
τ radicalr (ns) | Fraction (%) | A ZZ (MHz) | τ radicalr (ns) | Fraction (%) | A ZZ (MHz) | τ radicalr (ns) | Fraction (%) | A ZZ (MHz) | |
a Residual free spin-label fraction, contribution shows almost complete removal of free spin label from all samples. | |||||||||
C38 MTSL | 0.89 | 100.0 | 110.0 | n/a | n/a | n/a | n/a | n/a | n/a |
C38 pyMTSL | 0.63 | 20.0 | 108.5 | 2.20 | 79.5 | 102 | 0.05 | 0.5 | 108.0 |
C200 MTSL | 0.89 | 99.7 | 115.0 | n/a | n/a | n/a | 0.02 | 0.3 | 109.8 |
C200 pyMTSL | 0.81 | 17.0 | 109.8 | 8.70 | 82.7 | 94 | 0.05 | 0.3 | 108.0 |
C288 MTSL | 2.23 | 42.0 | 100.0 | 10.00 | 57.7 | 97 | 0.02 | 0.3 | 109.8 |
C288 pyMTSL | 3.20 | 25.0 | 101.0 | 10.00 | 74.9 | 101 | 0.05 | 0.1 | 108.0 |
For CcP positions 38 and 200, MTSL has a rotation correlation time (τradicalr) of 0.9 ns, which is the single component contributing to these spectra. When using the pyMTSL tag, the radical has a majority contribution with a 2.5 fold (position 38C) and a more than ten-fold increased rotation correlation time (position 200C) (see Table 1). For the 288 position the MTSL label is rotating more slowly than at the other positions: two contributions are observed with almost equal weight and correlation times of 2 ns respectively 10 ns, which is significantly more immobilized than the MTSL at the other positions. Replacing MTSL by pyMTSL increases the rotation correlation time of the faster fraction to 3 ns and shifts the population to 75% of slower fraction, showing that also at this position, the pyridine substituent leads to a slowing down of the rotation.
The rotation correlation times of pyMTSL vary per position, showing that local interactions contribute to the mobility of pyMTSL and that the mobility of pyMTSL is not exclusively determined by its side-chain structure. Furthermore, none of the rotation correlation times reach the value of the protein, showing that the spin label is not completely anchored to the protein, and demonstrating that the linker connecting the nitroxide-ring to the protein backbone possesses local degrees of freedom. Note that EPR reports on the correlation time of the radical, whereas PREs reflect the rotational correlation time of the radical-nuclear vector (τr).
When the SL orientations were fit to the data using only a single conformer, the position of the nitroxide radical for the twenty lowest energy structures converged to a fairly precise position at all three attachment sites (Fig. 4-i). Also, little difference was seen between MTSL and pyMTSL. When using an ensemble of four conformers, however, the most favourable orientations were more dispersed and two or three distinct populations appeared (Fig. 4-ii). Table 2 reports the Q-values and average violations (see Materials and methods for the definition of these measures) of the various fits. Interestingly, the ensemble fit yields results that are only marginally better than those obtained with the single conformers. Thus, on the basis of these data it cannot be decided which is the better description of SL conformations.
Position | MTSL | pyMTSL | |||
---|---|---|---|---|---|
1 conformer | 4 conformers | 1 conformer | 4 conformers | ||
C38 | AV (Å) | 2.94 | 2.44 | 2.83 | 2.57 |
Q-factor | 0.20 | 0.17 | 0.18 | 0.17 | |
C200 | AV (Å) | 1.65 | 1.61 | 1.82 | 1.80 |
Q-factor | 0.15 | 0.14 | 0.15 | 0.14 | |
C288 | AV (Å) | 2.35 | 2.28 | 2.16 | 2.13 |
Q-factor | 0.18 | 0.17 | 0.16 | 0.15 |
Essentially the same position was found for the nitroxide radical when using either MTSL or pyMTSL. For C38-MTSL (Fig. 4A and B-ii), the ensemble showed two distinct populations, one with a very well-defined location of the nitroxide radical and one more dispersed, with ratio of approximately 25%:75%. For pyMTSL, the positions are similar but the populations have a ratio closer to 50%:50%. Furthermore, the well-defined and disperse populations are swapped when compared to those for MTSL. The EPR data (Fig. 3A) showed a marked decrease in mobility for pyMTSL compared to MTSL at position C38 but this difference was not reflected in the number of calculated SL orientations. Furthermore, the populations found by NMR were not reflected in the EPR simulations (Table 1). For example, for C38 MTSL the EPR simulations found only one highly mobile population while for C38 pyMTSL the EPR simulations found two populations but in a ratio of 20.0%:79.5% as compared to roughly 50%:50% found by NMR. Differences between the ratios of populations found by EPR and NMR were also seen for both SLs at C200 and for MTSL at C288 (see discussion below).
For position C200, the EPR data showed a similar decrease in mobility for pyMTSL compared to MTSL, which is reflected in a much more defined positioning of the nitroxide radical for pyMTSL compared to MTSL in the calculated SL ensemble orientations, although the general location of the radicals was the same in both cases (Fig. 4C and D). When using an ensemble of four conformers at this position, three distinct populations were found with a ratio of about 25%:50%:25% for both SLs. However, the EPR simulations found only one population for C200 MTSL and only two populations for C200 pyMTSL with a ratio of 17.0%:82.7% (Table 1).
The NMR spectra for C288-(py)MTSL showed chemical shift perturbations for peaks of the residues near the spin label attachment site (small shifts for residues 284 and 285; larger shifts for residues 286–294) indicating that the presence of the spin label affects the local backbone structure of the C-terminal loop. Therefore, the position of the nitroxide radical was also determined by docking pseudoatom(s) to CcP using the intra-PRE data, representing the paramagnetic centre(s) unrestrained by the covalent linkage to the CcP backbone. The resulting positions and fit to the data were very similar to those found when the SLs were attached to CcP, indicating that linking the SL to the backbone of CcP seen in the crystal structure did not interfere with determining the experimentally most favourable orientations. As observed for the other SL positions, the ensemble of conformers at C288 showed more than one population for both SLs, with a ratio of about 25%:75%. Interestingly, the EPR simulations also showed two populations for both SLs at C288 with a ratio of 42.0%:57.7% for MTSL and a ratio of 25.0%:74.9% of pyMTSL.
Overall, there seems to be little correlation between the populations of SL conformers found using NMR and those found during the EPR simulations; C288 pyMTSL is the only position for which ratio between the populations found with NMR matched those found with EPR. Furthermore, the τradicalr values found during the EPR simulations (0.63–3.20 ns for the fast components; 2.20–10.00 ns for the slow components) were much lower than the estimated τc of the protein (16 ns).14,18 For spin labels, the value of τs is large so τc is dominated by τr in the formula τc−1 = τr−1 + τs−1.4–6 The value of τr is affected by any motions that change the vector between the paramagnetic centre and the nucleus such as protein tumbling, SL mobility, and local protein dynamics6 but of these usually only protein tumbling is taken into account35 as it is generally assumed to dominate the relaxation measurements.36 Despite shorter τradicalr values for the SLs, the assumption that protein tumbling dominates τc appears to hold true here as previous tests on the same system using lower τc values of 4 or 12 ns did not improve the docking results.14 This discrepancy between τr and τradicalr can be understood if it is assumed that the short correlation of the radical is caused mainly by small movements of the SL but have little influence on the length and orientation of the radical-nuclear vector in the external field. Rearrangement of this vector is dominated by rotation of the protein.
The EPR spectra for position C288 showed that both MTSL and pyMTSL were highly immobilized (Fig. 3C), so highly defined locations for the most favourable conformer orientations may be expected. When using a single conformer, the nitroxide radical positions were indeed precisely defined (Fig. 4E and F-i) and also the radical positions for the pyMTSL ensemble are well defined (Fig. 4E-ii). However, it should be noted that the calculations give no evidence for stronger steric restrictions. The restraints used in the calculations are to the oxygen of the SL. Other atoms can sample the conformational space as far as allowed by the restrained position of the oxygen atom. This is illustrated by the pyridyl ring that occupies a wide range of orientations for the pyMTSL in each of three positions. It is not obvious that this range is more limited for C288.
To see how well the calculated structures of the most favourable orientations fit the PRE data, the distances from the paramagnetic centre to the amide protons were back-predicted using r−6 averaging over all conformer orientations present in the best twenty solutions (Fig. 4) and then compared to the experimentally observed distances (Fig. S2, ESI†). The quality of fit parameters for all data sets are given in Table 2.
Very little difference is seen between the calculated distances, and thus the quality parameters, for the single conformer and ensemble solutions. Therefore, it cannot be established whether the single conformer or the ensemble is the better description for the SL, so both have been used in the docking calculations to allow for a comparison. Also little difference is present between solutions for MTSL and pyMTSL, in accord with similarity between the input PRE data sets for both SL types. There are, however, significant differences in the quality of fit between the SL positions. The best fits are observed for C200. The reason is not evident.
During the docking, first a single SL conformer was used that was either free to rotate or fixed in the experimentally determined most favourable orientation. Docking was repeated 10000 times from random starting positions of Cc and an ensemble was generated from the twenty lowest energy solutions for MTSL (Fig. 5A–D) and pyMTSL (Fig. 5E–H).
Fig. 5 Twenty lowest energy solutions for docking Cc to CcP driven by intermolecular PRE data using a single SL conformer that was free to move during docking (MTSL A, B; pyMTSL E, F) or that was fixed in the experimentally determined most favourable orientation (MTSL C, D; pyMTSL G, H). CcP is shown in grey ribbons and Cc is shown in multi-coloured ribbons. The SLs are shown in sticks at positions 38 (teal), 200 (blue) and 288 (green), with the nitroxide oxygen atom in red and the Cc haem group is shown in multi-coloured sticks. The docking was done using the Cc and CcP structures taken from PDB entry 2PCC.13 |
When the SL is free to move during docking, the position of Cc as well as the SL orientations in the resulting ensemble of the twenty lowest energy solutions were much more dispersed than when the SL orientations were fixed; this can been seen most clearly when the positions of the Cc haem groups are compared. Similar results were observed in a previous study.15 Fixing the SL orientations provides a more precise description of the Cc position, indicating that the docking is reproducible.14,18 Note that the higher precision does not imply a result that is closer to the benchmark, in this case the position of Cc observed in the crystal structure, as is discussed below.
The distances between the Cc amide protons and the SL nitroxide radicals in the ensemble were compared to the experimental distances. Some discrepancies in the fit are to be expected since the data were obtained for the Cc–CcP complex in solution, which is known to consist of 30% encounter complex and 70% stereo-specific complex, while the structure comparison was done with the crystal structure, which consists only of the stereo-specific complex.14 Nevertheless, there was a good overall fit between the back-calculated and experimental distances (Fig. S3, ESI†). The fit was evaluated using the AV and Q-factors (Table 3).
Position | MTSL | pyMTSL | |||
---|---|---|---|---|---|
1 conformer free | 1 conformer fixed exp. | 1 conformer free | 1 conformer fixed exp. | ||
C38 | AV (Å) | 1.98 | 2.55 | 2.01 | 2.53 |
Q-factor | 0.10 | 0.15 | 0.10 | 0.14 | |
C200 | AV (Å) | 2.13 | 2.51 | 1.88 | 2.19 |
Q-factor | 0.10 | 0.12 | 0.09 | 0.12 | |
C288 | AV (Å) | 1.21 | 2.17 | 1.44 | 2.04 |
Q-factor | 0.07 | 0.12 | 0.08 | 0.11 |
From the AV and Q-factors, it is clear that a freely rotating conformer much better describes the inter-molecular PRE data than does a single fixed conformer, despite the fixed position being determined experimentally with intra-molecular PRE data. This is not surprising since there are more degrees of freedom during the docking in this case; both the SL orientation and the position of Cc are fit to the inter-molecular PRE data when the SL is free to move.
The results were also compared to the stereo-specific orientation found in the crystal structure (Fig. 6). Overall, the Cc positions were similar to that of the X-ray crystal structure, although the calculated Cc ensemble was rotated slightly around the stereo-specific binding interface in all data sets. The RMSD of the Cα atoms was calculated by first generating an average structure from all the Cc orientations, done by taking the linear average of the individual structures, and then comparing that to the stereo-specific orientation in the X-ray crystal structure. The RMSD is sensitive to differences caused by both rotation and translation, while the DRMS is mainly sensitive to translation. The DRMS is calculated by determining the Cα–Cα distance matrix of for all Cα pairs from the two structures and then taking the root-mean-square deviation (eqn (3)). The DRMS is always smaller than the RMSD and a large RMSD in combination with a small DRMS indicates that the two structures are mostly rotated relative to each other.37
Fig. 6 Comparison of the Cc positions as viewed from spin labelled CcP. The Cc position in the crystal structure (orange; PDB entry 2PCC)13 is compared to the twenty lowest energy solutions for docking Cc to CcP (grey ribbons), driven by intermolecular PRE data using a single conformer that was free to move during docking (A and C) or that was fixed in the experimentally determined most favourable orientation (B and D). The SLs (MTSL A, B; pyMTSL C, D) are shown in sticks at positions 38 (teal), 200 (blue) and 288 (green), with the nitroxide oxygen atoms in red. |
For both fixed and free MTSL and pyMTSL, the RMSD values were much higher than the DRMS values, indicating that a large proportion of the difference between the calculated Cc orientations compared to the crystal structure is due to rotation of Cc, while the binding interface is similar. For MTSL, fixing the SLs in the experimentally determined most favourable orientations improved the fit to the crystal structure and decreased the DRMS from 2.2 Å for the freely rotating SLs to 1.7 Å with the SLs fixed (Fig. 6A and B). Also, the RMSD is significantly reduced from 6.6 Å to 3.9 Å. This indicates that for MTSL, experimentally determining the most favourable SL orientations prior to protein docking improved the accuracy of the final structure. For pyMTSL, the DMRS values were similar in both cases, with a DRMS of 1.8 Å and 1.9 Å when using freely rotating and fixed SLs, respectively.
Fig. 7 Twenty lowest energy solutions for docking Cc to CcP driven by intermolecular PRE data using an ensemble of four MTSL conformers that were free to move during docking (A and B) or that were fixed in selected positions (C and D) or experimentally determined most favourable orientations (E and F). CcP is shown in grey ribbons and Cc is shown in multi-coloured ribbons. The SLs are shown in sticks at positions 38 (teal), 200 (blue) and 288 (green), with the nitroxide oxygen atom in red and the Cc haem group is shown in multi-coloured sticks. The docking was done using structures from PDB entry 2PCC.13 |
Fig. 8 Twenty lowest energy solutions for docking Cc to CcP driven by intermolecular PRE data using an ensemble of four pyMTSL conformers that were free to move during docking (A and B) or that were fixed in selected positions (C and D) or experimentally determined most favourable orientations (E and F). CcP is shown in grey ribbons and Cc is shown in multi-coloured ribbons. The SLs are shown in sticks at positions 38 (teal), 200 (blue) and 288 (green), with the nitroxide oxygen atom in red and the Cc haem group is shown in multi-coloured sticks. The docking was done using structures from PDB entry 2PCC.13 |
As observed when using a single SL, allowing the SLs to rotate during docking resulted in a large, dispersed ensemble of the twenty lowest energy solutions for both MTSL and pyMTSL, while fixing the SL conformer orientations produced a well-defined position for Cc. The back-calculated distances between the Cc amide protons and the SL nitroxide radical oxygens were similar when using either free or fixed SL conformers and there was a good overall agreement with the experimental distances (Fig. S4, ESI†). The fits were evaluated using the AV and Q-factors (Table 4).
Position | MTSL | pyMTSL | |||||
---|---|---|---|---|---|---|---|
4 conformers free | 4 conformers fixed sele. | 4 conformers fixed exp. | 4 conformers free | 4 conformers fixed sele. | 4 conformers fixed exp. | ||
C38 | AV (Å) | 2.35 | 2.53 | 2.66 | 2.00 | 2.18 | 2.40 |
Q-factor | 0.12 | 0.14 | 0.15 | 0.10 | 0.11 | 0.13 | |
C200 | AV (Å) | 1.93 | 2.96 | 2.17 | 2.00 | 2.10 | 2.03 |
Q-factor | 0.10 | 0.14 | 0.12 | 0.08 | 0.11 | 0.11 | |
C288 | AV (Å) | 1.32 | 1.51 | 2.24 | 1.60 | 1.80 | 2.32 |
Q-factor | 0.08 | 0.09 | 0.12 | 0.09 | 0.10 | 0.12 |
Again, the best fits were found when the SL orientations were free to move during dynamic docking; i.e., the SL conformer orientations were fit to the PRE data along with the position of Cc. This indicates that, although fixing the SL orientations results in a very precise determination of the Cc position, using four fixed positions for the nitroxide radical and, thereby, limiting the degrees of freedom, cannot describe the observed PRE data completely.
Surprisingly, optimizing the SL positions using experimental data did not yield a better fit to the inter-molecular PRE data than simply selecting four orientations from the sterically allowed possibilities (see Material and methods for details of how the conformers were selected). The back-calculated distances are determined using the orientation of the complex found in the X-ray crystal structure, assuming that the orientation of the complex in solution is very similar to the one observed in the crystalline state. However, it is known that the complex is in fast exchange on the NMR timescale between this stereo-specific orientation, accounting for 70% of the complex in solution, and a more dynamic the encounter state, accounting for the remaining 30%.14,18,19 The experimental PREs are a (non-linear) average of all orientations in solution, so comparing them to only the stereo-specific orientation will limit the quality of the fit. Nevertheless, the overall fit to the PRE data was reasonably good for all conditions.
The results for docking Cc to CcP were also compared to the crystal structure of the stereospecific complex (Fig. 9). As shown previously,8,9 the best results were obtained for MTSL when using multiple conformers. Furthermore, as seen when using a single SL conformer, many of the resulting Cc orientations are rotated with respect to the stereospecific binding site resulting in high RMSD values but small DRMS values for the ensembles. For MTSL, fixing the conformers in the selected orientations produced the worst fit the with crystal structure (RMSD = 6.7 Å; DRMS = 2.2 Å). The fit improved when the SLs could freely rotate (RMSD = 4.8 Å; DRMS = 1.8 Å) but the best fit was obtained when using the experimentally determined conformer orientations (RMSD = 2.5 Å; DRMS = 1.5 Å). This indicated that the most accurate description of the protein complex was achieved by predetermining the conformer orientations using intra-molecular PREs, even though the quality of fit to the experimental data was not the best for this solution.
Fig. 9 Comparison of the Cc positions as viewed from spin labelled CcP. The Cc position in the crystal structure (orange; PDB entry 2PCC)13 is compared to the twenty lowest energy solutions for docking Cc to CcP (grey ribbons), driven by intermolecular PRE data using an ensemble of four conformers that were free to move during dynamic docking (A and B) or that were fixed in selected positions (C and D) or experimentally determined most favourable orientations (E and F). The SLs are shown in sticks at positions 38 (teal), 200 (blue) and 288 (green), with the nitroxide oxygen atoms in red and the Cc haem group shown in multi-coloured sticks. |
In a previous study, PREs from MTSL at positions 38, 200 and 288 were fixed in selected positions while Cc was docked to CcP.14 The same conformers were used in this study. An RMSD with the crystal structure of 2.2 Å was found in that study, which is much smaller than the RMSD values that we observed, of 6.7 Å. This large discrepancy is likely due to small differences in the experimental PREs and the fact that normalization of the Ipara/Idia data (as described in the Materials and methods) was not done in the previous study. Although the Ipara/Idia ratio for residues unaffected by PRE is expected to be 1.0, we frequently observe values that are on average slightly higher or slightly lower. The reason for this is unclear but is likely due to slight differences between the diamagnetic and paramagnetic samples. The transverse relaxation rate is very sensitive to the exact fraction bound, because the rotational correlation time of the CcP–Cc complex is much larger than that of free Cc. If the concentrations of the proteins vary slightly, for example, due to slight aggregation, the fraction bound Cc can differ between the paramagnetic and diamagnetic samples. Since it is a global effect on all residues, and the deviation in the average Ipara/Idia ratio can be both larger and smaller than 1.0, non-specific contact between Cc and the paramagnetic protein cannot be the cause of this effect. This normalization was not done in the previous study14 resulting in shorter distance restraints than were obtained in this study, particularly for position C38, which allowed Cc to find a position much closer to the SLs than was allowed in this study. These findings demonstrate that the outcome of the docking is very sensitive to relatively small differences in the experimental data set.
For pyMTSL, the DRMS values showed that using pyMTSL resulted in a very good fit to the stereo-specific state but the RMSD values were quite high. Therefore, the differences between the final Cc positions found after docking and the stereo-specific state were mainly due to rotation and not translation of Cc. When the SL was free to rotate, the resulting fit had an RMSD of 4.2 Å and DRMS 1.4 Å. Fixing the conformers in the selected orientations resulted in greater rotation of Cc relative to the stereo-specific state (RMSD = 5.5 Å; DRMS = 1.2 Å). Fixing the SLs in the experimentally determined orientations resulted in a lower RMSD of 3.2 Å and DRMS 1.2 Å, in line with the results for MTSL. Therefore, in this case, it was necessary to represent pyMTSL using multiple conformers to get the best results.
As was shown previously,14 fixing the SL orientations during docking produces a highly defined ensemble of solutions. However, precision is of no importance in the absence of accuracy, which can be difficult to achieve when the fixed SL orientations must be selected without experimental data. The use of intra-molecular PRE to predetermine the most favourable SL orientations improved the accuracy of structure determination despite the fact that this did not necessarily improve the fit between the experimental and back-calculated data. In this case, we used the stereo-specific orientation of Cc observed in the X-ray crystal structure as the benchmark for accuracy under the assumption that this is also the main state of the complex in solution. However, as mentioned above, this stereo-specific state only accounts for 70% of the complex in solution, with the remaining 30% in the encounter state,14,16–19 resulting in discrepancies in the fit. Furthermore, under most conditions, a significant rotation of Cc around the stereo-specific binding interface was observed. This is likely due to the fact that PREs are highly sensitive to minor states38 giving them a disproportionally large influence on the final orientation of Cc determined by the docking calculations. Although fixing the SLs in the experimentally determined most favourable orientations helped to reduce this effect, we conclude that PREs do not yield very reliable restraints for determining protein orientation within a complex.
Much better results can be obtained when combining PCS or RDC with the PRE data.39–48 This was demonstrated recently by Hiruma et al. for the cytochrome P450cam–putidaredoxin complex. Paramagnetic tags were placed at two locations on cytochrome P450cam as well as on one location on putidaredoxin and intermolecular PCS, RDC and PRE were obtained. The PCS and RDC back-calculated from the final, well-defined structure matched the experimental data very well, as did the PRE data for the one of the tag positions on cytochrome P450cam. For the second position, however, the experimental PREs were stronger than expected for many residues and the authors concluded that this was due to a minor state influencing the PRE results. Nevertheless, the use of PCS and RDC in combination with PRE data allowed for the structure of the major state to be successfully resolved, as judged by the subsequently determined crystal structure of the complex, which showed a 1.7 Å RMSD with the mean of the NMR ensemble of structures.49 This work was done using different paramagnetic lanthanoid ions, which can be attached to the protein via double armed caged lanthanide NMR probes (CLaNP), having the additional benefit of being highly immobilized and therefore limiting spatial averaging of the observed paramagnetic effects.50,51
For structural modelling paramagnetic restraints can also be combined with other NMR data, such as RDCs obtained using external alignment media and NOEs.39,42–47 This was done recently by Shi et al. who used PRE data to complement both NOE and RDC data for the integral-membrane protein phospho-lamban (PLN). PLN is a small protein consisting of two helical domains linked by a flexible loop. The NMR structure of the monomeric protein in dodecylphosphocholine micelles had previously been solved by their group using NOE and solvent PRE but the resulting ensemble of conformers in the membrane was very poorly defined.52 Combining the NOE data with RDCs resulted in 100 low energy structures that were highly refined (backbone RMSD = 1.6 Å) but were grouped into four families exhibiting a four-fold degeneracy in the relative orientations of the two helices. By also incorporating PRE data into the structure refinement, the degeneracy of the RDC data was overcome and the correct family of structures was obtained with high resolution (backbone RMSD = 1.2 Å).48 This highlights the benefit of combining multiple data sets when resolving the structure of dynamic and/or multi-domain proteins, especially when together they can provide both distance and orientation information.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5cp03781f |
This journal is © the Owner Societies 2016 |