David
Blocquel
,
Johnny
Habchi
,
Antoine
Gruet
,
Stéphanie
Blangy
and
Sonia
Longhi
*
CNRS, UMR 6098, 13288, Marseille, France. and Aix-Marseille Univ., Architecture et Fonction des Macromolécules Biologiques (AFMB), 13288, Marseille, France. E-mail: Sonia.Longhi@afmb.univ-mrs.fr; Fax: +33 4 91 26 67 20; Tel: +33 4 91 82 55 80
First published on 23rd November 2011
Henipaviruses are recently emerged severe human pathogens within the Paramyxoviridae family. Their genome is encapsidated by the nucleoprotein (N) within a helical nucleocapsid that recruits the polymerase complex via the phosphoprotein (P). We have previously shown that in Henipaviruses the N protein possesses an intrinsically disordered C-terminal domain, NTAIL, which undergoes α-helical induced folding in the presence of the C-terminal domain (PXD) of the P protein. Using computational approaches, we previously identified within NTAIL four putative molecular recognition elements (MoREs) with different structural propensities, and proposed a structural model for the NTAIL–PXD complex where the MoRE encompassing residues 473–493 adopt an α-helical conformation at the PXD surface. In this work, for each NTAIL protein, we designed four deletion constructs bearing different combinations of the predicted MoREs. Following purification of the NTAIL truncated proteins from the soluble fraction of E. coli, we characterized them in terms of their conformational, spectroscopic and binding properties. These studies provided direct experimental evidence for the structural state of the four predicted MoREs, and showed that two of them have clear α-helical propensities, with the one spanning residues 473–493 being strictly required for binding to PXD. We also showed that Henipavirus NTAIL and PXD form heterologous complexes, indicating that the PXD binding regions are functionally interchangeable between the two viruses. By combining spectroscopic and conformational analyses, we showed that the content in regular secondary structure is not a major determinant of protein compaction.
A few distinctive properties, including their much larger genome size, led to the classification of NiV and HeV within the Henipavirus genus of the Paramyxoviridae family.9 A number of Henipavirus strains have been isolated from humans, bats, horses and pigs over a wide geographic area in the last decade. Noteworthy, Henipaviruses are also found outside Australia and Asia, thus extending the endemic area of one of the most pathogenic virus genera known in humans.3 The susceptibility of humans, their high pathogenicity, the wide host range and interspecies transmission led to the classification of HeV and NiV as biosecurity level 4 (BSL-4) pathogens.10 To date, no therapeutic agents nor vaccines are available to fight against these severe pathogens.
As in all Mononegavirales members, the negative-strand, non-segmented RNA genome of Henipaviruses is encapsidated by the nucleoprotein (N) within a helical nucleocapsid. This helical N:RNA complex, and not naked RNA, is the substrate used by the polymerase complex during both transcription and replication. The helical nucleocapsid, made of genomic RNA encapsidated into a linear non-covalent polymer of the N protein, has the characteristic herringbone-like structure typically observed in other Paramyxoviridae members.11–17 When expressed in heterologous hosts, NiV N was found to form nucleocapsid-like particles that contain RNA.18–21
Minigenome replicon studies showed that in Henipaviruses, the N protein, the phosphoprotein (P) and the large protein (L) are necessary and sufficient to sustain replication of viral RNA.22 By analogy with other Paramyxoviridae members, the polymerase complex is thought to consist of the L protein and of the P protein, with the latter serving as a tether for the recruitment of L onto the nucleocapsid template. As in all Mononegavirales members, Henipavirus N and P proteins were shown to interact with each other, being able to form both homologous and heterologous N–P complexes.23,24
So far, high-resolution structural information is only available for the fusion (F) and attachment (G) glycoproteins.25–28 For the N and P proteins, the only molecular data come from our previous studies.21,29–31 Those studies showed that the N-terminal region of P (PNT, aa 1–404 or 1–406) and the C-terminal region of N (NTAIL, aa 400–532) belong to the family of intrinsically disordered proteins (IDPs), although they both contain short order-prone segments.29 The occurrence of some residual, transient secondary and/or tertiary structure within NTAIL and PNT led to their classification within the premolten globules (PMG) subfamily within the family of IDPs.32–37 IDPs are ubiquitous, functional proteins that lack highly populated secondary and tertiary structure under physiological conditions and in the absence of a partner/ligand (for recent reviews on IDPs, see ref. 38–43). In contrast, we showed that the C-terminal X domain of the Henipavirus P proteins (PXD, aa 660–709 of NiV P and aa 657–707 of HeV P) is folded, and adopts an α-helical conformation.21Henipavirus NTAIL and PXD domains were shown to form a 1:
1 stoichiometric complex that is stable up to 1 M NaCl, and whose KD is within the μM range.21 Using bioinformatics approaches, we identified within both HeV and NiV NTAIL domains four putative molecular recognition elements (MoREs), where these latter are short order-prone regions within intrinsically disordered regions that have a propensity to undergo induced folding upon binding to partner(s).44–47 Among these predicted MoREs, at least two of them (aa 408–422 and 473–493) exhibit a clear α-helical nature, and spectroscopic studies carried out in the presence of trifluoroethanol (TFE) unveiled that both HeV and NiV NTAIL do possess α-helical propensities.29 In agreement, using far-UV circular dichroism (CD) and heteronuclear NMR we showed that binding to PXD triggers α-helical folding of NTAIL, with the resonance behavior suggesting a role for the predicted α-MoRE spanning residues 473–493 in binding to PXD.21 Using fluorescence spectroscopy, we showed that PXD has no impact on the chemical environment of a Trp residue introduced at position 527 of Henipavirus NTAIL, thus arguing for the lack of stable contacts between the C-terminus of NTAIL and PXD.21 Based on all these observations, and also by analogy with the related measles virus (MeV),48,49 we proposed a tentative structural model of the Henipavirus NTAIL–PXD interaction where the α-MoRE encompassing residues 473–493 adopt an α-helical conformation and are embedded between helices α2 and α3 of PXD, leading to a relatively small interface dominated by hydrophobic contacts.21
The objective of the present study was to directly assess the role of this α-MoRE in binding to PXD and to investigate the possible contribution of other short order-prone NTAIL segments to complex formation and folding. To this endeavor, for both HeV and NiV NTAIL proteins, we designed four constructs encoding truncated NTAIL proteins bearing various combinations of the four predicted MoREs. We report their bacterial expression, purification and characterization and show that removal of the α-MoRE spanning residues 473–493 does indeed abrogate binding to PXD. In contrast, removal of the other MoREs does not significantly affect the affinity of the binding reaction, although it impacts the folding propensities to various extents. Spectroscopic studies allowed us to shed light on the structural state of the various MoREs, showing that the MoRE encompassing residues 444–464 are unlikely to transiently populate a β structure. By combining spectroscopic and conformational analyses, we showed that the content in regular secondary structure is not a major determinant of protein compaction. Finally, we estimated the cross-reactivity between the HeV and NiV NTAIL–PXD interacting pairs and show that the PXD-binding regions are functionally interchangeable between the two viruses.
![]() | ||
Fig. 1 (A) Schematic representation of NTAIL deletion proteins. (Top) Structural organization of Henipavirus N proteins according to ref. 29 showing that they are composed of an N-terminal structured region (NCORE, aa 1–399) and a C-terminal disordered region (NTAIL, aa 400–532). The two putative α-MoREs (aa 408–422 and aa 473–493), with the second one corresponding to the region modeled at the PXD surface,21 are shown. A putative irregular MoRE (I-MoRE) (aa 523–532) and a putative MoRE of dubious state (aa 444–464, see region contoured by a dashed line)29 are also shown. The NTAILΔ1, NTAILΔ1Δ2, NTAILΔ4 and NTAILΔ3Δ4 deletion proteins are devoid of Box1, Box1 plus Box2, Box4 or Box3 plus Box4, respectively. They all possess an N-terminal hexahistidine tag. (B) 15% SDS-PAGE of purified deletion proteins followed by Coomassie blue staining. |
The gene fragments encoding the different NTAIL deletion proteins were cloned into the pDEST17OI vector to yield N-terminally histidine-tagged recombinant products, the expression of which is under the control of the T7 promoter being tightly controlled by IPTG. In all cases, the NTAIL proteins were readily expressed in E. coli and their solubility was high, thus allowing their recovery from the soluble fraction of the bacterial lysates. The proteins were purified to homogeneity (>95%) by immobilized metal affinity chromatography (IMAC) followed by preparative size exclusion chromatography (SEC) (Fig. 1B). The identity of the final purified proteins was confirmed by mass spectrometry analysis of the tryptic fragments obtained after digestion of the purified proteins excised from sodium dodecyl sulfate (SDS)-polyacrylamide gels: those analyses yielded for all NTAIL proteins a high sequence coverage attesting their identity and integrity (data not shown).
All the NTAIL proteins migrate in SDS-polyacrylamide gel electrophoresis (SDS-PAGE) with an apparent molecular mass higher than expected (see Fig. 1B and Table S1, ESI‡). Indeed the apparent molecular masses of the HeV and NiV NTAILΔ1 proteins are 18 and 19 kDa, respectively, while their expected molecular mass is 14 kDa (see Fig. 1B and Table S1, ESI‡). Likewise, the apparent molecular masses of HeV and NiV NTAILΔ4 proteins are 18 and 19 kDa, respectively, while their expected molecular mass is 15.5 kDa (see Fig. 1B and Table S1, ESI‡). The HeV and NiV NTAILΔ3Δ4 proteins both have an apparent molecular mass of approximately 12 kDa, while their expected mass is 10 kDa. As for the HeV and NiV NTAILΔ1Δ2 proteins, which have an expected molecular mass of 10 kDa, they display an even more abnormal electrophoretic mobility, with apparent molecular masses of approximately 14 kDa (HeV) and 17 kDa (NiV) (Fig. 1B and Table S1, ESI‡).
This abnormal migratory behavior has already been documented for both HeV and NiV full-length NTAIL proteins, where mass spectrometry analyses gave the expected results.29 This anomalous electrophoretic mobility is frequently observed in IDPs (for a few illustrative examples see ref. 12, 21, and 50–52) and has been ascribed to a rather high content of acidic residues as compared to globular proteins.53 The high negative charge is thought to lead to a larger Stokes radius of the protein–detergent complex together with a reduction in detergent binding.54 It should be noted however that the content in acidic residues of those NTAIL proteins that display the most aberrant migration, namely NiV and HeV NTAILΔ1Δ2 (see Fig. 1B and Table S1, ESI‡), is 11.1%, a value lower than the average acidic content (11.8%) of globular proteins53 (see Table S2, ESI‡). In addition, the content in acidic residues of the Henipavirus NTAILΔ1Δ2 proteins is also lower than that of the HeV and NiV NTAILΔ3Δ4 proteins (13.6% and 11.5%, respectively), where these latter have the same expected molecular mass as the NTAILΔ1Δ2 proteins and yet migrate less abnormally than these latter (see Tables S1 and S2, ESI‡). This observation suggests that other parameters, beyond the content in acidic residues, are responsible for the abnormal electrophoretic migration of IDPs. That the occurrence of PP and/or PXP motifs may be a possible determinant of the observed aberrant migration is unlikely. Indeed, while the HeV NTAILΔ1, NTAILΔ1Δ2 and NTAILΔ4 proteins possess a PP and a PKP motif, the NiV proteins, including the NiV NTAILΔ1Δ2 that displays the most aberrant migration, are all devoid of such motifs.
![]() | ||
Fig. 2 (A) Charge–hydropathy (CH) plot of Henipavirus NTAIL proteins. The mean net charge (R) is plotted against the mean hydropathy (H). In the left part of the CH plot, a protein is predicted to be intrinsically disordered, whereas it is predicted to be structured if it falls in the right part of it (see Experimental). |
Recent analyses of IDPs by the group of Pappu indicated sequence polarity as a determinant of IDP compaction: indeed polar IDPs were found to favor collapsed ensembles in water despite the absence of hydrophobic groups.59 To evaluate sequence polarity, we therefore calculated for each NTAIL protein the net charge per residue (NCR) as described by Pappu and co-workers59 (see Table S2, ESI‡). For all NTAIL proteins, the NCR is negative and well below the well-recognized threshold of 0.2, discriminating collapsed and extended IDPs.59 This suggests that the NTAIL proteins would preferentially populate collapsed states. According to Pappu and co-workers, these proteins would belong to the category of weak polyelectrolytes/polyampholytes, where |f+ − f−| ≤ 0.2 and each of f+ and f− are small.
![]() | ||
Fig. 3 SEC-MALLS-RI analysis of HeV (A) and NiV (B) NTAIL proteins. The right y-axis represents the molecular mass and the left axis represents the differential refractive index. The horizontal traces show the molecular masses calculated from light-scattering intensity at different angles and differential refractive indexes as a function of the elution volume. The concentrations of the injected samples were as follows: HeV NTAILΔ1: 18.5 mg mL−1, HeV NTAILΔ4: 23.3 mg mL−1; HeV NTAILΔ1Δ2: 14.5 mg mL−1; HeV NTAILΔ3Δ4: 12 mg mL−1; NiV NTAILΔ1: 12.2 mg mL−1; NiV NTAILΔ4: 17 mg mL−1; NiV NTAILΔ1Δ2: 14.5 mg mL−1; NiV NTAILΔ3Δ4: 17.7 mg mL−1. |
MMObs (kDa) | MMtheo (kDa) | R ObsS | R NFS | R IDPS | R PMGS | R US | R ObsS/RPMGS | R ObsS/RUS | CI1 | CI2 | |
---|---|---|---|---|---|---|---|---|---|---|---|
a Values were derived by previously reported mass spectrometry studies.29 b Values were derived by previously reported SEC studies.29 | |||||||||||
HeV | |||||||||||
NTAIL | 15.3a | 15.2 | 28.0b ± 3 | 19.4 | 30.8 (26.6) | 28.0 | 34.0 | 1.001 | 0.826 | 0.410 | 0.475 |
NTAILΔ1 | 15.5 | 14.5 | 30.5 ± 3 | 19.1 | 29.9 (25.9) | 27.4 | 33.0 | 1.112 | 0.922 | 0.184 | 0.247 |
NTAILΔ1Δ2 | 9.8 | 10 | 22.8 ± 3 | 16.7 | 24.6 (21.2) | 23.6 | 27.2 | 0.965 | 0.837 | 0.421 | 0.485 |
NTAILΔ4 | 15.3 | 15.7 | 31.4 ± 3 | 19.7 | 31.3 (26.4) | 28.4 | 34.5 | 1.107 | 0.909 | 0.211 | 0.285 |
NTAILΔ3Δ4 | 11.1 | 10.3 | 21.3 ± 3 | 16.8 | 25.3 (21.0) | 23.9 | 27.7 | 0.891 | 0.770 | 0.590 | 0.683 |
NiV | |||||||||||
NTAIL | 14.9a | 14.9 | 28.0b ± 3 | 19.4 | 30.8 (25.7) | 27.7 | 33.6 | 1.009 | 0.834 | 0.391 | 0.475 |
NTAILΔ1 | 14.0 | 14.3 | 31.2 ± 3 | 19.0 | 29.9 (25.0) | 27.2 | 32.8 | 1.145 | 0.952 | 0.115 | 0.197 |
NTAILΔ1Δ2 | 10.0 | 9.8 | 25.3 ± 3 | 16.6 | 24.6 (20.2) | 23.5 | 27.0 | 1.078 | 0.936 | 0.166 | 0.230 |
NTAILΔ4 | 15.5 | 15.5 | 27.0 ± 3 | 19.6 | 31.3 (25.5) | 28.1 | 34.2 | 0.959 | 0.790 | 0.492 | 0.569 |
NTAILΔ3Δ4 | 10.5 | 10.1 | 21.9 ± 3 | 16.8 | 25.3 (20.7) | 23.8 | 27.5 | 0.921 | 0.797 | 0.521 | 0.625 |
On the other hand, the finding that both HeV and NiV NTAILΔ1 proteins are the least compact proteins is not in agreement with the in silico analyses, although these latter predict Henipavirus NTAILΔ1 proteins to be less compact than the parental, full-length forms (see Fig. 2).
The compaction trend of HeV NTAILΔ1Δ2 and HeV NTAILΔ4 is inversed with respect to that of the corresponding NiV proteins: indeed, while HeV NTAILΔ1Δ2 has a compaction slightly higher than that of the parental full-length form and HeV NTAILΔ4 is less compact, the opposite scenario is observed in the case of the NiV proteins (see Table 1). This indicates that while in the case of HeV NTAIL, removal of Box4 leads to a more extended form, in the case of NiV NTAIL it is the removal of Box1 plus Box2 which leads to a decrease in compaction. This suggests the occurrence of differences in these boxes and/or in the remainder of the polypeptide chain in terms of local structural propensities and/or abilities to form tertiary contacts in the HeV and NiV NTAIL proteins.
Altogether, these data support a major role in both HeV and NiV NTAIL proteins for Box1 as a determinant of protein compaction. It is conceivable that Box1 could exert this effect by establishing long-range tertiary contacts and/or by adopting a transiently populated regular conformation that would result in a PMG-like form. The role of Box2 and Box4 is more dubious, given the poor agreement between the hydrodynamic and the in silico analyses and the inconsistencies between the two viruses. By contrast, experimental and computational data for both viruses consistently indicate that removal of Box3 plus Box4 results in a more compact form. The possibility that the shortening of polypeptide chain length might be a determinant of protein compaction is ruled out by the observation that the NiV NTAILΔ1Δ2 protein, which has an even slightly shorter chain (see Table S2, ESI‡), adopts a more extended conformation with respect to both HeV and NiV NTAILΔ3Δ4 proteins.
![]() | ||
Fig. 4 Far-UV CD spectra of HeV (A) and NiV (B) NTAIL proteins in 10 mM sodium phosphate pH 7 in the absence or in the presence of either 20% TFE or 3 M TMAO. The protein concentration was 0.1 mg mL−1. Spectra were recorded at 20 °C. Due to high background noise at lower wavelength (beyond 210 nm), the spectra in the presence of TMAO are shown to the point up to which the dyna voltage was in the permissible range. Data are representative of one out of two independent acquisitions. |
Trifluoroethanol (TFE) and trimethylamine N-oxide (TMAO) are secondary structure stabilizers that can be used as probes of hidden structural propensities of peptides and proteins. TMAO and other osmolytes may fold unstructured proteins due to the osmophobic effect, a solvophobic thermodynamic force, arising from the unfavorable interaction between the osmolyte and the peptide backbone.63–66 Although both TMAO and TFE act on the peptide backbone, the molecular mechanisms underlying their effects are different. Both solutes act by mimicking the hydrophobic environment experienced by proteins in protein–protein interactions, and can thus be used as probes to unveil regions having a propensity to undergo induced folding.67–70 It has long been known that TFE increases the propensity of amino acids to form an α-helix, presumably by strengthening peptide hydrogen bonds in TFE/H2O mixtures and through favorable interactions of hydrophobic amino acid side chains with TFE.71,72 Peptide hydrogen bonds in helices are believed to be stabilized indirectly by weakening the hydrogen bonding of water molecules to the peptide backbone in the coil form.72 As a result of weakening the hydrophobic interactions within the protein interior, TFE might promote helical structure in most peptides and proteins, even though this helical structure may be non-native.73–75 In contrast, TMAO increases the driving forces for protein folding due to its solvophobic effect on the backbone, forcing thermodynamically unstable proteins to fold without altering the rules for folding to a native-like conformation.64 Furthermore, in opposition to TFE solutions, the propensities of hydrophobic groups to interact with solvent are essentially the same in water as they are in TMAO solutions.76,77 Thus, due to the weakening of hydrophobic interactions, the dominant effect of TFE on proteins is protein denaturation accompanied by the preferential formation of α-helices as a result of the strengthening of peptide hydrogen bonds. Contrarily to that, TMAO promotes folding of unfolded proteins by providing an additional force for folding that has no preference for any particular secondary structure.66 It should be pointed out however, that even if TFE is known to stabilize α-helices more than β-strands, gain of α-helicity in the presence of TFE is not a general rule: for instance, (i) the acidic activator domain of GCN4 forms little or no α-helix in TFE concentrations as high as 30% and folds mostly as β-sheets in 50% TFE,78 (ii) the intrinsically disordered dehydrin Rab18 has an α-helical content as low as 2% in the presence of 90% TFE79 and (iii) the intrinsically disordered rat seminal vesicle protein IV exclusively undergoes β transitions in the presence of TFE (P. Palladino, S. Vilasi, R. Ragone and F. Rossi, personal communication). Furthermore, in the case of measles virus NTAIL, we have recently shown that TFE promotes α-helical folding of the 488–502 region only, with the downstream region only becoming slightly less mobile while retaining an extended conformation in the presence of 20% TFE.80 These observations attest the ability of TFE to show β propensities.
In order to assess the folding potential of the truncated NTAIL proteins, and taking into account the fact that both HeV and NiV NTAIL possess a predicted MoRE of the dubious β state, namely Box2,29 we recorded the CD spectra of NTAIL proteins in the presence of either TFE or TMAO, this latter being used with the specific purpose of unveiling possible β propensities. The TFE concentration in these studies was set to 20%, this choice being dictated by previous CD studies focused on full-length Henipavirus NTAIL proteins showing that most unstructured-to-structured transitions take place at this concentration.29
TMAO experiments were initially carried out with full-length proteins and the concentration of the osmolyte was gradually increased from 0 to 5 M (with 1 M steps). At TMAO concentrations as high as 4 and 5 M, protein precipitation occurred and the spectra were extremely noisy thus precluding meaningful analyses. Because of this observation and also taking into account the fact that previous studies carried out by others showed that 3 M TMAO was found to promote most dramatic structural transitions in various IDPs (for examples see ref. 81–83), we carried out subsequent experiments at 3 M TMAO, a concentration where significant structural transitions were observed and no protein precipitation occurred. Under these conditions, the spectra show a poor signal-to-noise ratio due to the strong absorption of TMAO in the far-UV region. This high background noise prevented meaningful analyses of the spectra beyond 210 nm, as the dyna voltage above this wavelength was found to fall in the non-permissible range.
All the NTAIL proteins, including the full-length forms (see Fig. S1, ESI‡ and ref. 29), show a loss of unordered structure upon addition of 20% TFE, as indicated by the characteristic increase in ellipticity at 190 and 198 nm (Fig. 4 and Fig. S1, ESI‡). Likewise, and in spite of the noisy nature of the spectra in the presence of TMAO, this latter was found to trigger a disorder-to-order transition in all NTAIL proteins, as judged from the drop in the ellipticity in the 210–240 nm region (Fig. 4 and Fig. S1, ESI‡). For both HeV and NiV NTAIL proteins, and irrespective of whether TFE or TMAO was added, the disorder-to-order transition is more pronounced for NTAILΔ1 and NTAILΔ1Δ2 than for NTAILΔ4 and NTAILΔ3Δ4, suggesting a role for Box3 and Box4 in the gain of regular secondary structure. Interestingly, NiV NTAILΔ1Δ2 exhibits a lower ability to undergo a structural transition in the presence of the osmolytes as compared to the HeV NTAILΔ1Δ2 protein (Fig. 4). This observation suggests the existence of subtle differences in the inherent folding propensities of Box1 and Box2 between the HeV and the NiV NTAIL proteins, with these differences having been already unveiled by hydrodynamic studies.
In order to achieve a quantitative assessment of the TFE and TMAO impact, the CD spectra were deconvoluted (see Experimental), thereby allowing the percentage of the various secondary structures to be inferred (Fig. 5). Although deconvolution approaches notoriously lead to estimations that cannot be taken as fully reliable, i.e. they often significantly deviate from the actual content in secondary structure as observed in experimentally determined structures (for examples see ref. 48 and 84), the estimated α-helical content is nevertheless very useful for comparative purposes. Note that the spectra recorded in the presence of TMAO could not be deconvoluted, as the wavelength range in which the dyna voltage was in the permissible range (namely, 260–210 nm) was too small to allow deconvolution by any of the programs implemented in the DICHROWEB server.
![]() | ||
Fig. 5 Secondary structure content of HeV (A) and NiV (B) NTAIL proteins in 0 and 20% TFE. The percentages of α-helices, β-strands, turns and unordered regions were calculated using CDSSTR. Values for full-length HeV and NiV NTAIL proteins were derived from spectra shown in Fig. S1 (ESI‡). |
As shown in Fig. 5, spectral deconvolution pointed out that for both viruses, NTAILΔ3Δ4 is the NTAIL protein with the lowest α-helical content, followed by the NTAILΔ1Δ2 proteins. In both viruses, NTAILΔ1 and NTAILΔ4 were found to have an α-helical content similar to that of the full-length form, arguing for a poor, if any, ability of Box1 and Box4 to adopt an α-helical conformation in water. On the other hand, data support a role for Box3 and, to a minor extent, for Box2 in sampling, at least transiently, α-helical conformations. Consistent with the hypothesis that Box1 poorly contributes to the overall α-helical content of both HeV and NiV NTAIL proteins, the addition of 20% TFE to both HeV and NiV NTAILΔ1 proteins pointed out a gain in α-helicity similar to that of the full-length form (see Fig. 5). Conversely, and in agreement with a possible involvement of Box3 and Box2 in α-helical folding, addition of TFE to Henipavirus NTAILΔ3Δ4 and NTAILΔ1Δ2 proteins led to a dramatic or significant drop, respectively, in the ability to undergo α-helical folding as compared to the full-length form (see Fig. 5). In particular, for both viruses, NTAILΔ3Δ4 is the NTAIL protein with the lowest ability to undergo α-helical folding, as judged from the finding that the HeV and NiV NTAILΔ3Δ4 proteins have the lowest α-helical content in 20% TFE. Surprisingly however, in both viruses, NTAILΔ4 does not gain α-helicity in the presence of TFE (see Fig. 5). This behavior, which suggests a role for Box4 in α-helical folding, was quite unexpected based on the observation that this latter form has an α-helical content similar to that of the parental, full-length form in the absence of osmolytes. Notably, for all NTAIL proteins, the addition of 20% TFE triggers a decrease in the β content, arguing for the lack of strong β propensities within the HeV and NiV NTAIL proteins (see Fig. 5). This observation, together with the finding that Box2 has an α-helical propensity, suggests that Box2 likely adopts an α-helical conformation, thus shedding light on the hitherto dubious state of this Box.29
Taking into account however the fact that TFE may favor (non-native) α-helical folding, we also sought at performing a quantitative analysis of spectra recorded in the presence of TMAO. To this endeavor, since deconvolution was not feasible for spectra recorded in the presence of TMAO, we derived the α-helical content of the various NTAIL proteins from the ellipticity at 222 nm as described in ref. 85 (see Fig. S2, ESI‡). In order to allow meaningful comparisons with spectra recorded in the absence of any osmolyte, as well as with spectra recorded in TFE, we also used this method to infer the α-helical content of the spectra of the NTAIL proteins alone and in the presence of 20% TFE (see Fig. S2, ESI‡). The absolute values of the α-helical content derived by this analysis turned out to be different from those provided by the deconvolution (cf.Fig. 5 and Fig. S2, ESI‡). Comparison of data obtained in the presence of TFE with data obtained in the presence of TMAO revealed that this latter is less efficient than TFE in promoting α-helical folding, as judged from the lower α-helical content of all NTAIL proteins in the presence of TMAO as compared to TFE. This finding is in agreement with previous reports (for examples see ref. 81 and 86). It should be pointed out however that the lower folding potential of 3 M TMAO with respect to 20% TFE may also arise from the possibility that this TMAO concentration does not correspond to the condition where the midpoint transition takes place.
In spite of the numerical differences between the two calculations, an overall similar trend, with a few exceptions, was observed. In agreement with the results obtained by deconvoluting the spectra, the data shown in Fig. S2 (ESI‡) show that (i) the NTAILΔ3Δ4 proteins were found to possess the lowest α-helical content in the absence of osmolytes and to have the lowest ability to undergo α-helical folding in the presence of TFE or TMAO, (ii) the NTAILΔ1Δ2 proteins have a borderline (HeV) or significantly reduced (NiV) ability to undergo α-helical folding in both TFE and TMAO as compared to the parental full-length forms, and (iii) the NTAILΔ4 proteins were found to possess an α-helical content similar (NiV) or slightly higher (HeV) than that of the parental proteins. In contrast with the data obtained by deconvoluting the spectra, the data shown in Fig. S2 (ESI‡) support the ability of both HeV and NiV NTAILΔ4 proteins to undergo α-helical folding in the presence of osmolytes, consistent with the lack of strong α-helical propensities within Box4. Another notable difference between the data obtained from the deconvolution of the whole CD spectra or from the sole ellipticity at 222 nm concerns NTAILΔ1. Based on the ellipticity at 222 nm, both HeV and NiV NTAILΔ1 proteins were found to possess an even higher ability to undergo α-helical folding than the parental, full-length proteins, although their α-helical content in the absence of osmolytes is comparable to that of the full-length protein (see Fig. S2, ESI‡). It should be pointed out however that the values of α-helical content obtained upon deconvoluting the spectra recorded over the whole wavelength range are likely to be more reliable than the data inferred from a single ellipticity value, with this being especially true for highly noisy spectra such as those recorded in the presence of TMAO.
Having in mind the reported ability of TMAO to unveil possible β propensities, we also attempted at obtaining insights about the β content from the spectra recorded in the presence of TMAO. To this end, we checked those spectra for the presence of a possible minimum at 218 nm, with a pronounced negative peak at this wavelength being a hallmark of β structure.87 None of the NTAIL spectra in the presence of TMAO displays such a minimum, with perhaps the only exception of HeV and NiV NTAILΔ4 proteins that have a spectral shape reminiscent of that of a partly β folded protein (see Fig. 4). Since however this spectral signature was not found in the spectra of the parental, full-length proteins in the presence of TMAO (see Fig. S1, ESI‡), we concluded that the present data do not support any pronounced β propensity within Henipavirus NTAIL.
Altogether, the present data support a role for Box3, and to a minor extent, for Box2 in α-helical folding of NTAIL. By contrast, Box1 was found to be devoid of α-helical propensities and not to be involved in α-helical folding. The observation that Box2 has an α, rather than a β, inherent structural propensity provides insights into this MoRE, the structural state of which has been uncertain so far.29 The finding that Box3 plays a major role in the osmolyte-induced α-helical transition provides support to our previous hypothesis that Box3 could be the region implicated in the PXD-induced α-helical folding of Henipavirus NTAIL.21 Data are also consistent with the lack of pronounced α-helical propensity within Box4, thus providing additional support to our previous computational analyses that predicted the occurrence of an irregular MoRE (I-MoRE) within residues 523–532. We can speculate that the removal of Box4 could reduce the ability of NTAIL to undergo α-helical folding through a mechanism that is independent from the inherent folding ability of this box and that rather reflects a possible stabilization of the α-helices within Box2 and Box3 through long-range tertiary contacts.
![]() | ||
Fig. 6 (A) 222–200 ellipticity plot modified from Uversky.34 The mean residue ellipticity at 222 nm (Θ222) of a set of well-characterized unfolded, random coil-like (RC-like) or premolten globule-like (PMG-like) proteins (from ref. 34) has been plotted against the mean residue ellipticity at 200 nm (Θ200). The position in the plot of Henipavirus NTAIL proteins is highlighted. (B) Plot of the ratio between the ellipticity at 222 nm and the ellipticity at 200 nm (Θ222/Θ200) of the same set of well-characterized RC-like or PMG-like proteins shown in (A). The position in the plot of Henipavirus NTAIL proteins is highlighted. The dashed line corresponds to the boundary between RC-like and PMG-like proteins. (C) CH-ellipticity plot of Henipavirus NTAIL proteins. For each NTAIL protein, the distance from the boundary in the CH plot, referred to as CH distance, has been plotted as a function of the distance from the boundary in the Θ222/Θ200 plot shown in (B). |
We then plotted the ratio between the Θ222 and Θ200 of the NTAIL proteins and of a set of well-characterized RC-like and PMG-like proteins34 (Fig. 6B). This ratio has the advantage over the individual Θ222 and Θ200 of being less affected by errors in estimations of protein concentrations, and of providing a unified parameter reflecting the content in both unordered and α-helical structure. We then arbitrarily set a threshold of the Θ222/Θ200 (see the dashed line in Fig. 6B) that allows RC-like IDPs to be discriminated from IDPs adopting a PMG-like conformation.
We then generated a plot in which the distance of each NTAIL protein from this threshold was plotted as a function of its CH-distance (Fig. 6C). This analysis is intended to combine, and hence extend, the two methods previously introduced by Uversky.34,58 Indeed, it combines a computational and a spectroscopic analysis and is in principle meant to allow RC-like forms to be distinguished from PMG-like forms among proteins predicted to be intrinsically disordered by the hydropathy/charge method. In this plot, increasingly negative CH distances designate proteins with increasing disorder, while increasingly positive θ222/θ200 distances designate IDPs becoming progressively more collapsed, as a consequence of an increased content in regular secondary structure. Thus, the left bottom quadrant is expected to correspond to IDPs adopting a RC-like conformation, while the right bottom quadrant is supposed to designate IDPs adopting a PMG conformation.
Surprisingly however, in the case of the NTAIL proteins, this analysis revealed that the HeV and NiV NTAILΔ3Δ4 proteins, which are predicted to be the most ordered by the charge/hydropathy method and which additionally were also shown to be the most compact forms by the hydrodynamic analyses (see Table 1), are the NTAIL proteins exhibiting the lowest distance from the θ222/θ200 boundary (see Fig. 6B) and hence expected to adopt the most extended conformation (see Fig. 6C). Thus, this analysis would wrongly predict the HeV and NiV NTAILΔ3Δ4 proteins to be more extended than the parental, full-length forms by virtue of their increased content in unordered structure and decreased content in α-helical structure. CD studies allowed the lower α-helical content of the HeV and NiV NTAILΔ3Δ4 proteins to be unambiguously accounted for by the lack of Box3, with the MoRE located within Box3 having been shown to be the one with the most pronounced α-helical propensity (Fig. 5 and Fig. S2, ESI‡).
The present data suggest that the amount of regular secondary structure is not seemingly a major determinant of protein compaction, with IDPs depleted in MoREs being nevertheless able to adopt a collapsed state. The bottom line of these observations is that the distance from the θ222/θ200 boundary, while being informative in terms of the relative content in unordered and α-helical structure, is not a reliable indicator of the extent of compaction of the Henipavirus NTAIL proteins. Whether this is a unique property of the Henipavirus NTAIL proteins or rather a general rule of IDPs remains to be established.
![]() | (1) |
![]() | ||
Fig. 7 ITC studies of homologous complex formation between PXD and HeV (A) or NiV (B) NTAIL deletion proteins. Data are representative of at least two independent experiments where the initial concentrations of NTAIL proteins in the microcalorimeter cell and of the homologous PXD in the microsyringe were slightly tuned. Panels (A) and (B) show the data obtained using 150 μM NTAIL and 1.5 mM PXD. Graphs shown in the bottom of each panel correspond to integrated and corrected ITC data fit to a single set of sites model (all sites identical and equivalent). The filled squares represent the experimental data, whereas the solid line corresponds to the model. The derived equilibrium dissociation constant (KD) as well as the stoichiometric number are shown. |
Stoichiometry, n | K D (μM) | Binding enthalpy, ΔH (cal mol−1) | ΔS (cal mol−1 deg−1) | |
---|---|---|---|---|
HeV NTAILΔ1 | 0.92 ± 0.015 | 5.18 ± 0.85 | −2771 ± 69.87 | 14.7 |
HeV NTAILΔ1Δ2 | 0.93 ± 0.010 | 3.98 ± 0.48 | −4082 ± 61.41 | 10.8 |
HeV NTAILΔ4 | 0.72 ± 0.015 | 7.14 ± 1.10 | −4222 ± 119.9 | 9.14 |
NiV NTAILΔ1 | 1.31 ± 0.036 | 9.00 ± 2.10 | −2305 ± 95.52 | 15.2 |
NiV NTAILΔ1Δ2 | 0.75 ± 0.016 | 7.24 ± 1.09 | −8184 ± 241.6 | −4.5 |
NiV NTAILΔ4 | 0.91 ± 0.02 | 9.00 ± 1.63 | −6586 ± 238.4 | 0.615 |
Stoichiometry, n | K D (μM) | Binding enthalpy, ΔH (cal mol−1) | Binding entropy, ΔS (cal mol−1 deg−1) | |
---|---|---|---|---|
a Data were taken from ref. 21. | ||||
HeV NTAIL–HeV PXDa | 1.37 ± 0.01 | 8.7 ± 0.55 | −5584 ± 62 | 4.09 |
NiV NTAIL–HeV PXD | 1.02 ± 0.007 | 7.6 ± 0.46 | −6001 ± 56.38 | 2.92 |
NiV NTAIL–NiV PXDa | 0.93 ± 0.07 | 2.1 ± 0.24 | −9034 ± 108.9 | −4.84 |
HeV NTAIL–NiV PXD | 1.12 ± 0.21 | 6.9 ± 1.32 | −2494 ± 81.24 | 15.1 |
The present results support a role for Box3 as a major determinant of PXD binding. These findings validate the previously proposed involvement of Box3 in complex formation, and provide additional support to our structural models of the Henipavirus NTAIL–PXD complexes.21
Having shown that Box3 is the NTAIL region indispensable for binding to PXD, we sought at assessing whether the Box3 regions could be functionally interchanged between the two viruses. We therefore carried out ITC studies in which the parental full-length NTAIL proteins were loaded in the cell and then titrated with the heterologous PXD protein (Fig. S3, ESI‡ and Table 3). As shown in Table 3, replacing HeV NTAIL by NiV NTAIL in the binding reaction with HeV PXD led to binding parameters quite comparable (i.e. within the error bars) to those observed for the homologous pair. Substituting NiV NTAIL with HeV NTAIL in the binding reaction to NiV PXD yielded a slight, though significant, increase in the KD with a concomitant variation in the enthalpic and entropic contributions: while the enthalpic contribution is reduced for the heterologous pair, the entropic penalty is suppressed and the ΔS becomes favorable, possibly reflecting a less pronounced disorder-to-order transition. In support of the significance of the interaction, it should be noted that no interaction was found between Henipavirus NTAIL and measles virus PXD (Habchi et al., unpublished).
These data suggest that the HeV and NiV Box3 regions can be functionally replaced by each other in binding to PXD, consistent with previous studies showing that Henipavirus N–P can form heterologous complexes.23 The HeV PXD scaffold seems to be more tolerant (i.e. equally able to accommodate the homologous and the heterologous Box3 region) than the NiV PXD. We can speculate that this difference could reflect the higher buried surface area (and hence the higher surface complementarity) of the NiV NTAIL–PXD complex as compared to the HeV one.21
![]() | ||
Fig. 8 Far-UV CD studies of HeV (A) and NiV (B) NTAIL proteins either alone (black line) or in the presence of a two-fold molar excess of the homologous PXD protein (full circles). The CD spectra of PXD proteins alone (grey line) are also shown, as are the theoretical average curves (empty circles) calculated by assuming that no structural variations occur (see Experimental). Spectra were recorded at 20 °C in 10 mM Tris/HCl pH 7.5, NaCl 200 mM. Protein concentrations were 10 mg mL−1. In NTAIL/PXD mixtures the NTAIL concentration was 350 μM, while that of PXD was 700 μM. The path length was 0.01 mm. Data are representative of at least two independent experiments. |
The first interesting observation that we made concerns the anomalous electrophoretic behavior of the NTAIL proteins, which could not be exclusively accounted for by their content in acidic residues, suggesting that other additional parameters can be responsible for the abnormal electrophoretic migration of IDPs in SDS-PAGE. A possible hint in this regard is provided by the behavior of the NiV NTAILΔ1Δ2 protein. Indeed, this protein was found to display the most aberrant electrophoretic mobility. Besides, its electrophoretic behavior is more anomalous than that of the cognate HeV NTAIL protein. While the two proteins have the same content in acidic residues, the NiV NTAILΔ1Δ2 protein was found to adopt a more extended conformation than the corresponding HeV protein. This observation suggests that the degree of protein extension in solution could be a possible additional parameter affecting the electrophoretic mobility of IDPs. In further support of this hypothesis, analysis of the NTAIL proteins by native polyacrylamide gel electrophoresis (see Fig. S4, ESI‡) clearly shows that the NiV NTAILΔ1Δ2 protein migrates much more slowly than the HeV NTAILΔ1Δ2 protein, consistent with the lower compaction of the NiV NTAILΔ1Δ2 protein as compared to the HeV one. In the same vein, the NiV NTAILΔ4 protein migrates much faster than the HeV NTAILΔ4 protein, reflecting the higher extent of compaction of the NiV protein as compared to the HeV one.
By combining computational, hydrodynamic and spectroscopic analyses, we could unveil regions playing a major role in protein compaction, as well as regions sampling transiently populated α-helices (Fig. 9A). In particular, the spectroscopic studies herein described revealed the structural state of Box2. The present results also support the occurrence within Henipavirus NTAIL of two I-MoREs (e.g. Box1 and Box4) and of two α-MoREs (e.g. Box2 and Box3), thus arguing for the lack of a transiently populated β structure within Box2 (Fig. 9A). Among the two α-MoREs, Box3 was found to be a major determinant of α-helical folding and ITC studies showed that this region is strictly required for binding to PXD (Fig. 9A). These results confirm the previously postulated role of Box3 in binding to PXD and provide direct experimental support for the proposed Henipavirus NTAIL–PXD complexes.21 In addition, ITC studies in which the full-length NTAIL proteins were titrated with the heterologous PXD, showed that the two Box3 regions are functionally interchangeable between the two viruses, consistent with previous immunoprecipitation studies showing that Henipavirus N–P can form heterologous complexes.23 The HeV PXD was found to be seemingly more tolerant than the NiV PXD (i.e. equally able to accommodate the homologous and the heterologous Box3 region), with this property possibly reflecting a higher surface complementarity in the case of the NiV NTAIL–PXD complex as compared to the HeV one.21
![]() | ||
Fig. 9 (A) Sequence alignment between the HeV and NiV NTAIL proteins. Hydrophobic residues (L, I, V, F, W and M) are shown in green, acidic residues in red and basic residues in blue. Asterisks and colons below the alignment denote identity and similarity, respectively, while dots indicate weakly similar residues. Lack of symbol below the alignment denotes strongly different residues. The front numbers correspond to the amino acid position in the N sequence. Dots above the alignment indicate intervals of 10 residues. The four boxes are shown, as is the location of the transiently populated α-helices, as unveiled by CD studies. (B) Schematic diagram summarizing the data obtained by computational and hydrodynamic analyses. The red and green arrows highlight the discrepancies found between the two viruses in terms of compaction properties of NTAILΔ1Δ2 and NTAILΔ4 proteins. |
Strikingly, and in agreement with previous studies,60 the content in regular secondary structure was found not to be a major determinant of protein compaction, with NTAIL proteins depleted in MoREs being nevertheless able to adopt a collapsed state (Fig. 9B). Indeed, removal of Box3 plus Box4, where Box3 has a strong inherent α-helical propensity as judged from CD studies, led to a more compact form (see Fig. 9B). This finding suggests that the region downstream Box3 (i.e. residues 493–532) is endowed with a large conformational freedom that confers to the full-length protein an overall extended character. Thus, the occurrence of a transiently populated α-helix within Box3 would not impart strong compaction constraints to NTAIL and would be unable to compensate for the high flexibility of the downstream region. Lack of a strict relationship between the content in regular secondary structure and protein compaction is further supported by hydrodynamic and spectroscopic studies focused on NTAILΔ1, which showed that Box1 plays a role as a major determinant of compaction, with this property being quite independent from its ability to adopt a regular secondary structure: indeed CD studies unveiled that removal of Box1 does not lead to any significant reduction in the α-helical content nor does it affect the α-helical folding abilities of NTAILΔ1 as compared to the full-length form (see Fig. 9). Therefore, in the case of the NTAIL proteins, the PMG state likely mainly arises from the occurrence of either long-range or short-range tertiary contacts rather than from local constraints imposed by transiently populated regular secondary structure elements. The occurrence of long-range tertiary contacts within IDPs has been already well documented (for examples see ref. 89–97), suggesting that long-range contacts may serve as major determinants of protein compaction not only in Henipavirus NTAIL proteins but in IDPs in general. The present findings are expected to stimulate future studies in this field and hence to shed light on the conformational behavior of IDPs and on how this latter is encoded by the amino acid sequence.
We also devised a method that combines computational and spectroscopic analyses, which was meant to allow RC-like IDPs to be discriminated from PMG-like IDPs. This method, which is a combination of two methods previously introduced by Uversky,34,58 revealed once again that the content in unordered and α-helical structure is a poor predictive criterion for the prediction of protein compaction. These findings somehow challenge the well-established notion that PMG-like IDPs have more residual secondary structures than RC-like IDPs.34
Given the poor agreement between the hydrodynamic and the in silico analyses and the inconsistencies between the two viruses in the case of the NTAILΔ1Δ2 and NTAILΔ4 proteins (see Fig. 9B), no definitive conclusions could be drawn as to the role of Box2 and Box4 in determining protein compaction. We reasoned that the observed discrepancies might be accounted for by differences in the NTAIL sequence leading to a different pattern in the tertiary contacts (whether long- or short-range) in the two viruses. In order to get insights into this issue, we compared the amino acid sequences of the HeV and NiV NTAIL proteins in view of highlighting possible differences. As shown in Fig. 9A, the NTAIL sequence is highly conserved in the two viruses (76% identity and 90% similarity). As expected for disordered regions not predicted to be involved in partner recognition, the regions connecting Box1 to Box2 and Box3 to Box4 are the most variable ones. The local sequence variability between Box3 and Box4 might account for conformational differences between HeV and NiV NTAILΔ1Δ2. On the other hand, the conformational differences between the HeV and NiV NTAILΔ4 proteins are more difficult to explain. Indeed, the possibility that the differences occurring in the region between Box1 and Box2 could be responsible for the differences between HeV and NiV NTAILΔ4 can be ruled out, based on the conformational similarity between HeV and NiV NTAILΔ3Δ4. Likewise, since the variable region between Box3 and Box4 does not seemingly lead to any significant difference in the conformational behavior of the HeV and NiV NTAILΔ1 proteins, it is difficult to explain how differences in this region could be responsible for the observed conformational differences between HeV and NiV NTAILΔ4. More subtle parameters are certainly at play, which remain however elusive so far.
As for Box3, while residues 474–487 are fully conserved in the two viruses, the last five residues are not (Fig. 9A). Given the functional equivalence of Box3 between the two viruses, this observation may imply that either residues 488–493 are not as much involved in PXD binding as the upstream residues, or that the partner can equally accommodate the IKESTA and AKEAAS stretches.
In conclusion, the present studies provide the first detailed mapping of the PXD binding region within NTAIL and provide the first quantitative estimation of the ability of Henipavirus NTAIL and PXD to form heterologous complexes. These studies designate specific Henipavirus NTAIL regions as targets for future site-directed mutagenesis studies and/or for the design of inhibitors capable to block the NTAIL–PXD interaction.
The charge–hydropathy (CH) plot was generated as described in ref. 34. The CH plot is divided into two regions by a line, which corresponds to the equation H = (R + 1.151)/2.785, where R is the mean net charge and H is the mean hydrophobicity. In the left part of the diagram (where H < (R + 1.151)/2.785), a protein is predicted as disordered, whereas it is predicted as ordered in the right part.34HBoundary was computed according to ref. 34: HBoundary = (R + 1.15)/2.785. The mean net charge (R) of a protein is defined as the absolute value of the difference between the number of positively and negatively charged residues at pH 7 divided by the total number of amino acid residues. It was calculated using the program ProtParam at the EXPASY server (http://www.expasy.ch/tools). The mean hydrophobicity (H) is the sum of normalized hydrophobicities of individual residues divided by the total number of amino acid residues minus 4 residues (to take into account fringe effects in the calculation of hydrophobicity). Individual hydrophobicities were determined using the Protscale program at the EXPASY server (http://www.expasy.ch/tools), using the options “Hphob/Kyte & Doolittle”, a window size of 5, and normalizing the scale from 0 to 1.
The net charge per residue (NCR) of the various NTAIL truncated proteins was calculated as described in ref. 59 as the difference between the fraction of positively charged residues (f+) and the fraction of negatively charged residues (f−).
The pairwise sequence alignment between the HeV and NiV NTAIL proteins was obtained using ClustalW (http://www.ebi.ac.uk/Tools/clustalw2/index.html).99 Sequence similarity and identity were calculated using the Emboss program100 (http://www.ebi.ac.uk/Tools/emboss/align/index.html).
All NTAIL constructs were obtained by PCR using Phusion (Finnzymes) polymerase and synthetic N genes (GenScript) optimized for the expression in E. coli, as templates. Primers (Operon) were designed to introduce an AttB1 and AttB2 sites at the 5′ and 3′ ends, respectively, and to amplify the desired part of the NTAIL ORF. After digestion with DpnI (New England Biolabs) to remove the methylated DNA template, and purification (Nucleospin Extract II, Macherey-Nagel), the PCR products were cloned using the Gateway® recombination system (Invitrogen) into the pDest17OI expression vector, a modified pDest17® (Invitrogen) to which LacO and LacI were inserted to allow a better control of protein expression.101 The resulting constructs encode N-terminally histidine tagged proteins in which the native sequence is preceded by a vector-encoded MSYYHHHHHHLESTSLYKKAGS amino acid stretch. The pDest17OI/NTAILΔ1 and pDest17OI/NTAILΔ1Δ2 gene constructs encode residues 423–532 and 465–532 of Henipavirus N proteins, respectively. The pDest17OI/NTAILΔ4 and pDest17OI/NTAILΔ3Δ4 gene constructs encode residues 400–522 and 400–472 of Henipavirus N proteins, respectively.
Selection and amplification of DNA constructs were carried out using CaCl2-competent E. coli TAM1 cells (Active Motif). The sequence of the coding region of all expression plasmids was verified by sequencing (GATC Biotech) and found to conform to expectations.
The NTAIL truncated proteins were purified using the protocol already described for Henipavirus wt NTAIL proteins29 with minor modifications. Briefly, cellular pellets were resuspended in 5 volumes (v/w) buffer A (10 mM Tris/HCl pH 8, 300 mM NaCl, 10 mM Imidazole, 1 mM phenyl-methyl-sulfonyl-fluoride – PMSF) supplemented with lysozyme 0.1 mg mL−1, DNAse I 10 μg mL−1, 20 mM MgSO4 and protease inhibitor cocktail (Roche, one tablet per 50 mL of bacterial lysate). After a 20 min incubation with gentle agitation, the cells were disrupted by sonication (using a 750 W sonicator and 4 cycles of 30 s each at 45% power output). The lysate was clarified by centrifugation at 30000 g for 30 min. Starting from a 1 L culture, the clarified supernatant was incubated for 1 h with gentle shaking with 4 mL Chelating Sepharose Fast Flow Resin preloaded with Ni2+ ions (GE, Healthcare), previously equilibrated in buffer A. The resin was washed with buffer A supplemented with 20 mM imidazole, and the recombinant protein was eluted in buffer A supplemented with 250 mM imidazole. Eluates were analyzed by SDS-PAGE for the presence of the desired protein product. The fractions containing the recombinant protein were combined, and then loaded onto a Superdex 200 HR 16/60 column (GE, Healthcare). The elution buffer was 50 mM Tris/HCl pH 7.5, supplemented with either 300 mM (NTAILΔ1 and NTAILΔ4) or 500 mM (NTAILΔ1Δ2 and NTAILΔ3Δ4) NaCl. Purification of Henipavirus PXD and full-length NTAIL samples was carried out as described.21
The proteins were concentrated using Centricon Plus-20 (molecular cutoff of 3000 Da for PXD, and of 5000 Da for NTAIL) (Millipore). All proteins were stored at −20 °C. Dialysis was used to exchange the buffer and adjust it to the ensuing analyses. All purification steps, except for gel filtrations, were carried out at 4 °C.
Protein concentrations of NTAIL truncated forms were derived using the theoretical absorption coefficients at 280 nm, as obtained using the program ProtParam at the EXPASY server (http://www.expasy.ch/tools). In the case of PXD samples, protein concentrations were estimated using the BCA protein assay reagent (Pierce), since estimations based on the theoretical absorption coefficients at 280 nm turned out to be not fully reliable.21
The purified NTAIL proteins were also analyzed by native polyacrylamide gel electrophoresis. In those experiments, migration was carried out in the absence of SDS. In addition, the sample was not boiled before being subjected to electrophoresis and the loading buffer was devoid of both SDS and β-mercaptoethanol. Proteins were thus separated as a function of their inherent charge and compaction.
The theoretical Stokes radii (RS, in Å) expected for a natively folded (RNFS), fully unfolded random coil state in urea (RUS) and natively unfolded PMG (RPMGS) protein with an expected molecular mass (MMtheo) (in Daltons) were calculated according to ref. 61:
log(RNFS) = 0.369 × log(MMtheo) − 0.254 | (2) |
log(RUS) = 0.521 × log(MMtheo) − 0.649 | (3) |
log(RPMGS) = 0.403 × log(MMtheo) − 0.239 | (4) |
The hydrodynamic radius of a natively unfolded protein with N residues was also calculated according to ref. 60 using either the simple power-law model:
RIDPS = R0Nν | (5) |
RIDPSseq = (APPro + B) × (C|Q| + D)Shis × R0Nν | (6) |
The compaction index (CI) is expressed as:
CI = (RUS − RobsS)/(RUS − RNFS) | (7) |
Mean ellipticity values per residue ([Θ]) were calculated as [Θ] = 3300 m ΔA/(lcn), where l (path length) in cm, n = number of residues, m = molecular mass in daltons and c = protein concentration expressed in mg mL−1. Number of residues (n) are 132 for NTAILΔ1 proteins, 90 for NTAILΔ1,2 proteins, 145 for NTAILΔ4 proteins, 95 for NTAILΔ3,4 proteins, 58 for HeV PXD and 57 for NiV PXD. Mass (m) values for HeV NTAIL proteins are 14519 Da (NTAILΔ1), 10
012 Da (NTAILΔ1Δ2), 15
775 Da (NTAILΔ4), and 10
319 Da (NTAILΔ3Δ4). Mass values for NiV NTAIL proteins are 14
286 Da (NTAILΔ1), 9862 Da (NTAILΔ1Δ2), 15
482 Da (NTAILΔ4), and 10
176 Da (NTAILΔ3Δ4). Mass values for PXD proteins are 6871 Da for HeV and 6733 Da for NiV. Protein concentrations of 0.1 mg mL−1 were used when recording spectra of NTAIL proteins either in the absence or in the presence of 20% TFE or 3 M TMAO.
When 0.01 mm thick quartz cells were used, protein concentrations as high as 10 mg mL−1 could be used. Use of high protein concentrations ensured achievement of saturation in induced folding experiments in the presence of the PXD homologous partner. In the case of protein mixtures (i.e. NTAIL + PXD), mean ellipticity values per residue ([Θ]) were calculated as [Θ] = 3300 ΔA/{[(C1n1/m1) + (C2n2/m2)]l}, where l (path length) = 0.001 cm, n1 or n2 = number of residues, m1 or m2 = molecular mass in daltons and C1 or C2 = protein concentration expressed in mg mL−1 for each of the two proteins in the mixture. The theoretical average ellipticity values per residue ([Θ]Ave), assuming that neither unstructured-to-structured transitions nor secondary structure rearrangements occur, were calculated as follows: [Θ]Ave = [([Θ]1n1) + ([Θ]2n2R)]/(n1 + n2R), where [Θ]1 and [Θ]2 correspond to the measured mean ellipticity values per residue, n1 and n2 to the number of residues for each of the two proteins, and R to the excess molar ratio of protein 2.
The experimental data in the 190–260 nm range were analyzed using the DICHROWEB website (http://dichroweb.cryst.bbk.ac.uk/html/home.shtml) which was supported by grants to the BBSRC Centre for Protein and Membrane Structure and Dynamics.102,103 The CDSSTR deconvolution method was used to estimate the content in secondary structure using the reference protein set 7. For the spectra recorded from TMAO-containing mixtures, which could be recorded in the 210–260 nm range only, the α-helical content was derived from the ellipticity at 222 nm as described in ref. 85.
Heat dilution of the ligand was taken into account from peaks measured after full saturation of the protein sample contained in the microcalorimeter cell by the ligand. A theoretical titration curve was fitted to the experimental data using the ORIGIN software (Microcal). This software uses the relationship between the heat generated by each injection and ΔH° (enthalpy change in cal mole−1), KA (association binding constant in M−1), n (number of binding sites per monomer), total protein concentration and free and total ligand concentrations. The variation in the entropy (ΔS in cal mol−1 deg−1) of each binding reaction was inferred from the variation in the free energy (ΔG), where this latter was calculated from the following relation: ΔG = −RTLn 1/KA.
Footnotes |
† Published as part of a Molecular BioSystems themed issue on Intrinsically Disordered Proteins; Guest Editor: M. Madan Babu. |
‡ Electronic supplementary information (ESI) available: Fig. S1–S4 and Tables S1 and S2. See DOI: 10.1039/c1mb05401e |
This journal is © The Royal Society of Chemistry 2012 |