Open Access Article
Michael B.
Cammarata
and
Jennifer S.
Brodbelt
*
Department of Chemistry, The University of Texas at Austin, 1 University Station A5300, Austin, TX, USA 78712. E-mail: jbrodbelt@cm.utexas.edu
First published on 26th November 2014
Ultraviolet photodissociation (UVPD) mass spectrometry is employed to investigate the structure of holo-myoglobin as well as its apo form transferred to the gas phase by native electrospray. UVPD provided insight into the stability of native structural elements of holo-myoglobin. The fragmentation yields from UVPD showed the greatest overall correlation with B-factors generated from the crystal structure of apo-myoglobin, particularly for the more disordered loop regions. Solvent accessibility measurements also showed some correlation with the UVPD fragmentation of holo-myoglobin. Comparison of UVPD of holo- and apo-myoglobin revealed similarities in fragmentation yields, particularly for the lower charge states (8 and 9+). Both holo- and apo-myoglobin exhibited low fragmentation yields for the AGH helical core, whereas regions known to interact with the heme show suppressed fragmentation for holo-myoglobin. The fragment yields from HCD showed the lowest correlation with B-factor values and rather reflected preferential charge-directed backbone cleavages.
Due the nature of these limitations, a number of mass spectrometric methods have been developed for the structural characterization of proteins with high sensitivity and low sample demands. Some of these methods entail using various chemical probe methods on proteins in solution, followed by mass spectrometric detection of products. Hydrogen deuterium exchange (HDX) of the amide hydrogens and covalent labeling of specific amino acids remain at the forefront of mass spectrometric-based strategies for structural biology applications.3 In these methods, the proteins labelled in solution are typically analyzed by a bottom-up type proteomic workflow which entails digestion of the proteins by one or more proteases, separation of the resulting peptides by liquid chromatography and identification by mass spectrometry. The abundances of the modified peptides, which presumably reflect the accessibility of the HDX reagent or chemical probe to specific peptide sites, are quantified relative to the unmodified counterparts to reveal information about the tertiary structure of the protein.4 This process can be simplified by adoption of a top-down workflow which alleviates the proteolytic digestion step but requires a high-resolution/high mass accuracy mass spectrometer to map the incorporation of the covalent labels.5,6 For the top down approach, the deuterium-labelled or probe modified proteins are typically transferred to the gas phase by electrospray ionization (ESI) and fragmented by collisional induced dissociation (CID) or its analog higher energy collisional dissociation (HCD), electron-based activation (ECD or ETD), or ultraviolet photodissociation (UVPD), then the modified sites are determined based on interpretation of the abundances of resulting fragment ions.5,6 Denaturing ESI conditions are typically used to maximize sensitivity. While the top down/labelling approach offers some advantages to the bottom-up methods, the use of any type of labelling method can introduce artifacts during data analysis.
An alternative to labelling methods is to use native spray conditions to ionize proteins for direct analysis in the gas phase. Although the structures of proteins in the gas phase remain a matter of debate and such studies raise concerns about conformational integrity, investigation of intact proteins may offer new insight as well as correlations with known solution structures. It has been shown previously that tertiary intra-protein, protein–ligand and protein–protein interactions can be retained in the gas phase using gentle ESI conditions and buffered solutions.7–23 Several groups have undertaken HDX on the presumed “native” proteins and protein complexes and provided convincing evidence that the gas-phase structures mirror the known crystal or solution structures obtained by X-ray crystallography or NMR.11,12 However, hydrophobic interactions are not well maintained in the gas-phase native structures, and thus electrostatic and van der Waals type interactions play a far more important role. Ion mobility studies have also shown that ions in lower charge states may exhibit similar radii of gyration to that of their solution counterparts.13,14 These studies have also demonstrated that as the charge states increase, the proteins are more elongated, thus showing the impact of charge on denaturation in the gas phase.15–17
There have been a handful of recent studies in which unlabeled native-like proteins have been interrogated by electron-based activation methods, ETD and ECD, to obtain structural insight.18–24 The conclusions from these MS/MS-based studies have been compelling. For example, it was discovered that strong salt bridges were not disrupted by ETD, thus enabling discrimination of different types of salt bridges in the gas phase.19 It was found that specific enhanced backbone cleavage sites could be correlated with regions of the proteins that were known to have high B-factors (a measure of flexibility) from crystallographic studies, suggesting that these regions are also quite disordered (and more flexible) in the gas phase and thus more prone to cleavage.20,21,25,26 Similarly, the abundances of the fragment ions produced by ECD or ETD have been related directly to the stabilities of specific regions of the proteins.25,27–29 There has been one recent report of characterization of native protein complexes by UVPD, ETD and HCD for which UVPD revealed the greatest information about the primary sequences of the proteins in the complexes and also produced non-covalent fragment ions comprised of a portion of the protein bound to the ligand.30 In general, 193 nm UVPD has proven to be an exceptional activation technique for obtaining very high sequence coverage of the protein backbone (for both native and denatured proteins).5,31,32,33 HCD has been applied to top-down analysis of intact proteins (denatured),31,32,34 including one study that showcased the fragmentation of the transmembrane section of integral membrane proteins under denaturing conditions.35
The present study focuses on evaluating the application of 193 nm UVPD to characterize the gas-phase structures of native proteins, as demonstrated for horse heart myoglobin in both its holo and apo (heme-bound) states. Horse heart myoglobin was the first structure to be solved by X-ray crystallography and has become the most popular model protein for development and assessment of new structural methodologies both in apo and holo forms.5,36–41 However, an apo crystal structure has yet to be published. Past studies of myoglobin have been undertaken using an array of mass spectrometric strategies, such as the use of solution phase HDX for examination of structure and kinetic unfolding and ion mobility measurements for investigation of protein cross-section, as well as many computational studies on the stability of the helices in myoglobin.5,6,14,38–40,42 As with past ECD/ETD experiments in which fragment ion abundances were related to the stabilities of particular regions of proteins,9,22,27,29 the same types of correlations are developed in the present study from the examination of UVPD results. The impact of the charge state of the proteins is also examined to determine whether charge-mediated unfolding/elongation is reflected in the resulting fragment ion distributions, and HCD results are evaluated relative to the UVPD results to allow a comparison of the trends observed for a more conventional collision-based activation method (HCD) to photodissociation.
In this approach, all product ions arising from backbone cleavages that occur N-terminal to a specific amino acid (yielding an, bn, and cn ions) are summed with all the complementary C-terminal product ions arising from the same backbone cleavage site (yielding complementary xR−n, yR−n, and zR−n ions) where R is the total number of amino acids in the protein. For example, the abundances of the a70, b70 and c70 ions are summed with the abundances of the x83, y83, and z83 ions, each representing cleavage between residues 70 and 71 for myoglobin (which contains 153 amino acids). In this way, cleavages are categorized via a specific backbone position that is assigned to an amino acid local to the cleavage site. The MS/MS spectra for holo-myoglobin were additionally searched with inclusion of a +615.1695 Da shift corresponding to heme ligand adduction. The identified heme-containing fragment ions were summed with the corresponding apo-fragments from the same spectrum. The resulting fragment yields were also averaged into well-defined demarcated regions and then divided by the length (in terms of number of constituent amino acids) of the regions. These regions are as follows: N-terminus (residues 1–3), A-helix (residues 4–19), B-helix (residues 20–35), C-helix (residues 36–42), CD loop (residues 43–50), D-helix (residues 51–57), E-helix (residues 58–77), EF loop (residues 78–85), F-helix (residues 86 to 94), FG loop (residues 95 to 99), G-helix (residues 100–118), GH loop (residues 119–123), H-helix (residues 124–150), and C-terminus (residues 151–153).38,41,44 Effectively this gives a “fragmentation density” per structural region, a categorization that is beneficial for facilitating discussion about each of the 14 regions at the expense of individual residue-specific changes. The B-factors associated with each amino acid were extracted from the PDB file 1DWR and then averaged for the regions listed above to allow comparisons of B-factors to the fragmentation yields. GetArea was also used to obtain backbone and side-chain solvent accessible surface areas.45
Each MS/MS spectrum was processed to calculate the normalized fragmentation yield per amino acid (i.e. based on backbone cleavages between every pair of residues) as described in the experimental section. These values were plotted with their associated standard deviations in Fig. 2 which displays the normalized fragment yield for each native charge state (8+, 9+, 10+) for both forms of myoglobin. The results for the denatured charge state (20+) are included in the apo-myoglobin panel. The general shapes of the fragment distributions in Fig. 2 are similar for the holo- and apo-forms, with clusters of backbone sites showing elevated fragmentation yields relative to others that exhibit little or no yields. The graphs in Fig. 2 also include demarcated regions which are defined based on fourteen known structural regions of myoglobin. If the protein cleaved uniformly at each backbone position, the expected fragmentation yield would be 0.66% (e.g., 100% fragment ion current divided by 152 backbone cleavage sites). Backbone cleavages between residues that were detected to give greater than 0.66% fragmentation yield are considered to be residues exhibiting preferential (enhanced) cleavage. These specific amino acids are highlighted in the primary sequence map (bottom of Fig. 3) and are also superimposed on the myoglobin crystal structure (PDB code 1DWR), as illustrated for each native holo-myoglobin charge state in Fig. 3.
![]() | ||
| Fig. 3 Those residues that were found to have enhanced backbone cleavage frequencies upon UVPD activation are highlighted in red on the holo-myoglobin crystal structure 1DWR, as also indicated in the primary sequence below for the native charge states (8+, 9+, 10+) evaluated. The heme group is shown as a hot pink color. The 1DWR crystal structure has been rotated 180° about the x-axis between the top and bottom structures. | ||
In an effort to streamline the data interpretation in a more structurally meaningful manner, the backbone site-specific fragment yields were grouped and averaged according to the fourteen known structural regions of myoglobin and are displayed in histogram format in Fig. 4 for holo-myoglobin (with structural regions labelled in Fig. S2†) for both HCD and UVPD. The analogous results for apo-myoglobin and denatured myoglobin are shown in Fig. S3.† Essentially this treatment provides fragment yield densities which are a metric of fragmentation localized to specific structural regions of the protein, normalized for the size of the region. This provides a way to visualize the change in fragmentation yield for each activation method and as a function of charge state. To facilitate comparisons, each structural region is assigned an average B-factor value (as described in the Experimental section), which conveys the degree of flexibility in the crystal structure. The B-factors for each defined region of holo-myoglobin (1DWR) are shown in Fig. 4 and 5.
![]() | ||
| Fig. 5 Comparison of average (a) B-factor values and fragment yield densities of respective labelled regions from the (b) holo and (c) apo forms of myoglobin for the three native-like charge states. Regions with greater B-factors or fragmentation yield densities indicate less stable regions whereas those with lower values indicate more stable regions. The C- and N-termini have been removed from the figure. Standard deviations for fragment yield densities are given in Fig. 4 and S2† for holo and apo forms, respectively. | ||
The sequence or order of unfolding of myoglobin based upon molecular dynamics simulations, CD experiments, NMR, and HDX mass spectrometric data provides another perspective on the structural stability of the protein.38–41,47 As briefly summarized here based on published findings,37–40,46 the F-helix is the most unstable structural element, thus the first to unfold. This is followed by unfolding of the C, D, and E helices, prior to the unfolding of the B-helix. The last helices to unfold are the A, G, and H helices or the AGH core. Not surprisingly, this AGH core is found to be consistently the most stable and least flexible, with the exception of the N-terminus of the G-helix, which has been shown by computational modeling to fray.40B-factors from the 1DWR crystal structure were also analyzed and used to correlate the fragmentation yields in the present study to a measured and quantified flexibility value (B-factor). The averaged B-factors of the previously defined demarcated regions are summarized in Fig. 5a. Those regions with high B-factor values are flexible regions of the protein, while those with low B-factor values are more rigid and more stable. The charge state-based fragmentation yield data obtained from UVPD of both the holo and apo forms is shown in Fig. 5b and c, respectively. The UVPD results obtained for the apo-form should not be compared directly to the B-factor values because the latter includes the impact of the heme group. However, comparison of the B-factor values with the UVPD data for holo-myoglobin and comparison of the UVPD data for the holo and apo proteins provides interesting insight, as described in this study. A more detailed comparison of the UVPD results for the apo and holo forms as a function of charge state is shown in Fig. S4,† also discussed in detail later.
For the present study, our goal was to assess whether the UVPD fragmentation yields (Fig. 2 and 4) reflected the local stabilities of the tertiary and secondary structural features of myoglobin (i.e. structures in Fig. 3) as suggested previously in a number of publications utilizing electron activation and 266 nm UVPD for other proteins.23,27,29,48 In essence, these previous studies indicated that those regions of a protein that displayed high fragmentation frequencies and thus resulted in high fragment yields were presumed to be conformationally flexible and/or have few retained intramolecular interactions. Those which had low fragmentation yields were conformationally stable elements, such as those with strong intramolecular interactions in the gas phase. A number of substantial variations in relative fragmentation yields are observed in Fig. 5 and S4† as a function of charge state and apo/holo state of myoglobin, and visualized for the 9+ species in Fig. S5.†
When comparing the UVPD fragment yields obtained for the 8+ charge state to the 9+ charge state, fragmentation of the FG and GH loop regions was greatly enhanced, along with significant increases in the fragmentation of the connecting helices G and H (in all cases going from virtually no detectable fragmentation for the 8+ charge state to moderate/significant fragmentation for the 9+ charge state) (Fig. 4b and 5b). The fragmentation yields of the B and C neighboring helices also increased with charge state. The CD loop and the E helix are two regions that showed moderate decreases in fragmentation going from the 8+ to 9+ charge state, and several other structural regions, including the A, D and F helices and the EF loop, exhibited no significant change as a function of charge state. Collective grouping of these regions (those showing decreasing fragmentation or no change with charge state) reveals that all are important for heme binding.49 It has also been noted that absorption cross-sections of proteins change with charge density and conformation.50 This prior finding might explain in part some of the changes observed in the UVPD fragmentation yields as a function of charge state in the present study, namely a possible charge-density induced change in absorption cross-section in addition to a degree of protein unfolding as the charge state increases. Upon transitioning to the 10+ charge state, the most stable regions predicted by UVPD (e.g. lowest fragmentation yields) are the A, H, E and F helices, all of which increased slightly from the fragmentation yields determined for the 9+ charge state. The G helix displays a more significant increase in fragmentation yield for the 10+ charge state, an outcome attributed to N-terminal fraying as a consequence of unfolding of the FG loop that was also noted from the 8+ to 9+ charge states. This type of fraying has been noted in prior solution studies.40,47 Several of the structural regions show an increase in fragmentation for the 10+ charge state, suggestive of unfolding related to the D, G, C, and B helices along with EF and CD loops. This nearly global increase in fragmentation with charge state tracks well with the concept of electrostatic-induced elongation. Even with this alpha helical fraying and expansion in the regions from B to the D helix, the maintenance of the E and F helices as well as the AGH core of the protein allow retention of interactions with the heme group.
Solvent accessibility is another physical parameter that has shown positive correlation with B-factors of proteins51,52 as well as with ETD fragmentation trends for native proteins,18 and thus was also evaluated in the context of the UVPD data in the present study. A plot of the solvent accessible surface area (SASA) for both the backbone and side chains generated from the 1DWR crystal structure is shown in Fig. S6† (along with the B-factor values and UVPD fragmentation yields for ready comparison). There is some general correlation backbone SASA and the UVPD fragmentation yield for those regions having SASA above 50%. Those regions that have both low SASA values for the side-chain and backbone regions also have the lowest UVPD fragmentation yields, particularly for the 8+ charge state. The A-helix region shows the worst correlation between SASA and UVPD fragmentation yields, with rather prominent SASA values for the side-chains of the A-helix region of the protein but yet very low UVPD yields. However, the SASA values of the backbone in this A-helix region are very low.
The observation that the loop regions tend to fragment more readily than the helices by UVPD may also be related in part to the larger molar absorptivities of loops compared to helices at 193 nm. Tsai et al. reported that the λmax of a typical alpha-helix occurred between 189–204 nm in solution with a molar absorptivity of 4.1 cm2 mol × 10−3, whereas the molar absorptivities of coiled or loop regions were around 6.9 cm2 mol × 10−3 with a similar λmax (187–190 nm).51 Interestingly, beta-sheets were reported to have an even greater molar absorptivity;50 however, myoglobin contains no beta-sheets.53 Although the exact mechanism of the UV activation and dissociation process for large molecules like proteins remains unknown in the gas phase, it is conceivable that local absorptivities of certain structural elements of proteins that are retained in the gas phase could have an impact on energy deposition and re-distribution prior to fragmentation. More general trends about fragmentation efficiencies of helical and non-helical regions of native proteins may emerge as additional proteins are analyzed by UVPD.
One might anticipate a general across-the-board increase in UVPD fragmentation yields for apo-myoglobin relative to holo-myoglobin because the collisional cross-section of myoglobin increases, albeit slightly, upon transition from the holo to myo form.14 Although there are modest increases in UVPD yields for many of the structural elements (AGH helical core, D, E, F helices) going from the holo to myo form, a few regions (EF loop, B and C helices, CD, EF and FG loops) show more significant increases. The increases in the C, D, E and F helix fragmentation yields for the apo form may reflect the lack of stabilizing non-covalent interactions with the heme group that also allows the intervening loop regions to release and helices to unfold. For the 9+ and 10+ charge states, two regions showed a decrease in UVPD fragmentation for the apo form: the B helix and GH loop, as well as the CD loop and D helix for the 10+ charge state. The substantial amount of change in fragmentation yields for the 10+ charge state of holo- versus apo-myoglobin may reflect significant differences in secondary and possibly tertiary structures. The agreement in the UVPD trends between the holo and apo forms of myoglobin with a particular strong similarity in fragmentation behavior of the AGH core helices suggesting that this helical bundle is stabilized in both of these forms.
Footnote |
| † Electronic supplementary information (ESI) available: ESI mass spectra of holo- and apo-myoglobin, schematic representation of holo-myoglobin, histograms of HCD and UVPD fragmentation yields of apo-myoglobin, B-factors and solvent accessibilities. See DOI: 10.1039/c4sc03200d |
| This journal is © The Royal Society of Chemistry 2015 |