Alessandra
Del Giudice‡
*,
Daniele
Del Giudice§‡
,
Emanuele
Spatola¶
,
Valentina
Alemanno
,
Luciano
Galantini
and
Stefano
Di Stefano
Department of Chemistry, Sapienza University of Rome, P.le A. Moro 5, 00185 Rome, Italy. E-mail: alessandra.delgiudice@uniroma1.it
First published on 26th September 2024
Given the intimate connection between the structure and function of biological macromolecules, the ability to temporally control their unfolding–refolding process enables temporal regulation over specific functionalities, potentially applicable in innovative domains, including the construction of protein-based actuators or programmable catalysis and drug release in complex biotechnological processes. We show here how a temporally controlled protein unfolding–refolding cycle can be coupled in time with programmed pH sequences achieved through the spontaneous decomposition of an activated carboxylic acid. Specifically, we illustrate this process at the molecular level using albumin, the most prevalent protein found in plasma, for which a temporary shift from native to unfolded forms is promoted using nitroacetic acid, able to undergo base-catalysed decarboxylation when solubilized in water solution. As detected by small angle X-ray scattering and intrinsic tryptophan fluorescence, starting from the protein in its native form, the acid addition triggers unfolding to a partially denatured state and a subsequent time-tunable pH rise with gradual refolding that recapitulates the intermediate steps detected at the same pH values by static acidification, fitting within a framework of full reversibility of the structural changes as a function of the protein protonation state. At the end of the pH jump, the native structure is fully recovered, making this method a chemical tool to achieve a complete protein conformational sequence programmed in the timeframe of minutes without further intervention.
We and others have indeed shown that simple reagents can be employed to finely control the pH of a water solution over time.10,11 The consumption of such reagents allows complementary pHlow–pHhigh–pHlow12–14 or pHhigh–pHlow–pHhigh15–19 sequences, whose amplitude and duration can be regulated at will. Since the properties of many biotic and abiotic systems in water solution depend on pH, the above programmable pH sequences can be used to have time control over such properties. In other words, the above reagents allow the system under investigation to pass from the resting state A to a different state B during the first pH variation and to come back from state B to state A during the second pH variation. Such a process is affected by the reactivity of the stimulus employed and its concentration and can be often modulated with the addition of enzymes13,20,21 or other additives. Generally, the system persists in state B as long as the stimulus is present and finally comes back to its resting state A when the stimulus is exhausted.
In particular, we used nitroacetic acid (NAA, 1), an activated carboxylic acid22–24 able to undergo base-catalysed decarboxylation, as a stimulus to achieve pHhigh–pHlow–pHhigh sequences. The latter were exploited to drive the operation of host–guest systems15 and DNA based molecular devices.16 The addition of acid 1 to a water solution results in an instantaneous lowering of the pH due to the dissociation of 1 to 1− and H+. Then, the resulting carboxylate 1− slowly loses carbon dioxide, giving the nitronate anion 2− which eventually acts as a base, increasing the pH to the final value (Fig. 1a).
![]() | ||
Fig. 1 (a) Transient variation of the pH of a water solution due to the decarboxylation of acid 1. (b) Different conformations assumed by HSA in response to pH variation as a function of the protonation state (titration curve adapted from the original work of Tanford9). (c) The transient pH variation due to the addition of 1 causes programmed HSA conformational changes (unfolding–refolding). |
Among biomolecules, proteins are the most sensitive ones to pH variations due to the presence of acidic and basic functions in the lateral chains of the aminoacidic core, which affect the protonation state of the protein and hence its conformation.
It is known that the tertiary structure of human serum albumin (HSA), a protein that under physiological conditions (pH = 7.1–7.5) is found in its folded native (N) form, is highly affected by pH variations.25 According to the widely accepted Foster's model,26,27 lowering the pH to 4.3 induces the protein transition to a partially open conformation identified as the F (“fast migrating”) form, stable up to a pH value as low as 2.7. Moreover, the F form can experience a second transition to the E (“expanded”) form at even lower pH (Fig. 1b), in which the protein further loses secondary and tertiary structures and expands to a highly flexible chain of its subdomains, which locally maintain a high degree of native folding, together with the covalent disulfide bonds.
Taking advantage of the in situ production of acid due to the hydrolysis of glucono-δ-lactone, we have previously studied the unfolding process of HSA by means of fluorescence spectroscopy and small angle X-ray scattering (SAXS), enabling continuous monitoring of the protein structure in solution as a function of a smooth pH decrease.28 This approach has allowed the deconvolution of structural models for the main conformational components of the protein under acidic conditions, highlighting the intermediately unfolded conformation represented by the F-form, which never reached 100% at low ionic strength and in the absence of ligands. It has also provided interesting insights into the role of fatty acid binding within the high affinity sites in the stabilization of the native tertiary structure, so that lower pH values are required to trigger the acid transition towards partially unfolded conformations.29
Here we show a complementary approach for easily monitoring over time the conformational changes of HSA. Starting from the protein in its native form, we employed nitroacetic acid 1 to achieve a fast acidification of the solution upon dissolution, with the consequent denaturation of the protein, followed by a slow increase in the pH without further intervention, which enables full protein refolding with a predictable time dependence. This method allows for a whole protein conformational cycle to be realized by the addition of a single reagent, prompting us to study in detail the refolding process by SAXS (Fig. 1c), verifying the reversibility and identifying the main structural components.
By comparing the pH versus time profiles obtained with the same concentration of 1 and under the same buffer conditions, with and without HSA, it is evident that the presence of the protein affects the pH cycle, neutralizing part of the added 1 and consequently causing a less pronounced acid excess, resulting in an earlier pHlow–pHhigh jump. For example, the pH versus time trend obtained with 20 mM of 1 reasonably agrees with the profile obtained with 50 mM added acid and 0.055 mM of protein (Fig. 2b). Assuming that a similar pH rise over time implies a similar initial molar ratio between 1 and the base, 68 extra base equivalents per protein molecule can be estimated. This is a reasonable number considering that the HSA sequence has 98 total aspartate and glutamate residues that act as weak bases.
Since the conformation of HSA well and reversibly responds to such pH variations, a clear change of the SAXS profiles as a function of time was detected (Fig. S3† and Fig. 4a, b and c). The clarity of the collected data is due to the simple protocol allowing continuous monitoring of the same sample, not needing preparation of a series of samples at different pH values and thus removing a source of variability. The transition is more clearly observed when showing the data in the Kratky plot representation (I·q2vs. q),|| which emphasizes the difference between a fully folded state, whose scattering profile follows in the high q regime the law expected for sharp interfaces (1/q4), and flexible states, which rather tend to follow the polymer statistics (1/q2). In addition, the presence of maxima in this representation is easily detected and their position provides information about the size of folded portions of the studied macromolecules, shifting to larger q values when smaller compact entities prevail within a flexible state, over a larger fully globular structure.37,38 In this case, we used the subtended area under the Kratky plot as an index of protein refolding. This quantity, if calculated up to the isosbestic point among the curves and normalized by the native value, grows towards 1.0 when the protein refolds (Fig. 4c bottom panel). A clear indication of reversibility is also denoted by the overlap between the data collected for the refolded samples at the end of the pHlow–pHhigh jump and those of the initial stock solutions (Fig. S3†).
The radius of gyration (Rg) obtained from the initial slope of the SAXS profile, according to a Guinier fit, is an immediate indication of the overall size of the protein molecules and is observed to decrease as a function of time (Fig. 4c top panel).
Both parameters indicate the increasing presence of partially unfolded and extended states upon increasing the concentration of 1 and that the recovery to the native size and folding state occurs with different time delays, as expected from the pH measurements. Therefore, using different acid amounts allows for tuning, in the range of 1 to 20 minutes, the time after which the protein begins to refold from an initially non-native state.
The apparent molecular mass obtained by assuming a fixed partial specific volume of 0.735 cm3 g−1 (ref. 40) shows a maximum during the pH rise towards neutrality around pH 3–3.5 (Fig. 5a, middle panel), possibly indicating that upon moving towards the isoelectric point,41 when electrostatic repulsions decrease, an increasing population of weak oligomers could transiently exist.** To further test whether the protein conformation detected by SAXS measurements during the time-dependent refolding is indeed an “instantaneous” equilibrium corresponding to a certain average protonation state, we prepared control samples by static acidification of the same HSA solution via the addition of known amounts of a strong acid (HCl, as reported in ESI Table S2†). The comparison between the SAXS profiles of the control samples brought to constant acidic pH values and those accessed at the same pH values in the time-dependent pH jump show that they substantially overlap (Fig. S5†), suggesting that the 1 decarboxylation-coupled refolding recapitulates the same equilibrium average conformations that take place in static samples.
![]() | ||
Fig. 5 (a) Refolding trends as detected using SAXS parameters (Rg, I(0) and the index of refolding (Σ/Σn)) for different albumin preparations acidified with 40 mM and 60 or 80 mM (data points for pH < 2) acid 1: defatted HSA (purple), defatted HSA with double concentration of Tris HCl buffer (red), fatted BSA (cyan), and fatted HSA (black). All solutions contain NaCl 0.15 M. (b) Collection of SAXS data in Kratky plot representation for the fatted HSA sample shown as coloured dots according to the assigned pH, and data fit (black lines) as a linear combination of three components, shown in (c), weighted by the volume fractions reported in the inset. The first two components come from applying a component analysis to the dataset, while the third component was added to consider a fully expanded conformation, as measured in the absence of added NaCl and at pH < 1.6 (HCl > 40 mM). (d) Description of the selected SAXS profiles of fatted HSA (red is component 3 of panel (c), dark orange is 5 minutes after the addition of 80 mM acid 1, and light orange and blue are respectively 7.5 min and 5 h after the addition of 40 mM acid 1) according to an ensemble of models obtained by assuming that the unfolded albumin structure is composed of structured helical loops connected by random linkers. In the distribution of Rg values, the black line represents the overall pool of generated conformers, while the coloured areas show the optimized selection of conformers to fit the SAXS data as shown in a lateral panel. A representative conformer is shown for the non-native states, while for the refolded albumin, a fit of the data with the crystal structure46 is shown as a dashed line. |
We also followed time-dependent pH jumps by SAXS carried out with different albumin preparations from commercially purified products (Fig. 5, defatted HSA in Tris HCl 5 mM (HdT5), defatted HSA with double the concentration of Tris HCl buffer (HdT10), fatted bovine serum albumin BSA (BfT5), and fatted HSA (HfT5)). The general refolding trends as a function of pH are essentially unchanged for the human protein, except for a systematic shift of Rg to smaller values for the preparation not deprived of bound fatty acid molecules (“fatted”), probably due to an intrinsically smaller number of dimers and oligomers in the lyophilized starting product.†† For the HfT5 preparation indeed, the starting stock solution of native protein can be satisfyingly described by means of the monomeric albumin crystal structure (Fig. 5d). We therefore focus on this dataset to reach a more detailed structural interpretation of the refolding process, in terms of the minimum possible conformational states involved and previously proposed structural components.
The set of SAXS data collected during the 1-triggered unfolding and time-dependent refolding could be described reasonably well in terms of just two SAXS profiles, a fully folded component and a partially unfolded one (Fig. S6b†), while a third, further expanded component could marginally contribute at pH < 2 (Fig. 5b and c).‡‡ The optimized concentration profiles of these three structural components suggest that the mid-point of the transition between the partially unfolded and the native form as observed in these SAXS refolding experiments is located close to pH 3 (Fig. 5b). This should be compared to the component-based interpretation of previously collected unfolding trends with glucono-δ-lactone time-dependent acidification (in the absence of added salts), where a similar intermediately unfolded component, ascribed to the “F form” of the literature, seemed to reach a maximum around pH 4, while already coexisting with further expanded conformations clearly detected for pH lower than 3. Such a difference could be ascribed to the low ionic strength in the unfolding experiments, implying a reduced screening of the intra-chain electrostatic repulsions and an emphasized population of extended conformations, despite the nominal pH value measured with the electrode being the same.42
The overall folding state of albumin could be visualized by applying an ensemble optimization method to fit selected SAXS profiles and by considering a particular conformer found to be highly contributing and recurrent in repeated optimization experiments (Fig. 5d). The further extended component, not detected in these pH jumps due to the requirement for stronger inter-subdomain repulsions, can be described by including a relevant portion of models with Rg values above 5 nm, and shows an even higher level of flexibility (wider Rg distribution) compared to the reference random pool generated using the compact helical loop-flexible linker assumption. The SAXS profile with the lowest degree of folding accessed with the highest concentration of 1 (nominal pH 1.9) suggests an albumin conformation with a larger globular core involving the close-packed central subdomains and higher orientation flexibility for the N-terminal and C-terminal loops. Such a low-pH state substantially coincides with the main non-native component detected along the refolding path, which, together with an increasing fraction of native structures, allows for the simplest interpretation as a coexistence of the two main conformations in varying amounts (native and partially unfolded). The inspection of models that fit the SAXS data at higher pH upon refolding (e.g. at pH 3) could give an alternative view to this simplified assumption and suggest the involvement of possible conformers with a lower degree of separation of the terminal loops.
When correlating the results of SAXS and fluorescence from separate measurements using the unified pH scale (Fig. S8†), a clear asynchrony is detected between the two probes. The sharp shift of the tryptophan fluorescence spectrum back to the native position (accompanied by a more gradual change of the emission intensity) is completed only when the protein fully recovers the SAXS profile of the original native solution (pH > 4.7) and the mid-point of the transition observed through fluorescence corresponds to a global protein folding, which is already more than 90% similar to that of the native state. The discrepancy has to do with the multi-domain nature of albumin and the different sensitivity of the two techniques to global and local changes of the protein structure: while the SAXS signal mainly detects the earlier refolding and terminal domain recompaction taking place around pH 3, it is much less sensitive to the local re-adjustments within the central protein domain, where tryptophan is located. Here the fluorescent probe experiences a less polar environment compared to the native state irrespective of the variable degree of unfolding of the terminal domains, until a final local rearrangement restores the native polarity of its site in the central domain cleft characteristic of the native conformation. Such asynchronicity had already been observed more robustly with simultaneous detection of SAXS and fluorescence emission during time-dependent acid-induced unfolding (rather than refolding as realized here) of the same sample,28 and it contributes to a clear understanding of the multi-step nature of albumin's isomerisations at acidic pH.31
Starting from a native protein sample, the acid 1 addition triggers the unfolding to a partially denatured state of albumin ascribed mainly to the “F” form, and the gradual time-dependent refolding can be described as an increase of the native fraction, which reaches 100% when the pH climbs back to 5, allowing for a full protein conformational cycle. The final state at the end of the pHhigh–pHlow–pHhigh cycle is indistinguishable from the initial starting state, and the intermediate steps of the refolding kinetics recapitulate both the inverse process of unfolding upon acidification28 and the average protein conformations detected at the same pH values by static acidification with HCl, showing that neither acid 1 nor its byproduct affects the conformational transition or the stability of the protein, fitting within a framework of full reversibility of the structural changes as a function of the protein protonation state. Such reversibility, appreciated through various methods since very early works (e.g. ref. 26), has now been clearly shown in time-programmed refolding experiments of the same sample without further intervention.
A whole conformational cycle (native–unfolded–native) can therefore be driven and temporally controlled by the single addition of a simple reagent. Together with other systems used to transiently control the pH, our protocol based on the use of acid 1 can be helpful in the study of pH responsive structural transitions of macromolecules. The biological origin of proteins, their high structural sensitivity towards pH and the intimate link between their structure and functions make these results particularly interesting for the development of pioneering and sustainable applications, from protein-driven actuators in robotics to customizable catalysis and drug delivery in advanced biotechnologies.
Final data shown in this article, including the spectroscopic and SAXS data plotted in all the figures, are available at Zenodo at https://doi.org/10.5281/zenodo.13169259.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4ob01289e |
‡ Co-first authors with equal contribution. |
§ Present address: Institute for Complex Molecular Systems and the Laboratory of Macromolecular and Organic Chemistry, Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, The Netherlands. |
¶ Present address: Institute of Organic Chemistry, Ulm University Albert-Einstein-Allee 11, 89081 Ulm, Germany. |
|| I is the background subtracted scattered intensity and q is the reciprocal space scattering vector modulus q = 4π![]() ![]() |
** According to the relationships ISAXS(0)(cm−1) = MM(kDa)c(mg cm−3)(Δρm(cm g−1))2/NA and Δρm(cm g−1) = (ρm(el. g−1) − ρsolv(el. cm−3) ![]() ![]() |
†† The presence of such oligomeric species in an amount usually below 20% of the overall protein mass affects the average SAXS profile especially in the lower q region, and the estimated Rg (when above 2.8 nm under native conditions) and maximum size (when above 9 nm) are particularly sensitive parameters signalling their existence. However, the overall refolding trend as a function of pH is not affected by the co-existence of a small fraction of albumin dimers,43 which can undergo comparable conformational transitions to monomers, as seen for example for the structural response of albumin upon oxidation using sodium hypochlorite.44 |
‡‡ Considering the previous knowledge of the maximum degree of albumin expansion detected by SAXS upon acidification below pH 2.7, with Rg values above 4.5 nm and Kratky plots skewed to larger q values with a lower subtended area in the q range <1 nm−1,28,29,43,45 we tested if such an expanded component could be included to describe the refolding process in combination with the native form. This was not the case (Fig. S6c†), and to reach satisfactory agreement, the second component could not be assigned to a state in which the protein is fully expanded as allowed by its local disulfide bonds but had to provide for an intermediate state with the bimodal Kratky plot and an Rg value of around 3.8 nm, which could be rather ascribed to the putative F-form. If the SAXS profile of a further expanded conformation is artificially included as an additional third component, it can just help in describing the data collected at the lowest pH, but it is not appreciably contributing, as already pointed out by the goodness of fit of a two-component model. |
This journal is © The Royal Society of Chemistry 2025 |