Miha
Kastelic
a,
Yurij V.
Kalyuzhnyi
b and
Vojko
Vlachy
*a
aFaculty of Chemistry and Chemical Technology, University of Ljubljana, Večna pot 113, 1000 Ljubljana, Slovenia. E-mail: vojko.vlachy@fkkt.uni-lj.si
bInstitute for Condensed Matter Physics, Svientsitskii 1, 79011 Lviv, Ukraine
First published on 1st August 2016
We analyze the experimentally determined phase diagram of a γD–βB1 crystallin mixture. Proteins are described as dumbbells decorated with attractive sites to allow inter-particle interaction. We use thermodynamic perturbation theory to calculate the free energy of such mixtures and, by applying equilibrium conditions, also the compositions and concentrations of the co-existing phases. Initially we fit the Tcloudversus packing fraction η measurements for a pure (x2 = 0) γD solution in 0.1 M phosphate buffer at pH = 7.0. Another piece of experimental data, used to fix the model parameters, is the isotherm x2vs. η at T = 268.5 K, at the same pH and salt content. We use the conventional Lorentz–Berthelot mixing rules to describe cross interactions. This enables us to determine: (i) model parameters for pure βB1 crystallin protein and to calculate; (ii) complete equilibrium surface (Tcloud–x2–η) for the crystallin mixtures. (iii) We present the results for several isotherms, including the tie-lines, as also the temperature-packing fraction curves. Good agreement with the available experimental data is obtained. An interesting result of these calculations is evidence of the coexistence of three phases. This domain appears for the region of temperatures just out of the experimental range studied so far. The input parameters, leading good description of experimental data, revealed a large difference between the numbers of the attractive sites for γD and βB1 proteins. This interesting result may be related to the fact that γD has a more than nine times smaller quadrupole moment than its partner in the mixture.
Proteins may self-assemble to form dense liquid phase, crystals, or amorphous aggregates, and an understanding of this process is important for many reasons.2,9–11 First, the self-assembly is a step in protein crystallization; good quality crystals are needed to determine protein structure through diffraction experiments. The latter pathway is the following: soluble protein aggregates initialize the coexistence of low- and high density liquid phases, differing in protein concentration. The so-called liquid–liquid phase separation is a meta-stable stage and may further lead to crystallization. For this reason, many experimental and theoretical efforts, were invested to improve understanding of the liquid–liquid phase separation. The pioneering work of George and Wilson,12 and the later study of Vliegenthart and Lekkerkerker13 revealed a strong correlation of the liquid–liquid phase boundary with the second virial coefficient and thus inter-protein interactions. Secondly, protein self-assembly in living cells plays a key role in diseases, such as Alzheimer's, Parkinson's, and others.1,14,15 Thirdly, a key step in developing biotech drugs16 is to formulate proteins that do not form aggregates. The challenge is to deliver formulations with high concentrations of solutes, low viscosity and maintaining the protein’s therapeutics properties for at least two years. Self-assembly decreases therapeutic efficacy and increases its viscosity.
As atomistic level molecular simulations are not practical for studying phase separations in such solutions, the traditional approach is to use the colloidal model of Derjaguin–Landau–Verwey–Overbeek (DLVO).17 In this approach, proteins are treated as spheres that interact via van der Waals and screened Coulomb potentials, acting between centers of molecules. The approach proved to be unsatisfactory in several respects.1 On the other hand, many aspects of protein aggregation may be pictured by the models that include anisotropic protein–protein interactions of short range.7,8,18–21 This strategy was successfully used to model a wide class of globular proteins and to analyze the liquid–liquid phase separation,22,23 crystallization,24,25 osmotic pressures,26 distribution of aggregates,27,28 percolation threshold,27 and other physico-chemical properties.21
Heras and co-workers29,30 extended these ideas to binary mixtures of equally-sized spherical particles. They examined an influence of pair potential characteristics on the phase behavior under isobaric conditions. A special interest for us is the work of Roldnán-Vargas31 and coworkers. These authors treated binary mixtures of spherical particles of different sizes and with different surface patterns. The fact that the shape of protein molecules affects the phase separation, has recently been confirmed by us. A distinct shift of liquid–liquid critical density toward lower values has been obtained in case of the associating dumbbells.32 Such a behavior was previously observed experimentally for the system of Y-shaped antibodies.33 For proteins in dense mixtures near the phase separation a realistic modeling of protein shape seems to be important.
In the previous studies34,35 we applied simple models with anisotropic interactions to analyze aqueous protein solutions in the presence of low-molecular-mass salts. We treated proteins as hard spheres, decorated with square-well-energy binding sites, using Wertheim's thermodynamic perturbation theory.36 The model provided good fits to the cloud-point temperature curves of lysozyme in buffer–salt mixtures as a function of the type and concentration of salt. The necessary condition required for such modeling to be realistic is that proteins in solution during the experiment remain in their compact form.37,38 Unfortunately, many experimentalists often do not provide this information. In the next contribution32 the approach above was extended to examine the thermodynamics of fluid with “molecules” represented by two fused hard spheres, again decorated by the attractive square-well sites. Interaction between these sites was of short-range and caused association between the fused-sphere particles. The model served to estimate the effects of non-spherical geometry of proteins on solution properties.
Here we continue with applications of simple models to protein systems. Going one step further, we treat a binary mixture of proteins with molecules of each of the respective species represented as dumbbells. In the first step we need to form the molecule (dumbbell) by fusing two hard spheres and further decorate it with attractive sites on the surface. These sites allow association between the same type of molecules (self-association) as also with molecules of the other protein species present in the system. Thermodynamic properties of the model mixture are obtained via Wertheim's thermodynamic perturbation theory.36 We utilized the McMillan–Mayer approach, where the principal variables are concentration, chemical potential of composed solvent, and temperature. We calculated the liquid–liquid phase diagram and present the results as the volume packing fraction–composition–temperature diagram. Under such conditions the osmotic pressure follows from the equation of state. When it comes to comparison with experimental data, we chose to model a γD–βB1 crystallin mixture, thoroughly studied by Wang and co-workers.39 Both proteins show significant deviations from the spherical shape and resemble dumbbells40 (see Fig. 1 therein, and Fig. 6 of this paper). These proteins are of special interest: an improper functioning of γD crystallin may lead to cataracts in the human eye.14 For this reason crystallins were intensively studied. The investigations include stability measurements,41 influence of mutation studies42,43 and other surface modifications,44 and the analysis of surface residues forming inter-protein salt bridges.45 The investigations revealed that a replacement of one single amino acid with another may modify physiological function and/or change the crystallization pathway.46–49 Investigation of this subtle phenomenon is beyond the scope of this work. In this initial study we focus to the liquid–liquid phase equilibrium, and how the latter is governed by the interactions between two protein species. We use a simple “dumbbell” protein model, coupled with the thermodynamic perturbation theory of Wertheim,36 to analyze the experimental data of Wang and co-workers.39
〈exp{−βu(ij)BB(zBB)} − 1〉ΩiΩj = δijK(ij)BBδ(rij − σ), | (1) |
Protein molecules i and j are interacting via the pair potential, which consists of the sum of four terms uij(rij,Ωi,Ωj). Note again that hard-spheres labeled 1 form molecules of type 1, while the hard-spheres denoted 2 form molecules of type 2. These terms describe the interaction between decorated hard spheres i and j, belonging to molecules i and j:
![]() | (2) |
Here uhs(rij) is the hard-sphere potential, zαβ is the distance between sites Aα and Aβ, which belong to the spheres of the type i and j,
![]() | (3) |
A = Aid + Ahs + Aass. | (4) |
These terms are written as:
![]() | (5) |
![]() | (6) |
![]() | (7) |
![]() | (8) |
![]() | (9) |
Here ij(r) is the orientational average of the Mayer function for the square-well site–site interaction:
![]() | (10) |
Wertheim's TPT uses the orientation average of fij(r) and accordingly placement of the sites does not enter the theory. There is computer evidence in the literature that this is not a severe approximation.54
Once the free energy A is known, the chemical potential of the protein species i, μ(p)i, and the equation of state (pressure P) can be obtained from the standard thermodynamic relations50
![]() | (11) |
![]() | (12) |
μ(p)i({ρ(p)1(1), ρ(p)2(1)}, T) = … = μ(p)i({ρ(p)1(Q), ρ(p)2(Q)}, T), | (13) |
P({ρ(p)1(1), ρ(p)2(1)}, T) = … = P({ρ(p)1(Q), ρ(p)2(Q)}, T). | (14) |
In words: the chemical potential of species i = 1,2 within phase (γ), μ(p)i(γ), must be at given temperature and pressure equal in all phases. In addition to these conditions, we need also to consider the conservation of the volume fractions of phases X(γ) = V(γ)/V(0) and the number densities of proteins {ρ(p)1(γ),ρ(p)2(γ)} over the coexisting phases (γ):
![]() | (15) |
![]() | (16) |
Eqn (13)–(16) form the set of expressions for unknowns {ρ(p)1(γ), ρ(p)2(γ)}, X(γ) (γ = 1…Q) at specified densities {ρ(p)1(0), ρ(p)2(0)} of the mother phase.
In order to determine the coexistence region we need to solve numerically the set of equations defined above. This task may be accompanied by several problems. First, the solution of the system of nonlinear equations is not unique and the exact number of solutions is not known in advance. Second, numerical methods for solving nonlinear equations are usually based on iterative algorithms, requiring a good initial approximation to reach convergence. Third, in a process of searching for the solution one can get stranded at a saddle point, away from the global free energy minimum. In general, additional restrictions55 must be considered, what makes such an approach time consuming.
To avoid these complications, we used a somewhat different approach, searching for the global minimum directly. The appropriate functional to be minimized is the free energy of the system per volume, A(0)/V(0),
![]() | (17) |
To account for the conservation conditions given by eqn (15) and (16), we insert these expressions into eqn (17), and thus reduce the dimensionality of the problem. Next we use the Boltzmann simplex simulated annealing method (BSSA), described in Chapters 10.4 and 10.9 of ref. 56, to find the global minimum. When this minimum is reached, the conditions given by eqn (13) and (14), are automatically fulfilled. Numerically, the equations are satisfied within 10−2%. More detailed description of the search for the global minimum is given as a part of the ESI.†
To study the deviations from the basic case examined above, we place unequal numbers of sites on each protein (m1 ≠ m2), while keeping other parameters unchanged. We examine the case where m1 is increased by one (m1 = 5), and m2 decreased by one (m2 = 3). The corresponding phase diagram is shown on panel B of Fig. 2. In contrast to the previous example, the x2 curves are not parallel to each other and a distinct asymmetry along x2 axis is observed. Envelopes for pure components, denoted by black solid lines at x2 = 0 and x2 = 1, are different: protein 1 has higher critical density and temperature, since it has five attractive sites (per sphere) than the protein species labeled 2. This observation is consistent with our previous study,34 as well as, with observations of other authors studying one-component systems.57,58 In addition to this, the unstable states exhibit higher complexity: as inferred from the right site of panel B, T* = 1.20, the high-packing fraction phase tends to retain the composition of the unstable state (density driven), while the low-packing fraction phase is enriched on an account of the second component (composition driven), to satisfy the conservation of particles. At higher temperature, T* = 1.40, this effect is less pronounced.
In the next example the differences between protein species are introduced on account of modifying their interaction energies εij. It is important to stress out that overall protein–protein interaction is controlled by εij and ωij, as well as, by the number of binding sites mi. Here the square well depth ε11 is increased for 10%, while keeping the number of sites m1 = 5 and m2 = 3, and ranges ωij unchanged. These results are shown on panel C of Fig. 2. On the first sight there seems to be little difference to panel B, except for the higher critical point of the envelope at x2 = 0. A closer look, however, reveals the coexistence region of three liquid phases (black shaded area), embedded into the two-phase region. Interestingly, the three-phase region disappears for the temperature equal to the critical value of pure protein 2 (x2 = 1) that is at T* = 1.225. To demonstrate the simultaneous presence of two- and three-phase regions, we on the same panel show also the isotherms for temperatures below and above T* = 1.225. The isotherm for T* = 1.20 reflects the presence of three-phase region, labeled by number “3”, and surrounded by a closed dashed-blue loop. Unstable states within this region (denoted by empty squares) decompose into the phases with packing fraction and composition as indicated on the Figure. We may call these three phases the low-, intermediate-, and high-packing fraction liquid phases. According to the Gibbs phase rule, in a two-component mixture the region where three phases are in equilibrium has only one degree of freedom, in our case this is temperature. The three-phase region separates two-phase region into three sub-regions as specified on the Figure. At higher temperature, T* = 1.40, a simpler phase diagram having only two phases in equilibrium is re-discovered.
Finally, in contrast with the previous example, we keep ε11 = 13.7ε (ε = kBT/T*) unchanged and increase the ε22 for 10%. As before, m1 = 5 and m2 = 3. In this way we suppress – in comparison with the previous case – the interaction potential difference between the two protein species. The resulting phase diagram is shown in panel D of Fig. 2. In this case the phase behavior exhibits a higher symmetry along the x2 axis than in panels B and C: the shape of the envelope at x2 = 1.0 resembles the one at x2 = 0.0. Interestingly, the three-phase region shown in panel C disappears under such conditions. Because of similarity in interactions among the two protein species, the system does not need a third phase for stabilization. Furthermore, the high packing fraction (daughter) phases, derived from unstable states (mother phases), tend to retain the composition of the mother phases, similarly as before shown on panel B.
The best-fit parameters, collected in Table 1, were determined by hand; initially we picked m1, ε11, and ω11 to fit the envelope of pure γD protein (x2 = 0.0). Experimental results for pure βB1 (x2 = 1.0) are not available and to adjust m2, ε22, and ω22 values, we fit the isotherm x2vs. η at T = 268.5 K (see Fig. 3). In this range of x2 values the Tcloud data are readily available.39 This is all the input information we use. We also assume that conventional Lorentz–Berthelot mixing rules provide a reasonable measure of the cross interactions; it is known that an excessive deviations in parameters may cause an un-physical instability of protein mixtures.59
γD | βB1 | ||
---|---|---|---|
m 1 | 8 | m 2 | 2 |
ε 11/kB [K] | 2231 | ε 22/kB [K] | 3248 |
ω 11 [nm] | 0.18 | ω 22 [nm] | 0.21 |
![]() | ||
Fig. 3 Isotherms at 268.5 K (green, bottom), and 270.5 K (orange, top). Circles show experimental data, while the solid lines represent calculations. In addition, we compare the experimentally determined tie-lines39 (dashed green/orange lines), with those calculated (black solid lines). Experimental data for the isotherm at 268.5 K (without the tie-lines) are showed together with the measurements for x2 = 0 (Fig. 4) used to fix the model parameters, collected in Table 1. Other results, including the isotherm at 270.5 K and the tie-lines for both temperatures, were than calculated using these values. |
Parameters listed in Table 1 are then used to calculate: (i) the isotherm at T = 270.5 K (including the tie-lines), for which the Tcloud measurements are available (see Fig. 3); (ii) Tcloudvs. η curves for x2 equal to 0.08, 0.16, and 0.29, shown in Fig. 4 as colored curves; (iii) the isotherms at 255 K, 265 K, and 275 K presented in Fig. 5; as also (iv) the equilibrium surface for the model γD–βB1 mixture (see Fig. 4) for a range of temperatures, x2, and η values.
![]() | ||
Fig. 4 Panel A: Phase separation of the γD–βB1 mixture in 0.1 M phosphate buffer at pH = 7.0. We show the equilibrium surface and its projections on the η–x2 plane. Experimental results are denoted by points and calculations by lines. The comparison is shown for x2 values equal to 0, 0.08, 0.16, and 0.29. The three-phase region (black shaded area) is embedded into the two-phase region for temperatures below 264 K. Panel B: For easier visualization we re-plotted the three-phase region. Coexisting phases are denoted by black lines, see also the isotherm at T = 255 K in Fig. 5. |
![]() | ||
Fig. 5 Isotherms for the temperatures 255 K, 265 K, and 275 K, extracted from Fig. 4. The color notation corresponds to the temperature code in Fig. 4. The one-, two-, and three-phase regions are denoted by the appropriate numbers. Squares denote mixtures, which phase-separate while the lines are connecting the mother phases with the coexisting (daughter) phases. |
Results for the calculated tie-lines and isotherms at 268.5 K are, as shown in Fig. 3, in fair agreement with measurements. The fact that we obtained correct slopes for the tie-lines supports the proposed model; it is known that the latter are very sensitive to the protein–protein interactions. As concluded by Dorsaz et al.;6 the difference in cross interactions at less then kBT may already invert the slope of the tie-line. Notice that uncertainties in measurements of the packing fraction and composition of the coexisting phases can be up to ±0.02.39
In Fig. 4 we show the agreement between the Tcloud measurements and theoretical calculations (lines) for x2 values equal to 0.00, 0.08, 0.16, and 0.29. In addition to these results we also show the equilibrium surface, which signals the onset of cloudiness and the coexistence of low- and high-packing fraction (η) liquid phases. A large difference between m1 (equal to eight), and m2 (equal to two) for γD and βB1, we picture this situation in Fig. 1, is needed for the model to correctly describe experimental results. This set of mi values leads to the presence of a three-phase region (black shaded area) at temperatures below 264 K, where no experimental results are available. Notice that the two-phase coexistence region is present even for packing fractions as high as η > 0.4, what was not the case when the difference between m1 and m2 was smaller (compare Fig. 2). βB1 protein exhibits a slightly longer range ω22 and also a somewhat stronger association energy ε22. The measurements of the association enthalpies,60 based on the thermodynamic analysis of protein oligomerization, reveal a much larger association enthalpy in case of βB1 crystallin. This finding is consistent with our calculations. It is worth mentioning that the potential parameters (ε11, ε22 and ω11, ω22) applied in this calculation fall in a broad range of hydrogen bond magnitudes.61 This result is consistent with our previous findings.34
Fig. 5 presents the results for the three isotherms, which are part of the full phase-diagram shown in Fig. 4. We show the η–x2 projections of the equilibrium surface for three different temperatures: 255 K, 265 K, and 275 K. The graphs show an evolution of the phase coexistence curves with temperature. For the highest one, T = 275 K, the one-phase region is dominant; the two-phase regions only exist around x2 = 0 (η between 0.1 and 0.25) and also for high η values with the composition x2 between roughly 0.4 and 0.8. For a somewhat lower temperature, T = 265 K, the two-phase region is much broader. The lines indicate how the unstable points (squares) phase separate into the phases having different x2, η values. For the lowest isotherm, T = 255 K, the graph is quite complex. Here we have a domain, denoted by “3”, where three liquid phases coexist. In addition, both one- and two-phase regions are present. The complexity is very likely due to fact that interprotein interactions gain in importance if the temperature is lowered. See also the discussion of results presented in panel C of Fig. 2.
Although the choice of the set of parameters is not unique, there seems to be a limited number of such sets providing reasonable results. In several examples we tested the sensitivity of the parameters listed in Table 1. In one of these cases we assumed for the number of sites to be equal to m1 = m2 = 8, and ranges of ω11 = ω22 = 0.18 nm, while at the same time, we increased the difference between ε11 and ε22 by 12%. With these modifications, we have not been able to reproduce the experimental isotherm at T = 268.5 K, shown in Fig. 3; the result for x2 ≈ 0.3 at η ≈ 0.14 was (in our best fitting attempt) too large by a factor of two. In yet another example we choose m1 = 8 (“fixed” by fitting the experiment at x2 = 0) and m2 = 4, keeping the same temperature and ω11 and ω22 values as above. Again, even upon considerable adjustment of ε22 (we took a 15% more-negative value) the agreement with experimental x2vs. η curve was poor, much poorer than in Fig. 3.
The authors of the experimental work provided data in the domain η × x2 ∈ [0,0.3] × [0,0.3]; the state points out of this range were not accessible to measurements due to the tendency of the system to solidify. We calculated the system properties for η and x2 values out of this range – no convergence problems were encountered. An example is the liquid–liquid envelope at x2 = 1, i.e. for pure βB1 protein. Unfortunately, this curve could not be obtained by direct measurements, because of the low critical temperature below −10 °C.60 Addition of polyethylene glycol to the protein buffer mixture increases the critical point and allows us to measure the cloud point temperatures in a wide range of protein concentrations. Such indirect measurements,60 in conjunction with the extrapolation to zero concentration of polyethylene glycol, suggest for the critical point to be around 250 K. Our theoretical model yields for x2 = 1 (see Fig. 4) a value equal to 255 K. Notice again that this prediction is based on fitting the results for x2 = 0 (absence of βB1 protein) and the isotherm for the mixture at T = 268.5 K.
An alternative (and to a certain degree complementary) presentation of the information given by phase diagram in Fig. 4 and isotherms in Fig. 5 is provided by the cloud and shadow curves.62 These curves are fundamental for studying mixtures, because they show the partitioning of protein species into low- and high-packing fraction phases. These results are shown in the ESI† as Fig. S1.
The surface distributions of charges and insights into tertiary structures of γD and βB1ΔN41 proteins are shown in Fig. 6. Since all the measurements were performed slightly below the isoelectric points of proteins (7.1), at pH equal to 7.0, the net charge of proteins is around zero (blue and red surface areas are roughly of the same size).
![]() | ||
Fig. 6 Comparison between γD (left) and βB1ΔN41 (right) crystallins. Panel a: Tertiary structures embedded into the solvent accessible area, generated by Schrödinger software.63 Panel b: Charged amino acid residues on the surfaces of proteins; positive – blue, negative – red, neutral – white. The location of residues is shown in the Hammer projection,64 having the polar and the azimuthal angles as the principle coordinate axes. An implementation of this projection was proposed by Koromyslova et al.65 and modified by us, as described in the ESI.† |
From the Fig. 6 it is difficult to draw any firm conclusions about why the numbers of sites per sphere (m1,m2), needed to adequately describe crystalline mixture, differ from one protein to another so much. In search for some clues we turned to the Protein Dipole Moments Server66 (http://dipole.weizmann.ac.il/) for information about higher multipole terms for 1hk0 and 1oki. The results are summarized in Table 2. The two proteins have dipole moments of similar magnitude, while βB1ΔN41 has a more than nine times larger quadrupole moment than γD.
Charge | Dipole | Quadrupole |
---|---|---|
γD crystalline | ||
0 | 191 | 110 |
βB1ΔN41 crystallin (chain A) | ||
0 | 260 | 1037 |
This enhances the deviation from isotropic behavior and a low number of sites per sphere (m2 = 2) is needed to describe the highly directional interactions among the βB1 proteins. There is experimental evidence showing that β crystallin proteins form dimers and low-branched oligomers.60,67
The comparison of the model calculations with experimental data for binary mixture of γD and βB1 crystallins yields the following conclusions: (i) for good agreement between theory and experiment a large difference in number of sites per sphere of protein (m1 = 8 and m2 = 2) is needed. The results are consistent with experimental findings indicating that β crystallins form small clusters with only a few proteins involved. We trace this result to the large difference in the quadrupole moments between the two proteins. (ii) Our model predicts correct slopes for the tie-lines in x2vs. η graphs. (iii) The calculation yields good agreement with experimentally determined temperature-packing fraction curves for mixtures. (iv) The model provides the results for pure protein βB1, for which no direct measurements are available, and accordingly these data were not included in the fitting procedure. The calculation for the critical temperature is in good agreement with the value suggested from the extrapolation of experimental data for the βB1 protein. (v) The “dumbbell” model predicts a region where three liquid phases are in equilibrium – the region is just out of the domain where measurements are currently available. The calculation demonstrates, in agreement with articles listed in recent review papers,7,8 that methods of the condensed matter physics may be useful in the analysis of experimental data of protein systems.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c6sm01513a |
This journal is © The Royal Society of Chemistry 2016 |