Open Access Article
Maria
Tsanai
a,
Pim W. J. M.
Frederix
a,
Carsten F. E.
Schroer
a,
Paulo C. T.
Souza
ab and
Siewert J.
Marrink
*a
aGroningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen, 9747AG Groningen, The Netherlands. E-mail: s.j.marrink@rug.nl
bMolecular Microbiology and Structural Biochemistry, UMR 5086 CNRS, University of Lyon, Lyon, France
First published on 18th May 2021
Complex coacervates are liquid–liquid phase separated systems, typically containing oppositely charged polyelectrolytes. They are widely studied for their functional properties as well as their potential involvement in cellular compartmentalization as biomolecular condensates. Diffusion and partitioning of solutes into a coacervate phase are important to address because their highly dynamic nature is one of their most important functional characteristics in real-world systems, but are difficult to study experimentally or even theoretically without an explicit representation of every molecule in the system. Here, we present an explicit-solvent, molecular dynamics coarse-grain model of complex coacervates, based on the Martini 3.0 force field. We demonstrate the accuracy of the model by reproducing the salt dependent coacervation of poly-lysine and poly-glutamate systems, and show the potential of the model by simulating the partitioning of ions and small nucleotides between the condensate and surrounding solvent phase. Our model paves the way for simulating coacervates and biomolecular condensates in a wide range of conditions, with near-atomic resolution.
One of the current challenges for the field is to study the dynamics of individual molecules (not only polyelectrolytes, but also solvent, ions and small organic compounds) inside the coacervate phase. Some experimental methods such as Overhauser dynamic nuclear polarization (ODNP) measurements can probe the local dynamics of polymers or water when spin labels are added,15 but explicit molecular simulation techniques such as Molecular Dynamics (MD) are arguably more versatile in addressing this challenge. All-atom MD simulations have so far mainly been restricted to simulating pairs of electrolyte molecules, as simulating systems with multiple polymer chains representative of a coacervate becomes computationally expensive and the sampling of the relevant conformations becomes problematic.16–18 A recent exception is the study by Mintis and Mavrantzas,19 who used a free energy based approach with an explicit solvent all-atom model to study the phase diagram of short acrylic polymer fragments. Neglecting the solvent degrees of freedom remedies this problem to some extent,20,21 but the importance of explicit water in simulating polyelectrolyte complex formation was demonstrated in a number of cases.22–25 Given the limitations of all-atom models, the use of coarse-grain (CG) models is gaining popularity in this field.26 In coarse-graining, groups of atoms are represented by effective interaction sites, drastically reducing the number of particles.27 CG MD allows to describe the diffusive and complexation behavior of all constituents of the coacervate phase at a length scale large enough to have phase separation. This has been demonstrated by several groups, utilizing a one bead per amino acid CG model to study a variety of condensed liquid biomolecular systems.28–34 Although these CG models offer important insight to the physicochemical driving forces of liquid–liquid phase separation, their ability to represent fine chemical details of the constituents is limited. Inherent to their limited resolution, they are more generic in their predictions.
To alleviate this problem, here we use the recently re-parameterized CG Martini force field (version 3) that offers a higher (near-atomic) resolution. Martini maps 2–4 heavy particles to 1 CG bead and is based on oil/water partitioning free energies, as well as miscibility data to describe the various hydrophobic and hydrophilic interactions.35 Previous simulations on poly(diallyldimethylammonium) and poly(styrene sulfonate) complexes with an older version of Martini have demonstrated that this CG approach can correctly deal with excluded volume effects, charge compensation and changes in the dielectric constant of water inside a coacervate.36 We demonstrate the accuracy of the model by reproducing the salt dependent coacervation of poly-L-lysine/poly-L-glutamate (pLys/pGlu) systems, which have been studied extensively by Tirrell, Perry and co-workers,17,20,30,36,37 and show the potential of the model by simulating the partitioning of ions and small nucleotides between the condensate and surrounding solvent phase. The advantage of the Martini model over other CG approaches, in addition to the chemical specificity, is the building block approach offering immediate compatibility of polyelectrolytes with other molecules parameterized using the Martini philosophy, such as lipids,38 proteins,39 sugars,40 nucleotides,41,42 ionic liquids43 and polymers.44 This provides a quick route to the simulation of extraction of valuable compounds in biotechnological applications, as well as to study prebiotic cell precursors and biomolecular condensates in current cells with realistic composition.
:
1 mixtures of these polymers with Npol = 30, coacervation occurs in sodium chloride solutions with concentrations below 0.40 M.36Fig. 2A shows snapshots of the simulated systems and Fig. 2B and C polymer–polymer radial distribution functions (RDFs) from simulations with six different salt concentrations, both below and above this threshold. Two different processes can be observed: firstly, at all salt concentrations positive and negative polymers pair to form dimers. Fig. 2B shows this clearly in the intense first peaks of the pLys–pGlu RDFs at 0.5 nm for all these six salt concentrations, revealing direct contact between the oppositely charged polymers' beads. However, a different behavior is observed for polymers with identical charges, as shown for pGlu–pGlu RDFs (Fig. 2C). As expected, the first neighbor shell (0.5 nm) is almost completely excluded because of charge repulsion, but an enrichment of polymer at larger distances (>1 nm) is clearly observed for the simulations at lower salt concentration, indicative of a large-scale phase separation. Simulations with 0.36 M NaCl concentration only show a negligible enhancement of polymer density compared to the bulk, while with 0.6 and 0.9 M the polymer chains are fully dispersed. Together with the snapshots from the simulations (Fig. 2A), these RDFs show that phase separation into a polymer-dense and a polymer-deficient phase occurs at sodium chloride concentrations of 0.17, 0.21 and 0.25 M, but not at concentrations above 0.36 M. To quantify the salt concentration at which coacervation is observed in our simulations, we calculated the maximum peptide cluster size, normalized by the volume of the system. The results are shown in Fig. 2D. The maximum cluster size seems to decrease with increasing the salt concentration and above 0.36 M it levels off. Taking the inflection point of the maximum cluster size curve as a rough estimate (Fig. 2D), we conclude that, in our simulations, the transition from coacervation to non-coacervation occurs around 0.3 M NaCl concentration. It should be noted that the coacervate formation is a dynamic and reversible process: when additional salt (1.3 M) is added to the coacervate structure formed at 0.17 M sodium chloride solution, the polymers redissolve into pairs over time, as shown in Fig. S2.† Our estimate of the transition point is consistent with the experimental threshold of 0.4 M,30 although it is important to point out that a fair comparison between the experimental and simulated critical salt concentration is difficult, due to a number of reasons. First, the polymer concentrations used in the simulations are much higher than those used in the experiments, to keep the simulations tractable (otherwise mostly water–water interactions would be simulated). Potentially, the miscibility transition depends on the polymer concentration. Unfortunately, the extent of this effect is not known for the system under investigation, but experimental data for similar systems (pHis/pGlu complex coacervate) point to a limited effect.30 Similar to the polymer length, an increase of 2 to 10 times the polymer concentration seems to promote a shift to modestly higher concentrations of salt, in the range of 0.2 to 0.5 M. Second, our simulation boxes are periodic, and limited in size, and therefore finite size effects are likely to play a role. To obtain a more accurate estimate of precise phase boundaries, Gibbs-ensemble simulations are a suitable choice, as demonstrated by several groups in the context of LLPS.19,31,34 However, such simulations are computationally rather demanding as they rely on Monte-Carlo moves, which is not trivial for polymer-based systems with near-atomic detail. Third, the exact transition point is rather sensitive to details of the force field. We noticed a shift of the transition point to higher salt concentration when using the final Martini 3.0 version instead of the beta release (Fig. S3†), which certainly improves the match with the experimental transition point. Keeping these limitations in mind, we conclude that our model captures the experimentally observed salt-dependency of the coacervation, at least at a semi-quantitative level.
![]() | ||
| Fig. 2 Salt-dependent coacervation. (A) Snapshots of the final state of 100 Lys30 and 100 Glu30 polymers after 20 μs of CG MD simulations at different salt concentrations. Color code is the same as in Fig. 1, water not shown. (B) pLys–pGlu RDFs at different salt concentrations. (C) pGlu–pGlu RDFs at different salt concentrations. (D) Maximum peptide cluster size at different salt concentrations normalized by the volume of each system (for a cut-off of 0.6 nm, which is the largest distance to be considered as a cluster). | ||
To exemplify the effect of polymer concentration to some extent, we performed additional simulations at varying polymer/water weight ratio. We found that polymer concentration did not influence the coacervation as strongly as salt concentration. Simulations with 10, 50, 100 and 150 polymer chains of each charge (0.5–8% w/w) with 0.17 M salt concentration show aggregation in each case (except for the lowest concentration), with the polymer-rich phase taking up a considerable part of the simulation box at the highest polymer concentration (Fig. S4†). Priftis et al. show the formation of coacervate droplets from 0.1 wt% total polymer concentration. With further increase of the total polymer concentration more complex is formed due to the increase of the available charged sites.36
Another interesting observation is the broad width of the interface between the condensate and the aqueous solution. The maximum bulk density of the condensate is only reached at the very center of the phase, implying an interfacial width of around 20 nm. This suggests a low surface tension (γ) between the two phases. To quantify this, we computed the surface tension for the extended system based on the pressure differences between the lateral and perpendicular directions (see Methods). Although this is a slowly converging property, we observe convergence to a value of ∼3 bar nm on a time scale of 10 μs (Fig. S6†). This low surface tension is in reasonable agreement with experimental findings performed for the same system with 0.16 M NaCl concentration which report a value of γ = 7 bar nm.51
In a heterogeneous system, however, this analysis is problematic because of the computation of the particle average which mixes the dynamics of particles in different environments, hence, the MSD only offers an average behavior. An alternative pathway to gain insight into the particle dynamics is the computation of the Incoherent Scattering Function (ISF) that has been frequently studied in the field of supercooled liquids.54,55 The ISF contains information on how long a particle is correlated to a certain distance around its initial position. In case of homogeneous diffusion, the ISF decays exponentially, where the time scale is related to the diffusion coefficient.54,56 One can observe from Fig. 4 that for a salt concentration of 0.60 and 0.90 M, the shape of the curve can be reasonably well described by a single exponential decay. For the systems with 0.17 and 0.36 M salt concentration, a single exponential function (light green) clearly fails to describe the shape of the ISF. Instead, two distinct regimes can be observed: a fast decay that corresponds to the water beads in the bulk and a slow decay that represents the water beads in the coacervate. In order to extract the diffusion coefficients from the curve, we fitted the data points in these cases with a weighted sum of two exponential functions (illustrated with the pink line for 0.17 M salt concentration in Fig. 4), considering ISF curves obtained at different wave vectors q (Fig. S7†). The resulting diffusion coefficients are shown in Table 1. The decrease in diffusion coefficients upon increasing salt concentration can be rationalized by the increase of the salt concentration. This was proven by running simulations in pure water/salt mixtures in 0.17, 0.36, 0.60 and 0.90 M NaCl concentration (Table 1), which show similar diffusion coefficients as observed in the bulk phase of the coacervate systems. Within the coacervate phase, water diffusion is observed at a significantly reduced speed, about 6 times slower than in the bulk, in qualitative agreement with the experimentally observed slow-down.52 The reduced speed of the water molecules in the coacervate phase with respect to the bulk phase was further validated by calculating the short-time MSD of selected water molecules inside the coacervate and inside the bulk phase over a period of 1 ns simulation time for the biphasic system with 0.17 M NaCl concentration (Fig. S8B–D†). From the MSD analysis we found the diffusion coefficients for the bulk and coacervate phase to be Dbulk = (1.60 ± 0.10) × 10−9 m2 s−1 and Dcoac = (0.43 ± 0.04) × 10−9 m2 s−1 respectively (Fig. S8A†), consistent with the results reported in Table 1 given the approximate nature of the MSD analysis.
| Water–NaCl–peptides | Water–NaCl | ||
|---|---|---|---|
| C NaCl | D bulk | D coac | D bulk |
| 0.17 | 1.56 ± 0.01 | 0.26 ± 0.01 | 1.71 ± 0.01 |
| 0.36 | 1.47 ± 0.01 | 0.24 ± 0.02 | 1.36 ± 0.01 |
| 0.60 | 0.86 ± 0.05 | — | 0.96 ± 0.01 |
| 0.90 | 0.56 ± 0.02 | — | 0.62 ± 0.01 |
An interesting question is to what extent polymers inside condensates still display liquid-like properties. In order to gain insight into the dynamics of the peptides in our systems, we calculate the ISF pGlu in different salt concentrations and for different wave vectors q. As mentioned earlier, the ISF should decay exponentially in the case of a homogeneous diffusion. From Fig. 5 one can observe that the systems with 0.17 and 0.90 M NaCl concentration, the shape of the curves can be reasonably well described by a single exponential decay. In these systems most of the peptides are either in the coacervate phase (system with 0.17 M) or totally dissolved (system with 0.90 M), in consequence they will show a homogeneous diffusion. However, for the systems with 0.36 and 0.60 M salt concentration, a single exponential function fails to describe the shape of the ISF. Instead, two distinct regimes can be observed: a fast decay that corresponds to the peptides that are dissolved in the bulk and a slow decay that represents the peptides that are part of transient peptide clusters. In order to extract the diffusion coefficients from the curves, we fitted the data points for 0.17 and 0.90 M NaCl concentration with an exponential function (illustrated with the green lines in Fig. 5) and the data points for 0.36, 0.60 M with a weighted sum of two exponential functions (illustrated with the pink lines in Fig. 5), considering ISF curves obtained at different wave vectors q. For q value of ∼0.64 (i.e. a length scale lower than 2.8 particle diameters), we find the diffusion coefficients to plateau at values shown in Table 2. Experimental studies employing proton pulsed field gradient NMR have reported diffusion coefficients of poly(diallyldimethylammonium chloride) to be of the order of 10−12 m2 s−1 to 10−14 m2 s−1 inside a coacervate phase of poly(diallyldimethylammonium chloride) and the protein bovine serum albumin.57 Our peptides diffuse at the faster end of this range, and clearly are in a liquid-like state. The liquid properties and the dynamics of the polypeptides in our systems can be also seen in the Movie S1.†Table 2 shows that poly-Glu peptides diffuse less when they are in the coacervate phase, which is the system with 0.17 M NaCl concentration. By increasing the salt concentration the peptides start to dissolve, therefore the diffusion coefficient of pGlu increases. A much faster component is now also present due to peptides that diffuse as isolated entities. In case of the system with 0.9 M, which is homogeneous (Movie S2†), the dynamics is slowed down due to the formation of a gel-like network at the high peptide concentrations used in this study.
| C NaCl | D 1(pGlu) | D 2(pGlu) |
|---|---|---|
| 0.17 | 27 ± 1 | — |
| 0.36 | 39 ± 2 | 325 ± 7 |
| 0.60 | 38 ± 2 | 355 ± 6 |
| 0.90 | 33 ± 1 | — |
In order to study the partitioning of the RNA polyelectrolytes, we performed MD simulations of 10 μs of a phase separated pLys/pGlu coacervate system in the presence of single stranded RNA (ssRNA) molecules. We prepared three different systems, differing in the length of the ssRNA: 3-, 5-, or 10-mer, with 20 copies present in each case. All three systems were simulated at two different salt concentrations, 0.17 M and 0.25 M. Starting from randomized placements of the ssRNA in the solution phase, we observed the partitioning of the 20 oligonucleotides into the coacervate phase, at first on the surface and as the simulations proceeded, further inside the coacervate system. The movement of ssRNA into the coacervate phase is most evident for the system with 0.25 M salt, which can be rationalized by the closer vicinity to the phase transition point (estimated at 0.3 M, see above) implying a diffuser interface. Snapshots of the final configurations for the 3-mer ssRNA are shown in Fig. 6, similar results are obtained for the 5-mer and 10-mer ssRNA molecules (Fig. S9†), showing the preference of the ssRNA for the coacervate phase in line with the experimental observations on related systems.62 It should be noted that the systems have not reached full equilibrium. The slow dynamics of the polymers (see above) makes the diffusion of ssRNA in and out of the condensate a slow process. Experimental studies show that exchange of RNA molecules between coacervate droplets occurs on the second time scale.62 This phenomenon cannot be captured with our simulations which are in the order of microseconds.
In order to further quantify the extent to which the ssRNA molecules prefer the coacervate phase or perhaps the interface, we performed umbrella sampling simulations to obtain the free energy of transfer of an RNA molecule from the aqueous phase to the coacervate phase. The resulting profile (Fig. 6C) shows that the transfer of the RNA molecule to coacervate phase is more favorable than the molecule remaining in the aqueous phase by about 40 kJ mol−1. Furthermore, the broad interface does not show a minimum free energy demonstrating that the oligonucleotides do not remain localized at the surface of the phase but have a stronger tendency to move within its interiors.
Our results demonstrate that, using the Martini 3 force field, we can capture the experimental trends of this complex coacervate system. RDFs and cluster analysis, show that the coacervate formation is strongly affected by the ionic strength. In particular, we observe coacervation at a lower salt concentration than the experiments, but still in semi-quantitative agreement with the phase diagram.
Furthermore, we revealed that the coacervate phase remains well hydrated and ions are depleted from the coacervate phase. Remarkably, the interface between the condensate and surrounding solution is found to be very broad, of the order of 20 nm, with a low surface tension of γ = 3 bar nm consistent with experimental studies.
Additionally, our analysis showed that peptides, water and ions diffuse freely in these complex coacervate systems. A systematic analysis of the diffusion of water molecules, both inside the coacervate and the aqueous phase reveal that the diffusion coefficient for the bulk water is almost one order of magnitude larger than the diffusion coefficient of the water molecules inside the coacervate phase. This trend agrees with the reported experimental findings.
Increasing evidence suggest the localization of biomolecules, and especially nucleic acids, within coacervate droplets. In an attempt to study this phenomenon, we performed 10 μs simulations of single strand RNA molecules in biphasic systems with 0.17 M and 0.25 M NaCl concentration. Our simulations show the partitioning of all RNA molecules inside the coacervate phase. This is also evident from free energy calculations which signify a clear preference of the oligonucleotides for the interiors of the coacervate phase.
Taken together, our results demonstrate that we can capture the experimental behavior of the poly-lysine and poly-glutamate coacervate system with the Martini 3 force field. Consequently, the present explicit-solvent coarse-grain model of complex coacervates can be extended to gain physical insight on the mechanisms that drive the formation of membraneless organelles within cells and provide near-atomic resolution on the structural and dynamic organisation inside the coacervate phase.
Equal numbers of polymers (100 each unless specified otherwise, representing ∼5–7% w/w polymer) were randomly dispersed in a 30 × 30 × 30 nm3 periodic simulation box and solvated using normal Martini water beads. The required number of water beads was replaced with charged TQd and TQa ion particles to represent Na+ and Cl−, respectively. After steepest descent minimization, systems were equilibrated for 10 ns using a 10 fs time step and the Berendsen barostat. 20 μs production runs using a 20 fs timestep were carried out in the NPT ensemble, keeping temperature at 298 K using the v-rescale thermostat69 (τT = 1.0 ps−1) and pressure at 1.0 bar using the isotropic Parrinello–Rahman pressure coupling70 (τP = 12 ps−1). In accordance with the typical settings associated with the Martini force field, electrostatic interactions were treated using a reaction field approach (εrf = 15 beyond the 1.1 nm cut-off) and a shifted van der Waals potential, cut-off at 1.1 nm with the Verlet cut-off scheme.71 The neighbor list was updated every 20 steps and coordinates were saved every 200 ps. Reported salt concentrations are based on the final number of the water molecules and are calculated as C(Na+) = C(Cl−) = (number of ions × 55 mol L−1)/(final number of water beads × 4), small changes in polymer concentration are not deemed significant for the results. In Martini model, one water bead accounts for four water molecules.
In addition, extended systems were generated to compute the surface tension and measure density profiles by constructing a biphasic system of 600 Lys30 and 600 Glu30 polyelectrolyte chains in a 24 × 24 × 176 nm3 simulation box. In particular, accumulative average surface tension was calculated every 200 ns over a 16 μs length trajectory using the GROMACS tool gmx energy.64 This system requires to be equilibrated for long time (around 9 μs) since the surface tension shows huge fluctuations in the beginning of the equilibration but also after the equilibration. Negative values for the surface tension could mean that water tunnels are formed in the coacervate phase which connect the two sides of the aqueous phase. Low positive values for the surface tension indicate that the coacervate phase consists of a large amount of water molecules which form water bubbles inside the phase. RNA partitioning simulations were performed in a 22 × 22 × 74 nm3 simulation box consisting of 581 Lys30, 581 Glu30 polyelectrolyte chains and 20 ss-RNA molecules, whose bonded parameters were based on Uusitalo et al.41 The ssRNA molecules are composed by 3, 5 or 10 monomers of uracil.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1sc00374g |
| This journal is © The Royal Society of Chemistry 2021 |