Yiwei
Wang‡
,
Rongrong
Zou‡
,
Yeqiang
Zhou
,
Yi
Zheng
,
Chuan
Peng
,
Yang
Liu
*,
Hong
Tan
,
Qiang
Fu
and
Mingming
Ding
*
College of Polymer Science and Engineering, State Key Laboratory of Polymer Materials Engineering, Sichuan University, Chengdu 610065, China. E-mail: liuyang_leon@scu.edu.cn; dmmshx@scu.edu.cn; dmmshx@163.com
First published on 29th July 2024
Coacervates play a pivotal role in protein-based drug delivery research, yet their drug encapsulation and release mechanisms remain poorly understood. Here, we utilized the Martini model to investigate bovine serum albumin (BSA) protein encapsulation and release within polylysine/polyglutamate (PLys/PGlu) coacervates. Our findings emphasize the importance of ingredient addition sequence in coacervate formation and encapsulation rates, attributed to preference contact between oppositely charged proteins and poly(amino acid)s. Notably, coacervates composed of β-sheet poly(amino acid)s demonstrate greater BSA encapsulation efficiency due to their reduced entropy and flexibility. Furthermore, we examined the pH responsiveness of coacervates, shedding light on the dissolution process driven by Coulomb forces. By leveraging machine learning algorithms to analyze simulation results, our research advances the understanding of coacervate-based drug delivery systems, with the ultimate goal of optimizing therapeutic outcomes.
Since proteins generally carry electrical charges, it is preferable to use coacervates containing charged components for encapsulating proteins and preserving their biological activity. Polylysine (PLys) and polyglutamate (PGlu) have the ability to self-assemble into coacervates through electrostatic interactions in aqueous environments.26–29 Owing to their excellent biocompatibility and susceptibility to enzymatic breakdown within organisms, these polymers are promising candidates as protein-based drug carriers, potentially circumventing undesirable accumulation in the body.30–34 This represents an excellent model for investigating the encapsulation of proteins by coacervates. Sarah, L. P. et al. have shown that both the location of charged patches and the overall net charge of a protein are crucial factors in facilitating its encapsulation within PLys/PGlu coacervates.35 Owing to the charges present in the proteins, the order in which various components are introduced to the system could potentially affect their initial contact with specific charged poly(amino acid)s, thereby influencing the encapsulation process. This implies that the system might undergo kinetic trapping, rendering it difficult to achieve the lowest thermodynamic state.36 In addition, the charged nature of the protein and the sequence in which components are added can also impact the secondary structure of the poly(amino acid)s,37 affecting the phase behavior of the coacervate. Matthew Tirrell et al. have shown that coacervates could be formed when at least one polypeptide in the PLys/PGlu mixture is racemic, disrupting hydrogen-bonding networks in the backbone and leading to the formation of a coil secondary structure. In contrast, pairs of purely chiral polypeptides, regardless of their chirality, adopt β-sheet conformation and aggregate into more compact precipitates.38 Therefore, we speculate that the sequence of ingredient addition and the secondary structures of coacervate components could have a significant impact on protein encapsulation.
In this study, we first investigated the P-L-Lys/P-D,L-Glu coacervate with a random coil secondary structure, referring to the research of Tsanai and Black et al.22,39 Martini 3.0 simulations were utilized to model the formation of these coacervates and investigate the mechanisms underlying the encapsulation and release of various proteins within these coacervates. While all-atom (AA) simulations face challenges in observing coacervation phenomena due to temporal and spatial scale limitations,40 the Martini 3.0 coarse-grained model41 offers detailed molecular insights and significantly speeds up simulations by 4–10 times. As a result, it has been increasingly applied to simulate various biomolecular systems in recent years, e.g. Lys/Glu-based coacervate,39,42–44 peptides/proteins,45 nucleic acid,46 polymers,47,48 coacervate/membrane,49etc. However, none of these simulations have specifically addressed the encapsulation and release of proteins in coacervates. The density-based spatial clustering of applications with noise (DBSCAN)50 and principal component analysis (PCA)51,52 were employed to analyze and characterize the formation of the coacervate, as well as to identify the dynamic variations in polymer chain movements across different systems. Notably, the DBSCAN machine learning method has been used to analyze the state of coacervates.53 Here, we reveal that the sequence in which ingredients are added significantly affects the encapsulation efficiency of bovine serum albumin (BSA) within the coacervates. This effect is primarily due to differences in the contact between the protein and the oppositely charged poly(amino acid). Introducing the poly(amino acid) component with an opposite charge to that of the proteins as the initial step during coacervation enhances the encapsulation efficiency of protein-based drugs within the coacervates. This result could be extended to other protein systems, such as lysozyme, saporin, actin and enhanced green fluorescent protein (EGFP). Additionally, we also used P-L-Lys/P-L-Glu coacervates with β-sheet secondary structure and conducted pioneering research on the impact of secondary structures on protein encapsulation mediated by coacervates. Coacervates with β-sheet secondary structures exhibited superior protein encapsulation efficiencies compared to those with coil systems, due to the lowered entropy and reduced flexibility of polymer chains in the β-sheet systems. Furthermore, through DBSCAN, we achieved protein release from coacervates under neutral to acidic conditions, which was driven by charge.
Fig. 1 (a) Martini 3.0 CG models and chemical structures of PLys and PGlu with a degree of polymerization of 30 in simulation systems. (b) Protein encapsulation percentage by coacervates formed by different ingredient adding sequence as function of BSA content. The time-changed values are presented in Fig. S2.† (c) Contact number of PLys/BSA (solid line) and PGlu/BSA (dashed line) in various systems (normalized by BSA number). Values that change over time are displayed in Fig. S4.† (d) Contact map illustrating the residue-based contacts between PLys and BSA in 0.4BSA-LG-Coil system. The results of PLys/BSA and PGlu/BSA in other systems are presented in Fig. S5 and S6.† (e) Contact number between PLys and PGlu in various systems. Values that change over time are displayed in Fig. S9.† (f) SASA of coacervates formed by different ingredient adding sequence as BSA protein content change. The time-changed values are presented in Fig. S10.† (g) The PC1 and PC2 motion of PLys in 0.4BSA-LG-Coil system. Each illustration represents a movement of PLys from blue to white to red over time, with the mean position represented in gray. For PC1, PC2 motions of PLys and PGlu in LG, GL and SIM systems, refer to Fig. S12–S14.† (h) The PC1 and PC2 corresponding principal component projection values of PLys in various systems (0.4 BSA). The specific values of PLys and PGlu in other systems are shown in Fig. S15, S16, S18 and S19.† (i) The Schilitter entropy of PLys and PGlu in various systems with different ingredient adding sequence on coacervate formation (0.4 BSA). The numerical values of PLys in other systems and PGlu are presented in Fig. S17, S21 and S22.† |
Moreover, we focused on the characteristics of the coacervates in LG, GL and SIM systems and found that the contact number of PLys/PGlu in all systems remained constant regardless of variations in BSA concentration (Fig. 1e, the specific values as functions of time are shown in Fig. S9†). This suggested that the stability of the coacervates was not influenced by changes in BSA concentration. In addition, the contact number between PLys and PGlu in the LG sequence were consistently lower than GL and SIM systems (see Fig. 1e). This could be attributed to the higher PLys/BSA contact in LG systems (Fig. 1c), which consequently reduced the PLys/PGlu contact due to steric hindrance. What's more, the solvent-accessible surface area (SASA) of the coacervates formed under the LG sequence was higher than those of the GL and SIM systems (Fig. 1f, S10a and c†). Analysis of the number of clusters also revealed that there are the highest number of coacervate clusters after equilibrium in LG systems (Fig. S49†).
Therefore, coacervates formed in LG systems were smaller and more dispersed than those in other systems.
To gain a comprehensive understanding of the behavior of PLys and PGlu chains during the coacervation and protein encapsulation processes in LG, GL and SIM systems, we applied the PCA machine learning algorithm and calculated the Schilitter entropy.54,55 These computations allowed us to examine the state of individual PLys and PGlu chain, classify their structures into various subcategories based on the covariance matrix, and evaluate the accessible configurational state space of amino acid chains. We investigated the effect of BSA concentration and ingredient addition sequence on coacervate formation on the motion pattern of peptide chains, respectively. In Fig. 1g, principal components (PC1, PC2) were ordered based on their eigenvalues (see Fig. S14†), with PC1 and PC2 representing the open/close and torsion motions of both PLys and PGlu in all systems (as detailed in Fig. S15–S17†). The BSA content had minimal impact on the PC1 and PC2 projection, indicating similar motion patterns of poly(amino acid)s (Fig. S11 and S12†), which aligned with the observed similarity in Schilitter entropy (Fig. S13†). When considering the ingredient adding sequence, PC1 projections showed similarities in LG, GL, and SIM systems, indicating that the sequence had limited impact on the PC1 motion of poly(amino acid) chains (Fig. 1h and S18–S19†). This observation was consistent with our discovery that end-to-end distances were similar in LG, GL, and SIM systems (see Fig. S20†). However, we detected differences in distributions of PC2 projections for PLys chains in LG, GL and SIM systems, although they were all Gaussian distributions (see Fig. 1h and S18†). In the LG system, the PC2 distribution of PLys was narrower than those in GL and SIM systems, indicating that its twisting was restricted. This was in agreement with the Schilitter entropy analysis, which showed that PLys in the LG system had lower entropy values due to its limited rotation, reducing the degree of disorder in the chain motion (Fig. 1i and S21†). This indicated that, compared to other adding sequence systems, the overall mobility of PLys around BSA was more strongly restricted, making it difficult to twist and tightly binding to the protein surface. As a result, the PLys/BSA contact and BSA encapsulation rate in LG systems were higher than the other two systems.
Non-reactive molecular dynamics we used is based on the ergodic hypothesis, which states that since there is a one-to-one correspondence between the initial conditions of a system and the state at some other time, averaging over a large number of initial conditions is equivalent to averaging over time-evolved states of the system. Given sufficient time, a system can explore all thermodynamic states located on this surface.56,57 Ideally, any initial configurations should converge to the global minimum on the free energy surface. However, in experiment, the procedure for adding ingredients affects the final results due to the constrained experimental time, leading to being captured in kinetic traps. Angela Valbuzzi found that Anti-TRAP(AT) inhibition of the RNA with trp RNA-binding attenuation protein interaction is determined kinetically. And whether AT first exists in the system will significantly affect the binding results of AT and RNA.58 The different addition sequences of I-catalysts significantly impact the rate of the ascorbic acid iodine clock reaction.59 The more limited simulation time makes it harder to sample all the state points on the free energy surface. This is equally effective for our systems. Despite being a metastable state, it plays a crucial role in fields such as catalysis and drug delivery, necessitating thorough investigation.
Fig. 2 (a) The snapshots of the simulations in saporin-GL, lysozyme-GL, EGFP-LG and actin-LG systems, and the simulation snapshots of all systems are shown in Fig. S23.† Orange represents PLys, green represents PGlu, red represents saporin, purple represents lysozyme, pink represents EGFP, and light blue represents actin. Water and ions are not shown for clarity. (b) Protein encapsulation percentage in coacervates formed by different ingredient adding sequence. All P values were < 0.05, so the encapsulation rates for different ingredient adding sequence are significantly different. The time-dependent values are presented in Fig. S27.† (c) Contact number of PGlu/protein and PLys/protein in various systems. Values that change over time are displayed in Fig. S28.† (d) Contact maps illustrating the residue-based contacts between PLys and saporin in LG, GL and SIM systems. The results of PLys/protein and PGlu/protein in other systems are presented in Fig. S29–S32.† |
We conducted separate calculations for the number of contacts of PLys/protein and PGlu/protein within distinct coacervate systems (Fig. S28†), accompanied by the generation of their corresponding contact maps (Fig. S29–S32†). Similar to BSA protein systems, altering the adding sequence of ingredients in coacervates resulted in variations in poly(amino acid)/protein contact numbers, consequently affecting the loading efficiency.
Significant differences were noted in number of PLys/protein contacts for the negatively charged EGFP and actin proteins when exposed to different ingredient adding sequences during coacervate formation (see Fig. S28†). Specifically, the LG sequence resulted in the highest number of contacts (Fig. 2c). In contrast, for saporin and lysozyme proteins, characterized by an overall positive charge, number of PGlu/protein contacts exhibited significant differences, with the GL sequence yielding maximum contacts (Fig. 2c and S28†). These findings were consistent with outcomes of the contact maps analysis that the LG sequence for EGFP and actin proteins displayed the darkest colors in the contact map compared to the other two sequences, while the GL sequence for saporin and lysozyme proteins exhibited the deepest colors (Fig. 2d and S29–S32†). For example, in the contact maps of saporin with PGlu, it can be observed that when saporin residues are near positions 25, 60 or 80, the contact maps under the GL sequence are notably more intense (redder) compared to those under LG and SIM sequences (Fig. 2d). Consequently, in GL sequence systems involving positively charged proteins, the initial addition of PGlu resulted in strong electrostatic interactions, leading to a higher rate of protein encapsulation. Similarly, for negatively charged proteins, the initial incorporation of PLys in the LG sequence enhanced the rate of protein encapsulation within coacervates. It was worth noting that in all protein systems, the highest contact point between proteins and poly(amino acid)s was located on the charged patches of the protein surface (Fig. S8 and S28–S32†). Therefore, the adding sequence of ingredient during coacervate formation played a crucial role in influencing protein encapsulation, which was determined by the overall charge carried by the protein itself, along with patches of charged residues on the protein surface (Fig. S8†). The experiment conducted by Sarah L. Perry, et al. supported our conclusions.35 By manipulating the sequence in which poly(amino acid)s and proteins were added into systems, we could leverage electrostatic interactions to control protein encapsulation, achieving systems with high encapsulation efficiencies to enhance the therapeutic effectiveness.
Fig. 3 (a) Protein encapsulation percentage by coacervates as function of BSA protein content. The time-changed values are presented in Fig. S33c.† (b) RDF between coacervates (including both PLys and PGlu) and protein under different BSA content. (c) Contact number between different components in the systems. Values that change over time are displayed in Fig. S34 and S37.† (d) Contact map illustrating the residue-based contacts between BSA and PLys in the 0.2BSA-LG-β-sheet system. The results of other systems are presented in Fig. S35 and S36.† (e) NCluster of coacervates as BSA protein content change. The time-changed values are presented in Fig. S33g.† (f) The highlight of residues in BSA that have the highest contact frequency with PLys in the 0.2BSA-LG-β-sheet system. (g) The PC1 and PC2 motion of PLys and its corresponding principal component projection values in various secondary structural systems (0.2BSA). The middle illustrations represent the movement of PLys from blue to white to red over time, with the mean position represented in gray. For specific values and PC1, PC2 motion of other systems with different BSA content, refer to Fig. S38–S40.† (h) The Schilitter entropy of PLys in different secondary structural systems (0.2BSA). The numerical values of PLys in other systems and PGlu are presented in Fig. S42 and S43.† (i) Difference between extreme values of PLys Gibbs free energy. The numerical values of PLys and PGlu are presented in Fig. S44 and S45.† |
In order to investigate mechanism behind the encapsulation process, we computed the contact number of PLys/BSA and PGlu/BSA in both secondary structures (see Fig. 3c and S34†). Our results indicated that contact numbers between PLys and BSA, as well as between PGlu and BSA were higher in β-sheet systems than those in coil systems. This difference was particularly pronounced in the contact number between PLys and BSA due to their opposite charges. Furthermore, we generated contact maps for PLys/BSA and PGlu/BSA based on residue interaction in Fig. 3d, S35a–e and S36a–e.† We also highlighted top 20 residue pairs with the highest contact frequency in interactions between BSA protein and PLys (exemplified by the 0.2BSA-LG-β-sheet system) in Fig. 3f, and PGlu in Fig. S36f.† Apart from higher contact frequency between PLys and BSA protein, our findings indicated that most of contacts made by PLys were with Glu and Asp residues in BSA protein, while PGlu contacts were primarily made with Lys and Arg residues. These contact pairs possessed opposite charges. Therefore, the higher engulfment efficiency of β-sheet systems was primarily attributed to the higher contact between PLys and BSA, particularly involving Glu and Asp residues in the BSA protein. Moreover, we characterized the state or configuration of coacervates themselves. The number of cluster (NCluster) analysis results indicated that β-sheet systems ultimately formed fewer coacervate clusters (Fig. 3e and S33g†). And PLys/PGlu contact numbers were higher in β-sheet systems regardless of number of BSA protein numbers presented in the system (Fig. 3c and S37†). These results indicated that coacervates formed by PLys/PGlu in the β-sheet secondary structure were larger in size compared to those formed by poly(amino acid)s in coil systems (Fig. 3a and S2a†).
The behavior of PLys and PGlu during coacervation and protein encapsulation processes were analyzed using the PCA machine learning algorithm and Schilitter entropy calculations. From PCA, we obtained main motion patterns of PLys and PGlu chains during the coacervation and protein encapsulation processes, and it could be seen that PC1 and PC2 components were ranked according to their eigenvalues (Fig. S38a and S39a†), with PC1 and PC2 representing open/close and torsion motions of both PLys and PGlu in all systems (see Fig. S40†). Distributions of PC1 and PC2 projections for both PLys and PGlu chains were observed to remain unchanged regardless of the amount of BSA present, and both PC1 and PC2 adopt Gaussian distributions regardless of the secondary structure system (Fig. 3g, S38 and S39†). Moreover, our results demonstrated that the projection value distribution in β-sheet systems was considerably wider in comparison to coil systems, irrespective of whether it was PC1 or PC2, especially for PLys. This suggested that the amino acid chain within the β-sheet system exhibited a greater level of expansion and distortion. This was consistent with calculation results of the end-to-end distance analysis, where lengths of PLys and PGlu were larger in β-sheet systems (Fig. S41†). This also provided a reason for the larger SASA of whole coacervates in β-sheet systems (Fig. S33d and f†). Additionally, we calculated the Schilitter entropy for both PLys and PGlu chains, the entropy was lower in β-sheet systems than coil systems, indicating a smaller configurational space (Fig. 3h, S42 and S43†). We also computed the 2D Gibbs free energy landscape for both PLys and PGlu chains in different secondary structures as functions of PC1 and PC2, (Fig. 3i, S44 and S45†). We observed a more pronounced difference between extreme values of Gibbs free energy for both PLys and PGlu in β-sheet systems compared to coil systems. This indicated that there was greater restriction on extreme movements of poly(amino acid)s in β-sheet systems. Based on these analyses, it could be speculated that the movement of the poly(amino acid) single chain was confined to a broader structure in β-sheet systems, resulting in the formation of larger coacervates and leading to enhanced efficiency in encapsulating BSA.
Fig. 4 (a) Up: snapshots of the final states of 0.05BSA systems after about 8 μs of CG MD simulations at different pH values. Down: the DBSCAN algorithm clustering models. The complete snapshot is shown in Fig. S45a.† (b) Chemical structures and Martini 3.0 CG models of PGlu before and after protonation. (c) The variation curve of the NCluster and SASA with changes in pH value. The dashed line represents SASA, the solid line represents NCluster. Values that change over time are displayed in Fig. S46a and b.† (d) The contact number of PLys and PGlu after equilibrium in different pH systems. Values that change over time are displayed in Fig. S46c.† (e) Linear correlation between Coulomb potential and contact number between PLys and PGlu at different pH values. (f) The change in pH affects the quantity of contact in PLys/BSA and PGlu/BSA. Values that change over time are displayed in Fig. S46d and e.† |
To quantify changes in the configuration of coacervates under different pH conditions, we employed the DBSCAN machine learning algorithm to determine the phase separation level of coacervates within the system, as illustrated in Fig. 4a, S46a and ESI Movies 1–4.† Our findings revealed that both PLys and PGlu displayed notable aggregation effects at pH 5 and higher, forming continuous and dense coacervates. However, as the pH decreased, coacervates appeared fragmented, resulting in an increased number of clusters detected by DBSCAN clustering models, as shown in Fig. S46c.† Consequently, condensed droplets began to decompose as the pH decreased. This phenomenon was also confirmed in the contact number, NCluster, and SASA of different coacervate components. As the pH value decreased, there was an elevation in the NCluster and SASA of coacervate droplets (Fig. 4c and S47a and S47b†), accompanied by a reduction in the contact number between PLys and PGlu (Fig. 4d and S47c†). Furthermore, the pH value of approximately 4 exhibited the most significant changes in contact number, number of clusters, and SASA, indicating a potential phase transition point. Notably, this pH transition point between 3 and 5 aligned with experimental observations.22 To investigate the mechanism responsible for the degradation of coacervates, we quantified the Coulomb potential between PLys and PGlu in Fig. S47f.† It was observed that an increase in the Coulomb potential resulted in a linear increase in the PGlu/PLys contact number (Fig. 4e). Thus, the disintegration of coacervates was attributed to the decrease in electrostatic interaction between PLys and PGlu.
Interestingly, despite observing the disruption of coacervates as the pH value decreased, we noticed an increase in the contact between BSA and poly(amino acid)s (Fig. 4a and S47a†). To quantify this phenomenon, we computed the contact number of PGlu/BSA and PLys/BSA at different pH levels (Fig. 4f, S47d and e†). The PGlu/BSA contact increased with the disintegration of coacervate droplets (decrease of pH value), while the PLys/BSA contact remained relatively stable. This was because at high pH levels, both PGlu (prior to protonation) and BSA carried negative charges, which led to a strong repulsion between PGlu and BSA. As the pH level decreased, the protonation degree of PGlu increased, diminishing the repulsive force between PGlu and BSA, resulting in an increased contact frequency. Furthermore, within the pH range of 2–7.4, the protonation state of PLys remained constant, resulting in minimal changes in the contact number between PLys and BSA. It should be noted that at low pH levels, the BSA protein was primarily surrounded by a single layer of poly(amino acid)s, which was no longer in a coacervate state. Therefore, even though the coacervate dissolved at low pH, BSA protein was still wrapped by some PGlu and PLys chains, which could not be captured in experiments.22
Furthermore, We investigated the pH sensitivity of coacervates and applied machine learning algorithms to analyze the dissolution process of coacervate from neutral to acidic environments. This process was found driven by Coulomb interactions between PGlu and PLys. In many disease microenvironments, such as inflammation and tumors, the pH value is lower and levels of reactive oxygen species (ROS) are higher compared to the normal cellular environment.71–73 Plys/PGlu polyelectrolytes are in a coacervate state in the cellular environment, whereas in disease microenvironments, protonated polyglutamic acid (PGlu+H) causes the dissolution of the coacervate, thereby achieving specific drug release. Therefore, our comprehensive study significantly enhances the understanding of pH sensitive coacervate mechanisms governing the encapsulation of pharmaceutical substances, and drug release within disease microenvironments. This maximizes the therapeutic effects of encapsulated agents and offers guidance for the clinical application of coacervates in drug delivery systems. However, the mechanisms by which redox-responsive coacervates form, encapsulate proteins, and penetrate the cell membrane to enter cells remain unclear, which are already included in our future research plan.
Principal Component Analysis (PCA) was applied to reduce dimensionality and extract motion characteristics of poly(amino acid) chains. We initiated the process by standardizing the dataset, ensuring that each feature has a mean of 0 and a standard deviation of 1. The covariance matrix of the standardized data was computed to assess the correlations between features. Following this, we determined the optimal directions for data transformation by solving for the eigenvalues and eigenvectors of the covariance matrix that we selected the eigenvectors corresponding to the top 35 largest eigenvalues to form the transformation matrix. We finally project the original motion data into a new coordinate system, completing the dimensionality reduction of the data.
Density-Based Spatial Clustering of Applications with Noise (DBSCAN) was applied in our work, we calculated the truncation radius between PLys and PGlu based on their position of the first valley in RDF and selected it as the neighborhood radius Epsilon ε = 0.6 nm (Fig. S3, S26, S33 and S48†). In order to ensure that each cluster contains at least one PLys and one PGlu, and since the content we feed is three-dimensional coordinates, we assigned a minimum point value (MinPts) of 4. Both ε and MinPts significantly influence the clustering results of DBSCAN. Subsequently, we fed the coordinates of all PLys and PGlu beads as data points and calculated the number of neighboring points within the ε neighborhood for each data point. If a data point had at least 3 neighboring points within its ε-neighborhood, we marked it as a core point. We then identified all points that were directly density-reachable, forming a cluster. This process continued until no new points could be added to any cluster. Finally, the number of clusters was calculated. We implemented this process through Python.
For more details, please refer to the ESI methods.†
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4sc03061c |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2024 |