Bin Wang†
a,
Tianren Zhang†
ab,
Sirui Shen
a,
Darrin J. Pochana,
Jeffery G. Saven*b and
Kristi L. Kiick
*ac
aDepartment of Materials Science and Engineering, University of Delaware, Newark, DE, USA. E-mail: kiick@udel.edu
bDepartment of Chemistry, University of Pennsylvania, Philadelphia, PA, USA. E-mail: saven@sas.upenn.edu
cDepartment of Biomedical Engineering, University of Delaware, Newark, DE, USA
First published on 25th April 2025
Intrinsically disordered proteins (IDPs) yield solutions with tunable phase transition behavior and have been widely applied in designing stimuli-responsive materials. Understanding interactions between amino acid residues of the IDP sequence is critical to designing new IDP-based materials with selective phase behavior, assembly, and mechanical properties. The lack of defined structure for this class of proteins complicates accurate prediction of their molecular-scale behavior. In this review, recent progress is presented in the development and application of simulation methods to describe the behavior of IDPs. Results for elastin-like polypeptides (ELPs) and resilin-like polypeptides (RLPs) are highlighted, focusing on studies that compare simulation results with experimental findings.
Design, System, ApplicationIntrinsically disordered proteins (IDPs) yield solutions with tunable phase transition behavior and have been widely applied in designing stimuli-responsive materials. Understanding interactions between amino acid residues of the IDP sequence is critical to designing new IDP-based materials with selective phase behavior, assembly, and mechanical properties. The lack of defined structure for this class of proteins complicates accurate prediction of their molecular-scale behavior. In this review, two IDPs, elastin-like polypeptides (ELPs) and resilin-like polypeptides (RLPs), and recent development of simulation methods for understanding and predicting important properties of these IDPs such as conformational features, polymer properties, and phase transition behavior are highlighted. The comparisons between simulation and experimental results are included to reveal the advances and current limitations in IDP simulations. Several examples of successful simulation of peptide systems integrating IDPs and ordered domains demonstrate the potential to design advanced biomaterials with desired and versatile properties through the assistance of simulation. |
On the other hand, intrinsically disordered proteins (IDPs) are characterized by the presence of intrinsically disordered regions that lack well-defined three-dimensional (3D) structures.11 The disordered nature of IDPs has been shown to be important in biological processes including transcription, translation, signal transduction, and cell-cycle control.12 The abilities of IDPs to undergo tunable, stimuli-responsive assembly as well as promote liquid–liquid phase separation (LLPS) have been successfully applied in designing stimuli-responsive materials including drug delivery carriers, hydrogels, and nanoparticles.13 IDPs can also be versatile building blocks for new materials and have been exploited to guide supramolecular self-assembly and materials properties via the architecture and sequence of fused IDPs.14
Elastin-like peptides (ELPs) are derived from the hydrophobic domain of tropoelastin, a key extracellular matrix protein responsible for the elasticity of many tissues. ELPs are one of the most widely explored and earliest-studied IDPs. ELPs exhibit unique thermoresponsive behavior, making them highly versatile in various biotechnological applications, particularly in drug delivery15,16 and tissue engineering.16,17 Derived from the repetitive amino acid sequences found in elastin (e.g., [VPGXG]n, X is guest residue which can be any amino acid except proline, n is the number of repeat units), ELPs undergo reversible phase transitions in response to changes in temperature, typically characterized by the lower critical solution temperature (LCST).18 This tunable thermoresponsiveness has supported the development of dynamic systems for targeted delivery and controlled assembly. Recent advances in the design and functionalization of ELPs have focused on tailoring their molecular architectures, such as varying chain length,19,20 incorporating guest residues,21,22 and conjugating them with other functional domains.23–25 These modifications aim to enhance ELP properties, including stability and responsiveness.26 By understanding molecular mechanisms governing ELP behavior, researchers can better design ELP-based systems with tunable properties for a broad range of applications.
Resilin is another IDP that serves energy storage functions in insects27 and has been used both as a model for understanding IDP behavior and as a candidate for biomaterials, drug delivery vehicles, and actuators.28–30 Crosslinked resilin forms a thermal stable rubber-like material with low stiffness, high extensibility, high resilience, and long fatigue times. The protein is prevalent in the mechanically active exoskeleton of insects where it serves as a high-frequency responsive elastomer.28,31 Inspired by resilin, resilin-like polypeptides (RLPs) have been designed and recombinantly synthesized in E. coli, based on the putative pro-resilin sequence from the first exon of the Drosophila melanogaster CG 15920 gene.32 More recently, resilins from various insects, including mosquitos, fleas, and buffalo flies, have been produced recombinantly and used in hydrogels, stimuli-responsive materials, and as a bioactive component in hybrid materials.28,30,33,34 By exploiting protein engineering techniques, bioactive and structural domains have been successfully fused with RLPs. RLP-based polypeptides with tunable mechanical properties, versatile assembly into nano- and micro-structures, and high biocompatibility have been produced for biomedical and materials applications.35–39
Various IDP-based materials, especially ELP/RLP-based materials, have been designed and applied as smart biomaterial platforms, leveraging their reversible phase transition behavior, tunable phase transition temperatures, and high biocompatibility.13 Opportunities remain in fine-tuning their physicochemical properties and on-demand phase separation, which require a better understanding of how sequence can be used to control these properties. Compared to other polymeric materials, however, the large numbers of potential sequences involving a wide range of amino acids provide not only diversity in tuning properties but also difficulty in predicting resulting behavior.40 In addition, the variety of short- and long-range interactions possible between amino acid residues (e.g., hydrophobic interactions, electrostatic interactions, π–π interactions, among others), coupled with the absence of well-defined structures in IDPs, hinder the precise characterization of residue interactions within IDP-based assemblies.8,41,42
To understand their association behavior and address key interactions in IDPs, a variety of computational methods have been developed to aid in the understanding of the complex assembly behavior of these polypeptide-based materials.43–45 Molecular dynamics (MD) simulations and related computational methods have been extensively utilized as a complement to experimental studies to better understand LLPS, self-assembly, and intermolecular interactions in IDP-based materials.46–49 Depending on the length and time scales of the modeled systems, both all-atom simulations and coarse-grained (CG) models have been widely employed. All-atom simulations leverage force fields, such as CHARMM,50,51 AMBER,52–54 or OPLS,22,55 and advanced sampling techniques, such as replica exchange molecular dynamics (REMD),56,57 metadynamics, or tempering-based methods,58 to overcome the limitations of standard MD in exploring rugged energy landscapes. Notably, several studies have combined advanced sampling techniques to improve the exploration of IDP conformational landscapes. For instance, hybrid methods such as replica exchange molecular dynamics integrated with metadynamics (REMD-MetaD) have been applied to overcome free energy barriers more effectively to enhance the sampling of free energy landscapes in peptide folding studies.59 Similarly, bias-exchange metadynamics, where multiple replicas, each biased along a different collective variable, periodically exchange configurations, has been successfully applied to capture both global and local conformational dynamics in protein folding.60 Those simulations and various sampling techniques provide molecular-level insights into structure, energetics, interactions, dynamics, and conformational fluctuations, and these techniques have been insightful in studying IDP-based materials, enabling high-resolution analysis of molecular features and the complex dynamics of these systems.47,53,54,61–67
While all-atom simulation provides a powerful way of understanding IDP and related materials, the study of LLPS and complex large-scale aggregates has been limited by the significant computational demands of all-atom simulation. To overcome these limitations, CG models, with their reduced numbers of degrees of freedom, have been developed for simulating systems at larger length and time scales but with lower molecular resolution. Various CG models,68–70 ranging from simplified polymer models to the carefully parameterized Martini CG models, have been successfully applied to biomolecular systems.57,71–77 Polymer models, such as bead-spring models, lattice models and DPD methods, are particularly useful for capturing the overall characteristics of biopolymers, such as chain conformation and assembly of nano- and micro-scale structures, while reducing computational cost and focusing on key interactions.55,78–89 However, the reduction in computational cost provided by these models comes at the expense of detailed molecular accuracy. For IDPs, assembly and LLPS is driven by complex noncovalent and often weak interactions between multiple amino acids, which are often too intricate for simple polymer models to accurately capture.53,90–95 To address this limitation, hydrophobicity scale (HPS) models that parameterize amino acid interactions without explicit solvent have been developed.57,96–99 These models allow for a more refined representation of the specific interactions that govern assembly and phase separation associated with IDPs. While HPS models reduce computational expense relative to all-atom simulations, they do so at the cost of lower molecular detail, potentially overlooking solvent-mediated effects and finer interaction nuances critical to IDP behavior. In addition, the Martini CG model, which employs a coarse-grained explicit solvent, has been modified to study IDP-based materials.100–103 These approaches allow for modeling of the specific interactions that govern assembly and phase separation associated with IDPs. Nonetheless, both Martini models and HPS models enhance computational feasibility and efficiency, although they inherently introduce approximations that can affect the accuracy of the simulated thermodynamics and kinetics, and thus require rigorous calibration and validation.
Furthermore, hybrid simulations combining all-atom and CG models have emerged as an effective approach to studying IDP-based materials, taking advantage of the detailed accuracy of all-atom representation and the computational efficiency of CG models. For instance, the formation and dynamics of the LLPS of IDPs has been studied at the CG level, while key interaction motifs or regions responsible for the phase-separation process can be modeled at the all-atom level for detailed structural insights.53,54,104 However, hybrid simulations present their own challenges, such as maintaining a consistent interface between all-atom and CG representations, addressing parameterization differences, and striking the right balance between computational efficiency and the required level of detail. The integration of these approaches requires careful optimization to ensure that the benefits of broader sampling and reduced computational expense do not compromise the structural and dynamical accuracy of key interaction motifs.
Although IDP sequences have been studied experimentally for decades in a range of different materials, simulation models for IDPs have only been more recently developed for characterizing their association, conformational properties, and interactions. Below, some of the recent progress in the simulation of specific IDPs and their conjugates with ordered assembly-promoting sequences (e.g., collagen-like peptides and coiled-coil peptides) is presented. The emphasis is on how these methods can aid both in interpreting experimental observations from a molecular perspective and in understanding features of IDP-based materials, including transition temperatures, physical properties, residue interactions, and assembled structures.
More recent simulations of ELPs have focused on both simulating the molecular properties near the phase transition and guest–residue interactions. Hydrophobic interactions have been regarded as the main driving force of the phase transition, and researchers have shown the difference between multi-chain and mono-chain simulations.111,112 In substantial studies by Barrat et al., the investigation focused on molecular properties near the lower critical solution temperature (LCST) of ELPs using all-atom simulations of the sequence GVG[VPGVG]3;113 here the primary objective was to understand how intra- and inter-peptide interactions evolve below, near, and above the LCST. The role of peptide concentration on the structural and dynamic properties of aqueous solutions of short ELPs was explored over a temperature range near the LCST. Utilizing the AMBER99SB-ILDN114 force field for peptides and the TIP4P-D water model,115 ELP behavior was simulated in single-chain, two-peptide, and multichain systems. Because of the limited length of the peptide sequence, only a slight reduction in polymer size at the LCST was observed at the single-chain level; nevertheless, Markov state modeling116 revealed clear two-state behavior. In the two-peptide system, consistent with LCST behavior, the potential mean force (PMF) between the chains shifted with temperature. In multichain systems, the PMF was employed to characterize the interactions; the peptides experience a weak attraction between the chains at high temperature, as shown in Fig. 1a. By analyzing the potential of the mean force of the chains, it was shown that the peptides adopted more extended conformations compared to single-chain systems, where the peptides behaved more like ideal chains. This extension in the multichain systems was attributed to the presence of central valine residues that drive inter-chain contact formation. The study suggested that substituting these valine residues with more hydrophilic or weakly interacting amino acids could potentially suppress contact formation, which provided more detailed insight into residue interaction in ELP and associated LLPS and similar interactions have been observed by the Shaw group and the Pomes group.112,114 Overall, the simulations provided valuable data for ELP design and deeper insights into the mechanisms governing ELP behavior near the LCST.
![]() | ||
Fig. 1 a) Potential of mean force (PMF) w(Δr) as a function of distance Δr for a pair of ELP chains as a function of temperature. The positive w(Δr) indicates more probable distances peptides and the negative w(Δr) indicates less probable distances between peptides. Temporal evolution of b) nonbonded interaction energy within the 27-polypeptide cluster, c) interaction energy between 27 polypeptides and water, d) total number of interchain peptide–peptide H-bonds in the system, and e) average number of intrachain H-bonds per peptide chain. a) Reprinted with permission.113 Copyright 2023, American Chemical Society. b–e) Reprinted with permission.117 Copyright 2021, American Chemical Society. |
Longer ELP sequences have also been successfully simulated by Yingling et al. in all-atom simulations to investigate the structural and aggregation behavior of ELP coacervates.111,117–119 The sequences [VGPVG]18 and [VPGVG]18 were simulated in explicit water at 350 K to capture hysteresis effects caused by differences in amino acid patterning.117 As shown in Fig. 1b and c, in the initial stages, peptide–peptide interactions intensify while peptide–water interactions diminish, leading to conformational changes, followed by aggregation of the peptides without formation of a defined hydrophobic core. While hydrogen bonds were indicated as the primary driving force in these interactions (Fig. 1d and e), factors such as peptide surface hydrophobicity, torsion angles, and secondary structural motif propensities were also shown to contribute to both the LCST and the observed hysteresis.117,119 By comparing the thermal behavior of [VGPVG]18 and [VPGVG]18, deeper insights into LCST phenomena were obtained. The consistency of the simulation results with experimental data further validated the simulation findings, bolstering confidence in the ability to understand LCST behavior based on peptide sequence and structure.
All-atom simulations were also utilized in other reports to study the relationship between LCST and the degree of polymerization in ELPs, spanning from three to over 30 repeat units in single-chain systems, while CG simulations examined chains with up to 150 repeat units.64–66,120 These all-atom and CG simulations both demonstrated that the relationship between the inverse transition temperatures of ELPs and their lengths follows power law exponents, aligning well with experimental findings. Moreover, the all-atom simulation study suggested that the transition behavior of [VPGVG]n, can be explained by a combination of the thermal disruption of the water network surrounding the polypeptide, the peptide's solvent-accessible surface area, and the peptide's hydrophobicity.64,65 Together these studies provide valuable insights into the coacervation of ELPs, particularly highlighting the influence of ELP length, which can be readily controlled in fine-tuning the behavior of ELP-based systems.
Although all atom-level MD simulations have provided molecularly detailed representations of ELP properties, these simulations become impractical with increasing ELP chain length and numbers of chains. As a result, CG models have been introduced to simulate ELPs. Muraoka et al. explored the formation of coacervates and the internal structural and dynamic properties of ELPs using CG simulations with a Martini3 CG model.121 ELPs were modeled with a variable number n of [VPGVG]n repeat units, employing an explicit, CG solvent model.122 From simulations of the self-assembly of two distinct ELPs having molecular weights of 2.1 kDa and 8.6 kDa, a 6-fold difference in the inter-chain contact surface area was observed using differences in solvent-accessible surface area,123 as shown in Fig. 2. These findings indicate that increasing both the concentration and chain length of ELPs enhances coacervate formation. Complementing the simulations, experiments conducted at higher ELP concentrations corroborated the simulation results, confirming the observed trends in concentration and length effects. While CG models offer significant computational advantages, they also introduce approximations that can compromise structural accuracy and the detailed representation of molecular interactions. For instance, the Martini CG model does not capture secondary structures well and thus requires initial definition of the secondary structures, which could introduce artifacts in the simulations. On the molecular level, due to its CG strategies, hydrogen bonds were not included.46,70 This trade-off is critical when studying systems where these fine-grain interactions can play a pivotal role. Thus, although CG simulations provide valuable insights into the overall assembly behavior of ELPs, care must be taken in interpreting results, particularly when extrapolating to molecular-level details.
![]() | ||
Fig. 2 a) Mapping of the poly[VPGVG] chemical structure to the coarse-grained model in b), where particle types used for the Martini 3.0 mapping are indicated. Different colors correspond to different particle types. Contact surface area of (c) 2.1 kDa and (d) 8.6 kDa models. The contact surface area of 8.6 kDa sample is 5–6 times larger than the 2.1 kDa sample. The low contact surface area in (c) quantifies association and indicates the absence of an enhanced local polymer density. The increasing contact surface area in (d) reveals the formation of densely associated polymers, consistent with formation of the coacervate phase. Reprinted with permission.121 Copyright 2021, American Chemical Society. |
![]() | ||
Fig. 3 a) (I): Representative rendering of the atomistic simulation of the condensed phase of [GRGDSPYS]25. (II): Snapshot from atomistic simulations of [GRGDSPYS]25, highlighting representative examples of the wide variety of contacts driving phase separation of WT. (III): Non normalized (top) and normalized (bottom) average residue pairwise contacts (top), separated into backbone–backbone (bb–bb), backbone–side chain (bb–sc) and side chain–side chain (sc–sc). The inset show the average contacts formed by Tyr:Ser Tyr:Tyr Arg:Asp Arg:Tyr Arg:Arg Arg:Ser. b) Fluorescence microscopy image of 0.5 μm yellow–green fluorescent polystyrene beads embedded in IDP coacervates. Scale bar, 5 μm. Inset: Representative trajectory from two-dimensional particle tracking showing Brownian motion of the beads (the length of the inset box is 0.02 μm). c) State diagram showing saturation concentration, Csat, at 18 °C and viscosities for the variants. Reprinted with permission.53 Copyright 2024, Springer Nature Limited. |
To design IDP-containing materials for applications, it is critical to understand materials properties such as mechanical strength and nanostructure after phase separation. While most simulations cannot quantitatively predict such experimental properties, efforts have been made to connect measured material properties with simulation to obtain qualitative insight. In the above work, Rekhi et al. correlated the viscosity from experiment with the critical concentration identified in simulation;53 the viscosity was estimated by mean squared displacement (MSD) of fluorescent tracer beads embedded in the IDP coacervates (Fig. 3b). These studies found that viscosity decreased with the saturation concentration of the peptide sequences (Fig. 3c). Detailed analysis of the different sequences indicated that changes in side-chain interactions generally correlated with the variation of viscosities of different sequences, highlighting that while a nonlinear relationship existed between propensity for phase separation and coacervate viscosity, the independent modulation of phase behavior and material properties is feasible.
All-atom simulations using parallel tempering in the well-tempered ensemble (PT-WTE) were also applied to study short resilin-like polypeptides ((RLP-X) [GGRPSDSXGAPGGGN]3) based on the full consensus repeat from D. melanogaster, with aromatic (X = Y, F, W) and aliphatic (X = A, L, V) guest residues at the X position (Fig. 4a).37 For the aromatic guest residues, π–π and cation–π interactions were observed, with RLP-W forming these interactions twice as frequently as RLP-Y or RLP-F; RLP-Y exhibited a higher number of cation–π contacts. These observations involving aromatic residues were consistent with previous experimental studies and were believed to be the primary driving forces for phase separation.90,91,126,127 Moreover, RLP-W and RLP-Y formed twice as many hydrogen bonds involving donor/acceptor side chain atoms. As a result, RLP-W showed the highest propensity for LLPS, whereas RLP-F exhibited the lowest among the aromatic residues. In contrast, the aliphatic residues displayed overall less frequent contacts than the aromatic variants, potentially making them less likely to undergo LLPS. The detailed residue pairwise-contact maps for RLP-W, as well as the hydrogen bonds, π–π, and π–cation interactions for all the variants, are provided in Fig. 4a.
![]() | ||
Fig. 4 a) (I) Amino acid composition of the 45-residue long RLP-W peptide that was simulated using an all-atom model. (II) Total number of intramolecular van der Waals contacts formed between each residue type pair reflecting the abundance of each pair type in the RLP-W sequence. (III) Contact propensity in (II) normalized to the number of possible interaction pairs of a given type in the RLP-W sequence to obtain insights into their intrinsic interaction strengths. (IV) Number of intramolecular van der Waals contacts formed by residue type X (A, L, V, F, W, or Y) further analyzed in terms of important known modes, hydrogen bond, π–π, and cation–π interactions. 2D b) NOESY and c) COSY NMR spectra of RLP-W. Only the signals from 1 to 5 ppm (Y-axis) in the NOESY spectra represent interactions through space. 2D d) NOESY and e) COSY NMR spectra of RLP-F. Only the signals from 3 to 5 ppm (Y-axis) in the NOESY spectra represent interactions through space. Reprinted with permission.37 Copyright 2024, American Chemical Society. |
Experimental verification of the different interactions in the RLP-X constructs was confirmed experimentally by Garcia et al.37 via NMR methods, double quantum-filtered correlation spectroscopy (DQF-COSY) and nuclear Overhauser effect spectroscopy (NOESY). NOESY-unique signals (which appeared in NOESY but not in COSY), captured through-space interactions of specific amino acid residues in the RLP-X chains. As shown in Fig. 4b and c, the NOESY-unique signals of RLP-W suggested high spatial proximity of Trp–Trp and Trp–Arg side chains (aromatic Trp protons: δ 7.00–7.60 ppm, Hα tryptophan: δ 4.60 ppm H arginine δ 3.23, 1.86, 1.58, and 1.21 ppm, highlighted in blue boxes), which was consistent with simulation results (Fig. 4a). However, with phenylalanine guest residues, the most common contacts observed in NOESY-unique signals (Fig. 4d and e) only involved Phe aromatic protons (δ 7.15–7.40 ppm) and Hα and Hβ from Phe (δ 3.00, 3.15, and 4.56 ppm; highlighted in blue boxes), which corroborated the prevalence of only Phe–Phe interactions, consistent with simulation (Fig. 4a). In addition, an increased UCST transition temperature of RLP-W compared to RLP-F was consistent with the increased interaction strength for RLP-W.
In other reports, Bevan and coworkers applied MD simulations to investigate the roles of the SYGAP sequence motif and its less common variants SYSAP and TYGAP in influencing the elastomeric properties of D. melanogaster resilin containing Gly-rich repetitive motifs comprising the amino acids PSSSYGAPGGGNGGR.128 Their findings revealed that the SYGAP motif forms a bent structure with greater extensibility than the other variations, primarily due to the conformational freedom of glycine and the presence of hydrogen bonding within the motif. These structural features may significantly impact the elastic behavior of resilin and potentially other elastomeric proteins.
A more recent study by Pesce et al. has revealed the possibility of designing IDPs de novo,132 starting with naturally occurring IDP sequences, including α-synuclein (αSyn), the low complexity domain from hnRNPA1 (A1-LCD), the prion-like domain of FUS (FUS-PLD), and the R-/G-rich domain of the P granule protein LAF-1 (LAF-1-RGG). New sequences based on the naturally occurring IDP sequences were identified through a Markov chain Monte Carlo scheme to sample sequence variants. The sequences were simulated by MD simulations with the CG CALVADOS force field97 to generate structural parameters such as radius of gyration and saturation concentration. Based on the simulation, 15 variants were selected, five of which could be successfully expressed recombinantly. Characterization via differential interference contrast (DIC) microscopy confirmed the formation of coacervates for the purified variants (Fig. 5a). The saturation concentration (Fig. 5b) and radius of gyration (Fig. 5c) measured experimentally aligned well for all but one variant V5, showing the possibility of the de novo design of artificial IDPs.
![]() | ||
Fig. 5 a) DIC microscopy WT A1-LCD and the five variants showing phase separation behavior of artificial IDPs. b) Comparison of experimental and calculated values of Csat at 298 K. c) Comparison of experimental and calculated values of Rg for WT A1-LCD and V2 to V5. Reprinted under a Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC).132 Copyright 2024, the authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. |
Roberts et al. showed that the inclusion of polyalanine helical sequences (An or (KAAA)n) in sequences containing in ELP sequence repeats [VPG(Ax/V1−x)G]n caused thermal hysteresis as monitored by turbidity; differences in heating/cooling transition temperatures increased with increasing polyalanine ratio and decreasing charge.135 A CG bead-spring model was employed and showed an aggregate of ELP domains above the transition temperature. However, the exchange of alanine-rich regions with neighboring clusters entangled them into a network which prevented the re-solubilization of ELP domains upon cooling. A more detailed all-atom level MD simulation of this hysteretic phase behavior was conducted by Quiroz et al.,136 in which it was found that the hysteresis can be promoted by reversing the ELP sequence, e.g. using [VPGVG]n rather than [GVGPV]n (Fig. 6a). In the simulations, minor changes in secondary-structure propensities, primarily in the unstructured motif, were observed after sequence reversal (Fig. 6b). However, the differences in radius of gyration (ΔRg) were close to 0 after sequence reversal (indicating increased backbone rigidity and a less compact conformation of reversal sequences compared to original sequences at high temperature), and in those cases the new sequence tended to show large hysteresis experimentally (Fig. 6c).
![]() | ||
Fig. 6 a) Temperature-dependent optical turbidity to probe the phase behavior of three IDP pairs, I) (VPGVG)30 and (GVGPA)30, II) (VPAGVG)25 and (GVGAPV)25, and III) (APVGLG)40 and (VPAGLG)40. Each pair consists of two IDPPs with identical number of repeats of motif sequences that are interrelated by sequence reversal. b) Fraction of unstructured motifs residues for single IDPP chains and c) differential between values at high and low temperatures of radius of gyrations (ΔRg, black) and absolute peptide–water interaction (ΔEpw, blue) energy of 18-mer, single IDPP chains at low (290 to 310 K) and high (350 to 390 K) temperatures for three IDPP pairs in a). d) Fraction of value changes (f) in interchain interaction quantities after cooling two chain “phase separated” systems to 290 K for 25 ns. f = (end value − initial value)/(initial value). e) Snapshots from two-chain simulations for an IDPP with marked hysteresis (VGAPVG)18 (shown in Fig. 6a II) and for an IDPP with negligible hysteresis (LGAPVG)18 (shown in Fig. 6a III). a–d) Reprinted under a Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC).136 Copyright 2019, the authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. e) Reprinted with permission.111 Copyright 2014, American Chemical Society. |
The significant decrease in peptide–water interaction energy (ΔEpw) was also regarded as a predictor of hysteresis behavior of [VGPVG]18 and [VGAPVG]18, indicating that minor changes in hydrophobic hydration can be a main driving force for fine tuning IDP LCST-associated phase separation (Fig. 6c).111 In a two-chain model, the hysteresis was characterized by the increase in fractional difference of interaction energy, number of interchain hydrogen bonds, number of interchain hydrophobic contacts, and (absolute) hydrophobic interaction energy (Fig. 6d); the visualization of the differences between IDPs with strong hysteresis ([VGAPVG]18) and weak hysteresis ([LGAPV]18) is shown in Fig. 6e. These studies illustrate the ability to predict the hysteretic phase separation of IDP via MD simulation, which should be applicable to a variety of related ELP- and IDP-based materials.
In collaborative studies by the Jayaraman and Kiick laboratories on short ELPs conjugated to trimer-forming collagen-like peptides (CLPs), a series of all-atom and CG simulations were employed, complemented by experimental work. In all-atom simulations, the LCST transition temperature, Tt, of ELP oligomers with the sequence [VPGFG]6 was characterized.55 The LLPS behavior of ELP oligomers anchored to fixed points was also simulated as a model of the ELP conjugation to the triple-helix-forming CLP; the results qualitatively aligned with previous experimental observations, describing the thermoresponsive assembly of ELP–CLP, by Luo and Kiick.137 In the CG simulations of ELPs conjugated to CLPs, where the CLPs were modeled as rigid blocks, it was observed that the increased local density of ELPs made them more prone to aggregation at lower polymer–polymer attraction energies compared to when the ELPs were untethered. Furthermore, similar studies examined the influence of various guest residues (X) on the Tt in short ELPs [VPGXG]4.138 ELPs containing Trp (WWWW) were more prone to form β-turn structures at lower temperatures compared to Phe variants (FFFF). The bulkier aromatic side chain of Trp likely contributes to increased local stiffness in the Trp variants, which could be another factor for the significantly lower Tt (Tt < 278 K) compared to that observed for the Phe variants (Tt < 353 K), in addition to differences in their hydrophobicity. The simulations provided results closely matching experimental findings when the effects of local stiffness and the stronger attractive interactions among Trp residues compared to Phe residues were incorporated into the CG model. These types of CG models were rooted in polymer physics and were primarily designed to capture the key interactions between ELPs and CLPs, while minimizing the influence of other interactions. This approach allowed researchers to focus on the main driving forces behind the system's behavior. However, due to the simplified nature of the model, the results were typically limited to qualitative comparisons with experimental data.
In subsequent work, the effect of Tyr substitutions and their placement in the ELP ‘parent’ sequences [VPGFG]6 was explored and was shown to influence Tt.139 Simulations revealed that the Tyr guest residues enhanced intra- and inter-peptide hydrogen bonding, which promoted the aggregation of ELPs and lowered Tt. Moreover, placing Tyr away from the tethering point to the CLPs resulted in a lower Tt compared to placements closer to the tethering point, as a greater number of π–π stacking interactions were possible at low temperatures, owing to the increased conformational flexibility of the ELP chains further from the tethering point.
CG simulations using the Martini force field have also been applied to understand the self-assembly of block copolypeptides in which RLPs [GGRPSDSWGAPGGGN]1 are conjugated to designed coiled-coil bundle-forming peptides (BFP) that form tetrameric, α-helical, antiparallel bundles.38,39,140,141 These simulations successfully captured the UCST behavior of RLP–BFP–RLP conjugates and revealed 1D self-assembly mediated by RLP interactions at low temperatures, consistent with experimental observations as shown in Fig. 7a–c.38 Further experimental studies explored the self-assembly of hybrid polypeptides with BFP on the termini (BFP–(RLP-L)6–BFP) where Trp was replaced by Leu in the RLP sequence.39 Upon fusion with BFP peptides, the RLPs retained their thermoresponsive phase behavior, and cryogenic transmission electron microscopy (Fig. 7d) along with CG simulations (Fig. 7e and f) revealed that BFP–(RLP-L)6–BFP adopted a closed-loop dimer structure (Fig. 7f), assembling into 1D nanofibers that further formed into 2D ribbon-like or sheet-like structures. These findings suggest broader opportunities for fusing stimuli-responsive IDPs with assembled peptide domains to prepare hybrid materials that adopt properties from each domain.
![]() | ||
Fig. 7 Cryo TEM reveals that genetic fusion of RLPs with 1 repeat unit a) or 3 repeat unit b) to BFP mediates nanofibrillar assembly of coiled coil bundles. c) Rendering of Martini CG of RLP–BFP–RLP assembly with alignment of BFP bundles in the nanofibrils (RLP indicated in blue and BFP indicated in orange). The length of RLP was one repeat unit in the simulations. d) CryoTEM BFP–(RLP-L)6–BFP constructs assembled into micrometer-sized individual nanofibrils, scale bar = 500 nm. e) and f) Two CG BFP–(RLP-L)6–BFP conjugates model that self-assemble into e) chain nanostructures or f) closed loop configuration. Left is the Martini CG representation of each self-assembled nanostructure, and right is a snapshot of the aggregation morphologies achieved from the simulations. a–c) Reprinted with permission.38 Copyright 2023 Wiley-VCH GmbH. d–f) Reprinted with permission.39 Copyright 2024, American Chemical Society. |
However, the precise prediction of the behavior of IDP systems, such as transition temperature, saturation concentration, and mechanical properties, still requires more investigation. A variety of factors limit the detailed quantitative comparison of simulation and experiment regarding IDP properties. These include the necessarily large systems (micrometer-size in experiment vs. nanometer-size in simulation) and long timescales (second-scale in experiment vs. nanosecond-scale in simulation) associated with liquid–liquid phase separation, the quantitative alignment of experimental and simulation conditions, appropriate parameterization of important intermolecular interactions (including accurately modeling hydrogen bonding, hydrophobic effects, electrostatics, and solvent-mediated interactions), and the subtlety of methods associated with simulating systems comprising multiple phases and their transitions. Appropriate parameterization needs to consider residue-specific interaction potentials, temperature-dependent effects, solvation dynamics, and the influence of ionic strength; new parameters could also include detailed representations of post-translational modifications, crowding effects, and explicit solvent molecules.
Most all-atom level simulations focus on single-chain systems or small oligomers due to the complexity of the model, and even with a simplified CG model, simulation sizes are still limited to nanometer dimensions while IDP protein-rich phases in experiments are micrometer-size coacervates. While CG approaches can potentially simulate the phase separation processes on a larger scale, they have to date been only widely applied in explaining general features of IDP assembly, mainly due to the lack of accuracy resulting from the necessarily simplified residue interactions in the CG models. Optimized force fields address the residue interactions in IDPs more accurately, but developing and implementing these in CG modeling still requires further investigation in the context of larger-scale simulations. Specific improvements include the incorporation of more accurate potentials that reflect hydrogen bonds, accurately capture secondary structures, and represent solvation energetics more effectively. Enhancing the resolution of side-chain representations and introducing sequence-specific interaction rules derived from experimental or high-resolution all-atom data may significantly improve CG model fidelity. Additionally, hybrid modeling approaches combining atomistic and coarse-grained representations could be explored to better capture critical interactions in larger-scale systems.
Data-based approaches such as machine learning have been extensively used in the simulation and prediction of polymeric materials properties including polymer swelling,142 polymer-surface interactions,143 and assembled membrane properties.144 Similar data-derived methods will provide interesting opportunities for the simulation and prediction of IDP properties as training datasets become more available and more extensive for this class of polypeptides. Classification of diverse IDP sequences, recognition of key parameters that influence IDP properties, and significant additional contributions to the current IDP datasets will be required for successful implementation of data-based simulation of IDPs. In summary, continued synergistic improvement in the coordinated application of simulation and experiment will be critical for to advance the prediction of materials properties and the successful de-novo design of artificial IDPs.
Footnote |
† These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2025 |