Daniel Tan†
a,
Dilimulati Aierken†
ab,
Pablo L. Garcia
a and
Jerelle A. Joseph
*ab
aDepartment of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA. E-mail: jerellejoseph@princeton.edu
bOmenn-Darling Bioengineering Institute, Princeton University, Princeton, NJ 08544, USA
First published on 23rd September 2025
Biomolecular condensates, including those formed by prion-like low complexity domains (LCDs) of proteins, are maintained by networks of molecular interactions. Such collective interactions give rise to the rich array of material behaviors underlying condensate function. Previous work has uncovered distinct LCD conformations in condensates versus dilute phases, and recently, single-component LCD condensates have been predicted to exhibit microstructures with “small-world” networks—where molecular nodes are highly clustered and connected via short pathlengths. However, a framework linking single-molecule properties, condensate microstructure, and macroscopic material properties remains elusive. Here, we combine molecular simulation and graph-theoretic analysis to reveal that small-world microstructures are inherent properties of LCD-like polymers, whose sequence features impact both molecule-scale conformations and droplet-scale material properties while maintaining a stable network structure. Using a residue-resolution coarse-grained model, we probe condensates comprising naturally-occuring LCD sequences and generalize our findings by varying composition and patterning in binary sequences of hydrophobic and polar residues. We show that non-blocky sequences, including a hydrophobic homopolymer, form condensates with small-world internal networks featuring “hubs”—molecules responsible for global connectivity—and “cliques”, molecular clusters bound by persistent short-ranged associations. Cliques localize near interfaces without a secondary phase transition, suggesting a role in mediating molecular partitioning and condensate aging by tuning interfacial material properties. Moreover, we discover that network small-worldness and droplet surface tension are consequences of sequence length and hydrophobicity. We also track single-molecule structure and dynamics inside condensates, revealing that internal heterogeneity at the single-molecule level is systematically encoded by network topology. Collectively, our work establishes multiscale structure–property relationships in LCD condensates, elucidating general organizing principles of the condensate microstructure that persist with sequence-driven changes in molecular behaviors and material properties.
Intrinsically disordered regions (IDRs) are among the key components of proteins involved in intracellular phase separation and condensate formation.34 Prion-like low-complexity domains (LCDs) are exemplary instances of IDRs in biomolecular condensation: LCD sequences contain strongly interacting “sticker” residues that drive clustering and phase separation, as well as “spacer” residues interspersed between stickers that modulate solubility and intermolecular interaction strengths.13,15,35–38 These sequence architectures enable the formation of complex networks of reversible physical crosslinks underlying condensates.9,39 Recent experimental advances and simulation approaches have begun to observe the rich internal organization and heterogeneities associated with condensate microstructures.10,12,40 Specifically, Farag et al. first leveraged lattice simulation and graphical network analysis to predict the inhomogeneous connectivity of networks underlying LCD condensates, noting a “small-world” graph structure globally connected by a small subset of highly connective “hubs”.10 These results were recently supported by experimental studies revealing the inhomogeneous, network-like internal organization of single-component LCD condensates.12 Similar network analyses to ref. 10 have been employed to probe molecular networks in two-component condensates,11 to determine the effect of temperature, length, and residue composition on networks underlying multicomponent condensates,41 and to study the effects of sequence patterning and binding site affinity on percolation and phase separation.18 Despite experimental and computational characterization, the principles that govern how single-molecule sequence and structure give rise to condensate microstructures—and how microstructures, in turn, relate to material properties—remain poorly defined.
Here, we address this gap by combining molecular dynamics simulation and graph-theoretic analysis to characterize the microstructures of LCD condensates. Importantly, we show how condensate microstructure gives rise to clustered internal organizations and interfacial properties that persist with large changes in sequence composition, single-molecule behaviors, and macroscopic material properties. We systematically study the interaction networks underlying LCD condensates using a chemically-specific residue-resolution coarse-grained model, Mpipi.42 We generalize our findings by designing binary sequences composed of tyrosine (Y) and serine (S) residues to investigate the impact of sequence composition and patterning on network topology. We consistently find that LCDs and non-blocky binary sequences, including a hydrophobic homopolymer, form condensates with small-world microstructures marked by high local clustering of molecules and short global pathlengths. We demonstrate that the small-worldness of LCD microstructures varies systematically with sequence length, while that of the phase-separating binary sequences is independent of sequence hydrophobicity. Moreover, we show that sequence length and hydrophobicity tune droplet surface tension while preserving the small-world characteristics of the network. We further reveal that biomolecules possess two distinct regimes of interactivity in the small-world network. One regime is marked by high global connectivity and expanded conformations (“hubs”), while the other is marked by elevated local crosslinking (“cliques”). By quantifying the spatial and temporal dynamics of the condensate microstructure, we find that cliques display confined local movements and exhibit long lifetimes compared to hubs, whose molecular identities are found to be highly transient. In agreement with previous experimental studies of mesoscale inhomogeneities in condensates, we find that these nanoscale clique clusters consistently form near interfaces without a secondary phase transition, suggesting roles in mediating selective molecular partitioning and condensate aging.31,32,40
Our work also demonstrates that the condensate microstructure is shaped by a heterogeneous ensemble of single-molecule conformations in the dense phase. Specifically, we predict power-law-like relationships between network connectivity and single-molecule conformational characteristics including radius of gyration and polymer shape anisotropy, indicating that condensate microstructure can be read out with single-molecule features and vice versa. By systematically varying sequence composition and patterning using the binary sequence model, we show that the small-world internal structure is generally not achieved by blocky sequences, which tend to form micelles instead of a distinct liquid phase. However, the relationships between molecular behavior and microstructure are conserved in all phase-separated condensates, spanning a wide range of sequence compositions. Taken together, our work establishes multiscale structure–property relationships of LCD condensates—linking sequence features to molecule-scale behavior and droplet-scale material properties—and provides a conceptual framework for decoding and engineering stable condensates with complex internal architectures robust to a wide range of sequence compositions and material behaviors.
We survey biologically relevant condensates by simulating four LCDs known to phase separate under physiological conditions, namely TIA1-LCD, FUS-LCD, hnRNPA1-LCD (“A1-LCD”) and EWSR1-LCD. LCD sequence features are shown graphically in Fig. 1a and exact sequences are given in the Methods. The critical temperatures Tc of each sequence are estimated using direct coexistence simulations and the data is fitted using the law of coexisting densities and rectilinear diameters.44,45 The corresponding phase diagrams are shown in Fig. 1d. Despite being the shortest sequence (n = 97 aa), TIA1-LCD is observed to have the highest critical temperature (Tc = 370 K) of all LCDs, while the longest sequence EWSR1-LCD (n = 280 aa) has the lowest predicted critical temperature (Tc = 327 K). Notably, TIA1-LCD has the highest fraction of π and aromatic residues (e.g., tyrosine); EWSR1-LCD has the lowest fraction of these residues and the highest fraction of neutral and glycine residues. Consistent with previous reports, we observe that strong π–π and cation–π interactions play outsized roles in driving macromolecular phase separation.36,39,46,47
All YS sequence variants are constructed with a length of n = 150 aa, similar to the average length of the chosen LCD sequences. To study the effect of sequence composition, we varied sequence hydrophobicity (fraction of tyrosine residues, fh) from fh = 0.10 to fh = 1.00 over six sequence variants, preserving the near uniform distribution of sticker residues noted for phase-separating prion-like domains:36 (S9Y)15, (S4Y)30, (SSY)50, (YS)75, (YYS)50, and Y150. Sequence features are shown graphically in Fig. 1b, and corresponding phase diagrams are shown in Fig. 1e. As expected, we predict systematically higher critical solution temperature as hydrophobicity increases. All simulated LCDs have a fraction of aromatic (“sticker”) residues fh ≈ 0.14, and the range of critical temperatures observed of LCDs falls roughly within the range of Tc measured for the YS variants with fh = 0.1 and fh = 0.2.
To probe the effect of sequence patterning, we design three additional YS variants at a hydrophobic fraction fh = 0.20, as the corresponding uniform sequence (S4Y)30 displayed the closest phase behavior to the LCDs with fh = 0.14. We then alter the blockiness of the sequences: (S12Y3)10, (S24Y6)5, and S120Y30. In addition, we generate a randomly scrambled sequence, (S120Y30)scr, with the same composition. Graphical representations of these sequences are shown in Fig. 1c along with their measured blockiness fB (see Methods), and corresponding phase diagrams are shown in Fig. 1f. The critical temperatures and phase boundaries of these patterning variants are all similar to those of LCDs except for those corresponding to (S24Y6)5 and S120Y30, the two blockiest sequences. These latter sequences form micelles instead of phase-separated condensates.
In addition to characterizing the phase behavior of the sequences, we investigate how molecular sequence affects condensate material properties by computing droplet surface tension. We measure the surface tension of each condensate via direct-coexistence simulations in the slab geometry48 at 0.9Tc, where Tc is the critical solution temperature. We find that surface tension increases with LCD sequence length (Fig. 1g). Longer chains likely enhance surface tension both through confinement and entanglement effects, and by virtue of their larger conformational entropy in the dense phase, with the stability of the dense phase continually reinforced by the enthalpy of abundant transient intermolecular interactions.49 On the other hand, condensates formed by uniformly patterned YS sequences show that higher sequence hydrophobicity is proportional to surface tension (Fig. 1h), indicating that hydrophobicity plays a strong role in modulating material properties. The effect of hydrophobicity is likely obscured in simulated LCD condensates due to the dominant effect of sequence length with respect to minor variations in LCD hydrophobicity. Compared with a similar simulation study conducted in ref. 43, which found that a minimum hydrophobic content of 60% is required for phase separation of binary associative polymers with n = 20 beads, we find that sequences with n = 150 aa reliably phase separate with a hydrophobic content above 10%. This suggests that longer polymers can afford to have lower concentrations of associative sites while still forming stable, phase-separated droplets. The LCD sequences studied here have fewer than ≈5% charged residues (cf. Fig. 1a), so we do not expect sequence charge distributions to produce electrostatic effects contributing substantially to droplet stability or surface tension. The patterning variants shown in Fig. 1i demonstrate that surface tension decreases with increased sequence blockiness, proceeding in order from (S4Y)30 to (S120Y30)scr and (S12Y3)10 (see Fig. 1c for sequence blockiness fB). However, the formation of micelle structures, shown in Fig. 1i, prevents us from properly calculating the surface tension for (S24Y6)5 and S120Y30. While the YS model enables a controlled study of sequence effects, the LCD trends suggest that the emergent material properties of the condensate are shaped by an interplay between sequence length and hydrophobicity. Together, these results point to nontrivial but interpretable relations between sequence composition, condensate phase behavior, and macroscopic material properties.
Using this procedure, we find that condensates formed by LCDs and non-blocky YS sequences, including a hydrophobic homopolymer, consistently display small-world network microstructures that span the dense phase (Fig. 2d and e). Small-world networks are formally characterized by high clustering coefficients and low average shortest pathlengths.50,51 We quantify network small-worldness using the graph-theoretic estimators σsw and ωsw, which measure the average clustering coefficient (C) and average shortest pathlength (L) between arbitrary nodes in the graph.52–55 The equations used to compute these graph parameters are described in the Methods. Values of 0 < σsw < 1 indicate that clustering is low or average shortest pathlengths are long compared to equivalent Erdős–Rényi (ER) random graphs, and σsw ≈ 1 indicates that the network is organized like an ER random graph. Characteristic small-world values σsw > 1 come from high clustering coefficients and average shortest pathlengths that are shorter than or comparable to those in ER random graphs. The second estimator ωsw is bounded between −1 and 1, where ωsw = −1 corresponds to a regular, lattice-like graph structure and ωsw = 1 corresponds to a random-graph structure. The small-world region ωsw ≈ 0 describes a graph structure that is both highly clustered—like regular lattices—and has short average path lengths, like ER random graphs.50,54 As seen in Fig. 2d–f, we define a “small-world” region −0.3 ≤ ωsw ≤ 0.3 as roughly the middle third of the range of ωsw values, which ideally captures small-world networks (ωsw ≈ 0) with a roughly 1-standard deviation margin of error. The balance of high clustering and short path lengths underlies the resilience and conduciveness of the small world network to efficient, high-fidelity transfer: most nodes are well-connected to local nodes in clustered “neighborhoods” (graph “cliques”), and these neighborhoods are globally linked through a small subset of highly connected “hub” nodes that act as highways mediating pairwise node relations through shortest paths.
Fig. 2d shows that all LCD condensates exhibit microstructures defined by small-world interaction networks with characteristic σsw > 1 and ωsw ≈ 0 values. These results are consistent with those from lattice simulations,10 which found that single-component A1-LCD condensates have interaction networks exhibiting small-world topologies. LCD simulations reveal that network small-worldness and droplet surface tension are both proportional to sequence length (Fig. 1g and 2d, respectively). Strikingly, simulations of non-blocky YS sequences show that microstructure small-worldness is consistent across an order-of-magnitude change in hydrophobic content (Fig. 2e). While the surface tension of the non-blocky YS variants scales proportionally with sequence hydrophobicity (Fig. 1h), the graph-theoretic measures of small-world microstructure are not significantly different across the range of sequence compositions studied. In fact, the FUS-LCD and all non-blocky YS variants appear to have small-world parameters within error of each other, and these sequences are the closest in length (n = 163 aa for FUS-LCD and n = 150 aa for the YS variants). This suggests that small-worldness may be an intrinsic feature of discretely sticky polymers forming fluctuating, transient intermolecular associations, provided that the polymers are long enough and sticky enough to phase separate. This opens the design space available to the cell, allowing cells to stably condense polymers with varying interaction modes (e.g., π–π, electrostatic, or hydrophobic attraction) and tune droplet material properties without grave concern for phase separation or interaction network stability.
The patterning variants shown in Fig. 2f suggest that increased sequence blockiness disrupts the small-world connectivity of the microstructure. Indeed, the blockiest sequences (S24Y6)5 and S120Y30 form micelles instead of phase separating. The effects of heterogeneous interactions are most dominant in these systems, and remarkably, these sequences show the greatest deviation from ideal small-worldness: their ωsw values lie outside the range of small-world-like ωsw ≈ 0 and indicate that these assemblies yield lattice-like interaction networks with high global clustering, likely due to packing in the hydrophobic micelle cores. While it is tempting to classify these networks as small worlds due to their σsw > 1 values, we note that an inherent weakness of both estimators σsw and ωsw is their tendency to overrepresent the importance of graph clustering,54 which emerges naturally in our systems due to the spatial embedding of the network. Notably, the shortest LCD TIA1-LCD and the block-copolymer S120Y30 appear to have ωsw values within error of each other (see Fig. 2d and f), though the reduced small-world character of TIA1-LCD is likely due to its short sequence length. As these quantities are widely accepted and used to describe small-worldness in the network literature, we continue to use them here to qualitatively compare the connective microstructures underlying a range of condensate compositions. Together, our results demonstrate how molecular sequence impacts phase separation and condensate material properties while preserving the small-world microstructure across a range of compositions.
We thus examine the spatial distributions of hub molecules and clique molecules in dense phases. The typical distributions of hub and clique molecules in the graph representation are depicted in Fig. 3a. Hubs are identified by high betweenness centralities CB, a measure of the extent to which a single node lies along shortest paths between arbitrary node pairs in the graph. Detailed calculations are shown in the Methods. Here, the top 10 hub molecules (i.e., highest betweenness centrality CB) are colored in red and the 10 largest cliques are shown in blue. Next, we quantify the spatial distribution of hubs and cliques in the condensates. As shown for simulations of FUS-LCD in Fig. 3b, mass density profiles are roughly uniform within simulated condensates, featuring well-defined interfaces. However, despite homogeneous density profiles, the distributions of hub molecules and clique molecules are distinctly heterogeneous: cliques are located closer to the interface than are hubs, which are distributed throughout the volume of the condensate. We further confirm that such distributions are not a special case for FUS-LCD but are generic features of condensates formed by LCDs and uniformly patterned YS variant sequences, as characterized in Fig. 3d. We also confirm that this effect is not due to finite-size effects by simulating an analogous system composed of 3375 copies of FUS-LCD, shown in Fig. S6. We find that the observed distribution of hubs and cliques is persistent in larger systems.
Additionally, we find that the spatial distributions of hubs and cliques appear similarly conserved across a range of sequence patterns. Between (S4Y)30, (S120Y30)scr, and (S12Y3)10, greater sequence blockiness leads to slightly more pronounced distinctions between the spatial distributions of hubs and cliques (Fig. 3d), but the general spatial delineation persists. The blockiest sequences (S24Y6)5 and S120Y30 do not phase separate and instead form micelles with core–shell architecture (Fig. 2c). Hubs and cliques identified in micelles are not meaningful organizing features, as molecular packing in the hydrophobic micelle core enables global connectivity and elevated local clustering simultaneously. Due to this, no conclusive relation between sequence blockiness and hub–clique mesoscale inhomogeneity can be determined.
Collectively, our findings reveal that the small-world microstructure of non-blocky LCD-like condensates predictably encodes mesoscale heterogeneities within dense phases, consistently organizing molecules into distinct spatial regimes of interactivity over a range of sequence compositions and patterns. Molecules central to dense phases act as hubs, contributing to global network connectivity, while molecules near the interface tend to form clique clusters defined by tight local associations. This spatial partitioning of network roles reflects a non-random, emergent organization even in single-component condensates, offering a mechanistic link between network topology and spatial patterning. Notably, our results recapitulate previous experimental observations of nanoscale molecular clustering at the interfaces of multicomponent condensates,40 suggesting that the emergence of spatial inhomogeneities from the microstructure is broadly conserved in phase-separated macromolecular assemblies. These insights emphasize the importance of treating the condensate interface not as a passive boundary but as a functionally and structurally distinct region, whose qualities strongly influence condensate material properties, biological function, and aging behavior.
YS pattern variants suggest that sequence blockiness may be weakly related to clique longevity: the blockiest phase-separating sequence (S12Y3)10 displays the clearest distributional distinction between hub and clique lifetimes, and hub and clique distributions overlap more as sequence blockiness decreases (Fig. 3e). Further, LCD condensates show that clique lifetimes are dependent on sequence length and diversity (Fig. 3e)—effects not captured in the binary YS sequences. For example, EWSR1, the longest and least hydrophobic LCD tested, exhibits both longer clique lifetimes and a broader distribution of clique lifetimes compared to other LCDs. TIA, the shortest, most hydrophobic, and least uniformly patterned LCD studied, has a broad and bimodal distribution of clique lifetimes. While the general temporal distinction between hubs and cliques in LCD and YS condensates remains conserved, these results imply that the properties of molecular clusters at interfaces, including their stability and local dynamics, may be fine-tuned with sequence patterning.
Taken together, these results suggest that the condensate microstructure obeys counterintuitive dynamics. The backbone of the small-world network consists of highly associative hub molecules who rapidly interchange roles, while the peripheral networks of local interactions between clique molecules are macroscopically more time-stable. This “decentralization” of hublike connectivity in the microstructure may represent a physical mechanism of resilience against network failure due to aberrant single-chain behavior. Moreover, experiments have shown that nanoscale molecular clustering is linked to reduced molecular diffusion in dense phases.40 The relative stability of clique clusters, along with their localization to condensate interfaces, suggests that clustering may play a role in selective molecular recruitment and dynamic confinement. These results provide a framework for understanding and interpreting condensate function and mesoscale inhomogeneity as emerging from interaction networks, which importantly remain robust to large variations in sequence composition and patterning.
We first compare single-molecule Rg against molecular betweenness centrality CB (normalized; see Methods). Recall that CB quantifies the importance of a molecule (i.e., node) based on how often it lies on the shortest paths between other molecules, such that a higher CB indicates a more central, globally influential position within the network. We find that Rg versus CB in log10–log10 space yields a positive linear relationship (Fig. 4c) for LCDs, suggesting a consistent power-law relationship:
Rg = aCkB → log10(Rg) = b + m![]() | (1) |
This power-law-like behavior is also observed in condensates formed by uniformly patterned binary YS sequences (Fig. 4d). In both cases, individual macromolecules become more expanded as their network centrality increases, indicating that condensate microstructures reliably encode inhomogeneities at the single-molecule scale. Among LCD sequences, Fig. 4c shows that longer chains such as EWSR1 exhibit larger radii of gyration, consistent with expected scaling effects. For uniformly patterned YS sequences of fixed length, Rg increases monotonically with CB with similar linear slopes across all variants (Fig. 4d). Comparing between these sequences reveals that the baseline expansion (intercept) increases with sequence hydrophobicity, resulting in a clear ordering of YS variants by both Rg and hydrophobic fraction fh. While higher hydrophobicity is often associated with chain compaction, the crowded dense phase environment likely leads to entropic favorability of intermolecular contacts over intramolecular contacts.49 In a densely packed “solvent” composed of fluctuating copies of a single molecule, greater sequence hydrophobicity enthalpically promotes intermolecular interaction. The translational entropy cost of an intermolecular interaction is lower than the conformational entropy cost of an intramolecular interaction, resulting in a preference for expanded configurations at higher hydrophobicity. This phenomenon has previously been reported for simulations of RNA condensates60 and in lattice simulations of flexible polymers with heterotypic binding motifs,49 underscoring its relevance to our findings. Together, these results suggest that the phase separation of discretely sticky polymers is stabilized by enthalpy gained from abundant intermolecular interactions and by the lower entropic cost of intermolecular contacts, compared to intramolecular contacts, in the dense phase. We demonstrate that the entropic mechanism underlying chain expansion in the dense phase persists over an order-of-magnitude change in sequence composition and droplet surface tension, with no perturbations to the condensate microstructure.
Rg profiles for YS sequences with fixed composition and varying patterning appear to corroborate this result (Fig. 4e), with all but the blockiest sequence (S120Y30) having Rg values collapsing onto the same curve. However, the most blocky sequence S120Y30 exhibits an almost inverse trend, where increased centrality CB is negatively correlated with molecular radii of gyration. This result may be attributed to the packing of hydrophobic poly-Y tails in the hydrophobic core of the micelle formed by S120Y30.
Similar to Rg, the molecular relative shape anisotropy κ2 also appears to follow power-law relationships with molecular CB for LCDs and non-blocky binary YS sequences, albeit with smaller coefficients of determination R2 (Fig. 4f and g). κ2 is a scale-invariant quantity, ranging from κ2 = 0 at the limit where polymers adopt a spherically isotropic conformation to κ2 = 1 at the limit where they are completely linear (Fig. 4b, right panel). LCD simulations reveal that greater slopes in the κ2(CB) relationship correlate with greater small-worldness in the network and greater surface tension (Fig. 1g, 2d and 4f), though this trend is not obvious for the YS sequences and likely arises from sequence complexity not captured in the YS variants (Fig. 4g and h). Previous simulation studies have reported the average relative shape anisotropy of individual polyampholytes in condensed phases,19 with dense-phase κ2 consistently between 0.42 and 0.44 invariant to changes in sequence. However, our results show that LCDs in dense phases exhibit a range of conformations that map onto their connective roles in the condensate microstructure. Compared to the ideal-chain κ2 = 0.39, molecules with low CB adopt slightly collapsed conformations (κ2 ≈ 0.28) while highly connected hublike molecules become slightly expanded (κ2 ≈ 0.48).
In summary, we demonstrate the potential to interpret complex single-molecule inhomogeneities in dense phases using principled analyses of condensate microstructures. We predict the presence of continuous quantitative relationships bridging single-molecule conformation and network centrality, whose parameters depend on sequence composition and length. Notably, we find that hublike character corresponds to maximal expansion among the conformations assumed by macromolecules in all condensates formed by LCD and non-blocky YS sequences. Simulations of YS variants show that greater sequence hydrophobicities promote chain expansion and increase droplet surface tension while preserving the small-world-like qualities of the microstructure. Collectively, our results demonstrate that LCD-like sequence properties influence both microscopic and macroscopic condensate features while leaving the dense-phase microstructure intact. Remarkably, molecular connectivity within the conserved small-world microstructure reliably encodes conformational and mesoscale inhomogeneities of the dense phase across all phase-separating sequences studied.
Similar to the conformational properties Rg and κ2, we find that normalized molecular displacement |r| follows an apparent power-law relationship with CB, exhibiting linear correlations in log10–log10 space (Fig. 5a–c). Given that the LCD sequences have similar hydrophobicities, the LCD data mainly show that molecular size strongly impacts displacement; the shortest sequence TIA1 exhibits the fastest dynamics and the longest sequence EWSR1 exhibits the slowest dynamics (Fig. 5a). YS sequences show that increased sequence hydrophobicity is associated with greater overall molecular displacement when compared at 0.9Tc (Fig. 5b). This trend suggests that, at a fixed relative distance from the critical temperature, more hydrophobic sequences experience a more favorable environment (i.e., better solvent quality) within the dense phase, promoting increased mobility and shaping the internal dynamics of the condensate. These results reveal that condensate microstructure is linked to single-molecule dynamics as well as conformational properties, supporting its explanatory power for both static and dynamic condensate phenomena.
To assess whether the observed dynamic and conformational features arise from polymer crowding in addition to sequence-specific interactions and networked microstructures, we compute the dimensionless density ϕrel61,62 of each LCD and YS condensate (see Methods). We find that LCDs and non-blocky YS sequences all yield effective packing densities in the range 1 < ϕrel < 1.2 (Fig. S19a and b), indicating that these condensates lie just beyond the overlap threshold, where crowding and entanglement may begin to influence molecular dynamics and microstructure. However, no particular ordering is observed for the packing density ϕrel with respect to sequence hydrophobicity fh or blockiness fB (Fig. S19b and c), unlike the clear ordering of trends in molecular conformations (Fig. 4c and d), single-molecule dynamics (Fig. 5a and b), and condensate surface tension (Fig. 1g and h) observed with respect to sequence features like fh. These results suggest that polymer crowding alone does not account for the observed phenomena, and that sequence length and hydrophobicity remain the primary drivers of condensate microstructure and material properties.
Experimental and simulation studies have characterized the diffusion of LCDs within condensates.63,64 More recent experimental studies have shown that the dense phases of FUS–RNA condensates contain “nanodomains”, densely interacting molecular clusters that decrease local diffusivity without a secondary phase separation.40 To study this effect and its relation to the small-world microstructure, we additionally analyze the local movements of hub and clique molecules in our systems. On short timescales, the displacements of clique molecules are consistently greater than those of hubs in both LCD and YS-variant simulations (Fig. 5d). This can be intuitively explained with the observed degree distribution of hub molecules and clique molecules within interaction networks: hub molecules associate with a greater number of partners, thus confining their motion to a greater extent (Fig. S11–S14). When combined with our previous analysis, which reveals that cliques have longer lifetimes (Fig. 3d and e), we conclude that the “faster” motion of clique molecules is best described as a form of local vibration. Indeed, when we trace the displacement of molecules in cliques and hubs, the motion of clique molecules is highly localized (see the 2D projections of molecular motion in Fig. 5f). Thus, while cliques experience relatively larger displacements than hubs, these displacements remain confined to microscale regions near the interface. Such confinement of clique motion is reminiscent of the nanodomains described in ref. 40, as well as of recent experiments reporting highly interactive hydrophobic “hotspots” in A1-LCD condensates.12 These results support the hypothesis that nanoscale clusters at interfaces might contribute to selective molecular recruitment and confinement through local modulation of material properties.
In this work, we leverage residue-resolution molecular dynamics simulations alongside graph theory to reveal that LCD-like sequences inherently form stable, small-world microstructures despite large, sequence-encoded variations in molecular conformation and dynamics as well as macroscopic material properties. We systematically characterize the microstructures of condensates formed by the low-complexity domains (LCDs) of key phase-separating proteins (hnRNPA1, FUS, EWSR1, and TIA1) using network-based approaches. To assess the generality of our findings, we also characterize condensate systems composed of binary sequences with varying fractions of hydrophobic Y residues and polar S residues. These YS variant systems represent associative heteropolymers with varying propensities for forming percolated networks within condensates.
In agreement with previous lattice-based simulations of A1-LCD,10 we find that condensates formed by biological LCDs and generic non-blocky YS sequences, including a hydrophobic homopolymer, consistently adopt microstructures well described as small-world networks. We discover that condensate microstructure and droplet surface tension are consequences of sequence length and hydrophobicity, and remarkably, we show that the graph-theoretic properties of the network microstructure are consistent over an order-of-magnitude change in sequence composition using the YS sequences of fixed length. While the microscopic and macroscopic characteristics of the droplet, including molecular conformations, dynamics, and surface tension, vary predictably with sequence composition and patterning, the small-world microstructure and the clustered organization of the dense phase persists.
Small-world networks contain two major topological features, “hubs” and “cliques”. Cliques are densely connected groups of nodes (here, molecules) representing fully connected subgraphs within the network. The cliques themselves are efficiently bridged through hub nodes, which are highly connective and reduce average shortest path lengths between arbitrary node pairs in the network. We consistently find that hubs are positioned closer to the condensate center, while the largest cliques are located near the interface. Such spatial organization of network features enables efficient transmission of internal stresses to clique clusters at the interface, potentially influencing the surface tension underlying droplet stability. Condensate microstructures, then, may be viewed as spatially embedded networks whose complex properties stably enable a rich array of sequence-encoded material behaviors.
Our work also elucidates how mesoscale inhomogeneities arise from internal network architectures. Hubs and cliques represent two distinct regimes of molecular interactivity within the microstructure: hub molecules are responsible for high global connectivity and are marked by expanded conformations, while clique clusters are nanoscale regions of elevated local associativity marked by interfacial localization and confined molecular motion. In addition to the spatial distinction observed between hubs and cliques, we find that hub molecules and clique molecules have distinct lifetimes: the molecular identities of connective hubs change rapidly, while members of cliques tend to remain in those cliques over longer timescales due to the formation of stable, fully-connected subnetworks. These results are highly counterintuitive. Network hubs are critical for network stability and often form many strong intermolecular associations, but their role is shown to be transient. Network cliques represent molecular clusters at the interface marked by maximal local connectivity, making them sensitive to minor perturbations, yet clique structures are demonstrably more time-stable. Such inverted behavior may be a crucial feature of resilience and function in LCD condensates: transient hubs allow decentralization of the backbone holding the network together, while stable clusters may modulate selective molecular recruitment and dynamic arrest by controlling the material characteristics of the interface.
At the single-molecule scale, we reconcile reports of conformational heterogeneity and connective heterogeneity in condensates by showing that the conformational properties of macromolecules are highly correlated with their connectivity within network microstructures. Interestingly, the radius of gyration (i.e., average size) and shape anisotropy (i.e., ranging from spherical to linear) of individual molecules are found to follow power-law-like relations to molecular betweenness centrality (i.e., connectivity) within interaction networks. This behavior is conserved across a range of sequence lengths, compositions, and patterns. We find that increasing sequence hydrophobicity does not alter the nature or the slope of the relationship. Instead, similar to previous reports, increased hydrophobicity leads to greater overall chain expansion through an entropic mechanism, which corresponds to increased droplet surface tension independent of the organization of the microstructure. Further, we find that sequence hydrophobicity does not appear to alter conformational anisotropy in dense phases but sequence length strongly impacts the degree of chain extension and linearization with increasing connectivity. Lastly, we note that the blockiest sequences (S24Y6)5 and S120Y30 form micelles instead of phase separating. Curiously, the block-copolymer sequence S120Y30 exhibits relationships that are inverted relative to all other sequences, and we attribute this behavior to packing and disorder in the hydrophobic micelle core and periphery, respectively (Fig. S19c). This behavior is not representative of phase-separated macromolecules in solution.
It is striking that molecule-scale physical quantities are strongly correlated with betweenness centrality and not with node degree, the intuitive measure of polymer associativity. Betweenness centrality and degree are naturally weakly correlated (Fig. S7–S10), as a node with a greater number of associations has an increased likelihood of lying along shortest paths, i.e., being globally connective. Indeed, on average, node centrality and degree vary similarly with increasing radial distance from the condensate center (Fig. S15–S18). However, high betweenness centralities do not require high degrees and vice versa. We find that the diverse conformational and dynamic characteristics assumed by single molecules in dense phases are most coherently viewed in a relationship with CB and not with degree or radial distance. We take these findings to mean that shortest-path centrality is a critical organizing principle of the condensate microstructure, and further, that inhomogeneities at the single-molecule scale are more strongly related to emergent properties of internal networking than directly to the number of intermolecular bonds or to spatial location within the dense phase. The observed ubiquity of the small-world topology also suggests that the design space available to the cell is vast:65 LCD-like sequences with varying inter-residue attraction modes can all be co-condensed, and their sequences can be tuned to produce certain material or dynamical characteristics without concern for phase separation propensity and droplet stability. Taken together, these results corroborate the notion that small-world microstructures contribute to both microscopic properties—through the relations uncovered here—and macroscopic material properties, by enabling efficient internal stress transmission.
In addition to structural properties of macromolecules, we explore their dynamics in the context of the microstructure. We find that molecules become less dynamic as their betweenness centrality increases. This result is intuitive, as more expanded molecules with larger betweenness centralities are subject to greater confinement through dense intermolecular interactions in the condensate environment. Interestingly, this relationship also shows power-law behavior. As expected, we find that smaller molecules exhibit faster motions in the dense phase. Increasing sequence hydrophobicity also leads to greater overall displacements across the range of the relationship, further illuminating how sequence features tune the relation between condensate microstructure and molecular properties. We also analyze the effective packing density of each condensate to confirm that the observed structural and dynamic features of individual molecules arise primarily from sequence-encoded interactions and not from molecular crowding in dense phases, though a detailed study of the relation between sequence features and molecular packing may be warranted.
To our surprise, molecules in interfacial cliques exhibit slightly faster motions than hub molecules. Explicitly tracing their trajectories, however, reveals that clique molecules are spatially confined. This suggests that their movement is primarily characterized by local vibrations, whereas hub molecules exhibit motion across larger regions. Recent work on multicomponent condensates has identified nanodomains within condensates marked by elevated local connectivity, interfacial localization, and reduced molecular diffusion,40 all of which appear to be consistent with the cliques we observe in LCD and YS condensates. Other recent experiments have also reported highly interactive hydrophobic nanoclusters in A1-LCD condensates;12 although ref. 12 refers to these regions as “hubs,” their nanoclusters align with the “cliques” found in our systems according to graph-theoretic principles. Alongside these studies, experiments report that pathological liquid-to-gel transitions in condensates originate at their interfaces.31,66 Additionally, the formation of interfacial aggregates resembling amyloid fibrils were observed in the early stages of FUS condensate aging.32,66 The arresting effects of interfacial aggregation can, in turn, arrest the dynamics of the entire condensate by propagating through the small-world network structure. Thus, the stability of locally constrained, highly interactive molecular clusters near condensate interfaces may be linked to both function and dysfunction, enabling selective molecular recruitment and pathological aggregation.
A key limitation of this work is that the probed condensates are composed solely of disordered protein sequences that engage in transient interactions. In contrast, cellular condensates often include both disordered and structured components that dictate their form and function.67 The latter can mediate long-lived, high-affinity interactions that shape the underlying interaction networks, as recent studies suggest.68,69 Exploring systems with specific binding interactions—such as those involving folded protein domains or RNA—would therefore be an important next step. Nonetheless, our findings show that even simple systems of disordered protein regions exhibit striking microstructural inhomogeneities that give rise to complex biophysical behaviors. We further note that the small-world measurements employed here, namely σsw and ωsw, are to be treated as estimators. These measurements are designed to reflect both the spatial structure (via high clustering) and efficient communication (via low average pathlengths) of small-world networks by generating ensembles of equivalent random-like or lattice-like graphs for comparison. While these measurements are commonly accepted and used in the network literature,52–54,56 they are not robust to measurement errors or effects such as elevated clustering arising from the spatial embedding of our networks. A more rigorous characterization of network structure with stricter boundaries defining “small-worldness” would be useful to quantitatively verify our claims, though to our knowledge, no such characterization exists for small-world networks beyond the methods employed here.
Collectively, our results elucidate the rules by which molecular features encode rich microscopic and material properties of condensates while maintaining a conserved, networked microstructure. We demonstrate that LCD-like condensates consistently adopt small-world network microstructures that persist with large variations in sequence composition and patterning, molecular characteristics, and macroscopic properties such as surface tension. Further, we analyze the spatial and temporal characteristics of distinct network features, hubs and cliques, to interpret experimentally observed mesoscale inhomogeneities in the context of the microstructure. We finally uncover quantitative relationships that govern the distribution of molecular conformations and dynamics in dense phases from molecular network connectivity. These findings reveal complex, multiscale structure–property relationships in LCD condensates that provide general principles for designing soft materials with stable internal architectures and diverse material characteristics. We anticipate that our results can be extended to other biomolecular systems with varying residue–residue interaction modes to inform the design of synthetic condensates with programmable, interpretable composition and properties.
A1-LCD | GSMAS | ASSSQ | RGRSG | SGNFG | GGRGG | GFGGN |
DNFGR | GGNFS | GRGGF | GGSRG | GGGYG | GSGDG | |
YNGFG | NDGSN | FGGGG | SYNDF | GNYNN | QSSNF | |
GPMKG | GNFGG | RSSGG | SGGGG | QYFAK | PRNQG | |
GYGGS | SSSSS | YGSGR | RF | |||
EWSR1-LCD | MASTD | YSTYS | QAAAQ | QGYSA | YTAQP | TQGYA |
QTTQA | YGQQS | YGTYG | QPTDV | SYTQA | QTTAT | |
YGQTA | YATSY | GQPPT | GYTTP | TAPQA | YSQPV | |
QGYGT | GAYDT | TTATV | TTTQA | SYAAQ | SAYGT | |
QPAYP | AYGQQ | PAATA | PTRPQ | DGNKP | TETSQ | |
PQSST | GGYNQ | PSLGY | GQSNY | SYPQV | PGSYP | |
MQPVT | APPSY | PPTSY | SSTQP | TSYDQ | SSYSQ | |
QNTYG | QPSSY | GQQSS | YGQQS | SYGQQ | PPTSY | |
PPQTG | SYSQA | PSQYS | QQSSS | YGQQS | SFRQD | |
HPSSM | GVYGQ | |||||
TIA1-LCD | MINPV | QQQNQ | IGYPQ | PYGQW | GQWYG | NAQQI |
GQYMP | NGWQV | PAYGM | YGQAW | NQQGF | NQTQS | |
SAPWM | GPNYG | VQPPQ | GQNGS | MLPNQ | PSGYR | |
VAGYE | TN | |||||
FUS-LCD | MASND | YTQQA | TQSYG | AYPTQ | PGQGY | SQQSS |
QPYGQ | QSYSG | YSQST | DTSGY | GQSSY | SSYGQ | |
SQNTG | YGTQS | TPQGY | GSTGG | YGSSQ | SSQSS | |
YGQQS | SYPGY | GQQPA | PSSTS | GSYGS | SSQSS | |
SYGQP | QSGSY | SQQPS | YGGQQ | QSYGQ | QQSYN | |
PPQGY | GQQNQ | YNS |
These LCDs are marked by a sequence distribution overrepresented in glutamine (Q), serine (S), glycine (G), and tyrosine (Y) residues. The polar uncharged residues Q, S, and G act as weakly interactive “spacers” along sequences, serving to segregate highly attractive “sticker” residues (particularly Y) uniformly along the sequence. TIA1-LCD also incorporates tryptophan (W) residues along the sequence that can enable “sticky” interactions with itself and tyrosine (Y) through π–π stacking of aromatic rings.
As for the binary YS sequences, we simulate chains with a constant length n = 150 to be close to the length of the LCD sequences described above. Furthermore, 11 sequence variants with different hydrophobic fractions are simulated and analyzed: (S4Y)30, (S120Y30)scr, (YS)75, Y150, (S12Y3)10, (S24Y6)5, (YYS)50, (S9Y)15, S120Y30, (SSY)50, and S150. Analogous to LCD architectures, each constructed variant distributes hydrophobic Y beads as evenly as possible along the sequence. These sequences represent a range of sequence compositions spanning an order-of-magnitude change in hydrophobicity. Among the sequences, S150 has the lowest hydrophobicity fh = 0.00 and does not phase separate at T > 104 K; we have thus excluded this sequence from the results. Y150 has the largest hydrophobicity fh = 1.00, and correspondingly, the highest critical temperature Tc = 945 K. In between, we cover a wide range of hydrophobicities: fh = 0.10 for (S9Y)15; fh = 0.20 for (S4Y)30, (S4Y)scr30, (S12Y3)10, (S24Y6)5, and S120Y30; fh ≈ 0.33 for (SSY)50; fh ≈ 0.67 for (YYS)50. Since fh = 0.20 is similar to the hydrophobicity of LCD sequences (fh ≈ 0.14), we vary sequence blockiness at this fixed composition, and we also scramble a sequence to further examine the effects of sequence patterning. This scrambled sequence is denoted (S120Y30)scr.
(S120Y30)scr | SSSSS | SSSSS | SSYSS | YSYYY | YSSSS | SSSSS |
YSSSS | SSSSY | SSSSY | SSSSS | SSSYS | SSSSS | |
SSSSS | SYSSY | YYSSS | SSSYS | YSYSS | SSSSS | |
YSSSY | SSYSS | SSSSS | YSSSY | YSSSS | SYYSY | |
SYSYS | SSSSS | SSSYS | SSSSS | SSSSS | YSSSS |
The blockiness fB is quantified by calculating the ratio of the number of actual Y–S and S–Y bonds Bact over the possible maximum number of Y–S and S–Y bonds Bmax,
![]() | (2) |
EMpipi = Ebond + Eelec + Epair. | (3) |
Specifically, beads are bonded via harmonic springs:
![]() | (4) |
Non-bonded interactions encompass long-ranged electrostatics, which are captured via a Coulomb term with Debye–Hückel screening,
![]() | (5) |
![]() | (6) |
All model parameters are discussed in detail in ref. 38 and 42 and are provided in our GitHub repository (see Data availability). In Mpipi, the solvent is modeled implicitly; the model was parameterized by combining bioinformatics data and atomistic potentials-of-mean force calculations. Previous work has demonstrated that Mpipi accurately captures both single-chain properties and collective phase behaviors of disordered proteins.38,42,71
First, NPT simulations are performed to accelerate the condensate formation process during the initial steps. A Berendsen barostat is used to apply an isotropic external pressure to the particles in each simulation cell, effectively overcoming the nucleation barrier and compressing the polymer chains into a condensed state. LCD simulations are compressed with a fixed isotropic pressure set to 100 atmospheres for 120000 timesteps (dt = 10 fs) with a pressure damping parameter of 10
000dt. YS sequence variants are subject to a time-varying pressure that increased from 50 atmospheres to 100 atmospheres over a period of 30
000 timesteps (dt = 10 fs) with a pressure damping parameter of 100
000dt.
The simulation cells are then relaxed to a volume corresponding to a constant mass density ρ = 0.05 g cm−3 while preserving the condensed polymers in the center of the simulation cell. Production runs are then performed in the NVT ensemble at 0.9Tc and 0.95Tc. The integration timestep is set to dt = 10 fs, and systems are simulated for 1 μs after condensate formation for equilibrium sampling. 1000 frames are sampled uniformly along equilibrium trajectories for each sequence. For each sampled frame in both simulation types, dense-phase centers of mass and single-molecule conformations are obtained using OVITO.73
![]() | (7) |
Interaction networks are studied for small-world-like topologies by finding node betweenness centralities CB and calculating the small-world coefficients σsw and ωsw.50,52,54 The betweenness centrality CB of a node i is found and normalized via the NetworkX betweenness_centrality() utility and is computed as follows:
![]() | (8) |
Graphs are generally considered to have small-world topologies if neighbors of any given node are highly connected to each other, if shortest pathlengths between any given pair of nodes are low, and if the graph is sparse.50 Both σsw and ωsw serve as estimators of the “small-worldness” of a given graph by comparing its average shortest pathlength L = 〈min〉 and its average clustering coefficient C to the same quantities Crand and Lrand found for a series of Erdős–Rényi random graphs, and Clatt and Llatt for equivalent lattice graphs:
![]() | (9) |
![]() | (10) |
It is commonly recognized that “small-worldness” in a graph corresponds to σsw > 1 and ωsw ≈ 0.52–54 Small-world network topologies are marked by high clustering and low average pathlengths, i.e., that individual “subcommunities” of nodes exist within the graph that are closely connected to each other and that particular nodes serve as highly-connected hubs bridging each subcommunity together in an efficient manner. The flow of information or impulses within these small-world networks are thus efficient with minimal loss in fidelity.
The betweenness centrality CB of a node i serves as a metric on its hub-like connectivity, measuring the number of shortest paths between any arbitrary node pair that flows through i. For each sampled frame, the ten graph nodes with the highest normalized CB are selected as “hubs,” and the ten largest maximal cliques are selected as the “subcommunities” of closely-connected nodes. A maximal clique for any node i is defined as the largest fully-connected subgraph containing i within the graph of interest; maximal cliques are found via the NetworkX find_cliques() utility and only reported if fewer than three nodes within the clique are members of an existing reported clique.
![]() | (11) |
![]() | (12) |
![]() | (13) |
Molecular displacements |Δr| are also normalized by the length of the corresponding polymer's linear chain conformation to obtain |r| for comparison. Graph analyses are then performed to relate single-molecule displacement to molecular connectivity and topological status.
![]() | (14) |
However, the formalism is built upon the assumption of a sharp interface, which breaks down as a condensate's critical temperature is approached. At that limit, the interface becomes more diffuse and harder to define, yielding incorrect surface tension estimations through the Kirkwood–Buff formalism. To overcome this difficulty, we employ a stress-profile method calculated from per-atom stresses using the virial force contribution equation.78 This derivation is possible because anisotropy of the diagonal components of the pressure tensors are found at the dense-dilute interface:
![]() | (15) |
![]() | (16) |
![]() | (17) |
![]() | (18) |
To measure the monomeric number density ρn inside each condensate, we construct a spherical volume with a radius equal to the distance between the dense-phase center of mass and the center of the located interface (see Methods Section 3.5, “Spatial organization within simulated condensates”, for details; alternatively, see Fig. 3b for a visualization of radial interface location). As in the preceding analyses of conformation and dynamics, entire 1 μs trajectories are sampled at intervals of 20 ns for both LCD and YS simulations. For each sampled frame, we center the constructed spherical volume on the dense-phase center of mass, record the number of monomeric units present within this volume, and then divide the monomer count by the sphere volume. Monomer number densities are averaged over all sampled frames for each simulation; this average ρn is returned for each LCD and YS sequence along with the associated standard error.
Based on this construction, an effective packing density ϕrel < 1 indicates that the condensate is dilute-like, i.e., that packing in the dense phase is below the threshold after which polymers begin to overlap. ϕrel ≈ 1 indicates that the number density of monomers in the dense phase is roughly equivalent to the number density of monomers within the excluded volume of a single chain, i.e., that the chains in the dense phase are beginning to overlap. ϕrel > 1 indicates that the system is in the semi-dilute or concentrated regime, where the substantial packing and overlapping of polymers strongly influences the macroscopic features of the dense phase.
The data supporting the findings in this study, as well as sample simulation input and output files, are available at the Joseph Group GitHub repository: https://github.com/josephresearch/LCD_Network.
Footnote |
† These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2025 |