Sougata
Guha
and
Mithun K.
Mitra
*
Department of Physics, Indian Institute of Technology Bombay, Mumbai 400076, India. E-mail: mithun@phy.iitb.ac.in
First published on 9th December 2022
Collapsed conformations of chromatin have been long suspected of being mediated by interactions with multivalent binding proteins, which can bring together distant sections of the chromatin fiber. In this study, we use Langevin dynamics simulation of a coarse grained chromatin polymer to show that the role of binding proteins can be more nuanced than previously suspected. In particular, for chromatin polymer in confinement, entropic forces can drive reswelling of collapsed chromatin with increasing binder concentrations, and this reswelling transition happens at physiologically relevant binder concentrations. Both the extent of collapse, and also of reswelling depends on the strength of confinement. We also study the kinetics of collapse and reswelling and show that both processes occur in similar timescales. We characterise this reswelling of chromatin in biologically relevant regimes and discuss the non-trivial role of multivalent binding proteins in mediating the spatial organisation of the genome.
One of the important motifs of this structural organisation is the formation of long-range contacts between distant chromatin segments, mediated by DNA–binding proteins. An important characteristic of many of these proteins is multivalency, with the presence of two or more DNA binding domains.10 In general, the phenomenon of multivalency is recognised to be an widespread hallmark of interactions between chromatin and binding proteins.11 For example, HP1 and PRC1 are two proteins which brings together distal chromatin segments through gene specific interactions with chromatin.12–14 While HP1 proteins are known to vary their binding valencies through oligomerization,15 PRC1 proteins are observed to simultaneously localize at least five or more distant target sites.16 The CCCTC-binding factor (CTCF) protein has also been implicated in loop formation.17–20 CTCF contains 11 zinc-finger domains, and is known to bind to multiple DNA motifs through a combinatorial clustering of these domains, with the zinc-finger domains 1–2, 4–7, and 8–11 hypothesised to function as discrete DNA binding subdomains.21 Further, proteins of the Structural Maintenance of Chromosomes (SMC) family, such as cohesin or condensin bring together distant segments of chromatin through loop extrusion.22–25 In addition, many transcription factors are known to contain multiple chromatin binding domains, and can thus aggregate multiple nucleosomes.10 These long chromatin loops, mediated by different families of DNA binding proteins, are known to play an important role in the structural organisation of the genome,26–28 as well as in regulating gene expression.29 Modifications in the structural organisation of chromatin has also been associated with various diseases.5,30
Experiments such as Hi-C and its variants have yielded a wealth of genome-wide data about long-range contacts,31–34 and various polymer models have been proposed to integrate these experimental observations into a fundamental understanding of the physical principles of genome assembly.35–38 Different variants of self-avoiding polymer models have been used to reproduce different aspects of chromosome organisation, such as the emergence of chromosome domains and compartments,39,40 Topologically Associated Domains (TADs),41,42 interactions of chromatin with the nuclear lamina,43,44 and to understand the role of looping proteins in the formation and maintenance of chromatin loops.45
Chromatin folding in presence of diffusible binding proteins have been studied extensively. A widely used class of models for describing chromatin organisation inside nucleus is known as the “Strings and Binders Switch” (SBS) model.39,46 In this model the chromatin is assumed to be a self-avoiding polymer which consists of binding sites for diffusing binder proteins. These binders can generate different stable configurations of chromatin polymer through attachments to the chromatin backbone. The chromatin polymer is found to undergo a coil-globule like transition from an open state to folded state as the binder concentration is increased.39,47,48 The contact probability exponents in these studies are consistent with experimental observations.49 On the other hand, stiffness of chromatin polymer and switching of binders between ‘on’ state and ‘off’ state play an important role in determining the structure of the clusters of chromatin formed by binders, and their kinetics.50–52 Bridging by different multivalent transcription factors has shown separate clusters to form by different transcription factors and hence generates TADs.53 Implicit bridging through direct interactions between monomers depending on their epigenetic states has also been studied.40,41,54–57 Other than forming different chromatin structure like loops, rosettes, TADs etc., distribution of epigenetic patterns on chromatin leads to coexistence or compartmentalization of different polymer domains (polymer–polymer phase separation).
A striking feature of chromatin organisation is the presence of structural and functional compartments within the nucleus.29,58,59 At the largest scales, the nuclear volume confines the space available to chromosomes. However, confinement occurs at multiple smaller scales as well. Hi-C contact maps have shown that chromatin segregates into two types of distinct compartments – A and B – which represent euchromatin and heterochromatin regions respectively.18,31 Each chromosome has also been shown to occupy its own distinct space, known as chromosome territories.49,60 Within a chromosome, as revealed by Hi-C experiments, TADs represent megabasepair long regions enriched in internal contacts which play a key role in gene regulation.40,61,62 The boundaries of these domains have been shown to be associated with CTCF proteins.20 Confinement of chromosomes has been recognised to be a potential regulator of both chromosome structure43,63 and dynamics.64 In recent years, much research has focused on dynamic phase separated condensates in the nucleus, such as transcription factories associated with RNA polymerase II,65 and heterochromatin foci.66
While the role of DNA–binding protein complexes in driving compaction of the chromosome has been studied in detail,39,47,48,50–53 these studies neglect the role of spatial confinement. For polymers in periodic boundary systems, the box size is necessarily larger than the free polymer radius of gyration Rg, to exclude the possibility of unphysical conformations due to binder interactions with spatially distant monomers across the periodic boundary, which can arrest the polymer into a stretched conformation.67 This necessity of large box size imposes computational constraints on the binder concentrations that can be accessed in periodic boundary simulations. On the contrary, for confined polymer systems, in addition to the biological relevance of confined chromatin, one can access higher concentrations of binding proteins to systematically investigate the role of multivalency on polymer conformations. Multivalent counterions are known to drive non-monotonic behaviour of polymer sizes in polyelectrolytes,68–70 and multivalent DNA–binding proteins can play a similar role within the nucleus at higher concentrations. Compartmental organisation of nuclear material enable large fluctuations in concentration of DNA–binding proteins through local accumulation, which can in turn influence chromatin configuration. In this work, we systematically investigate the effect of multivalent DNA binder proteins on conformations of confined chromatin, and show that chromatin–binder interactions can lead to rich and non-trivial behaviour of interphase chromosomes.
![]() | (1) |
Self-avoidance between monomer beads (and also between two binder beads) is implemented by a WCA interaction72 given by,
![]() | (2) |
The attractive interaction between binders and monomers is maintained by a truncated and shifted Lennard-Jones potential, consistent with existing binder models, and is given by,
![]() | (3) |
Different DNA binding proteins have been reported to have an array of binding valencies, ranging from two up to as large a number as ten.10 In line with the original SBS model,39 we choose a binder valency of six for the binder proteins. A binder–monomer bond is considered to have formed instantaneously if they are separated by a distance less than rc and the binder is attached to less than 6 monomers (see ESI† for details).
The chromatin polymer as well as the binders are assumed to be confined within a sphere of radius R (Fig. 1). Confinement is implemented as a soft repulsive wall modeled by the WCA potential, eqn (2). The separation between the wall and the particle (monomer bead or binder) is measured from the closest point of the wall to the particle. We choose confinement radii to conform with the biologically relevant chromatin density inside nucleus, estimated to be in the range 0.01–0.25 (see ESI† for details). We choose confinement radii between R = 6.5σ − 15σ consistent with these estimates.
In our simulations, the energy and length scales are scaled by ε and σ respectively. All length scales – including confinement radii and polymer sizes – are reported in units of σ throughout the manuscript. A choice of σ corresponds to a choice of the coarse graining scale, with polymer based models for chromatin having reported a range of values from nucleosome level resolution (∼200 bp)73,74 to 100 kbp level resolution.47 We choose a coarse graining scale of 40 kbp,41 which implies a bead size of σ = 70 nm (see ESI† for details). The simulated polymer is thus around ∼10 Mb, and hence the confinement can be interpreted as that at the level of chromatin A/B compartments.31,75 For simplicity, we choose the binding protein size to be the same as this coarse-grained chromatin bead size.47,48 For the CTCF protein, recent experiments estimate an abundance in human cells of the order of ∼105 per cell,76 which suggests a physiologically relevant number density in the range ∼0.04–0.3 in our simulation scale (see ESI† for details). In our simulations, we explore binder densities consistent with both this experimentally estimated density range,76 as well as previously reported theoretical studies.39,47,48,51 Unless stated otherwise, we simulated at least 5 independent runs with each run being ∼105τ (∼2 × 108 simulation steps) long where τ denotes our simulation time unit (see ESI† for details). All equilibrium properties are calculated by averaging over 1000 independent configurations.
![]() | (4) |
While we choose a polymer of size N = 256 throughout this work, we verified that the generic nature of the reswelling is not an artefact of the finite size of the polymer (Fig. S1, ESI†), nor do the results reported here depend sensitively on the exact binder valency (Fig. S2, ESI†). Further, while we assumed that binder sizes are equal to the monomer size, the qualitative results of our manuscript do not depend on this assumption (Fig. S3, ESI†). Finally, the assumption of a flexible polymer is also not crucial to this reswelling transition, which we verify to be present for a semiflexible polymer model of chromatin as well (Fig. S4, ESI†).
This reswelling transition suggests that the polymer size (Rg) can be controlled by changing the geometric confinement radius, and hence the local binder concentration, while keeping binder numbers the same. This is shown in Fig. 2e for different values of the binder number Nb. For low number of binders, when the chromatin polymer is in the semi-open state, it is expected that the polymer is distributed over the whole available volume, and hence Rg is expected to increase with increasing volume (and equivalently R). Evidently we found this to be true in presence of less number of binders (Nb = 100). As the binder number is increased (Nb ∼ 200–300), Rg tends to saturate with increasing confinement radius. This is due to the fact that the binder number is close to the critical binder number Nminb for larger values of the confinement radii and the polymer collapse into a compact state in these cases. Further increase in binder numbers results in a non-monotonic behaviour of the polymer size (Nb = 500). For strong confinement, Rg initially increases with increasing R. However, beyond a certain confinement radius, Rg starts to reduce with increasing R. This can be explained in terms of reswelling of the chromatin. For strong confinement Nb = 500 is significantly larger than Nminb corresponding to those confinement radii and thus the polymer is in reswollen state. On the other hand for large confinement radii Nb = 500 is close to Nminb for weak confinements and therefore the polymer is still in a moderately compact state resulting in lower Rg values. Thus changing the geometrical confinement radius can result in non-trivial behaviour of the chromatin size, for the same number of binding proteins. We note that while in our model, both the polymer and the binding proteins are confined, in principle, confinement of the chromatin polymer alone is sufficient to observe reswelling at physiological concentrations (Fig. S5, ESI†).
One parameter that may affect the chromatin structure strongly is the binding affinity of binders to chromatin. Fig. 2f shows the variation of Rg for range of interaction strength (εm) between binders and monomers, for three different binder numbers. As is expected, for any fixed binder number, the polymer Rg decreases with increasing εm. This is because higher εm means the bonds are stronger and long-lived and thus collapse the chromatin. However, for all values of the interaction strength, we observe a re-entrant behaviour, with the polymer size being smaller at Nb = 200 compared to the Nb = 50 and Nb = 800 cases. Thus, re-entrant behaviour of polymer size with increasing binder number is robust over all binder–monomer interaction energy scales that we investigated.
We now turn to the contact probability distributions in order to understand the coupled effects of binder concentration and confinement for the (semi-)open, compact and reswollen conformations. Generically, the contact probability scales with genomic distance as, Pc(s) ∼ 1/sα, where the exponent α in this case depends on both the binder concentration as well as confinement. We first consider the case of weak confinement, R = 15σ, shown in Fig. 4a. In the absence of binders, the contact probability exponent goes as α ∼ 2.0, indicating the polymer conformation is similar to the open random SAW polymer.77 Close to the transition point for the collapsed conformation, Nb = Nminb = 300, the contact probability exponent is α ∼ 0.75, which is in the biological range observed in experiments. On increasing the binder concentrations (Nb = 3500), the polymer reswells, which is reflected in a corresponding increase in the exponent, α ∼ 0.9, indicating a decreased probability of contact between distant monomer segments. Note that for this case of weak confinement, biological constraints on the binder concentrations preclude a full reswelling of the polymer. In order to understand the equilibrium properties of the conformations of this partially reswollen conformation, we then compare it to the conformation at a pre-collapse binder concentration, Nb = 75 < Nminb, such that the radius of gyration of these two conformations are roughly similar (Rg(Nb = 3500) ≈ Rg(Nb = 75) ≃ 5.3σ). Firstly we note that for small genomic separations (s/σ < 10), the contact probabilities for the reswollen polymer are lower than the partially collapsed polymer, which can be understood by the presence of the binder cloud surrounding monomers in the reswollen case, as discussed above. At larger separations however, the contact probability exponent in both these cases are roughly similar, indicating that reswollen conformations are statistically similar to the open counterparts. For the case of intermediate confinement, R = 9σ, again in the absence of binders, the contact probability behaves similar to a semi-open conformation (α ∼ 2.0 for Nb = 0). At the collapsed conformation, the contact probability exponent becomes much smaller, α ∼ 0.9, reflecting the increased long-range contacts in this collapsed conformation. As the polymer reswells (Nb = 800), the contact probability itself becomes lower due to the presence of a binder cloud surrounding each monomer, although the exponent α in this case is roughly similar to the compact conformation. Similar features are observed for R = 6.5σ, although the determination of the exponent in this case is complicated due to a strong confinement induced collapse even when binders are absent, and consequently, a weak binder mediated collapse and reswelling.
〈τc(R = 15)〉 ≃ 581τ 〈τc(R = 9)〉 ≃ 98τ 〈τc(R = 6.5)〉 ≃ 3.6τ |
Local binder concentrations in cells can be sensitively controlled by changing the strength of the geometric confinement (radius of the compartment). Here, we study the effect of varying confinement radius on the collapse timescales. We first characterise how mean collapse times τc change with changing confinement. For a given value of the confinement radius (R), the extent of collapse can be identified as the difference between the binder-free radius of gyration R0g and the final compaction value Rcg both of which depends explicitly on R. While R0g depends strongly on R, the variation of Rcg with confinement radius is comparatively smaller indicating a weaker dependence of Rcg on R. Therefore we use R0g − Rcg as a proxy for variation of confinement radius to determine how compaction time depends on confinement. We find that the mean collapse times scales with the extent of compaction as τc ∼ (R0g − Rcg)μ with μ = 1.45 ± 0.06 (Fig. 5d). Previous studies,78 in the absence of confinement, have reported compaction time scaling with polymer size with a similar exponent, τc ∼ N1.5. Confinement has a non-trivial effect on the scaling of polymer Rg with polymer size,64 and hence the extent of confinement R0g − Rcg provides a more robust characterisation of the compaction timescales.
We next ask whether binder mediated reswelling also occurs within comparable timescales. To check this, we initialise our system in the compact conformation, by equilibrating the chromatin polymer with Nminb binders. We then instantaneously increase the binder concentration to c = 0.2 for all values of the confinement radii R. This induces a reswelling of chromatin, and the reswelling time τr is defined similarly to τc by noting the minimum time taken to reach the equilibrium Rg and then fluctuate around this value for 200τ steps. Similar to compaction timescales, the distributions and averages are done over 1000 independent runs. The distributions of the reswelling times are again unimodal, and well-approximated by log-normal distributions (Fig. 5e–g). Note that since the concentration of binders is kept uniform, the extent of reswelling is different for different R (Ω15 = 0.2, Ω9 = 0.3, Ω6.5 = 0.7). For the case of maximal confinement, when c = 0.2 approaches the maximum possible reswelling (Ω ∼ 1), we note that the mean reswelling time 〈τr〉 ≃ 5〈τc〉. Irrespective of the extent of reswelling, we note that the mean reswelling times are comparable to the collapse timescales, suggesting that modulating local binder concentrations can lead to collapse and expansion of polymers within biological timescales.
Similarly to the collapse, we also characterise the scaling of the mean reswelling time with changing confinement radius. For a given confinement radius, we characterise the extent of reswelling by the difference between the reswelled radius of gyration Rrg and the compact radius of gyration Rcg. Therefore, similar to compaction times, in this case we use Rrg − Rcg as the proxy for variation of confinement. The reswelling time scales with the extent of reswelling as τr ∼ (Rrg − Rcg)ν with ν = 1.71 ± 0.07 (Fig. 5h). The reswelling exponent is thus slightly higher than the compaction exponent (ν > μ), and for all biologically relevant values of (R0/rg − Rcg), and equivalently of the confinement radius, the compaction and reswelling times are expected to be similar in magnitude, and hence both processes are expected to be physiologically relevant.
The coil-globule transition of chromatin in unconfined space has been well studied in the literature.39,47–52 For unconfined polymers (in periodic boundary conditions), the periodic box is necessarily large enough that accessing large binder concentrations has remained computationally challenging and hence the phenomenon of reswelling in these systems has remained unexplored. Introducing confinement of chromatin, in addition to being biologically relevant, allows us to explore these regions of high volume fractions, where a combination of entropic forces and self-avoidance drives reswelling of chromatin. We show that confinement of the chromatin polymer, and not necessarily binder proteins, allows us to access this previously unexplored region of parameter space. Reswelling of chromatin is observed when binder concentrations are large enough to dress the entire polymer backbone with a binder cloud, which screens interactions with other binders. We also note that binding proteins themselves often act in a cooperative manner,79 and assemblies of many proteins can regulate the strength of this screening by changing the effective strength of excluded volume interactions. Future experiments may provide more clarity on such screening effects on chromatin–binder conformations. Further, we note that the extent of reswelling, in unscaled units, is often quite small, especially for strong confinement. This is concomitant with the relatively small extent of collapse as well in strongly confined systems. Thus binder mediated collapse of chromatin can be weak or strong depending on the extent of confinement. Independent of this, our current work shows that the extent of reswelling is comparable to the extent of compaction, and are both modulated by the strength of confinement. Our study suggests that the reswelling of chromatin depends heavily on the confinement radius as well as presence of multivalent binding proteins, and the onset of reswelling appears at different binder density for different confinement radii even when the binder–chromatin interaction strength is unchanged.
Recent studies has also systematically investigated the collapse of the chromatin polymer via monomer–binder bridging in confinement,67 and the formation of long-lived, phase separated compartments through binder interactions.80 However the binders in these cases are mutually attractive, leading to formation of large cluster of binders and consequently no reswelling was observed. Another recent study involving implicit binders in the absence of excluded volume interactions among binders suggests that over-expression of binders can cause reswelling for the case of bivalent binders on a polymer chain where monomers can bind only a single binder.81 The absence of explicit binders, and consequently volume exclusion among binders, and the imposition of strict limits on the binder and monomer valencies limits the applications of this study to real systems. We believe that the entropic origins of the reswelling remain the same, however, multivalency of binders and volume exclusion represent additional physical forces that non-trivially affect the reswelling transition.
Our simulations show that the chromatin sizes and compaction can be sensitively controlled by changing the confinement radius. This offers a tantalising probability for the control of formation and stability of phase-separated condensates in the nucleus.59 Recent experiments have shown that CTCF is locally concentrated inside transcriptional condensates, and CTCF mediated looping is in fact an essential architectural prerequisite for the formation of such condensates.19 This is consistent with our simulations. We find that multivalent binding proteins are locally enriched within the polymer phase (with an average concentration 10–100 times higher inside the collapsed polymer phase as compared to the surrounding fluid). Fluctuations in the size of the compartment can then push local concentrations above or below the reswelling threshold, with an accompanying change in the structure of the chromatin polymer.
In summary, using a simple self-avoiding polymer model, we show that in the presence of confinement, multivalent binding proteins can either stabilise or destabilise the collapsed polymer conformation by varying the local binder density in the confined compartment. Our model has multiple simplifying assumptions – in particular, it neglects sequence heterogeneity and the partitioning of chromatin into heterochromatin and euchromatin regions, and also the role of crowding due to presence of binder proteins. Additionally, we make simplifying assumptions regarding the monomer valencies, where a monomer can bind to any number of binders, subject only to volume exclusion. Nevertheless, we believe that our coarse-grained model simulations, captures the qualitative physical mechanism underlying collapse and reswelling of the chromatin polymer and suggests a non-trivial role for binding proteins in regulating spatial organisation of chromatin.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2sm00612j |
This journal is © The Royal Society of Chemistry 2023 |