Open Access Article
Abhik Ghosh Moulick
a,
Sylvia Erhardt
bc,
Wolfgang Wenzel
a and
Mariana Kozlowska
*a
aInstitute of Nanotechnology, Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, 76131 Karlsruhe, Germany. E-mail: mariana.kozlowska@kit.edu
bMolecular Cell Biology of Animals, Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, 76131 Karlsruhe, Germany
cInstitute of Biological and Chemical Systems-Functional Molecular Systems (IBACS-FMS), Karlsruhe Institute of Technology (KIT), Kaiserstraße 12, 76131 Karlsruhe, Germany
First published on 10th June 2026
Noncanonical nucleosomes at the centromere contain the histone variant CENP-A, which plays a crucial role in chromosome segregation. CENP-A is highly regulated, including that by the centromere protein N (CENP-N). Despite its importance, the protein–protein interactions within the CENP-A nucleosome–CENP-N complex remain poorly understood at the molecular level. Here, we employ SIRAH coarse-grained molecular dynamics (MD) simulations to investigate dynamic mechanisms through which CENP-N binding to CENP-A nucleosome modulates structural changes of histones, contributing to the regulation processes. Using a set of μs-long MDs, we reveal enhanced flexibility in the N-terminal region of CENP-A and the stabilization of its RG loop in the complex with CENP-N, thus reshaping the conformational structure of the CENP-A protein. We characterize these allosteric changes and analyze their influence on histone–DNA contacts. Moreover, we demonstrate that they have rather minor effects on the overall stability of the nucleosome and its compactness. A distance-based contact map analysis further elucidates key residues mediating the interaction between CENP-A and CENP-N, while umbrella sampling simulations quantify their binding free energy, which remains challenging to measure experimentally.
In total, sixteen inner kinetochore proteins are associated with a centromeric nucleosome, and are collectively known as the constitutive centromere-associated network (CCAN).8–10 The specific binding of NCPs containing CENP-A with CCAN occurs via complex protein–protein interactions (PPIs), where CENP-N and CENP-C directly recognize the CENP-A nucleosome.11–13 While CENP-N acts as a reader and locator of CENP-A, anchoring the CCAN complex14 specifically to CENP-A-containing nucleosomes, CENP-C binds CENP-A nucleosomes for recruiting and organization of other CCAN components.15,16 Moreover, they are known to facilitate the packaging of chromatin at centromeric regions.9,17,18 CENP-N has been reported to contribute to the stacking of CENP-A-containing nucleosomes and the formation of nucleosomal arrays through contacts between its α6 helix and the DNA of neighboring nucleosomes.19 The binding mode of CENP-C is more complex and multivalent, complicating mechanistic interpretation.
Cryo-EM studies and biophysical analyses further confirmed that PPIs and protein–DNA interactions between CENP-A NCP and CENP-N are key elements underlying the formation of higher-order centromeric structures.19 Hydrogen/deuterium exchange (HX) coupled to mass spectrometry experiments additionally revealed that the N-terminal domain of CENP-N adopts a folded conformation, with the first 200 residues forming the major interface with CENP-A nucleosomes.18 Consistently, structural studies have shown that human CENP-N residues E3, T4, and E7 confer binding specificity through interactions with the L1 loop of CENP-A; particularly, its exposed RG motifs (R80-G81) are further stabilized by electrostatic contacts with nucleosomal DNA. Several positively charged residues of CENP-N (R44, K45, R11, K81, K148, R169, and R170), located proximal to the DNA backbone, likely form stabilizing interactions with the DNA phosphate groups.13
Overall, the functionality of such noncanonical nucleosomes depends on their intrinsic structural flexibility14,18 and on conformational changes stimulated by CCAN protein binding and DNA interactions.13,18,20,21 Despite these structural insights, detailed dynamical studies of how specific CENP-A–CENP-N interactions guide centromere function at the level of a single nucleosome and beyond remain elusive. Above all, quantitative data on binding energetics are still lacking from both experiments and simulations, yet such information is important for understanding whether CENP-N binding is essentially constitutive or dynamic, and how it compares in strength to other nucleosome–protein interactions. These energetic insights, along with structural dynamics, help to better understand how PPIs and protein–DNA interactions regulate nucleosome function at the centromere.
The structural changes of canonical nucleosomes, for example their loop formation,22,23 DNA breathing and unwrapping,23–26 twist defects,25 and nucleosome sliding,27 as well as some structural characteristics of histone variant nucleosomes,28,29 have been studied to date through different computational methods26,30,31 with diverse structural resolutions.32–34 Still, studies related to the interaction of histone variant nucleosomes with other biological macromolecules present in the nucleus remain obscure.
In the present study, we investigate how CENP-N specifically recognizes and modulates the dynamics of the CENP-A nucleosome using coarse-grained (CG) MD simulations. Due to the large system size of NCPs and biological complexes they may form by interacting with other proteins, gaining long-timescale conformational insights and PPIs through atomistic simulations is computationally expensive. Therefore, CG models35–39 serve as a useful alternative for exploring such systems. While polymer-based CG models38,40–43 and mesoscopic models34,44–46 are often used for nucleosome and chromatin modeling, higher resolution CG force fields (FF) like SIRAH (Southamerican Initiative for a Rapid and Accurate Hamiltonian)47,48 or MARTINI49 are capable of deciphering PPI and protein–DNA interactions with higher accuracy. Since SIRAH permits unbiased simulation of the secondary structure and long-range electrostatics, enabling reproducibility of PPI50 and protein–DNA interactions51–53 in comparison with experimental observations, we used it in our studies. Furthermore, the SIRAH FF was recently applied to understand protein–DNA interactions and dynamics of canonical nucleosomes in comparison with all-atom simulations.53,54
In short, in this study we first assess CENP-A nucleosome stability in the presence and absence of CENP-N and characterize the binding interface obtained using CG-MD in comparison with that available from experiments. Using structural analysis along with the contact-based interpretation, we analyze the allosteric modulation of the conformational landscape of the CENP-A protein by complexation with CENP-N and its impact on coupled protein–protein and protein–DNA interactions associated with nucleosome recognition. We then analyze the conformational flexibility of CENP-N in both its free and bound states, providing microscopic details of its structuring upon NCP binding. Finally, we quantify the PPI binding strength through umbrella sampling (US) simulations. With this approach, we aim to gain mechanistic insights into how CENP-N contributes to CENP-A nucleosome stability, recognition, and regulation, providing a better understanding of the functionality and structural changes of centromeric NCPs.
![]() | ||
| Fig. 1 (a) The all-atom structure of the CENP-A-containing nucleosome along with the CENP-N protein obtained in cryo-EM19 and further refined using atomistic simulations (this work). The histone core and CENP-N protein are visualized using New Cartoon and four histone pairs are marked in red, green, yellow and cyan for CENP-A, H2B, H4 and H2A, respectively, while DNA is visualized using QuickSurf and is marked in grey. The CENP-N protein is marked in salmon. (b) The CG representation of the CENP-A nucleosome with bonded CENP-N as mapped from its refined all-atom structure. The pink box highlights the L1 loop of CENP-A, which is visualized in panel (c), including the RG loop of CENP-A responsible for CENP-N binding. (d) The visualization of superhelical location (SHL) sites marked over nucleosomal DNA. Each SHL typically spans approximately 10 base pairs, ranging from SHL0 to SHL ±7. | ||
Due to adequate quality of SIRAH in describing interactions between biological macromolecules and structural dynamics of biologically relevant complexes,59–62 we selected this FF for in-depth characterization of protein binding and dynamics of the histone variant nucleosome system with CENP-A towards understanding its CENP-N-driven structural dynamics and binding energetics. Two systems were considered: (i) CENP-A nucleosome with bound CENP-N and (ii) CENP-A nucleosome without CENP-N. Atomistic structures were mapped to the respective CG representation (see Fig. 1b) using the SIRAH Tools mapping protocol, and they were later solvated using the WT4 water model in SIRAH, and ionized at a 0.15 M NaCl salt concentration. After energy minimization and equilibration, production runs were carried out for 10 μs with 20 fs time step under periodic boundary conditions, with three independent replicas for each system. Backmapping to atomistic resolution was performed using SIRAH-Backmap tools,63 followed by minimization with the ff14SB force field in AmberTools.64 The detailed description of the CG setup, equilibration protocol, simulation parameters, and backmapping procedure is given in the SI (section S1.2.).
To understand PPIs and the binding free energy between CENP-A protein in an NCP and a CENP-N CCAN protein, US simulations using SIRAH FF were performed.50,65 The computational details are explained in detail in the SI section S1.3. In short, the initial structure of the complex was taken from the final structure obtained from the atomistic simulations as discussed above. To generate initial conformations for US windows, steered MD simulation for the CENP-N pulling away from its initial position in the NCP into the bulk solvent was performed along the X-axis by applying a harmonic pulling potential with a force constant of 1000 kJ mol−1 nm−2 and a pulling rate of 0.0001 nm ps−1. To prevent drifting of the system along the reaction coordinate, a harmonic restraint with a force constant of 20 kJ mol−1 nm−2 for the whole nucleosome, except CENP-N protein, was applied. The center-of-mass (COM) distance between the CENP-A protein and the COM of the ordered region of CENP-N (residues 1–200) was defined as the reaction coordinate ξ. Sixty structures, generated during pulling simulation, were used as US windows for separate MD simulations. A harmonic bias potential with a force constant of 1000 kJ mol−1 nm−2 was applied to each window. Finally, the potential of mean force (PMF) was derived over a 50 ns trajectory of each window using the weighted histogram analysis method (WHAM),66,67 as implemented in GROMACS, to eliminate the influence of the applied bias. Statistical errors were estimated by using the default Bayesian bootstrapping algorithm built into the WHAM program.67 The binding free energy (ΔG) of proteins was calculated as the difference in free energy between the bound and unbound states (see section SI S1.3).
The errors for the quantities calculated were estimated where applicable. The error of the average was calculated using
, where s is the standard deviation of the mean values over the replicas, and n represents the number of replicas. To quantify RMSD fluctuations and assess the stability of each replica, the standard deviation of the RMSD time series was calculated.
ln(P(x,y)/Pmax), where Pmax is the probability of the most probable state, kB is Boltzmann's constant and T is the temperature. PCA and FEL computations were performed using the MDAnalysis Python library69,70 by providing the input coordinates of CG beads. The direction of the PCA modes was visualized through porcupine plots using NMWiz71 in VMD.72
![]() | (1) |
The average RMSD plot (Fig. 2a) shows that considering the histone tails in the RMSD calculation, the fluctuation is higher in the system with CENP-N bound to the CENP-A system (RMSD of 7.51 ± 0.33 Å, marked in red) as compared to the unbound CENP-A system (6.94 ± 0.10 Å, in black). It is connected to the interactions induced by the presence of CENP-N explained later. The RMSD of systems without considering the histone tails is lower, indicating higher histone core stability. In addition, the RMSD of the histone core in the presence of CENP-N is even slightly smaller, i.e., 5.12 ± 0.08 Å, while in the absence of CENP-N the average RMSD is 5.34 ± 0.08 Å. This indicates a possible stabilization of the nucleosome upon CENP-N binding. However, the observed changes should be interpreted qualitatively, since they are mostly below 0.5–1.0 Å, which is statistically too small to claim a meaningful structural change given the CG resolution. The average values of RMSD of nucleosomal DNA in the presence and absence of CENP-N are 5.45 ± 0.21 Å and 5.63 ± 0.08 Å, respectively (Fig. 2a). This suggests rather small structural changes; however, smaller movements of some base pairs that are located near the CENP-N binding site may contribute to this change. Together with a slight histone core structural stabilization, it provides evidence for possible NCP plasticity changes, which we discuss in the following section.
The average radius of gyration for both systems is shown in Fig. 2b. The presence of the CENP-N protein does not alter larger structural moves and the shape of the NCP. Time evolution of Rg is shown in Fig. S4. The final structure of the NCPs after the simulation is visualized in Fig. 2c. The upper panel depicts the CENP-A nucleosome with bound CENP-N along with the L1 loop region, while the lower panel shows the unbound CENP-A NCP. Distinct orientations are displayed for each system to convey the complete molecular architecture and assembly. The figure demonstrates that the systems considered in this study retained their structural integrity throughout the microsecond-scale CG-MD simulations.
To maintain its main function, CENP-N should primarily bind to the CENP-A protein in the histone core;11,75 however, it additionally shows the interaction with the DNA as depicted in Fig. 3b. The respective contact map between the CENP-N protein and the DNA is shown in Fig. 3d. DNA base G98 exhibits strong contacts with R169 and LEU168 of CENP-N, with average contact population of 0.99 and 0.73, respectively. The high frequency of contact formation indicates the attractive character and stability of the CENP-N binding at this interface (see Table S5). Similarly, G99 exhibits stable contacts with K81 and V82, both showing maximum contact values of 1.00, along with a weaker interaction with TRP83. The adjacent base A100 also forms strong contacts, particularly with K148 (1.00), Y147 (0.85), and again with K81 (0.97), indicating a consistent role of this region in DNA recognition. The A43 base from DNA strand-2 (DNA base number 190 in Fig. 3d) strongly interacts with R44, K45, and E46, with contact values exceeding 0.75, while C44 (DNA base number 191 in Fig. 3d) forms multiple contacts with M18, N19, and K45, further supporting weak interaction with R44. T49 (DNA base number 196 in Fig. 3d) also shows a modest contact with N19. These interactions (see values listed in Table S5) highlight key regions of CENP-N that stably associate with the DNA and retain their dynamical stability during the simulated time, suggesting their stabilizing role in mediating CENP-N binding to the noncanonical CENP-A nucleosome. The structure of the CENP-A nucleosome with and without CENP-N, taken after 10 μs-long MD simulation, is visualized in Fig. S6b.
To understand the possibility of CENP-A conformational changes upon CENP-N binding, the PCA analysis, as described in the Methods section, was performed. In Fig. 4b and c, the FEL plots along two principal components (PC1 and PC2) for CENP-A in the presence and absence of CENP-N are visualized. In both cases, we observe the distinct energy basins indicating distinct stable conformational states of CENP-A. In the absence of CENP-N, the two stable conformational states of CENP-A show a high similarity in their secondary structures and are, thus, separated by the low energy barrier (see Fig. 4c). However, the number of basins upon CENP-N binding increases (Fig. 4b), representing stronger structural changes. Such conformational changes likely reflect the specific interactions between CENP-A and CENP-N, impacting conformational states and revealing a more complex free energy landscape. This change is observed since the degrees of freedom in a molecule are coupled and not independent. Local stabilization of CENP-A due to CENP-N binding redistributes structural fluctuations globally, leading to structural changes and the modulation of their correlated motions even at sites not directly involved in binding. The identified representative structures corresponding to each energy basin for both cases are depicted in adjacent panels in Fig. 4b and c. They were identified by picking conformations at the minima of the FEL spanned by PC1 and PC2 shown in Fig. 4b and c. To better visualize the conformational changes across the different basins, the backmapped all-atom structures were used.
A detailed comparison of the CENP-A minima, depicted in Fig. 4b and c, is presented in Fig. S7. We see that the stable helical regions remain largely unchanged between structures in the absence of CENP-N (Fig. S7a), while the loop regions exhibit distinct conformations in each minimum. In the CENP-N bound case (Fig. 4b), the representative structures of CENP-A also show variations in loop conformations (see Fig. S7b); however, the stabilization of the helical region is more pronounced. It can be assessed by direct comparison of secondary-structure elements (Fig. S7c), which indicates that CENP-N binding does not induce large-scale structural rearrangements of CENP-A, but subtly stabilizes specific loop conformations and increases local helical order. Consequently, the differences observed in the PC landscapes arise from a reorganization of conformational populations among similar structural states, rather than from distinct structural transitions. The dominant principal component motions (PC1 and PC2) are visualized in the porcupine plots in Fig. 4d and e. The directionality of the PCs is altered upon CENP-N binding and the magnitude of motion is greater in the presence of CENP-N. This clearly shows enhanced fluctuations of the N-terminal tail (see the arrow lengths in the porcupine plots), reflecting its increased flexibility.
To estimate residue–residue motion correlations of CENP-A from its backbone Cα fluctuations over the full trajectories, DCCM maps were generated (see Fig. 5). We found that upon CENP-N binding (Fig. 5a), the stabilization of the RG loop coincides with enhanced long-range coupling between the RG loop and the N-terminal region, accompanied by a shift toward anticorrelated motions. In contrast, the CENP-A-only system (Fig. 5c) exhibits uncorrelated fluctuations across these regions. This observation is in line with the PCA analysis explained above and indicates that distant binding of CENP-N modulates subtle structural motions of other parts of CENP-A. Thus, the change in the PC landscape originates from different local conformational states of CENP-A sampled in its bound and unbound states. The regions corresponding to the RG loop and the N-terminal tail showing these correlation changes in the DCCM map (denoted as region X) were mapped onto the final structure of CENP-A (at t = 10 μs) as shown in Fig. 5b and d for the systems with bound and unbound CENP-N, respectively.
The backmapped all-atom structures reveal that several residues from the N-terminus of CENP-N adopt loop-like conformations in both states, despite largely belonging to helical regions. In contrast, the C-terminal tail predominantly forms loop structures in both bound and free states. Hence, it was important to calculate the secondary structure (ss) elements of the CENP-N protein in both the bound and free states across MD simulation replicas. At first, we performed the ss analysis of protein in both free and bound states using the SIRAH ss tool as explained in the Methods section. Fig. S8a shows the average ss content, along with error bars from three independent replicas, for both systems. We also calculated the time evolution of the ss content for the bound and free states of CENP-N (see Fig. S8b and c) over the last 8 μs. Both results revealed no significant changes in the ss between the two states. Considering the differing flexibility of particular regions of the protein, we identified three different regions that were studied separately in more detail. The first region was located between residues 1 and 50, which showed the strongest changes in the RMSF behavior upon binding. The second region (residues 51 to 200) showed no significant RMSF changes upon binding and the third region (residue 201 to 295) remained disordered, indicating higher RMSF in both bound and free states (Fig. 6). The secondary structures of these three different regions are given in Table 1, where helix and extended regions represent stable ss, while coil represents unstructured ss. From these data, we see that the first region shows increased contribution of stable ss from 54.07% to 57.70% (especially for helix content, which changes from 49.37% to 52.67%) upon binding as compared to the free state of the protein. In the other regions, the stable ss content versus coil does not change markedly: less than 1% of stable ss decreases or remains unchanged, with overall flexibility largely unaffected upon binding with CENP-A and nucleosomal DNA.
| Category | Res 1–50 | Res 51–200 | Res 201–295 | |||
|---|---|---|---|---|---|---|
| Bound | Free | Bound | Free | Bound | Free | |
| Errors are indicated as mean ± error. | ||||||
| Stable ss: helix | 52.67 ± 3.75 | 49.37 ± 2.37 | 30.67 ± 1.01 | 32.13 ± 1.67 | 30.60 ± 0.99 | 33.07 ± 2.20 |
| Stable ss: extended | 5.03 ± 0.88 | 4.70 ± 2.36 | 23.07 ± 1.90 | 22.30 ± 1.22 | 8.20 ± 1.56 | 5.73 ± 0.95 |
| Stable ss: helix + extended | 57.70 ± 3.85 | 54.07 ± 3.45 | 53.74 ± 2.16 | 54.43 ± 2.06 | 38.80 ± 1.84 | 38.80 ± 2.40 |
| Coil | 42.30 ± 3.96 | 45.97 ± 1.60 | 46.23 ± 1.31 | 45.57 ± 1.19 | 61.17 ± 0.67 | 61.17 ± 1.80 |
Since experimental studies, such as HX exchange,18 have reported that the N-terminal domain of CENP-N adopts a folded conformation upon interaction with CENP-A, particularly involving the first 200 residues, we examined the combined dynamics of the ordered region (residues 1–200) and the disordered region (residues 201–295) in CENP-N through RMSD calculations. At first, we calculated the RMSD of whole CENP-N in its free and bound states (see Fig. S9a). The overall RMSD was significantly higher for free CENP-N compared to that for the bound CENP-N with the average RMSD of 14.32 ± 0.44 Å and 9.98 ± 0.93 Å, respectively. At the same time, ordered residues (res 1–200) show a decrease in RMSD from 11.18 ± 0.45 Å to 5.57 ± 0.22 Å upon nucleosome binding, whereas the flexibility of the disordered region remains similar in both conditions. To further assess the compactness of CENP-N considering residues in these different regions, we calculated the corresponding Rg (see Fig. S9b). Rg is slightly smaller in the bound state of CENP-N; however, the average Rg for all three cases (entire CENP-N, the ordered region, and the disordered region) indicates less differences than the average RMSD. Thus, while region-specific RMSD analysis reveals local stabilization of the N-terminal residues upon nucleosome binding, the overall Rg remains largely similar, indicating that CENP-N retains its global size and shape while undergoing local conformational adjustments. Taken together, our findings indicate that binding of CENP-N to the NCP primarily stabilizes the N-terminal region of CENP-N, while the central and C-terminal regions remain largely unaffected. The concurrent observation of ordering within CENP-N and localized stabilization in the histone CENP-A demonstrates that the RMSF analysis of NCP captures biologically meaningful dynamical changes on both sides of the interacting system.
![]() | ||
| Fig. 7 (a) The PMF curve representing the binding free energy between the CENP-A and CENP-N proteins in the centromere NCP obtained using the umbrella sampling method. The error bars calculated using the Bayesian bootstrapping algorithm are depicted in grey. The 3D representation of the binding site and the proteins involved is depicted using CG beads, along with the nucleosomal DNA depicted in white surface representation. Different structures representing the COM reaction coordinate ξ at: (b) ξ = 5.45 nm, (c) ξ = 8.0 nm, and (d) ξ = 14.45 nm, are visualized for clarity. Bead colors represent the relative RMSF fluctuations, where red beads indicate regions of CENP-A and CENP-N with the highest fluctuations, while blue beads indicate the regions with smallest fluctuations. The RMSF coloring was generated using a user-defined color scale in VMD.72 | ||
To visualize the structural changes of CENP-A and CENP-N at three different ξ values, the calculated representative structures of the system were further inspected. To do so, the CG beads of all residues in both proteins were visualized and colored according to their relative RMSF values, calculated from the restrained trajectories at the corresponding ξ value. A more intense red color indicates significant fluctuations of the molecular entity representing the bead, while a more intense blue color represents smaller fluctuations. At ξ = 5.45 nm, corresponding to the bound state between the proteins, we observe higher fluctuations in the disordered region of CENP-N that is not involved in the direct protein–protein contact (Fig. 7b). At ξ = 8.0 nm, fluctuations at both terminals of CENP-N increase (Fig. 7c), while in the unbound state (ξ = 14.45 nm), they display much stronger fluctuations (Fig. 7d). The unbound state of CENP-N essentially represents the free state of CENP-N, where a substantial portion of the protein exhibits a disordered nature. In CENP-A, fluctuations increase in the unbound state compared to those in the bound state, although the increase is less significant compared to that in CENP-N since the protein is bound inside the histone core. The higher structural stability of both proteins upon their binding is clearly visible from this analysis and complements the dependencies observed in the unbiased MD simulations.
Starting with the binding interface reported in cryo-EM,19 CG-MD simulations revealed dynamically stable residue–residue contacts between the NCP and CENP-N. PPIs between CENP-A and CENP-N demonstrated a high specificity of the interaction interface enriched in polar, charged, and aromatic side chains, suggesting diverse binding interactions. In the RG loop of CENP-A within its L1 loop, R80 and G81 emerge as central points of contact in the simulation based on the average contact values (see Table S4). They were also suggested by the cryo-EM study by Chittori et al.13 Although this cryo-EM study13 had also identified E3 and E7 as participating in the interaction interface between CENP-N and R80 in the RG loop of CENP-A, the contact-based analysis of CG-MD data in the present work did not reveal stable or significant interactions involving these residues. This discrepancy is likely attributable to the intrinsic resolution limitations of the CG representation, which may not fully capture highly directional and residue-specific hydrogen-bonding interactions. Nevertheless, residues within the CENP-A L1 (RG) loop (T79, R80, G81, V82, and D83) consistently engage with a broader region of CENP-N. In particular, stable contacts are observed with residues centered around P144, N145, Q146, and Y147, as summarized in Table S4, suggesting that the L1 loop remains a dominant interaction hot-spot in the CG model, similar to that reported by Pentakota et al.75 Notably, residues in the vicinity of the experimentally implicated region,13 including T4 and F8 from CENP-N, also show appreciable contacts with the RG loop of CENP-A, indicating that the overall experimental interface is reproduced, even though the specific E3 and E7 mediated interactions were not captured. The comparison of experimentally reported CENP-A/CENP-N contacts versus their reproduction in CG-MD simulations is further tabulated in Table S6. In addition to the PPIs upon CENP-N binding, interactions between the nucleosomal DNA and CENP-N were also analyzed. The DNA bases, primarily belonging to SHLs from 2 to 3.5, and amino acids of CENP-N forming contacts in CG-MD are listed in Table S5. They are consistent with previous experimental studies by Chittori et al.;13 however, some experimentally reported interactions, such as those involving R170 and R11, are less prominent in the CG simulations, likely reflecting the reduced resolution and smoothing of residue-level interactions inherent to the CG representation.
To monitor any allosteric changes in the histone core upon binding of CENP-N, we focused on the conformational arrangement of the CENP-A protein in the presence and absence of CENP-N. Consistent with the RMSF analysis described earlier (Fig. 4a), the L1 loop is stabilized upon CENP-N binding, whereas the N-terminal region exhibits increased flexibility. This suggests that binding of CENP-N confers structural stabilization to the L1 loop, potentially through direct interactions. The same observation was reported by Pentakota et al.75 Furthermore, we found that upon CENP-N binding (Fig. 5a), when the RG loop is stabilized, the distant N-terminal loop shows enhanced dynamic coupling with a shift toward anticorrelated motions. This was largely absent in the absence of CENP-N (Fig. 5c), where uncorrelated motions of residues were present. Thus, this indicates the emergence of long-range dynamical coupling between the RG loop and the N-terminal region of CENP-A induced by CENP-N binding, which has not yet been captured experimentally. Therefore, the PCA explained earlier demonstrates the conformational changes of CENP-A in the PC landscape (see Fig. S7), directly indicating how distant binding of CENP-N modulates subtle structural motions of other parts of CENP-A. They may propagate further structural changes within the noncanonical NCP, impacting its biological function. For example, our contact map base analysis (Fig. S10) reveals a reduction in the average number of contacts formed by residues 13–22 of CENP-A with histone H2A when CENP-N is present (Fig. S10a and S11a). Conversely, in the absence of CENP-N, this region engages in more persistent contacts with H2A (Fig. S10c) with participating residues marked in Fig. S10b and S10d. Simultaneously, contacts with nucleosomal DNA are rather rare, with slightly higher occurrence in the presence of CENP-N (see Fig. S11). However, the behavior of the CENP-A N-terminal tail significantly differs from that of the canonical H3 tail: Khatua et al.23 demonstrated prominent interactions between the H3 N-terminal tail and DNA rather than other histones. This suggests that dynamical changes in canonical and noncanonical NCPs may differ even more than presently reported; therefore, a separate study is beneficial. To further compare the time evolution of the interactions between the CENP-A N-terminal tail with DNA and H2A histone, the time-dependent average number of contacts was calculated. It is visualized in Fig. S11. Overall, this analysis indicates that the binding of CENP-N allosterically disrupts intermolecular contacts, particularly between the CENP-A N-terminal tail and histone H2A. Such a change enhances the conformational freedom of the N-terminus.
The enhanced fluctuations of the CENP-A N-terminal tail (see also Fig. 4a) suggest that this region relies on interactions with specific binding partners for protein stabilization when bound to CENP-N. This dynamic behavior is consistent with experimental observations,79 highlighting the functional importance of the N-terminus of CENP-A for centromere functionality and kinetochore assembly. For example, the N-terminus of CENP-A was shown to contribute to the stabilization of the centromere binding protein CENP-B by direct interaction.80,81 Moreover, it acts as a recruiter of key kinetochore proteins such as CENP-C and CENP-T at both ectopic sites and endogenous centromeres in Schizosaccharomyces pombe and human cells.82,83 Thus, its enhanced fluctuations, as captured by diverse analyses in our work, lead us to hypothesize that this region adopts a more interaction-capable state that may modulate interactions essential for centromere function.
Furthermore, CENP-A's unstructured N-terminal tail bears post-translational modifications.84 Several studies have demonstrated that for instance phosphorylation of serine 7, 16 or 18 within the CENP-A N-terminal domain significantly influences the centromeric chromatin structure and function.85–87 These residues lie near the region that shows altered contact behavior in CG simulations (see residues 13 to 22 in Fig. S10), suggesting that CENP-N binding may modulate the exposure or accessibility of phosphorylation sites, thus indirectly impacting downstream chromatin remodeling or signaling events (see Fig. S10a). Therefore, the observed destabilization of local contacts (Fig. S10a) and increased flexibility of the N-terminus of CENP-A (residue 13 to 22 in Fig. 4a) upon CENP-N binding are not only structurally plausible but also biologically meaningful, potentially contributing to regulating centromere function via a modulation of post-translational modification, recruitment of kinetochore proteins, or higher-order chromatin structure formation. Future experimental studies, particularly assessing the role of CENP-A SER7 phosphorylation, along with other reported N-terminal phosphorylation sites in the presence and absence of CENP-N, could further clarify this mechanism.
Finally, to understand the molecular determinants of centromere assembly, it was essential to quantify the binding energetics of the CENP-A–CENP-N complex. Although experimentally determining the binding free energy for the noncanonical nucleosome-like system remains challenging, using the US technique we provided a quantitative estimate of the interaction strength, which is equal to −7.92 ± 0.99 kcal mol−1. Such binding free energy values reflect dynamic binding of the CENP-N protein, favoring reversible and tunable binding. It should permit the regulation of the noncanonical NCP complex by local concentrations and in a multivalent, competitive fashion rather than as an irreversible complex formation. This is directly connected to the biological function of the centromere, where a micromolar intrinsic affinity is consistent with stable but dynamic association, permitting the plasticity of the centromere required for kinetochore assembly and chromosome segregation.88 Moreover, moderate binding free energy, as obtained in the present study, suggests that nucleosomes and their higher-order structuring should remain responsive to changes in ionic conditions, the presence of other binding partners or post-translational modifications. Some of these observations were reported experimentally for canonical NCPs;89–91 micromolar range binding affinities were reported by an experimental study of the HP1 chromodomain interacting with the H3K9me2/3 system.91 This correspondence supports the conclusion that the value obtained in the present study is consistent with previously reported experimental measurements and reflects a physiologically relevant interaction strength. However, a direct comparison to noncanonical NCPs is presently not possible due to a lack of experimental data.
The multitude of analyses and dependencies reported in this work supports diverse experimental observations, thus demonstrating that the SIRAH CG force field reliably captures the long microsecond-scale dynamics of this centromeric complex. However, as with other CG approaches, this comes at the cost of reduced atomic-level detail. While the use of partial charges on individual beads allows SIRAH to approximately account for hydrogen bond-like interactions, specific directional hydrogen bonds and fine side-chain interactions are not represented explicitly. Therefore, contacts such as R80 from CENP-A with E3 and E7 from CENP-N proteins could not be captured in this study. In addition, secondary structure assignment in SIRAH is inferred from the backbone torsional angle rather than from explicit hydrogen-bond patterns, as used in DSSP-based atomistic analyses. Therefore, it should be interpreted as an approximate, geometry-based description of overall conformational trends. Last but not the least, the inherent loss of accuracy by using a CG model can also affect the faithful reproduction of native contacts at protein–protein interfaces, even when the overall contact surface areas remain within experimental bounds. Therefore, the interpretations presented in our study focus on relative changes in flexibility, contact persistence, and binding energetics rather than atomistic interaction details.
Together, our findings provide a critical mechanistic understanding in the characterization of PPIs and protein–DNA interactions, deepening our perception of microscopic processes in centromeres and offering a foundation for future studies aimed at targeted control of centromere functionality. Subsequent studies may further explore experimental measurements of binding energetics to complement and validate the computational predictions presented here.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d5nr04720j.
| This journal is © The Royal Society of Chemistry 2026 |