Open Access Article
Francesco
Porcelli
a,
Anna Rita
Casavola
a,
Alessandro
Grottesi
b,
Donatella
Schiumarini
a and
Lorenzo
Avaldi
a
aCNR-Istituto di Struttura della Materia, Area della Ricerca di Roma 1, CP 10 Monterotondo Scalo, Italy. E-mail: Francesco.Porcelli@mlib.ism.cnr.it
bCineca, Via dei Tizii, 6, Rome, Italy
First published on 15th December 2023
Argonaute (Ago) proteins mediate target recognition guiding miRNA to bind complementary mRNA primarily in the seed region. However, additional pairing can occur beyond the seed, forming a supplementary duplex that can contribute to the guide–target affinity. In order to shed light on the connection, between protein–RNA interactions and miRNA–mRNA seed and supplementary duplex mobility, we carried out molecular dynamics simulations at the microsecond time-scale using a different approach compared to the ones normally used. Until now, theoretical investigations with classical MD on Ago–RNA complexes have been focused primarily on pure water solvent, which mimics the natural environment of biological molecules. Here, we explored the conformational space of a human Ago2 (hAgo2) bound to the seed + supplementary miRNA–mRNA duplex, using the solvent environment as a molecular probe. MD simulations have been performed in a mixture of water/MeOH at a molar ratio of 70
:
30 as well as in pure water for comparison. Our findings revealed that the mixed solvent promotes protein RNA association, principally enhancing salt–linkages between basic amino acid side-chains and acidic phosphates of the sugar–phosphate backbone. The primary effect registered was the restriction of supplementary duplex flexibility and the stabilization of the miRNA 3′ terminus. Interestingly, we observed that the influence of the solvent appears to have almost no impact on the conformation of the seed duplex.
:
30, corresponding to a volume ratio of 50%.20 It is widely understood that polar organic solvents promote protein precipitation and/or refolding.21–23 For the oligonucleotide counterpart, a solvent with low polarity, can promote base pair opening24 mainly due to the repulsion of the negative charge on a sugar phosphate backbone. In general, for protein complexes, computational studies such as molecular docking and molecular dynamics simulations performed in different solvents, can provide information related to specific site interactions between a protein and small ligand as well as protein and protein, particularly in the field of drug design25–27 or to detect cryptic binding sites in proteins.28–30Considering protein RNA/DNA complexes, the role of amino-acids and nucleobases related to the strength of protein–oligonucleotide interactions, was also investigated using molecular dynamics simulations in methanol.31 Our study provides new insights into the relationship between seed and supplementary duplex mobility. Modifying the solvent environment from water to water/MeOH, the complex experiences a conformational rearrangement produced by stronger Ago and RNA interactions. Interestingly this effect appears to be strongly domain dependent.
![]() | ||
| Fig. 1 Crystallographic structure of the Human Argonaute2-miR-122 complex (PDB:6N4O), the protein domains are colored as given in the scheme at the bottom left. Guide miRNA is red colored, while target mRNA is black colored. | ||
:
30. The topology was built using Gromacs 2021.535 with the amber99sb-ildn force-field.36 The system was then inserted at the center of a dodecahedron box leaving a distance of 1.0 nm from the box wall to avoid interaction with periodic replica. The TIP3P model37 was employed for the implicit water solvent while the topology of methanol molecules was assigned using a generalized amber force field (GAFF).38 The number of methanol molecules to be added to reproduce the desired water/MeOH ratio was calculated as follows:![]() | (1) |
and nwater are the number of water molecules in the box containing only water and water/MeOH respectively. VVdWwater and VVdWmeoh are the Van-der-Waals volumes of water and methanol and nMeOH is the number of methanol molecules in the box containing the water/MeOH solution. By eqn (1), the desired number of methanol molecules in the simulation box, have been randomly inserted using gmx insert-molecules command, at a distance larger than the sum of van der Waals radii of the solute and solvent atoms.30 The solvated box has a total of 21
950 and 10
985 molecules of water and methanol respectively, corresponding to a water/MeOH molar ratio of 67% Taking VVdWwater and VVdWMeOH equal to 19.51 and 36.75 Å3 respectively, the mixed solvent box has a water/MeOH volume ratio of ∼50%. After solvation a total of 8 Na+ ions were added to set both systems electrically neutral. The systems of minimum energy were then obtained using steepest descent algorithms. Afterwards, both systems were equilibrated with a simulated annealing of 250 ps where the temperature was gradually increased from 50 to 300 K, followed by 50 ns at 300 K in a canonical ensemble (NVT). Production run consisted of two replicas of 1 μs in the isothermal–isobaric ensemble (NPT) at P = 1 atm, and T = 300 K for a total simulation time of 2 μs in water and water/MeOH. Long range electrostatic interactions were treated using Particle Mesh Ewald (PME) while a 0.9 nm cut-off was employed for short range electrostatic and van der Waals interactions. LINCS algorithm39 was used for the restraining of hydrogen atoms involved in covalent bonding.
Starting from the mean-centred trajectory matrix X, with dimensions nframes × 3N, (with N equal to the number of atoms), the related covariance matrix C = XXT is diagonalized as follows:
| C = WΛWT | (2) |
Here Λ represents the diagonal eigenvalues, and W is the matrix associated with the corresponding eigenvectors. Furthermore, the trajectory matrix X is projected along orthogonal directions (principal components) having the largest eigenvalues (representing the largest fluctuations):
| TL = XWL | (3) |
![]() | (4) |
In eqn (4)kb is the Boltzman constant, T is the temperature of the system, and Ni and Nmax are the number of trajectory points in the i-th bin and in the most populated bin respectively. PCA was performed on protein Cα atoms using the Gromacs tools gmx covar and gmx anaeig. FEL was constructed with the command gmx sham. Cumulative modes (eigenvectors) along PC1and PC2 were represented using the pymol modevectors.py script.
Clustering analysis was performed separately on RNA and protein dynamics to retrieve the representative conformations of nucleotide and protein fragments, in the two solvent environments employed. Analysis was carried out using gmx cluster with a RMSD cut-off of 0.2 nm on all atoms of RNA duplex and hAgo2.
To recover general information about base association, the RNA per-residue contact maps were calculated using gmx mdmat. Salt–bridge interactions were evaluated using gmx contact and MD analysis python tool.42 The former was used to calculate the salt–bridge occupancy displayed as red–gray–blue colored surface on O1P and O2P atoms. The latter was employed to recover residue-by-residue interaction. Base pair parameters were determined using amber-cpptraj package.43
![]() | ||
| Fig. 2 Cα-RMSD for trajectories sampled in water (a) and water/MeOH (b). Mean RMSD and standard deviation of the two independent replicas are showed inside the legend box. | ||
PC1 vs. PC2 free energy landscape (FEL), obtained from MD simulation in water solvent (Fig. 3(a and b)), shows the presence of adjacent local minima separated by a small potential barrier (<8 kJ mol−1). In contrast, the dynamic performed in water/MeOH (Fig. 3(c and d)) exhibits three distinct minima, separated by a high free energy barrier. In the representation of the reduced coordinate system, the magnitude of conformational subspace, spanned by trajectories sampled in the two solvents, is almost the same, as witnessed by the extent of PC1 and PC2 axes. However, the free energy surface computed in water solvent, suggests that protein experiences more rapid conformational rearrangements compared to water/MeOH solution. In water/MeOH, protein seems to possess a limited number of energetically favorable conformations, which do not communicate to each other. Functional domains of Ago are organized in structural scaffold forming two opposite lobes PAZ-N and MID-PIWI44 which undergo a conformational change to accommodate miRNA–mRNA duplex.45 The modes represented along PC1(Fig. 4) calculated from water system, indicate a conformational rearrangement, where the highly mobile PAZ domain19 (Fig. S1, the ESI†) moves in the direction of the MID-PIWI lobe. Along PC2, the collective coordinate motion provides evidence that PAZ and MID increase their distance. Overall, the eigenvectors analysis carried-out on water trajectories, can be contextualized within the biological behavior of human AGO2 bounded to the seed + supplementary duplex. Indeed, to locate duplex with supplementary base pairs, the gate formed by PAZ-L2 and MID-PIWI domain must create sufficient space to accommodate the base-pairs beyond the seed.8,11 Along PC1, collective modes computed in the water/MeOH system revealed a clockwise motion of the PAZ domain which moves toward MID and L2, while the N domain moves in the direction of PAZ with a closure of the PAZ-N pocket. As in water, along PC2, eigenvector analysis indicates the departure of PAZ from the MID domain.
![]() | ||
| Fig. 4 Cumulative modes from first (PC1) and second (PC2) eigenvector computed in water and water/MeOH. The colored arrows indicate the displacement of Cα atoms. | ||
The extent of PAZ motion with respect to MID, measured as distribution of the center of mass distance is reported in Fig. 5(a). Overall, this suggests a much closer contact between PAZ and MID in water/MeOH solvent compared to water system as also supported by the superposition of the middle cluster structures in Fig. 5(b).
From this observation, it is also reasonable to assume a stronger protein–RNA association promoted by the presence of methanol in the solution, compared to pure water solvent. Ago2 protein interacts with RNA mainly through hydrogen bonds and non-base specific salt–bridges.8,46–48 The latter occur via electrostatic attraction between negatively charged phosphates of RNA backbone and the positively charged side chains of LYS and ARG amino acids. Trajectories collected in water/MeOH, registered an increasing number of protein–RNA contacts as highlighted by the comparison of the numbers of hydrogen bonds (Fig. S2(a), ESI†) and salt–bridge interactions (Fig. S2(b), ESI†) in the two simulated systems. The observed differences can be interpreted on the basis of the nature of salt–bridge interactions, which include both electrostatic and hydrogen bond contributions.
Based on pure water and methanol relative dielectric constant at 25 °C (78.4 and 32.70, respectively), we can assume that the mixed solvent employed in our simulation has a dielectric constant lower than that of water alone49 thereby favoring protein RNA interactions via salt bridges and hydrogen bonds. Considering the dynamics of RNA, excluding the base pair contact in seed and supplementary duplex, inter residue distances between miRNA and mRNA in water (Fig. 6(a)) reveal local interactions occurring in the central gate region. At the same time, nucleotides gA12 and tU5, which are staggered in the starting crystallographic structure (Fig. S3(a), ESI†), adopt a paired conformation in water solution (Fig. 6(d)). In contrast, mean smallest distances calculated from trajectories sampled in water/MeOH, do not exhibit interactions in the central loop within a cut-off of 4 Å (Fig. 6(b)). As can be inferred from the middle structure displayed in Fig. 6(c), the water/MeOH environment induces the central duplex to adopt an open conformation preventing interactions between miRNA and mRNA. Interestingly, in the seed region, contact maps do not show significant variations on the basis of the solution composition, suggesting almost no correlation with supplementary duplex dynamics. Taking the base pair occupancy as a global parameter for duplex mobility, the data in Fig. 7(a) indicate a quasi-complete retention of base association in seed duplex despite the solvent used. On the other hand, supplementary duplex has experienced some conformational rearrangements due to the variation of solvent composition as witnessed by the values of base pair occupancy in Fig. 7(b). Trajectories collected in water solvent, showed an additional base pairing, involving A12 residue of guide miRNA and U5 nucleotide of target mRNA, for at least 80% of the simulation time. Again, in water, the sampled trajectories showed a base pair dissociation involving the gG16-tC1 pair, although nucleotides maintain mutual interaction as indicated by the middle cluster structure provided in Fig. S4 (ESI†). In contrast, in water/MeOH, the duplex conformation retains almost completely the initial base pairing in the supplementary region with an occupancy close to 100%. Although a full complementary exists in the middle region of the RNA duplex (g9–11 and t6–8), the base pairing in this region is hampered because steric clashes in the central gate involving L2 and PIWI loop.8 In order to allow a full guide–target pairing, the central gate has to open destabilizing the tertiary complex.10 As reported in ref. 50 and 6, a high complementarity between miRNA and its target mRNA with stable base pairing in the middle and 3′ regions leads to the unbinding and releasing of the RNA duplex from Ago protein. To estimate the flexibility of base pairing in RNA duplex, we calculated the mean oscillation of the buckle angles across the sampled trajectories, considering only paired nucleotides in the seed and supplementary regions as representative of duplex flexibility. As clearly shown in Fig. 8, all paired bases are not coplanar, displaying a typical behavior of a duplex in a bent conformation.51,52 In the seed duplex, the mean buckle angle values, as well as their average oscillations, are minimally affected by the solvent employed. In contrast, in supplementary duplex, water trajectories exhibit buckle angles values shifted by about ±10–15° from their water/MeOH counterparts. In addition, the water trend shows a quasi-coplanarity of gA13-tU4, gU14-tA3 gG15-tC2 with buckle angles ranging between 0 and 5°. The helix parameters slide and roll have been calculated providing information about translational and rotational motion of stacked bases along their long axes.53 Data in Fig. 9(a) and (c) show a broad distribution with mean values falling within the range of 9 to 10° for roll and between −1.2 and −1.1 Å for slide parameters, respectively. This findings strongly suggest that, in the transition from water environment to water/MeOH mixture, the seed duplex retains its A-form conformation.54 By the comparison of the roll angle distributions of supplementary duplex (Fig. 9(b)) a clear shift of about 6° is observable between water and water/MeOH system. Slide distributions in Fig. 9(d) show, instead, the presence of a second population located around 0.3 Å for water system. Thus, while the dynamics of the seed duplex does not seem to be significantly affected by the mixed solvent employed, the supplementary duplex in water appears to be more flexible than in water/MeOH.
Our results are consistent with those already reported by Gruttadauria et al.8 who stated that supplementary duplex is mobile in the Ago–RNA complex. Based on the information provided so far, we attempted to delve into the potential correlation between Ago–RNA association and duplex flexibility, exploring in more detail the extent of salt–bridge contacts in the two simulated systems. From preliminary data in Fig. S2(b) (ESI†), we explored the salt–bridge network between the RNA backbone and protein side chains. In Fig. 10(a) and (b), we compare the salt–bridge occupancy (the frequency of salt–bridge contacts within a cutoff of 4.5 Å) in the two simulated systems. Main differences are observable in the region located at the 3′ terminus of the miRNA and in the central duplex of the target mRNA. The blue surface on O1P and/or O2P atoms indicates that, in these regions, the water/methanol environment retains protein–RNA salt–bridge interactions for more than 50% of the simulated time (Fig. 10(b)). Conversely, the same analysis performed on the water system suggests a lower tendency of ARG and LYS to interact with phosphates located at the guide 3′ end and mRNA in the central duplex (Fig. 10(a)). The contact map in Fig. S5(a) (ESI†) clearly exhibits the region delimited by MID-PIWI and seed duplex as the most populated salt–linkage region. Conversely, in water/MeOH (Fig. S5(b), ESI†), the majority of salt linkages arise by the proximity of ARG and LYS residues located in PAZ domain with a backbone of the miRNA 3′ tail and mRNA in the central gate. A stronger binding affinity of target 3′ end with MID and PIWI domain is also to be noted. This is attributable to the solvent exposure of 3′ target terminal which, as the miRNA 3′ end, is more subject to the surrounding environment. By a magnification of the PAZ domain region in Fig. 10(c and d) specific residue-by-residue interactions involving LYS260-tU5, ARG277-gU19, ARG280-gU17 and ARG315-gU20 were recognized. The center-of-mass distances distribution between acidic O1P, O2P and basic NH1, NH2 of the selected couples underlines that, the water/MeOH solvent, promotes the association of guide 3′ tail with the PAZ domain (Fig. S6, ESI†). At the same time, U6 of miRNA is much more intimately linked with LYS 260 in the mixed solvent, contributing to the lower flexibility of the central duplex and preventing gA12-tU5 base association. Based on our previous observations, extrapolated from the PCA, we can therefore infer an association between the limited conformational freedom of protein observed in water/MeOH, with the extent of Ago–RNA binding due by salt–bridge interactions.
Moreover, our analysis suggests that the modification of solvent environment seems to have a negligible effect on seed duplex conformation. These findings can be connected with the regulatory properties of Ago2, which mediate target recognition primarily in the seed region.55–58 Indeed, to stabilize miRNA–mRNA interaction, Ago2 provides a screening effect on seed region reducing its exposure to solvent. This favors the miRNA–mRNA interaction preserving the target RNA from dissociation to guide miRNA.59
In summary, in physiological condition, supplementary base-pairing can undergo some conformational interconversion, as evidenced by the dynamical change of complementary base association (Fig. S7, ESI†). In the mixed solvent system, the main effect recorded has been the stronger 3′ tail-PAZ association with the consequent stabilization of paired bases in supplementary region, while preserving the conformation of seed duplex. Another consideration can be done about the dynamics of guide 3′ end. miRNA 3′ terminal, is anchored to PAZ domain via salt–bridges and by π-stacking interactions involving phenylalanine PHE294 with nitrogenous base on 3′ ending nucleotide.10 Simulations performed in water revealed a partial release of the 3′ end of guide nucleotides from the PAZ domain. This is evidenced by the broad peak centered around 13 Å in Fig. 11(a), which indicates the leak of the PHE294-gU21 interaction (Fig. 11(c)). Conversely, in water/methanol, the sharp distribution centered at 6 Å indicates the retention of the PHE294-gU21 interaction(Fig. 11(b)) for the entire collected trajectories. Experimental evidence reported in literature,60–62 have shown that 3′ end of miRNA can dynamically associate/reassociate with PAZ.
![]() | ||
| Fig. 11 (a) center of mass distance distribution between PHE294 and gU21. (b) and (c) Binding pocket of guide 3′ end with PAZ domain in bound and unbound state respectively. | ||
Wang et al.63 previously discovered that a binding pocket consisting of 14–15 base pairs promotes the releasing of guide 3′ terminal from the PAZ domain while still maintaining guide 5′ end anchored to MID. Our analysis also showed that, in a water solution, salt bridge interactions with 3′ guide terminal are less significant compared to those with seed duplex. Based on information provided by FEL and eigenvectors analyses, we investigated in more detail the structural variations at the different local minima positions. In water solvent (Fig. S8, ESI†) the partial dissociation of 3′ guide from PAZ is depicted by the structural transitions from point 1–2 to 3,4 and 5. These transitions occur with a very low free energy barrier suggesting a reversible interconversion between PAZ-3′ bounded and unbounded conformation. We can interpret these transients status on the basis of the conformational motion of PAZ which, moving in direction of MID can lead to a leak of PAZ guide 3′ interaction. Structures extracted from water/MeOH system (Fig. S9, ESI†) instead, clearly exhibit a stable guide 3′ PAZ association. Thus, the conformational motion of PAZ and guide 3′ is more correlated in water/MeOH system compared to water alone. Because the high mobility of PAZ domain, the transient unbinding of guide 3′ from PAZ can be explained by the missing of cooperative effects due by salt–linkage interactions, present instead in water/MeOH.
:
30% mol mol−1. Within the limitations of the simulated time range and force-field approximation, our findings revealed that the enhanced contact of the PAZ functional domain with the guide 3′ terminal, as registered in water/MeOH simulations, did not significantly impact the global conformation of the seed duplex. Conversely, the natural mobility of the supplementary duplex in water is restricted in water/MeOH. The enhancement of the attraction of the opposite charges, between basic ARG and LYS residues and acidic phosphates on the RNA backbone, contributed to maintain the stability of the starting base pairs in the supplementary duplex, while preserving the 3′ terminal association with the PAZ domain. We also find that the mobility of the supplementary duplex in water simulation does not affect the stability of the seed-duplex which retains almost completely base pair and helix parameters. Consistently with previous finding,8,46 our data highlight that the specificity of guide–target interaction arises principally in the seed-region. At the same time, the solvent effect on hAgo2–RNA association is attenuated moving from PAZ-N to MID-PIWI lobe. Our study offers a different approach for the theoretical examination of the Ago–RNA complexes and can contribute to the understanding of Ago-mediated guide–target interactions.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3cp05530b |
| This journal is © the Owner Societies 2024 |