Lev
Levintov
and
Harish
Vashisth
*
Department of Chemical Engineering, University of New Hampshire, Durham 03824, New Hampshire, USA. E-mail: harish.vashisth@unh.edu
First published on 1st April 2021
Ribonucleic acid (RNA) molecules are known to undergo conformational changes in response to various environmental stimuli including temperature, pH, and ligands. In particular, viral RNA molecules are a key example of conformationally adapting molecules that have evolved to switch between many functional conformations. The transactivation response element (TAR) RNA from the type-1 human immunodeficiency virus (HIV-1) is a viral RNA molecule that is being increasingly explored as a potential therapeutic target due to its role in the viral replication process. In this work, we have studied the dynamics in TAR RNA in apo and liganded states by performing explicit-solvent molecular dynamics (MD) simulations initiated with 27 distinct structures. We determined that the TAR RNA structure is significantly stabilized on ligand binding with especially decreased fluctuations in its two helices. This rigidity is further coupled with the decreased flipping of bulge nucleotides, which were observed to flip more frequently in the absence of ligands. We found that initially-distinct structures of TAR RNA converged to similar conformations on removing ligands. We also report that conformational dynamics in unliganded TAR structures leads to the formation of binding pockets capable of accommodating ligands of various sizes.
The variability in functions of RNA is rooted in its ability to undergo conformational changes that often lead to complex three-dimensional folds and an ensemble of structures which determine the function of RNA.12,13 Conformational changes in RNA may be due to changes in physiological conditions or due to binding of ligands (proteins, small ligands, and ions).14–16 The conformational flexibility in RNA can also lead to formation of transient binding pockets that can be exploited for drug design.9,17,18 While viral genomes encode for a limited number of protein targets, often considered “undruggable”,8 conserved and structured RNA motifs in viral genomes are viable targets to discover binding pockets for small molecules.8,9
Although our understanding of RNA dynamics has increased via different experimental techniques,8,19–21 especially those of RNA structure determination, the role of dynamics of all structural motifs in RNA and their coupling to ligand binding has not been fully explored to date.22 Several experimental techniques including X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy can provide information on the dynamical properties of nucleic acids,23–25 but even these methods are often limited in probing all possible parameters that can fully describe the dynamics in RNA molecules.16,26 However, computational tools can further enhance our understanding of RNA dynamics by providing additional insights at the atomic level. Moreover, these tools can potentially assist in identifying binding pockets that can be explored in drug design.
TAR RNA (Fig. 1A) from HIV-1 is a key model system to study conformational transitions in RNA molecules due to its ability to adapt multiple states.12,25 The TAR RNA is located at the 5′ end of HIV-1 transcripts where it interacts with the viral transactivator (Tat) protein and the host cofactor cyclin T1 to promote efficient transcription of the downstream genome and is therefore considered to be an important drug target. Several previous studies have shown that TAR RNA can bind to peptide mimics,27–32 to small molecules,33–36 to proteins,37–40 and to divalent cations41 (examples are shown in Table S1, Fig. S1, ESI†). TAR RNA has been studied using NMR methods,25,27,42–51 coarse-grained MD simulations,52 electron paramagnetic resonance (EPR),53 gel mobility,54 combinations of NMR and MD methods,24,44 and combinations of NMR and structure prediction software.55 Collectively, these studies have shown that TAR RNA undergoes complex dynamics by sampling different interhelical conformations around the bulge junction, thus forming various conformational ensembles.12,43 While detailed dynamics in TAR RNA have not been characterized in the presence of all known ligands, it has been suggested that ligands can potentially induce structural transitions in TAR by stabilizing pre-existing conformers or an ensemble of states in the apo TAR RNA structure.16,49,56,57
![]() | ||
Fig. 1 Sequence and structural details of HIV-1 TAR RNA. (A) Shown is the secondary structure and a snapshot of the three-dimensional structure of HIV-1 TAR RNA (PDB code 1ANR). Various structural motifs (Bulge, Helix I, Helix II, and Loop) are uniquely colored and labeled. (B) Shown are the snapshots of the initial states of TAR RNA (cartoon representation) in unliganded simulations. The apo conformation of TAR (PDB code 1ANR) is shown at the center (black cartoon) and superimposed onto other TAR RNA initial states. The initial conformations are placed in a circle such that the RMSD of the initial state relative to the apo conformation increases counterclockwise, with the 5J2W structure having the least RMSD and the 1LVJ structure having the highest RMSD. |
Several studies have revealed that a bulge motif in TAR RNA is especially critical for its recognition by the Tat protein.27,42,58 Therefore, the bulge motif has been exploited in the design of inhibitors to disrupt the TAR/Tat interaction (Fig. S2A, ESI†).32,59 As shown in Fig. S2A (ESI†), peptide ligands mostly interact with the apical loop (orange in Fig. 1A), helix II (blue in Fig. 1A), and the bulge motif (red in Fig. 1A), while small molecules are scattered between helices I and II (cyan and blue in Fig. 1A). The ligands differ in size and the charge value, which results in an increased buried surface area (BSA) as the size of the ligand increases but the structural changes in TAR RNA are not correlated with the ligand size or BSA (Fig. S2B, ESI†).
In this work, we studied dynamics in TAR RNA by conducting long time-scale MD simulations that were initiated from 13 different initial conformations of TAR with ligands, and from 14 conformations without these ligands, 13 conformations after removing ligands and one conformation based on the experimental apo structure (Fig. 1B, Fig. S3 and Table S2, ESI†). By utilizing several different initial structures, we aimed to obtain a broader conformational mapping of TAR RNA which has not been carried out yet. Moreover, we reveal the effect of ligand binding on the dynamics in TAR RNA by comparing unliganded and liganded simulations.
![]() | ||
Fig. 2 Conformational metrics of torsional flexibility and BSA. (panel A; left) All dihedral angles are shown by an arrow and labeled on a snapshot of the polynucleotide chain. The atoms in the chain are labeled as follows: 1, P; 2, O5′; 3, C5′; 4, C4′; 5, C3′; 6, O3′; 7, C1′; and 8, N9/N1. (panel A; right, top and bottom) The normalized distributions of each RNA backbone dihedral angle (α, β, γ, δ, ε, ζ) and the glycosidic dihedral angle (χ) for unliganded (U) and liganded (L) simulations. The transparent and thicker gray lines represent expected ranges of dihedral angles based on experimentally known RNA structures. See also Fig. S4 (ESI†). (B) The histograms of mean BSA values based on liganded MD simulations (darker shades) and the initial liganded structures (lighter shades) are shown. The error bars (vertical lines marked on histograms) were computed based on each liganded simulation. The BSA histograms are organized into three groups (labeled 1, 2, and 3; marked by overbars). A red asterisk highlights a system (PDB code 1ARJ) which exhibited a partial dissociation of the ligand. See also Fig. S6 (ESI†). |
BSA = SASAR + SASAL − SASARL |
We also assessed whether the ligands remained associated with each TAR RNA structure during MD simulations. In Fig. 2B, we show the histograms of the mean values of BSA from liganded MD simulations (darker shades) along with the initial BSA values of ligands in various TAR structures (lighter shades). These BSA data are organized into three groups based on the distribution of BSA values in MD simulations in comparison to initial BSA values. We observed that most systems exhibited similar or higher ligand BSA values (e.g. group 1 systems with PDB codes 1UTS, 1UUD, 1UUI, 2KX5, 5J1O, and 5J2W) in comparison to the initial BSA due to conformational rearrangements of ligands in the binding pocket that led to a deeper burial of some ligands in the binding pocket (e.g., see Fig. S5, ESI†). However, some systems exhibited fluctuations in nucleotides which allowed ligands to conformationally rearrange in the binding pocket and partially move out of the initial pocket, as indicated by the decreased BSA values (e.g. group 2 systems with PDB codes 1LVJ, 1QD3, and 5J0M; Fig. 2B).
A larger decrease in BSA of the ligand was observed in systems organized in group 3 (e.g. PDB codes 2KDQ, 2L8H, 6D2U, and 1ARJ) indicating that the ligands in these systems exhibited increased rearrangements in the binding pocket. For example, the ligand arginine amide in one of the TAR RNA structures (PDB code 1ARJ; marked by a red asterisk in Fig. 2B) exhibited brief dissociation for about ∼18 ns before binding again in the initial binding pocket (Fig. S6, ESI†). This observation is consistent with the largest dissociation constant for this ligand27 and the smallest size of this ligand among all ligands studied (Table S1, ESI†). However, we did not observe full dissociation of any ligand during MD simulations.
![]() | ||
Fig. 3 RMSD and clustering analyses. (A) Shown are the histograms (with error bars) of mean values of RMSD for all unliganded (lighter shades) and liganded (darker shades) simulations. The RMSD histograms are organized into three groups (labeled 1, 2, and 3; marked by overbars). An orange asterisk marks a system (PDB code 1LVJ) which showed a different behavior in comparison to other systems. (B) The fraction of conformations (Fconf) from a given simulation (100![]() |
However, one system (marked by an orange asterisk in group 3; Fig. 3A) showed increased fluctuations and a higher mean RMSD in the liganded state in comparison to the unliganded state. On further probing this structure (PDB code 1LVJ), we found that the structural deviation is likely a result of ligand rearrangements in the binding pocket that conformationally altered the TAR RNA structure from an initially bent conformation to a relaxed conformation with a higher RMSD value (Fig. S7, ESI†). The fact that the presence of a ligand caused higher perturbations in the TAR RNA structure that resulted in an unexpected conformational behavior needs to be considered in designing new inhibitors that target TAR RNA.
We further report the ΔRMSF values per residue based upon unliganded and liganded simulations to understand the effect of ligand binding on the flexibility of a particular residue or a motif (Fig. S8, ESI†). In Fig. S8 (ESI†), we show the difference between the RMSF values (ΔRMSF) of the unliganded and liganded simulations where a negative value corresponds to an increased flexibility on ligand removal. We observed that in eight systems, all of the residues became more flexible when ligands were removed (e.g. systems with PDB codes 1QD3, 1UTS, 1UUI, 2KDQ, 2KX5, 5J0M, 5J1O, and 6D2U). In four systems, we observed larger flexibility in a portion of residues in the liganded simulations (e.g. systems with PDB codes 1ARJ, 1UUD, 2L8H, and 5J1O). Importantly, one of those systems (PDB code 5J1O) had only one residue (U25) with increased flexibility in the liganded state due to the flipping out of that nucleotide during the simulation. Finally, in one system (PDB code 1LVJ) all of the residues became more rigid after ligands were removed which was consistent with the aforementioned observation that the fluctuations and mean RMSD for this specific system was higher in the liganded conformation (Fig. 3A).
Due to the significance of the bulge motif in ligand recognition,27,58 we also separately computed the RMSD values for the bulge motif (Fig. S9, ESI†). Overall, the bulge motifs in unliganded and liganded systems deviated by 7.82 ± 1.83 Å and 5.6 ± 1.73 Å, respectively. In particular, we observed that the majority of systems had higher mean RMSD values in the unliganded states with the exception of the system with the PDB code 1LVJ that had a higher mean RMSD value of the bulge motif in the liganded state (Fig. S9, ESI†). This observation is also consistent with ΔRMSF data that showed increased flexibility of the bulge nucleotides in the system with the PDB code 1LVJ (Fig. S8, ESI†). The behavior of the bulge motif in this system is coupled with the ligand movement in the binding pocket and the local rearrangements of bulge nucleotides that result in the overall change in the conformation of TAR RNA (Fig. S7, ESI†). These data suggest that the TAR RNA structures and the bulge nucleotides in liganded systems deviated to a smaller extent from their initial conformations compared to in the unliganded systems.
To further assess the fluctuations and the flexibility in TAR RNA, we computed the RMSD values in the course of each simulation with respect to the average structure of the corresponding simulation (Fig. S12, ESI†). We observed that the unliganded systems diverged from their respective average structures by 3.07 ± 0.97 Å and the liganded systems by 2.31 ± 0.57 Å. The majority of the systems had a decrease in mean RMSD values and smaller magnitude of fluctuations in the liganded states compared to the unliganded states (group 1; Fig. S12, ESI†). Three systems exhibited very similar RMSD distributions in unliganded and liganded states (group 2; Fig. S12, ESI†), however, two of them still showed decreased mean RMSD values in the liganded state (PDB codes 1ARJ and 1UUD). Finally, one system (PDB code 1LVJ; Fig. S12, ESI†) showed increased fluctuations and a higher mean RMSD value in the liganded state in comparison to the unliganded state which is consistent with the observations described earlier. Overall, this metric showed that the fluctuations in the TAR RNA structures decreased in the presence of ligands.
In addition to comparing the average structures from each simulation, we performed clustering analysis to detect similarities among structures within each simulation and to understand the effect of presence of ligands on conformational variability in TAR RNA. In Fig. 3B, we present the fraction of conformations in the most populated clusters derived from each unliganded and liganded simulation along with more comprehensive details on the distributions of clusters in Fig. S13 and S14 (ESI†). We observed a larger variation in conformational clusters in unliganded simulations in comparison to liganded simulations. For example, only two unliganded systems (PDB codes 1UUD and 2KDQ) had a cluster that contained at least 75% of structures while eight liganded simulations (PDB codes 1QD3, 1UUD, 1UUI, 2KDQ, 2KX5, 5J0M, 5J2W, and 6D2U) had a cluster of this type (Fig. 3B). The only exceptions were the unliganded systems with the initial structures based on PDBs 1LVJ (orange histograms in Fig. 3B) and 2L8H (purple histograms in Fig. 3B) that contained clusters with a higher fraction of conformations in the most populated cluster than in corresponding liganded simulations. Importantly, all liganded systems with peptide ligands (PDB codes 2KDQ, 2KX5, 5J0M, 5J1O, 5J2W, and 6D2U), except for the initial structure with the PDB code 5J1O, had the most populated cluster containing the majority of conformations (Fig. 3B). This analysis further supports the observation that ligands in general rigidify the TAR RNA structure by restricting its motion within an ensemble of structures that constitute the most populated cluster.
We also performed the combined cluster analysis (CCA) to further investigate conformational clusters in TAR simulations that were initiated with distinct initial structures. In Fig. S15 (ESI†), we show Fconf for the top clusters from the datasets of unliganded and liganded simulations. The CCA of unliganded simulations revealed three clusters that contain more than 5% of the total number of configurations each and are composed of multiple systems (Fig. S15A, ESI†). The CCA of liganded simulations, that was performed at the same value of the RMSD metric for constructing clusters as for the CCA of the unliganded simulations, revealed only one cluster which contains most of the systems (Fig. S15B, ESI†). These observations show that a number of our simulations, that were initiated from distinct initial structures, have conformations that are similar to each other.
Based on the angle distributions from MD simulations, we observed that γ1 was mostly confined between −90° and 90° for the unliganded and liganded systems (Fig. 4A). Most of the liganded systems showed a decrease in the width of populated angles in comparison to the analogous systems in the unliganded form with the exception of the system with the PBD code 2L8H (depicted in purple color in Fig. 4) which exhibited a higher twisting in the Helix I with γ1 angles between 70° and 115°. The system with the PBD code 1ARJ in the liganded form showed values similar to the unliganded conformation given conformational rearrangements in the ligand and a decreased BSA (Fig. 2 and Fig. S6, ESI†). The average γ1 values for the unliganded and liganded systems were estimated to be 19 ± 40° and 20 ± 33°, respectively. Overall, the presence of ligands decreased the standard deviation in γ1 by 7°, thereby leading to narrower γ1 distributions.
We also observed that Helix II showed more flexibility compared to Helix I in both liganded and unliganded simulations. In the unliganded simulations, γ2 was mostly distributed between −90° and 105° with the exception of the structures with PDB codes 5J0M, 2KDQ, and 5J2W that spanned additional conformations between −105° and −140°, between −120° and −130°, and between −105° and −145°, respectively. The γ2 angle in the liganded systems was mostly confined between −105° and 120° but even though the width of distributions were similar, the number of states that were populated decreased. The average γ2 values for the unliganded and liganded systems were estimated to be 12 ± 51° and 11 ± 46°. Overall, the presence of ligands decreased the standard deviation in γ2 by 5°.
We observed that ϕ is mostly confined between 25° and 105° for the unliganded systems, with the exceptions of structures with the PDB codes 1QD3 and 2KDQ, which occupied states with angles between 25° and 80°, and 1UUD which occupied states between 45° and 85°. The distributions of ϕ in the liganded systems became narrower (ranging between 35° and 75°) with the exception of the structures with the PDB codes 1LVJ, 1ARJ, and 1UTS which spanned angles between 25° and 105°, 25° and 100°, and 45° and 110°, respectively. Overall, the structural bending in TAR RNA decreased in the liganded systems, except for the systems with initial states based on the PDB codes 1UUD, 1UTS, 1LVJ and 1ARJ which have distributions similar to their unliganded systems. The average ϕ for the unliganded and liganded systems was estimated to be 70 ± 14° and 60 ± 11°, respectively. While previous NMR analysis has suggested high amplitude bending and twisting motions in TAR RNA helices,24 our data further suggest that the conformations of helices in TAR RNA are altered and stabilized on ligand binding.
![]() | ||
Fig. 5 Conformational transitions in bulge nucleotides. (A) A snapshot of TAR RNA nucleotides (stick representation) used in defining the flipping angle (θ) for U23 (blue sticks and marked by an asterisk) and the traces of θ vs. simulation time (t) are shown for four systems in which a conformational transition was observed either in the unliganded state (U; lighter shade) or in the liganded state (L; darker shade) or in both. The initial value of θ for U23 in each system is marked on the y-axis by a filled circle in the same color as traces. The inward flipped state is characterized by θ values between −60° and +60° (labeled and shown by a transparent gray rectangle). All other values of θ indicate an outward flipped state. For those unliganded and liganded simulations where a transition occurred in both simulations, only those values of θ are plotted where a transition was observed. In case the transition was observed only in an unliganded simulation (or vice versa in a liganded simulation), in addition to plotting θ values in the unliganded simulation where the transition occurred, all values of θ are shown for the corresponding liganded simulation (or vice versa corresponding unliganded simulation) where the transition was not observed. (B and C) Data similar to panel A are shown for the flipping of nucleotides C24 (red sticks and marked by an asterisk; panel B) and U25 (green sticks and marked by an asterisk; panel C). The flipping angle of a nucleotide is defined by the center of mass of each of the following four groups: the nitrogenous bases of base-paired nucleotides (labeled 1) neighboring the flipping base, sugar moiety (labeled 2) attached to the base that is stacked with the flipping base, sugar moiety (labeled 3) attached to the flipping base, and the nitrogenous base (labeled 4) of the flipping nucleotide. See also Fig. S16 (ESI†). |
The first bulge-nucleotide U23 (Fig. 5A) was initially in a flipped-in state in most structures, except in two structures (PDB codes 1QD3 and 1UTS) where it was in a flipped-out state (as marked by a filled circle on the y-axis at t = 0 in time-traces of θ in Fig. 5A). We observed that U23 flipped out during five unliganded simulations (PDB codes 1ARJ, 1UUI, 2L8H, 5J2W, and 6D2U) in which U23 was initially in a flipped-in state (traces with lighter shades in Fig. 5A and Fig. S16, ESI†). In most of these systems U23 eventually returned to its initial position after briefly transitioning to a flipped-out conformation. For example, in one of the unliganded simulations (PDB code 2L8H; light purple time-traces in Fig. 5A) U23 flipped outward at t = ∼0.9 μs, maintaining the flipped out state for ∼0.5 μs, and then flipping inward, resuming its initial position.
We observed a similar conformational behavior in systems with PDB codes 6D2U (light brown time-traces in Fig. 5A) and 5J2W (light cyan time-traces in Fig. S16A, ESI†). In another unliganded system (PDB code 1UUI), U23 flipped out at t = ∼1.25 μs and remained in a flipped-out state until the end of the simulation (light yellow time-traces in Fig. 5A). However, in one system (PDB code 1UTS) we observed conformational transitions in both unliganded and liganded simulations (cyan time-traces in Fig. 5A), where U23 flipped inward from an initially outward conformation with θ = 130, retaining the inward position for almost the entirely of liganded simulation and flipping outward after ∼1 μs in the unliganded simulation. In all liganded simulations, U23 transiently flipped outward only in one system (PDB code 1ARJ; Fig. S16A, ESI†). Overall, we observed that the presence of ligands significantly decreased conformational transitions in U23. This conformational behavior is consistent with observations from experiments, where a smaller pool of ligands (arginine amide and a linear as well as a cyclic peptide) were tested.75 However, we consistently observed conformational stabilization of U23 for several ligands with different binding affinities.
The second bulge-nucleotide C24 (Fig. 5B) was initially in a flipped-out conformation in all systems, except in three systems (PDB codes 1ANR, 1LVJ, and 1QD3). We observed that C24 flipped inward in three unliganded simulations (PDB codes 1ARJ, 5J1O, and 1UTS; lighter shade time-traces shown in red, blue, and cyan in Fig. 5B). For example, C24 flipped inward in one of the unliganded systems (PDB code 1ARJ; Fig. 5B) and remained in the inward conformation until ∼1.15 μs. However, in other unliganded systems (PDB codes 5J1O and 1UTS), C24 flipped inward at the beginning of simulations where it either remained flipped inward during the entire simulation (PDB code 5J1O; Fig. 5B) or flipped outward and then flipped back inward toward the end of the simulation (PDB code 1UTS; Fig. 5B). For two unliganded systems where C24 was initially in an inward flipped conformation (PDB codes 1LVJ and 1QD3), it flipped outward toward the end of simulations (Fig. 5B and Fig. S16B, ESI†). In liganded simulations, we observed that C24 flipped inward during two simulations (PDB codes 1ARJ and 1UTS; darker red and cyan time-traces in Fig. 5B) and flipped outward during one simulation (PDB code 1LVJ; darker brown time-trace in Fig. 5B). Overall, we observed that C24 showed conformational transitions both in unliganded and liganded simulations, but less frequently in liganded simulations.
The third bulge-nucleotide U25 (Fig. 5C) was initially in a flipped-in conformation in most structures except in three structures (PDB codes 2KDQ, 5J2W, and 6D2U). We observed U25 to be significantly flexible in both unliganded and liganded simulations since it flipped inward or outward in most of the systems. For example, U25 flipped outward from an initially inward conformation in several unliganded simulations (PDB codes 1UUI, 1UTS, 2KX5, 2L8H, and 5J1O; Fig. 5C and Fig. S16C, ESI†). It also flipped inward from an initially outward conformation during unliganded simulations of several systems (PDB codes 2KDQ, 5J2W, and 6D2U) in which it remained in the inward conformation until the end of each simulation (Fig. 5C). In several liganded simulations (PDB codes 1ARJ, 1UTS, 2L8H, and 5J1O), U25 flipped outward from an initially inward conformation (Fig. S16C, ESI†), while in other systems (e.g. PDB code 6D2U) it flipped inward from an initially outward conformation (Fig. 5C). Overall, we observed that all three bulge nucleotides (U23, C24, and U25) can conformationally transition between inward and outward states although ligands decrease the frequency of these transitions.
![]() | ||
Fig. 6 Predicted binding pockets in unliganded TAR structures. (A) Predicted binding pockets (cyan surfaces) are shown overlaid on each TAR RNA structure (transparent white cartoon). (B–D) Snapshots of the overlay of each ligand (orange sticks) on the predicted binding pocket where the ligand is known to bind in each structure. See also Fig. S17 and S18 (ESI†). |
We also assessed whether the density of pockets observed in specific regions in a structure could accommodate ligands in conformations observed in liganded TAR RNA structures. We found that the observed pockets were sufficiently large in size to encapsulate the ligand known to bind to that specific TAR RNA structure. For example, TAR RNA is known to bind to small-molecule ligands (acetylpromazine and RBT158) in the bulge region, where we observed binding pockets (labeled B for PDBs 1ANR and 5J1O in Fig. 6A) large enough to accommodate each ligand (Fig. 6B). Furthermore, TAR RNA is also known to bind other ligands (neomycin B and JB181) in the Helix I and the apical-loop/bulge regions, where we observed binding pockets (labeled H1 for PDB 6D2U and L/B for PDB 1UUD in Fig. 6A) large enough to accommodate respective ligands (Fig. 6C and D). We observed that unliganded simulations with different initial structures showed several similar binding pockets as well as previously unknown binding pockets (in the apical loop or Helices I and II) that accommodated ligands known to experimentally bind to other conformations (Fig. S18, ESI†).
We initially assessed the overall torsional flexibility of TAR RNA by computing distributions of all backbone dihedral angles from unliganded and liganded simulations (Fig. 2A and Fig. S4, ESI†) and found these distributions to be consistent with the range of values from experimentally known structures of nucleic acids. This observation supports the ability of the interatomic potential in adequately capturing the dynamics in TAR RNA structures. By computing the buried surface area (BSA) of each ligand, we also assessed the stability of ligands during long time-scale MD simulations and found that in most TAR RNA structures ligands remained bound throughout simulations except in a few cases where ligands conformationally rearranged and/or partially dissociated.
We then probed the global dynamics in TAR RNA by comparing all unliganded and liganded conformations from MD simulations using global RMSD, ΔRMSF, and clustering analyses (Fig. 3 and Fig. S8, S12, ESI†). The primary observation from the RMSD analysis was that the mean RMSD of unliganded conformations was higher than the mean RMSD of liganded conformations, thereby suggesting decreased conformational fluctuations in TAR RNA on ligand binding. The analysis of ΔRMSF further supported this observation since the magnitude of fluctuations in nucleotides was smaller in the liganded simulations than in the unliganded simulations. These observations are consistent with the notion that RNA molecules are stabilized by binding of ligands because liganded TAR RNA structures were conformationally more rigid compared to unliganded structures.
However, one of the small molecules, acetylpromazine, resulted in distinct perturbations in the overall structure of the liganded TAR RNA in comparison to other liganded systems and in comparison to the corresponding unliganded simulation. In the presence of acetylpromazine, the TAR RNA structure transitioned between two distinct (bent and stretched) conformations (Fig. S7, ESI†). We have previously also shown that the (un)binding process of acetylpromazine is associated with the flipping of nucleotides in the binding pocket.76 The main structural difference between acetylpromazine and other small molecules is the presence of a sulfur moiety in one of the benzoic rings (Fig. S1, ESI†), which could be an important design feature for future development of inhibitory compounds.
The bulge motif which connects two helices in the TAR RNA structure (Fig. 1A) also became more rigid in the presence of ligands, which decreased the twisting and bending fluctuations in TAR RNA helices. The clustering analysis further supported these observations by showing a higher fraction of similar conformations in liganded structures in comparison to unliganded structures. In fact some liganded structures with peptide ligands exhibited a single cluster containing more than 90% of the RNA conformations, implying that these liganded simulations exhibited small conformational variability in the presence of peptides since the TAR RNA conformations were similar to each other (e.g. PDB code 2KX5; Fig. 3B and Fig. S14, ESI†). We also observed that various simulations have similar conformations that form combined clusters by performing combined cluster analysis (Fig. S15, ESI†) and by comparing average structures from each simulation (Fig. S10 and S11, ESI†). These observations further support the previously proposed hypothesis12,26,43 that despite being a highly flexible molecule, TAR RNA potentially adopts a set of conformations forming an ensemble of structures that can recognize various ligands.
We further probed the local dynamics in bulge nucleotides (U23, C24, and U25), the conformational flipping motions in which facilitate ligand binding27,42 as well as alter global dynamics in TAR RNA. As opposed to the notion that the binding of ligands may prevent conformational transitions in nucleotides, we observed that the bulge nucleotides can transition between inward and outward conformations in both unliganded and liganded states although the frequency of transitions significantly decreases in the presence of ligands. Overall, we found bulge nucleotides C24 and U25 to be more flexible than U23.
Importantly, as a result of conformational heterogeneity in TAR RNA structures and coupling between local and global dynamics, we observed the formation of ligand binding pockets near several structural motifs (bulge region and helices I/II). This observation is consistent with the suggestion that TAR RNA may adopt conformations with pre-existing binding pockets where ligands can fit.12 We observed that these binding pockets form consistently in all unliganded simulations with enough volume to accommodate different ligands (Fig. 6 and Fig. S17, S18, ESI†), including larger ligands (e.g. peptides). Moreover, we observed the formation of binding pockets in other structural motifs (e.g. the apical loop) in TAR RNA which are potentially useful to future inhibitor design. As an example, Patwardhan et al.59 showed that the amiloride ligands can bind to nucleotides in the apical loop where we observed several binding pockets.
Footnote |
† Electronic supplementary information (ESI) available: Additional analysis and snapshots from simulations are provided in Tables S1, S2 and Fig. S1–S18. See DOI: 10.1039/d1cp00679g |
This journal is © the Owner Societies 2021 |