Open Access Article
Deepraj Negi†
a,
Zijue Huang†
b,
Ahyun Son†
b,
Bharathwaj Sathyamoorthy
*a and
Scott Horowitz
*b
aDepartment of Chemistry, Indian Institute of Science Education and Research Bhopal, Madhya Pradesh 462066, India. E-mail: bharathwaj@iiserb.ac.in
bDepartment of Chemistry & Biochemistry and the Knoebel Institute for Healthy Aging, University of Denver, Denver, CO 80208, USA. E-mail: scott.horowitz@du.edu
First published on 22nd December 2025
G-quadruplexes (G4s) are increasingly recognized to chaperone proteins, warranting studies of structure–function relationships. In this study, we apply solution NMR methods to determine the topology and base-level resolution structure of a G-rich DNA sequence with protein chaperoning activity (referred to as Seq576) without chemical shift assignments. Seq576 samples two conformations, in the slow exchange timescale, arising from a G-register shift. Using the structural insights of Seq576, we then perform structure–function studies via mutation and chaperone assays to investigate the G4 properties essential for chaperoning protein aggregation and folding. These studies highlight the possibility of using a construct design to perform in-depth nucleic acid structural biology investigation using inexpensive and fast NMR experiments to obtain and analyze function, such as the residue-level investigation of chaperone activity.
G4s were previously found to be orders of magnitude more efficient at preventing aggregation than other chaperones in vitro. Furthermore, they were found to be general chaperones, able to prevent the aggregation of a wide variety of different proteins.6,9 Tests of whether G4s could additionally directly affect protein folding found that they rescue partially folded protein intermediates and promote their folding to the native state.10 Further investigation showed that G4s could catalyze protein folding. In these experiments, the protein folded through an on-pathway intermediate while continuously bound to the G4, and with increased rate constants for both folding and unfolding.11 In cells, G4s were found to improve the folding environment to a similar degree to many chaperone proteins and to directly aid the quality of protein folding as their primary effect.9,10
However, our knowledge of how G4s chaperone proteins is far less developed. A previous investigation suggested the roles of overall structural topology, accessibility to the G4 tetrad core, G4 oligomerization state, and structural dynamics in chaperoning proteins.12 Characterizing specific contributions with structure–function studies would identify molecular interactions that facilitate G4s in chaperoning proteins, which are currently not clear.
Performing comprehensive structure–function analyses requires detailed structural information about the binding partners. While solving the structures of any macromolecule is a time-intensive process, it tends to be even more so for nucleic acids than for proteins. Obtaining conformational preferences of nucleic acids with NMR spectroscopy is challenging due to the low proton density and reliance on time and labor-intensive nuclear Overhauser effect (NOE)-based analysis for chemical shift assignments.13 Identification of the G4 topology is further complicated by the range of structural polymorphs, calling for effective and economic methods that provide an alternative to the conventional methodology.14 Recent developments in 13C/1H chemical-shift-based methodologies for DNA G4s enable the rapid and reliable identification of backbone topology and characterization of loop conformations, without the need for expensive isotopic enrichment or complicated NMR experiments/analysis.15,16
In this work, we apply a 13C/1H chemical shifts-based solution-state NMR spectroscopy methodology to analyze the conformations of a G4-forming DNA sequence (referred to as Seq576) that is known to perform protein chaperoning activity, and is found in several organisms, especially fish species. Seq576 is observed to sample two conformations that exchange on the slow timescale (greater than milliseconds).
13C/1H NMR analysis provides topological characterization from a natural isotopic abundance sample without chemical shift assignments, enabling mutational analysis to unravel the conformational dynamics of register shifts. Significantly, the NMR-based topology assignment predictions facilitate structure–function analysis of Seq576, paving the way for understanding atomistic details in the analysis of protein chaperoning performed by nucleic acids.
Subsequently, 1D NMR data acquired for Seq576 displayed approximately 24 resonances in the imino region (Fig. 1B), indicating the formation of more than one G4 conformation given a sequence of 20 bases. Such a scenario could arise from two topologies comprising three tetrad planes, each resulting in a doubling of imino resonances in the characteristic Hoogsteen region (10–12 ppm) of the 1H NMR spectrum. This would not be surprising, as Seq576 has fifteen guanine nucleotides, while a three-plane G4 would require a minimum of twelve.
Generally, extensive polymorphism is reported for G-register shifts with broad and overlapping imino resonances. Seq576 displays sharp and reasonably resolved imino resonances at an almost 1
:
1 ratio. The well-resolved spectral feature of Seq576 is rendered as an amenable target for high-resolution characterization with NMR. It also becomes interesting to determine the two conformational states and to determine whether one or both states contribute to chaperoning activity.
To address this question, we devised a strategy to stabilize each potential G4 conformation of Seq576 individually. If a G-register shift is involved in the case of Seq576, then such a scenario would involve G9–G12 and/or G14–G20 stretches. As a first step, we tested the mutants of the G9–G12 stretch, such that the first G was mutated to thymine (Seq576G9T, Table S1) or inosine (Seq576G9I, Table S1). If G9 were indeed involved in tetrad formation, then one set of resonances belonging to that state would be deleted in the NMR spectrum. Similarly, another set was prepared wherein G12 in Seq576 was mutated to thymine (Seq576G12T, Table S1) or inosine (Seq576G12I, Table S1). In line with our speculations, both Seq576G9T/I and Seq576G12T/I mutations resulted in a single set of imino resonances in the 1H 1D NMR spectrum (Fig. 1B and Fig. S2) that were different from one another. Small variations in the chemical shifts of Seq576 mutants from the wild-type are plausibly due to subtle variations in the structure and/or stabilizing interactions (such as a change in π–π interaction of loop nucleotides proximal to the G-tetrad plane). This experiment effectively separated the two conformations of Seq576.
Now that two sets of NMR resonances of the Seq576 sequence accounted for the parallel topology suggested by CD, we looked to use another mutant to confirm the two conformations. Within the G14–G20 tract, only substitution in G17 may likely allow for two sets of G-stretches (G14–G16 and G18–G20) to be available for three-plane G4 formation. Falling in line, the Seq576G17T mutant displays 24 imino resonances in the 1H NMR spectrum that are almost indistinguishable from those of Seq576 (Fig. 1B), unambiguously showing that G17 is not involved in tetrad formation in either state. Therefore, to allow base-level resolution characterization of Seq576 states, Seq576G9T and Seq576G12T sequences were obtained using 2D NMR spectroscopy.
:
syn guanines, counting the number in the distinct cluster provides a rapid and reliable way of determining backbone topology before venturing into detailed NOESY analysis.15,16 To confirm the formation of a parallel topology in both states of Seq576, the peaks picked in the 13C–1H HSQC were overlaid on the C8–H8 chemical shift correlations documented previously.15,16 The C8–H8 spectral analysis of Seq576 indicates 24 resonances in the anti region (Fig. S2), confirming that both states (with 12
:
0 anti
:
syn ratio) sample parallel G4 topology.
The next step was to obtain sequence-specific assignments of Seq576; however, the NOESY analysis is complicated by cross-peaks arising from both states. The single mutants Seq576G9T and Seq576G12T proved helpful, as their 2D HSQC and NOESY spectra arise from a single homogeneous state and are therefore better resolved. Similar to the 1D imino 1H spectra, the 2D spectra of Seq576G9T/G12T agree well with the 2D spectrum of wild-type Seq576, with each mutant displaying 12
:
0 anti
:
syn resonances in the characteristic C8–H8 aromatic and C1′–H1′ sugar regions of the HSQC. Intranucleotide guanine spin-system 1H resonances (H1′, H2′/H2″ and H8) picked from the NOESY were then subjected to an optimized machine-learning-based methodology, facilitating plane-specific assignments (Table S2).15 The assignments allowed for a rapid and reliable sequence-specific assignment for Seq576G9T/G12T mutants. Resonance assignment obtained for Seq576G9T/G12T (Fig. S3–S5 and Tables S3 and S4) mutants was subsequently transferred to the Seq576 NMR data, allowing complete assignment of chemical shifts (Fig. 1B and 2B and C).
![]() | ||
| Fig. 2 Schematic representation of Seq576 topology and 2D NMR assignments for the two states. (A) Seq576 exchanges across two parallel conformations wherein either G9 (top, G9L) or G12 (bottom, G12L) is involved in G-register shift. All tetrad-forming guanosines sample anti-glycosyl dihedral angle and are represented as blue rectangles. Propeller loops connecting parallel strands are depicted in orange, with labels indicating the nucleotide type and position in the primary sequence. Potassium ions that stabilize tetrad planes are shown as yellow spheres with a positive charge. (B and C) 13C–1H heteronuclear single quantum coherence correlation (HSQC) spectrum obtained for Seq576 observed for (B) aromatic C6/C8–H6/H8 and (C) sugar C1′–H1′ correlation with sequence-specific resonance assignment. NOESY connectivity for sequential walk and 15N–1H imino assignments for Seq576G9T and Seq576G12T are provided in Fig. S3–S5. Nucleotide labels for the state involving G12 in the propeller loop are marked with an asterisk to distinguish it from the state where G9 is involved in loop formation (without an asterisk) to maintain consistency with Fig. 1B. Resonance marked with “$” (adenine C2–H2 for aromatic and cytosine C5–H5 in sugar) indicates aliased peaks arising from folding of the 13C dimension and ambiguous assignment is labeled with “#”. | ||
Combined with the information from single mutant analysis of Seq576, we constructed base-resolution conformations of the two states of Seq576 (Fig. 2A). The sequence adopts two parallel G4 conformations wherein either G9 or G12 is involved in tetrad formation, allowing for G-register shift in the second strand (Fig. 2A), with G10 and G11 involved in tetrad formation in both instances. In both states, as G17 is present in a single-nucleotide propeller loop conformation between strands 3 and 4, only a single resonance is observed in either state (Fig. 2B). In contrast, other states tend to show two resonances.
Furanose 13C–C1' chemical shifts and one-bond 13C–1H scalar coupling (1JC1′–H1′) of tetrad nucleotides shed light on conformational fluctuations observed in parallel topology. Guanines involved in tetrad formation following a single-nucleotide propeller loop present in plane-1 (P1) tend to exhibit a downfield-shifted resonance in the carbon dimension compared to other guanines in plane-2 (P2) and plane-3 (P3).21 For Seq576, when G9/G12 is involved in the loop formation, G14/G9 and G18 are preceded by a single-nucleotide propeller loop that exhibits downfield-shifted 13C–C1′ resonance (Fig. 2B).
Additionally, in both states, the average of 1JC1'-H1' for P1 guanines displays an increased magnitude of ∼4 Hz compared to nucleotides present in plane-2 and plane-3 (P2 and P3) (Fig. S6), in line with the previous results, further confirming common features of parallel topology.21
000 bp) showed moderate holdase activity, resulting in approximately 60% aggregation overall, as anticipated.22 In contrast, as previously shown, the G4-forming wild-type Seq576 shows excellent holdase activity, with aggregation below 10% (Fig. 3B).9
To dissect the contribution of each conformer to chaperone activity, we introduced inosine and thymine mutations at positions G9 and G12. While structurally analogous to guanine, inosine cannot form the stable Hoogsteen hydrogen bonds required for G-quartet formation. Thymine substitutions at these positions were also intended to prevent specific guanines from engaging in quartet formation, thereby favoring one conformer over the other. Our results showed that inosine (Seq576G9I and Seq576G12I, Table S1) consistently outperformed thymine (Seq576G9T and Seq576G12T, Table S1) at both positions in preventing aggregation, leading to a very modest decrease (Fig. 3C and D). Comparatively, locking the conformation through mutation at G9 (Seq576G9T/I) or G12 (Seq576G12T/I) results in similar activity, with G9 mutants performing slightly better as chaperones at preventing protein aggregation.
Next, we probed how G17 mutants of Seq576 perform in the aggregation assay. NMR and CD data show that the thymine mutation at G17 does not alter tetrad formation or the overall G4 fold (Fig. 1B and Fig. S7, and S8). Interestingly, despite preserving the G4 structure, Seq576G17T substantially reduced holdase activity (∼60% aggregation, Fig. 3E). Yet again, inosine at G17 (Seq576G17I) was comparatively more effective, resulting in ∼30% aggregation (Fig. 3E). These results suggest that the G17 position in the loop is important for activity, meriting additional G17 mutants to explore its role further. Among the mutants at G17, Seq576G17C showed a substantial loss of function, driving the fastest aggregation. Seq576G17T and the abasic mutant (Seq576G17AP) followed nearly identical kinetics (Fig. 3E and F). However, aggregation with G17T continued beyond that of G17AP, leading to higher overall aggregation (Fig. S9), indicating that substituting guanine with thymine is more disruptive than nucleobase removal to chaperone activity. Seq576G17A delayed the aggregation process and marginally decreased aggregation compared to Seq576G17AP (Fig. 3F), with Seq576G17isoG maintaining activity similar to the wild-type (<10% aggregation). Collectively, these results emphasize the importance of G17 present in the propeller conformation in loop-3 modulating the holdase activity.
While aggregation assays demonstrate that G4s can inhibit misfolding, refolding assays are equally essential to determine whether they can promote the restoration of properly folded functional protein. Next, we evaluated the refolding capability of the mutants using the folding biosensor TagRFP675. Chemically denatured TagRFP675 remains unfolded and non-fluorescent at its native excitation/emission wavelength of 598/675 nm. When diluted into refolding conditions, the recovery of native fluorescence over time allows us to assess the effect of the “foldase” activity of Seq576 and its mutants on their ability to assist protein folding (Fig. 4A).
In the presence of non-G4-forming DNA Seq42, TagRFP675 displays no recovery of fluorescence signal, indicating that the control sequence does not aid in refolding (Fig. 4B). Next, G9 and G12 mutants of Seq576 were investigated for foldase activity. Similar to the aggregation activity, inosine mutants at both positions performed slightly better than the thymine mutants in the refolding assays. However, neither position showed a significant increase in fluorescence compared to the wild-type (Fig. 4C and D). The initial folding rate between inosine and thymine was comparable, suggesting that substitutions at these positions do not substantially impair the foldase function (Fig. S10). These findings indicate that the difference in foldase activity between the conformers involving G9 or G12 in the propeller loop is not pronounced.
Given that G17 mutations of Seq576 exhibited strong effects on aggregation despite not disturbing the parallel G4 topology, we investigated its foldase role closely. At G17, consistent with the aggregation results, substitution by inosine continued to outperform thymine (Fig. 4E). The Seq576G17AP mutant shows the worst foldase activity, with lower fluorescence and a prolonged lag phase, indicating a slower and less efficient refolding process. Similarly, Seq576G17T and Seq576G17C also show poor foldase activity, with slower refolding kinetics than the wild-type. Conversely, Seq576G17A and Seq576G17I display nearly comparable refolding activity to the wild-type, suggesting that purine mutations did not substantially impair foldase function (Fig. S10). Seq576G17isoG exhibits rapid folding kinetics, with an exponential phase nearly identical to the wild-type Seq576 (Fig. 4F).
Comparing the activity that reduces aggregation, the G17 mutations can be rank ordered as follows: G17C = G17T < G17AP < G17A < G17I < G17isoG (Fig. 3). While comparing the activity that promotes folding, the mutations can be ranked as follows: G17AP < G17T < G17C < G17A < G17I < G17isoG (Fig. 4). These trends are consistent across the two assays, yet they suggest that the aggregation and folding functions are not entirely dependent on the same features. While pyrimidines were more effective in the G17 position than the absence of a base for folding, this was reversed for the prevention of aggregation. However, purines were more effective in position 17 for chaperone activity in all cases.
In the predicted TagRFP675–Seq576G9T and TagRFP675–Seq576G12T complexes, we observed that the loop G17 burrowed into a binding interface formed between the β-sheet (residues 214–217) and a random coil region (residues 150 and 152) (Fig. 5A and B). However, for Seq576G17T/G17AP mutants, the G4 structures move away from the protein surface and no longer form these contacts, indicating a loss of interaction that could contribute to their reduced chaperone function (Fig. 5C and D). These predictions suggest the structural positioning of G17 is essential and mediates binding and chaperone activity.
We further quantified the gel to assess the relative oligomerization. Consistent with previous findings, mutants capable of forming higher-order structures largely exhibit stronger holdase activity (Fig. 6B, Pearson's R = −0.90, Spearman's R = −0.92, both p < 0.0001).12 As the relationship between G4 oligomerization and foldase function is unknown, we performed the same comparison for the protein refolding activity.
We observe that mutants with reduced oligomerization showed a slower initial folding phase for foldase function. Based on the ranked foldase performance, Spearman's correlation yielded R = 0.74, indicating that G4 oligomerization is also associated with enhanced foldase activity, but less strongly than aggregation prevention (Fig. 6C and D). Overall, these results suggest that oligomerization is a mechanistic feature that prevents aggregation and facilitates proper protein folding.
The method applied here for characterizing G4 topologies offers a considerably faster approach than traditionally used in NMR. The ability to use unassigned resonances on an unlabeled sample reduces the cost and time to solve an NMR structure considerably. While this technique was used here for G4 structure, a very similar strategy could presumably be employed with other nucleic acid structures.
In terms of chaperone activity, it was interesting that mutations to G17 affected G4 oligomerization so strongly, while it was also crucial to the predicted binding site primarily used for chaperone activity. While unclear how at present, it is possible that these two characteristics are physically linked, and that the ability for G4s to oligomerize uses the same chemical properties as its chaperone function. Regardless, this case emphasizes the importance of considering that chaperoning is not merely binding and that other variables are important to consider in their function.
Footnote |
| † Authors contributed equally to this work. |
| This journal is © the Owner Societies 2026 |