Open Access Article
Marlies Derden†
,
Sten Reynders†,
Jérôme Rihon,
Xinlu Li,
Frédérick Coosemans,
Jef Rozenski,
Piet Herdewijn
,
Elisabetta Groaz†
* and
Eveline Lescrinier†
*
KU Leuven, Rega Institute for Medical Research, Medicinal Chemistry, Herestraat 49-Box 1041, 3000 Leuven, Belgium. E-mail: elisabetta.groaz@kuleuven.be; eveline.lescrinier@kuleuven.be
First published on 7th October 2025
i-Motifs (iMs) are non-canonical four-stranded DNA structures stabilized by hemi-protonated (C:H+C) cytosine base pairs. Under non-acidic pH conditions, cytosine deprotonation disrupts these interactions, resulting in structural unfolding. To broaden the environmental conditions that support iM formation for potential biomedical applications, we investigated the effect of substituting 2′-deoxycytidine residues with either the (R) or (S) isomers of an acyclic phosphonate-linked nucleotide (ZNA) bearing a cytosine (C) or 5-fluorocytosine (5F-C) nucleobase within the 22-mer vertebrate telomeric iM. UV melting and NMR spectroscopy analyses revealed that the presence of ZNA-C in the core of the iM decreased its stability, while 5F-C altered the iM stability across different pH ranges. Molecular dynamics (MD) provided a first glimpse of the quasi-symmetrical loops 1 and 3 in the iM formed by the native vertebrate telomeric sequence. The changes in the dynamics of the loop region in the modified sequences could be correlated with their decreased stability, despite stable base pairing occurring for both (R) and (S)-ZNA modifications.
In living cells, iMs are mainly present at promoters of actively transcribing genes and within open chromatin regions, often overlapping with R-loops, with their abundance and distribution varying in a cell-type specific manner.4 They are believed to play key roles in various DNA processes, including the regulation of genome stability, gene transcription, mRNA translation, DNA replication, telomere and centromere functions, as well as vertebrate diseases.5 In vitro, the formation of iM depends not only on experimental conditions such as pH and temperature but also on the length of cytosine stretches as well as the number and types of nucleotides that link the cytosine containing residues.6 Despite extensive research, the precise factors governing iM formation remain not fully understood. To gain deeper insight and potentially regulate iM stability and folding, various chemical modifications have been explored, including variations to the backbone and sugar moieties such as peptide nucleic acid (PNA),7 phosphorothioate (PS),8 2′-fluoroarabino nucleic acid (FANA),9 locked nucleic acid (LNA),10 hexitol nucleic acid (HNA)11 as well as acyclic nucleic acid analogues.12–14 These modifications have produced varying outcomes; for instance, PNA and PS did not significantly affect iM stability, whereas LNA enhanced stability only when the modified residues were positioned at specific sites within an oligomer. Other modifications either prevented folding or led to destabilization. This variability highlights the complexity of how sugar moieties influence iM stability and emphasizes the need for a systematic investigation.
The 22 nt vertebrate telomeric fragment with the sequence 5′-d[(CCCTAA)3CCCT] has been well studied in vitro.15 It exhibited pH-dependent iM formation under mildly acidic conditions, dissociating at neutral pH, and demonstrated optimal stability around pH 6.0. Previous experimental studies using specific sugar modifications highlighted the importance of sugar–sugar contacts for iM stability. Depending on their position within the iM tetrameric core or loops formed by the vertebrate telomeric sequence, acyclic unlocked nucleic acids (UNA) could destabilize or stabilize the corresponding folded structure, respectively.12 Similar effects were observed for cytidine containing acyclic threoninol nucleic acid (aTNA) residues.13 Furthermore, the replacement of unmodified residues at selected C-core positions with aTNA allowed to modulate iM stability over different pH ranges. The introduction of a fluorine atom at the 2′-β position of 2′-deoxycytidine (dC) was found to enhance iM stability, supporting formation even at higher pH values approaching ∼7.0.16
The pH-dependence of iM formation is an attractive feature for nanotechnology applications, particularly in the design of DNA nanodevices that use iM-forming sequences as pH-switches or components of DNA-nanomotors.17 The pH-responsiveness of iM forming oligonucleotides holds also potential for applications in bioanalysis and targeted therapy. A pH-responsive aptamer bearing an unnatural nucleobase was designed to specifically fold through C:H+C base pairing and bind target proteins in the extracellular acidic microenvironment of cancer cells to achieve selective inhibition of cancer cell migration.18
Understanding how different chemical modifications influence iM formation and stability is crucial for designing nucleic acid-based tools with specific functional properties. In this study, the effect of an acyclic ZNA-C modification with standard cytosine nucleobase on the iM formation ability of the vertebrate telomeric fragment 5′-d[(CCCTAA)3CCCT] was investigated (Fig. 1). ZNA can exist in two enantiomeric forms, i.e., (S)- and (R)-ZNA. Both isomers were considered in this work, however only the (R)-isomer was previously shown to be accommodated in a DNA duplex.19 The 5-fluoro (S)-ZNA-C analogue was also included since a previous X-ray crystallography study suggested that formation of C:H+C base pairs can be affected by a decrease of pKa of cytosine upon halogenation.20 Moreover, the 19F in the studied modification can be exploited to probe the secondary structure of the core of the telomeric fragment, as its chemical shift is sensitive to multiple factors affecting its electronic microenvironment, including stacking, H-bonding interactions with adjacent bases, and the protonation state of the nucleobase.21 The thermodynamic stability of the modified and unmodified sequences was studied by the UV melting method. Measurements were performed at different pH values to assess the pH-dependent stability characteristic of an iM structure. Nuclear magnetic resonance (NMR) was used to evaluate C:H+C base pairing in the iM core for the modified and unmodified congeners under different pH conditions. To monitor the effect on mobility of individual atoms or residues induced by a single ZNA-C modification in the core of the iM, a molecular dynamics (MD) study was also performed.
| ON1 | d(CCCTAACCCTAACCCTAACCCT) |
| ON2 | d(CCCTAAC)-(R)xC-d(CTAACCCTAACCCT) |
| ON3 | d(CCCTAAC)-(S)xC-d(CTAACCCTAACCCT) |
| ON4 | d(CCCTAAC)-(S)fC-d(CTAACCCTAACCCT) |
| ON5 | d(CCCTAAC)-(S)fC-d(CTAACCCTAAC)-(S)fC-d(CT) |
060; T, ε = 8560; C, ε = 7100). For all experiments, the oligonucleotide concentration was 4 μM. A Varian Cary 100 BIO spectrophotometer was used to determine the melting curves. Cuvettes were maintained at a constant temperature by water circulation through the cuvette holder. The temperature of the solution was measured with a thermistor that was directly immersed in the cuvette. Temperature control and data acquisition were carried out automatically with an IBM-compatible computer by using the Cary WinUV thermal application software. A quick heating and cooling cycle was carried out to allow for proper annealing. Denaturation curves were acquired at 260 and 270 nm (SI). The samples were then heated from 10 to 80 °C at a rate of 0.2 °C per min and were cooled again at the same speed. Experiments at different pH values ranging from pH 4.6 to 6.0 were performed. Melting temperatures (Tm) were calculated as the midpoint of the transition using the first derivative of the experimental data; reported values are the average of two runs.
:
1. Concentrations of 0.4 and 0.1 mM were used for ON1, while ON2–ON5 were analyzed at 0.1 mM. NMR spectra were acquired on a Bruker Neo spectrometer operating at 600 Mhz with a quadruple cryoprobe (1H, 31P, 15N, 13C) and processed with the Topspin software. TOCSY spectra were recorded with a standard DIPSI24 spinlock sequence and 64 ms mixing time. NOESY spectra were acquired at 200 and 300 ms mixing times. Zero-quantum interference in two-dimensional NMR spectra was eliminated by gradients.25 Excitation sculpting26 was used to suppress the water signal prior to acquisition. One- and two-dimensional experiments were carried out at temperatures ranging from 0 to 40 °C.
![]() | (1) |
Ultimately, two functions were necessary to form a linear combination of the resulting profile of χα at χβ = 70°. The following parameters (Kv, γ, n) were used for the parametrization of χα: [Kv: 2.8, γ: 355, n: 2.05] and [Kv: 4.25, γ: 230, n: 0.57]. A graph (Fig. S4B, SI) was provided that represents the linear combination of both analytic functions to model the behavior of this specific angle.
000 cycles, with the first 22
500 cycles employing the steepest descent method and the last 7500 the conjugate gradient method. The SHAKE algorithm46 was employed to allow a time step of 2 fs. An initial heating was performed for 50 ps, from 0 to 100 K with vlimit set to 15. The rest of the heating, from 100 to 300 K, ran for an additional 50 ps. Density and equilibration ran for 100 ps each, with density set at 1 g mL−1. The Langevin thermostat47 and the Berendsen barostat48 were used to keep the temperature and density at a constant value. The production simulations were run for 1000 ns, with a restart every 1 ns to vary the random seed of the simulation. MD simulations were run on an NVIDIA GeForce RTX 2070 GPU through the cuda accelerated simulation engine.49 Cpptraj was used to extract the modelled data.50 Visualization was performed using UCSF Chimera.51 Graphs were made using the Matplotlib python module.| 4.60 | 5.10 | 5.25 | 5.50 | 5.75 | 6.00 | |
|---|---|---|---|---|---|---|
| a na: no observable Tm. | ||||||
| ON1 | 47.0 | 44.2 | 45.1 | 38.8 | 34.4 | 30.6 |
| ON2 | 36.4 | 33.0 | 27.9 | 26.7 | na | na |
| ON3 | 38.5 | 34.1 | 29.7 | 28.0 | na | na |
| ON4 | 32.9 | 29.3 | na | na | na | na |
| ON5 | na | na | na | na | na | na |
![]() | ||
| Fig. 2 (A) Imino proton signals in the 1D proton spectra of ON1–5 at pH = 5.2 (0 °C). (B) Proposed equilibrium of the folded single-stranded iM and dimeric structure containing A5::T16 base pairs. | ||
An equilibrium between 5′E and 3′E conformers of the monomeric iM was also observed for a sequence that lacked T22. The additional T10::T22 in ON1 is thought to stabilize the 5′E conformation of the iM, while the presence of a Watson–Crick A::T base pair rules out a monomeric 3′E iM as the minor conformer in the studied sequence.2 The disappearance of the minor form at lower concentration further supports its dimeric nature.
In the pH range between 5.20 and 5.50, the 1H NMR spectra of ON2–3 at 0 °C contained one sharp signal at 11.0 ppm and four signals between 15 and 16 ppm due to C:H+C base pairs (Fig. 2A). The signal at 13.3 ppm, which was previously attributed to A::T base pairing in the dimeric structure of the full DNA sequence ON1, was absent in the spectra of the modified sequences at 0.1 mM. No imino signals were observable for ON2 and ON3 above 30 and 40 °C at pH 5.5, while for ON1 imino signals were still detectable at 40 °C throughout the full pH range up to 6.3 (Fig. S6, SI).
A significant loss of iM stability was also previously observed in circular dichroism (CD) spectroscopy experiments upon introduction of aTNA13 and UNA12 monomers into the tetrameric region of an identical DNA sequence. For ON4 containing a single modified 5-F-cytosine (S)-ZNA nucleoside at position 8, four sharp C:H+C imino signals were only visible at 0 °C (Fig. 2A) and at pH 5.20 or lower. The fluorine substituent in 5-fluorocytidine is known to substantially lower the apparent pKa of the N3 nitrogen by 1.7–1.8 units compared to cytidine,54 leading to a lower pH needed for stable iM formation in solution. A computational study by Yang et al. predicted that 5F-dC can form iMs, as the base-pairing energies (BPEs) of proton-bound heterodimers between cytosine and 5F-dC surpass those of canonical G:C base pairs, despite being lower than the BPEs of unmodified C:H+C base pairs.55
A second 5F-C (S)-ZNA modification (ON5) was introduced to study the effect of a modified base pair within the iM structure, however imino signals were absent, even at low pH and temperature, suggesting that no stable iM was formed in contrast to what is reported for aTNA.13
Subsequently, 31P NMR was also performed to examine potential backbone changes upon introduction of ZNA modifications. The 31P signals arising from the phosphonate linkage in ON2–ON5 were shifted downfield compared to those in the traditional phosphodiester linkages of ON1, reflecting a significantly different electronic environment (Fig. 3A). A phosphonate signal at 16.04 ppm was observed for (R)-ZNA-C in ON2, which was close to the reported values of 16.83 ppm for (R)-ZNA-C in a duplex and 16.75 ppm for (S)-ZNA-C in a hairpin.19 In the folded iMs of ON3 and ON4 at 10 °C, the 31P resonance of (S)-ZNA-C and (S)-ZNA-5F-C were shifted downfield to 14.45 and 14.42 ppm, respectively (10 °C, pH 5.2). Upon increasing the temperature, a second signal arose in the 31P spectra of ON3 and ON4 at 16.36 and 16.26 ppm, respectively, while the original signal gradually decreased (Fig. 3B). This is characteristic for the co-existence of two conformational states that exchange slowly on the NMR timescale. Since the rising signal above 16 ppm matched the phosphonate chemical shift observed at pH 5.5 in the absence of iM formation, it was assigned to the unfolded sequence. For ON2, broadening of the phosphonate signal, rather than appearance of separate signals, was observed upon increasing the temperature to 40 °C, indicating that, for this oligonucleotide, the exchange rate between folded and unfolded structures approached the difference in chemical shifts between both states, which is considerably smaller than in ON3 and ON4.
In contrast to the phosphonate signal in the iM backbone of ON4, the 19F signal of its modified nucleobase in the iM core at pH 5.2 exhibited a progressive chemical shift change with increasing temperatures (Fig. 3C). At 0 °C, ON4 displayed a signal at −166.32 ppm corresponding to an iM-like structure, based on the 1H NMR spectrum showing characteristic imino proton signals at 14–16 ppm. Increasing the temperature led to a significant chemical shift change between 30 and 35 °C towards −167.20 ppm at 40 °C, approaching the 19F chemical shift of −167.26 reported in the literature for 5F-dC. This chemical shift change was comparable to the approximately 3.0 ppm downfield deshielding effect reported for 5F-U in a Watson–Crick geometry relative to a random coil context.56
Next, strong intraresidue anomeric to aromatic NOE interactions allowed to assign the non-exchangeable nucleobase protons in cytosine residues, and subsequently their amino protons involved in hydrogen bonding within the iM core. From the latter assignment, the imino signals could also be identified in the NOESY spectrum. For UNA-C, destabilization of the iM was attributed to disruption of interactions within tetrameric core.12 In contrast, ON2 adopted a stable iM at low temperature with characteristic NOE interactions across its narrow grooves, suggesting that the ZNA monomer did not cause substantial alterations in the tetrameric core of the iM structure. As described for ON1,43 loop signals in loops 1 and 3 were broadened due to conformational exchange and showed overlap due to symmetry within the folded structure, preventing the assignment of loop 1 (T4-A6) and loop 3 (T16-A18).
A similar assignment procedure was also carried out for (S)-ZNA C containing ON3 (Fig. S7, SI). The four sharp imino signals above 15 ppm could be assigned to the four C:H+C base pairs in the central core of the iM, including (S)-ZNA-C8::C20. In contrast to ON2, the interresidue interactions of C15:H1′ to xC8:H1′ across the narrow groove in the iM were not detected. Unfortunately, a strong overlap of imino signals for C9::C21 and C2::C14 base pairs prevented their unambiguous assignment. Broad signals in the imino proton region did not yield observable NOE contacts; this could arise from the closing base pairs C7::C19 and C1::C13, which were absent in the spectrum of ON2.
| Atoms | ON1 | ON2 | ON3 |
|---|---|---|---|
| C8:O2-C+20:H41 | 1.812 (±0.098) | 1.850 (±0.107) | 1.848 (±0.104) |
| C8:N3-C+20:H3 | 1.940 (±0.092) | 1.933 (±0.091) | 1.916 (±0.085) |
| C8:H41-C+20:O2 | 1.980 (±0.159) | 1.958 (±0.150) | 1.929 (±0.135) |
| A6:N7-A18:H62 | 2.042 (±0.143) | 3.188 (±1.714) | 5.631 (±0.193) |
| A6:H62-A18:N7 | 2.035 (±0.139) | 2.947 (±1.566) | 3.625 (±0.186) |
| T10:O4-T22:H3 | 2.046 (±0.551) | 8.843 (±0.374) | 2.335 (±1.480) |
| T10:H3-T22:O4 | 2.045 (±0.516) | 8.933 (±0.386) | 2.253 (±1.274) |
| A12:N7-T22:H3 | 15.243 (±2.159) | 2.035 (±0.162) | 15.438 (±2.022) |
| A12:H62-T22:O2 | 19.848 (±2.748) | 1.964 (±0.170) | 20.051 (±2.488) |
The mobility of individual atoms or residues is reflected by the root-mean-square fluctuation (RMSF) of atomic positions relative to the reference structure. From the RMSF plot for individual atoms throughout the full simulation relative to the starting NMR structure of ON1 (Fig. 6, right), it is clear that the core of the iM contains the least mobile elements with C:H+C pairs exhibiting comparable low fluctuations from the reference state. Their limited mobility reflects a stable structure for the stacked hemi-protonated cytosine pairs, including the pair with the ZNA modification at position 8 in ON2 and ON3.
Most TAA loop segments exhibited significant repositioning throughout the entire MD trajectory with strong peaks in the RMSF plot, which were due to the flapping motions of nucleobases in the loop residues. The largest differences in fluctuations between simulations were observed throughout the last 100 ns in the equilibrated phase MD trajectory.
As previously reported for the iM of ON1,36,57 T10 and adenines A11 and A12 in loop 2 displayed strong flexibility, while the position of residues in loops 1 and 3 was much more stable. This phenomenon was attributed to the stabilizing effect of mutual interactions between loops 1 and 3 in the DNA iM. The RMSF plots of the modified sequences over the last 100 ns in the MD trajectory revealed a different pattern for ON2. Specifically, residue A12 in loop 2 demonstrated significantly reduced RMSF, while flapping motions for the nucleobases of T4 and A5 in loop 1 persisted in the last 100 ns of the simulation. Previous conformational dynamics unraveled that the stability of the studied iM is mainly associated with motion in interacting loops 1 and 3, where temperature and loop sequence can affect the dynamics and stability of the iM.57 Increased RMSFs over the last 100 ns of the MD trajectory for the atoms of T4 and A5 reflected their increasing mobility within loop 1 according to the order ON1 ≪ ON3 < ON2 (Fig. 6). The analysis of hydrogen bond distances in the last 200 ns of MD simulations for the of A6::A18 base pair flanking the core of the iM (Table 3) revealed increased average distances between atoms involved in hydrogen bonding for the modified oligomers compared to ON1, especially for ON3, indicative of increased fraying of this base pair within the interacting loops 1 and 3. Increased dynamics in loops 1 and 3 of ON2 and ON3 was supported by the disappearance of the imino proton for the flanking C7::C19 base pair in the NMR spectrum of ON2, while it was observable for the reported d(CCCTAA5mCCCTAACCCUAACCCT) sequence.43
Despite low RMSF values for atoms in hemi-protonated cytosine pairs, the incorporation of a ZNA-C residue had a clear impact on the puckering behavior of the sugar ring in its complement (residue 20) as well as in residues 2 and 3 with nucleobases stacking with the ZNA-nucleobase. For each of these residues, fluctuation between multiple ribose ring conformations was observed during the last 100 ns of the MD simulation (Fig. S8–S10, SI), which corresponded to a slightly increased RMSF at the sugar phosphate backbone for these residues in ON2 and ON3 relative to ON1. The effect on sugar puckering in residue 3 was passed on to the first residue in loop 1 (T4) through the backbone linkage. In residue 9, immediately downstream of ZNA-C, the dominant sugar puckering shifted to the C2′-endo (South) conformation, while in DNA iM structures, the deoxyribose sugars primarily adopt the atypical C3′-endo (North) conformation, which is also found in A-type double stranded helices. The altered sugar conformation of residue 9 led to repositioning of loop residue T10, resulting in the loss of contact with T22. Instead of the sheared T10::T22 seen in ON1 and ON3, T22 moved within hydrogen bonding distance of A12 to form a reverse Hoogsteen base pair in the structure of ON2, resulting in reduced RMSF for A12 upon incorporation of the ZNA-C modification (Fig. 6). Unfortunately, the imino signal at 11 ppm did not exhibit any NOE interactions. Therefore, NMR could not be used to obtain data on the base pairing pattern of T22.
At the other end of the iM, an A6::A18 Hoogsteen:Hoogsteen base pair was consistently observed between interacting loops 1 and 3 in all simulations. Nucleobases in the A6::A18 base pair were sandwiched between the C:H+C base pair formed by C7 and C19 on one side and A17 on the other side. The increased RMSF for atoms in the nucleobase of A17 is caused by syn/anti flipping during the stabilized phase of the MD trajectories with transient loss of stacking interactions (Fig. 6). A similar RMSF pattern in loops 1 and 3 was also reported for comparative MD simulations on ON1 using different forcefields.36 However, these results contrast with reported NMR data for d(CCCTAA5mCCCTAACCCUAACCCT),43 which describe syn/anti flipping for A18 in an A6::A18 base pair as an explanation for the observed conformational exchange between two conformers of interacting loops 1 and 3. In this NMR study, two base substitutions (italicized) were introduced in the vertebrate telomere sequence to break the quasi-symmetry between loops 1 and 3 that prevented full assignment in the NMR spectra of the natural sequence ON1. It is unclear to what extent these modifications affect the structure and dynamics of both loops, however authors suggested that tight packing of loops 1 and 3 might favor the configuration with the non-methylated U16 inside and T4 outside. In the equilibrated phase of all MD simulations, both T4 and T16 were pointing outside (Fig. 7). In agreement with NMR data, this configuration has a quasi-symmetrical structure for loops 1 and 3 with residues C7/C19, A6/A18, and T4/T16 that are replaced by their correspondent upon 180° rotation around the quasi-symmetry axis.
![]() | ||
| Fig. 7 (A) Comparison of loop 1 (T4-A6) and loop 3 (T16-A18) contacts in the solution NMR structure of d(CCCTAA5mCCCTAACCCUAACCCT)43 (1el2.pdb). (D) Simulated ON1 (frame at 1000 ns MD). (E) Simulated ON2 (frame at 1000 ns MD). (F) Simulated ON3 (frame at 1000 ns MD). Residues are color coded by type (C: green, T: yellow, U: orange, A: blue). Across all three simulated models (ON1–ON3), nucleobase A17 occurs in both syn- and anti-conformation. Representative frames from model ON3 in top view on the A6::A18, sandwiched between C7::C19 and A17 are shown with A17 in syn at 910 ns (B) and anti at 995 ns (C). Nucleobase A17 is colored dark blue. | ||
The presence of a 5-fluorine atom at the nucleobase moiety of (S)-ZNA-C further lowered the pH required to form a stable iM, while preserving the NOE interactions identified for the (S)-ZNA-C containing iM. This observation suggests the possibility of modulating the pH stability of iMs by introducing substituents at the C5-position of cytidine, enabling the design of iM structures suited for different pH ranges. Furthermore, the 5F-C nucleobase allowed to readily monitor the temperature-dependent folding of the corresponding iM by 19F NMR. The ability of 5F-C to provide distinct 19F spectral signatures in vitro is noteworthy, as this could be exploited for future applications such as in-cell NMR analysis of iM structures, especially since fluorine is nearly absent in biological systems, resulting in minimal background fluorine signal.58
The observed decrease in thermal stability upon ZNA incorporation could provide a useful means to finely tune iM behavior, potentially allowing to control their structural dynamics under diverse in vitro conditions; however further studies such as a systematic investigation of sequence context are needed to elucidate the broader applicability of this approach. Interestingly, since ZNA was demonstrated to be highly stable against enzymatic degradation,23 it would be worthwhile to investigate whether this modification could enable the design of nuclease resistant iM structures.
Supplementary information: experimental procedures, NMR spectra, and other data supporting the results or analyses. See DOI: https://doi.org/10.1039/d5ra05934h.
Footnote |
| † Equal contribution. |
| This journal is © The Royal Society of Chemistry 2025 |