Open Access Article
Robert
Dörrenhaus
,
Philip K.
Wagner
and
Stephanie
Kath-Schorr
*
Department of Chemistry and Biochemistry, Institute of Organic Chemistry, 50939 Cologne, Germany. E-mail: skathsch@uni-koeln.de
First published on 13th June 2025
Artificial base-pairs expand the genetic alphabet and allow for site-specific labeling, for example with fluorophores or spin labels, thereby enhancing the capabilities of biomolecular sensing. We developed CTPT3, a stable C-nucleoside pairing with NaM to form a hydrophobic base pair, and demonstrate its enzymatic incorporation into DNA/RNA and facile modification for labeling.
Recent studies have explored a wide range of metal ions, C1′ modifications, and reductive pathways for C1′ activation, often via the C1′ glycal radical and using bench-stable glycal donors. Accordingly, we investigated such CC reactions to synthesize the novel C-glycosides nucleobase CTPT3 and its ribonucleoside rCTPT3 (Fig. 1).
To assess whether the design of CTPT3 can replicate the electrostatic characteristics of TPT3, we computed the electrostatic surface potentials of both TPT3 and CTPT3. N → C and C → N substitutions in CTPT3 shift the electrostatic potential compared to TPT3. However, these changes occur on the face of the nucleobase oriented away from NaM, unlikely to impact base-pairing interactions significantly. On the NaM-facing side, the carbonyl oxygen of CTPT3 exhibits a higher surface electrostatic potential compared to the thiocarbonyl sulfur in TPT3, with this elevated potential also influencing adjacent carbon atoms. Overall, their electrostatic profiles are similar, suggesting, that CTPT3 binds to NaM similarly to TPT3.
This study optimized Ni-catalyzed CCs to enhance rNaM (Scheme 1 VI) and enable rCTPT3 synthesis (Scheme 1 II), applied a Heck-type reaction for dCTPT3 (Scheme 1 III) and evaluated both in enzymatic triphosphate (TP) synthesis and polymerase assays. We thus present a hydrolysis-resistant C-glycoside complementary to dNaM to expand nucleic acid chemical tools.
CTPT3 nucleobase 3a was first synthesized from thieno[2,3-b]pyridin-7-ol by acid catalysed bromination with NBS (Scheme 1 I). The intermediate was either N-methylated using iodomethane or functionalized with a (CH2)4NBoc linker via its iodinated substrate, yielding 3a and 3b in near-quantitative yields. N-Functionalisation stabilizes the ketone in the keto-tautomeric form. Moreover, as demonstrated in our experiments, the following cross-coupling reactions do not proceed effectively without N-methylation. Coupling of the CTPT3 nucleobase to ribose was challenging as classical methods, such as using aryl lithium with protected lactones, did not yield the desired product, regardless of whether the methylated or non-methylated nucleobase analogue was used. Building on our previous progress with Ni-catalysed CC14 with the rAc ribofuranose15 for rNaM synthesis (VI), we tested this approach with the iodinated CTPT3 nucleobase, but only obtained side product formation induced by Zn.16
The route was changed to diastereoselective photo-reductive Ni-catalysed CC with the rDHP sugar moiety (Scheme 1 II).17 The rDHP sugar was synthesized from ribose by acid catalysed acetonide protection of the 2′ and 3′ OH, followed by benzoyl protection of the 5′ OH. Dihydropyridine was synthesized from ethyl-3-aminocrotonate and glyoxylic acid and coupled to the ribose 1′-OH by amide coupling with N,N′-diisopropylcarbodiimide (see ESI,† Section 2). The Ni-catalysed CC follows a photoreductive mechanism, where the glycosyl ester is oxidized by an activated photosensitizer (4CzIPN), forming a radical after deprotonation. Through a series of single-electron transfers, a Hantzsch pyridine and an alcoxycarbonyl radical are generated. An unstable C1′ radical forms via CO2 loss and participates in a Ni-catalysed CC with a Ni(II) species formed by oxidative addition of the Br-CTPT3 nucleobase 3a,17 producing the protected β-rCTPT3 nucleoside 4a in 17% yield. Deprotection using p-toluenesulfonic acid to remove the acetonide group, followed by nucleophilic cleavage of the benzoyl group with NaOMe, gave the free nucleoside in 95% yield over two steps. The overall yield from thieno[2,3-b]pyridin-7-ol to the rCTPT3 nucleoside was 14% with the CC step being yield-limiting. Compared to rTPT3, which has an overall nucleoside yield of ∼3%,6 this represents a significant improvement, making the route promising for further research and application. Further optimization of the CC step may improve yields. To assess enzymatic compatibility, the nucleoside was subjected to TP synthesis using enzymes typically applied to natural or modified nucleosides, aiming to simplify the triphosphorylation process and avoid low-yielding chemical routes. With support by BioNukleo (Berlin), an enzymatic synthesis of rCTPT3 TP 5 was achieved.
The synthesis of DNA nucleotide dTPT3 TP 7 (Scheme 1 III) originated from the Br-CTPT3 nucleobase 3a used for rCTPT3 and a TMS-protected glycal (Scheme 1 III). The dgly was synthesized in one step from thymidine by silylation of the hydroxy groups and nucleobase elimination in 85% yield.18 A subsequent Heck-reaction19 and selective C3-ketone reduction with with NaBH(OAc)320 afforded the β-nucleoside in 6% yield over two steps. The dCTPT3 nucleoside was enzymatically converted to TP 7 in 7.5% yield. Thus, both rCTPT3 and dCTPT3 are compatible with enzymatic TP synthesis—an encouraging result, as enzymatic acceptance offers here a more efficient and higher-yielding alternative to traditional chemical methods.
rCTPT3 was incorporated into RNA via in vitro transcription (Fig. 2a and c) using T7 RNA polymerase and dNaM-containing DNA template (DNA1_dNaM). The resulting RNA was purified and analysed by LC-MS and denaturing PAGE confirming the presence of full-length products (see ESI,† Section S7 and Fig. 2a). Full-length RNA with rCTPT3 incorporated opposite dNaM with an additional non-templated rCTPT3 at the end of the sequence (sodium adduct: 10
321 g mol−1; calculated 10
321 g mol−1; double sodium adduct 10
343 g mol−1; calculated 10
344 g mol−1) was observed. The addition of non-templated nucleotides by T7 RNA polymerase is well-documented,21 and was also found in control samples with unmodified templates and classical rTPT3 (ESI,† Section S7). With rTPT3 also a tendency of adding non-templated rTPT3 at the 3′ end of the product has been observed previously.22In vitro transcriptions of Spinach_dNaM DNA template were conducted to produce a functional Spinach RNA aptamer (ESI,† Section S7.3) with one rCTPT3 modification. In gel staining with DFHBI confirms correct folding of the full length rCTPT3 modified Spinach aptamer (ESI,† S7.3).
![]() | ||
Fig. 2 Enzymatic incorporation of rCTPT3 TP and dCTPT3 TP. (a) Denaturing PAGE of in vitro transcription by T7 RNA Pol with rCTPT3-TP. Lanes left to right: template; NaM template + rCTPT3-TP; NaM template without rCTPT3-TP; natural template + NTPs. (b) Denaturing PAGE of DNA primer extension by KlenTaq. Lanes left to right: DNA primer; NaM template + dCTPT3-TP; NaM template without dCTPT3-TP; (4) unmodified template + dNTPs. (c) Deconvoluted ESI mass spectrum of purified in IVT using DNA1_dNaM template and T7 RNA polymerase giving rCTPT3-full length product with additional non-templated rCTPT3 addition at the 3′-end (10 321 g mol−1; calculated 10 321 g mol−1) and the 2× sodium adduct of this sequence (10 343 g mol−1; calculated 10 344 g mol−1). (d) Deconvoluted ESI mass spectrum of purified primer extension assay using template DNA2_dNaM and KlenTaq DNA polymerase resulting in dCTPT3-full length product (13 190 g mol−1; calculated 13 191 g mol−1). Further, mismatches of dNaM/dA plus non-templated dA (13 472 g mol−1; calculated 13 474 g mol−1), dNaM/dA plus non-templated dG (13 490 g mol−1; calculated 13 490) and the sodium adduct of full-length product with non-templated dT (13 516 g mol−1; calculated 13 518 g mol−1) are observed. Raw data and supplementary data are available in the ESI† (Section S6). | ||
We also investigated DNA primer extension with dCTPT3 TP, dNaM-containing template (DNA2_dNaM) and KlenTaq DNA polymerase (Fig. 2b and d). Equivalent primer extensions were additionally performed with classical dTPT3 TP (see ESI† Section S7.2). Analysis in PAGE showed full length products as well as minor amounts of termination products (Fig. 2b). LC-MS analysis confirmed the full-length product with incorporated dCTPT3 (13
190 g mol−1; calculated 13
191 g mol−1) and its sodium adduct with a non-templated dT (13
516 g mol−1; calculated 13
518 g mol−1). Additional products included dNaM extensions with natural nucleosides dA mismatches and non-templated dA (13
472 g mol−1; calculated 13
474 g mol−1), as well as full-length fragments of mismatch dNaM/dA plus non-templated dG (13
490 g mol−1; calculated 13
490) were found (see ESI† Section S7.1). Similar mismatches between natural nucleobases and unnatural bases have been previously reported.23 These findings align with the in vitro transcription data and confirm that standard DNA polymerases can incorporate CTPT3 TPs into full-length DNA strands (DNA template, primer and promoter sequences: ESI† Section S5).
To evaluate the enhanced acid stability conferred by the C-glycosidic bond, nucleotide rCTPT3 TP was compared with rTPT3 TP under acidic conditions (pH 5, 80 °C, 5 h) (Fig. 3). Prior studies by us have shown that TPT3 derivatives are prone to base loss during dPAGE purification and are susceptible to acid cleavage due to their N-glycosidic linkage.10a–e
At pH 5, both TPs hydrolysed to the monophosphate as expected (Fig. 3). However, for rTPT3 nucleobase fragments were found, indicating cleavage of the N-glycosidic bond. These fragments are not observed in the MS fragmentation pattern of the triphosphate prior to acidic incubation, which confirms that they are generated by degradation under acidic conditions (see ESI† Section S7.4). Quantitative analysis of the UV chromatogram revealed that 11% of TPT3 underwent hydrolysis under the applied conditions (see ESI† Section S7.4). In contrast, rCTPT3 remained intact, confirming its superior stability due to the C-glycosidic linkage.
In summary, we developed efficient synthetic routes to the stable, non-hydrolysable C-glycosides dCTPT3 and rCTPT3 via Heck and Ni-catalysed CC reactions. CTPT3 nucleobase was functionalized and demonstrated compatibility with CC conditions. Both nucleosides were successfully converted into their TPs via enzymatic synthesis, offering moderate to high yields and a simplified alternative to traditional chemical methods. The TPs were enzymatically incorporated into full-length products using in vitro transcription (rCTPT3) and primer extension (dCTPT3) with dNaM-containing templates, confirming polymerase acceptance. While the C-glycosidic linkage provides enhanced stability and modifiability, challenges such as the achievement of a nature-like fidelity for enzymatic amplification of the UBP remain. Our work expands the nucleic acid chemistry toolbox by introducing a robust and versatile C-glycosidic artificial nucleobase as part of the unnatural base pair CTPT3-NaM.
The authors would like to thank Dr Christina Wartmann, Dr Niklas Hermes and Lukas Neu for fruitful discussions. Further, the authors want to thank Sawar Aziz, Moritz Steiner and Jessica Kubis for experimental support. A special thanks to Dr Hannah Depmeier for advice in biochemical methods.
Footnote |
| † Electronic supplementary information (ESI) available: Synthesis routes and analytical data to dCTPT3 TP and rCTPT3 TP, details of biochemical studies, supplementary LC-MS results. See DOI: https://doi.org/10.1039/d5cc02749g |
| This journal is © The Royal Society of Chemistry 2025 |