TransG4: an interpretable deep-learning approach for sequence-based G-quadruplex prediction

Yongna Yuan; Yaojie Tian; Zhenyu Liu; Jun Ma

doi:10.1039/D6CP00173D

TransG4: an interpretable deep-learning approach for sequence-based G-quadruplex prediction

Yongna Yuan,

*^a Yaojie Tian,^a Zhenyu Liu^a and Jun Ma^a

Author affiliations

* Corresponding authors

^a School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
E-mail: yuanyn@lzu.edu.cn

Abstract

G-quadruplexes (G4) are non-canonical nucleic acid secondary structures formed in guanine-rich regions and have been shown to regulate diverse cellular processes such as gene expression, DNA replication, and telomere maintenance, with increasing evidence linking G4 to cancer and other human diseases. G4 predominantly emerge in guanine-rich regions and are implicated in a spectrum of molecular interactions and disease phenotypes, thus researchers are interested in the formation of G4. However, predicting the formation of G4 from nucleotide sequences is a persistent problem. Existing computational tools for G4 prediction are either rule-based on domain knowledge or rely on a single neural network model like a convolutional neural network (CNN), which lacks interpretability and struggles to capture long-range dependencies among bases. Here, we introduce TransG4, a novel neural network architecture that integrates a CNN, a transformer, and bidirectional gated recurrent units (BiGRUs) to identify potential G4 structures. TransG4 demonstrates strong predictive performance on both G4-seq and rG4-seq datasets, accurately predicting DNA mismatch scores and consistently outperforming existing methods in RNA RSR-ratio prediction. Attention-based interpretations further show that TransG4 captures biologically meaningful motifs consistent with canonical G4 structures, providing an interpretable and generalizable framework and representing a novel and impactful contribution to sequence-based G4 propensity prediction.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D6CP00173D
Article type: Paper
Submitted: 18 Jan 2026
Accepted: 03 Mar 2026
First published: 25 Mar 2026

Download Citation

Phys. Chem. Chem. Phys., 2026, Advance Article

Permissions

Request permissions

TransG4: an interpretable deep-learning approach for sequence-based G-quadruplex prediction

Y. Yuan, Y. Tian, Z. Liu and J. Ma, Phys. Chem. Chem. Phys., 2026, Advance Article , DOI: 10.1039/D6CP00173D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Physical Chemistry Chemical Physics

TransG4: an interpretable deep-learning approach for sequence-based G-quadruplex prediction

Abstract

Supplementary files

Article information

Download Citation

Permissions

TransG4: an interpretable deep-learning approach for sequence-based G-quadruplex prediction

Social activity

Search articles by author

Spotlight

Advertisements