Issue 44, 2023

Deciphering DNA nucleotide sequences and their rotation dynamics with interpretable machine learning integrated C3N nanopores

Abstract

A solid-state nanopore combined with the quantum transport method has garnered substantial attention and intrigue for DNA sequencing due to its potential for providing rapid and accurate sequencing results, which could have numerous applications in disease diagnosis and personalized medicine. However, the intricate and multifaceted nature of the experimental protocol poses a formidable challenge in attaining precise single nucleotide analysis. Here, we report a machine learning (ML) framework combined with the quantum transport method to accelerate high-throughput single nucleotide recognition with C3N nanopores. The optimized eXtreme Gradient Boosting Regression (XGBR) algorithm has predicted the fingerprint transmission of each unknown nucleotide and their rotation dynamics with root mean square error scores as low as 0.07. Interpretability of ML black box models with the game theory-based SHapley Additive exPlanation method has provided a quasi-explanation for the model working principle and the complex relationship between electrode–nucleotide coupling and transmission. Moreover, a comprehensive ML classification of nucleotides based on binary, ternary, and quaternary combinations shows maximum accuracy and F1 scores of 100%. The results suggest that ML in tandem with a nanopore device can potentially alleviate the experimental hurdles associated with quantum tunneling and facilitate fast and high-precision DNA sequencing.

Graphical abstract: Deciphering DNA nucleotide sequences and their rotation dynamics with interpretable machine learning integrated C3N nanopores

Supplementary files

Article information

Article type
Paper
Submitted
31 Jul 2023
Accepted
19 Oct 2023
First published
19 Oct 2023

Nanoscale, 2023,15, 18080-18092

Deciphering DNA nucleotide sequences and their rotation dynamics with interpretable machine learning integrated C3N nanopores

M. K. Jena, S. Mittal, S. S. Manna and B. Pathak, Nanoscale, 2023, 15, 18080 DOI: 10.1039/D3NR03771A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements