Challenges for error-correction coding in DNA data storage: photolithographic synthesis and DNA decay

Andreas L. Gimpel; Wendelin J. Stark; Reinhard Heckel; Robert N. Grass

doi:10.1039/D4DD00220B

You do not have JavaScript enabled. Please enable JavaScript to access the full features of the site or access our non-JavaScript page.

Challenges for error-correction coding in DNA data storage: photolithographic synthesis and DNA decay†

Andreas L. Gimpel,

^a Wendelin J. Stark,

^a Reinhard Heckel^b and Robert N. Grass

*^a

Author affiliations

* Corresponding authors

^a Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 1-5, Zürich, Switzerland
E-mail: robert.grass@chem.ethz.ch

^b TUM School of Computation Information and Technology, Technical University of Munich, Arcistrasse 21, Munich 80333, Germany

Abstract

Efficient error-correction codes are crucial for realizing DNA's potential as a long-lasting, high-density storage medium for digital data. At the same time, new workflows promising low-cost, resilient DNA data storage are challenging their design and error-correcting capabilities. This study characterizes the errors and biases in two new additions to the state-of-the-art workflow in DNA data storage: photolithographic synthesis and DNA decay. Photolithographic synthesis offers low-cost, scalable oligonucleotide synthesis but suffers from high error rates, necessitating sophisticated error-correction schemes, for example codes introducing within-sequence redundancy combined with clustering and alignment techniques for retrieval. On the other hand, the decoding of oligo fragments after DNA decay promises unprecedented storage densities, but complicates data recovery by requiring the reassembly of full-length sequences or the use of partial sequences for decoding. Our analysis provides a detailed account of the error patterns and biases present in photolithographic synthesis and DNA decay, and identifies considerable bias stemming from sequencing workflows. We implement our findings into a digital twin of the two workflows, offering a tool for developing error-correction codes and providing benchmarks for the evaluation of codec performance.

Download options Please wait...

Supplementary files

Supplementary information PDF (1117K)

Article information

DOI: https://doi.org/10.1039/D4DD00220B
Article type: Paper
Submitted: 05 Jul 2024
Accepted: 17 Oct 2024
First published: 18 Oct 2024
This article is Open Access

Download Citation

Digital Discovery, 2024,3, 2497-2508

Permissions

Request permissions

Challenges for error-correction coding in DNA data storage: photolithographic synthesis and DNA decay

A. L. Gimpel, W. J. Stark, R. Heckel and R. N. Grass, Digital Discovery, 2024, 3, 2497 DOI: 10.1039/D4DD00220B

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Social activity

Fetching data from CrossRef.
This may take some time to load.

Digital Discovery

Challenges for error-correction coding in DNA data storage: photolithographic synthesis and DNA decay†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Challenges for error-correction coding in DNA data storage: photolithographic synthesis and DNA decay

Social activity

Search articles by author

Spotlight

Advertisements