Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence

Elsa Góngora-Castillo; C. Robin Buell

doi:10.1039/C3NP20099J

Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence

Elsa Góngora-Castillo^a and C. Robin Buell*^a

* Corresponding authors

^a Department of Plant Biology and Department of Energy Great Lakes Bioenergy Research Center, Michigan State University, Department of Plant Biology, East Lansing, MI 48824, USA
E-mail: buell@msu.edu

Abstract

Plant natural product research can be facilitated through genome and transcriptome sequencing approaches that generate informative sequence and expression datasets that enable characterization of biochemical pathways of interest. As the overwhelming majority of plant-derived natural products are derived from species with little, if any, sequence and/or genomic resources, the ability to perform whole genome shotgun sequencing and assembly has been and will continue to be transformative as access to a genome sequence provides molecular resources and a context for discovery and characterization of biosynthetic pathways. Due to the reduced size and complexity of the transcriptome relative to the genome, transcriptome sequencing provides a rapid, inexpensive approach to access gene sequences, gene expression abundances, and gene expression patterns in any species, including those that lack a reference genome sequence. To date, successful applications of RNA sequencing in conjunction with de novo transcriptome assembly has enabled identification of new genes in an array of biochemical pathways in plants. While sequencing technologies are well developed, challenges remain in the handling and analysis of transcriptome sequences. In this Highlight article, we provide an overview of the bioinformatics challenges associated with transcriptome analyses using short read sequences and how to address these issues in plant species that lack a reference genome.

This article is part of the themed collection: Modern Methods in Plant Natural Products

Article information

https://doi.org/10.1039/C3NP20099J

Article type

Highlight

Submitted

02 Oct 2012

First published

01 Feb 2013

Download Citation

Nat. Prod. Rep., 2013,30, 490-500

Permissions

Request permissions

Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence

E. Góngora-Castillo and C. R. Buell, Nat. Prod. Rep., 2013, 30, 490 DOI: 10.1039/C3NP20099J

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Natural Product Reports

Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence

Abstract

Article information

Download Citation

Permissions

Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence

Social activity

Search articles by author

Spotlight

Advertisements