Issue 12, 2014

piRNA identification based on motif discovery

Abstract

Piwi-interacting RNA (piRNA) is a class of small non-coding RNAs about 24 to 32 nucleotides long, associated with PIWI proteins, which are involved in germline development, transposon silencing, and epigenetic regulation. Identification of piRNA loci on the genome is very useful for further studies in the biogenesis and function of piRNAs. To accomplish this, we applied the computational biology tool Teiresias to identify motifs of variable length appearing frequently in mouse piRNA and non-piRNA sequences, respectively, and then proposed an algorithm for piRNA identification based on motif discovery, termed “Pibomd” by using these sequence motifs as features in the Support Vector Machine (SVM) algorithm, a sensitivity of 91.48% and a specificity of 89.76% on a mouse test dataset could be achieved, much better results than those reported in previously published algorithms. We also trained an unbalanced SVM classifier (named as “Asym-Pibomd”) that provided a higher specificity (96.2%) and a lower sensitivity (72.68%) than Pibomd. Inspite of the predicted ACC being less than that of Pibomd, the predicted ACC (84.44%) of Asym-Pibomd is about ten percent more than that obtained using the k-mer method. Further analysis of the motif positions on the piRNA sequences showed that the piRNA sequences may contain information at the 5′- and/or 3′-end recognized by the piRNA processing apparatus of actual piRNA precursors. Furthermore, this prediction method can be found on a user-friendly web server found at http://app.aporc.org/Pibomd/.

Graphical abstract: piRNA identification based on motif discovery

Supplementary files

Article information

Article type
Method
Submitted
29 Jul 2014
Accepted
29 Aug 2014
First published
29 Aug 2014

Mol. BioSyst., 2014,10, 3075-3080

Author version available

piRNA identification based on motif discovery

X. Liu, J. Ding and F. Gong, Mol. BioSyst., 2014, 10, 3075 DOI: 10.1039/C4MB00447G

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements