Jump to main content
Jump to site search

Issue 1, 2013
Previous Article Next Article

De novo prediction of RNA–protein interactions from sequence information

Author affiliations


Protein–RNA interactions are fundamentally important in understanding cellular processes. In particular, non-coding RNA–protein interactions play an important role to facilitate biological functions in signalling, transcriptional regulation, and even the progression of complex diseases. However, experimental determination of protein–RNA interactions remains time-consuming and labour-intensive. Here, we develop a novel extended naïve-Bayes-classifier for de novo prediction of protein–RNA interactions, only using protein and RNA sequence information. Specifically, we first collect a set of known protein–RNA interactions as gold-standard positives and extract sequence-based features to represent each protein–RNA pair. To fill the gap between high dimensional features and scarcity of gold-standard positives, we select effective features by cutting a likelihood ratio score, which not only reduces the computational complexity but also allows transparent feature integration during prediction. An extended naïve Bayes classifier is then constructed using these effective features to train a protein–RNA interaction prediction model. Numerical experiments show that our method can achieve the prediction accuracy of 0.77 even though only a small number of protein–RNA interaction data are available. In particular, we demonstrate that the extended naïve-Bayes-classifier is superior to the naïve-Bayes-classifier by fully considering the dependences among features. Importantly, we conduct ncRNA pull-down experiments to validate the predicted novel protein–RNA interactions and identify the interacting proteins of sbRNA CeN72 in C. elegans, which further demonstrates the effectiveness of our method.

Graphical abstract: De novo prediction of RNA–protein interactions from sequence information

Back to tab navigation

Supplementary files

Publication details

The article was received on 21 Jul 2012, accepted on 10 Oct 2012 and first published on 11 Oct 2012

Article type: Paper
DOI: 10.1039/C2MB25292A
Mol. BioSyst., 2013,9, 133-142

  •   Request permissions

    De novo prediction of RNA–protein interactions from sequence information

    Y. Wang, X. Chen, Z. Liu, Q. Huang, Y. Wang, D. Xu, X. Zhang, R. Chen and L. Chen, Mol. BioSyst., 2013, 9, 133
    DOI: 10.1039/C2MB25292A

Search articles by author