Issue 12, 2014

Predicting protein–protein interactions between human and hepatitis C virus via an ensemble learning method

Abstract

An estimated 170 million people, approximately 3% of the world population, are chronically infected with the hepatitis C virus (HCV). More than 350 000 deaths are reported annually, which are caused by HCV. HCV, similar to a variety of viruses, causes disease in humans by altering protein–protein interactions within the host cells. Experimental approaches for the detection of host–virus PPIs have many inherent limitations. Computational approaches to predict these interactions are therefore of significant importance. While many studies have been developed to predict intra-species PPIs in the last decade, predictions on inter-species PPIs such as human–HCV PPIs are rare. In this study, we developed an ensemble learning method to predict PPIs between human and HCV proteins. Our model utilises four well-established diverse learners as base classifiers including random forest (RF), Naïve Bayes (NB), support vector machine (SVM) and multilayer perceptron (MLP). In addition, an MLP was used as a meta-learner to combine base learners' predictions to provide the final prediction. To encode human and HCV proteins as feature vectors, we used six different descriptors as follows: amino acid composition (ACC), pseudo amino acid composition (PAC), evolutionary information feature, network centrality measures, tissue information and post-translational modification information. To assess the prediction power of the proposed method, we assembled a benchmark dataset composed of confident positive and negative PPIs. In a 10-fold cross-validation experiment, our prediction method achieved accuracy and specificity as high as 83% and 94%, respectively. Furthermore, in an independent test set the proposed method achieved an accuracy of 84% and a specificity of 92%. When compared with the existing method, our method showed a better performance. These results revealed that our method is suitable for performing PPI prediction in a host–pathogen context.

Graphical abstract: Predicting protein–protein interactions between human and hepatitis C virus via an ensemble learning method

Supplementary files

Article information

Article type
Paper
Submitted
13 Jul 2014
Accepted
02 Sep 2014
First published
12 Sep 2014

Mol. BioSyst., 2014,10, 3147-3154

Author version available

Predicting protein–protein interactions between human and hepatitis C virus via an ensemble learning method

A. Emamjomeh, B. Goliaei, J. Zahiri and R. Ebrahimpour, Mol. BioSyst., 2014, 10, 3147 DOI: 10.1039/C4MB00410H

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements