Issue 2, 2015

Predicting the subcellular localization of mycobacterial proteins by incorporating the optimal tripeptides into the general form of pseudo amino acid composition

Abstract

Mycobacterium tuberculosis is a bacterium that causes tuberculosis, one of the most prevalent infectious diseases. Predicting the subcellular localization of mycobacterial proteins in this bacterium may provide vital clues for the prediction of protein function as well as for drug discovery and design. Therefore, a computational method that can predict the subcellular localization of mycobacterial proteins with high precision is highly desirable. We propose a computational method to predict the subcellular localization of mycobacterial proteins. An objective and strict benchmark dataset was constructed after collecting 272 non-redundant proteins from the universal protein resource (the UniProt database). Subsequently, a novel feature selection strategy based on binomial distribution was used to optimize the feature vector. Finally, a subset containing 219 chosen tripeptide features was imported into a support vector machine-based method to estimate the performance of the dataset in accurately and sensitively identifying these proteins. We found that the proposed method gave a maximum overall accuracy of 89.71% with an average accuracy of 81.12% in the jackknife cross-validation. The results indicate that our prediction method gave an efficient and powerful performance when compared with other published methods. We made the proposed method available on a purpose built Web server called MycoSub that is freely accessible at http://lin.uestc.edu.cn/server/MycoSub. We anticipate that MycoSub will become a useful tool for studying the functions of mycobacterial proteins and for designing and developing anti-mycobacterium drugs.

Graphical abstract: Predicting the subcellular localization of mycobacterial proteins by incorporating the optimal tripeptides into the general form of pseudo amino acid composition

Article information

Article type
Paper
Submitted
31 Oct 2014
Accepted
18 Nov 2014
First published
18 Nov 2014

Mol. BioSyst., 2015,11, 558-563

Author version available

Predicting the subcellular localization of mycobacterial proteins by incorporating the optimal tripeptides into the general form of pseudo amino acid composition

P. Zhu, W. Li, Z. Zhong, E. Deng, H. Ding, W. Chen and H. Lin, Mol. BioSyst., 2015, 11, 558 DOI: 10.1039/C4MB00645C

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements