Issue 1, 2013

Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection

Abstract

Identification of catalytic residues plays a key role in understanding how enzymes work. Although numerous computational methods have been developed to predict catalytic residues and active sites, the prediction accuracy remains relatively low with high false positives. In this work, we developed a novel predictor based on the Random Forest algorithm (RF) aided by the maximum relevance minimum redundancy (mRMR) method and incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility to predict active sites of enzymes and achieved an overall accuracy of 0.885687 and MCC of 0.689226 on an independent test dataset. Feature analysis showed that every category of the features except disorder contributed to the identification of active sites. It was also shown via the site-specific feature analysis that the features derived from the active site itself contributed most to the active site determination. Our prediction method may become a useful tool for identifying the active sites and the key features identified by the paper may provide valuable insights into the mechanism of catalysis.

Graphical abstract: Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection

Supplementary files

Article information

Article type
Paper
Submitted
10 Aug 2012
Accepted
02 Oct 2012
First published
15 Oct 2012

Mol. BioSyst., 2013,9, 61-69

Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection

Y. Gao, B. Li, Y. Cai, K. Feng, Z. Li and Y. Jiang, Mol. BioSyst., 2013, 9, 61 DOI: 10.1039/C2MB25327E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements