Issue 2, 2016

HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease

Abstract

The disruption of protein hydroxylation is highly associated with several serious diseases and consequently the identification of protein hydroxylation sites has attracted significant attention recently. Here, we report the development of an improved method, called HydPred, to identify protein hydroxylation sites (hydroxyproline and hydroxylysine) based on the synthetic minority over-sampling technique (SMOTE), the random forest (RF) algorithm and four blocks of newly composed features that are derived from the protein primary sequence. The HydPred method achieved the best prediction performance reported until now with Matthew's correlation coefficient values of 0.770 and 0.857 for hydroxyproline and hydroxylysine, respectively, according to jack-knife cross-validation. This represents an improvement of 8% for hydroxyproline and 19% for hydroxylysine compared to the best results of available predictors. The prediction performance of HydPred for the external validation of hydroxyproline and hydroxylysine was also improved compared with other published methods. We subsequently applied HydPred to study the association of disruption of hydroxylation sites with human inherited disease. The analyses suggested that the loss of hydroxylation sites is more likely to cause disease instead of the gain of hydroxylation sites and 52 different human inherited diseases were found to be highly associated with the loss of hydroxylation sites. Therefore, HydPred represents a new strategy to discover the molecular basis of pathogenesis associated with abnormal hydroxylation. HydPred is now available online as a user-friendly web server at http://lishuyan.lzu.edu.cn/hydpred/.

Graphical abstract: HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease

Supplementary files

Article information

Article type
Paper
Submitted
10 Oct 2015
Accepted
26 Nov 2015
First published
26 Nov 2015

Mol. BioSyst., 2016,12, 490-498

Author version available

HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease

S. Li, J. Lu, J. Li, X. Chen, X. Yao and L. Xi, Mol. BioSyst., 2016, 12, 490 DOI: 10.1039/C5MB00681C

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Spotlight

Advertisements