Jump to main content
Jump to site search
Access to RSC content Close the message box

Continue to access RSC content when you are not at your institution. Follow our step-by-step guide.


Issue 1, 2012
Previous Article Next Article

Uncertainty analysis in protein disorder prediction

Author affiliations

Abstract

A grand challenge in the proteomics and structural genomics era is the prediction of protein structure, including identification of those proteins that are partially or wholly unstructured. A number of predictors for identification of intrinsically disordered proteins (IDPs) have been developed over the last decade, but none can be taken as a fully reliable on its own. Using a single model for prediction is typically inadequate because prediction based on only the most accurate model ignores model uncertainty. In this paper, we present an empirical method to specify and measure uncertainty associated with disorder predictions. In particular, we analyze the uncertainty in the reference model itself and the uncertainty in data. This is achieved by training a set of models and developing several meta predictors on top of them. The best meta predictor achieved comparable or better results than any other single model, suggesting that incorporating different aspects of protein disorder prediction is important for the disorder prediction task. In addition, the best meta-predictor had more balanced sensitivity and specificity than any individual model. We also assessed the effects of changes in disorder prediction as a function of changes in the protein sequence. For collections of homologous sequences, we found that mutations caused many of the predicted disordered residues to be flipped to be predicted as ordered residues, while the reverse was observed much less frequently. These results suggest that disorder tendencies are more sensitive to allowed mutations than structure tendencies and the conservation of disorder is indeed less stable than conservation of structure. Availability: five meta-predictors and four single models developed for this study will be publicly freely accessible for non-commercial use.

Graphical abstract: Uncertainty analysis in protein disorder prediction

Back to tab navigation

Article information


Submitted
16 Sep 2011
Accepted
14 Oct 2011
First published
21 Nov 2011

Mol. BioSyst., 2012,8, 381-391
Article type
Paper

Uncertainty analysis in protein disorder prediction

M. F. Ghalwash, A. K. Dunker and Z. Obradović, Mol. BioSyst., 2012, 8, 381
DOI: 10.1039/C1MB05373F

Search articles by author

Spotlight

Advertisements