Jump to main content
Jump to site search

Issue 1, 2012
Previous Article Next Article

Uncertainty analysis in protein disorder prediction

Author affiliations

Abstract

A grand challenge in the proteomics and structural genomics era is the prediction of protein structure, including identification of those proteins that are partially or wholly unstructured. A number of predictors for identification of intrinsically disordered proteins (IDPs) have been developed over the last decade, but none can be taken as a fully reliable on its own. Using a single model for prediction is typically inadequate because prediction based on only the most accurate model ignores model uncertainty. In this paper, we present an empirical method to specify and measure uncertainty associated with disorder predictions. In particular, we analyze the uncertainty in the reference model itself and the uncertainty in data. This is achieved by training a set of models and developing several meta predictors on top of them. The best meta predictor achieved comparable or better results than any other single model, suggesting that incorporating different aspects of protein disorder prediction is important for the disorder prediction task. In addition, the best meta-predictor had more balanced sensitivity and specificity than any individual model. We also assessed the effects of changes in disorder prediction as a function of changes in the protein sequence. For collections of homologous sequences, we found that mutations caused many of the predicted disordered residues to be flipped to be predicted as ordered residues, while the reverse was observed much less frequently. These results suggest that disorder tendencies are more sensitive to allowed mutations than structure tendencies and the conservation of disorder is indeed less stable than conservation of structure. Availability: five meta-predictors and four single models developed for this study will be publicly freely accessible for non-commercial use.

Graphical abstract: Uncertainty analysis in protein disorder prediction

Back to tab navigation

Publication details

The article was received on 16 Sep 2011, accepted on 14 Oct 2011 and first published on 21 Nov 2011


Article type: Paper
DOI: 10.1039/C1MB05373F
Citation: Mol. BioSyst., 2012,8, 381-391
  •   Request permissions

    Uncertainty analysis in protein disorder prediction

    M. F. Ghalwash, A. K. Dunker and Z. Obradović, Mol. BioSyst., 2012, 8, 381
    DOI: 10.1039/C1MB05373F

Search articles by author

Spotlight

Advertisements