Accurate prediction of chemical shifts for aqueous protein structure on “Real World” data

Jie Li; Kochise C. Bennett; Yuchen Liu; Michael V. Martin; Teresa Head-Gordon

doi:10.1039/C9SC06561J

You do not have JavaScript enabled. Please enable JavaScript to access the full features of the site or access our non-JavaScript page.

Accurate prediction of chemical shifts for aqueous protein structure on “Real World” data†

Jie Li,

^ab Kochise C. Bennett,^ab Yuchen Liu,^ab Michael V. Martin^c and Teresa Head-Gordon

*^abcd

Author affiliations

* Corresponding authors

^a Pitzer Center for Theoretical Chemistry, University of California, Berkeley, CA 94720, USA
E-mail: thg@berkeley.edu

^b Department of Chemistry, University of California, Berkeley, CA 94720, USA

^c Department of Bioengineering, University of California, Berkeley, CA 94720, USA

^d Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA 94720, USA

Abstract

Here we report a new machine learning algorithm for protein chemical shift prediction that outperforms existing chemical shift calculators on realistic data that is not heavily curated, nor eliminates test predictions ad hoc. Our UCBShift predictor implements two modules: a transfer prediction module that employs both sequence and structural alignment to select reference candidates for experimental chemical shift replication, and a redesigned machine learning module based on random forest regression which utilizes more, and more carefully curated, feature extracted data. When combined together, this new predictor achieves state-of-the-art accuracy for predicting chemical shifts on a randomly selected dataset without careful curation, with root-mean-square errors of 0.31 ppm for amide hydrogens, 0.19 ppm for Hα, 0.84 ppm for C′, 0.81 ppm for Cα, 1.00 ppm for Cβ, and 1.81 ppm for N. When similar sequences or structurally related proteins are available, UCBShift shows superior native state selection from misfolded decoy sets compared to SPARTA+ and SHIFTX2, and even without homology we exceed current prediction accuracy of all other popular chemical shift predictors.

This article is part of the themed collections: 2020 Chemical Science HOT Article Collection and Accelerating Chemistry Symposium Collection

Download options Please wait...

Supplementary files

Supplementary information PDF (1459K)

Article information

DOI: https://doi.org/10.1039/C9SC06561J
Article type: Edge Article
Submitted: 29 Ker. 2019
Accepted: 02 Meur. 2020
First published: 03 Meur. 2020
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry

Download Citation

Chem. Sci., 2020,11, 3180-3191

Permissions

Request permissions

Accurate prediction of chemical shifts for aqueous protein structure on “Real World” data

J. Li, K. C. Bennett, Y. Liu, M. V. Martin and T. Head-Gordon, Chem. Sci., 2020, 11, 3180 DOI: 10.1039/C9SC06561J

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Social activity

Fetching data from CrossRef.
This may take some time to load.

Chemical Science

Accurate prediction of chemical shifts for aqueous protein structure on “Real World” data†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Accurate prediction of chemical shifts for aqueous protein structure on “Real World” data

Social activity

Search articles by author

Spotlight

Advertisements