Predicting the binding affinities of compound–protein interactions by random forest using network topology features

Zhanchao Li; Yang Wang; Yun Xie; Li Zhang; Zong Dai; Xiaoyong Zou

doi:10.1039/C8AY01396A

Predicting the binding affinities of compound–protein interactions by random forest using network topology features†

Zhanchao Li,

*^ac Yang Wang,^b Yun Xie,^a Li Zhang,^b Zong Dai^b and Xiaoyong Zou

*^b

Author affiliations

* Corresponding authors

^a School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, People's Republic of China
E-mail: zhanchao8052@gdpu.edu.cn
Fax: +86 760 88207939
Tel: +86 760 88207939

^b School of Chemistry, Sun Yat-Sen University, Guangzhou, People's Republic of China
E-mail: ceszxy@mail.sysu.edu.cn
Fax: +86 20 84112245
Tel: +86 20 84114919

^c Key Laboratory of Digital Quality Evaluation of Chinese Materia Medica of State Administration of Traditional Chinese Medicine, Guangzhou, People's Republic of China

Abstract

The identification of the binding affinity between a compound and a protein is of extraordinary significance to modern pharmacology and drug discovery. Despite the advances in experimental technology, the determination of binding affinity at the proteome scale is still expensive, laborious and time-consuming. Therefore, there is a strong desire for the development of a novel theoretical method for identifying the binding affinity of a compound and protein. A comprehensive node- and edge-weighted network is constructed comprising three subnetworks, namely compound–compound similarity, protein–protein interactions and compound–protein interactions. Based on the graph theory, some novel network topological features are proposed to characterize compound–protein interactions, and random forest is utilized to construct a model for predicting the binding affinity of each interaction. The Spearman and Pearson correlation coefficients of 0.8547 and 0.8779 as well as the root mean square error of 0.8638 are obtained, indicating the effectiveness of the developed method. A total of 2102 potential chemical–protein interactions are identified associated with diseases, such as aromatase excess syndrome and immunodeficiency autosomal recessive. It is anticipated that the proposed method may become a powerful high-throughput virtual screening tool for drug research and development.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C8AY01396A
Article type: Paper
Submitted: 23 Jun 2018
Accepted: 05 Aug 2018
First published: 06 Aug 2018

Download Citation

Anal. Methods, 2018,10, 4152-4161

Permissions

Request permissions

Predicting the binding affinities of compound–protein interactions by random forest using network topology features

Z. Li, Y. Wang, Y. Xie, L. Zhang, Z. Dai and X. Zou, Anal. Methods, 2018, 10, 4152 DOI: 10.1039/C8AY01396A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Analytical Methods

Predicting the binding affinities of compound–protein interactions by random forest using network topology features†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Predicting the binding affinities of compound–protein interactions by random forest using network topology features

Social activity

Search articles by author

Spotlight

Advertisements