DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations

Ahmet Sureyya Rifaioglu; Esra Nalbat; Volkan Atalay; Maria Jesus Martin; Rengul Cetin-Atalay; Tunca Doğan

doi:10.1039/C9SC03414E

DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations†

Ahmet Sureyya Rifaioglu,

^abc Esra Nalbat,

^c Volkan Atalay,

*^ac Maria Jesus Martin,

^d Rengul Cetin-Atalay

^ce and Tunca Doğan

*^fg

Author affiliations

* Corresponding authors

^a Department of Computer Engineering, METU, Ankara, Turkey
E-mail: vatalay@metu.edu.tr
Tel: +903122105576

^b Department of Computer Engineering, İskenderun Technical University, Hatay, Turkey

^c KanSiL, Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey

^d European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK

^e Section of Pulmonary and Critical Care Medicine, The University of Chicago, Chicago, IL 60637, USA

^f Department of Computer Engineering, Hacettepe University, Ankara, Turkey
E-mail: tuncadogan@hacettepe.edu.tr
Tel: +903122977193/117

^g Institute of Informatics, Hacettepe University, Ankara, Turkey

Abstract

The identification of physical interactions between drug candidate compounds and target biomolecules is an important process in drug discovery. Since conventional screening procedures are expensive and time consuming, computational approaches are employed to provide aid by automatically predicting novel drug–target interactions (DTIs). In this study, we propose a large-scale DTI prediction system, DEEPScreen, for early stage drug discovery, using deep convolutional neural networks. One of the main advantages of DEEPScreen is employing readily available 2-D structural representations of compounds at the input level instead of conventional descriptors that display limited performance. DEEPScreen learns complex features inherently from the 2-D representations, thus producing highly accurate predictions. The DEEPScreen system was trained for 704 target proteins (using curated bioactivity data) and finalized with rigorous hyper-parameter optimization tests. We compared the performance of DEEPScreen against the state-of-the-art on multiple benchmark datasets to indicate the effectiveness of the proposed approach and verified selected novel predictions through molecular docking analysis and literature-based validation. Finally, JAK proteins that were predicted by DEEPScreen as new targets of a well-known drug cladribine were experimentally demonstrated in vitro on cancer cells through STAT3 phosphorylation, which is the downstream effector protein. The DEEPScreen system can be exploited in the fields of drug discovery and repurposing for in silico screening of the chemogenomic space, to provide novel DTIs which can be experimentally pursued. The source code, trained "ready-to-use" prediction models, all datasets and the results of this study are available at https://github.com/cansyl/DEEPscreen.

This article is part of the themed collection: Computational protein design and structure prediction: Celebrating the 2024 Nobel Prize in Chemistry

Chemical Science

DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations†

Abstract

Supplementary files

Article information

Download Citation

Permissions

DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations

Social activity

Search articles by author

Spotlight

Advertisements