SLAB: simultaneous labeling and binding affinity prediction for protein–ligand structures

Abstract

Machine learning models are often used as scoring functions to predict the binding affinity of a protein–ligand complex. These models are trained with limited amounts of data with experimentally measured binding affinity values. A large number of compounds are labeled inactive through single-concentration screens without measuring binding affinities. These inactive compounds, along with the active ones, can be used to train binary classification models, while regression models are trained using compounds with binding affinities only. However, the classification and regression tasks are often handled separately, without sharing the learned feature representations. In this paper, we propose a novel model architecture that jointly performs regression and classification objectives, aiming to maximize data utilization and improve predictive performance by leveraging two complementary tasks. In our setup, the regression yields the binding affinity, whereas the classification task yields the label as active or inactive. We demonstrate our method using PDBbind, the standard 3D structure database, as well as a dataset of flavivirus protease compounds with binding affinity data. Our experiments show that the new joint training strategy improves the accuracy of the model, increasing applicability in various practical drug screening scenarios.

Graphical abstract: SLAB: simultaneous labeling and binding affinity prediction for protein–ligand structures

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
03 Jun 2025
Accepted
13 Nov 2025
First published
03 Dec 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026, Advance Article

SLAB: simultaneous labeling and binding affinity prediction for protein–ligand structures

A. Ranganath, H. Kim, H. Shim and J. E. Allen, Digital Discovery, 2026, Advance Article , DOI: 10.1039/D5DD00248F

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements