Synthesis and Machine Learning Techniques to Enable Data-Driven Investigation of Supramolecular Host-Guest Interactions

Abstract

The availability of large datasets such as the Protein DataBank and ChEMBL have allowed for rapid progress in developing machine learning tools for predicting the biological activity of organic small molecules. The binding between supramolecular hosts and their desired guests are governed by the same forces that drive protein-small molecule interactions, and yet this field has seen dramatically less application of machine learning. In this contribution, we demonstrate that the production of easily diversified building blocks can allow a single laboratory to generate a dataset that is sufficient to engage with modern machine learning approaches. A range of methods were evaluated against our single-laboratory dataset, with a graph neural network featuring an attention mechanism providing meaningful performance in this data-sparse arena.

Supplementary files

Article information

Article type
Edge Article
Submitted
16 Jan 2026
Accepted
04 May 2026
First published
05 May 2026
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY-NC license

Chem. Sci., 2026, Accepted Manuscript

Synthesis and Machine Learning Techniques to Enable Data-Driven Investigation of Supramolecular Host-Guest Interactions

A. Shaurya, A. H. Begherzadeh Mostaghimi, D. Turnbull, F. Hof and J. F. Van Humbeck, Chem. Sci., 2026, Accepted Manuscript , DOI: 10.1039/D6SC00479B

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements