Issue 12, 2025

Navigating materials design spaces with efficient Bayesian optimization: a case study in functionalized nanoporous materials

Abstract

Machine learning (ML) has the potential to accelerate the discovery of high-performance materials by learning complex structure–property relationships and prioritizing candidates for costly experiments or simulations. However, ML efficiency is often offset by the need for large, high-quality training datasets, motivating strategies that intelligently select the most informative samples. Here, we formulate the search for top-performing functionalized nanoporous materials (metal–organic and covalent–organic frameworks) as a global optimization problem and apply Bayesian Optimization (BO) to identify regions of interest and rank candidates with minimal evaluations. We highlight the importance of a proper and efficient initialization scheme of the BO process, and we demonstrate how BO-acquired samples can also be used to train an XGBoost regression predictive model that can further enrich the efficient mapping of the region of high performing instances of the design space. Across multiple literature-derived adsorption and diffusion datasets containing thousands of structures, our BO framework identifies 2×- to 3×-more materials within a top-100 or top-10 ranking list, than random-sampling-based ML pipelines, and it achieves significantly higher ranking quality. Moreover, the surrogate enrichment strategy further boosts top-N recovery while maintaining high ranking fidelity. By shifting the evaluation focus from average predictive metrics (e.g., R2, MSE) to task-specific criteria (e.g., recall@N and nDCG), our approach offers a practical, data-efficient, and computationally accessible route to guide experimental and computational campaigns toward the most promising materials.

Graphical abstract: Navigating materials design spaces with efficient Bayesian optimization: a case study in functionalized nanoporous materials

Supplementary files

Article information

Article type
Paper
Submitted
30 May 2025
Accepted
03 Nov 2025
First published
03 Nov 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025,4, 3753-3763

Navigating materials design spaces with efficient Bayesian optimization: a case study in functionalized nanoporous materials

P. Krokidas, V. Gkatsis, J. Theocharis and G. Giannakopoulos, Digital Discovery, 2025, 4, 3753 DOI: 10.1039/D5DD00237K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements