A Critical Examination of Active Learning Workflows in Materials Science

Abstract

Active learning (AL) is an increasingly important approach for data-efficient machine learning (ML) in materials science. It is widely used, from building training datasets to guiding autonomous materials discovery platforms. However, the performance of AL workflows depends on a number of often implicit design choices that are rarely examined systematically. Here, we critically analyze commonly used AL strategies in materials science, highlighting overlooked assumptions, hidden biases, and methodological limitations across different applications. Based on this, we provide practical guidelines to enhance the efficiency and reliability of AL workflows for materials science applications.

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Perspective
Submitted
18 Feb 2026
Accepted
22 May 2026
First published
25 May 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026, Accepted Manuscript

A Critical Examination of Active Learning Workflows in Materials Science

A. S. Nair and L. Foppa, Digital Discovery, 2026, Accepted Manuscript , DOI: 10.1039/D6DD00081A

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements