Black-Box Data: A new paradigm for biomedicine in the AI era

Abstract

As Artificial Intelligence cements its role as a cornerstone of scientific discovery, the field is undergoing a fundamental shift beyond the current transition from "white-box" first-principles models to "black-box" deep learning. We argue that a parallel, necessary transformation is emerging in data generation: the rise of "black-box data." These data sources are intentionally optimized for machine consumption rather than human intuition—a trade-off we contend is essential to achieving the scale required for high-capacity biological foundation models. This article defines the "black-box data" paradigm, explores the necessity of this shift for the future of AI-driven science, and provides a unifying taxonomy illustrated by both historical precedents and contemporary breakthroughs.

Article information

Article type
Perspective
Submitted
10 Feb 2026
Accepted
31 Mar 2026
First published
08 Apr 2026
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY-NC license

Chem. Sci., 2026, Accepted Manuscript

Black-Box Data: A new paradigm for biomedicine in the AI era

L. Naef and M. Bronstein, Chem. Sci., 2026, Accepted Manuscript , DOI: 10.1039/D6SC01189F

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements