Issue 12, 2025

A FAIR research data infrastructure for high-throughput digital chemistry

Abstract

The growing demand for reproducible, high-throughput chemical experimentation calls for scalable digital infrastructures that support automation, traceability, and AI-readiness. A dedicated research data infrastructure (RDI) developed within Swiss Cat+ is presented, integrating automated synthesis, multi-stage analytics, and semantic modeling. It captures each experimental step in a structured, machine-interpretable format, forming a scalable, and interoperable data backbone. By systematically recording both successful and failed experiments, the RDI ensures data completeness, strengthens traceability, and enables the creation of bias-resilient datasets essential for robust AI model development. Built on Kubernetes and Argo Workflows and aligned with FAIR principles, the RDI transforms experimental metadata into validated Resource Description Framework (RDF) graphs using an ontology-driven semantic model. These graphs are accessible through a web interface and SPARQL endpoint, facilitating integration with downstream AI and analysis pipelines. Key features include a modular RDF converter and ‘Matryoshka files’, which encapsulate complete experiments with raw data and metadata in a portable, standardized ZIP format. This approach supports scalable querying and sets the stage for standardized data sharing and autonomous experimentation.

Graphical abstract: A FAIR research data infrastructure for high-throughput digital chemistry

Supplementary files

Article information

Article type
Paper
Submitted
07 Jul 2025
Accepted
21 Oct 2025
First published
03 Nov 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025,4, 3502-3514

A FAIR research data infrastructure for high-throughput digital chemistry

A. Gauthier, L. Vancauwenberghe, J. Cousty, C. Matthey-Doret, R. Franken, S. Maennel, P. Miéville and O. Riba Grognuz, Digital Discovery, 2025, 4, 3502 DOI: 10.1039/D5DD00297D

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements