Can We Automate Scientific Reasoning in Closed-Loop Experiments using Large Language Models?

Abdoulatif  Cissé; Max  Cooper; Mengjia Zhu; Xenophon Evangelopoulos; Andrew Cooper

doi:10.1039/D5DD00520E

Can We Automate Scientific Reasoning in Closed-Loop Experiments using Large Language Models?

Abdoulatif Cissé, Max Cooper, Mengjia Zhu, Xenophon Evangelopoulos and Andrew Cooper

Abstract

We present here a detailed study of our hybrid optimisation framework, BORA, which integrates large language model (LLM) reasoning with Bayesian optimisation (BO) for accelerating scientific discovery using closed-loop experiments. We compare five modern LLMs (o4-mini, o3, gpt-5-mini, gpt-5, and gemini-2.5-flash) as optimisers for two benchmark problems: a 10-dimensional photocatalytic hydrogen-evolution experiment and a 7-dimensional physics-based pétanque simulation. The results show that LLM/BO hybrids outperform BO-only approaches, particularly in early-stage exploration where the search is warm-started by LLM-driven hypotheses. Among the models tested, o3 delivered the strongest and most consistent optimisation performance after 150 experiments. LLM-only optimisations without the BO component also matched or surpassed hybrid methods in some settings, locating global optima with high repeatability. We demonstrated that appending human hypotheses, prior literature, or experimental datasets can improve convergence, and that LLM reasoning can recover in some cases from deliberately misleading prompts. We also explored outlier runs to understand the limitations and failure modes of these methods, as well as considering the energy implications of the LLM queries. The strongest LLM-only performance was observed with a batch size of one, suggesting that experiment-by-experiment machine reasoning is a viable strategy for certain scientific optimisation tasks.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D5DD00520E
Article type: Paper
Submitted: 23 Nov 2025
Accepted: 09 Feb 2026
First published: 23 Feb 2026
This article is Open Access

Download Citation

Digital Discovery, 2025, Accepted Manuscript

Permissions

Request permissions

Can We Automate Scientific Reasoning in Closed-Loop Experiments using Large Language Models?

A. Cissé, M. Cooper, M. Zhu, X. Evangelopoulos and A. Cooper, Digital Discovery, 2025, Accepted Manuscript , DOI: 10.1039/D5DD00520E

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Digital Discovery

Can We Automate Scientific Reasoning in Closed-Loop Experiments using Large Language Models?

Abstract

Supplementary files

Article information

Download Citation

Permissions

Can We Automate Scientific Reasoning in Closed-Loop Experiments using Large Language Models?

Social activity

Search articles by author

Spotlight

Advertisements