Spectroscopy-Assisted Bayesian Optimization for Efficient Refolding of Inclusion Body Proteins

Abstract

The production of recombinant proteins in Escherichia coli often yields insoluble inclusion bodies, which require denaturation and refolding to obtain the native product. The pro- tein refolding step usually represents a major bottleneck. Conventional development and optimization typically rely on sequential Design of Experiments with high-performance liquid chromatography readouts. This approach is slow, labor-intensive, and requires an established chromatographic method as well as purified protein standards. At the beginning of process development, these prerequisites may not be met - especially for proteins that can only be expressed as inclusion bodies. We introduce a more efficient, data-driven workflow that pairs Bayesian optimization with a rapid, in-line readout from intrinsic tryptophan fluorescence. Using a disulfide-bonded single-chain variable fragment, we explored a five-dimensional design space of refolding buffer composition (dithiothreitol, oxidized glutathione, dilution factor, pH, and final urea concen- tration) guided by two spectroscopy-derived objectives. We showed that the spectral shift correlates with chromatographic yields, supporting its use as a fit-for-purpose sensor to guide process development. With 25 experiments, Bayesian optimization identified conditions that delivered a refolded protein concentration of 1.29 ± 0.06 g L−1 at 58.7 ± 1.3 % refolding yield with a dilution factor of 3.14, whereas a three-stage Design of Experiments with more than 60 experiments concluded at 0.37 ± 0.02 g L−1 and 61.4 ± 3.1 % with a dilution factor of 11.39. Thus, the presented workflow achieved roughly 3.5-fold higher product concentration at comparable yield, while operating at substantially higher protein concentrations. Therefore, spectroscopy- assisted Bayesian optimization was found to be a practical, sample-efficient tool for refolding optimization that is especially valuable in early development stages.

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
22 Jan 2026
Accepted
14 May 2026
First published
18 May 2026
This article is Open Access
Creative Commons BY-NC license

Digital Discovery, 2026, Accepted Manuscript

Spectroscopy-Assisted Bayesian Optimization for Efficient Refolding of Inclusion Body Proteins

F. Gisperg, R. Klausser, M. Kierein, M. Elshazly, J. Kopp, E. Prada Brichtova and O. Spadiut, Digital Discovery, 2026, Accepted Manuscript , DOI: 10.1039/D6DD00035E

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements