Machine learning-guided protein engineering to improve the catalytic activity of transaminases under neutral pH conditions†
Abstract
Biocatalysis provides an eco-friendly and efficient method for the synthesis of fine chemicals, pharmaceuticals, and biofuels. However, the catalytic performance of enzymes is greatly reduced when they react under non-optimal pH conditions. Despite efforts in protein engineering to improve enzymatic pH dependence, predicting and optimizing catalytic activity under various pH conditions remain challenging. Recent advancements in machine learning (ML) provide a promising solution by enabling data-driven predictions of enzyme properties, thus reducing the need for labor-intensive experiments. This study aims to enhance enzyme activity and regulate pH dependence through ML-guided protein engineering. Using a transaminase from Ruegeria sp. TM1040 as the mutation template, we first collected high-quality experimental data from a series of variants by measuring their activity under a series of pH conditions. Based on these data, we developed an ML model to predict catalytic activity under different pH conditions. This work provides a powerful ML tool to guide the rational design of variants, which showed improved activity (up to 3.7-fold compared to the starting variant) at pH 7.5, and offers a data-driven protein engineering strategy to achieve co-optimization of enzyme activity and pH dependence, demonstrating the potential of ML to accelerate enzyme engineering for industrial applications.