Decoding Cryptic Defluorinases through a Latent Generative Sequence Landscape

Abstract

The special nature of the fluorine atom imparts remarkable strength and unique physical properties to chemical bonds. Unlike man-made fluorochemicals, fluorinated natural products remain rare due to low bioavailability and toxicity of fluoride. Despite this, defluorinases have evolved in nature to cleave carbon-fluorine bonds, with the hydrolytic fluoroacetate dehalogenase being one of the most wellcharacterized examples. These enzymes are of fundamental interest and hold unrealized biotechnological potential, yet the scope of this unique chemistry remains underexplored in the biosphere. Here, we trained and applied a machine learning-based framework, termed Latent Generative Landscapes (LGLs), to map the functional sequence space of the α/β-hydrolase superfamily. This approach identified 3,014 putative defluorinases that were previously not annotated or plausibly misannotated. Experimental validation of selected candidates led to the reclassification of five novel defluorinases, all exhibiting high thermal stability (Tm > 70 ˚C) and diverse catalytic efficiencies with conserved enantioselectivity on the model substrate 2-fluoro-2-phenylacetate. Notably, the enzyme A0A4Z0BVY8 exhibited 2.7-fold greater defluorination activity than the current state-of-the-art enzyme Q6NAM1. Our results establish that LGL modeling is a powerful strategy to decode cryptic carbon-fluorine bond chemistry in nature, enabling the future discovery and engineering of defluorination biocatalysts.

Supplementary files

Article information

Article type
Edge Article
Submitted
31 Mar 2026
Accepted
23 May 2026
First published
26 May 2026
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY-NC license

Chem. Sci., 2026, Accepted Manuscript

Decoding Cryptic Defluorinases through a Latent Generative Sequence Landscape

K. Ji, S. S. Barnes, C. Ziegler, M. Nikpey, A. de la Paz, E. Pack, N. Kvasovs, F. Morcos and S. Dodani, Chem. Sci., 2026, Accepted Manuscript , DOI: 10.1039/D6SC02671K

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements