Open Access Article
Ravi Kumar
Verma‡
a,
Wan Lin
Yeo‡
b,
Elaine
Tiong‡
c,
Ee Lui
Ang
*be,
Yee Hwee
Lim
*de,
Fong Tian
Wong
*cd and
Hao
Fan
*aef
aBioinformatics Institute, Agency for Science, Technology and Research, 30 Biopolis Street, #07-01 Matrix Building, Singapore 138671, Republic of Singapore. E-mail: ravikumarv@bii.a-star.edu.sg; hfan2006@gmail.com; fanh@bii.a-star.edu.sg
bSingapore Institute of Food and Biotechnology Innovation, Agency for Science, Technology and Research, 31 Biopolis Way, #04-01 Nanos, Singapore 138669, Republic of Singapore. E-mail: yeo_wan_lin@sifbi.a-star.edu.sg; ang_ee_lui@sifbi.a-star.edu.sg
cInstitute of Molecular and Cell Biology, Agency for Science, Technology and Research, 61 Biopolis Drive, Proteos, Singapore 138673, Republic of Singapore. E-mail: elaine_tiong@imcb.a-star.edu.sg; wongft@imcb.a-star.edu.sg
dInstitute of Sustainability for Chemicals, Energy and Environment, Agency for Science, Technology and Research, 8 Biomedical Grove, #07-01 Neuros Building, Singapore 138665, Republic of Singapore. E-mail: lim_yee_hwee@isce2.a-star.edu.sg
eSynthetic Biology Translational Research Program, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Republic of Singapore
fDepartment of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117596, Republic of Singapore
First published on 5th May 2025
S-Adenosylmethionine (SAM)-dependent fluorinases have emerged as environmentally friendly enzymatic alternatives for organofluorine chemical synthesis. However, their use remains limited by their rarity; only 16 fluorinases have been found in nature so far. Here we report two new fluorinases, FLASbac from Streptosporangiales bacterium and a modified FLAAdig_Nter from Actinoplanes digitatis. Through molecular dynamics (MD) simulations, we have identified the crucial roles of the SAM-binding site and an ion-egress site (IES) for fluorination reaction, particularly regarding its preference for fluoride ions. We have validated these findings by testing mutants of the two new fluorinases and the known fluorinase from Streptomyces sp. MA37 (FLAMA37). Through these targeted mutations, we identified, for the first time, specific sites in certain variants that significantly enhance the enzyme's specificity for fluorination over chlorination while maintaining its fluorination activity. In these particular variants, this refinement led to a remarkable increase in fluorine preference, improving from approximately 10-fold to over 200-fold. Overall, this research advances our fundamental understanding of enzymatic fluorination, providing a basis for further exploration of fluorinase optimization. In turn, these advancements could open new opportunities for the pharmaceutical industry in the development of organofluorine drugs and other fluorine-reliant biotechnologies.
![]() | ||
Fig. 1 (A) Biological unit of “SAM Hydrolases/SAM-Dependent Halogenases” superfamily members (PDB 1RQP4). (B) Diversity of the reaction mechanisms in the “SAM Hydrolases/SAM-Dependent Halogenases” superfamily. (C) Identical structural fold in the “SAM Hydrolases/SAM-Dependent Halogenases” superfamily. The fluorinase (PDB 1RQP4), chlorinase (PDB 6RYZ5), hydrolase (PDB 2WR8 6) are shown in magenta, yellow, and grey cartoons, respectively, with bound SAM (orange) and the co-crystallized chloride ion (green) in chlorinase. Inset highlights different IBS residues in chlorinase and fluorinase, with the modelled fluoride ion shown as a cyan sphere. (D) Schematic illustration of results from SSN analysis. Four subnetworks containing putative fluorinase sequences are represented in different shapes and colours. The dashed line indicates a sequence identity threshold of 60%, meaning the maximum sequence identity between any two sequences from different subnetworks is below 60%. The subnetwork-1 including 16 known fluorinases and the four newly discovered putative fluorinases (FLASbac, FLAAdig, FLASmor, and FLAGbac) are highlighted in bold font. | ||
Despite the promising prospects, applications of fluorinases have been restricted by their scarcity in nature, i.e., to date only 16 fluorinases have been identified,10 of which two fluorinases, FLAScat from Streptomyces cattleya4 and FLAMA37 from Streptomyces sp. MA37,11 have crystal structures. Furthermore, these enzymes exhibit a degree of promiscuity, catalysing not only fluorination reaction, but also chlorination reactions.10,12 Previous studies comparing fluorinases and chlorinases have established that the halogen specificity is influenced by the architecture of the ion-binding site (IBS). This site has been previously referred to as various names, such as “halide binding pocket”13 or “anion binding site”.14 Here, we refer to it as IBS to distinguish it from ion-egress site (IES). A key difference in our definition is the treatment of residue T80. While previous studies grouped T80 and S158 together, we classify S158 under IBS and T80 under the IES. IBS is composed of four consecutive and conserved residues: a threonine, a phenylalanine/tryptophan, a tyrosine/phenylalanine, and a serine (Fig. 1C and Table S1†).13 Mutating the fourth serine residue (S158) in the IBS of FLAScat, specifically S158G and S158A,15 led to reduced fluorinase activity, emphasizing the role of this S158's hydroxyl group in fluoride ion binding and catalysis. However, introducing serine mutation in the 4th residue in the IBS of chlorinases did not produce fluorinases,9,16 indicating that other residues are also crucial for the halogenation specificity. Interestingly, despite understanding its significance, previous attempts at IBS modifications often had negative effects, even with increased specificity for fluorination.15,17
In this study, we first expanded the known sequence space of fluorinases by identifying two new fluorinases, FLASbac from Streptosporangiales bacterium and FLAAdig_Nter from Actinoplanes digitatis using Sequence Similarity Network (SSN) analysis and experimental testing. Guided by MD simulations, we analysed FLASbac, FLAAdig_Nter, and a known fluorinase from Streptomyces sp. 37 (FLAMA37).18 This yielded additional insights into the known SAM-binding site and ion-binding site, along with the discovery of an ion-egress site (IES), which serves as a critical gateway to the IBS. Our targeted mutations at the SAM-binding and the IES sites, in contrast to previous studies, enabled us to successfully engineer fluorinase variants for the first time with increased specificity for fluorination over chlorination, while maintaining their fluorination activity. Collectively, these findings provide a comprehensive new perspective on the molecular mechanisms governing fluorinase specificity and ion-binding preferences.
21 and a cut-off of 60% sequence identity was employed to remove edges and divide SSN into smaller subnetworks. Further, using fluorinase enzyme classification number (EC: 2.5.1.63) as a filtering criterion, we identified four distinct subnetworks (subnetwork 1–4), each containing at least one sequence annotated as a fluorinase.
23 and the force field OPLS4.24 To elucidate why fluorinases prefer fluorination reactions over chlorination, additional systems with chloride ion in the IBS were also prepared for molecular dynamics simulations.
000 atoms. The initial MD-simulation systems were relaxed in multiple short MD steps. Briefly, first the systems were relaxed using Brownian dynamics at 10 K with restraints on the solute atoms, followed by another relaxation step at 100 K with restraints on the solute heavy atoms under the NVT ensemble. Next, the system temperature was gradually raised to the desired simulation temperature using simulated annealing with restraints on the solute heavy atoms under the NPT ensemble. This was followed by a relaxation step with restraints applied only to the protein backbone and ligand heavy atoms. The system was then equilibrated for 12 nanoseconds (ns) with restraints only on the ligand heavy atoms. Finally, the equilibrated system was subjected to unrestrained 300 ns production run. For both equilibration and the following unrestrained production runs, we used Langevin constant pressure and temperature dynamical system27 to maintain pressure at 1 atm with a time step of 2 femtoseconds under the NPT ensemble. Long-range electrostatic interactions were calculated using the particle mesh Ewald method.28 A radius of 12.0 Å was used for coulomb interactions. The systems were simulated at 47 °C. To improve conformational sampling, two independent trajectories for each system with different random initial velocities were computed. The data analysis was carried out with the MDTraj program.29
:
1 volume of methanol. The quenched reaction mixtures were centrifuged (2182 × g for 20 min) and 10 μL was used for HPLC-UV analysis.
All reactions using purified proteins were carried out in triplicates in 1.5 mL tubes in shaking incubator at 250 rpm. Reactions were stopped by heating the samples at 95 °C for 1 min (using a PCR machine). The precipitated protein was then removed by centrifugation (20
238 × g for 2 min). 10 μL of the reaction mixture was used for HPLC-UV analysis.
Tube enzymatic reaction conditions (activity): SAM (1 mM), NaF or NaCl (200 mM), and fluorinase purified proteins (20 μM) in a final volume of 100 μL at 37 °C for 1.5 h and 24 h.
Tube enzymatic reaction conditions (thermal stability): SAM (1 mM), NaF or NaCl (200 mM), and fluorinase purified proteins (20 μM) in a final volume of 100 μL at 37 °C for 1.5 h at either 47 °C or 60 °C.
Kinetic assay reaction conditions: various concentrations of SAM (10–1000 μM), NaF (200 mM), and fluorinase purified proteins (20 μM) in a final volume of 100 μL at 37 °C. Reactions were carried out at various time points. All reactions were carried out in triplicates, in 200 μL PCR tubes using the PCR machine. Reactions were stopped by heating the samples at 95 °C for 1 min (using a PCR machine). The precipitated protein was then removed by centrifugation (20
238 × g for 2 min). 10 μL of the reaction mixture was used for HPLC-UV analysis. Kinetic parameters were obtained by the best-fit model of initial velocity against substrate concentrations based on Michaelis–Menten equation using GraphPad Prism 9 (GraphPad Software).
Mobile phase A: 0.1% formic acid in water; B: 0.1% formic acid in methanol, Phenomenex, Kinetex® 2.6 μm biphenyl 100 Å, LC Column 150 × 4.6 mm, 0.6 mL min−1 flow rate, isocratic elution, 23% B for 15 min. The retention time of 5′-FDA is approximately 3.95 min. The retention time of 5′-ClDA is approximately 5.93 min.
Subnetwork-1 includes 20 unique sequences, 16 of which are known fluorinases, including FLAScat and FLAMA37 (Table S1 and Fig. S1†). Newly identified members FLAGbac, FLASbac, FLAAdig, and FLASmor exhibit significant maximal sequence identities ranging from 68.9%, 78.8%, 93.7%, and 97.9%, respectively, with known fluorinases. FLASmor was not screened due to its high sequence identity to known fluorinases. FLAGbac could not be expressed in any soluble form, hence no further experiment was performed. FLASbac and FLAAdig were expressed and assessed for fluorination and chlorination activity against SAM substrate, as well as thermal stability (Fig. 2). FLASbac exhibited fluorination activity in its native sequence, whilst activity for FLAAdig had to be rescued through a N-terminal addition of beta-strand forming peptide from FLAMA37, yielding FLAAdig_Nter (Fig. 2A and Table S2†). Further discussion on these findings is provided in Section 3.2.
The remaining three subnetworks (subnetworks 2–4) consist of 370, 10, and 2 unique sequences, respectively, with experimentally uncharacterized functions. These highly divergent sequences show low sequence identity to known fluorinases, with maximum identities of 24.2–31.4%, 26.9–29.3%, and 26.7–27.5% for subnetworks 2, 3, and 4. Furthermore, members of these subnetworks show notable differences in the IBS residues (Fig. S2†). We selected 20 sequences from subnetworks 2–4 (Table S3†) for testing, but none of these sequences exhibited detectable fluorination or chlorination activity. Importantly, none of the sequences in these subnetworks, or any other sequences in the SSN, exhibited the key IBS feature of known fluorinases: a threonine at the first position, two aromatic residues at second and third positions, and a serine at the last position. This suggests the significance of these specific residues in fluorinase function.
Further analysis of sequence features revealed critical variations within the β3–β4 loop, located between the β3 and β4 strands, among members of subnetworks 1–4 (Fig. S3†). Subnetwork-1 sequences contain two conserved threonine residues at positions 80 (T80) and 82 (T82). Of these, the T80 is uniquely conserved in subnetwork-1 only, while T82 is also conserved in subnetwork-3 and subnetwork-4. In contrast subnetwork-2 sequences lack threonine at both these positions. Additionally, subnetwork-1 contains a single conserved positively charged arginine at position 85 (R85), whereas subnetworks 2–4 exhibit an overabundance of positively charged residues at multiple positions in the β3–β4 loop. Prior mutagenesis studies on FLAScat highlighted the role of T80 in fluorination activity, with the T80A substitution reducing the catalytic activity to 15% of the wild-type.30 We hypothesize that, in addition to the IBS residues, the distinct sequence features in the β3–β4 loop, particularly the conserved threonine and arginine residues in subnetwork-1 and the divergences in subnetworks 2–4, have functional implications. This hypothesis is explored in detail under Section 3.4, where we experimentally investigate the structural and functional implications of these variations.
We compared the F− and Cl− binding free energies for the IBS residues from the initial MD-relaxed initial conformations of FLAMA37, FLASbac, and FLAAdig_Nter using MMGBSA method.22 Fluoride ion showed stabilizing negative binding free energy values (−2.87, −0.33, and −3.21 kcal mol−1) for S158 respectively, which are consistent with the experimental mutational data on S158, where S158G and S158A variants led to reduced fluorinase activity.15 In contrast, the chloride ion showed destabilizing positive values (2.09, 2.79, and 2.19 kcal mol−1) likely due to its larger van der Waals radius and lower electronegativity. Additionally, the other three IBS residues, T155, F156, and Y157, also contributed to the relative preference for binding F− over Cl− by the three enzymes (Table S5†), except for T155 in FLASbac. Although being useful, the MMGBSA method cannot fully capture the breadth of molecular interactions as it relies on approximate models for solvation and often treat entropic contributions at a simplified level.32 Therefore, we further explored the stability of the F− and Cl− ion in the IBS with MD simulations. Specifically, we computed the minimum distance between the fluoride/chloride ion and the polar hydrogen atoms of S158 in the IBS (Fig. S4A†). Our analysis revealed that F− shows much higher occupancy than Cl− in the IBS of all the three fluorinases (Fig. S4B†). In FLAMA37 and newly identified FLASbac, the minimum distance between the F− and the sidechain/backbone polar hydrogen atoms remained less than 2.5 Å throughout the entire MD simulation trajectories, while the engineered FLAAdig_Nter showed 4.77% loss of this critical interaction, with the loss occurring exclusively in one of the two simulation trajectories over a duration of 300 ns. Unlike the fluoride ion, chloride ion demonstrated a tendency to exit the IBS during the MD simulations. Specifically, in every trimer simulation conducted, at least one monomer exhibited a clear egress of the chloride ion from the IBS. This behaviour highlights a key difference in the interaction dynamics of fluoride and chloride ions within the fluorinase enzyme. To rule out potential force field artifacts, we also conducted additional MD simulations of F− and Cl− bound to the related chlorinase, SalL. Interestingly, these MD simulations revealed a opposite trend: F− displayed a greater tendency to dissociate compared to Cl− ions (Fig. S5†). We observed that in three of the six monomers, specifically monomer M1–2 in replicate-1 and M2 in replicate-2, the F− exited the IES (distance > 8 Å) in both F−-bound SalL trajectories. In contrast, no complete event was observed in the Cl−-bound SalL trajectories.
Analysis of minimum distance between the F156 and S158 in the IBS revealed smaller median distances in Cl−-bound trajectories compared to F−-bound fluorinase trajectories. The differences in these distances were 0.5 Å, 0.3 Å, and 0.2 Å, for FLAMA37, FLASbac, and FLAAdig_Nter, respectively (Fig. S6†). The closer proximity observed in Cl−-bound trajectories likely destabilizes Cl− binding by facilitating its displacement from the IBS, consistent with the enzymes' specificity for fluorination. Previously, it was proposed that in FLAScat, F− bind to the active site in solvated form and subsequently exchange the bound water molecules by forming interactions with the polar hydrogens of S158 residue in IBS.15 Building on this model, we hypothesize that the reverse process occurs during the release of the F−/Cl− ion from IBS, wherein water molecules re-enter and reoccupy the IBS as the ion exits. To test this hypothesis, we estimated water penetration events within IBS using a minimum distance criterion of 3.0 Å between the sidechain and backbone polar hydrogens of S158 in the IBS and water molecules, considering only frames where F− or Cl− remained within 8.0 Å of the IBS. Water penetration was minimal in F−-bound trajectories, occurring in 2.5%, 12.6%, and 2.7%, of frames for FLAMA37, FLASbac, and FLAAdig_Nter, respectively, but was significantly higher in Cl−-bound trajectories at 26.5%, 36.9%, and 28.8% (Fig. S4C†). These results further suggest that chloride ions exhibit weaker binding affinity within the IBS compared to fluoride. Furthermore, fluorinases with higher fluorination activity, FLAMA37 (kcat/KM: 13.7 ± 2.2 mM−1 min−1) and FLAAdig_Nter (kcat/KM: 12.9 ± 1.9 mM−1 min−1) showed less water penetration than FLASbac (kcat/KM: 10.0 ± 1.9 mM−1 min−1), which has lower fluorinase activity (Table S6†). MD simulations of wildtype FLAAdig, lacking the N-terminal β-strand (Fig. 2A), showed higher incidences of water penetration in both F−- and Cl−-bound trajectories (22.3% and 34.1%, respectively) compared to FLAAdig_Nter, highlighting the critical role of the N-terminal β-strand on fluorinase function.
![]() | ||
Fig. 3 Structural and functional analysis of SAM-binding site. (A) Ligplot+ 31 generated 2D ligand interaction plot illustrating the type of molecular interactions between SAM (depicted in purple) and SAM-binding residues in the FLASbac. FLASbac homology model, was constructed using FLAScat (PDB 1RQP4) as the template. Location of H211, which do not interact directly with SAM, is shown in ellipse. (B) Interaction (magenta) between the SAM-binding site residue D210 (green) and SAM (grey) within FLASbac, with adjacent H211 (pink) and fluoride ion (cyan). (C) Interaction between the R211 and E54 in FLASbac_H211R mutant (green) compared to wild-type FLASbac (grey). (D) Residue conservation at 210–211 position in known 16 fluorinases. (E) Effect of mutations on halogenation activity. | ||
Building on insights from the D210A mutation results, we also targeted the adjacent H211 that is less conserved than D210 in known fluorinases (Fig. 3C, D and Fig. S1†). This residue is hypothesized to play a role in maintaining the structural integrity of the SAM-binding site. Analysis of our wild-type MD simulation trajectories revealed that H211 forms inter-monomer hydrogen bond interactions with the sidechain or backbone of nearby residue T20 (Movie S1†). Mutating H211 to arginine (H211R) could alter these inter-monomer interactions affecting the dimer interface where SAM binds. Our MD simulations of H211R mutant supported this hypothesis. The R211 maintained interactions with the backbone of T20 but showed reduced interactions with the T20 sidechain. Additionally, the extended sidechain of R211 allowed formation of new intra-monomer interactions with E54 (Movie S2†). The E54 is positioned near the F50 (W50 in FLAMA37 and FLAAdig_Nter) which stabilizes the adenosine-moiety of SAM trough pi–pi interactions (Movie S3†). Thus, the R211–E54 interaction could potentially influence intra-monomer SAM binding as well. Experimentally, FLAMA37_H211R, FLASbac_H211R, and FLAAdig_Nter_H212R largely retained fluorination activities, with values of, 0.96-, 0.87-, and 0.88-fold of their respective wild-types (Fig. 3E). The FLASbac_H211R and FLAMA37_H211R exhibit increased F−/Cl− specificities of 28.8- and 16.6-fold, respectively, compared to the wild-type FLASbac (10.4-fold) and FLAMA37 (11.5-fold). In contrast, the FLAAdig_Nter_H212R showed a slightly reduced F−/Cl− specificity of 8.3-fold (Table S4†) compared to 8.65-fold specificity in FLAAdig_Nter.
Within the β3–β4 loop, threonine residues are conserved at positions 80 and 82, and are often observed at positions 83 and 84 in subnetwork-1 members (Fig. S3†). In contrast, the threonine residues are absent in these positions (except for 82) in members of subnetworks 2 to 4 (Fig. S3†). Prior studies on FLAScat suggested that T80 could stabilize the fluoride ion.30 However, our MD simulation results indicate that T80, positioned between the IBS and IES, can also play a crucial role in facilitating ion transport between these regions (Movie S4†). Hence the absence of T80 in subnetworks 2–4 may contribute to the lack of detectable fluorination or chlorination activity in those sequences. In addition, as prior dehydration of F− and Cl− ions is required for their accommodation in IBS,17 we hypothesized that other threonine residues within the β3–β4 loop, although not directly next to the IBS, might also help dehydrate and guide ions into the IBS. Therefore, perturbing IES could also affect enzyme's F−/Cl− specificity. To test this hypothesis, the highly conserved threonine at position 82 was mutated to alanine in FLAMA37, FLASbac, and FLAAdig_Nter. FLASbac_T82A showed 1.12-fold fluorination and 1.11-fold chlorination activity (Fig. 4D). In contrast, FLAMA37_T82A and FLAAdig_Nter_T82A retained their fluorination activity but displayed only 0.6- and 0.7-fold chlorination activity respectively compared to their wildtypes (Fig. 4D). This underscores the potential significance of polar residues within the β3–β4 loop in governing ion egress dynamics and in turn affecting ion specificities.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5sc00081e |
| ‡ These authors contributed equally to the manuscript. |
| This journal is © The Royal Society of Chemistry 2025 |