Xingming
Pan†
,
Haixin
Li†
and
Liao-Bin
Dong
*
State Key Laboratory of Natural Medicines, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China. E-mail: ldong@cpu.edu.cn
First published on 10th December 2025
Covering: up to August 2025
Terpenoids constitute nature's largest and most structurally diverse class of natural products, with extensive applications in medicine, agriculture, and fragrance industries. Class I terpene synthases (TSs) create this remarkable diversity by converting linear isoprenoid diphosphates into complex, often polycyclic frameworks through intricate carbocation cascades. This review examines strategies for engineering TSs to generate diverse terpene skeletons—a key objective in synthetic biology. We summarize four core approaches: structure-guided design targeting active sites, water networks, and conserved motifs; evolutionary methods leveraging natural variation and phylogenetic insights; mechanism-focused engineering controlling specific carbocation intermediates; and techniques extending beyond the active site through second-shell modifications and contact mapping. These approaches are complemented by semi-rational and random methods including alanine scanning, saturation mutagenesis, and directed evolution, often enhanced by computational modeling and high-throughput screening. While the complexity of TS catalysis and often weak sequence-function correlations create significant engineering challenges, integration of structural biology, computational simulations, diverse engineering techniques, and advanced screening methods is steadily improving outcomes. Future advances in machine learning, mechanistic understanding, screening technologies, and metabolic engineering integration will further expand access to novel terpenoid chemical space for biotechnological exploration.
000 identified compounds.1–4 These molecules, derived from the universal five-carbon precursors isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP),5,6 exhibit a vast array of biological functions and are invaluable resources for human applications, including pharmaceuticals (e.g., paclitaxel,7,8 artemisinin9), agrochemicals,10,11 fragrances,12 and biofuels.13,14 The remarkable chemical diversity of terpenoids originates primarily from the activity of terpene synthases (TSs), enzymes that catalyze the transformation of linear isoprenoid diphosphate substrates (e.g., geranyl diphosphate (GPP, C10),15 farnesyl diphosphate (FPP, C15),16–18 geranylgeranyl diphosphate (GGPP, C20),19,20 geranylfarnesyl diphosphate (GFPP, C25))21 and C30 substrate squalene or epoxysqualene22,23 into a multitude of often complex cyclic and polycyclic hydrocarbon or alcohol skeletons.
TSs are broadly classified into two main types based on their reaction mechanisms and structural folds.2,24 Class II TSs typically possess a βγ-domain structure and initiate catalysis by protonating the substrate double bond or epoxide group, often cyclizing larger substrates like triterpenes (e.g., squalene).23,25–27 In contrast, class I TSs, the focus of this review, generally feature an α-helical fold (α-domain) and catalyze the ionization of the allylic diphosphate substrate, generating an initial carbocation.28,29 This highly reactive intermediate then undergoes a series of intricate and precisely controlled intramolecular cyclizations, rearrangements (including hydride and methyl shifts), and quenching steps (typically deprotonation or water capture), often referred to as a carbocation cascade.30,31 This complex cascade, guided by the enzyme's active site architecture, is the source of the vast skeletal diversity observed in mono-, sesqui-, di-, and sesterterpenes (C10–C25).32
The characteristic α-helical fold of class I TSs creates a hydrophobic active site cavity lined with specific residues that stabilize transient carbocation intermediates, guide the substrate through the reaction coordinate, and ultimately dictate the final product structure.28,29,33 Engineering class I TSs offers multiple advantages that majorly include (1) access to novel, non-natural terpene skeletons with unique bioactivities; (2) generation of analogs to optimize existing therapeutic compounds; (3) enabling sustainable bioproduction of valuable terpenoids that are otherwise obtained through inefficient extraction from natural sources or complex chemical synthesis; and (4) addressing limitations of wild-type (WT) enzymes in heterologous expression systems.34–36 Among these advantages, the generation of novel terpenoid skeletons through enzyme engineering has emerged as a particularly compelling objective, as new scaffolds can lead to compounds with unique biological activities or improved properties.37–39 This skeletal diversification capability is especially valuable, as chemical synthesis of complex terpenes is often challenging,40,41 whereas engineering class I TSs offers a powerful biosynthetic alternative for accessing both known and novel terpenoid structures.2,42 The inherent catalytic promiscuity and evolutionary plasticity of these enzymes make them attractive targets for protein engineering aimed at altering their product specificity and generating skeletal diversity.
Despite these opportunities, engineering TSs presents significant challenges: product specificity control is difficult due to the intrinsic promiscuity of many TSs; weak correlation exists between enzyme sequence and functional output; the complex, dynamic reaction mechanism involves multiple transient carbocation intermediates; and there is limited availability of high-resolution structures capturing catalytically relevant conformations. These factors constitute a “predictability barrier” that necessitates integrating rational approaches with empirical methods. This review focuses on the strategies employed to engineer class I TSs for the generation of diverse and novel terpenoid skeletons. We explore several core approaches, including: (i) structure-guided rational design targeting specific active site residues, water networks, or conserved motifs; (ii) evolutionary and comparative methods that leverage natural variation, phylogenetic insights, and functional switches identified between related enzymes; (iii) mechanism-focused engineering aimed at controlling specific carbocation intermediates or reaction branch points (BPs); (iv) techniques extending beyond the immediate active site, such as second-shell engineering and contact mapping; (v) protein segment engineering including domain swapping and chimeric approaches for functional conversion; and (vi) semi-rational and random approaches including alanine scanning, saturation mutagenesis, and directed evolution (Fig. 1). We also discuss emerging computational and data-driven methods incorporating machine learning and high-throughput screening technologies that show promise for future applications. These diverse strategies, often used in combination, navigate the complex catalytic landscape of these fascinating enzymes. Understanding and applying these strategies is crucial for unlocking the full biosynthetic potential of class I TSs and expanding the accessible chemical space of terpenoids for biotechnological applications.
Altering steric bulk within the active site can effectively interrupt or redirect the cyclization cascade.43 EIZS WT converts FPP into the complex tricyclic product epi-isozizaene with high fidelity (79% of the product mixture) (Fig. 2).43 Its active site is lined with aromatic residues that stabilize the series of carbocation intermediates required for the intricate cyclization. Crystal structures (e.g., EIZS WT, PDB ID: 4LTV) revealed that F96A causes minimal structural perturbations but significantly alters the active site contour (Fig. 2B). This modification removes a key steric barrier and cation-π stabilizing interaction, leading to the premature termination of the cascade. As a result, the product profile is substantially inverted, with the acyclic (E)-β-farnesene becoming the major product (70%), demonstrating a complete skeletal simplification from tricyclic to linear. Further mutations targeting other aromatic residues, such as F95H and F198L, similarly redirected the cascade to yield alternative monocyclic (β-curcumene) and tricyclic (β-cedrene) skeletons, respectively. Notably, the F198L mutant redirected the cascade to produce 61% β-cedrene, of which 80% was (−)-β-cedrene and 20% was (+)-β-cedrene, highlighting a significant loss of stereocontrol and the plasticity of the catalytic pocket (Fig. 2A).43
In parallel, modulating the pocket's polarity provides an orthogonal handle to steer carbocation intermediates toward alternative reaction trajectories. The introduction of polar residues, such as in the EIZS F96S or F96Q mutants, fundamentally alters the electronic environment of the active site.46 Crystallographic analysis (the unliganded F96S mutant, PDB ID: 6AXO) confirmed that these substitutions preserve the enzyme's architecture while creating localized electronic effects. These changes critically alter the stabilization of the homobisabolyl+, redirecting its cyclization pathway from the native tricyclic scaffold to a bicyclic framework, sesquisabinene A. The F96Q variant performs this conversion with near-perfect selectivity (97%).47 Building on these findings, a synergistic double mutant (F95S/F198S) was designed; the core hypothesis was that this mutation would synergistically expand the active site to accommodate both the bisabolyl+ and a water molecule—an effect neither single mutant could achieve alone. This non-additive expansion was confirmed by active site volume measurements (WT: 279 Å3vs. double mutant: 391 Å3). Structural analysis of this variant at 1.47 Å resolution (PDB ID: 8V3K) revealed that the elongated active site not only redirects the product skeleton from tricyclic to monocyclic but also positions a water molecule for nucleophilic attack, thereby converting the enzyme into a specific α-bisabolol synthase with high yield (74% at 4 °C). Crucially, the engineered active site shares key structural features with a natural α-bisabolol synthase (AaBOS from Artemisia annua),48 providing evolutionary validation for the design strategy.46 These studies collectively exemplify how precise, structure-guided sculpting of the active site can effectively intercept and reroute the carbocation flux to unlock novel terpenoid scaffolds.
Taxadiene synthase (TXS) engineering exemplifies the integration of multiple computational approaches to achieve precision skeletal reprogramming in the biosynthesis of paclitaxel (Taxol®), one of the most important anticancer agents.57,58 The initial three-dimensional crystal structures (PDB ID: 3P5P) of TXS revealed its active site cavity, identifying a group of polar residues (S587, Q609, Y688, C719, and C830) as potential candidates for catalysis. Subsequent engineering efforts altered the product profile by manipulating carbocation intermediates. One common outcome was the enhanced production of taxa-4(20),11(12)-diene, a more favorable isomer for paclitaxel biosynthesis. The Y688L59 or V584I56 reorients the final taxen-4-yl+, making its C20 methyl protons more accessible for deprotonation and favoring the exo-cyclic double bond formation.58,61 More reprogramming involves intercepting the catalytic cascade to yield entirely different carbon skeletons. Cembrene A, a monocyclic diterpene, is formed by halting the reaction after the first cyclization. This was achieved after molecular mechanic (MM)-based modeling predicted that the W753H mutation introduces an active base that prematurely terminates the reaction.55 Similarly, V610F sterically blocks the path to the second cyclization, also yielding cembrene A (Fig. 3).56 The reaction can also be terminated at the bicyclic stage to produce verticillene-type compounds.56 This strategy was guided by computational modeling, which indicated that altering the active site's shape could interrupt the process. The models predicted that V584M/L56 or Q609A/G,56,59 the bicyclic intermediate is physically pushed closer to the diphosphate moiety, triggering its premature deprotonation.
In summary, these residues function as sequential gatekeepers that can be engineered to intercept the native tricyclic cascade. Y688 exclusively directs the final deprotonation, controlling the double bond's regioselectivity on the intact skeleton. W753 and V610 control the first cyclization; their mutation introduces either a catalytic base (W753H) or a steric block (V610F), halting the reaction to release a monocyclic product. V584 and Q609 shape the active site for the second cyclization; mutating them alters this geometry, forcing the premature release of a bicyclic intermediate before the final ring can form.
The bacterial diterpene synthase (DTS) VenA demonstrates how crystal structure-guided manipulation of aromatic cage architecture can achieve skeletal rearrangements through systematic residue targeting.34 Crystal structures (PDB IDs, 7Y9H and 7Y9G) revealed seven critical aromatic residues (Y88, W107, Y108, F185, F215, F219, and F338) positioned within 5 Å of the GGPP, each contributing stage-specific stabilization energies ranging from −6.8 to −9.6 kcal mol−1.34 Quantum Mechanics/Molecular Mechanics (QM/MM) simulations demonstrated that Y88 and F215 jointly stabilize intermediate C during early cyclization stages after intermediate B undergoes a 1,3-hydride shift, while F185 and W107 control later reaction trajectories, particularly stabilization of non-classical carbocation intermediate F, which arises from the rearrangement and subsequent cyclization of the four-membered ring in intermediate E. The cascade involves a critical BP at the initial carbocation intermediate A, which in certain mutants (e.g., Y88A) can undergo direct 1,14-cyclization to form 14-membered monocyclic products like cembrene A. In the WT, the pathway proceeds linearly through a separate 1,14-cyclization (C → D), followed by a 2,15-cyclization toward intermediate E and ultimately the tetracyclic venezuelaene scaffold (Fig. 4). Systematic alanine scanning exploited these mechanistic insights to redirect the native 5/5/6/7 tetracyclic venezuelaene A pathway toward alternative skeletal frameworks. The Y88A mutation, disrupting early-stage intermediate stabilization, transformed products to acyclic β-springene and monocyclic cembrene A while retaining only 21.7% of WT activity. The W107A variant, affecting late-stage carbocation control, generated 3,7,18-dolabellatriene (5/11 bicyclic system) with a titer of 275.5 ± 9.4 mg L−1 through an alternative 1,11-cyclization rather than native 1,10-cyclization. The F215A mutation produced dictytriene C, representing a 5/7 bicyclic framework transformation. Additionally, a systematic combination of six mutations (Y88A/W107A/V111A/T112A/F215A/F219A) expanded the active site volume from 516.1 Å3 to 641.5 Å3, enabling substrate scope expansion to accommodate GFPP and produce the 14-membered monocyclic sesterterpene (S)-cericerene at 92.6% selectivity.
TXS and VenA engineering collectively demonstrate that aromatic residue networks function as programmable molecular switches for skeletal reprogramming.34,55–59 The success of this approach relies on detailed mechanistic understanding of stage-specific carbocation stabilization, enabling rational design of mutations that predictably redirect cyclization cascades toward desired skeletal outcomes. The integration of structural biology, quantum mechanical modeling, and systematic mutagenesis provides a robust framework for expanding the accessible chemical space of TSs beyond their native product profiles.
The engineering of BcBOT2, which naturally produces the tricyclic alcohol presilphiperfolan-8β-ol, demonstrates how single mutations can redirect complex cyclization cascades (Fig. 5B). The W94A mutation eliminated both steric bulk and aromatic stabilization, causing premature termination at the humulyl+ intermediate to yield the 11-membered macrocyclic α-humulene and its hydroxylated bicyclic derivative. The F99A variant produced the bicyclic β-caryophyllene and a related bicyclic alcohol through early termination at different carbocation intermediates. Most remarkably, the N325A mutant generated presilphiperfolene, which represents a ring-cleaved rearrangement product involving C-12 methyl migration—a skeletal reorganization from the native tricyclic framework. Additionally, this mutant produced germacrene A through alternative 1,10-cyclization. Similarly, modifications to DbPROS, which yields the tricyclic Δ6-protoilludene, led to significant skeletal simplification. The I193A mutation altered the active site geometry, shortening the C1–C10 distance from 3.6 Å to 2.6 Å, which promoted alternative 1,10-cyclization to form the germacrene A instead of proceeding through the normal tricyclic pathway. The F93A mutation removed both aromatic stabilization and steric constraints, leading to water attack and formation of a hydroxylated bicyclic product. CLM1 engineering revealed the most extensive skeletal diversification. While the WT produces the tricyclic alcohol longiborneol, site-directed mutagenesis generated diverse alternative frameworks. The T111A variant produced a mixture of mono- and bicyclic sesquiterpenes by disrupting dipole–cation interactions with early intermediates. The V83A and Y87A mutations yielded another set of products from the same intermediate through different termination mechanisms: a tricyclic hydroxylated alcohol and its corresponding tricyclic deprotonated olefin were formed, while another product resulted from an additional ring closure to form a distinct new tricyclic framework (Fig. 5B). These findings on STSs reveal how minimal alterations of active site properties can effectively redirect cyclization cascades toward a wide array of diverse carbon frameworks.35
Careful manipulation of aromatic residues can intercept carbocation intermediates. The DTS Bnd4, from Streptomyces sp. (CL12-4), produces the cis-fused 6/10 bicyclic diterpene benditerpe-2,6,15-triene through a 1,10-cyclization followed by 1,14-cyclization of GGPP.70,71 A computer model of Bnd4 identified an active site residue, W316. W316A/H altered the reaction pathway from the native 1,10-cyclization to a 1,14-cyclization, yielding cembrenes with a 14-membered monocyclic skeleton as the major products. A similar approach was used for AlbS from Streptomyces albireticuli, which synthesizes the trans-fused 6/10 bicyclic diterpene albireticulene via initial 1,10-cyclization, followed by a critical 1,3-hydride shift, and subsequent 1,14-cyclization (Fig. 6A).72 Density functional theory (DFT) calculations on the WT mechanism identified a key monocyclic carbocation intermediate formed after the initial 1,10-cyclization. The calculations showed this intermediate must overcome an 11.1 kcal mol−1 energy barrier for the subsequent 1,3-hydride shift to occur. This theoretical model provides a direct explanation for the experimental outcome of the Y214F mutation, a mutational target selected due to its homology with the critical Y197 residue in Bnd4.71 By altering a key residue, the mutant is unable to stabilize the transition state and facilitate the reaction over this energy barrier, thus intercepting the reaction at the monocyclic intermediate. As a result, the mutant's activity to form the native bicyclic product was reduced to 4% of WT, with the primary product being prenylgermacrene A, a 10-membered monocyclic germacrane.
Direct crystallographic visualization provides unparalleled precision for systematic scaffold diversification. The coral-derived synthase PcTS1, resolved at 3.5 Å resolution (PDB ID: 8ZWZ), exemplifies how structural data guides comprehensive active site engineering. The WT enzyme catalyzes the formation of elisabethatriene, a diterpenoid featuring a 6/6 bicyclic biflorane skeleton through multiple cyclizations and hydride shifts (Fig. 6B).42,73 The crystal structure reveals a distinctive class I TS α-helical fold characterized by the absence of helix K, elongation of helices H and M, and the presence of an additional helix O.69 Systematic mutation of crystallographically defined positions produced notable scaffold diversity. The I254A mutant generated three distinct products: a 10-membered monocyclic intermediate from the initial 1,10-cyclization, and two constitutional isomers of the native biflorane skeleton resulting from deprotonation at alternative sites. This substitution enlarges the active site cavity, which provides the final carbocation intermediate with greater conformational freedom, leading to these varied quenching patterns. Similarly, the S338I mutation yielded a novel cage-like tricyclic structure by altering the stability of an earlier bicyclic intermediate formed after the 2,6-cyclization step, a change proposed to promote an additional, non-native cyclization event. Other designed mutations successfully intercepted specific intermediates along the catalytic cascade to produce a cyclopropane-containing product (I59L), a constitutional isomer of the native scaffold (F93Y), and an alternative tricyclic scaffold (F221V).69
The first high-resolution eunicellane cyclase structure, AriE at 1.87 Å (PDB ID: 9LPB), revealed precise aromatic residue contributions to product specificity.65 The crystal structure provides clear mechanistic explanations through its detailed active site architecture (volume 588.80 Å3, surface area 481.47 Å2, depth 16.18 Å), where W73 and neighboring residues form a critical hydrophobic pocket. Molecular docking identified W73 as a critical regulatory hotspot, and saturation mutagenesis at this position altered product distributions. Modifications to the W73 disrupt the stabilization of specific carbocation intermediates and the trajectory of subsequent hydride shifts, preventing the formation of the native cis-6/10-bicyclic benditerpetriene and instead yielding diverse scaffolds including monocyclic cembrenes and products with inverted stereochemistry.65 Conversely, the L96V mutation converted AriE into a selective cis-6/10-bicyclic eunicellane synthase by subtly refining the active site contour, which enhances stabilization for the bicyclic pathway while restricting the formation of cembrene byproducts.65 These studies collectively demonstrate how computational frameworks can systematically unlock vast chemical diversity through rational manipulation of aromatic residue networks, water positioning, and stereochemical control elements across diverse TS families.
The engineering of Sorangium cellulosum cubebol synthase (ScCubS) exemplifies how multiple aromatic residues function in concert to control tricyclic cubebane formation (Fig. 7).78 High-resolution crystal structure determination (1.80 Å resolution, PDB ID: 7ZRN) revealed that F104 stabilizes the C7 position of cation C through cation-π interactions, while F211 serves as a barrier against premature water quenching of reaction intermediates, with S206 positioning F211 correctly through a hydrogen-bonding network.78 Substitutions at these critical positions substantially redirected cyclization pathways from the native tricyclic 10-epi-cubebol to alternative scaffolds. The F104Y mutation disrupted cation-π stabilization at C7, favoring water quenching of an earlier intermediate and effectively transforming the enzyme into a germacradien-4-ol synthase with ∼80% selectivity. This mutation shifted the major product to germacradien-4-ol at 7.59 mg L−1, while 10-epi-cubebol production decreased from ∼45 mg L−1 in the WT to 1.23 mg L−1. The F211A variant removed the steric and aromatic barrier preventing water access, allowing deprotonation at intermediate C to produce γ-cadinene or water addition to form 1,10-di-epi-cubenol (bicyclic cadalane scaffold), while S206G disrupted the hydrogen-bonding network positioning F211, preventing the third cyclization and resulting in predominantly cadalane-type compounds.78
Conserved tyrosines function as conformational gatekeepers that dictate cyclization trajectories through well-defined hydrogen bonding networks.79–81 The Y573 position in Mentha spicata S-limonene synthase (LS) represents a paradigmatic example, as crystal structure (PDB IDs: 2ONG and 2ONH) analysis identified this residue as uniquely conserved in cyclic MTSs but variable in acyclic ones.82 Native enzyme converts GPP primarily to limonene (monocyclic, 96.69%) with minor bicyclic products through initial ionization to geranyl+, isomerization to linalyl diphosphate, re-ionization to linalyl+, 1,6-cyclization to terpinyl+, and final deprotonation (Fig. 8). The Y573F substitution, preserving aromaticity while eliminating the hydroxyl group, altered this distribution despite reducing activity to 2.79%—significantly increasing bicyclic compounds including α-pinene (2.76%), β-pinene (2.15%), and sabinene (4.77%) by allowing the terpinyl+ to undergo 2,7-cyclization to pinyl+ or alternative rearrangements. MD simulations revealed a critical hydrogen bond between Y573 and D496, with closer proximity between Y573 and carbons C7–C9 of the linalyl diphosphate intermediate in the cyclization-prone helical conformation. Unlike residues that primarily stabilize carbocations, Y573 functions earlier in the cascade by directing intermediate conformation and initial cyclization events.79
Aromatic residues, particularly phenylalanine, can serve as molecular switches controlling carbocation cyclization through a combination of steric effects and potential cation-π interactions, with the relative contribution depending on spatial positioning. Structure-guided mutagenesis of bacterial cineole synthase (bCinS) coupled with QM/MM calculations identified F77 as the critical determinant of cyclization specificity. The F77A substitution fundamentally redirected product formation from 96% 1,8-cineole (bicyclic) to primarily linalool (acyclic), with QM/MM calculations revealing that F77 stabilizes the α-terpinyl+ intermediate with exceptional efficiency (−5.7 kcal mol−1 interaction energy) compared to other phenylalanine (−1.2 to −2.8 kcal mol−1). This stabilization enables water capture by the terpinyl+ to form the cyclic ether of 1,8-cineole (Fig. 8). The aromatic nature proved essential—F77W restored cineole production, while F77H redirected cyclization toward monocyclic products like limonene and α-terpineol through alternative deprotonation or hydroxylation pathways. Complementary engineering at position A301 demonstrated architectural control mechanisms, where A301V/L/F mutations eliminated cineole production by creating steric clashes with W58 that disrupted active site geometry necessary for productive cyclization.83
This principle is well-demonstrated by the engineered conversion of δ-cadinene synthase (DCS), which naturally produces a bicyclic hydrocarbon, into a germacradien-4-ol synthase (GdolS) that selectively produces a monocyclic alcohol.85,86 This transformation was achieved through two distinct but complementary approaches (Fig. 9). A highly effective “single-residue gatekeeper” strategy involved mutating W279, a residue that forms a hydrophobic barrier at the active site entrance. The W279A opened a water channel and inverted the product profile, shifting from 90% bicyclic δ-cadinene to 89% monocyclic germacradien-4-ol by allowing water capture at the monocyclic intermediate before the second cyclization, while maintaining high catalytic efficiency.87 Alternatively, restructuring the more distant G-helix (e.g., with an N403P/L405H double mutant) also successfully opened a path for water to quench the monocyclic intermediate, yielding 93% germacrene D-4-ol, although the catalytic activity of this mutant was severely compromised relative to the WT.88–90
Beyond controlling general water access, more precise engineering strategies have emerged that target specific residues involved in water activation for nucleophilic attack on carbocation intermediates.91 The case of bCinS demonstrates how identifying water-activating residues precisely enables controlled redirection of product formation through minimal mutations. Structural analysis and MD simulations identified N305 as the key residue coordinating a water molecule for nucleophilic attack on the α-terpinyl+ intermediate in bCinS (Fig. 9). The WT enzyme exhibits exceptionally high fidelity, producing bicyclic 1,8-cineole with 95% specificity and minor amounts of monocyclic α-terpineol (2%). N305A completely eliminated 1,8-cineole production, redirecting catalysis toward non-hydroxylated terpenes including monocyclic β-phellandrene and bicyclic β-pinene and sabinene. Despite this shift in product profile, the variant maintained substantial activity, although its total terpene production was markedly reduced compared to the high levels of the WT. MD simulations revealed that these mutations fundamentally disrupted the water coordination network, with quantum mechanics calculations suggesting that N305 likely functions as a transient proton acceptor in the final deprotonation step of 1,8-cineole formation. This skeletal transformation occurs because disrupting water activation prevents hydroxylation at the α-terpinyl+ stage, allowing alternative cyclization pathways and direct deprotonation to predominate.91
In selinadiene synthase (SdS), engineering of the K-helix successfully converted the enzyme from a bicyclic selinadiene producer to a producer of the monocyclic intermediate, germacrene B.92 The native pathway proceeds through germacrene B formation followed by a second cyclization to yield the bicyclic selina-4(15),7(11)-diene. Notably, the double mutant A301Y/G305E resulted in the exclusive production of germacrene B, completely halting the reaction cascade before the second cyclization. This high degree of product specificity, however, came at the cost of a 162.8-fold reduction in catalytic efficiency.92
The STS 7-epi-α-eudesmol synthase (SvES) from Streptomyces viridochromogenes exemplifies how systematic cavity expansion can transform both substrate scope and product architecture (Fig. 10).96 Structure-guided comparison with SdS identified key hydrophobic residues defining active site boundaries, particularly positions A76 and F77 located directly in front of the Asp-rich motif. SvES WT normally converts FPP to bicyclic 7-epi-α-eudesmol (66%) and monocyclic products, displaying modest substrate scope limited to C15 precursors. The A76G variant showed slightly reduced sesquiterpene production while gaining the ability to convert the larger GGPP (C20) into macrocyclic diterpenes cembrene A and nephthenol (28 ± 2%). This transformation represents not merely substrate expansion but a redirection from bi/monocyclic to macrocyclic scaffolds. More substantial results emerged from F77 substitutions, where systematic mutations (F77G/A/V/L/I/M/S/T/C) all enabled GGPP acceptance with yields ranging from 6.8 ± 0.4% (F77C) to 85 ± 6% (F77I). The substrate range was expanded further by smaller substitutions (F77G/A/V/S/T), which enabled the enzyme to accept GFPP (C25), yielding macrocyclic sesterterpenes with yields up to 11 ± 2% (F77A). The mechanism involves creating additional space to accommodate larger substrates while maintaining critical catalytic interactions, with products showing substrate-dependent stereoselectivity (14R for GGPP products, 14S for GFPP products).96
Complementing this expansion strategy, the fungal linalool synthase (Ap.LS) from Agrocybe pediades demonstrates how active site reshaping can simultaneously transform substrate preference and reaction mechanism. Resolution of the first crystal structure of Ap.LS at 1.99 Å (PDB ID: 8GY0) revealed a key constriction in the substrate tunnel formed by the Y299-S184-M77 trio, explaining the enzyme's native specificity for GPP and exclusive production of linear linalool. Notably, this enzyme exhibits high efficiency—44-fold and 287-fold higher than bacterial and plant counterparts (Fig. 11).97
Saturation mutagenesis at position Y299 produced mechanistic transformation. Several variants (Y299A/C/G/Q/S) gained the ability to accept FPP as substrate and altered cyclization pathways, producing bicyclic α-selinene, bicyclic β-selinene, and monocyclic β-elemene. Notably, β-elemene is formed via a heat-induced Cope rearrangement of the initial enzymatic product, germacrene A, during analysis. The Y299G variant additionally generated linear nerolidol.97 This represents a conversion from a highly specific monoterpene synthase (MTS) producing exclusively linear products to a promiscuous STS generating diverse cyclic scaffolds. Computational analysis revealed that the Y299A mutation significantly reduced FPP torsion strain energy (30.42 vs. 45.36 kcal mol−1 in WT), creating space for the longer C15 chain while altering carbocation stabilization patterns. The biological relevance was confirmed in homologous enzymes Ap.LNS and Hf.LS, where equivalent mutations produced similar transformations.97
Unlike traditional engineering that modifies residues directly interacting with intermediates, this approach physically reconfigures the catalytic environment to enable entirely new substrate capabilities and reaction modes. The ability to transform cyclization patterns from bicyclic to macrocyclic scaffolds (e.g., SvES) or convert linear monoterpene synthesis into cyclizing sesquiterpene production (e.g., Ap.LS) establishes active site cavity engineering as a transformative method for expanding terpene scaffold diversity.97
Systematic volume calibration exemplifies this precision approach, as demonstrated in the polytrichastrene synthase (CpPS) from Chryseobacterium polytrichastri.101 Originally mischaracterized as a linalool synthase, this enzyme was revealed to be a DTS producing tetracyclic polytrichastrene A and tricyclic polytrichastrol A featuring an unusual ethyl group in terpenes, alongside minor products wanju-2,5-diene and thunbergol. Homology-based analysis using the SdS crystal structure (PDB ID: 4OKM) as template identified key active site positions that control product specificity (Fig. 12).74 The I66F substitution, designed to introduce an aromatic residue common in other DTSs, dramatically transformed the product profile. This single mutation generated five new compounds, including three with entirely new skeletons—the tetracyclic polytrichastrol B and polytrichastrene B, and the tricyclic bonnadiene-type bonn-2-en-11α-ol—along with wanju scaffold variants wanju-2,6-diene and wanju-2-en-6α-ol. The mutation achieved 195 ± 43% of WT activity, demonstrating that product diversity can be enhanced without sacrificing catalytic efficiency. Other adjustments—A87T, A192V, and their combination—shifted production toward the tricyclic wanju-2,5-diene scaffold while maintaining high activity levels (108–176% of WT).101
Sesterterpene synthases (StTSs) exemplify how architectural sculpting enables the formation of complex polycyclic systems.102 Unlike broad pocket modifications, this approach targets the three-dimensional arrangement of stabilizing residues to direct intricate macrocyclic architectures unique to larger terpene frameworks. Crystallographic analysis of fungal StTSs, NfSS (PDB ID: 8YLA) and PbSS (PDB ID: 8YL9), identified key aromatic residues (F191, F196, Y160, W164 in NfSS) forming cation-π interactions and aliphatic residues (T88, I92) facilitating hydrophobic interactions with carbocation intermediates (Fig. 13).102 Site-directed mutagenesis at these positions transformed the original 5/8/6/5 tetracyclic system into diverse scaffolds: the NfSS T88A and I92A alternatively yielded 5/10/5 tricyclic and 5/15 bicyclic skeletons. In PbSS, residue F102 controlled initial cyclization mode, with PbSS F102A redirecting C1–C15 linkage to C1–C14 linkage, yielding 14-membered ring systems. Notably, mutations of residues not directly involved in catalysis (PbSS W169A) simultaneously produced multiple scaffolds while maintaining WT activity levels, demonstrating the high sensitivity of complex polycyclic formation to architectural perturbations.102
Supporting these engineerxing advances, structural studies of bacterial MTSs reveal critical determinants that could guide future precision modifications.103 Structure analysis of bCinS (PDB IDs: 5NX6, 5NX7) and bLinS (PDB IDs: 5NX4, 5NX5) demonstrates how specific residue pairs control substrate acceptance: aromatic residues F77/F179 in bCinS constrict the active site for GPP and bicyclic products, while non-aromatic residues T75/C177 in bLinS create larger cavities accommodating both GPP and FPP for acyclic products. These structure–function relationships establish clear predictions for rational engineering: modifying aromatic residues to smaller variants could expand substrate scope, while introducing bulky aromatics might constrain cavities to favor cyclization.103
The I644A mutation, designed to alter backbone catalytic function, completely transformed product specificity from ent-abietane to ent-pimarane skeleton, yielding exclusively ent-sandaracopimaradiene while maintaining catalytic efficiency (Fig. 14). QM/MM calculations revealed that EfTPS14 WT facilitates a concerted but asynchronous process (8.4 kcal mol−1 barrier) where the I644 carbonyl mediates deprotonation at C14 followed by reprotonation at C16 and methyl migration. The I644A variant retained only deprotonation function while losing proton transfer capability, effectively arresting the reaction at an intermediate stage to produce ent-pimarane rather than ent-abietane skeletons. Additional computational optimization through the L765F/L762S double mutant redirected deprotonation regioselectivity to exclusively produce ent-abietadiene, demonstrating how distal residue modifications can fine-tune reaction outcomes.105
By targeting protein backbone elements and aromatic transport networks through computational prediction, these strategies access control mechanisms previously overlooked in traditional protein design, enabling skeletal interconversions that would be challenging to achieve through conventional active site engineering approaches.105
The successful application of this strategy is exemplified by the rational engineering of amorpha-4,11-diene synthase (AaADS) from Artemisia annua, where phylogenetic analysis of six related STSs with distinct cyclization patterns guided site-directed mutagenesis (Fig. 15).110 Sequence alignment between ADS (bicyclic amorpha-4,11-diene producer), BOS (monocyclic α-bisabolol producer), and GAS (monocyclic germacrene A producer) identified differential residues that control first and second cyclization events. The T296V mutation, designed based on the finding that valine occupies this position in acyclic (E)-β-farnesene synthase, substantially shifted production from the native bicyclic scaffold to 93.7% linear (E)-β-farnesene. Structural modeling revealed that T296's hydroxyl group stabilizes the farnesyl+ during ionization, while valine's hydrophobic character disrupts this critical interaction, causing premature cascade termination. The systematic L374Y/L404V/L405I/G439S tetra-substitution, based on residues found in GAS, blocked the second cyclization through steric hindrance, yielding 80% monocyclic bisabolyl-type products including 61.5% cis-γ-bisabolene. Significantly, five of the seven critical residues (T296, L374, G439, T399, and T447) are located adjacent to conserved metal-binding motifs, establishing these regions as crucial leverage points for scaffold engineering.110
The evolutionary hypothesis is that modern 1,10-cyclases evolved from promiscuous 1,11-ancestors through gene duplication. This hypothesis was experimentally validated using germacrene A synthase (GAS) from Solidago canadensis.111 This enzyme naturally produces >96% monocyclic germacrene A (1,10-cyclization) and only ∼2% α-humulene (1,11-cyclization) (Fig. 16). Systematic substitution of G402 with amino acids of increasing steric bulk progressively shifted product distribution toward 1,11-cyclization products, with the G402C variant producing 62.5% α-humulene while maintaining half of WT catalytic efficiency. Parallel mutations in the homologous germacrene D synthase (GDS G404V) generated ∼20% bicyclic bicyclogermacrene, a unique cyclopropane-containing scaffold that structurally links both cyclization pathways. Mechanistic studies using [1-3H1]-10-fluoro-FPP and deuterium-labeled [12,13-2H6]-FPP revealed a germacrene-humulene rearrangement connecting the 1,10- and 1,11-pathways, likely through a bridged 1,10,11-carbocation intermediate. Kinetic isotope effects (kH/kD = 4.98 for GAS WT, 4.29–5.09 for G402 variants) indicated that G402 functions as an evolutionary ‘plasticity residue’ that can redirect cyclization preference while maintaining substantial catalytic activity.111
Comprehensive phylogenetic analysis across plant STSs revealed that a single preNSE/DTE residue—the amino acid positioned four residues upstream of the conserved NSE/DTE ion-binding motif—functions as a conserved molecular switch between 1,10- and 1,11-cyclization modes (Fig. 17).112 Statistical analysis demonstrated that 1,10-cyclases predominantly contain hydrophilic residues (C, S, G, T) at C440, while 1,11-cyclases harbor hydrophobic residues (A, L, I, V). Single-point mutations effectively reversed cyclization specificity: the C438G in SmSTPS3 shifted production from bicyclic 5-epi-eremophilene to 54.7% monocyclic elemol, while S454G in AaGAS transformed the profile from β-elemene to 74.4% elemol. Additionally, S441G mutation in AtTPS21 converted the enzyme from producing 92.9% bicyclic β-caryophyllene (9/4 bicyclic ring system) to 69.2% monocyclic germacrene A-11-ol (10-membered ring). Crystal structure analysis from tobacco epi-aristolochene synthase TEAS (PDB ID: 5EAU)28 revealed this residue's proximity to the FPP isopropyl tail, suggesting it controls substrate folding to determine whether C1 attacks in a Markovnikov (1,10-cyclization) or anti-Markovnikov (1,11-cyclization) manner.112
The integration of phylogenetic analysis with high-resolution crystallography enabled precise engineering of 1,8-cineole synthase (Sf-CinS1) from Salvia fruticosa, whose 1.95 Å (PDB ID: 2J5C) crystal structure identified N338 as a critical plasticity hotspot.113 This residue activates water for nucleophilic attack on the α-terpinyl+ intermediate, leading to ether formation in 1,8-cineole. The N338I mutation eliminated water activation, transforming the enzyme from producing primarily cyclic ether (72.4% 1,8-cineole) to bicyclic sabinene (48.3%) and monocyclic limonene (37%) (Fig. 18A). Optimization via the N338I/A339T/G447S/I449P/P450T quintuple mutation yielded an enzyme producing 86.8% sabinene. Notably, the N338A mutation enlarged the active site cavity sufficiently to accommodate FPP, converting the MTS into a STS producing 49% bicyclic trans-α-bergamotene (Fig. 18B), demonstrating how key evolutionary residues can control not only product scaffold but also substrate specificity.113
Further refinement of this system identified a secondary residue (typically leucine or valine) adjacent to the primary I/T switch that interactively affects product outcome.117 This secondary residue exerts an epistatic or additive effect, essentially “tuning” the active site cavity. For instance, in OsKSL5, pairing the primary reversal T664I with the secondary substitution L661V was required to fully restore specific tetracyclic ent-isokaurene production. This additive interactivity was further demonstrated in castor bean (Ricinus communis) KS enzymes, where double mutations significantly abbreviated cyclization cascades. The L633S/I636T substitution in RcKSL2 converted the native pentacyclic ent-trachylobane to a mixture of simplified tricyclic and bicyclic products, while the L629S/V632T substitution in RcKSL4 similarly redirected catalysis from tetracyclic ent-beyerene to simplified scaffolds (Fig. 19A). Moreover, rational combination of the secondary switch mutation L635V with the primary site mutation I638S in AtKS successfully engineered the predominant production of a novel hydroxylated diterpene, 8α-hydroxy-ent-pimar-15-ene, by likely relaxing active site confinement to better accommodate the attacking water molecule.117
Extending the I/T paradigm established by these foundational studies, the evolutionary principles governing residue switches in DTSs were found to extend beyond cyclization depth control to encompass the critical decision between hydroxylation and deprotonation pathways. Two highly homologous enzymes (98% identity) from Isodon rubescens, IrTPS2 and IrKSL3a, produce different products from the same substrate (+)-copalyl diphosphate despite their near-identical sequences. Position 522/523 emerged as important, with IrKSL3a exclusively generating isopimaradiene while IrTPS2 produces the hydroxylated nezukol. This functional divergence mirrors the I/T switch paradigm discovered in KS enzymes, wherein residues containing β-methyl groups versus hydroxyl-bearing residues determine reaction trajectory. Systematic analysis revealed that residues containing β-methyl groups control product formation through selective occlusion of water binding sites. QM/MM simulations demonstrated that when A523 in IrTPS2 is mutated to isoleucine, the β-methyl physically blocks water binding to S499/T527, preventing hydroxylation and redirecting catalysis toward deprotonation pathways (Fig. 19B). Conversely, the reciprocal mutation in IrKSL3a (I522A) alone produced minimal nezukol, but combining it with A498S to restore the water-binding dyad (IrKSL3a I522A/A498S) resulted in a 7.9-fold increase in nezukol production, demonstrating that both the absence of steric hindrance and the presence of hydrogen-bonding residues are required for hydroxylation. Notably, T527 in IrTPS2 is highly conserved across Lamiaceae DTSs, paralleling the conservation of threonine residues in the classical I/T switch and suggesting its fundamental role in water-mediated hydroxylation. This steric effect alters the reaction trajectory without disrupting overall active site architecture. Computational analysis quantified these effects through QM/MM free energy calculations: water addition in IrTPS2 is favorable (exothermic by −11.7 kcal mol−1) with virtually no energy barrier, while deprotonation in IrKSL3a requires overcoming a substantial 17.0 kcal mol−1 energy barrier. These energetic differences explain how single amino acid variations translate to different product distributions, demonstrating that the β-methyl versus hydroxyl-bearing residue principle operates not only in controlling cyclization depth (as shown in KS/KSL systems) but also in governing terminal quenching chemistry through modulation of the local chemical environment. The requirement for multiple coordinated mutations (I522A/A498S) to efficiently convert IrKSL3a function underscores the extended evolutionary process underlying the specialized production of hydroxylated diterpenes.109
A parallel example of minimal intervention engineering is found in maize TSs TPS4 and TPS5, which share 98% sequence identity yet produce distinct product distributions.119 TPS5-Del1 yields sesquithujene (28.2%, bicyclic), (S)-β-bisabolene (26.6%, monocyclic), and (E)-β-farnesene (13.2%, acyclic), while TPS4-B73 produces 7-epi-sesquithujene (24.4%, bicyclic), (S)-β-bisabolene (29.1%, monocyclic), and (E)-β-farnesene (9.2%, acyclic). Comparative analysis identified positions 407–411 as key determinants, with the single G409A mutation altering product distribution by favoring (S)-β-bisabolene (monocyclic) production while reducing bicyclic compounds to barely detectable levels (Fig. 21). Adding the A410T mutation partially restored bicyclic product formation, demonstrating the interplay between adjacent residues. Structural modeling located position 409 within a conserved helix G kink at the active site bottom, where the additional methyl group in alanine versus glycine alters the formation rates of (R)- and (S)-bisabolyl+ intermediates. This subtle change explains why TPS5 derives approximately 95% of products from the (S)-bisabolyl+ pathway, while TPS4 utilizes both pathways equally, creating their divergent product profiles despite minimal sequence differences.119
The complexity of multi-residue engineering is illustrated by maize TS paralogs TPS4 and TPS10, which exhibit substantial evolutionary divergence (amino acid similarity of 57%) compared to the minimal differences seen in allelic variants.120 TPS4 naturally converts FPP into bicyclic 7-epi-sesquithujene and monocyclic β-bisabolene, whereas TPS10 produces bicyclic (E)-α-bergamotene and acyclic (E)-β-farnesene from the same substrate. Sequence comparison combined with homology modeling identified 17 amino acid differences within the active site region between these enzymes (Fig. 22). Systematic mutagenesis revealed varying impacts on product specificity, with the Y382S substitution notably redirecting catalysis toward (E)-β-farnesene production. While the comprehensive TPS4-c17 variant incorporating all 17 active site mutations achieved only partial functional conversion (61.5% (E)-β-farnesene and 14.4% (E)-α-bergamotene), complete functional mimicry required two additional second-shell substitutions, I411F and R442K. The resulting enzyme (TPS4-c17 R442K/I411F) produced 56.5% (E)-β-farnesene and 21.0% (E)-α-bergamotene, approaching TPS10's natural distribution of 50.4% and 35.9%, respectively. Structural examination from homology modeling revealed that despite pointing away from the substrate-binding cavity, I411F at the N-terminal end of helix G2 indirectly influences active site geometry, while R442K affects the positioning of the G1 helix, demonstrating how secondary structural elements modulate active site conformation and product outcomes.120
A more refined approach to multi-residue engineering is exemplified by A. annua TSs AaBOS and AaADS. Despite sharing 92% nucleotide sequence identity, AaBOS yields monocyclic α-bisabolol (92.8%) while AaADS generates bicyclic amorpha-4,11-diene as its major product. Through systematic domain-swapping and mutagenesis, a distinctive “BOS motif” (named after BisabOlol Synthase) comprising four key residues (V373, I395, N398, L399) that collectively control cyclization pathways was identified. These spatially proximal residues—with V373 on helix F and I395/N398/L399 on helix G1—form a functional unit within the substrate-binding pocket that acts as a molecular switch for product determination.48 Engineering efforts on AaBOS produced variant AaBOS-M2 (V373N/L381A/I395V/N398I/L399T) that generated 68.8% γ-humulene, a monocyclic product with alternative ring closure pattern, representing significant improvement over the tetra-substituted AaBOS-M1 (V373N/I395V/N398I/L399T), which produced 29.3% γ-humulene (Fig. 23). Crystal structure analysis (PDB IDs: 4FJQ for AaBOS, 4GAX for AaBOS-M2) revealed specific roles for each plasticity residue: the L399T substitution proved essential for γ-humulene formation, with its hydroxyl group promoting 1,11-ring closure through altered carbocation stabilization, while L381A doubled γ-humulene production by reducing steric hindrance at the pocket rim. Reciprocal engineering in AaADS demonstrated the broader applicability of this approach, where the T399S mutation increased amorpha-4,11-diene conversion by 70% and catalytic efficiency (kcat/Km) by 71.7% through optimized active site hydrophilicity—demonstrating the potential of plasticity residue substitution for improving enzymatic performance.48
The most comprehensive multi-residue engineering strategy involves mapping epistatic networks that control fundamental catalytic transformations, as demonstrated in the evolutionary transition from linear to cyclic terpene production in A. annua.121 Structure-based Combinatorial Protein Engineering (SCOPE) generated an extensive library of ∼27
000 mutants by breeding natural variations from AaADS into (E)-BFS, enabling comprehensive exploration of the functional landscape. From this collection emerged Y402L as a dominant mutation that fundamentally transformed BFS function, producing 15 distinct cyclic terpenes (∼75% of total products) instead of linear compounds while maintaining near WT catalytic efficiency (kcat/Km = 5.9 × 10−3vs. 10.5 × 10−3 µM−1 s−1). Mechanistic analysis revealed that Y402L delays proton elimination and releases the substrate from its unproductive all-trans conformation, enabling 2,3-σ-bond rotation in the neutral nerolidyl diphosphate (NPP) intermediate for subsequent cyclization. Complex epistatic interactions formed the foundation of this functional shift: V467G acts as a second-site suppressor, reverting cyclization to linear products despite being ∼10 Å from Y402L, while Y430 adds further complexity through Y430A reactivating cyclization in the Y402L/V467G background (55% cyclic products) and enhancing cyclization when paired with Y402L alone (87% cyclic products) (Fig. 24). Quantitative fitness landscape analysis identified positions 402-467-430 as a strongly epistatic residue network (roughness/slope ratio = 0.57) collectively controlling cyclization emergence, with the Y402L/Y430A pair representing a fitness maximum within this network.121
The concept of gatekeeper residues emerges as a universal mechanism for controlling carbon skeleton formation across diverse enzyme architectures, as demonstrated in fungal haloacid dehalogenase-like terpene cyclases (HAD-TCs) from Antrodia cinnamomea.124,125 Despite representing a unique enzyme family with distinctive architecture compared to plant TSs, these fungal enzymes employ similar spatial control mechanisms. AncC and AncA share 94% sequence identity yet generate different carbon skeletons: AncC forms a bicyclic drimane-type scaffold ((+)-albicanol) while AncA produces a monocyclic cyclofarnesoid scaffold ((R)-trans-γ-monocyclofarnesol) from identical FPP. Molecular docking combined with AlphaFold2 structural analysis pinpointed just 18 residue differences between their TC domains, with two gatekeeper residues in AncA (Y283/F375) emerging as critical determinants. These residues create a narrower active site pocket that restricts conformational freedom of reaction intermediates, directing cyclization toward monocyclic products. Individual mutations (Y283F or F375L) produced minimal effects, demonstrating cooperative gatekeeper function, while the double mutant (Y283F/F375L) transformed AncA into an efficient (+)-albicanol producer, effectively recreating AncC-like activity (Fig. 25). Domain-swapping experiments confirmed that the TC domain exclusively determines scaffold identity, with kinetic analysis revealing that cyclization occurs more rapidly than subsequent dephosphorylation, establishing specific architectural constraints rather than altered reaction rates as the key determinant.124
High-resolution structural analysis provides crucial insights into conserved catalytic mechanisms, as exemplified by crystallographic studies of Coniophora puteana δ-cadinol synthases Copu5 and Copu9.126 The first crystal structure of Copu9 was determined at 1.83 Å resolution (PDB ID: 7OFL), capturing a catalytically relevant state with three Mg2+ ions and substrate analog bound in the active site. This structural snapshot illuminated universal elements of sesquiterpene cyclization: the “effector triad” comprising R190, D191, and S193 in Copu9 (R182, D183, and S185 in Copu5) sits at the C-terminal end of the G1 α-helix and plays a crucial role in substrate ionization, while residues G194 and C195 in Copu9 (G186 and C187 in Copu5) position near the substrate's phosphate moiety to control carbocation formation and stabilization. Structure-guided mutagenesis targeting these conserved elements produced variants—Copu5 C187V, N220A, and S306N, along with Copu9 N228A and S314N—that generated monocyclic germacrene D-4-ol as a notable side product while maintaining (+)-δ-cadinol as the main product (Fig. 26). These mutations affect the active site architecture at the interface between hydrophilic and hydrophobic regions, specifically influencing the reaction trajectory of the germacryl+ intermediate and demonstrating how conserved structural motifs control cyclization outcomes across fungal species.126
The conserved nature of aromatic stabilization mechanisms is further demonstrated in bacterial TSs from the thermotolerant organism Rubrobacter radiotolerans, where leucine/phenylalanine switches profoundly influence product specificity through modulation of cation-π interactions.127 Homology modeling and molecular docking guided systematic mutagenesis of nerolidol synthase (NerS) and trans-α-bergamotene synthase (BerS), revealing that the BerS L86F substitution approximately doubled (E)-β-farnesene (acyclic) production compared to WT (26.2 vs. 12.5 mg L−1) while enhancing overall terpene yields (Fig. 27). This mutation significantly altered the (E)-β-farnesene:trans-α-bergamotene ratio from ∼17
:
1 to ∼10
:
1 and enabled production of trace amounts of β-bisabolene (monocyclic), a product absent in WT. The introduced aromatic ring at position 86 provides critical cation-π stabilization of reaction intermediates, enhancing substrate isomerization required for product diversification. Conversely, the BerS S82L variant increased product titers but favored non-isomerized products, shifting the ratio from ∼17
:
1 to ∼23
:
1 through steric hindrance that blocks crucial isomerization steps. Complementary evidence from NerS engineering, where F79A mutation reduced nerolidol (acyclic) production approximately 15-fold, underscored the importance of the phenylalanine residue—through both its steric bulk and potential aromatic interactions—in stabilizing isomerized carbocation intermediates. The thermotolerant origin of these enzymes was reflected in their optimal activity at 50 °C (turnover number of 0.38 min−1, more than three times faster than at 30 °C), demonstrating how conserved aromatic stabilization mechanisms operate across diverse environmental conditions.127
Fine-tuning carbocation stabilization patterns offers even more streamlined control over diterpene cyclization, as demonstrated in conifer DTSs where a single hydroxyl-bearing residue determines the fundamental BP between bicyclic and tricyclic skeletal formation.132 Sequence alignment of conifer DTSs revealed a crucial pattern: abietadiene-producing enzymes from grand fir (AgAS) and Norway spruce (PaAS) share a conserved alanine at a key position, while the pimaradiene-specific synthase (PaPS) from Norway spruce features serine at the equivalent location. Introduction of this single modification (A723S) into AgAS fundamentally redirected product outcome from >95% tricyclic abietadienes to >95% tricyclic pimaradienes—predominantly isopimara-7,15-diene (75%) and sandaracopimara-8(14),15-diene (21%) (Fig. 28). The high efficiency of this conversion defied conventional expectations, as the A723S mutant maintained comparable catalytic rates to AgAS WT while exhibiting slightly increased specific activity (∼2-fold). Mechanistic analysis revealed that the introduced hydroxyl group selectively stabilizes the pimar-15-en-8-yl+ intermediate, creating a kinetic trap that favors direct deprotonation rather than rearrangement to the secondary pimar-8(14)-en-15 yl+ species required for progression to abietadienes. This stabilization likely occurs by modifying the electrostatic environment that would otherwise favor continued cyclization driven by ion-pairing with the diphosphate anion released during initial ionization.132
Structure-guided engineering of white spruce (Picea glauca) ent-kaurene synthase (PgKS) further demonstrates how steric effects, rather than polar interactions, can critically influence cyclization outcomes.133 Homology modeling based on the T. brevifolia TXS crystal structure combined with molecular docking of ent-CPP identified I619 as a pivotal residue for cyclization control. This residue occupies a pivotal position on helix G2 within 3 Å of C-8 of ent-CPP—precisely where the critical pimarenyl+ forms during cyclization. Comprehensive sequence analysis revealed this position as strictly conserved among tetracyclic kaurene-producing enzymes but variable in those generating tricyclic pimaradiene-type products. Introduction of the I619A mutation fundamentally altered product outcome, redirecting cyclization from predominantly ent-kaurene (tetracyclic scaffold) to ent-pimaradiene (tricyclic scaffold) with 85% selectivity for the new product, nearly eliminating native ent-kaurene production (reduced to 0.3%). Molecular docking revealed that the native I619 residue prevents premature quenching of the pimaren-8-yl+ intermediate, allowing progression to the beyeran-16 yl+ intermediate and ultimately to tetracyclic ent-kaurene (Fig. 29). When replaced with alanine, the smaller side chain creates an environment permitting premature deprotonation of the pimarenyl+, effectively trapping the reaction at the tricyclic stage.133
Computational reaction BP mapping enabled the systematic redesign of MTS HCinS, with QM/MM simulations identifying and manipulating three critical decision points within its catalytic cascade.140 Detailed computational analysis revealed the complete reaction network: BP1 governing initial cyclization of linalyl+, BP2 determining whether (R)-α-terpinyl+ undergoes hydroxylation or deprotonation, and BP3 controlling cyclization of (R)-α-terpineol (Fig. 30). Native HCinS predominantly produces 1,8-cineole (bicyclic terpene) with 87.4% specificity via a specific trajectory through these BPs. Engineering efforts at each BP generated variants with altered product profiles through precise manipulation of active site geometry and energetics. At BP3, the F236M mutation expanded the spatial distance between the side chains of F236 and F268 from 8.9 Å to 9.8 Å, increasing the cyclization energy barrier from 12.3 to 14.2 kcal mol−1 and redirecting product formation toward (R)-α-terpineol (monocyclic hydroxylated terpene) with 55.1% specificity. Intervention at BP2 employed the N135H substitution to replace a hydroxylation-promoting asparagine with histidine, facilitating deprotonation and generating (R)-limonene (monocyclic terpene) with 87.6% specificity by altering the competitive balance between hydroxylation and deprotonation pathways. The T111A mutation targeted BP1, widening the binding cavity from 9.9 Å to 11.9 Å and allowing the substrate to maintain a Z-shaped conformation rather than adopting the U-shape required for cyclization, effectively preventing ring formation and producing myrcene (acyclic terpene) with 70.8% specificity. These engineered variants retained 55–95% of native HCinS's catalytic efficiency while achieving product specificity switching. When integrated into strains with enhanced GPP supply and increased gene dosage, fed-batch fermentations yielded 6.0 g L−1 for 1,8-cineole, 4.3 g L−1 for myrcene, 4.2 g L−1 for (R)-limonene, and 3.8 g L−1 for (R)-α-terpineol in 5L bioreactors—the highest reported titers for these compounds. Furthermore, the principles of this strategy were applied to phylogenetically distant TSs, suggesting its potential for broad applicability.140
Structure-guided quantum chemical design enables even more refined control over reaction BPs, as demonstrated in the engineering of bisabolene synthase (SydA) where crystal structure analysis (PDB ID: 8YZU) combined with computational modeling revealed and manipulated novel mechanistic pathways.141 Comparative analyses of active site contours with seven other TSs guided the identification of D99, D222, and W297 as critical determinants of product specificity at distinct BPs within the cyclization cascade. The D99A variant displayed exceptional product diversity, generating six different sesquiterpenes with monocyclic, bicyclic, and tricyclic structures, while functioning as a gatekeeper residue modulating active site pocket size via Mg2+ binding (Fig. 31). The D222K mutation achieved precise control over a newly identified 1,7-hydride shift BP, predominantly producing bicyclic α-cuprenene (66.7%) with reduced formation of the original monocyclic product (22.8%) and minor amounts of bicyclic β-chamigrene (10.5%). This precise control was further refined in the D222A/T185D/F187V triple mutant (SydA-M3), which achieved 90.3% selectivity for bicyclic α-cuprenene with complete elimination of the original product, while the W297Y variant significantly increased the formation of tricyclic 7-epi-β-cedrene from 6.1% in D99A to 40.5% (a 6.64-fold increase) by affecting base positioning critical for deprotonation steps (Fig. 31). Quantum chemical calculations and molecular docking studies revealed that these mutations alter the enzyme's ability to stabilize different carbocation intermediates and guide reaction pathways, with the catalytic mechanism involving FPP cyclization to the bisabolyl+ followed by divergent reaction pathways including the unique 1,7-hydride shift confirmed through deuterium-labeled FPP experiments.141
Rational control of aromatic residue clusters (ARCs) exemplifies computational-guided engineering for carbocation transport control, representing an orthogonal approach to traditional BP targeting.142 The bifunctional Pestalotiopsis fici nigtetraene synthase (PfNS) demonstrates how MD simulations can guide systematic redesign of critical transport networks that control skeletal complexity through regulated carbocation migration. Detailed simulations identified a distinctive ARC arrangement [F89-Y113-W317] in PfNS that differs significantly from ophiobolin F synthase (AcOS) [F66-F90-W189-W305], despite both enzymes sharing common carbocation intermediates. These computational insights facilitated the design of specific modifications to redirect cyclization pathways through controlled carbocation transport mechanisms.142
The engineered PfNS W193L/T194W double mutant triggered functional transformation, completely abolishing production of the native 5/11 bicyclic nigtetraene in favor of novel 14-membered monocyclic and 5/15 bicyclic sesterterpenes.142 Progressive engineering culminated in the quadruple mutant PfNS F89A/Y113F/W193L/T194W, which achieved complete functional conversion to produce 5/8/5 tricyclic ophiobolin F at approximately 10% of AcOS WT yield, demonstrating successful skeletal reprogramming from bicyclic to tricyclic frameworks (Fig. 32). Computational analysis revealed the molecular basis: W194 forms edge-to-face interactions with F89, stabilizing C14 of GGPP at 5.2 ± 0.8 Å, while F89 creates critical C–H⋯π interactions with carbocation intermediates at 3.9 ± 0.3 Å. Theozyme calculations confirmed these interactions significantly reduce energy barriers (<0.5 kcal) for critical hydride shifts, explaining the mechanistic basis for skeleton transformation.143,144 Volumetric analysis provided quantitative parameters for directing cyclization (active site volumes ranging from 166.0 ± 17.7 Å3 to 227.0 ± 17.7 Å3), enabling rational design of different carbon skeletons.142
The value of focusing on critical intermediate stabilization is demonstrated in the bacterial DTS CotB2, where experimental identification of key residues controlling carbocation fate enabled scaffold transformations through selective mutagenesis.146 CotB2 naturally converts GGPP to cyclooctat-9-en-7-ol, a complex 5/8/5 fused tricyclic structure with six chiral centers. Site-directed mutagenesis targeting residues involved in stabilizing reaction intermediates produced two variants with fundamentally altered cyclization trajectories: the CotB2 F107A mutant generated (R)-cembrene A, while the CotB2 W288G variant produced (1R,3E,7E,11S,12S)-3,7,18-dolabellatriene (Fig. 33). These scaffold transformations resulted from altered stabilization of critical carbocation intermediates at specific BPs in the cyclization cascade. The F107 residue appears to interact with and stabilize bicyclic intermediates in the native reaction pathway, and its replacement with alanine eliminated this stabilization, causing premature termination of the cyclization cascade immediately after initial ring formation. Similarly, W288 influences substrate folding and intermediate stabilization, with its substitution by glycine redirecting the reaction trajectory toward an alternative pathway. This experimental approach efficiently identified minimal mutations that fundamentally redirect complex cyclization cascades by focusing on residues likely to participate in specific intermediate stabilization rather than mapping the entire energetic landscape.146
An even more focused approach involves exploiting polar groups' role in determining skeletal complexity, as demonstrated in rice DTS OsKSL4 where strategic hydroxyl group placement critically influences reaction termination points.147 A single T696I substitution transformed the enzyme's catalytic output from tricyclic syn-pimara-7,15-diene to complex rearranged tetracyclic aphidicol-15-ene with approximately 80% selectivity when presented with syn-CPP (Fig. 34). This mutation was positioned spatially near the C8-yl+ in the initial cyclization intermediate, where the hydroxyl group of T696 facilitates direct deprotonation of the pimar-15-en-8-yl+ intermediate in the WT, promptly terminating the reaction. Substituting the polar threonine with non-polar isoleucine removed this premature termination pathway, enabling the intermediate to undergo a complex sequence of transformations: a 1,2-hydride shift from C9 to C8, followed by secondary cyclization and Wagner–Meerwein ring rearrangement before eventual deprotonation. Notably, this substantial increase in reaction complexity occurred with minimal impact on catalytic efficiency (less than 2-fold decrease in yield), indicating the mutation primarily redirects reaction trajectory rather than disrupting overall enzyme function. The more hydrophobic environment created by the isoleucine substitution appears to better stabilize carbocation intermediates in proximity to the diphosphate co-product, enabling the enzyme to pursue more intricate reaction pathways and access complex product architectures that would otherwise be terminated by premature deprotonation.147
Modifying the Hα-1 loop has proven effective for reversing hydroxylation patterns in TSs. This structurally conserved element undergoes significant conformational changes during catalysis and serves as a critical determinant of water accessibility to carbocation intermediates. Patchoulol synthase (PTS) and germacradien-11-ol synthase (Gd11olS) served as model systems, where transplantation of specific loop sequences from non-hydroxylating SdS yielded product shifts. The conversion of 238VEDE241 to 233RRGS236 in Gd11olS and 458KKRE461 to RRGS in PTS yielded substantial changes. Native PTS primarily synthesizes tricyclic patchoulol (60%), but the engineered PTS Hα-1 variant completely abolished hydroxylation, yielding instead bicyclic α-bulnesene (46%) and monocyclic germacrene A (40%) (Fig. 35). Similarly, Gd11olS Hα-1 sharply decreased germacradien-11-ol production from >90% to merely 8.3%, redirecting catalysis toward bicyclic isolepidozene (81%).149
Complementing loop engineering, modifications of secondary structural elements have demonstrated equal promise for altering cyclization outcomes.150 The G-helix kink in Gd11olS exemplifies how subtle changes to structural features can simultaneously influence carbocation stabilization and substrate positioning. The “kink” refers to a functionally critical bend in the G-α-helix that is essential for active site closure and substrate ionization. MD simulations identified the RQH site (R228, Q313, H320) and the G-helix kink region (residues 187–190) as critical determinants. Specific substitutions produced diverse product profiles: W312A substantially altered the product distribution, yielding large amounts of the acyclic nerolidol alongside the hydroxylated germacradien-11-ol and trace quantities of other cyclic hydrocarbons, including isolepidozene, while W312F redirected catalysis toward monocyclic germacrene A (71.5%). Additionally, the G188A mutation produced 88.4% bicyclic isolepidozene with minimal hydroxylation, demonstrating complete skeletal rearrangement (Fig. 36).150
Engineering conserved metal-binding motifs has revealed another effective approach for product diversification.148 The NSE/DTE motif and adjacent residues in GdolS from Streptomyces citricolor exemplify this strategy. High-resolution structural analysis (PDB ID: 5I1U, 1.50 Å) revealed how subtle modifications to the metal coordination environment can alter product specificity. The N218Q mutation in the NSE motif converted FPP into a nearly equal mixture of hydroxylated monocyclic germacradien-4-ol (47.6%) and non-hydroxylated monocyclic germacrene A (50.7%), compared to exclusively germacradien-4-ol in the WT (Fig. 36). This substantial shift results from altered metal coordination geometry, enabling the diphosphate group to act as a base for premature carbocation quenching. Additional mutations targeting the active site periphery, including Y303F and E307M, produced smaller but significant product shifts (>2% and >4% germacrene A, respectively).148
The dynamic C-terminal region represents perhaps the most complex target for engineering novel terpene scaffolds, as demonstrated in the bacterial DTS CotB2.33 This mobile element, featuring the conserved WXXXXXRY motif, orchestrates active site closure, substrate positioning, and carbocation stabilization. Crystal structures (PDB IDs: 4OMG and 6GGI) captured both open and closed conformational states, revealing how C-terminal folding creates extensive hydrogen-bond networks connecting key catalytic motifs. Site-directed mutagenesis of aromatic residues in this region led to scaffold rearrangements: W288G transformed the natural cyclooctat-9-en-7-ol (5/8/5 fused ring system) into 3,7,18-dolabellatriene, while W186H similarly generated dolabellatriene by altering reaction trajectories (Fig. 37). Notably, F107A prevented crucial C2–C6 bond formation, yielding cembrene A through premature cyclization termination. QM/MM simulations quantified interaction energies between active site residues and carbocation intermediates (−10 to −20 kcal mol−1), demonstrating how precisely positioned amino acids guide complex reaction cascades.33
The potential of second-shell engineering is demonstrated in cattleyene synthase (CyS) from Streptomyces cattleya, where high-resolution structural analysis revealed how subtle modifications to non-substrate-contacting residues can reshape terpene cyclization landscapes.151 Three critical structures—apo-CyS (PDB ID: 7Y50, 2.00 Å), CyS-GGPP-Mg2+ complex (PDB ID: 7Y88, 1.87 Å), and the CyS C59A variant (PDB ID: 7Y87, 2.30 Å)—provided unprecedented insight into substrate binding and catalysis, with the CyS-GGPP-Mg2+ structure representing the first TS captured with its native GGPP substrate. The structure revealed an active site architecture where GGPP is surrounded by five aromatic residues (F62, W81, F86, W160, W318), four aliphatic residues (A190, A191, A229, L311), and three polar residues (C59, C82, N315). Systematic site-directed mutagenesis confirmed that the C59A substitution was particularly consequential, identifying C59 as a key second-shell residue that influences a first-shell residue without directly contacting the substrate (Fig. 38). This variant catalyzed the conversion of GGPP into six diterpenes with distinct skeletons: the original tetracyclic cattleyene, an alternative tetracyclic compound, two different tricyclic structures, another tetracyclic framework (allokutznerene), and a novel tetracyclic system with an unprecedented skeleton. The C59A variant produced approximately 6-fold higher total yields compared to F86A and W160A variants that generated similar diversity but with lower efficiency. Structural comparison between WT and C59A revealed the mechanistic basis for this functional shift: the thiol group of C59 forms a close interaction (3.4 Å) with the phenyl ring of F86 in CyS WT, orienting this residue toward the active site. The C59A mutation disrupts this interaction, prompting F86 to rotate 24° clockwise and shift slightly away from the substrate binding pocket. This structural rearrangement expands the active site cavity and alters the cation-π interactions that normally channel the reaction toward a single product, thereby reducing selectivity. The formation of the novel tetracyclic compound involves a particularly complex pathway mapped using 20 isotopomers of (13C)GGPP, revealing branching from a common intermediate through multiple steps involving ring closures, 1,2-hydride shifts, and skeletal rearrangements—a complex transformation enabled by a single second-shell substitution.151
The versatility of second-shell engineering is further exemplified by Fusarium oxysporum fusoxypene synthase (FoFS), where saturation mutagenesis at a single key non-active site position generated extensive product diversity from GFPP.152 Systematic variation at position L89 in FoFS created variants with diverse product profiles from the GFPP.152 The L89Q variant proved particularly versatile, producing not only the native pentacyclic fusoxypene A, but also three novel sesterterpenes: fusoxypene D with a 5/6/8/5 fused tetracyclic skeleton, fusoxypene E featuring a 5/6/7/3/5 fused pentacyclic arrangement, and fusoxypene F with a 5/12/5 tricyclic framework, as well as pentacyclic (−)-astellatene. Other variants showed more focused product shifts: the L89C/A/G/T/N substitutions exclusively produced bicyclic sesterterpenes including bipolapene A, bipolapene B, and preterpestacin I, completely redirecting cyclization away from the native pentacyclic pathway, while L89D/M variants retained the original catalytic specificity (Fig. 39). Computational analyses using DFT calculations and MD simulations revealed that L89 mutations reconfigure hydrogen-bonding networks with second-shell residues N70 and Y74, which indirectly reposition the first-shell aromatic residue W69 within the catalytic pocket, modifying its interactions with carbocation intermediates through cation-π, dipole-π, or C–H⋯π mechanisms. These subtle changes redirect cyclization at critical BPs, yielding diverse ring systems without directly modifying the catalytic residues themselves.152
Building upon the single-position modifications demonstrated in CyS and FoFS, systematic contact mapping reveals even greater engineering potential when applied to multiple second-shell positions, as illustrated by comparing TEAS and Hyoscyamus premnaspirodiene synthase (HPS), which share 72% sequence identity yet produce fundamentally different products: TEAS generates predominantly eudesmane-type bicyclic compounds while HPS forms vetispirane-type spirocyclic structures from identical FPP.145 Systematic tier-based mapping revealed that first-shell residues directly contacting the substrate were identical between the two enzymes, while critical determinants of product specificity resided in second-shell residues that shape the active site environment without directly contacting reaction intermediates (Fig. 40). Mutations informed by this mapping yielded catalytic redirections. The T402S/V516I double mutation in TEAS created a variant producing 4-epi-eremophilene (70% yield)—a previously undocumented bicyclic eremophilene-type compound resulting from altered regiospecificity of deprotonation at C6 rather than C8 of the eremophilenyl+ intermediate. Further refinement through a nine-residue variant (T274A/V291A/V372I/T402S/Y406L/S436N/I438T/I439L/V516I) generated 75% premnaspirodiene, effectively converting TEAS to mimic HPS function while maintaining comparable catalytic parameters (Km = 6.24 µM, kcat = 0.130). The complementary approach proved equally effective, with eight reciprocal mutations transforming HPS to produce 90% 5-epi-aristolochene, demonstrating bidirectional applicability. Mechanistic analysis revealed that these second-shell residues collectively influence catalytic trajectories by shaping active site geometry and dynamics rather than directly participating in substrate binding, with distal residues like V372I (located 12.5 Å from the active site center) influencing critical alkyl migration steps while T402S/V516I mutations determine regiospecificity of deprotonation.145
This systematic engineering approach demonstrates how subtle modifications to residues surrounding the active site can redirect complex terpene cyclization cascades, providing another framework for rational engineering of terpene structural diversity through comprehensive mapping of extended interaction networks.
Application of this approach to TEAS and HPS demonstrated its transformative potential for achieving skeletal diversification.153 Despite sharing high sequence identity (as noted above), these enzymes produce fundamentally different carbon frameworks—TEAS generates bicyclic 5-epi-aristolochene, while HPS produces spirocyclic premnaspirodiene. A comprehensive combinatorial library encompassing all 512 possible combinations (29) of nine phylogenetically guided differentiating residues revealed multiple discrete functional transitions between these distinct skeletal types. The study identified 418 active variants from 432 unique combinations, with 51.2% showing promiscuous activities, 36.8% maintaining TEAS-like specificity, 9.6% producing premnaspirodiene-like products, and only 2.4% generating 4-epi-eremophilene as the major product. The library revealed that product specificity changes varied greatly with mutational context, with average interneighbor distances ranging from 21 to 49 chemical distance units across different sequence backgrounds. Here, chemical distance represents a quantitative measure of product spectrum differences between enzyme variants, calculated as the Euclidean distance between their product distributions in four-dimensional space (representing percentages of 5-epi-aristolochene, 4-epi-eremophilene, premnaspirodiene, and minor products). Notably, an eight-mutation variant M8 (TEAS T274A/V291A/V372I/T402S/Y406L/S436N/I439L/V516I) achieved 81% premnaspirodiene production, surpassing the complete nine-mutation M9 variant (TEAS T274A/V291A/V372I/T402S/Y406L/S436N/I438T/I439L/V516I) (72% premnaspirodiene) (Fig. 40), demonstrating the non-linear nature of evolutionary trajectories toward new skeletal specificities. Furthermore, the strict dependence of 4-epi-eremophilene synthase activity on the T402S active-site mutation highlighted the critical role of active-site residues in directing skeletal reprogramming, despite the importance of distal mutations for overall catalytic optimization. DFT calculations provided mechanistic insights into these skeletal transformations, revealing higher energy barriers for methyl migration (8.4 kcal mol−1) compared to methylene migration (4.8 kcal mol−1), suggesting that thermodynamic control influences native skeletal specificities. Quantitative assessment across the entire mutational landscape revealed that approximately 7% of sequence backgrounds functioned as “hot spots” where single mutations produced functional shifts (average interneighbor distances >50 on the chemical distance scale, where larger values indicate greater divergence in product profiles), highlighting how specific mutational contexts can facilitate rapid skeletal diversification. This systematic exploration demonstrates how mutations distributed throughout the protein architecture—from direct active site positions to distal supporting residues—collectively redirect complex carbocation cyclization through subtle alterations in reaction geometry and intermediate stabilization.153
The transformation of mint (M. spicata) LS into diverse monoterpene-producing enzymes exemplifies how alanine scanning can guide systematic active site remodeling (Fig. 41).157 Initial screening identified three critical positions—N345, L423, and S454—where alanine substitution increased formation of bicyclic pinene products.157 Following established principles that postulate additive effects of plasticity residues,157 the triple mutant N345A/L423A/S454A converted a highly specific LS (96.69 ± 0.29% monocyclic limonene) into an effective pinene synthase producing primarily bicyclic products (21.68 ± 2.64% α-pinene and 47.07 ± 3.87% β-pinene, total 68.75%) with minimal residual limonene (9.19 ± 1.57%).
Focused exploration of the N345 position revealed how side chain polarity controls cyclization pathway selection. The N345I mutation created a phellandrene synthase generating predominantly monocyclic products (29.47 ± 1.18% α-phellandrene and 39.40 ± 1.61% β-phellandrene, total 68.87%). Systematic substitution at this position demonstrated a clear structure–activity relationship: polar residues (N345S: 69.84 ± 0.73% limonene; N345D: 89.97 ± 0.47% limonene) maintained native limonene-producing activity, while non-polar substitutions (N345L: 25.04 ± 1.76% limonene; N345I: 18.48 ± 0.81% limonene) shifted product profiles toward alternative skeletal frameworks.157
Mechanistic investigation revealed these mutations influence the fate of the α-terpinyl+ intermediate formed after initial cyclization of GPP. Structural analysis identified a “polar pocket” comprising multiple residues (C321, W324, N345, T349, S454, M458, H579, and Y573) that collectively stabilize the terpinyl+ and prevent carbocation migration toward the diphosphate co-product. Computational modeling confirmed the electrostatic steering effect of the diphosphate, with measurements from crystal structures showing the α-phosphorus positioned 2.72 Å, 4.70 Å, and 6.43 Å from C3, C5, and C7, respectively, supporting directional migration during bicyclic pinene formation.157
Complementary studies on Paeonia lactiflora α-pinene synthase demonstrated how alanine scanning can identify aromatic residues critical for carbocation stabilization.158 Homology modeling and computational docking identified F482 as a key position within 5 Å of the reaction center, stabilizes the pinyl+ through steric constraints and potential cation-π interactions. Single-point alanine substitution at this position, along with other aliphatic replacements (F482I, F482L, F482V, and F482T), transformed the native enzyme from producing 97.86% α-pinene (bicyclic 2,7-ring closure product) into an efficient sabinene synthase generating >90% sabinene—a bicyclic product formed via 6,7-hydride shift followed by 2,6-ring closure (Fig. 41). The F482V and F482L variants proved most effective, yielding 94.51% and 94.26% sabinene while retaining 29.50% and 49.98% of WT catalytic efficiency, respectively. MD simulations revealed that these mutations altered positioning of reaction intermediates within the active site, preventing stabilization of the pinyl+ and favoring the alternative sabinene-forming pathway.158
Successful applications highlight saturation mutagenesis's versatility across diverse TSs. Saturation mutagenesis of cotton (+)-DCS generated the N403P/L405H double mutant producing 93% germacrene D-4-ol,88 while systematic exploration of F205 in 2-methylenebornane synthase revealed substrate-dependent product profiles ranging from alcohols to novel C11 compounds.164 Studies on FoFS demonstrated how non-catalytic substitutions at position L89 could generate distinct sesterterpene architectures, from tetracyclic structures (L89Q) to exclusively bicyclic products (L89C/A/G/T/N).152 In TXS, the Y688L mutation enhanced selectivity for specific isomers while Q609G redirected cyclization toward novel skeletons.59 Smart library design strategies employing degenerate codons significantly improved screening efficiency, leading to the identification of variants such as V610F and V584M with completely altered product profiles.56
Additionally, error-prone PCR complements rational approaches by introducing random mutations throughout the gene sequence, enabling exploration of unexpected beneficial mutations that influence protein dynamics through allosteric effects. This approach proved effective in engineering cotton (+)-δ-cadinene synthase, where random mutagenesis identified the L405S mutation that redirected cyclization from bicyclic (+)-δ-cadinene toward monocyclic germacrene D-4-ol, with subsequent systematic optimization yielding the highly efficient N403P/L405H double mutant.88
These directed evolution approaches provide comprehensive functional information about critical residues, establishing their roles in determining product outcomes and creating opportunities for accessing novel terpene skeletons through systematic engineering.
The versatility of this approach extends across diverse TS classes. In fungal systems, systematic segment exchange studies between related bifunctional TSs have identified a critical cyclization control region (residues 60–69).97 Chimeric constructs between fungal HAD-TCs have demonstrated that skeleton-determining functions can be isolated to the TC domain.124 Recent structural studies of the bifunctional fusicoccadiene synthase (PaFS) revealed that cyclase domains preferentially interact with prenyltransferase domains from adjacent chains rather than their covalently linked partners, demonstrating how non-covalent interactions can mediate substrate channeling between domains.166
The functional conversion between Cinnamomum camphora monoterpene and STSs demonstrates the potential of systematic domain swapping.167 Despite sharing 97% DNA sequence identity, CiCaMS and CiCaSSy exhibit fundamentally different catalytic activities—CiCaMS produces linear monoterpenes (myrcene and linalool) from GPP, while CiCaSSy synthesizes complex polycyclic sesquiterpenes (tricyclic α-santalene, bicyclic β-santalene, and bicyclic trans-α-bergamotene) from FPP (Fig. 42A). Through systematic domain-swapping experiments, specific protein regions were exchanged between the two enzymes. Region R4 (a protein segment spanning positions 267–308 in the C-terminal domain, containing 5 amino acid differences) and Region R5 (a 12-amino acid stretch from positions 401–419 near the substrate-binding pocket, also with 5 substitutions) were individually transplanted from CiCaSSy into the CiCaMS backbone. These chimeric enzymes—MS_R4 (carrying all five R4 substitutions as a block) and MS_R5 (carrying all five R5 substitutions together)—gained STS activity, producing trans-α-bergamotene at 20.5 ± 0.7 and 28.1 ± 1.4 mg L−1, respectively. The MS_R4R5 double hybrid, combining both regions, exhibited enhanced functional characteristics by producing all three sesquiterpenes (α-santalene, β-santalene, and trans-α-bergamotene) with a combined titer of 35.6 ± 15.2 mg L−1, which approaches the production level of the WT CiCaSSy (43.8 ± 6.7 mg L−1). This wholesale region exchange proved far more effective than any individual amino acid substitution (best single mutant MS_F294M: only 8.7 ± 0.4 mg L−1).
Domain swapping revealed the modular nature of enzyme function, demonstrating that regions governing substrate recognition (GPP vs. FPP) could be functionally separated from those controlling cyclization trajectories.167 Integration with selected point mutations ultimately achieved complete skeletal reprogramming. The MS_6S variant, incorporating six critical substitutions (S267N/F294M/V401A/L403F/L404V/N442D), reproduced CiCaSSy's characteristic product profile with 60% α-santalene, 25% trans-α-bergamotene, and 15% β-santalene, effectively converting a linear monoterpene-producing enzyme into a polycyclic STS (Fig. 42B).167
Complementary studies on Sitka spruce MTSs revealed how domain swapping can identify single residues functioning as master switches for cyclization pathway control. Systematic exchanges between (+)-3-carene synthase (PsTPS-3car) and (−)-sabinene synthase (PsTPS-sab) narrowed specificity determinants to position F596 within helix J. The F596L substitution in PsTPS-sab redirected product formation from 44.7% (−)-sabinene (monocyclic) to 28.6% (+)-3-carene (bicyclic), while the reciprocal L596F mutation in PsTPS-3car2 shifted distribution from 67.5% (+)-3-carene to 37.4% (−)-sabinene. Triple mutations (A595G/F596L/L599F in PsTPS-sab) achieved 42.3% (+)-3-carene production, demonstrating amplified pathway redirection through coordinated substitutions (Fig. 43). Homology modeling revealed that F596 resides approximately 4 Å from the C4 carbon of the α-terpinyl+ intermediate, where phenylalanine likely provides steric constraints or cation-π interactions promoting sabinene formation while leucine facilitates the alternative 5,8-cyclization required for 3-carene production.
Kinetic analysis revealed that PsTPS-sab WT (producing primarily sabinene) exhibited significantly higher turnover rates (kcat = 32.68 min−1) than PsTPS-3car2 WT (producing primarily 3-carene, kcat = 5.86 min−1), demonstrating how the F596L substitution affects both product selectivity and catalytic efficiency.168
QM/MM simulations combine quantum mechanics for the reacting center and molecular mechanics for the enzyme environment to simulate entire reaction pathways and calculate energy barriers for different steps.104,172,173 These methods help understand how enzymes stabilize transition states and intermediates through electrostatic or steric effects, and can reveal mechanisms for controlling product specificity by raising energy barriers of undesired pathways.174 QM/MM MD simulations have demonstrated how mutations in LS helices G1-G2 region (S454G, C457V, M458I) increased the distance between the α-terpinyl+ and potential bases, thereby preventing deprotonation to limonene and promoting reaction flow towards bicyclic products.174
These approaches provide atomic-level details of transient species inaccessible experimentally, guide rational design, explain complex experimental outcomes, and predict binding modes even without experimental structures.175 Their accuracy depends heavily on starting structure quality, force field parameters, and quantum mechanical methods used.
Substrate consumption-based screening has been successfully implemented using engineered strains with competitive carotenoid synthesis pathways, where TS activity is inferred from colony color intensity.176 Byproduct detection relies on colorimetric methods to measure diphosphate release, reflecting overall activity but not distinguishing product profiles.177,178 Biosensor-based approaches employ genetically encoded sensors that convert product concentration into detectable signals, though developing skeleton-specific sensors remains challenging.179
Automated GC-MS workflows with liquid handling and multi-well plate autosamplers increase throughput for medium-scale library screening.180 While directed evolution can discover unexpected beneficial mutations without requiring detailed structural knowledge, the screening bottleneck remains the major obstacle, particularly for identifying specific skeletal rearrangements.
Machine learning and co-evolutionary analysis identify key residues or regions likely to influence product outcomes, guiding mutagenesis or library design. Co-evolution analysis combined with structural simulation has been used to identify hotspot regions in a 5-epi-aristolochene synthase, successfully improving enzyme activity through site-specific mutation.186
These approaches can uncover complex patterns in large datasets, integrate diverse data types, and potentially overcome the poor sequence-function correlation often observed in TSs.187,188 Co-evolutionary analysis reveals functionally important residue networks, including those distant from the active site.189 While promising, these methods require substantial high-quality training data and careful management of potential biases.190
Achieving significant skeletal remodeling in TSs typically requires integrating multiple strategies. An integrated, multidisciplinary approach combining computational modeling, ancestral reconstruction, high-throughput screening, and data-driven methods offers the effective path toward predictive engineering of TSs for novel product outcomes, including skeletal rearrangements.99
Engineering TSs for predictable skeletal diversification remains challenging due to their complex carbocation cascade mechanisms, inherent promiscuity, and the often weak correlation between sequence and function. Notably, while random mutagenesis approaches such as error-prone PCR and directed evolution have proven highly effective for optimizing enzyme properties like activity or stability, they have yielded limited success in generating fundamentally new terpene skeletons. This highlights the difficulty of navigating the vast and complex fitness landscapes required for such profound catalytic reprogramming. Consequently, progress is accelerating through the integration of structural biology, computational simulations, and diverse engineering techniques that allow for more precise interventions. Strategies combining rational design with evolutionary principles or computational predictions with experimental validation show particular promise for achieving significant skeletal changes.
Future advances will likely emerge from several directions: machine learning applications for predicting sequence–function relationships and designing novel variants; deeper mechanistic understanding through advanced spectroscopy and computational modeling; innovations in high-throughput screening using biosensors and microfluidic platforms; and integration of engineered enzymes into optimized microbial production systems. These developments will expand access to novel terpenoid structures with applications across medicine, agriculture, and materials science, demonstrating enzyme engineering's potential to reshape molecular diversity.
Footnote |
| † These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2026 |