Open Access Article
Qi
Zhang
a,
Lorenzo
Catti
b,
Leonidas-Dimitrios
Syntrivanis
c and
Konrad
Tiefenbacher
*cd
aKey Laboratory of Green Chemistry and Technology of Ministry of Education, College of Chemistry, Sichuan University, China
bLaboratory for Chemistry and Life Science, Institute of Innovative Research, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226-8503, Japan
cDepartment of Chemistry, University of Basel, St. Johanns-Ring 19, CH-4056 Basel, Switzerland. E-mail: konrad.tiefenbacher@unibas.ch
dDepartment of Biosystems Science and Engineering, ETH Zürich, Mattenstrasse 26, CH-4058 Basel, Switzerland. E-mail: tkonrad@ethz.ch
First published on 25th April 2019
Covering: literature up to 2018
Terpenes are a class of natural products characterized by remarkable structural diversity. Much of this diversity arises biosynthetically from a handful of linear precursors through the so-called tail-to-head terpene cyclization reaction. This reaction is one of the most complex observed in nature, and historically attempts to replicate it with non-enzymatic means have met with little success. In recent years, however, the development of manmade binding pockets that allow such reactions to take place has been reported. This Highlight provides an overview of this nascent field, and outlines the challenges that need to be overcome moving forward.
000 members of the terpenoid family have been described so far. Nevertheless, the currently known structures constitute just a fraction of potential structures, as was recently highlighted for monoterpenes in a computational study.2 Interestingly, this whole variety is produced from just a handful of relatively simple acyclic terpenes via cyclization and oxidation processes. Geranyl pyrophosphate (GPP 1, Scheme 1a), farnesyl pyrophosphate (FPP 2), and geranylgeranyl pyrophosphate (GGPP 3) are the three most common substrates utilized for cyclizations. These linear isoprenoid diphosphates are formed in nature by prenyltransferases utilizing the simple C5 building blocks dimethylallyl diphosphate and isopentenyl diphosphate.1
![]() | ||
| Scheme 1 Structures of the linear substrates (a) utilized by class I and class II cyclases and examples of cyclization products (b) generated by such enzymes. | ||
In nature, two classes of enzymes are responsible for the subsequent cyclization processes.3 Class I cyclase enzymes initiate the reaction by cleaving the pyrophosphate residue of the molecule via a trinuclear metal cluster. Class II cyclases, on the other hand, trigger reactions via protonation of the terminal olefin or a previously installed epoxide moiety. Ryan Shenvi proposed the term “tail-to-head” cyclization for chemical reactions similar to reactions observed in class I cyclase enzymes since the reaction is initiated at the tail end of the molecule and propagates towards the head group (Scheme 1b).4 Accordingly, the term “head-to-tail” cyclization refers to a process similar to those catalyzed by class II enzymes.4 This terminology is very well suited to differentiate between these two reaction pathways in a non-enzymatic setting, the main focus of this article. Interestingly, the structural variety of products formed by class II cyclase enzymes is rather limited. Mainly six-membered rings are formed from the energetically most preferred chair conformation of the flexible substrate. As an example, the reaction product ent-copalyl diphosphate5 (4) is displayed in Scheme 1b. Chemists have had some success in mimicking the head-to-tail cyclization mode in solution,6 and even some enantioselective cyclizations using chiral Lewis acids have been reported. The reproducibility of this cyclization mode in solution using abiotic catalysts is likely due to the reliance on the energetically preferred substrate conformation, which translates into the product structure. Nevertheless, the head-to-tail cyclization mode is still far away from being generally applicable in a solution setting.
In contrast to class II cyclases, class I enzymes produce a much larger variety of products. This is due to their ability to bind a specific conformation of the flexible precursor, enabling cyclization modes not observed with the unrestrained conformations in solution. Additionally, the tail-to-head cyclization is stepwise and the cationic intermediates formed can undergo a variety of reactions like rearrangements, hydride shifts, proton transfers and eliminations. Again, the shape of the active site determines the accessible conformations of these carbocationic intermediates, thereby determining the reaction outcome. The active site of the enzyme, additionally, stabilizes cationic intermediates and transition states via a range of interactions: coulombic, cation–dipole and cation–π interactions.7 As an example of a tail-to-head cyclization product, taxadiene (5), the carbon framework of the anti-cancer drug Taxol (6),8 is depicted in Scheme 1b. This cyclization involves several cationic intermediates, an intramolecular proton transfer and an elimination reaction as the terminating step.1 Computational chemistry is increasingly utilized to learn more about the mechanisms involved in terpene cyclizations.9–14 Such studies have provided insight into how cyclase enzymes exert control over the cyclization outcome, via specific interactions between active site moieties and carbocations.9,11,14,15 The enzyme, however, not necessarily has to intervene throughout the whole cascade reaction; some steps can be driven by the intrinsic reactivity of the cationic intermediates.16 Nevertheless, in solution the tail-to-head cyclization mode of class I enzymes proved to be very challenging to reproduce.4 This may not be too surprising, since even a very large Lewis or Brønsted acid is hardly able to influence the conformation of the flexible acyclic terpene substrate. Moreover, nucleophiles in solution may intercept cationic intermediates, thereby preventing the formation of polycyclic products.
![]() | ||
| Scheme 2 Examples of (a) tail-to-head and (b) head-to-tail cyclizations of monoterpenes using traditional Lewis or Brønsted acids in solution. | ||
Although there have been reports about tail-to-head cyclizations of sesquiterpenes in solution, these reactions again suffer from the competition of elimination and/or substitution reactions and produce cyclic terpenes in low yields only.4 The Shenvi group attributed these problems to the presence of counteranions that can function as a base and/or nucleophile. Due to coulombic attraction, this negatively charged counteranion is held in close proximity to the formed cation, resulting in fast elimination and substitution reactions. Shenvi and coworkers devised a strategy that prevented the mobility of the cleaved leaving group. They synthesized a modified sesquiterpene substrate 9 (Scheme 3a), containing a vinyl epoxide, in four steps. The addition of aluminum Lewis acids activates the epoxide and allows 1,6-ring closure to form the bisabolyl cation 10. Due to the covalent bond between the substrate and the former Lewis acid, the anionic leaving group is immobilized. Therefore, the cationic charge is able to freely propagate and undergo a reaction cascade consisting of hydride shifts, cyclizations and elimination, to produce the cedrene (11, 12) and funebrene (13) skeletons rather efficiently in a combined yield of 44%.
Moreover, the Shenvi group synthesized a similar sesquiterpene substrate 14 (Scheme 3b) that features a trimethylsilyl (TMS) group at the head end, in five steps. The TMS group should increase the nucleophilicity of the terminal alkene and allow the direct reaction between the terminal double bond and the initially formed bisabolyl cation 10. Indeed, exposure of 14 to aluminum Lewis acids produced the cumacrene/dunniene skeleton 15 in 40% yield as a mixture of isomers.
With these two examples, the Shenvi group clearly demonstrated that strained and thermodynamically unstable sesquiterpenes were accessible via this strategy. Nevertheless, two main limitations remain: (1) the unnatural substrates have to be synthesized in four to five linear steps. The ideal starting material for a chemical cyclization would be the commercially available and inexpensive farnesol itself. (2) The use of a Lewis acid in solution does not allow control over the conformation of the flexible sesquiterpene substrate. To achieve such control, uptake of the substrate into a binding pocket would be required.
:
20. Regular uncatalyzed hydrolysis produced mainly the tertiary alcohol product 19 as a diastereomeric mixture. Interestingly, but maybe not too surprisingly, the cavity of such active antibodies is lined with several aromatic residues to stabilize cationic species, much like in natural cyclase enzymes.25,26 The reaction catalyzed by the antibody, however, does not correspond to a tail-to-head cyclization that is characterized by the initial formation of an allylic cation. It shares distant similarities to a head-to-tail cyclization and this strategy was later indeed used for the formation of class II cyclase-like decalin structures.27
More recently, investigations of supramolecular structures as simple enzyme mimetics have intensified.28–32 Supramolecular capsules share some basic similarities to cyclase enzymes. They feature a binding pocket that can entrap guest molecules temporarily. Most capsules reported in the literature feature aromatic walls that can potentially stabilize cationic species inside the binding pocket via cation–π interactions. A main difference to enzymes is the general absence of a pyrophosphate-binding motif, which is essential for strong substrate binding in cyclase enzymes. However, the Lledó group reported a supramolecular cavitand equipped with an anion binding motif,33 but was not able to initiate cyclizations so far. The groups of Toste, Bergman and Raymond reported monoterpene-like Prins-cyclizations inside the anionic metal–ligand cage (±)-I (Scheme 4b).34 Inside the supramolecular cavity (10 mol% of I, 60 °C, aqueous phosphate buffer, pH = 7.5), citronellal (20) was cyclized mainly to alkenol 22. In solution much more acidic conditions (pH = 3.2) were required to observe conversion. Interestingly, under these conditions diol 23 was formed preferentially. The observed differences in reactivity are likely due to the stabilization of cationic species and the hydrophobic environment inside the cavity of I. Utilizing an optically active host related to I, the authors were able to produce enantiomerically enriched products (up to 69% ee) with substrates closely related to the monoterpene 20.35
The Tiefenbacher group reported the first tail-to-head terpene cyclization inside a supramolecular container,19 utilizing the hexameric resorcinarene capsule II36–38 as the catalyst. This capsule self-assembles from six subunits 24 and eight water molecules in apolar solvents via the formation of a complex hydrogen bond network (Scheme 5a). Despite being completely closed, guest uptake into II is usually facile and occurs most likely via the dissociation of one subunit.39 Optimized reaction conditions turned out to be 10 mol% capsule II and 3 mol% HCl in chloroform at 30 °C.40 Under these conditions, commercially available nerol (25) was cyclized to eucalyptol in 40% yield (Scheme 5b). The reaction was initiated by the protonation of the alcohol, followed by cyclization to produce the α-terpinyl cation 26. By utilizing an 18O-labeled substrate, it was shown that the cleaved water molecule, not the water in the hydrogen bond network of II, was the preferred nucleophile for the introduction of the alcohol functionality of 27. Subsequently, the alkene is activated by protonation, to facilitate the cyclization leading to the formation of the bicyclic monoterpene eucalyptol (28). Interestingly, this report marked the first direct access to eucalyptol from an acyclic terpene precursor utilizing a man-made catalyst. Further studies revealed that the interception of cation 26 can be avoided by using less nucleophilic leaving groups. For instance, the conversion of geranyl acetate (29) under otherwise identical conditions produced α-terpinene (30) as the main product (35%, Scheme 5c). A series of control experiments indicated that the reaction indeed took place inside the cavity of II. For instance, without capsule, 3 mol% HCl did not initiate any conversion. The same was true when the cavity of II was blocked by a high affinity guest molecule (nBu4NBr). In addition, a very high size selectivity was observed, indicating that substrates that are encapsulated well react much faster than more bulky substrates.
The reaction mechanism for the formation of α-terpinene (30) from geranyl acetate (29) must involve the α-terpinyl cation 26, the same intermediate as in the formation of eucalyptol. How is this cation generated from 29? A direct cyclization like in the reaction of nerol (25) is prevented by the trans geometry of the C2–C3 alkene in geranyl acetate (29). In the proposed biosynthesis (Scheme 5d), the leaving group of geranyl pyrophosphate (1) is first cleaved to generate the transoid allylic cation 31a. The recombination of the leaving group at the more hindered allylic position of the cation then produces linalyl-PP 32a. The free rotation about the C2–C3 sigma bond allows for the facile conversion to the cisoid isomer 32b, required for the cyclization to 26. After another cleavage of the leaving group, the cisoid cation is generated, which is believed to produce the key intermediate 26via cyclization. This isomerization mechanism is generally accepted in the literature1 for terpene cyclizations and is in very good agreement with experimental evidence. Nevertheless, the release of the formed tertiary pyrophosphate (32a/b in case of monoterpenes) has not been observed so far.41 The direct isomerization of the initially formed transoid allylic cation 31a was excluded due to gas phase calculations which indicated that the energy barrier for such a process is relatively high (approx. 55 kJ mol−1).42,43 However, in the capsule-catalyzed cyclization of geranyl acetate (29), a stepwise isomerization mechanism involving linalyl derivatives can be excluded for two reasons: (1) the reaction profiles for geranyl acetate and linalyl acetate do not match, indicating that geranyl acetate predominantly reacts via an alternative pathway that does not involve linalyl acetate as intermediate. (2) More importantly, in the cyclization of geranyl acetate, no intermediates – like for instance linalyl acetate – can be detected. In contrast to cyclase enzymes that bind formed intermediates tightly, capsule II does not exhibit a strong affinity to intermediates. Therefore, intermediates – for instance α-terpineol (Scheme 5b) – can be readily detected via GC and NMR. In light of these observations, a direct isomerization of the transoid cation 31a to the cisoid cation 31b was considered highly likely. Although this result does not necessarily imply that a direct isomerization is also operational in the terpene biosynthesis, the direct isomerization should not be categorically excluded anymore.
The reaction mechanism for the conversion of geranyl acetate (29) was investigated in detail.40 Kinetic investigations indicated that the cleavage of the protonated leaving group was the rate-determining step of the cyclization cascade, indicating that substrate uptake into capsule II is not the slowest step.
Recently, the Tiefenbacher group also reported the first examples of sesquiterpene cyclizations inside the capsule II.44,45 Utilizing the optimized conditions for monoterpene cyclizations, all alkene isomers of farnesol and farnesol acetate were investigated (Scheme 6a). Compared to monoterpenes, formation of more complex product mixtures was observed. This was not too surprising, since the additional prenyl unit and the higher flexibility of the farnesol precursor enable a much greater diversity in cyclization outcomes. Five main products were identified in these experiments (Scheme 6a and b): δ-selinene (35), 10-epi-zonarene (37), α-cedrene (38), 2-epi-α-cedrene (39), and ε-patchoulene (40). Interestingly, δ-selinene (35) was only observed with substrates containing a (2E)-alkene. Its formation is initiated by a 1,10-ring connection to form cation 34 (Scheme 6a).46 This ring closure seems to be only operational when the 1,6-ring formation (Scheme 6b) is attenuated by the (2E)-alkene geometry. The most selective conversion to δ-selinene (35) was observed in the case of (2E,6Z)-farnesyl acetate (18% yield). It is likely that the (6Z)-alkene favors substrate conformations suitable for the required 1,10-ring closure. This difficile control over the substrate conformation is not possible with capsule II yet, since its cavity is too large and too symmetric. Therefore, it was decided to further restrict the conformational freedom of the sesquiterpene substrate by incorporating a six-membered ring. The cyclofarnesyl acetate 41 (Scheme 6c) is readily available in three steps from an inexpensive starting material. Cyclization of this conformationally restricted substrate led to a relatively selective formation of isolongifolene (42) in 24% yield (GC analysis). Purification at this step was difficult and required repeated chromatography on silica, which reduced the yield to 11% (85% purity according to GC analysis). Analytically pure material was obtained by direct allylic oxidation of the crude cyclization mixture of 42 (after the removal of resorcinarene), to deliver the natural product isolongifolenone (43). It was isolated in 20% yield from starting material 41 over two steps. This route (total of four steps for isolongifolene; five steps for isolongifolenone) is much shorter than previously reported total synthesis approaches which required ten or more linear steps.44 This highlights the great potential of terpene cyclizations inside supramolecular catalysts. Nevertheless, it is only a starting point. Several advances are required to transform this proof of principle study to a generally useful terpene cyclase mimic.
One could ask why mimicking enzymes that nature has already optimized close to perfection would be a worthwhile endeavor. The power and utility of the biomachinery for the production of tail-to-head terpene products is undisputed, as for instance demonstrated by the efficient production of artemisinic acid, the precursor to the anti-malaria artemisinin.48 Nevertheless, there are several (potential) advantages for man-made catalysts: (1) there are cases, especially for natural products from marine sources, where the producer organism is unknown and, therefore, the biomachinery is not available to us. (2) Learning how to construct artificial cyclase enzymes may allow chemists to access completely novel terpene structures, yet unknown to nature. Modified cyclization substrates might be especially of interest, since natural enzymes do not tolerate large variations of the substrate structure.49 (3) Such studies also may help chemists to better understand the working principles of natural enzymes.
In addition, the intrinsic challenge of performing one of the most complex reaction of nature in a man-made catalyst is very motivating in itself. Therefore, we believe that this field will continue to receive an increasing amount of attention over the next decade.
Polyolefin Carbocyclizations to Terpenes and Steroids, Chem. Rev., 2005, 105, 4730–4756 CrossRef CAS PubMed.| This journal is © The Royal Society of Chemistry 2019 |