Michael J.
Stephenson
,
Robert A.
Field
and
Anne
Osbourn
*
John Innes Centre, Norwich Research Park, Norwich, NR47UH, UK. E-mail: anne.osbourn@jic.ac.uk
First published on 20th February 2019
Covering: 1948 up to the end of 2018.
The triterpene alcohols represent an important and diverse class of natural products. This diversity is believed to originate from the differential enzymatically controlled cyclisation of 2,3-oxidosqualene. It is now a well-established presumption that all naturally occurring tetra- and penta-cyclic triterpene alcohols can be rationalised by the resolution of one of two intermediary tetracyclic cations, termed the protosteryl and dammarenyl cations. Here, a discussion of typical key triterpene structures and their proposed derivation from either of these progenitors is followed by comparison with a recently reported novel pentacyclic triterpene orysatinol which appears to correspond to an unprecedented divergence from this dichotomous protosteryl/dammarenyl view of triterpene biogenesis. Not only does this discovery widen the potential scope of triterpene scaffolds that could exist in nature, it could call into question the reliability of stereochemical assignments of some existing triterpene structures that are supported by only limited spectroscopic evidence. The discovery of orysatinol provides direct experimental evidence to support considering more flexibility in the stereochemical interpretation of the biogenic isoprene rule.
One of the richest sources of triterpene diversity is the plant kingdom, where this family of natural products serve as precursors to many important and structurally diverse secondary metabolites.5,6 However, triterpenes derived from 2,3-oxidosqualene (1) are also essential primary metabolic precursors in seemingly all eukaryotic kingdoms. A terminological distinction is often made (particularly in phytology) between triterpenes implicated in secondary metabolic pathways and those originating from primary metabolism, the latter being commonly referred to as sterols.4 This distinction also has chemical significance, since sterols fundamentally differ in the stereochemical configurations of their ring architectures. This is exemplified by comparison of the two constitutionally identical tetracyclic compounds lanosterol (4)7–9 and euphol (5).10 These two compounds differ only in the configuration of three (out of seven) stereogenic centres. This is most strikingly evident in the opposite relative configuration of the attachment of the methyl groups at the C/D trans ring junctions (Fig. 1).
The structural comparison between lanosterol (4) and euphol (5) is of historical significance, as it was attempts to rationalise the differing stereochemical outcome cemented in the final scaffolds of this pair of natural products that would eventually result in the 1950s postulation of two distinct and fundamental cyclisation routes.1–3,11,12 It was proposed that in the case of lanosterol (4), the series of initial electrophilic additions resulting in the first three ring-forming annulations proceed via a chair–boat–chair conformational folding. Conversely, the equivalent steps in the formation of euphol (5) proceeds via the less sterically demanding chair–chair–chair conformation. Subsequent formation of the D rings results in two distinct tetracyclic carbocations, now termed the protosteryl (2) and dammarenyl (3) cations respectively (Fig. 1).1,4 All naturally occurring triterpenes alcohols known at that time were subsequently reasoned to originate from one of these two key intermediates.
Over the proceeding decades, many more triterpene and sterol scaffolds presumed to arise from the cyclisation of 2,3-oxidosqualene (1) have been isolated from nature, and consistently their formation has been rationalised on the basis of this dichotomous pair of cyclisation pathways. The precise intricacies of these cyclisation mechanisms, and their enzymatic control, is still an active area of research.13–15 However, it is now a well-established and perhaps dogmatic presumption that to date all naturally occurring tetra- and penta-cyclic triterpenes derived from 2,3-oxidosqualene (1) can be rationalised by, or predicted to arise from, the eventual resolution of either the protosteryl cation (2) or the dammarenyl cation (3). The differential resolutions of these two cations, are typically proposed to proceed via further 1,2-hydride and alkyl shifts, with or without additional ring-forming or -expanding rearrangements. The reactions are typically terminated by vicinal hydrogen elimination to form an alkene, although water capture (resulting in diols) and more complex fragmentations are also observed.1,16
This interpretation of triterpene biosynthesis is now so well-established that it is generally considered a prerequisite during structural characterisation that novel triterpenes should comply with this stereochemical conformity,3 a view which is presented in many authoritative reviews. Indeed, this compliance was exhaustively presented for all structures known in 2004 by Xu et al.1 A historical perspective on the formulation of the biogenic isoprene rule, and its important impact over the proceeding 50 years was given by two of the original authors, Eschenmoser and Arigoni, in 2005.3 Some of the more unusual structures rationalised as post-cyclisation fragmentation products of enzymes exploiting the protosteryl (2) or dammarenyl (3) cations were included in a 2008 review by Domingo et al.16 To illustrate these points, we present here a small selection of key structures and their relation to either the protosteryl (2) or dammarenyl (3) cation, before going on to highlight the fundamental significance of a novel OSC product reported in 2018 which appears to be the first example of a triterpene which clearly breaks this dichotomy. This discovery raises the possibility that nature utilises a wider range of chemical space in the biosynthesis of polycyclic triterpenes than is currently presumed. It also provides direct experimental evidence to support consideration of more flexibility in the stereochemical interpretation of the biogenic isoprene rule.
Lanosterol (4) (whose derivation from the protosteryl cation (2) has already been discussed in the Introduction and Fig. 1) is one of the best studied sterols, being the first committed step in the biosynthesis of cholesterol22 and the steroid hormones in mammals.23 It is one of the few structures for which the proposed cyclisation mechanism has been well studied, and that is extensively supported by direct experimental and computational evidence spanning many decades.24,25 This includes confirming the existence of the protosteryl cation (2) through trapping of its ring structure in lanosterol synthase feeding experiments with substrate mimics.26 This work also supported a β-configuration of the protosteryl cation (2) sidechain contrary to the originally proposed structure which predicted this to be in the α-configuration.2
Lanosterol (4) is also produced in some higher plants,27 but the main protosteryl-derived tetracycles in this kingdom are cycloartenol (7)28,29 and parkeol (8).30 Both compounds can be derived from the same C9 tetracyclic cation (6), which results after a series of 1,2-hydride and alkyl shifts from the protosteryl cation (2) (Fig. 2a). In the case of parkeol (8), a C11 hydrogen elimination gives the C9–C11 alkene. Whereas, for cycloartenol (7), a hydrogen elimination at the neighbouring methyl group can be invoked to afford the characteristic cyclopropane ring (Fig. 2a).
![]() | ||
Fig. 2 (a) Key protosteryl-derived compounds. (b) Key dammarenyl-derived compounds. *Deprotonation steps may be spontaneous in same cases. |
Pentacycles are also known to originate from the protosteryl cation (2), with isoarborinol (11)31 being one such example. D-ring expansion of the protosteryl cation (2) through C13–C17 bond migration gives a 6,6,6,6 tetracyclic cation (9). Subsequent ring closure affords the 6,6,6,6,5 pentacyclic C22 cation (10). A series of 1,2-hydride and alkyl shifts, terminating in C11 hydrogen elimination gives the alkene (Fig. 2a).
Lupeol (14)34 and β-amyrin (16)35,36 are perhaps the archetypal pentacyclic triterpenes derived from the dammarenyl cation (3). They share a common origin following D-ring expansion via C16–C17 bond migration. Annulation by nucleophilic attack of the side-chain alkene forms the E-ring giving the 6,6,6,6,5 C20 cation (13). In the case of lupeol (14), vicinal hydrogen elimination from one of the geminal methyl groups affords the characteristic terminal alkene of lupeol (14). For β-amyrin (16), E-ring expansion, followed by 1,2-hydride shifts terminating in C12 hydrogen elimination gives the 6,6,6,6,6 oleanane scaffold (so named due to the most common oxidised form of β-amyrin (16), the C28 acid, oleanolic acid35) (Fig. 2b).
C13–C17 bond migration analogous to that seen in the formation of the isoarborinol-type compounds from the protosteryl cation (2) is also seen with the dammarenyl cation (3). Motiol (19)37,38 is an example of a 6,6,6,6,5 triterpene derived from this route. Again E-ring formation is followed by a series of 1,2-hydride and alkyl shifts, terminated in C7 hydrogen elimination to give the alkene (Fig. 2b).
This unprecedented discovery could represent a transformative divergence from the long-established choice between the expected protosteryl (2) and dammarenyl (3) cation cyclisation routes, as production of orysatinol (48) is not easily rationalisable from either of these cations. Conventional reasoning dictates that the methyl groups at the C13 and C14 positions of orysatinol (48) originate from the C14 and C8 positions of the progenitor tetracyclic C20 cation (44) (Fig. 4a). This is the result of a series of suprafacial 1,2-shifts (retaining the stereochemical configuration of the migrating bond) from the pentacyclic C22 cation (46); a rearrangement that is analogous in sequence to the previously described formation of the dammarenyl-derived motiol (19) (Fig. 2b), and protosteryl-derived isoarborinol (11) (Fig. 2a). As both of these methyl groups are in the β-configuration, the progenitor tetracyclic C20 cation (44) cannot be either the protosteryl (2) or dammarenyl (3) cation without invoking a stereochemical conversion of the C8 originating methyl group in proceeding steps (Fig. 4a). Therefore, we propose that orysatinol (48) originates from a novel hitherto unpredicted tetracyclic cation, the orysatinyl cation (44), distinct from the protosteryl (2) and dammarenyl (3) cations (Fig. 4a).
Mechanistic reasoning would suggest that the orysatinyl cation (44) should be a C8 epimer of the protosteryl cation (2) (Fig. 4a). This is supported by site-directed mutagenesis work52 which report the functional conversion of a committed parkeol synthase to a variant producing orysatinol (48). Further support comes from phylogenetic analysis suggesting that the wild type orysatinol synthase shares a common (parkeol (8) producing) ancestor with the committed parkeol synthase. Formation of the orysatinyl cation (44) would be expected to occur during initial formation of the C-ring. It is now believed that the 6,6,6 tricyclic precursor to the protosteryl cation (51), proceeds via a 6,6,5 intermediate (50) by a Markovnikov-type addition followed by C-ring expansion (Fig. 4b).54 Attack of the alkene at different faces of the bicyclic C8 cation (49) would result in differential production of either the classical protosteryl cation (2) or the orysatinyl cation (44) (Fig. 4b). In the proposed orysatinyl cation (44), both the C8 and C14 methyl groups are in the β-configuration, explaining the resulting cis C/D ring junction of orysatinol (48). This would also imply that the 1,2-shifts of the C8, and C14 methyl groups are not concerted, as both methyl groups must migrate across the same face of the compound. This is not an unreasonable assumption as orysatinol synthase (like many OSCs) is a multifunctional enzyme producing several side-products, and this would suggest intermediately cations must be available.
If wide-spread in nature, this novel third cyclisation route would potentially widen the theoretical scope of natural triterpenoid diversity. This is exemplified by the interesting structure of a novel side product of the wild-type orysatinol synthase enzyme, orysaspirol (55). NMR analysis suggests that orysaspirol (55) possesses a highly unusual spirocyclised B/C ring junction, and a seco-A-ring of the type observed in the partially cyclised triterpene marneral.55 Generation of orysaspirol (55) can be rationalised from the same pentacyclic C8 parent cation (54) as orysatinol itself, where instead of a C7 elimination to give the alkene, an alternative ring contraction, proceeded by Grob fragmentation gives the spiro-ring junction and seco-A-ring (Fig. 5). It is only plausible to postulate such a mechanism due to the novel stereochemistry of the orysatinol C8 cation (54). This stereochemistry results in the C9–C11 sigma bond being appropriately orientated relative to the empty p orbital of the C8 carbocation, such that it is reasonable to propose that a ring contraction might be possible (Fig. 5) since the migrating C9–C11 sigma bond takes up the same stereochemical orientation as the C–H sigma bond in the well-established rearrangement of the dammarenyl cation (3) to form motiol (19). Furthermore, this route might be expected to be entropically favourable because it initiates the fragmentation cascade which opens the A-ring, and possibly thermodynamically favourable because it could relieve steric strain imposed by the cis C/D ring junction. In the case of the dammarenyl-derived motiol (19), it is not reasonable to propose an equivalent ring contraction, and this is evident by the existence of isomotiol (57)56 which represents an alternative resolution of the motiol parent C8 cation (56) through a C9 hydrogen elimination instead of the C7 hydrogen elimination observed in the formation of motiol (19) itself (Fig. 5).
The isolation of novel triterpene prototypes from nature has been a major contributor in expanding the understanding of the cyclisation process of 2,3-oxidosqualene (1). Orysaspirol (55) is a very minor side product of the wild-type orysatinol synthase enzyme, which was isolated as a result of synthetic biological efforts. It seems unlikely that this product would ever have be isolated though speculative prospecting of the rice plant through traditional natural product extraction workflows. The isolation of the orysaspirol (55), therefore highlights the potential and likely future impact of synthetic biology in the field of natural product chemistry. The stereochemical assignment of orysatinol (48), proven by X-ray crystallography, clearly provides direct experimental evidence to justify more flexibility in the stereochemical interpretation of the biogenic isoprene rule; a rule that has served as a powerful tool to aid structural determination of new triterpene natural products for decades.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8np00096d |
This journal is © The Royal Society of Chemistry 2019 |