Lauren J.
Rajakovich
and
Emily P.
Balskus
*
Department of Chemistry and Chemical Biology, Harvard University, USA. E-mail: balskus@chemistry.harvard.edu
First published on 19th November 2018
Covering: up to the end of 2017
The human body is composed of an equal number of human and microbial cells. While the microbial community inhabiting the human gastrointestinal tract plays an essential role in host health, these organisms have also been connected to various diseases. Yet, the gut microbial functions that modulate host biology are not well established. In this review, we describe metabolic functions of the human gut microbiota that involve metalloenzymes. These activities enable gut microbial colonization, mediate interactions with the host, and impact human health and disease. We highlight cases in which enzyme characterization has advanced our understanding of the gut microbiota and examples that illustrate the diverse ways in which metalloenzymes facilitate both essential and unique functions of this community. Finally, we analyze Human Microbiome Project sequencing datasets to assess the distribution of a prominent family of metalloenzymes in human-associated microbial communities, guiding future enzyme characterization efforts.
The intriguing relationship between the gut microbiota and its host elicits fundamental questions about the composition of this community in both health and disease states, the gut microbial functions that influence host biology, and ultimately, the ways that this knowledge can be leveraged to improve human health. Over the last decade and a half, the application of next-generation sequencing has significantly advanced the field toward addressing the first point.11 Taxonomic profiling of gut microbial communities has shown that only two bacterial phyla, Bacteroidetes and Firmicutes, typically dominate this habitat in healthy western populations.12,13 However, the bacteria within these two phyla that constitute the community have been found to vary greatly in healthy populations depending on geography, age, and environmental factors.14,15 These discoveries have dispelled the concept of a core community of gut bacteria that are characteristic of a healthy person. Instead, the focus has shifted toward defining conserved microbial functions that promote host health or affect disease susceptibility.16
The metabolic potential of the gut microbiota greatly surpasses that of the host as its genetic content exceeds that of the human genome 150-fold.17,18 Consequently, gut microbes produce a vast set of small molecules that are often chemically distinct from those generated by host metabolism.19 The nature of these metabolites and their abundance can vary substantially between individuals depending on the composition of their gut microbiota and dietary intake.20 In many cases, metabolites produced by the host and the gut microbiota are exchanged and can be co-metabolized to generate molecules that can have unique consequences in the context of the human body.21 Indeed, many microbially-produced metabolites have been strongly correlated with human disease.22 Thus, the chemical output of microbial metabolism is becoming recognized as a critical component of human health and disease. However, we currently lack a molecular understanding of gut microbial metabolic activities. Bacteria from this habitat are often difficult to cultivate in the laboratory. In addition, bacteria of the same genus and even from the same species can have substantially divergent metabolic capabilities. Therefore, taxonomic profiling, which at present is limited to genus level assignments, is insufficient to provide an accurate assessment or prediction of the metabolic functions of the gut microbiota.
The identification of specific organisms, genes, and enzymes responsible for metabolic functions of interest will improve our understanding of the metabolic potential of the gut microbiota and its role in host health and disease. Metagenomic, metatranscriptomic and metaproteomic approaches are currently used to profile the gut microbiota on a community level. Ideally, specific genes, transcripts, or proteins could be used as indicators in complex meta'omic data to predict the presence of a metabolic function and potentially serve as a personalized read-out of health status. Microbial enzymes could even be direct targets for therapeutics.23,24 However, the present challenge to leverage this type of analysis or manipulate gut microbial functions is that only ∼50% of genes from human stool metagenomes can be assigned a broad functional annotation and less than half of that 50% can be given more descriptive annotations.25 This gap in knowledge limits our understanding of the metabolic functions of the gut microbiota and the molecular basis of the diseases linked to this community. This vast area of unexplored biochemical space creates a need, as well as great opportunity, for biochemists and chemical biologists to contribute to this exciting field.
Metalloenzymes are an important class of enzymes for all domains of life. In enzyme catalysis, metallocofactors enable unique molecular rearrangements and transformations of relatively chemically inert molecules. In particular, anaerobic microbes often use metalloenzymes to promote radical catalysis, which presents challenges in aerobic environments due to the reactivity of dioxygen with organic radical intermediates.26 In the anoxic environment of the human colon, anaerobic bacteria deploy metalloenzymes to perform many interesting metabolic functions. These metal-dependent transformations can enable gut microbes to access nutrients or energy from alternative substrates or to generate molecules that can have important consequences for the host. This review describes unique metabolic functions of the human gut microbiota that involve metalloenzymes. Rather than providing a comprehensive account of all extant metalloenzymes employed by gut microbes for housekeeping functions, we will focus on enzymatic transformations that enable gut microbes to interact with their environment, pathogens, ingested compounds (dietary molecules and xenobiotics), and the human host. Finally, we evaluate the prevalence and distribution of the major families of metalloenzymes across the healthy human microbiota, with the goal of inspiring further investigation into characterized and uncharacterized microbial enzymes that play important roles in human biology.
Gut microbes produce specialized enzymes for reducing both host and dietary polysaccharides into smaller units for energy.34 For example, Bacteroides thetaiotaomicron (B. theta) is one of the most prevalent and prominent members of the gut microbiota in average healthy adult humans (found in 46% of individuals from the HMP cohort at a mean abundance of 0.6%)13,35 and excels at carbohydrate metabolism.36–38 Nearly 20% of its genome is dedicated to degradation of these macromolecules,37 allowing B. theta to access carbohydrates from various sources depending on their availability. Transcriptional responses to available carbohydrates in mice colonized with B. theta revealed the ability of this bacterium to prioritize transient dietary resources when available and forage alternative resources (e.g., mucin) when ingested polysaccharides have been depleted.39 This expanded metabolic repertoire may explain why B. theta is a prominent colonizer of the healthy human gut. Indeed, deletion of only a small subset of the carbohydrate metabolism genes impaired the ability of B. theta to persist in the gut of a mouse model and to be transmitted vertically to offspring.37
The carbohydrate active enzymes of gut microbes act upon polysaccharides that are diverse in monosaccharide composition, linkages, and modifications.30,32,40 Metalloenzymes in particular help with the degradation of polysaccharides containing two common modifications: (i) sulfation and (ii) fucosylation. These modifications are especially abundant on glycans that make up the mucus layer of the GI tract.
Fig. 1 Maturation of gut microbial sulfatases by the radical SAM anSMEs. (a) Sulfate removal from an N-acetyl-glucosamine-6-sulfate substrate by sulfatases harboring an formylglycine (fGly) cofactor. (b) Post-translational modification of a sulfatase L-serine or L-cysteine residue to an fGly residue that is hydrated to the gem-diol active form. (c) Depiction of the anaerobic sulfatase maturating enzyme (PDB accession code: 4K38) with SAM coordinating the radical SAM cluster and a peptide substrate analogue (dark grey sticks) bound in the active site. (d) Reductive cleavage of S-adenosylmethionine (SAM) by the radical SAM [4Fe–4S]1+ cluster to generate 5′-deoxyadenosine radical (5′dA˙) and L-methionine. (e) Proposed mechanism for fGly generation by anSMEs. |
Sulfatases are ubiquitous throughout all domains of life and can be delineated into three main groups: the Zn-dependent alkylsulfatases, the Fe-dependent dioxygenase sulfatases, and the formylglycine-dependent sulfatases.54,55 The latter class constitutes the largest group, and its members are frequently encoded in the genomes of gut microbes.48,55 These sulfatases employ a unique post-translational Cα-formylglycine (fGly) modification that is critical for activity (Fig. 1b).56 The hydrated gem-diol form of the fGly cofactor (Fig. 1b) has been observed in crystal structures of both eukaryotic and prokaryotic homologs,57,58 implicating it as the catalytically relevant form. The proposed mechanism involves nucleophilic attack of a deprotonated alcohol of the gem-diol onto the sulfur atom of the sulfate monoester, eliminating the attached sugar and forming a sulfated-enzyme intermediate.59,60 Deprotonation of the second hydroxyl group would then promote release of sulfate from this tetrahedral intermediate, regenerating the aldehyde cofactor.
The catalytic fGly residue originates from either an active site cysteine (in eukaryotes56,61 and prokaryotes62–64) or serine (in prokaryotes63,65,66) residue that is post-translationally modified by a separate maturase enzyme (Fig. 1b). The type of maturase found in eukaryotes and some prokaryotes, termed formylglycine-generating enzyme (FGE), is dioxygen-dependent and operates through an acid–base mechanism without the use of a cofactor.61,67,68 In prokaryotes, two other types of maturases have been identified,64,66,69,70 the best studied of which are the bacterial anaerobic sulfatase-maturating enzymes (anSMEs). The anSMEs are commonly found in the anaerobic bacteria that inhabit the human gut. This class of maturase is oxygen-sensitive and uses radical chemistry to generate the fGly cofactor.64,69,71 AnSMEs belong to the large superfamily of radical S-adenosylmethionine (SAM) enzymes, which harbor an essential [4Fe–4S] cluster within a β-barrel structural domain.72 Three iron ions of the cluster are ligated by cysteine residues in a strictly conserved CX3CX2C sequence motif, and the fourth iron ion is coordinated by the enzyme co-substrate, SAM (Fig. 1c).72 The canonical function of this metallocofactor is to reductively cleave the 5′-C–S+ bond of SAM (Fig. 1d) to generate the potent one-electron oxidant, 5′-deoxyadenosyl radical (5′-dA˙), which then initiates subsequent radical chemistry.72–75 In the anSMEs, the 5′-dA˙ abstracts the pro-S-Cβ-hydrogen76,77 of the target sulfatase Cys/Ser residue located in close proximity to the radical SAM cluster (Fig. 1c), generating a Cβ-centered radical (Fig. 1e). Deprotonation of the residue's thiol or alcohol group is proposed to result in a transient radical anion intermediate that is oxidized to the (thio)aldehyde (Fig. 1e). In the case of cysteine modification, hydrolysis of the thioaldehyde would yield the final fGly functional group.
The anSMEs are members of a subclass of the radical SAM superfamily comprised of proteins that have a C-terminal “SPASM” (subtilosin, PQQ, anSME, mycofactocin) domain.78 SPASM domains possess additional cysteine-rich sequence motifs, which coordinate at least one auxiliary [2Fe–2S] or [4Fe–4S] cluster.78 The postulated roles of these clusters differ from the canonical function of the radical SAM iron–sulfur cluster because they are not known to catalyze cleavage of SAM. Instead they are purported to mediate electron transfer during catalysis and/or aid in substrate positioning and activation. In the case of the anSMEs, Mössbauer-spectroscopic characterization and crystal structures of both Cys- and Ser-type anSMEs determined that the SPASM domain harbors two additional [4Fe–4S] clusters.77,79,80 Their positions in the protein structure, one close to the radical SAM cluster (Fig. 1c) and the other near the protein surface,80 suggest that they constitute an electron transfer relay pathway to mediate the final oxidation of the radical anion intermediate to the product (Fig. 1e). This relay ultimately shuttles the electron to acceptors in solution that can reduce the oxidized radical SAM [4Fe–4S]2+ cluster to enable multiple turnovers, as is observed in vitro.77,79
The crystal structure of the anSME AtsB in the presence of a peptide substrate analogue (Fig. 1c) demonstrated the mode of binding of the sulfatase polypeptide,80 and suggested a possible rationale for substrate promiscuity. The majority of substrate peptide contacts to the anSME protein are established through the peptide backbone, instead of the residue side chains.80 Though the sequence immediately following the target Cys/Ser has a distinct pattern [(C/S)X(P/A)XR],81 this mode of interaction would enable recognition of diverse peptide sequences for maturation. This promiscuity appears to be biologically important as analysis of bacterial genomes often reveals numerous sulfatases, but only a few or just a single anSME.82 Indeed, in B. theta, mutation of the single anSME present in its genome dramatically reduced the ability of the organism to utilize sulfated mucin and glycosaminoglycan carbohydrates as energy sources for growth in vitro.82 This deficiency manifested in vivo as reduced fitness to persist or colonize in the gut of a germ-free mouse when introduced in competition with or after colonization of the wild-type strain, respectively.82 Thus, disruption of this single, metallocofactor-containing maturase enzyme dramatically altered the metabolic capabilities of this organism that allow it to thrive in the gut environment.
The L-fucose sugars released from polysaccharides can have multiple fates depending on the degrading organism and the environmental conditions. One of the pathways, termed the “fucose utilization” (fuc) pathway, starts with steps similar to those of glycolysis, involving aldol cleavage to yield lactaldehyde and dihydroxyacetone phosphate.93 The lactaldehyde product is then reduced to (S)-1,2-propanediol (Fig. 2a),93 which can be further metabolized either oxidatively or reductively. In the oxidative pathway, known as the “propanediol utilization” (pdu) pathway, (S)-1,2-propanediol is converted via the intermediate propionaldehyde to the short-chain fatty acid propionate (Fig. 2a).94 Short-chain fatty acids are common products of gut microbial fermentation of carbohydrates and amino acids and have been noted for their beneficial effects on host biology.95–98 Most notably, they can serve as an energy source for colonic epithelial cells, contributing ∼10% of the human daily caloric requirement.99 In addition, short-chain fatty acids can serve as health-promoting signaling molecules in host cells. Propionate, in particular, has been connected with beneficial health effects, such as lowering lipid biosynthesis and serum cholesterol levels, and reducing carcinogenesis.100
The penultimate step in the metabolism of L-fucose to propionate is the dehydration of (S)-1,2-propanediol to propionaldehyde (Fig. 2a). This reaction is catalyzed by two different microbial enzymes that each requires a metallocofactor for either enzyme activation or catalysis. The B12-dependent (S)-1,2-propanediol dehydratase (PduC) has been extensively characterized since its identification over 50 years ago.101–104 PduC exists as a dimer of heterotrimeric subunits in an α2β2γ2 stoichiometry with the vitamin B12 (cobalamin) cofactor located at the αβ interface of each monomer.105 The cobalamin cofactor consists of a tetrapyrrole corrin ring structure, which provides four equatorial nitrogen ligands to coordinate a cobalt ion at the center (Fig. 2b).103,106–108 Spectroscopic109,110 and structural105,111,112 characterization determined that a 5,6-dimethylbenzimidazole (DMB) moiety extending from the corrin ring coordinates the cobalt ion in the lower axial position in “base-on” form and a 5′-deoxyadenosine molecule occupies the upper axial position (Fig. 2b). In the resting state, the cobalt ion of the cofactor is in the 3+ oxidation state with a Co–C coordination bond to the 5′-deoxyadenosine (adenosylcobalamin).
The conserved first step in the mechanism of adensoylcobalamin enzymes is homolytic cleavage of this Co–C bond to generate a Co2+ center and 5′-dA˙ (Fig. 2c),104 the same radical oxidant generated by radical SAM enzymes. Many factors trigger activation of the Co–C bond, including cofactor and substrate binding to the enzyme active site, as well as kinetic coupling to the subsequent hydrogen atom abstraction step.113,114 In (S)-1,2-propanediol dehydratase, two different metal ions, a K+ ion and a Ca2+ ion, are thought to contribute to activation of the Co–C bond. An essential K+ ion binds in the protein active site near the adenine ring, inducing a protein conformational change and resulting in Co–C bond cleavage even in the absence of substrate.111,112,115 The substrate coordinates a Ca2+ ion via its two hydroxyl groups, thereby increasing its effective size and resulting in a larger energetic release upon binding,111,112,116 which could balance the energetic cost of Co–C bond cleavage.117
When the Co–C bond cleavage event occurs in the presence of substrate, the resultant 5′-dA˙ abstracts a hydrogen atom from the C1 position of (S)-1,2-propanediol to generate an α-hydroxyalkyl substrate radical (Fig. 2c).118 Based on the crystal structure, this C–H bond cleavage would necessitate rotation about the glycosidic bond to bring the 5′-dA˙ radical in close proximity to the target carbon.104,112,119 The substrate-based radical then undergoes hydroxyl group migration from the C2 position to generate a C1 gem-diol intermediate with a C2-centered radical (Fig. 2c).120 This migration is mediated by acidic and basic residues in the active site, as well as the aforementioned Ca2+ ion that interacts with the two hydroxyl groups of the substrate.111,112,116 This mechanism is consistent with the observed retention of the 18O-label in the product when using a C2-18O-isotopically labeled substrate.120,121 The C2-centered substrate radical then abstracts a hydrogen atom from the same 5′-deoxyadenosine molecule that initiated chemistry,122,123 regenerating the 5′-dA˙ that can reform the Co–C bond with concomitant oxidation of Co2+ to Co3+ (Fig. 2c). Finally, dehydration of the C1 gem-diol intermediate yields the final propionaldehyde product.
Recently, another enzyme from a different superfamily was discovered to catalyze the same (S)-1,2-propanediol dehydration reaction.124,125 Transcriptional analysis of the gut microbe Roseburia inulinivorans grown on L-fucose as a carbon source noted the absence of a B12-dependent (S)-1,2-propanediol dehydratase, PduC.126 Instead, a member of the glycyl radical enzyme (GRE) superfamily was observed in a gene cluster that was up-regulated during L-fucose growth.126 GREs utilize an active site glycyl radical to initiate radical chemistry.127 This radical is installed post-translationally by a dedicated partner activating enzyme (GRE-AE) that belongs to the radical SAM superfamily.128 The canonical [4Fe–4S]1+ cluster in the activating enzyme reductively cleaves SAM to generate the 5′-dA˙ oxidant, which abstracts a hydrogen atom from the active site glycine in the partner GRE protein (Fig. 2d).127,128 The glycyl radical is then thought to react with a nearby conserved and essential cysteine residue to generate a thiyl radical that acts on the substrate.127,129 In the (S)-1,2-propanediol dehydration reaction, the thiyl radical is proposed to abstract a hydrogen atom from the C1 position of the substrate (Fig. 2e),124 in a step analogous to the C–H bond cleavage by the 5′-dA˙ in the B12-dependent enzyme. However, the mechanisms of the GRE and the B12-dependent dehydratases are thought to diverge from this point. Instead of hydroxyl group migration, the resultant C1-based radical intermediate in the GRE mechanism is postulated to undergo a spin center shift to the C2 position. In this mechanism (Fig. 2e) supported by computational studies,126,127 deprotonation of the C1 hydroxyl group initiates elimination of the C2 hydroxyl group with concomitant formation of the C1 aldehyde and radical migration (Fig. 2e).130,131 This elimination mechanism is consistent with the observed loss of the 18O-label in the product when using a C2-18O-isotopically labeled substrate.121 Hydrogen atom abstraction by the C2 alkyl radical from the catalytic cysteine thiol regenerates the thiyl radical and forms the propionaldehyde product. At the end of each cycle, the radical migrates back to the initial conserved glycine residue.
These two enzymes represent a striking example of convergent enzyme evolution. They carry out identical dehydration reactions of (S)-1,2-propanediol, yet have different mechanisms to generate the initial radical and stabilize different substrate-based radical intermediates during catalysis. One feature they share is the ability to recycle or store the radical oxidant that activates the substrate, enabling catalytic turnover. However, the protein-based radicals of GREs and the metallocofactor of their activating enzymes are extremely sensitive to oxidation and enzyme inactivation by dioxygen or reactive oxygen species.122,128,129 Consequently, expression of the GRE (S)-1,2-propanediol dehydratase and fermentation of (S)-1,2-propanediol are induced in low-oxygenic or anaerobic conditions in the gut environment.126 In contrast, the B12-dependent enzyme is less vulnerable to oxidation,94 which may allow for gut microbes possessing this enzyme to utilize L-fucose under conditions of host intestinal inflammation when oxygen levels are higher.132,133 Interestingly, the B12-dependent dehydratases are primarily found in opportunistic enteric pathogens, including Salmonella and Klebsiella, perhaps enabling their expansion in the inflamed gut.134 Indeed, the expression of the B12-dependent enzyme in Salmonella enterica serovars Typhimurium confers a fitness advantage and is considered a genetic determinant of pathogenicity.135 Conversely, the GREs have been identified more often in commensal organisms of the Clostridia class.124,125 This observation suggests the possibility of targeting the pathogen-associated B12-dependent dehydratases to impair their ability to utilize L-fucose, without disrupting L-fucose metabolism by commensal organisms.
One large class of antibiotics produced by gut microbes is the bacteriocins, which are ribosomally synthesized and post-translationally modified peptides (RiPPs).144–147 These natural products are encoded by a precursor gene that is ribosomally translated to yield a short peptide, typically consisting of 20–100 amino acids.144 This precursor peptide consists of a core sequence and a short leader sequence that is cleaved following post-translational modifications of the core peptide to yield the final mature natural product.146 These modifications result in unique structural features that serve to classify the bacteriocin RiPPs into smaller subgroups, which have been comprehensively reviewed.144 Bioinformatic analyses of human-associated microbial genomes and metagenomes showed that they encode an array of bacteriocins representing all structural subclasses.148–152 While many putative RiPP gene clusters mapped to bacteria in the GI tract,151,152 an even higher abundance were identified from microbes residing in other sites of the human body, with the majority in the oral and vaginal cavities.152 Collectively, these analyses underscore the competitive microbial ecological niches on and in the human body and suggest that they could be untapped reservoirs of natural product antibiotics.
Sactipeptides (sacti- = sulfur-to-alpha-carbon), also known as sactibiotics, are a growing family of bacteriocins with a defining thioether peptide linkage that is installed by a metalloenzyme.144 Inhabitants of the human gut produce sactipeptides that exhibit antibiotic activity against enteric pathogens. These peptides are thought to act by creating pores in the bacterial membrane, causing an influx of ions that leads to membrane depolarization and cell death.153,154 Subtilosin A, produced by strains of Bacillus subtilis, was the first sactipeptide to be identified and characterized (Fig. 3a).155–157 Its biosynthetic gene cluster has been detected in stool metagenomic samples, as well as the genomes of microbes from various habitats.151,152 Subtilosin A demonstrates relatively broad bacteriocidal activity against Gram-positive pathogenic bacteria implicated in bacterial vaginosis, food poisoning, and hospital infections, including Gardnerella vaginalis, Listeria monocytogenes, Enterococcus faecium, and Staphylococcus aureus.153,158,159 Other sactipeptides have been isolated exclusively from gut microbes, such as thuricin CD154,160–162 (Fig. 3a) and thurincin H163,164 (Fig. 3a) that are produced by strains of Bacillus thuringiensis, and ruminococcin C made by the commensal bacterium Ruminococcus gnavus.165–167 The latter demonstrates narrow antibiotic activity toward Clostridium perfringes,168 a bacterium that causes food-borne illnesses. The thuricin CD product is a peptide heterodimer (Trnαβ) that displays antibiotic activity against the infectious gut pathogen C. difficile.160,161 Importantly, whereas other broad spectrum antibiotics commonly used in C. difficile treatment cause major disruption to the rest of the gut microflora, thuricin CD does not alter the community composition due to its narrow spectrum activity, yet is equally as effective against C. difficile.161 The use of these narrow-spectrum antibiotics against enteric pathogens could improve outcomes and mitigate adverse effects, including infection reoccurrence, caused by destroying the resident commensal gut microbiota.
Fig. 3 Sactipeptide RiPP natural products synthesized by commensal gut microbes contain metalloenzyme-installed thioether linkages. (a) Structures of sactipeptides produced by human-associated bacteria [PDB accession codes: subtilosin A, 1PXQ; thurincin H, 2LBZ; thuricin CD, 2L9X and 2LA0]. Leader sequences are shown in blue and brackets indicate thioether linkages. (b) Mechanistic proposals for thioether bond formation invoking an iron–sulfur cluster activated thiol (radical scheme) or a ketoimine intermediate (polar scheme). (c) The CteB (PDB accession code: 5WGG) active site depicting one of the SPASM domain auxiliary clusters located in close proximity to the SAM substrate bound to the radical SAM cluster. |
The unique thioether structural motif common to all sactipeptides arises from a C–S bond-forming reaction between the thiol group of a cysteine residue and the α-carbon of a variable amino acid residue in the peptide.144 While thioether bonds are also found in lantibiotics (another subclass of bacteriocin RiPPs), those linkages are formed from conjugate addition between a cysteine thiol and a dehydroalanine or dehydrobutyrine residue, which is mediated by enzymes that employ acid–base chemistry.144 In contrast, generation of the thioether linkages in sactipeptides requires radical chemistry to enable hydrogen atom abstraction from the amino acid α-carbon. Sactipeptide biosynthetic gene clusters invariantly include a gene encoding for a radical SAM enzyme, leading to its suspected involvement in thioether-bond formation.169 The first thioether bond forming radical SAM maturase to be characterized was AlbA, the enzyme that modifies the sactipeptide subtilosin A.170 This single enzyme catalyzes the crosslinking of three different cysteine residues with the respective α-carbons of one threonine and two phenylalanine residues in the subtilosin A precursor peptide (Fig. 3a). Sactipeptides commonly have multiple thioether crosslinks with either stereochemical configuration. Thus, the mechanism of C–S bond formation by radical SAM sactipeptide maturases and the basis for their regio- and stereoselectivity have been points of intrigue.
AlbA and the other identified sactipeptide maturases are members of the aforementioned SPASM-domain subgroup of radical SAM enzymes. AlbA harbors one extra [4Fe–4S] cluster in its SPASM domain,170 whereas other recently described maturases harbor two auxiliary [4Fe–4S] clusters.171,172 As expected, the radical SAM [4Fe–4S] cluster alone is competent and sufficient for reductive SAM cleavage to generate the 5′-dA˙ intermediate.170,171 This oxidant is proposed to perform the initial hydrogen atom abstraction from the α-carbon of the target amino acid. Conversely, the SPASM [4Fe–4S] cluster is not essential for SAM cleavage, but is required for thioether bond formation.170 A recent crystal structure of the sactipeptide radical SAM maturase CteB revealed that, unexpectedly, only three of the four iron ions in the SPASM cluster were ligated with cysteine residues, while the fourth site remained open.172 This unsaturated coordination sphere contrasts with the full cysteine coordination observed for the SPASM domain clusters in the anSMEs, which act as electron transfer mediators, perhaps suggesting another role for this cluster.
An additional structure of CteB solved in the presence of a truncated peptide substrate provided potential insights into the role of the SPASM cluster and its open coordination site in thioether-bond formation.172 In this structure, a cysteine from the peptide substrate (albeit not the thioether bond-forming cysteine) coordinates the open site of the SPASM auxiliary cluster, which is located <11 Å from the 5′-position of SAM (Fig. 2c).172 Prior to this structure, such a direct interaction had been postulated based on perturbations in the UV/visible absorption spectrum of the SPASM [4Fe–4S] cluster in AlbA in the presence of the peptide substrate.170 The authors proposed that the SPASM cluster could act as an electron acceptor and a Lewis acid to activate the coordinated thiol for oxidative coupling to the carbon-centered radical (Fig. 3b).170 There are mechanistic precedents for iron ion activation of a coordinated sulfur atom for oxidation and C–S bond formation. In the enzyme isopenicillin-N-synthase (IPNS), the cysteine thiol of the substrate tripeptide directly coordinates a mononuclear non-heme Fe3+ ion.173 This sulfur undergoes attack by a carbon-centered radical to form a C–S bond with concomitant one-electron reduction of the iron ion.173,174 In the case of the sulfur-inserting radical SAM enzyme BioB (described in detail in a later section), the activated sulfur atom is a sulfide bridge of an auxiliary [2Fe–2S] cluster. In this mechanism, a substrate-based alkyl radical attacks the bridging sulfide and reduces one of the Fe3+ ions through inner-sphere electron transfer (Fig. 5b).175,176 Extending this rationale to the sactipeptide maturases, radical coupling of the α-carbon radical to the iron-coordinated cysteine thiol would yield the thioether linkage with concomitant loss of an electron through inner-sphere electron transfer to the [4Fe–4S]2+ SPASM cluster (Fig. 3b).
Fig. 5 Microbial biosynthesis of the alternative tRNA base, queuosine, involves two metalloenzymes. (a) Complete biosynthetic pathway for queuosine production with the reactions catalyzed by metalloenzymes highlighted in green and blue boxes. (b) Mechanistic proposal for the final step in queuosine biosynthesis catalyzed by the Cbl-dependent epoxyqueuosine reductase, QueG. (c) Active site depiction of product-bound QueG (PDB accession code 5D0B) showing the open-form cobalamin (grey sticks) with the central cobalt ion (pink sphere), the two proximal [4Fe–4S] clusters (orange and yellow spheres), and the queuosine base (yellow sticks) of the bound tRNA mimic. (d) Active site depiction of substrate-bound QueE (PDB accession code 4NJH) showing the radical SAM cluster (orange and yellow spheres) with the co-substrate SAM (light grey sticks) bound and the substrate (yellow sticks) coordinating the essential Mg2+ ion (purple sphere). (e) Proposed mechanism for 7-deazapurine synthesis by the radical SAM enzyme, QueE. |
An alternative mechanism has been put forth that invokes a distinct oxidation event preceding C–S bond formation via polar chemistry.171 In this mechanism (Fig. 3b), the α-carbon-centered radical first undergoes one-electron oxidation, presumably via electron transfer to the SPASM [4Fe–4S]2+ cluster, to form a ketoimine intermediate. This electrophilic intermediate is then attacked by the deprotonated cysteine thiol to form the C–S bond.171 In this case, the observed cysteine coordination to the SPASM domain auxiliary cluster could be rationalized by a role in substrate positioning. The core sequences of sactipeptides often contain many cysteine residues, which either form crosslinks in a processive fashion or remain unmodified. Coordination of the additional, unmodified cysteines to the SPASM domain cluster could be serving to lock the substrate into the proper configuration such that both the target carbon and cysteine are positioned close to the radical SAM cluster.
Importantly, both of these mechanistic hypotheses satisfy the observation of both D- and L-stereochemistry for the thioether bonds in sactipeptide natural products. In some cases, a single peptide has both D- and L-linkages that are all installed by the action of the same maturase, as observed for subtilosin A.156,157,170 As the precursor peptides contain only L-amino acids, the lack of stereoretention at the α-carbon could support the formation of a ketoimine intermediate, which could be attacked on either planar face by the deprotonated thiol. Alternatively, this outcome could also arise from the planar character of the carbon-centered radical, which is stabilized by the captodative effect. Indeed, epimerization of amino acids in RiPP peptides is known to be promoted by radical SAM enzymes.177 These epimerases are proposed to generate a similar Cα-radical substrate intermediate that is quenched by a hydrogen atom from a protein residue on the opposite face to invert the Cα stereocenter, yielding a D-amino acid.177 Further structural and spectroscopic characterization of the auxiliary SPASM cluster in sactipeptide maturases in the presence of native peptide substrates will help to resolve these two mechanistic proposals and the precise role(s) of the SPASM clusters.
Regardless of the operant mechanism, both hypotheses imply precise positioning of the reacting cysteine residue with respect to the activated α-carbon to ensure the stereoselective outcome of each respective crosslink. The stereochemistry and location of the thioether crosslinks are expected to greatly impact the peptide structure due to its small size and therefore influence its bioactivity. The antibiotic activity of these sactipeptides, and, in particular, the promise of thuricin CD as a therapeutic agent against C. difficile infection, warrants further investigation into their structural properties and the enzymes responsible for thioether installation.
De novo pathways for vitamin and cofactor biosynthesis in microbes involve a variety of unusual and chemically challenging transformations, many of which are catalyzed by members of the radical SAM enzyme superfamily.188 For example, the futalosine biosynthetic pathway for vitamin K2 (menaquinone) and the biosynthesis of vitamin B1 (thiamine) require the action of radical SAM enzymes that catalyze C–C bond-forming reactions (Fig. 4). We direct the reader to a recent review for more information on the mechanisms of these enzymes, including ThiC, which catalyzes the complex rearrangement of 5-aminoimidazole ribonucleotide to form the pyrimidine moiety of thiamine.189 A second thiamine biosynthetic enzyme, tyrosine lyase (ThiH), catalyzes an elimination reaction that shares mechanistic similarities with the transformations catalyzed by radical SAM enzymes HydG, CofH, and NosL.190–192 In the proposed mechanism of ThiH, the 5′-dA˙ oxidant abstracts a hydrogen atom from the amine of L-tyrosine to generate a nitrogen-centered radical.188 The Cα–Cβ bond then undergoes β-scission to yield a one-electron oxidized p-cresol intermediate that gets reduced and protonated, while the dehydroglycine co-product feeds into thiazole biosynthesis (Fig. 4b).
Thiamine acts as an enzyme cofactor in essential pathways, such as branched-chain amino acid and carbohydrate metabolism, as well as in non-coenzymatic roles.193,194 Thiamine deficiency with systemic and neurological symptoms often occurs in infants and young children from tropical and impoverished regions.194 The diet is a major source of this vitamin, but a recent study using Drosophila as a model system195 has provided evidence for the long postulated role of the gut microbiota196 in provision of this nutrient to the host. In this model, axenic offspring that were unable to develop on a thiamine-deficient diet were rescued by introduction of a microbiota and even a single organism, Acetobacter pomorum.195 Indeed, human colonocytes express a thiamine pyrophosphate-specific transport that would allow for uptake of this vitamin by the host.184,197–199 Interestingly, variants in this gene have been identified as susceptibility markers for ulcerative colitis in Northern Indian populations, which are known to exhibit thiamine deficiency.200,201 This connection between thiamine uptake and ulcerative colitis remains to be explored further.
Another example of a radical SAM enzyme that participates in vitamin biosynthesis is one of the founding superfamily members, biotin synthase (BioB). BioB is responsible for the incorporation of a sulfur atom in the thiophane ring of the cofactor biotin (Fig. 4c) and belongs to a subgroup of radical SAM enzymes that all perform sulfur insertion reactions.202,203 Each of these enzymes possesses additional auxiliary [4Fe–4S] or [2Fe–2S] cluster(s) that serve as sacrificial donors of one or two sulfur atom(s) that are incorporated into the final product.202 The auxiliary cluster in BioB is a [2Fe–2S] cluster ligated by three cysteine residues and an unusual arginine residue.204–206 The mechanism of BioB (Fig. 4d) initiates with generation of a 5′-dA˙ at the radical SAM [4Fe–4S] cluster and subsequent abstraction of a hydrogen atom from the C9 position of the desthiobiotin substrate.207 The substrate-based radical then attacks a bridging sulfide of the auxiliary [2Fe–2S]2+ cluster, forming the first C–S bond with concomitant inner-sphere electron transfer to an Fe3+ ion of the cluster.175 This reaction yields a chemically competent 9-mercaptodethiobiotin intermediate species208 that remains cross-linked to the Fe3+ ion of the [2Fe–2S]1+ cluster.176 BioB then catalyzes another SAM cleavage event, generating a second 5′-dA˙ equivalent that abstracts a hydrogen atom from the C6 position of the substrate.207 The resultant substrate alkyl radical undergoes a similar reaction with the cross-linked sulfur to cyclize the substrate, which also reduces the second Fe3+ ion of the cluster to Fe2+.
Upon reduction and loss of sulfur to the product, the auxiliary cluster degrades, resulting in an inactive enzyme in vitro.209,210 However, evidence of catalytic activity in vivo211 suggested that additional components might enable cluster reassembly to allow for multiple turnovers in the cell. A recent report reconciled the issue of cluster degradation in vitro through study of another sulfur inserting enzyme, lipoyl synthase (LipA). Scaffold proteins NfuA and IscU, associated with iron–sulfur cluster biogenesis, were demonstrated to reassemble and insert a new auxiliary cluster into lipoyl synthase, rendering it capable of multiple turnovers in vitro.212 This type of cluster repair mechanism could very well extend to other sulfur inserting enzymes, like BioB, to enable their catalytic activity in vivo.
Biotin is an essential cofactor for carboxylase enzymes in pathways such as fatty acid biosynthesis, branched-chain amino acid catabolism, and gluconeogenesis.213 Deficiency of this vitamin in humans causes alopecia and skin dermatitis.214 A recent study using murine models demonstrated the impact of the gut microbiota on systemic host biotin levels and the display of alopecia phenotypes.215 A shift in gut microbiota composition consisting of a bloom of Lactobacillus murinus, a biotin auxotroph, was identified as the cause of low biotin levels in the host.215 This example represents an interesting case in which the gut microbiota is actively depleting the vitamin pool available to the host, an aspect of vitamin homeostasis that has not been adequately explored with respect to the gut microbiota.
Although the structure of queuosine had been known for decades,228,229 the complete pathway for its de novo biosynthesis in bacteria (Fig. 5a) has only been recently established.223,230 The pathway begins with conversion of GTP to 7,8-dihydroneopterin-3′-triphosphate by a GTP cyclohydrolase, analogous to the first steps in folate and biopterin biosynthesis. The unique 7-deazaguanine ring is synthesized from this intermediate through the action of the enzymes QueD and QueE, the latter of which is a radical SAM enzyme. The ATP-dependent enzyme QueC transforms the carboxyl group of the 7-deazaguanine to a nitrile in the PreQ0 intermediate. The NADH-dependent enzyme QueF then reduces the nitrile to an aminomethyl group, generating the PreQ1 intermediate. The enzyme tRNA guanine transglycosylase (TGT) then inserts the PreQ1 intermediate into the target tRNA, replacing guanine at position 34. The final two enzymes act on the PreQ1–tRNA complex to yield the queuosine–tRNA final product. The SAM-dependent enzyme QueA catalyzes transfer and isomerization of the ribose group from SAM to the aminomethyl of PreQ1 to form epoxyqueuosine–tRNA. Lastly, the enzyme QueG reduces the epoxide to generate the cyclopentene ring of queuosine.
The enzyme responsible for this final transformation remained elusive for many years, even following the elucidation of the rest of the pathway. It was finally discovered by screening a strain library of E. coli single gene knockout mutants for the accumulation of epoxyqueuosine and the absence of queuosine in isolated tRNA nucleotides.231 The protein identified, QueG, shares sequence homology to reductive dehalogenases, which utilize a cobalamin (vitamin B12) cofactor and multiple iron–sulfur clusters for catalysis. These enzymes belong to the class III group of cobalamin-dependent enzymes, which to date remain grossly under-characterized despite the amazing chemistry attributed to the few known members.232 Cofactor analysis of QueG by EPR spectroscopy and structural characterization confirmed the presence of a cobalamin cofactor with square-pyramidal geometry, existing in the “free-base” configuration (i.e., no lower axial ligand) with an upper axial water ligand.233–235 However, in the presence of substrate, this water ligand is displaced (Fig. 5c).235 The lack of an upper ligand is unique to enzymes of this class, in contrast to the other classes of cobalamin-dependent enzymes which have either a 5′-deoxyadenosyl or methyl axial ligand, and is proposed to have a key function in their mode of action.232,236
In the case of QueG, the “open-Cbl” form is predicted to enable formation of a covalent substrate adduct intermediate during catalysis.224,225 The proposed mechanism for QueG234,235 (Fig. 5b)230,231 begins with nucleophilic attack by the reduced Co1+ on the substrate epoxide to form a Co3+–C adduct and open the epoxide ring, which is facilitated by protonation of the oxygen atom. Single electron reduction of the alkyl–Co3+ species induces homolytic cleavage of the metal–carbon bond, formation of the alkene, and concomitant elimination of the hydroxyl group. Reduction of the resultant Co2+ center to regenerate the Co1+ state for another turnover is mediated by two [4Fe–4S] clusters positioned between the protein surface and the cobalamin cofactor (Fig. 5c).234,235 These clusters were determined to have sufficiently low reduction potential to reduce the low redox potential Co2+/1+ couple.235
Further insight into the mechanisms of substrate recognition and binding, as well as support for the proposed catalytic mechanism, were obtained from structural comparison of substrate-free QueG and a co-crystal structure with a short oligonucleotide tRNA mimic containing the queuosine product.235 QueG is composed of three structural domains: the N-terminal Cbl-binding domain, a ferredoxin-like fold that harbors the iron–sulfur clusters, and a triple HEAT-repeat domain that interacts with the tRNA substrate. The cyclopentenediol ring of the queuosine product is observed in the structure directly above the cobalamin cofactor with the target carbon at a distance of ∼4 Å (Fig. 5c),235 supporting the proposal of a metal–carbon adduct intermediate at the open axial position. In addition, a conserved aspartate residue is positioned in the active site approximately 3 Å from the double bound of the product cyclopentene (Fig. 5c), suggesting it could aid in substrate positioning through interaction with the epoxide oxygen and could serve as a catalytic acid. The unique mechanism of QueG formulated based on structural, mechanistic, and spectroscopic studies highlights the key role of the potent nucleophilic Co1+ reactant species. This reactivity could be exploited in the design of inhibitors for use as experimental tools to probe the role of queuosine in host biology.
Another metalloenzyme involved in the biosynthesis of this alternative nucleobase is the radical SAM enzyme QueE. This enzyme and its homologs catalyze an unusual pterin ring contraction/rearrangement reaction that generates the 7-deazapurine ring found in the tRNA bases queuosine and archeosine (found exclusively in archaea), as well as a number of natural products.230,237 The structure of QueE consists of a minimal TIM barrel fold that harbors the radical SAM [4Fe–4S] cluster via either the traditional CX3CX2C or an atypical CX14CX2C sequence motif.238 The 6-carboxy-5,6,7,8-tetrahydropterin substrate is bound in close proximity to the radical SAM cluster and is stabilized through coordination of an essential Mg2+ ion via its 4- and 6-carboxylate oxygen atoms (Fig. 5d).238,239 In the proposed mechanism (Fig. 5e), the 5′-dA˙ oxidant generated from reductive SAM cleavage initiates the reaction through abstraction of the substrate C6 hydrogen atom.239 The resulting substrate radical then rearranges to form a strained aziridine intermediate or transition state, which is supported by computational studies.240 This mechanism is reminiscent of the migration reactions catalyzed by radical SAM aminomutases. Alternatively, the pyrazine ring could open through β-scission of the N7–C4a bond to form an imine intermediate. In the next step, formation of the 5-membered deazapyrrole ring is proposed to yield an exocyclic amine with a nitrogen-centered radical. This amine-based radical is quenched by hydrogen abstraction from the original 5′-dA molecule to regenerate the 5′-dA˙, which rationalizes the observed catalytic nature of SAM in QueE catalysis.239 Deprotonation of either the C2 pyrimidine exocyclic amine or the pyrrole nitrogen by a basic amino acid residue238 promotes elimination of ammonia from the C7-gem-aminocarboxylate pyrrole intermediate. Finally, a basic residue abstracts the pro-R-proton from the activated C8 position to rearomatize the final pyrrolopyrimide structure.238,239
Although the physiological role of queuine remains enigmatic, preliminary findings suggest it may regulate cell proliferation.220,241 Queuosine has been reported to influence mitotic signaling pathways that rely on protein phosphorylation patterns.242 In cancer cells, protein tyrosine phosphorylation levels are abnormally high and tRNA molecules are hypomodified with queuosine.243 Exogenous queuine administration reverses both of these phenotypes in cancer cells,242,243 suggesting that queuine plays a role in regulation of tyrosine kinases that are critical in cell proliferation processes. However, these studies have not been able to differentiate the potential effects of the free queuine base from the queuosine-containing tRNA. The connection between queuine, cell proliferation, and cancer malignancy warrants further investigation.
The acrylate and propionate derivatives of tryptophan are produced via a reductive fermentation pathway consisting of a promiscuous set of enzymes encoded by the aromatic amino acid metabolism gene operon, fldAIBC.249 In this pathway (Fig. 6), the aromatic amino acid first undergoes transamination to form the 2-oxo-acid, which is reduced via an NADH-dependent enzyme to the corresponding 2-hydroxy-acid. A CoA-ligase (FldA) appends CoA via a thioester linkage to activate the 2-hydroxy-acid, which is a prerequisite for the next enzymatic transformation.250 The 2-hydroxyacyl-CoA molecule is then dehydrated (FldIBC) to the acrylate-CoA and reduced in the final step to the fully saturated derivative.249 While most of these steps can be achieved through acid–base chemistry, the dehydration of a 2-hydroxy-acid is chemically challenging due to the high pKa of the protons at the β-carbon.251,252 Instead of acid–base chemistry, 2-hydroxyacyl-CoA dehydratases utilize reductive radical chemistry to perform this reaction, with the CoA thioester playing a key role in stabilizing reaction intermediates. The proposed mechanism of the 2-hydroxyacyl-CoA dehydratases (Fig. 6)251,252 initiates with a one-electron reduction of the substrate to generate a ketyl radical anion. Next, the carbanion eliminates the hydroxyl group to form an enoxy radical intermediate. The pKa (∼14) of the β-proton of this intermediate is markedly lower than that of the 2-hydroxy-acid (by >25 pK units),253 facilitating its deprotonation to yield another ketyl radical anion that undergoes one-electron oxidation to form the final enoyl-CoA product.
Fig. 6 Production of immune-modulatory indole derivatives by gut-associated Clostridia. (a) Reductive amino acid fermentation pathway involving dehydration of a 2-hydroxy-acid. (b) Homodimer interface of the dehydratase activator (2-hydroxyisocaproyl-CoA dehydratase activator, PDB accession code: 4EHT) highlighting the helix–cluster–helix motif in the ADP-bound 105° angle conformation. (c) Heterodimer interface of the dehydratase component (2-hydroxyisocaproyl-CoA dehydratase, PDB accession code: 3O3N) depicting the [4Fe–4S] clusters in each subunit and the direct substrate coordination to the α-cluster. (d) Umpolung charge reversal mechanism of 2-hydroxyacyl-CoA dehydration. (e) Proposed ATP-dependent electron transfer mechanism of dehydratase activation. |
The critical single electron transfer that initiates dehydratase catalysis is promoted by an essential partner activating enzyme, a [4Fe–4S] cluster-dependent electron transfer protein with ATPase activity.249,254 Structural characterization of homologs that activate other 2-hydroxyacyl-CoA dehydratases has provided insight into the conserved mechanism of activation (Fig. 6e). The activating partner exists as a homodimer with a single ATP/ADP bound per monomeric subunit and a [4Fe–4S] cluster at the interface of the dimer (Fig. 6b).255,256 The cluster is highly solvent exposed and thus extremely sensitive to oxidation;249 it is ligated by two cysteine residues contributed by a helix of each monomer to form an interesting helix–cluster–helix structural motif (Fig. 6b).255,256 In the ADP-bound form, the helix–cluster–helix has an angle of 105° (Fig. 6b),255,256 which inspired the epithet for these “Archerase” enzymes.252 Reduction of the cluster and substitution of the ADP molecules for ATP is proposed to induce a conformational change to form a 180° angle at the helix–cluster–helix dimer juncture that promotes complex formation with the dehydratase (Fig. 6e).256,257 ATP hydrolysis then drives the electron transfer from the [4Fe–4S]1+ cluster of the activating partner to an oxidized [4Fe–4S]2+ cluster in the dehydratase (Fig. 6e).254,256,257 This dehydratase cluster then transfers the electron via a second [4Fe–4S] cluster to ultimately reduce the substrate (Fig. 6e).
This mechanism of enzyme activation has been largely devised based on analogy to the activation of the nitrogen-fixation metalloenzyme, nitrogenase.254,258 Although the activating partners of 2-hydroxyacyl-CoA dehydratases and nitrogenases both have the helix–cluster–helix structural motif, their structures do not otherwise share apparent homology.259 The nitrogenase activator is most similar to the family of G-proteins, while the dehydratase activator resembles the ASKHA (acetate and sugar kinases, heat shock protein 70 and actin) proteins,259 implying that they independently evolved similar ATP-dependent strategies to promote one-electron reduction. As expected from their analogous roles in electron transfer, the [4Fe–4S] clusters of the dehydratase and the nitrogenase activator proteins share distinctive electronic properties. Spectroscopic characterization of the [4Fe–4S]1+ cluster in the phenyllactyl-CoA dehydratase activator (FldI) revealed that it has an unusual S = 3/2 ground state.249 Studies of the homologous 2-hydroxyglutaryl-CoA dehydratase activator demonstrated that, when treated with the strong reductant titanium(III) citrate, the [4Fe–4S]1+ cluster can be further reduced to the superreduced [4Fe–4S]0 oxidation state.260 To our knowledge, the nitrogenase and dehydratase activators are the only examples of superreduced clusters in biology. However, at least in the case of the dehydratase activator protein, only the more traditional [4Fe–4S]2+/1+ redox couple is likely to be biologically relevant for electron transfer, as the superreduced cluster cannot activate the 2-hydroxyglutaryl-CoA dehydratase in vitro.260
After one-electron reduction by the activating protein, the dehydratase is catalytically active and capable of thousands of turnovers through storage of the reducing equivalent in its own iron–sulfur clusters.249,261 The crystal structure of the 2-hydroxyisocaproyl-CoA dehydratase provided insight into the electron transfer cycle during catalysis. The dehydratase is a heterodimer composed of two structurally similar subunits that share low sequence homology.262 Each subunit harbors a [4Fe–4S] cluster positioned at the interface of the αβ dimer (Fig. 6c).262 The β-subunit cluster is ligated by three protein cysteine residues and another thiolate ligand and is believed to be the initial acceptor of the electron provided by the activating protein.262 The α-subunit cluster has three cysteine ligands and an open coordination site that is occupied by a water molecule in the absence of substrate.262 Upon substrate binding, this water ligand is displaced by the carbonyl oxygen of the substrate thioester (Fig. 6c),262 indicating that the α-subunit harbors the catalytic active site. The direct, monodentate coordination of the substrate to the iron–sulfur cluster enables facile inner-sphere electron transfer and stabilization of the ketyl radical anion intermediate. Interestingly, neither dehydratase cluster can be reduced by any tested chemical reductants,263 suggesting that they have very low reduction potentials. Either the reduction potential of the activator cluster is lowered upon complex formation due to the large postulated 105° to 180° conformational change and likely desolvation of the cluster256 or the reduction potentials of the dehydratase clusters are altered upon complexation and/or as a result of desolvation observed upon substrate binding.262 Once the dehydratase is reduced, the electron can continuously cycle between the two dehydratase clusters and the substrate. Overall, this elaborate and elegant mechanism of ATP-dependent electron transfer to a CoA-activated substrate is used to achieve a charge reversal (Umpolung effect) for water elimination from an unactivated 2-hydroxy-acid substrate.
Reductive aromatic amino acid metabolism has only been demonstrated for a handful of gut bacteria, all belonging to the phylum Firmicutes, but it has important implications for human health and disease. The fldAIBC gene operon responsible for this metabolism was shown to be less prevalent and less abundant in metagenomes of patients with inflammatory bowel diseases (both Crohn's and ulcerative colitis) compared to healthy populations.244,248 As expected, the abundance of the gene operon correlated with observed levels of tryptophan metabolites in biological samples of these patients,248 suggesting that the lack of this operon could be a good genetic marker for inflammatory disease. Interestingly, reduction of tryptophan metabolism is correlated with reduced mucin degradation, specifically of L-fucosylated glycans.248 In the healthy gut, the host presents fucosylated mucins, which promote colonization of fucose-degrading microbes that are known to produce tryptophan-derived metabolites with immune suppression activity. The host immune response to these microbial metabolites then results in increased mucin fucosylation. This co-metabolism thus perpetuates a positive feedback cycle between the gut microbiota and the host to suppress inflammation. Conversely in IBD, the observed reduction of these beneficial microbes could reflect disruption of this cycle as either a cause or a consequence of host inflammation and could present a target for microbiota-based therapeutics.
The enzyme responsible for anaerobic microbial choline metabolism, choline TMA-lyase (CutC), was recently identified through a genome mining approach and is widely distributed in gut microbial genomes and human gut metagenomes.275–277 CutC is another member of the GRE family, but prior to its characterization, C–N bond cleavage was not a known transformation for these enzymes. As with all GREs, a partner radical SAM activating enzyme generates the catalytic glycyl radical on CutC.275 The mechanism of C–N bond cleavage is proposed to initiate with hydrogen atom abstraction by the thiyl radical from the C1 position of choline to generate an α-hydroxyalkyl radical intermediate. Deprotonation of the hydroxyl group and formation of a transient ketyl radical intermediate is then proposed to promote heterolytic cleavage of the C–N bond to directly eliminate TMA. This proposal is supported by the substrate-bound crystal structure of CutC.278 The substrate is positioned in the active site pocket with a gauche conformation that is imposed by CH–O hydrogen bonding interactions between the partial positively charged, polarized N-methyl groups and the oxygen atoms of active site residues.278 This conformation allows for hyperconjugation between the p-orbital of the carbon-centered radical and the σ* anti-bonding orbital of the C–N bond and is expected to facilitate elimination of TMA.278 Following this step, the resultant acetaldehyde radical can abstract the hydrogen atom from the active site cysteine to regenerate the thiyl radical.
Instead of uptake and oxidation of TMA by the host, other gut microbes may further metabolize this molecule. Microbes from different environments have been shown to demethylate methylamines, such as TMA, by the action of corrinoid-dependent methyltransferases. In the first step, the donor amine forms a covalent bond with a post-translationally modified pyrrolysine residue in the active site (Fig. 7b).279 Once activated, the supernucleophilic Co1+ state of the vitamin B12 cofactor attacks the donor methyl group in an SN2-like reaction (Fig. 7b), cleaving the C–N bond of TMA.279 The resultant CH3–Co3+ cofactor can then be utilized as a methyl donor in one-carbon metabolism. Certain archaea possess N-methyltransferases that are able to initiate methanogenesis through the specific demethylation of TMA, dimethylamine, and monomethylamine.280 Interestingly, this activity has been demonstrated in human-associated methanogenic archaea and has been proposed as a route to lowering TMA levels and consequently TMAO production by the host.281,282 This “archaebiotic” proposal is an intriguing example of targeting a microbially-produced metabolite to treat its associated human diseases, such as trimethylaminuria and cardiovascular disease.
Fig. 8 Generation of p-cresol by a gut microbial glycyl radical enzyme. (a) Oxidative metabolic pathway in Clostridia for L-tyrosine conversion to p-cresol via a 4-hydroxyphenylacetate intermediate. (b) Hetero-octameric structure of the glycyl radical enzyme 4-hydroxyphenylacetate decarboxylase (PDB accession code: 2YAJ) with the β-subunit shown in yellow and the γ-subunit shown in dark green. Insets show the two iron–sulfur clusters harbored in the γ-subunit. |
The microbial enzyme that catalyzes production of p-cresol, p-hydroxyphenylacetate decarboxylase (HpdB), is a member of the GRE family.290 While it shares the common mechanistic features of protein-based radical-mediated chemistry that define this family, HpdB belongs to a separate subclass of the GRE family together with the enzyme benzylsuccinate synthase.291 These GREs are unique in their requirement for an accessory scaffold protein (γ). This small γ-subunit is composed of two homologous domains that each harbor a [4Fe–4S] cluster (Fig. 8b).291,292 The C-terminal domain cluster has complete cysteine ligation, whereas the N-terminal domain contains a HiPIP (high-potential iron–sulfur proteins)-like cluster coordinated by one histidine and three cysteine ligands (Fig. 8b).292 Redox titrations revealed only a single reduction potential of −287 mV, suggesting that only one cluster, likely that with full cysteine ligation, is able to redox cycle, whereas the HiPIP cluster might play a structural role.293 Although the precise roles of these iron–sulfur clusters in GRE activity remain unclear, the γ-subunit is essential for decarboxylase activity.294 The active form of the HpdB decarboxylase protein (β) is a β4γ4 hetero-octamer that associates only upon phosphorylation of a serine residue in the β-subunit.291 Conversely, when the serine is dephosphorylated, the γ-subunits dissociate and the β-subunits form inactive homodimers.294 The active hetero-octamer only has a single glycyl radical equivalent at any given time,294 and it is unusually short-lived in the absence of substrate relative to the lifetimes of glycyl radicals in other GREs.291 It has been postulated that dissipation of the glycyl radical is facilitated by one-electron reduction via the iron–sulfur cluster(s) in the γ-subunit.291 Glycyl radical quenching could be a regulatory mechanism to limit enzymatic production of the toxic molecule, p-cresol.
The radical SAM activating enzyme of the HpdB protein possesses, in addition to the canonical N-terminal radical SAM [4Fe–4S] cluster, extra [4Fe–4S] cluster(s) that are located in a C-terminal ferredoxin-like domain.291 With the exceptions of the archetypal GRE-AEs of pyruvate formate-lyase and anaerobic Class III ribonucleotide reductase, many other GRE-AEs possess a homologous ferredoxin-like C-terminal domain with conserved cysteine sequence motifs, suggesting that they also harbor one or more additional iron–sulfur clusters. However, the purpose of these clusters is not well established. In the case of the HpdB activating enzyme, elimination of the domain harboring the auxiliary [4Fe–4S] cluster did not abolish 5′-dA˙ production and the enzyme was still able to activate the decarboxylase.295 Yet this truncated variant reduced the stability of the glycyl radical in the decarboxylase enzyme, decreasing its half-life 10-fold.295 The observed radical lability has led to speculation that this domain could protect the glycyl radical from solvent during dissociation of the decarboxylase from the activating enzyme.295 Yet structural and spectroscopic characterization of GREs in complex with their activating enzymes is lacking to support this hypothesis.
Once activated, the mechanism of HpdB diverges from that of traditional glycyl radical enzymes. In the canonical mechanism, the first step in substrate activation is hydrogen atom abstraction by the active site cysteine-based thiyl radical. This type of mechanism was originally postulated for 4-hydroxyphenylacetate decarboxylation. In this proposal, the phenolic O–H bond would be homolytically cleaved, generating an intermediate that acts as an electron sink for subsequent decarboxylation.290 However, the structure of the decarboxylase solved in the presence of substrate did not support that hypothesis, as it showed the carboxylate end of 4-hydroxyphenylacetate positioned near the cysteine.292 Thus, a mechanism was proposed in which the thiyl radical initiates a concerted proton-coupled electron transfer, abstracting an electron from the substrate carboxylate and a proton from a nearby glutamate residue. The resultant substrate radical undergoes a Kolbe-type decarboxylation, facilitated by parallel orbital alignment of the carboxylate-methylene C–C bond with the benzene ring, to generate CO2 and a 4-benzoquinone methide radical anion upon deprotonation of the phenolic hydroxyl. Hydrogen atom abstraction from the catalytic cysteine would yield the p-cresol product and regenerate the thiyl radical. Quantum mechanical calculations support this proposed mechanism,296 postulating a transition state that is already induced by the protein in substrate binding to promote the Kolbe-type decarboxylation.292
p-Cresol is a metabolite biomarker for autism spectrum disorder.297 High urinary levels of p-cresol, its sulfated-derivative, and a similar molecule, 4-ethylphenylsulfate, have been detected in young children with autism and are correlated with disorder severity.298 A higher abundance of C. difficile, in addition to other community shifts in the gut microflora, has been noted in children with autism,298 which could explain the increased levels of p-cresol. However, this metabolic activity has also been attributed to Clostridium sporogenes as well as other gut microbes299,300 and has not been extensively profiled across other commensal microbes. Despite the observed accumulation of p-cresol in this patient population, its causative effect in autism, if any, has not yet been elucidated. Conversely, administration of 4-ethylphenylsulfate to conventional mice has been shown to induce anxiety phenotypes associated with the disorder.301 Interestingly, a single health-promoting bacterium, Bacteroides fragilis, reduced levels of 4-ethylphenylsulfate and other elevated metabolites in a mouse model of autism spectrum disorder.301 Both p-cresol-sulfate and 4-ethylphenylsulfate are thought to derive from tyrosine and are speculated to have similar bioactivities based on their chemical structures; however, it is worth noting that their enzymatic production likely differs. The Kolbe-type decarboxylation that results in p-cresol production relies on the orbital overlap between the C–C bond to be cleaved and the π-system of the benzene ring, as well as resonance stabilization of the 4-benzoquinone methide radical anion intermediate. These molecular features are not conserved when the carbon chain is extended by a methylene group in the putative 4-ethylphenylsulfate precursor, and thus, an alternative enzymatic mechanism is likely used to achieve production of this metabolite. The enzyme(s) responsible for 4-ethylphenylsulfate production have not yet been identified.
The decomposition of urea by urease occurs in two steps. The first is enzymatic hydrolysis to produce ammonia and carbamate, followed by non-enzymatic hydrolysis of carbamate in water to yield the second molecule of ammonia and carbonic acid (Fig. 9a).312 The enzymatic reaction carried out by urease is facilitated by a rare dinickel cofactor, which acts as a Lewis acid to activate the substrate and water. The dinickel cofactor enhances the electrophilicity of the urea carbonyl and lowers the pKa of water, generating the more nucleophilic hydroxide ion.305 Urease was the first enzyme to be crystallized, an experiment which confirmed that enzymes are proteins and garnered a Nobel Prize.305 Numerous crystal structures have been solved of this enzyme from bacterial and plant sources in its resting state and in the presence of small molecule inhibitors that illustrate key features of the proposed mechanism for urea hydrolysis.305 The protein exists as a trimer of trimeric subunits (αβγ)3 with the dinickel cofactor in the α subunit, giving three active sites per trimeric complex.313,314 The two Ni2+ ions in each active site are located within ∼3 Å from one another in a weakly antiferromagnetically-coupled cluster.305,313,314 The coupling is achieved through two bridging ligands – a hydroxide and a post-translationally modified carbamylated lysine residue (Fig. 9b).314 The former acts as the nucleophile in urea hydrolysis and the latter is essential for nickel ion binding.315 The dinickel cofactor is asymmetrically coordinated with two histidine ligands at the Ni2+ ion in site 1 and two histidine residues and a monodentate aspartate residue ligating the Ni2+ ion in site 2 (Fig. 9b).313,314 In the resting state, the first coordination sphere of both sites includes a water ligand, resulting in a six-coordinate metal at site 2 and a five-coordinate metal at site 1 (Fig. 9b).314
Fig. 9 Microbial urea metabolism catalyzed by the dinickel enzyme urease. (a) Two-step breakdown of urea. (b) Proposed catalytic cycle of urease mediated by its dinickel (green spheres) cofactor. Active site depictions of inhibitor-bound structures that support the mechanism are shown as insets (PDB accession codes: H2O = 2UBP, acetohydroxamic acid, AHA = 4UBP, BO33− = 1S3T, diamidophosphate, DAP = 3UBP). |
In the presence of substrate, both water ligands are displaced and urea is proposed to bind directly via its carboxyl oxygen to the open coordination site of the nickel ion in site 1. This coordination is supported by the structure solved in the presence of the urease inhibitor, acetohydroxamic acid (AHA),316 which effectively mimics the bridging hydroxyl group and the urea structure (Fig. 9b). A conformational change is then proposed to induce coordination of the urea amine to the Ni2+ at site 2, poising the molecule as a μ-1,3-bridging ligand for attack by the bridging hydroxide. This bridging mode is analogous to that observed in the borate-inhibited structure (Fig. 9b).317 The structure with the inhibitor diamidophosphate (DAP) (Fig. 9b)314 has led to the proposal of a tetrahedral intermediate (or transition state) upon nucleophilic attack of the bridging hydroxide that is stabilized through an extensive hydrogen-bonding network. Finally, protonation of the coordinated amine by a nearby histidine residue is postulated to facilitate its elimination.305
Urease is one of just a handful of enzymes that use a nickel-containing metallocofactor.305,318,319 Most metal-dependent hydrolases utilize the non-redox active Zn2+ ion in mono-, di- and tri-nuclear clusters for catalysis. However, urease is highly specific for its dinuclear Ni2+ cofactor, as it is inactive with Zn2+ and other divalent metals.320,321 Both Ni2+ and Zn2+ have large positive charge density that can facilitate polarization of the hydroxide nucleophile and the substrate. However, the protein active site could favor intrinsic stereoelectronic properties of Ni2+ over Zn2+ ions.305,322 The d8 electronic configuration of Ni2+ favors an octahedral coordination sphere, whereas Zn2+ ion complexes in biology typically have tetrahedral geometry. Thus, the active site geometry could rearrange with Zn2+ ions to potentially preclude bridging of the water nucleophile and of the substrate. Conversely, the preference for an octahedral coordination sphere results in open Ni2+ coordination sites occupied by labile water molecules that can be displaced for proper positioning of the activated substrate with respect to the bridging hydroxide ligand to facilitate ureolysis.
The importance of the dinickel cofactor is also underscored by the suite of accessory proteins dedicated to its incorporation into the urease apo-protein.323 Four accessory proteins, UreDEFG, are often encoded in the same operon with the urease gene and are minimally essential for urease activity in most bacteria.323,324 UreD is postulated to act as a protein scaffold for a multicomponent complex of the apo-urease enzyme with the other accessory proteins.325–327 UreF binding to the apo-urease–UreD complex induces a conformational change that exposes the urease active site, providing access for the nickel ions and CO2 for lysine carbamylation.325,328 Lastly, UreG catalyzes GTP hydrolysis that is proposed to drive Ni2+ transfer from the metallochaperone UreE to the urease active site.329 Nickel binding sites have been identified in the UreE metallochaperone,330–332 as well as UreG and UreF proteins,333,334 suggesting a metal shuttling mechanism from UreE via UreG and UreF to the final destination in the urease active site.327 The UreDEFG proteins are sufficient to activate urease that is heterologously expressed in E. coli.335 However, in some native bacteria, additional proteins can play essential roles in urease nickel-activation, including nickel ion permeases, transporters, and binding proteins.324,336 Furthermore, there is an interesting, yet not fully understood, connection between the proteins responsible for biogenesis of the [NiFe]-hydrogenase cofactor (HydA and HydB) and the UreG and UreE proteins that is essential for urease activation in H. pylori.336–338
In addition to the urease enzyme itself, the importance of these accessory proteins in urease activity renders them each unique targets for inhibition of this metabolism. Inhibitors of urease activity have been studied for more than 50 years,339,340 but primarily have been applied to combat H. pylori gastric and Proteus urinary infections.341 More recently, efforts to minimize gut microbial urease activity via microbiota transplant have been directed toward the treatment of hyperammonemia.342 Introduction of a model gut community with minimal urease activity into mice promoted development of a more complex microflora with reduced urease activity342,343 and was shown to improve phenotypes associated with hyperammonemia.342 In a similar fashion, manipulation of the gut microbiota, or specific targeting of urease and its cofactor biosynthetic machinery, could be applied to the treatment of Crohn's disease, which is exacerbated by high urease activity.306
The genus name of this organism means “bile-loving”, hinting at the underlying cause of its expansion. Bile acids are produced exclusively by vertebrates to emulsify dietary lipids and fat-soluble vitamins, which facilitates their absorption in the intestines.348 They are also signaling molecules, acting as ligands to nuclear and G-protein coupled receptors that regulate lipid, glucose, and energy metabolism.348,349 These biomolecules are synthesized from cholesterol in the liver and conjugated with glycine and taurine to form salts that can then enter the digestive system.350 While some microbes are sensitive to the moderate antibacterial activity of bile acids,351 other gut microbes and pathogens transform these molecules into secondary bile acids through deconjugation, oxidation, and dehydroxylation reactions.352 Some of these secondary bile acids can be reabsorbed through enterohepatic circulation to accumulate in the liver, whereas others are excreted.352 High levels of these microbially-derived molecules in bile, blood, and feces have been correlated with gallstone disease and colon cancer.352 Although the purposes of bile acid oxidation and dehydroxylation are not entirely understood, the deconjugation of bile salts liberates free glycine and taurine, which can be sources of carbon, nitrogen, and sulfur, as well as substrates for energy production.
In the case of B. wadsworthia, the released taurine is metabolized to generate sulfite (Fig. 10a), which can be used as an electron acceptor for anaerobic respiration.353 Indeed, only taurine-conjugated bile salts induced a bloom of B. wadsworthia in IL10−/− deficient mice.345 This expansion can be recapitulated by a milk, saturated fat-based diet, as it stimulates increased production of taurine-conjugated bile salts.345 Notably, this diet and the associated expansion of B. wadsworthia have been connected with development of colitis in IL10−/− deficient mice.345 Although genetic susceptibility is essential in this model, the colonization of this pathobiont and its metabolic functions are causative in development of the observed disease phenotypes.
Fig. 10 Microbial taurine metabolism linked to human disease. (a) Deconjugation of taurine-bile salts by microbes releases free taurine, followed by reduction of the downstream metabolite sulfite to hydrogen sulfide as a means of anaerobic respiration. (b) Quaternary structure of the dissimilatory sulfite reductase (PDB accession code: 2V4J), composed of a dimer of trimeric subunits: DsrA (light green), DsrB (blue), and DsrC (red). Iron–sulfur clusters are depicted as spheres. (c) DsrB active site located at the interface with the DsrA subunit, depicting the siroheme cofactor bridged to one iron–sulfur cluster and a sulfite ligand. The C-terminal tail of the DsrC subunit is shown in red. (d) Proposed mechanism for the first two two-electron reduction steps mediated by the DsrABC complex that yields the protein-based trisulfide substrate for anaerobic respiration. |
The final product of taurine metabolism and sulfite respiration, hydrogen sulfide (Fig. 10a), has been detected at higher levels in patients with ulcerative colitis.354,355 While this observation might suggest a role of this gaseous molecule in inducing the disease, contradictory properties of hydrogen sulfide have complicated efforts to establish its causality in ulcerative colitis.356 Hydrogen sulfide is an inhibitor of host β-oxidation of microbially-produced short-chain fatty acids in colonocytes,357 preventing energy acquisition and compromising the epithelial barrier. It has also been shown to stimulate T-cell activation,358 which could induce antigen production against commensal microbes. Conversely, various reports have shown hydrogen sulfide to be anti-inflammatory and ameliorate colitis in animal models.359–361 Thus, the role of hydrogen sulfide in the development or mediation of inflammatory bowel disease remains to be definitively established.
Hydrogen sulfide is generated from the six-electron reduction of sulfite. In B. wadsworthia, this reaction is catalyzed by the metalloenzyme dissimilatory sulfite reductase (dSiR).362 Its structure consists of three subunits with α2β2γ2 stoichiometry (Fig. 10b), encoded by the genes dsrA, dsrB and dsrC, respectively. The DsrA and DsrB proteins are evolutionarily related as paralogs that likely resulted from gene duplication and divergent evolution to only maintain ∼20% sequence identity in contemporary bacteria. Thus, these two subunits form a structurally symmetric heterodimer, but only the DsrB subunit harbors a functional active site capable of sulfite reduction.363 The active site in the DsrB subunit, situated at the interface of the αβ dimer, contains a siroheme cofactor that is bridged via the axial cysteine ligand to a [4Fe–4S] cluster (Fig. 10c). In contrast, the corresponding site in the DsrA subunit contains a demetallated siroheme (termed sirohydrochlorin), lacks critical conserved residues, and has an occluded substrate channel. The role of the third subunit DsrC (γ) was contentious for some time; however, recent biochemical characterization of this protein, described below, revealed its essential role in sulfite reduction.364
The overall six-electron reduction of sulfite to hydrogen sulfide was originally proposed to occur by three consecutive two-electron reduction steps.365–367 However, at present the data suggest that the three steps occur by different mechanisms and with different electron sources. In the current proposal, the siroheme cofactor in DsrB activates the sulfite substrate and mediates the first two-electron reduction (Fig. 10d). This type of cofactor is found in all types of sulfite reductases (assimilatory and dissimilatory), as well as some nitrite reductases, underscoring its ability to mediate multi-electron reduction of inorganic anions.367 The two inorganic components, the iron ion of the siroheme and the [4Fe–4S] cluster, are electronically coupled through the bridging cysteine ligand,368 which enables the cofactor to effectively store two electron equivalents. The two reducing equivalents are provided by an unknown exogenous source and are thought to be transferred via an additional [4Fe–4S] cluster that is located close to the protein surface in a ferredoxin-like domain of the DsrB subunit.363 In the reduced state, the coupled iron–sulfur cluster is in the 1+ oxidation state and the siroheme has a high-spin Fe2+ ion with an open axial coordination site. The structure of the dSiR complex solved in the presence of substrate demonstrated direct sulfite ligation via the sulfur atom to the open coordination site of the DsrB siroheme with tetrahedral geometry (Fig. 10c).363 The substrate oxygen atoms are stabilized in the active site by positively-charged lysine and arginine residues. Coordination of the π-acceptor sulfite substrate to the electron-rich siroheme-Fe2+ weakens the S–O bond through π-backbonding, facilitating its cleavage upon reduction.365,366 This bond breaking step is likely also facilitated by protonation of the departing oxygen atom.
The next step in sulfite reduction is primarily mediated by the DsrC subunit. DsrC possesses two conserved cysteine residues at the C-terminal tail that are critical for conversion of sulfite to sulfide with fidelity.364 This tail is disordered in solution, but the structure of DsrAB in complex with DsrC showed the C-terminal tail positioned in a crevice between the αβ dimer in close proximity to the siroheme cofactor (Fig. 10c).363In vitro characterization revealed that this cysteine thiol forms a trisulfide with the sulfur atom derived from sulfite and the other conserved cysteine residue.369 This intermediate was rationalized to arise from nucleophilic attack of the deprotonated cysteine thiol on the two-electron reduced sulfite intermediate ligated to the siroheme (Fig. 10d). This step would result in dehydration and reduction of the intermediate to generate sulfur in the 1+ oxidation state, which would weaken the S–Fe3+ bond and induce dissociation. Attack by the other cysteine thiol would eliminate the final hydroxyl group to form the trisulfide intermediate (Fig. 10d).
The elucidation of this intermediate illuminated the key to sulfite respiration. The neutral oxidation state of the sulfite-derived sulfur atom in the trisulfide intermediate implies that the dissimilatory sulfite reductase system (DsrABC) only reduces sulfite by four electrons. The final two electrons needed to reduce the trisulfide are predicted to derive from the membrane-bound DsrMKJOP complex369 that had been previously implicated in sulfite reduction.364 The DsrK subunit is homologous to the heterodisulfide reductase HdrD in methanogens and, like HdrD, possesses a [4Fe–4S] cluster that could reduce the trisulfide intermediate.370,371 Involvement of this complex enables coupling of electron transfer and subsequent substrate reduction to the transfer of protons across the membrane, generating a proton motive force to drive energy production.369,371 This concept could not be rationalized previously invoking a six-electron reduction by the cytoplasmic DsrABC components alone because no membrane components were involved. In this model, DsrABC are responsible for the four-electron reduction of sulfite to the protein-based trisulfide, which then serves as the terminal electron acceptor for anaerobic respiration by the DsrMKJOP complex.
The complexity of this multi-component protein system is interesting in light of the observation that siroheme itself is competent for sulfite reduction to hydrogen sulfide in isolation with exogenous reductants.372,373 In this instance, Nature has evolved a mechanism to limit the involvement of the siroheme cofactor in order to generate energy from the final two-electron reduction step. Ultimately, the ability to anaerobically respire on taurine-derived sulfite gives B. wadsworthia a competitive advantage in the gut. However, this microbial adaptation can have inadvertent negative consequences for the host and provides a clear example of the complex interactions between external factors (i.e., diet), host biology, microbial metabolism, and human disease.
Fig. 11 Reductive transformations of the xenobiotics (a) sulfasalazine, (b) digoxin, and (c) daidzein by gut microbes. |
The nature of reductive chemistry necessitates electron transfer, a common function of metallocofactors. Digoxin is a cardiac drug that is metabolized by strains of the gut microbe Eggerthella lenta, rendering it inactive.381,382 The reduction of digoxin to (20R)-dihydrodigoxin (Fig. 11b) is catalyzed by two proteins, termed Cgr1 and Cgr2.381,382 Cgr1 shares sequence homology with membrane-associated cytochrome proteins and is predicted to harbor multiple cytochrome c-type hemes.382 This protein is postulated to participate in electron delivery to Cgr2, which catalyzes the reduction of the α,β-unsaturated lactone moiety in digoxin.382 Cgr2 shares homology with flavin-dependent reductases and requires FAD for activity, but also harbors a [4Fe–4S] cluster that can potentially mediate electron transfer from the Cgr1 partner to the flavin cofactor for digoxin reduction.383 The presence of a [4Fe–4S] cluster in Cgr2 was not predicted bioinformatically from the protein sequence, and thus Cgr2 likely uses a novel fold to accommodate this metallocofactor. Interestingly, the reduction activity of Cgr2 is specific for substrates of the cardenolide structural family, which originate from plant sources.383 The lack of enzymatic activity toward additional endogenous host and dietary substrates suggests that this activity may not have arisen from promiscuous enzyme activity. One possible explanation could be that Cgr2 evolved to protect the host from the toxicity of these plant-derived compounds.383 This discovery portends that there may be other novel enzymes to uncover from the gut environment that have evolved specifically in response to xenobiotic exposure.
Similar reductive strategies are predicted in the transformation of dietary molecules, such as phytochemicals that are associated with altered disease risk. The soy-derived isoflavonoid, daidzein, is sequentially reduced by gut microbes to the estrogenic molecule, equol (Fig. 11c).384 Equol has been linked to reduced risk for breast and prostate cancers, likely due to its ability to act as a ligand for the human estrogen receptor.384 However, there is substantial variation among individuals in the ability of their gut microbiota to metabolize daidzein to equol.385 Equol-producing organisms possess a gene cluster that encodes for three essential enzymes that convert daidzein to equol.386–389 The first enzyme, daidzein reductase, catalyzes the initial flavin-dependent reduction to generate dihydrodaidzein.387 In addition, this enzyme is predicted to harbor a [4Fe–4S] cluster based on conservation of a cysteine-rich sequence motif.387 Although this hypothesis remains to be experimentally confirmed, an electron transfer role for this metallocofactor is easily envisioned in isoflavanoid reduction.
These examples are some of the few enzymes that have been connected to xenobiotic metabolism. Discovery of the organisms and enzymes responsible for other transformations will enable the identification of gene markers for metabolism that could be applied to predict an individual's response to different diets or drug therapies. It could also lead to development of approaches to enhance production of beneficial bioactive compounds and mitigate negative effects that influence drug bioavailability and efficacy. The metabolic capabilities of the gut microbiota, together with the variability in drug-metabolizing enzymes in the human genome, can have dramatic impacts on inter-individual response to medications and on development of precision medicine.
Pfam | Pfam description | Mean representation in stool metagenomes (hits/1000 genes) |
---|---|---|
PF04055 | Radical SAM superfamily | 2.6 |
PF00149 | Calcineurin-like phosphoesterase | 1.5 |
PF01609 | Transposase DDE domain | 1.4 |
PF00293 | NUDIX domain | 0.76 |
PF01546 | Peptidase family M20/M25/M40 | 0.66 |
PF13408 | Recombinase Zn β-ribbon domain | 0.65 |
PF00383 | CMP/dCMP deaminase Zn-binding domain | 0.63 |
PF01807 | CHC2-type Zn finger domain | 0.61 |
PF01966 | HD domain | 0.61 |
PF00557 | Metallopeptidase family M24 | 0.60 |
Using the Pfam classification analysis, we found that the radical SAM enzyme superfamily is the most prevalent metalloenzyme family in the human gut microbiome. This observation is perhaps unsurprising, as this family is highly represented in anaerobic bacteria, which dominate the healthy human GI tract. Radical SAM enzymes are found in every subject sampled with a mean representation of 2.6 hits per 1000 genes and a range of 1.5–3.5 hits per 1000 genes in metagenome assemblies (Fig. 12c and d). This high representation of radical SAM enzymes is not universal, as similar analyses of additional HMP metagenomes revealed the skin and vaginal microbiotas have a very low prevalence of radical SAM enzymes (Fig. 12d). While this observation might be expected because of the oxygen present in these habitats, the oral cavity, which includes both aerobic and anaerobic sites, has a high representation of radical SAM enzymes. Interestingly, this trend was also noted when assessing the abundance of genes encoding GREs, which are typically co-localized with radical SAM activating enzymes, in these same data sets.125 The prevalence of GREs in both the oral and gut cavities led the authors to posit that the oral cavity could be a reservoir for bacteria that can colonize the colon and have unique impacts in that niche.125 As demonstrated throughout this review, radical SAM enzymes are already known to play a wide range of roles in gut microbial metabolism. However, our analysis reveals that the majority of the radical SAM genes present in the stool metagenomes remain uncharacterized, as less than 40% have an assigned EC number. While this likely reflects the overall lack of characterization of this extensive superfamily (>300000 members), the biological context of this particular analysis could offer leads for uncharacterized enzymes to prioritize for further study.
Inclusive of and beyond metalloenzyme families, the vast amount of genetic data from individual organisms and microbial communities creates a challenge for identifying and prioritizing enzymes for study. With respect to the gut microbiota, the enzymes and metabolic functions that have an impact on human health and disease are obvious candidates of interest. One approach to uncover potentially disease-associated activities is to compare metagenomic, metatranscriptomic, metaproteomic, and metabolomic data sets from healthy and patient populations. This type of analysis could identify differentially regulated genes, expressed proteins, and microbial metabolites that may contribute to disease risk or development as well as provide support for proposed associations between gut microbial metabolism and host health and disease. Other experimental approaches such as activity-based functional profiling can be impactful tools for the discovery of novel enzymes by leveraging their chemical properties. Although not widely applied to gut microbial communities as of yet, this technique was used to enhance a comparative metaproteomic study of a mouse model of IBD using a cysteine nucleophile-reactive probe to assess host and microbial peptidase profiles.391 The quantification of gene abundance may be a critical component for assessing relevance in the gut microbiota. An approach termed chemically-guided functional profiling, which uses the computational tool ShortBRED392 in conjunction with Sequence Similarity Network analysis,393 has been applied to assess the abundance of GREs in the HMP metagenomes.125 This analysis identified a highly abundant, previously uncharacterized enzyme that was found to catalyze the dehydration of 4-hydroxy-L-proline.125 Another avenue is to apply well-established computational tools for identifying biosynthetic gene clusters to discover novel secondary metabolites produced by gut microbes. For example, the bioinformatics tool ClusterFinder394 has been successfully deployed to analyze the HMP metagenomes for multiple types of biosynthetic gene clusters.149 This work identified a diverse set of over 3000 putative biosynthetic gene clusters that are highly represented in the gut and oral microbiotas and are largely uncharacterized. Further studies of these cryptic gene clusters could reveal novel natural products with potential therapeutic applications, as well as improve our fundamental understanding of host–microbe or microbe–microbe interactions.
This journal is © The Royal Society of Chemistry 2019 |