Open Access Article
Yeo Jin Lee†
a,
Hyeon-Jeong Hwang†
a,
Jungro Lee†b,
Hyeon Seung Parkc,
Juwan Sonb,
Mohammad Seyedsayamdost
de,
Munhyung Bae*b,
Seoung Rak Lee*a and
Yun Kwon
*c
aCollege of Pharmacy and Research Institute for Drug Development, Pusan National University, Busan 46241, Republic of Korea. E-mail: srlee17@pusan.ac.kr
bCollege of Pharmacy, Gachon University, Incheon 21936, Republic of Korea. E-mail: baemoon89@gachon.ac.kr
cBK21 FOUR Community-Based Intelligence Novel Drug Discovery Education Unit, College of Pharmacy and Research Institute of Pharmaceutical Sciences, Kyungpook National University, Daegu 41566, Republic of Korea. E-mail: yunkwon@knu.ac.kr
dDepartment of Chemistry, Princeton University, Princeton, NJ 08544, USA
eDepartment of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
First published on 4th June 2026
Covering: 1944–2025
Nature's biosynthetic repertoire extends far beyond conventional CHON(S) chemistry and encompasses a rare but diverse array of natural products that incorporate atypical elements such as arsenic, selenium, fluorine, iodine, boron, and vanadium. These metabolites reveal how living systems have evolved to harness atypical atoms through both enzyme-mediated and spontaneous chemical strategies. Biological C–F and Se–C bond formation, SAM-dependent arsenic methylation, and non-enzymatic boron complexation exemplify nature's ingenuity in overcoming extreme energetic or coordination constraints. Despite their scarcity, these compounds play critical ecological and physiological roles in detoxification and redox regulation (As, Se), defense, and signaling (F, I, B), and in some cases, sustain global biogeochemical cycles (Mo, V). Structurally, they exhibit exceptional chemical stability, redox versatility, and metal–ligand diversity. Functionally, these findings expand our understanding of enzyme evolution, chemical defense strategies, and symbiotic metabolism in both marine and terrestrial ecosystems. Recent genomic and biochemical advances have uncovered new families of atypical natural products and the specialized enzymes responsible for their formation. Taken together, these discoveries define the limits of biogenic chemistry and highlight promising avenues for sustainable biocatalysis and drug discovery, particularly in the fluorination, selenation, and boronation pathways that bridge biological and synthetic chemistry.
The presence of atypical elements in natural products challenges conventional biosynthetic paradigms. In most microorganisms and plants, the intracellular concentrations of these elements are exceedingly low.6–8 Furthermore, their distinct chemical properties, such as selenium's redox sensitivity, fluorine's high electronegativity, and boron's Lewis acidity, pose significant barriers to enzymatic processing. Nevertheless, nature has evolved enzymatic machinery capable of selectively incorporating these elements into complex organic scaffolds. The resulting compounds exhibit unique physicochemical characteristics, including altered lipophilicity, enhanced metabolic stability, and increased redox activity.3,9,10 These features not only provide compounds with unusual biological effects but also broaden the diversity of natural products.
Although natural products containing chlorine or bromine have been thoroughly investigated, particularly in marine organisms, the biosynthesis and biological roles of those containing rare elements, such as iodine or fluorine, are far less understood. Elements such as selenium, arsenic, and boron, are often linked to basic or inorganic metabolism. However, recent studies have revealed their roles in the biosynthesis of complex secondary metabolites, endowing these compounds with unique structures and potent pharmacological activities.2–5 These discoveries highlight that nature's biosynthetic capacity extends far beyond conventional CHNO-based chemistry. They also suggest that incorporating atypical elements into compounds may be a key evolutionary strategy for optimizing biological activities and ecological functions. From the perspective of natural product chemistry and drug discovery, secondary metabolites that incorporate atypical elements are a particularly fascinating class of compounds. These compounds expand the accessible chemical space beyond that typically achieved using conventional synthetic methods. Their unique structures have motivated synthetic chemists to replicate or adapt the underlying biosynthetic principles to generate halogenated, selenylated, or boron-containing scaffolds with promising pharmacological potential.11–14 Furthermore, recent advances in genome mining, metagenomics, and enzymology have begun to reveal the genetic and enzymatic foundations of these unusual biosynthetic pathways in microbes. These advances have enabled the identification of halogenases, selenoenzymes, and boron-utilizing enzymes that mediate rare element transformations under remarkably mild biological conditions.15–17
Despite these significant advances, an integrated understand-ing of the chemistry, biosynthesis, and biological functions of natural products containing atypical elements remains incomplete. Previous studies have focused on individual elemental classes, such as halogenated metabolites or selenium-containing compounds, without providing a unified perspective comparing their biosynthetic logic and structural diversity.3,11 In light of the rapid expansion of this research field, a comprehensive analysis of how nature utilizes atypical elements such as selenium, fluorine, iodine, arsenic, and boron to expand its chemical diversity is both timely and necessary. In this review, we provide a comprehensive overview of the natural products that incorporate atypical elements. We discuss the structural diversity, elucidate the biochemical mechanisms by which these elements are selectively introduced, and examine their biological properties. Furthermore, we highlight the recent mechanistic and genomic studies that have unveiled the enzymatic foundations of these distinctive transformations. Finally, we offer perspectives on future research directions, emphasizing the potential of atypical element biosynthesis as a promising avenue for discovering novel bioactive compounds.
Based on genomic analysis and structural homology studies, the core framework of tartrolon compounds is synthesized via a trans-acyltransferase (trans-AT) type I polyketide synthase (PKS) system.29 Unlike conventional modular PKS systems, trans-AT PKSs lack integrated AT domains within each module; instead, discrete, trans-acting AT enzymes supply the appropriate extender units to the individual modules in a non-collinear fashion. The biosynthetic process is initiated from a D-lactate-derived starter unit, followed by the iterative incorporation of acetate and malonate building blocks to elongate the polyketide chain. During this process, tailoring domains such as ketoreductase (KR), dehydratase (DH), and enoyl reductase (ER) operate in a module-specific manner to control the oxidation state and stereochemistry of the intermediates. Together, these programmed reductive steps establish an oxygenation pattern and a stereochemical architecture that ultimately enables boron coordination in the mature scaffold.
Upon completion of the chain elongation, the nascent polyketide is released by a thioesterase (TE) domain, during which two monomeric units are coupled via ester bonds to yield a macrodiolide ring. This TE-associated dimerization and macrodiolide formation is a defining step; it generates a symmetric (or pseudosymmetric) oxygen-rich macrocycle that contains multiple hydroxyl groups in a preorganized arrangement. All known members of the tartrolon family, including 1–3, share this symmetrical or pseudosymmetrical macrodiolide scaffold.
Following macrodiolide ring formation, boron incorporation is believed to occur via a non-enzymatic chemical process. Environmental sources of boron, such as boric acid, spontaneously react with the four hydroxyl groups within the macrodiolide ring, leading to the formation of a stable central boronate complex (often described as a Böeseken-type borate/boronate complex, in which borate is held in tetrahedral coordination by four alcohol or α-hydroxy oxygen donors). Notably, the natural conversion of tartrolon D to its boron-complexed analog 3 suggests that boron incorporation is not enzyme-mediated but may proceed irreversibly under appropriate environmental boron conditions.26 This non-enzymatic boronation is interpreted as a selective late-stage modification that proceeds without perturb-ing the established stereochemical architecture of the macrodiolide. In the case of 2, the structural diversity arises from an additional oxidation event at a specific position on the macrodiolide framework.29 This modification is presumed to be mediated by a post-PKS tailoring enzyme (i.e., a post-modification introduced after PKS-mediated scaffold assembly and release, such as site-specific oxidation), further expanding the chemical diversity within the tartrolon family.
Collectively, tartrolon biosynthesis is a multistep process that involves polyketide chain assembly, macrodiolide formation via TE domain–mediated dimerization (esterification), site-specific oxidation (when applicable), and late-stage boron complexation to yield boron-coordinated macrodiolide scaffolds. Importantly, this overall sequence—construction of a polyoxygenated polyketide scaffold, macrodiolide formation, and late-stage boronate complexation—reflects a recurring biosynthetic logic observed across several microbial boron-containing macrodiolides discussed in subsequent sections, with family-level differences primarily arising from the PKS type and domain/module organization, starter unit selection, and late tailoring reactions. Accordingly, compound 3 is among the most biosynthetically resolved examples of this class, and Scheme 1 is provided as a representative biosynthetic pathway illustrating the key steps of tartrolon E biosynthesis, including polyketide assembly, macrodiolide formation, and late-stage boron complexation.
![]() | ||
| Scheme 1 Proposed biosynthetic pathway of tartrolon E (3) as a representative example of boron-containing microbial macrodiolides. | ||
Tartrolon B (1) exhibits antibacterial activity against gram-positive bacteria, including Bacillus subtilis, with minimum inhibitory concentrations (MICs) ranging from 0.3 to 1.56 µg mL−1.24 Tartrolon C (2)exhibits insecticidal activity, with minimum emergent larvicide concentrations (MELCs) of 125 ppm against both Spodoptera exigua (beet armyworm) and Heliothis virescens (tobacco budworm).25 This represented the first reported insecticidal chemotype within the tartrolon family. Tartrolon E (3) exhibits nanomolar-level antiparasitic activity against Plasmodium falciparum, including both asexual and sexual blood stages.30,31 Although it also shows cytotoxicity toward mammalian cells such as HepG2 (hepatocellular carcinoma) and L1210, its selectivity index exceeds 400, indicating significantly lower toxicity to host cells than to parasites.32,33 Notably, compound 3 was also isolated from a symbiotic bacterium residing in shipworm gill tissues, suggesting a potential ecological role in protecting the host from apicomplexan parasites such as gregarines, highlighting its broader significance beyond pharmacological activity.29
The polyketide backbone of 4 was synthesized via a Type I modular PKS system, incorporating a total of 14 acetate/malonate extender units.38,39 Six methyl groups are introduced by methionine-derived post-elongation methylation rather than by methylmalonyl-CoA extender incorporation.27 Uniquely, the polyketide chain initiation is achieved through a three-carbon starter unit derived from glycerol, rather than the more typical propionate-based precursors.40 Feeding experiments have confirmed D-valine as the direct biosynthetic precursor, and the retention of the α-stereocenter indicated that the amino acid was incorporated without epimerization at the stereogenic center.27 Upon completion of chain elongation, the polyketide intermediate is released by a TE domain and undergoes macrocyclization to yield the macrolactone ring. Although this process follows a canonical modular PKS mechanism, post-tailoring addition of the D-valine moiety contributes to the structural uniqueness of the final molecule. Subsequent non-enzymatic reactions with boric acid have been proposed to generate characteristic boron–oxygen coordination sites (see Section 2.1).34,35,39
Following the structural elucidation of boromycin (4), additional derivatives, N-formylboromycin (5) and N-acetylboromycin (6), were identified, arising from formylation and acetylation of the terminal amino group, respectively.36 Desvalinylboromycinate (7) is a variant that lacks the D-valine moiety in its chemical structure.37 These derivatives are supposed to arise from late-stage biosynthetic modifications and retain the characteristic boronate complex and polyketide backbone of 4. Collectively, these structural variants highlight the diversity introduced by terminal tailoring modifications within the biosynthetic framework of 4.
Compound 4 exhibits antibacterial activity against gram-positive bacteria, including Mycobacterium tuberculosis, with MICs ranging from 0.08 to 0.15 µM.41 Its mode of action involves functioning as a potassium ionophore, thereby dissipating the membrane potential. In addition, compound 4 displays nanomolar-level antiparasitic activity against protozoan pathogens such as P. falciparum and Toxoplasma gondii, and antiviral effects against HIV-1.42–44 Although the biological activities of 5–7 have not yet been reported, they are likely to exhibit similar pharmacological properties due to their close structural resemblance to 4.
Compound 8 represents the prototypical member of its class and features a symmetrical macrodiolide backbone composed of 40 carbon atoms, in which a central boron atom forms a highly stable boronate complex through coordination with four hydroxyl groups.47 Although the overall framework resembles that of conventional polyether macrolides, compound 8 lacks the peptidic side chain commonly found in boromycin, distinguishing it structurally and biosynthetically.39 Compounds 9 and 10 are acetylated derivatives of 8, arising through post-modifications.
The assembly of the polyketide-derived macrodiolide scaffold and the general late-stage boron complexation logic are consistent with the recurring framework summarized in Section 2.1. Notably, aplasmomycin biosynthesis is distinguished by the use of a glycerol-derived three-carbon starter unit (also observed for boromycins).48 Experimental interconversion of deboraaplasmomycin to aplasmomycin in the presence of boric acid further supports the idea that boronation occurs after macrodiolide formation via a non-enzymatic or weakly enzyme-assisted process.49 Compounds 9 and 10 are believed to arise through late-stage acetylation of hydroxyl groups following boronate complex formation, which may be enzyme-mediated via specific acetyltransferases or occur under culture conditions.
Aplasmomycin A (8), a boron-containing ionophore, exhibits antibacterial activity against gram-positive bacteria such as Staphylococcus aureus and B. subtilis, with MICs ranging from 0.78 to 3.12 µg mL−1.45 It also shows in vivo antimalarial efficacy in Plasmodium berghei–infected mice, where oral administration significantly reduces parasitemia and improves survival. Aplasmomycin B (9), a monoacetylated derivative, shows lower antibacterial activity than 8; this change has been discussed in relation to the altered hydrogen-bonding capacity and steric effects around the boron coordination site,46 with MICs against S. aureus and B. subtilis ranging from 4.2 to 125 µg mL−1. In the membrane-based assays, 9 retains its ionophoric activity for potassium transport at levels comparable to those of the parent compound. By contrast, aplasmomycin C (10), a diacetylated analog, shows reduced antibacterial activity in an agar disk diffusion assay, with relative activities of <5–6.3 (relative to aplasmomycin at 500 µg mL−1), and low activity in potassium ion transport assays. This loss of bioactivity is likely due to the disruption of the boron coordination environment and reduced flexibility of the macrodiolide scaffold caused by dual acetylation. Further elucidation of the biosynthetic steps and their impact on structure–activity relationships is essential to advance the chemical and biological understanding of this unique family of boron-containing macrodiolides.
The biosynthetic pathway of 11 is inferred to follow the polyketide route. Feeding experiments indicate that its carbon skeleton is assembled from acetate-derived C2 units with methyl groups derived from methionine50 and that the C3 starter unit appears to originate from an acetate–methionine combination rather than propionate. After the chain release, dimerization is proposed to form the macrodiolide scaffold, followed by late-stage non-enzymatic boron complexation (see Section 2.1).
In terms of bioactivity, compound 11 exhibits cytotoxic activity against cancer cell lines.50 It inhibits the growth of LoVo colon carcinoma and KB oral carcinoma cells with MICs of 0.066 and 3.3 µg mL−1, respectively. Compound 11 also shows antiviral activity in cell-based assays, inhibiting plaque formation by Herpes Simplex Virus type 2 (HSV-2) by 98% at 0.1 mg mL−1.
It comprises two chemically distinct polyketide-derived units l-inked by ester bonds, forming a macrodiolide framework. Each monomeric unit features a complex array of functional groups, including methylene, olefinic methine, ketone, ketal, and epoxide moieties. In particular, the presence of an epoxide be-tween C-14 and C-15 and a six-membered ketal ring highlight the stereochemical complexity of the molecule. Although the biosynthetic pathway of 12 has not yet been fully elucidated, its assembly is presumed to follow the recurring framework summarized in Section 2.1, with a PKS-derived scaffold undergoing macrodiolide formation and late-stage boron complexation.
Compound 12 exhibits biological activities across multiple assays, including immunomodulatory, antimicrobial, cytotoxic, and antiparasitic effects.51 In terms of its mode of action, compound 12 promotes the maturation and secretion of interleukin-1β (IL-1β) in both human peripheral blood mononuclear cells (PBMCs) and murine bone marrow-derived macrophages (BMDMs). This response is mediated by the activation of the NLRP3 inflammasome complex, triggered by intracellular potassium efflux. Compound 12 acts as a potassium ionophore, disrupting K+ homeostasis, thereby activating the cytosolic NLRP3 sensor protein and facilitating inflammasome assembly, culminating in caspase-1 activation and pro-IL-1β processing. In addition to its immunological effects, compound 12 shows antimicrobial activity against selected Gram-positive bacteria and fungi/yeasts. It inhibited B. subtilis DSM-10, Micrococcus luteus DSM-1790, and S. aureus DSM-346 with MIC values of 0.2, 0.4, and 0.8 µg mL−1, respectively, and showed activity against Mucor hiemalis DSM-2656, Pichia anomala DSM-6766, Rhodotorula glutinis DSM-10134, and Schizosaccharomyces pombe DSM-70572 with MIC values ranging from 0.8 to 8 µg mL−1.51 It also exhibited antiparasitic activity at 1 µM, inhibiting the growth of Trypanosoma brucei rhodesiense, Trypanosoma cruzi, Leishmania donovani, and P. falciparum by 100, 63, 95, and 100%, respectively.51 Furthermore, compound 12 shows nanomolar-level cytotoxicity against several human cancer cell lines, with the strongest activity observed against A-431 (epidermoid carcinoma), A-549 (lung carcinoma), and KB-3.1 (cervix carcinoma) cells (IC50 = 13 nM), among the cell lines tested.51 Taken together, these reports support 12 as a bioactive boron-containing scaffold, although further studies are required to establish its therapeutic potential and scope. Overall, the reported activities have been discussed in relation to the ionophoric modulation of intracellular ionic homeostasis, providing a mechanistic basis for future investigations.
The biosynthetic pathway of 13 proceeds through a series of enzymatic reactions derived from the methionine salvage pathway, originating from S-adenosylmethionine (SAM).54 This pathway initiates with the conversion of SAM into S-adenosylhomocysteine (SAH), which is subsequently hydrolyzed by 5′-methylthioadenosine/SAH nucleosidase (Pfs) to yield S-rib- osylhomocysteine (SRH). The key enzyme LuxS cleaves the C1–C2 bond of SRH, generating the sulfur-containing byproducts homocysteine and 4,5-dihydroxy-2,3-pentanedione (DPD). DPD serves as the immediate precursor of 13 and exists in aqueous solution as an equilibrium mixture of interconverting isomers (Scheme 2).
Some marine bacteria, such as V. harveyi, produce DPD, which reacts with boric acid when boron is readily available in the environment.53 This reaction forms a structurally distinct furanosyl borate diester that functions as a signaling molecule in QS pathways. The boron-coordinated form of 13 is recognized by the LuxP receptor and is biologically active. The conversion of DPD to boron-bound 13 occurs through a non-enzymatic chemical reaction and proceeds efficiently under marine or boron-rich environmental conditions. The crystal structure of the LuxP-13 complex, in which boron is coordinated within the signal molecule, provides direct structural evidence for its role in AI-2 recognition. This structure demonstrates that borate complexation is essential for high-affinity binding to the LuxP receptor, thereby conferring receptor specificity and enabling precise inter-species QS.
Compound 13 functions as a key regulatory signal in bacterial QS networks. In V. harveyi, the boron-bound form of 13 specifically binds to the LuxP receptor, initiating downstream signaling cascades that modulate population-level behaviors such as bioluminescence, biofilm formation, and virulence factor expression.52 In contrast, in Escherichia coli, 13 signaling occurs independently of boron complexation. The uncomplexed form of 13 is recognized by the LsrB receptor, which mediates transcriptional responses related to a wide range of physiological functions.54 These differences in recognition indicate that 13 can be sensed through species-dependent pathways that vary with its boron-binding state. Accordingly, compound 13 was investigated as a target for developing quorum-sensing inhibitors (QSIs) with proposed applications to modulate microbial behaviors relevant to infection and ecological interactions. Further studies are needed to clarify the biological roles of 13 in QS-regulated communication and microbial community dynamics.
In addition to its established role in microbial QS networks, compound 13 has been investigated for its potential physiological effects on hosts. Although compound 13 does not appear to exhibit direct cytotoxic, anti-inflammatory, or antioxidant activity through conventional pharmacological mechanisms, recent studies suggest that it may influence host immu-ne-related responses indirectly through host–microbiota interactions.55–57 For example, elevated levels of 13 were observed in colorectalcancer (CRC) tissues, where they were associated with TNFSF9 signaling in tumor-associated macrophages and changes in CD3+ T cell infiltration, consistent with a potential immunomodulatory association in the tumor microenvironment.55 Furthermore, exogenously administered 13 influences antibiotic-associated intestinal dysbiosis and markers of gut mucosal immunity, including epithelial barrier integrity and inflammatory cytokine levels.56 Related effects have also been reported in experimental models of pulmonary inflammation, consistent with gut–lung axis crosstalk.57 Overall, these reports suggest that 13 may contribute to host–microbe interactions and aspects of immune homeostasis, although the underlying mechanisms and physiological relevance remain to be established.
At the heart of this puzzle lies the chemical intransigence of the C–F bond and bioinorganic properties of fluoride ions.60 Fluorides are small, hard, and strongly hydrated, making them a poor nucleophile in aqueous solution; at the same time, the C–F bond is among the strongest single bonds in organic chemistry.60,61 From the perspective of enzyme catalysis, both facts are problematic: the initial formation of C–F bonds is kinetically disfavored, and once formed, those bonds are difficult to remodel or cleave. It is, therefore, perhaps unsurprising that only a few specialized lineages have evolved the machinery to handle fluorine safely, developing not only fluorination enzymes but also resistance and detoxification strategies to avoid self-poisoning.60,61
Nevertheless, when biological systems engage with fluorine, the resulting metabolites are almost invariably highly potent and often exert pronounced ecological or pharmacological effects.60,61 In this section, we review the known biosynthetic pathways, enzymatic machineries, and biological functions of naturally occurring organofluorine natural products, highlighting both their rarity and their disproportionate impact.
Compound 15 is one of the most toxic natural plant metabolites. Synthetic sodium fluoroacetate (compound 1080) has been widely used as a rodenticide and predator control agent, exploiting the same toxicity mechanism as plants' natural defense mechanism with LD50 ranging from 0.07 to 500 mg kg−1.68,69
Upon ingestion by herbivores, 15 is metabolized (via acetyl-CoA) to (2R,3R)-fluorocitrate, a suicide inhibitor of aconitase in the TCA cycle.68 This leads to citrate accumulation, halting cellular respiration and causing convulsions and cardiac failure in grazing animals. The extreme potency of this toxin provides a clear evolutionary advantage to 15-producing plants by deterring grazing by mammals and insects.
One historical anomaly is fluoroacetone (16), a volatile fluorinated ketone detected in an Australian plant (Acacia georginae) in the 1960s.70 It was hypothesized as a byproduct of fluoroacetate metabolism (via malonyl-CoA condensation), but later investigations cast doubt on this finding.71 Instead, 16 may have been misidentified; fluoroacetaldehyde (the hydrate of which can form 16 analogs in derivatization assays) is considered a more plausible intermediate in plant metabolism.72,73
As such, 16 is no longer considered a bona fide natural product.74
The enzymology of plant fluorometabolites remains unclear. No analog of the microbial C–F bond-forming enzyme (fluorinase) has been found in plants.75
Nonetheless, the widespread (if low-level) ability of plants to form organofluorines suggests a basic biosynthetic capacity in the plant kingdom. Intriguingly, trace amounts of 15 and fluorocitrate can be detected in some common crops and forage plants, such as tea, alfalfa, and clover, when grown in fluoride-rich conditions.76 This observation suggests a potential, yet unconfirmed, capacity of plants to convert inorganic fluoride into organofluorine, although only certain species have evolved to accumulate high concentrations. In D. cymosum, 15 is predominantly synthesized in young leaves (which can contain >1000 ppm) and then translocated to other parts of the plant, such as older leaves or developing seeds.76 Compound 15 may be temporarily stored or compartmentalized in leaf tissue adjacent to flowers and later mobilized—after fertilization—into seeds, where it is incorporated as fluoroacetyl-CoA into fatty acids.77 Such spatial and temporal separation likely helps the plant avoid self-toxicity while still arming its seeds with a potent chemical defense for the next generation. It has also been speculated that when fluoroacetate is not required, plants may detoxify it by converting a fraction into volatile fluorinated compounds. For instance, seasonal declines in fluoroacetate levels correlate with increased emissions of volatile organofluorines, such as fluoroacetaldehyde, suggesting a possible detoxification pathway that vents fluorine as a gas.75 However, direct in vivo evidence for this volatilization mechanism is lacking, and it remains a hypothesis.
Uncovering the plant enzymes responsible for fluoroacetate biosynthesis remains a key question. Given that no fluorinase homologs are present in these species, plants have likely evolved a completely different catalytic strategy for C–F bond formation, a tantalizing prospect for future discovery.75
Plant-derived fluoroacetate is frequently sequestered as sodium or potassium salts but may also be incorporated into complex lipids.62,68 Early studies of D. toxicarium revealed a series of long-chain ω-fluorinated fatty acids in seed oils, including 16-fluoropalmitic acid, 18-fluorooleic acid, 14-fluoromyristic acid, and 10-fluorocapric acid, as well as polyunsaturated analogs such as 18-fluorolinoleic acid.78 A rare dihydroxy derivative, threo-18-fluoro-9,10-dihydroxystearic acid, likely arises from epoxy metabolism of 18-fluorooleate.79 Subsequent studies identified additional ω-fluorofatty acids, including 20-fluoroarachidic and 20-fluoroeicosenoic acids, highlighting the metabolic versatility of these plants.78,80
These fluorinated lipids are biosynthesized using fluoroacetyl-CoA as a starter unit in fatty acid synthase, enabling chain extension from the two-carbon fluorinated precursor.77 The rest-riction of such compounds to Dichapetalum species suggests a highly specialized evolutionary adaptation.
Genome mining and enrichment approaches have only identified a limited number of additional producers. Streptomyces sp. MA37, isolated from soil, harbors a fluorinase gene and synthesizes 15, 17, and (2R,3S,4S)-5-fluoro-2,3,4-trihydroxypentanoic acid, a shunt metabolite in the fluorothreonine pathway.83 Bioinformatic surveys have also detected fluorinase homologs in Actinoplanes sp. N902-109 and Nocardia brasiliensis, exhibiting 80–87% sequence identity with the S. cattleya enzyme.59 However, these organisms produce little or no fluorometabolites under standard laboratory conditions, suggesting pathway silencing or context-dependent regulation.59
Nucleocidin exhibits potent activity against trypanosomes and bacteria, likely by inhibiting protein synthesis through interactions with tRNA synthetases or translation factors.85 Despite early reports, S. calvus strains lost the ability to produce 18 for decades, creating a cryptic phenotype.86 This mystery was resolved in 2015 when production was shown to depend on a bldA-encoded Leu-tRNAUUA. Restoration of this tRNA reactivates nucleocidin biosynthesis and enables detailed biochemical analyses.87
Microbial fluorometabolites function as chemical weapons in competitive environments. S. cattleya secretes 15 and 17, which inhibit susceptible microorganisms by disrupting central metabolism or amino acid utilization.81,83,88 Resistance mechanisms protect the producer, conferring a strong ecological advantage.87 Nucleocidin similarly serves as a potent antagonist, possibly deterring protozoan predators or competing microbes in soil habitats, and exhibits antibacterial activity against gram-positive bacteria, including B. subtilis with MICs around 200 µg mL−1.87,89
The extreme rarity of microbial fluorinated metabolites reflects both the scarcity of bioavailable fluoride and metabolic cost of fluorine chemistry.74 Production typically requires millimolar fluoride concentrations that are rarely encountered in natural environments.75 Nevertheless, fluoride-rich niches may support localized fluorine cycling. For example, fluoroacetate-degrading bacteria that possess dehalogenases coexist with producing organisms and use 15 as a carbon source.90 This establishes a localized ecological loop involving toxin production and detoxification.
In summary, fluorinated natural products function as highly potent bioactive agents that enhance microbial fitness in specialized niches. Their scarcity reflects environmental constraints and biochemical complexity, yet their existence illustrates the remarkable evolutionary capacity of select microorganisms to exploit elements largely ignored by most life forms.74,75
In biological systems, arsenic incorporation typically proceeds via methylation pathways from inorganic arsenate and arsenite, often mediated by SAM methyltransferases, followed by oxidative and reductive transformations.93 Together, these processes generate a wide array of organoarsenic compounds. Importantly, arsenic speciation strongly dictates biological activity. Some organoarsenicals, such as arsenobetaine, are essentially inert, and inorganic arsenicals and certain reactive methylated species exhibit pronounced toxicity.
This section reviews the principal classes of naturally occurring organoarsenicals, emphasizing their chemical structures, bios- ynthetic pathways, and biological activities, with a focus on the comparative structure–activity relationships that underlie their diverse bioactivities.
Biosynthesis of arsenobetaine proceeds via the oxidative demethylation of arsenocholine, which is a degradation product of algal arsenosugars.98,99 Marine algae biosynthesize arsenosugars, which are subsequently metabolized by animals and associated microbiota to yield arsenocholine.100 Oxidative demethylation of arsenocholine in host tissues then produces 19. Isotope tracer experiments support this pathway and feeding studies have demonstrated the efficient conversion of arsenocholine into arsenobetaine in fish tissues.97
Thus, the formation of 19 represents the terminal step of a multistage biotransformation pathway involving the methylation, redistribution, and oxidative processing of inorganic arsenic. Through this sequence, reactive arsenic species are ultimately converted into chemically stable, biologically inert end products. Consistent with this role, arsenobetaine exhibits extremely low toxicity compared with that of inorganic arsenic species.101 Its acute toxicity in mammals is orders of magnitude lower than that of arsenite (LD50 in mice >10
000 mg kg−1 for 19 versus ∼34 mg kg−1 for arsenite), and it is rapidly excreted unchanged without further bioactivation.93
In addition to its detoxification function, 19 acts as a compatible solute in microorganisms, enhancing tolerance to osmotic and thermal stress.100 This dual role highlights arsenobetaine as both a metabolic sink for arsenic and a physiologically beneficial osmolyte.
Biosynthesis begins with the uptake of inorganic arsenate by algae, followed by SAM-dependent methylation mediated by arsenic methyltransferases to generate dimethylarsinoyl intermediates.93 These intermediates are subsequently enzymatically conjugated to ribose-derived backbones, yielding mature arsenosugars. Arsenosugars serve as important metabolic precursors for other organoarsenicals, including arsenocholine and arsenobetaine, upon degradation in the marine food chains.97,98
Arsenosugars generally exhibit low acute toxicity in mammals but are not biologically inert. In humans, ingested arsenosugars undergo partial biotransformation to more toxic metabolites, including dimethylarsinous acid (DMA(III), 20), which is associated with chromosomal aberrations and is detected as a transient metabolic intermediate (Fig. 6).104 Nevertheless, arsenosugars rarely cause acute poisoning, and seafood containing these compounds is typically safe for consumption.
Ecologically, arsenosugars appear to function primarily in arsenic detoxification and storage within algae.105 Algae can reduce arsenic-induced physiological stress by converting inorganic arsenate, which can interfere with phosphate-dependent metabolism, into stable organoarsenic conjugates. Consequently, the pentavalent arsenic center embedded within the sugar framework renders arsenosugars a relatively benign reservoir of arsenic. However, their subsequent degradation in higher organisms can regenerate toxic species depending on their metabolic capacity.103 This dual behavior highlights arsenosugars as central intermediates in marine arsenic cycling.
Arsenocholine primarily originates from arsenosugar degradation. When algae containing arsenosugars are consumed by marine animals or decomposed by microorganisms, the cleavage of the ribose moiety liberates arsenocholine.97 This process is analogous to the generation of choline-containing metabolites from the breakdown of glycerylphosphorylcholine. Once formed, arsenocholine undergoes further metabolism; in particular, the oxidative removal of the terminal hydroxymethyl group from the ethanol side chain converts 21 into arsenobetaine (trimethylarsinioacetic acid).100
Consistent with its role as a metabolic intermediate, arsenocholine exhibits low acute toxicity comparable to that of arsenobetaine (19). It is highly water-soluble and is readily excreted by animals. In marine organisms, 21 likely functions as a transient storage or transport form of arsenic en route to more stable end products. Its structural similarity to choline prompted investigations into its potential interactions with choline-related metabolic pathways.104
Notably, arsenic-containing phospholipids (arsenolipids) with an arsenocholine head group have been identified, suggesting partial utilization of 21 in lipid metabolism.105 However, no essential biological function has been established, and arsenocholine appears to function primarily as a transitory metabolite. Therefore, its presence in seafood is relevant for understanding arsenic speciation rather than its toxicological risk.
Overall, the quaternary arsonium structure of arsenocholine, comprising three methyl groups and one ethanol substituent, confers high chemical stability and low reactivity. Consequently, 21 does not readily interact with protein thiols or other toxicologically relevant targets, accounting for its minimal toxicity, similar to that observed for 19.
These arsenolipids exhibit substantial structural diversity. For example, AsHCs isolated from fish oils include species with molecular formulas such as C17H39AsO, representing arsenic-containing analogs of heptadecane, whereas arsenofatty acids span chain lengths from medium (C7–C12) to long (C14–C18) ranges.106 A unifying structural feature is the presence of a dimethylarsinoyl moiety, which is typically connected to the lipid backbone through a carbon–oxygen linkage.
The biosynthesis of arsenolipids remains incompletely unders-tood but is thought to involve the conjugation of dimethylarsinoyl units to lipid precursors. One proposed pathway involves the initial methylation of inorganic arsenic to form dimethylarsinate, followed by enzymatic attachment to preexisting lipid scaffolds. Arsenosugars or arsenocholines produced in algae may serve as intermediates in this process. Garcia-Salgado et al. demonstrated that marine algae synthesize AsFAs, suggesting biochemical coupling of arsenic to fatty acid biosynthesis.110 In the case of arsenophospholipids, an arsenocholine moiety likely replaces the native choline head group during phospholipid assembly.104,108 Collectively, these observations suggest that organisms “mix and match” arsenic-containing units with conventional lipid frameworks, generating molecul-es that closely resemble endogenous lipids while incorporating arsenic.
The pronounced lipophilicity of arsenolipids distinguishes them from more common water-soluble arsenicals. Because of their solubility in fats and oils, arsenolipids readily cross cellular membranes and accumulate in adipose tissues more efficiently than do arsenate or arsenosugars.109,110 Several AsHCs and arsenofatty acids exhibit cytotoxicity to UROtsa and HepG2 cells, with IC50 values of 8–13.5 µM.109
Arsenolipids also perturb lipid metabolism and have been suggested to contribute to dysregulation of the gastrointestinal microbiome when consumed in diets rich in fish oil.111 From a structure–activity perspective, the incorporation of a dimethylarsinoyl group within a hydrophobic chain enables arsenolipids to function as molecular “Trojan horses,” transporting a reactive arsenic center into cellular membranes under the guise of a lipid. Partial metabolic processing can release inorganic arsenic or methylated oxo-arsenicals, thereby contributing to systemic arsenic exposure.111
Ecologically, arsenolipids may undergo biomagnification in marine food webs because of their preferential partitioning into fatty tissues. Therefore, regulatory agencies have noted that the presence of arsenolipids in seafood and fish oil supplements may pose distinct health risks, given their bioaccessibility and toxicity profiles.111 In summary, the enhanced hydrophobicity and membrane permeability of arsenolipids confer elevated toxicological potential relative to water-soluble arsenicals, illustrating how the molecular context critically modulates the biological impact of arsenic.
This compact As–O cage architecture is unprecedented among marine natural products and highlights nature's capacity to construct complex arsenic–carbon frameworks.112 In 22, each arsenic atom resides in a high oxidation state, commonly described as a mixed As(III)/As(V) system distributed across the cluster. The structure may be viewed conceptually as deriving from arsenic trioxide (As4O6), with three oxygen atoms replaced by methylene groups.113 Owing to its asymmetric arrangement of arsenic and oxygen atoms, arsenicin A is chiral and has been resolved into enantiomers, although it is isolated from natural sources as a racemate.
However, the biosynthetic origin of 22 remains poorly understood. Current hypotheses suggest that it arises through coupling reactions involving inorganic arsenic species within the sponge or its associated microbiota.113 One proposal suggests gradual assembly from smaller arsenic units, such as arsenite or methylarsenite, via oxidative coupling, potentially mediated by specialized enzymes or templating proteins.92,93,112,114 Alternatively, the highly symmetric and dense As–O connectivity suggests that partially non-enzymatic processes, such as autocatalytic assembly in localized oxic microenvironments, may contribute.113 Michelet et al. proposed that 22 could form through oxidative dimerization of arsenite to generate an As4 intermediate, followed by structural rearrangement into the adamantane cage.113 The involvement of unusual arsenic-related enzymes, including atypical arsenate reductases or peroxidases, remains an area of active speculation.114,115 Collectively, these observations suggest that the production of such a complex arsenical reflects a highly specialized evolutionary adaptation potentially linked to chemical defense.114
Despite its inorganic-like characteristics, arsenicin A exhibits pronounced biological activity. It shows strong antibacterial activity against gram-positive bacteria, including S. aureus and B. subtilis, with potency in the low micromolar to submicromolar range.110,113 Structurally related analogs, such as arsenicins B and C, in which one or two oxygen atoms are replaced by sulfur, exhibit activities comparable to those of standard antibiotics, including gentamicin.113 The cage structure may also facilitate membrane intercalation, allowing the polar arsenic core to perturb membrane-associated proteins and promote oxidative damage.113 Furthermore, redox cycling between arsenic oxidation states may contribute to the generation of reactive oxygen species (ROS).
In summary, the exceptional architecture of arsenicin A is closely associated with its potent antimicrobial activity and cytotoxicity. Its rigid cage structure and multivalent arsenic content confer substantial pharmacological activity while simultaneously requiring strict biological control of its production. Consequently, 22 likely functions as a highly specialized chemical weapon or deterrent, synthesized in limited quantities, to balance ecological advantage with self-protection.110,113
In addition, 22 shows moderate cytotoxicity against leukemia and solid tumor cell lines, including NB4, BxPC-3, MiaPaca-2, AsPC-1 and U87 cells, with IC50 values in the submicromolar range (0.2–0.7 µM).112 Notably, one study reported activity against promyelocytic leukemia cells at concentrations lower than those required for arsenic trioxide (Trisenox), highlighting its potential medicinal relevance.112 Although the precise mechanism of action remains unclear, the dense As–O cluster is likely central to its bioactivity. The proposed mechanisms include the induction of oxidative stress, strong binding to protein thiols and selenols, and the disruption of essential enzymatic pathways.113
In anaerobic or low-redox environments, such as animal gastrointestinal tracts, sewage sludge, and anoxic soils, microbial ArsM enzymes (As(III) SAM-dependent methyltransferases) convert inorganic arsenite to MMA(III) and subsequently to DMA(III).114 Under oxic conditions, these trivalent methylarsenicals are rapidly oxidized to their pentavalent counterparts, dimethylarsinate (24) and methylarsonic acid (MMA(V), 25) (Fig. 7).101 Consequently, detectable levels of MMA(III) and DMA(III) in natural samples are typically low and short-lived.114
Despite their ephemeral nature, certain microorganisms actively exploit trivalent methyl arsenicals as competitive weapons.114 Several soil and gut bacteria, including Streptomyces species, can produce and excrete methylarsenite and related compounds to inhibit neighboring organisms.114 These trivalent species are substantially more toxic than arsenite or arsenate, enabling producers to suppress competitors unless resistance mechanisms are present. Known defense systems include detoxification enzymes and efflux pumps encoded by genes such as arsH, arsP, and arsN.114 This phenomenon has been widely described as microbial “chemical warfare.” In arsenic-rich environments, methylating microbes frequently outcompete non-methylators, supporting the view that ArsM evolved not only for detoxification but also for offensive ecological functions.114,115
In addition to simple methylarsenites, a limited number of specialized organoarsenicals have been identified in this category. A prominent example is arsinothricin (26), a broad-spectrum antibiotic discovered in Burkholderia species (Fig. 7).115 Arsinothricin is a methylarsenical analog of glutamate that potently inhibits bacterial glutamine synthetase.114,115 Although 26 is formally a pentavalent arsenic compound (2-amino-4-(hydroxymethylarsinoyl)butyrate), it is biosynthesized via a methylarsenite intermediate and similarly exploits the affinity of arsenic for biological ligands to exert antimicrobial activity.115 These discoveries demonstrate that microbes can elaborate complex arsenic-based weapons beyond simple oxyanionic species.
Trivalent methylarsenicals are among the most toxic arsenic species in biological systems. Their toxicity arises primarily from their strong reactivity toward thiols and other nucleophilic functional groups in proteins.114 MMA(III) and DMA(III) readily coordinate with cysteine residues, often forming stable arsenic–thiolate rings that inactivate enzymes.109 They inhibit key metabolic pathways, including pyruvate dehydrogenase and other thiol-dependent systems, and disrupt cellular redox homeostasis.109 DMA(III) displays markedly higher cytotoxicity than inorganic arsenite in human cells; in A549 cells, DMA(III) showed IC50 values of approximately 14.1 µM, inorganic arsenite exhibited considerably higher IC50 values of approximately 76.6 µM.116 Comparative studies indicate a general potency order of MAs(III) > DMAs(III) ≈ As(III) ≫ As(V), with pentavalent species being far less toxic.117
This extreme reactivity explains why trivalent methylarsenicals rarely accumulate in vivo; the organisms that generate them must rapidly export or oxidize them into less harmful forms.114 From an ecological perspective, the production of MMA(III) and DMA(III) strongly influences microbial community structure.114 Only organisms possessing resistance mechanisms, such as ArsO oxidases, which convert MMA(III) to MMA(V), or ArsP/ArsK efflux systems, can persist under arsenic-selective pressure.101 In rice paddies and other anaerobic habitats, microbial methylation and demethylation processes establish a dynamic equilibrium that prevents the long-term accumulation of these toxic intermediates.101
However, even transient exposure to low concentrations can exert disproportionate ecological effects, suppress sensitive species, and alter nutrient cycling. In summary, trivalent methylarsenicals exhibit a striking structure–activity relationship: adding one or two methyl groups to arsenite, while keeping arsenic in the +3 oxidation state, transforms a detoxification intermediate into a potent biological weapon.113,114
Arsenic-containing natural products exhibit an exceptionally broad spectrum of biological activities, ranging from virtually inert detoxification to highly potent antimicrobial and cytotoxic activities.103 Arsenobetaine, arsenosugars, and arsenocholine (Sections 4.1–4.3) display minimal acute toxicity and primarily function as storage, transport, and excretory forms that mitigate arsenic-induced stress.92,97,99
In contrast, arsenolipids (Section 4.4) exhibit enhanced toxicological potential owing to their pronounced lipophilicity, which facilitates membrane permeation, cellular accumulation, and metabolic activation. These compounds disrupt mitochondrial function and lipid homeostasis, contributing to their cytotoxic and microbiome-altering effects.109
The most potent arsenic-derived bioactivities are observed for arsenicin A and trivalent methylarsenicals (Sections 4.5 and 4.6). Arsenicin A exhibits strong antibacterial and anticancer properties that are likely mediated by oxidative stress induction and thiol-binding mechanisms.112,113 Trivalent methylarsenicals, including MMA(III) and DMA(III), are among the most toxic arsenic species in biological systems and function as chemical weapons in microbial competition by irreversibly inactivating enzymes.114
Collectively, these activities illustrate how biological systems exploit arsenic chemistry to generate metabolites spanning a continuum from detoxification reservoirs to highly specialized ecological and pharmacological weapons.
Selenium-containing natural products have attracted attention in drug discovery because of their diverse biological activities. Organoselenium compounds exhibit antioxidant, anti-inflammatory, anticancer, and antimicrobial properties. For example, methaneseleninic acid has anticancer activity and may enhance the efficacy of conventional chemotherapeutic agents. Given their structural diversity and potent bioactivity, selenium-containing natural products represent a valuable class of compounds for developing novel therapeutics. Ongoing research explores their biosynthetic pathways and molecular mechanisms, highlighting the chemical ingenuity of nature and potential for innovative drug development.122
The biosynthetic pathway begins with the activation of biologically inert selenide (HSe−) into a reactive selenium donor. In this initial step, SenC catalyzes the ATP-dependent conversion of selenide to selenophosphate (SeP), a reaction analogous to that of SeP synthetase SelD. The resulting SeP serves as a selenium donor in the downstream biosynthetic steps. In the second step, SenB catalyzes the coupling of SeP with UDP-sugar to generate 1-seleno-β-D-glucose (SeGlc), a rare selenosugar containing a carbon–selenium (Se–C) bond. This reaction represents a previously uncharacterized Se–C bond-forming enzymatic process, establishing SenB as the first selenosugar synthase to be identified. SenB accepts a range of UDP-sugars (e.g., UDP-glucose, UDP-N-acetylglucosamine (UDP-GlcNAc), and UDP-GalNAc) as substrates, among which 1-seleno-N-acetylglucosamine (SeGlcNAc) serves as the primary intermediate involved in the biosynthesis of 27 (Scheme 3). In the final step, SenA, an oxidative enzyme homologous to EgtB in the ergothioneine biosynthetic pathway, catalyzes the coupling of SeGlcNAc and hercynine to produce hercynyl-SeGlcNAc selenoxide, a key intermediate en route to selenoneine. This intermediate is converted to 27 either under reducing conditions in the presence of dithiothreitol (DTT) or via spontaneous Cope elimination (Scheme 4). Notably, this transformation proceeds without the involvement of additional enzymes (such as EgtC or EgtE), thus providing a relatively simple route to the final product. This biosynthetic pathway is distinct from previously known selenium metabolic routes and represents a non-proteinaceous mechanism for selenium incorporation. This serves as an illustrative example of the biosynthesis of natural products bearing biologically rare Se–C bonds. Genomic analyses have revealed the widespread distribution of this pathway in diverse marine and terrestrial bacteria. This pathway may provide foundational information for developing selenium-based antioxidants, metal detoxifiers, and antimicrobials, thereby contributing to the exploration of the biosynthesis and bioactivity of natural products. In a human 3D epidermal model of functional melanocytes, selenoneine significantly suppressed melanin synthesis in a dose-dependent manner. At concentrations of 1.0 and 5.0 µM, melanin levels were reduced to 39.7% and 23.0% of the control, respectively, demonstrating a more potent inhibitory effect than the positive control, arbutin.126
![]() | ||
| Scheme 3 SenC generates selenophosphate (SeP) from selenide, which is then used by SenB to synthesize selenosugars (SeGlc, SeGlcNAc, and SeGalNAc) from UDP-sugars. | ||
![]() | ||
| Scheme 4 SenA couples hercynine with SeGlcNAc to form hercynyl-SeGlcNAc selenoxide (GlcNAc-SEN = O), which spontaneously converts to selenoneine or is reduced to the corresponding selenoether. | ||
These findings highlight the strong anti-melanogenic potential of 27. In the antioxidant evaluation, 27 shows stronger radical-scavenging activity than that of ergothioneine and Trolox in DPPH assays. Furthermore, treatment with 27 significantly reduces oxidative stress in Caenorhabditis elegans after 48 h, indicating that its antioxidant effects may involve modulating cellular signaling pathways beyond direct radical neutralization. In a zebrafish embryo model, 27 reduced methylmercury (MeHg) toxicity and is involved in the conversion of MeHg to its inert form, mercury selenide (HgSe). Additionally, 27 competitively inhibits zinc-containing angiotensin-converting enzymes and suppresses melanin production in a human 3D epidermal model, indicating that it may exert skin-whitening effects by inhibiting tyrosinase activity.
In a colon cancer mouse model, the administration of tuna dark muscle extract containing 27 reduces both tumor number and size. In a non-alcoholic fatty liver disease (NAFLD) model, treatment with 27 decreases hepatic damage markers and attenuates lipid accumulation. These results suggested that 27 may provide physiological improvements in various diseases through its antioxidant and metabolic regulatory effects.126,127
In bacteria, selenocysteine biosynthesis is a specialized process involving a unique set of genes and molecular machinery. The biosynthesis of 28 is unique and does not occur through standard codon-mediated amino acid incorporation. Although the UGA codon normally serves as a stop signal in protein translation, in the specialized mechanism for 28, it is reinterpreted as a selenocysteine insertion codon. This process requires several components, including seryl-tRNA synthetase (SerS), which produces Ser-tRNASec; SelD, which synthesizes SeP; and SelA, which generates selenocysteyl-tRNASec. The resulting Ser-tRNASec is delivered to the ribosome by a dedicated factor, SelB, and this process is directed by a specific mRNA structure known as the selenocysteine insertion sequence (SECIS) element. Interestingly, certain microorganisms have evolved alternative mechanisms for selenocysteine biosynthesis. For instance, Enterococcus faecium ABMC-05, a lactic acid bacterium isolated from a traditional Mexican beverage, produces selenocysteine through a pathway independent of the canonical selA and selD genes.129 Instead, this strain utilizes the cysK gene, which encodes cysteine synthase, suggesting a novel route for selenocysteine formation. This alternative mechanism underscores the metabolic diversity among microorganisms regarding selenium incorporation. These findings highlight the diversity of selenocysteine biosynthetic pathways in microorganisms and the importance of selenocysteine-containing proteins in microbial physiology. Understanding these pathways can provide insights into selenium metabolism and its applications in biotechnology and medicine. In particular, studies on human glioma cells have shown that 28 induces apoptosis by targeting TrxR1, leading to excessive production of ROS, mitochondrial dysfunction, an imbalance in Bcl-2 family protein expression (upregulation of Bax and downregulation of Bcl-2), and activation of caspase-3 and cleavage of PARP1.130 This apoptotic response was further linked to the disruption of the AKT and MAPK signaling pathways. Notably, antioxidant pre-treatment suppresses these effects, emphasizing the pivotal role of ROS in 28-induced apoptosis. In vivo experiments using U251 glioma xenografts in mice confirmed that administering 28 significantly inhibited tumor growth, consistent with TrxR1-targeted oxidative damage.130
In C. elegans, 28 mimicked the effects of dietary restriction (DR), with lifespan extension requiring SKN-1 and accompanied by nuclear translocation of DAF-16 (the ortholog of mammalian FoxO). Consequently, treatment with 28 increased the mean lifespan from ∼17.9 to 22.6 days.131 Selenocysteine also alleviates stress induced by a high-glucose diet and delays amyloid-β- induced paralysis, suggesting its potential neuroprotective effect in Alzheimer's disease models. Taken together, these findings indicate that selenocysteine functions not merely as a sulfur substitute but also as a regulator of redox homeostasis and apoptosis and may therefore contribute to processes relevant to cancer and neurodegenerative diseases.132
The biosynthesis of 29 begins with histidine and proceeds through a sequence of enzymatic reactions that yield a TSI antioxidant featuring direct selenium incorporation (Scheme 5). The first step involves the formation of N-acetyl-1-seleno-β-D-glucosamine (SeGlcNAc) by reacting selenium, activated via SeP, with sugar. This step is catalyzed by the enzymes SenC and SenB, following a pathway similar to that of the previously known selenosugar synthesis. Subsequently, the key enzyme, OvsA, a non-heme iron-dependent oxidase, introduces selenium from SeGlcNAc into the C-5 position of the imidazole ring of histidine. This leads to the formation of the intermediate 5-selenoxide, which is rapidly deglycosylated and reduced to 5-selenohistidine (5-SeHis). In the final step, the SAM-dependent methyltransferase OvsM methylates the imidazole ring of 5-SeHis to yield the final product, 29. Although this pathway is mechanistically analogous to ovothiol biosynthesis, the formation of the C–Se bond in 29 represents a unique biosynthetic feature, highlighting the biochemical diversity of selenium utilization in nature.133
In addition to its unique biosynthetic pathway, 29 is a newly identified microbial selenometabolite that features a C–Se bond. This structural feature contributes to its redox activity. Electrochemical analysis revealed that 29 exhibits a reduction peak potential of −1.02 V, which is more positive than that of 28 (−1.28 V), suggesting its comparatively enhanced electron-donating capacity and possible relevance as an unusual redox-active selenium compound. Although compound 29 exhibits promising redox properties, its precise physiological function remains to be elucidated. However, further studies are required to investigate its potential involvement in redox regulation, transcriptional control, and metal stress responses. The identification of 29 adds to the growing chemical diversity observed in microbial selenium metabolism and underscores the need to further explore the functional roles of selenometabolites in nature.133
Given their structural novelty and potent bioactivity, iodine-containing natural products represent an important frontier in marine natural product research, offering therapeutic potential for multiple diseases, including cancer, infectious diseases, and metabolic disorders. Continued exploration and characterization of these compounds will not only elucidate the functional roles of iodine in marine biosystems but also expand the repertoire of bioactive scaffolds for drug discovery.
Methyl 2-iodobenzoate (31) is a naturally occurring halogenated aromatic ester first isolated from the actinomycete Streptomyces chartreusis (Fig. 9).139 It represents the earliest known example of a naturally produced iodinated volatile organic compound. As a volatile compound, 31 may play a role in microbial communication or chemical defense; however, its ecological and physiological functions remain unclear. Cultivation of S. chartreusis in GYM medium enriched with 2-iodobenzoic acid resulted in the detection of methyl 2-iodobenzoate. This finding indicates that S. chartreusis likely converts the supplemented carboxylic acid precursor into a methyl ester through an efficient in situ methylation process.
Miuraenamide B (32) is a natural cyclic depsipeptide product isolated from the myxobacterium Paraliomyxa miuraensis (strain SMH-27-4) (Fig. 9).140 This strain was isolated from a soil sample collected near the coast of the Miura Peninsula in Kanagawa Prefecture, Japan. Structurally, 32 belongs to a unique class of secondary metabolites featuring a macrocyclic ring composed of both peptide and ester linkages, which contributes to its structural rigidity and bioactivity. Miuraenamide B shares an identical core structure with miuraenamide A, with a structural difference in the substitution of bromine with iodine at the tyrosine residue.141 Miuraenamide A is biosynthesized through a biosynthetic gene cluster consisting of two type I PKS genes (miuA and miuB), one non-ribosomal peptide synthetase (NRPS) gene (miuC), and four tailoring enzyme genes (miuD–G). The biosynthesis of miuraenamide A is initiated using acetyl-CoA as a starter unit, followed by iterative extensions with malonyl-CoA and methylmalonyl-CoA to form the polyketide backbone, after which L-alanine, brominated tyrosine, and methoxylated phenylalanine are sequentially incorporated via the NRPS modules. The resulting intermediate is then cyclized and released by the TE MiuF, and the FAD-dependent halogenase MiuG catalyzes the bromination of free L-tyrosine to introduce the specific bromotyrosine moiety in miuraenamide A. MiuG catalyzes the iodination of the L-tyrosine moiety during the biosynthetic pathway of miuraenamide B. In an anti-Phytophthora screening experiment using the paper disk diffusion method, 32 exhibited strong antimicrobial activity, inhibiting the growth of the phytopathogen Phytophthora capsici at the very low concentration of 0.025 µg per disc.142
Compounds 33 and 34 are derived from cembrane diterpenes, and their biosynthesis primarily involves halogenation and cyclization reactions. In the initial step, the cembrane diterpenes react with halogens (iodine or bromine) and undergo halogenation. After the halogen attachment, the compound forms an additional ring structure via an electrophilic reaction. This process begins with oxygenated cembrane diterpenes and is significantly mediated by haloperoxidase, an enzyme derived from red algae that promotes ring formation using halogens. The resulting ring structures yield unique iodinated diterpenes 33 and 34. It is more likely that 33 and 34 originate from red algae rather than cyanobacteria. Marine cyanobacteria predominantly produce bioactive compounds via the polyketide and non-ribosomal peptide pathways, often yielding modified amino acid structures. By contrast, peptide- or acetogenin-like structures have rarely been reported in cyanobacteria. In particular, diterpene compounds are scarcely found in cyanobacteria but are frequently detected in red algae, particularly those of the genus Laurencia. Considering these biosynthetic distribution characteristics, 33 and 34 are more likely to originate from red algae.143
Calicheamicins are composed of three key components: an enediyne core, a trisulfide-containing trigger moiety, and a unique oligosaccharide unit.149 These structural elements are biosynthesized through a series of highly orchestrated enzymatic steps in M. echinospora ssp. calichensis. The central enediyne core, which confers a potent DNA-cleaving activity, is biosynthesized by an iterative type I PKS. This enzyme modulates the formation of a highly strained macrocyclic structure by incorporating conjugated triple and double bonds, which are further modified by associated tailoring enzymes to afford a characteristic 10-membered enediyne framework. Following core formation, multiple rare sugar residues are appended by glycosyltransferases in a stepwise manner, facilitating specific recognition and binding to the minor groove of the DNA. The final biosynthetic step involves the incorporation of a trisulfide moiety derived from cysteine, which functions as a molecular “trigger” that activates the enediyne for DNA strand scission. In addition, calicheamicins contain an iodinated orsellinic acid moiety; however, the timing of iodine incorporation during biosynthesis has not yet been cl-early established. This modular biosynthetic strategy yields calicheamicin analogs with remarkable cytotoxicity and therapeutic potential.150 Although the calicheamicin derivatives exhibit strain-dependent variability, they show potent antibacterial activity against a wide range of bacteria. Compounds 37, 38, 39, and 41 showed MICs ranging from 0.06 to 0.5 µg mL−1 against key pathogenic strains, including E. coli, Klebsiella pneumoniae, Enterobacter sp., and Citrobacter sp.148 Notably, compounds 37–41 exhibited exceptional antibacterial potency against S. aureus, S. epidermidis, S. faecalis, M. luteus, and B. subtilis, with MICs as low as 0.0005 µg mL−1, effectively inhibiting bacterial growth at concentrations markedly lower than those of conventional antibiotics.148
Iodine-containing isomers show enhanced activity and are expected to play a significant role in various therapeutic applications, including cancer treatment. Compound 40, in particular, induces double-stranded DNA breaks, leading to cell death and antitumor efficacy. Through mechanisms involving cytotoxicity and DNA damage induction, it exerts a potent antineoplastic effect, notably causing strong cytotoxicity and showing antitumor activity through mechanisms such as NAD depletion and poly (ADP-ribosylation) activation.150
Vanadium-containing natural products (Fig. 12) include a single non-protein small molecule, amavadin (42),158 along with two broad classes of vanadium-binding macromolecules: vanabin (43), which are cytoplasmic storage and carrier proteins identified in ascidian blood cells,159 and functional enzymes such as vanadium-dependent haloperoxidases (44) and vanadium-dependent nitrogenases (45).160 Given their structural diversity and potent bioactivities, vanadium-containing natural products represent promising leads for new drug development and therapeutic applications. The following section summarizes known vanadium-containing natural products, with emphasis on their origins, structural features, and biological functions.
Amavadin (42), originally isolated from fly agaric (Amanita muscaria),162 is the first and only non-proteinaceous small-molecule natural product known to incorporate vanadium. Electron paramagnetic resonance (EPR) spectroscopy established vanadium's +4 oxidation state in amavadin, and the full structure was subsequently reported a year later.161 Subsequently, studies employing EPR, infrared (IR) spectroscopy, and other analyt-ical techniques led to further revisions of the proposed structures.162–164 Definitive elucidation of amavadin's structure was achieved in 1987 through X-ray crystallography and NMR spectroscopy.165,166 Its structure features a vanadium(IV) center in a rare non-oxo, octacoordinate environment chelated by two fully deprotonated (S,S)-N-hydroxyimino-(2,2′)-dipropionate (HIDPA) ligands. Each tetradentate HIDPA ligand binds via one oximate N–O donor and two carboxylate O donors, yielding an overall [V(HIDPA)2]2− complex anion with C2 symmetry.167 Amavadin (42) exists in two helical isomeric forms, Δ-amavadin and Λ-amavadin, depending on the right- or left-handed orientation of the C2-symmetric HIDPA ligands around the vanadium(IV) center. NMR and CD spectroscopy have shown that natural product 42 exists in solution as an approximately 1
:
1 mixture of the Δ- and Λ-isomers.168 Until now, no structurally related derivatives of 42 have been reported, and its biosynthetic pathway in A. muscaria remains uncharacterized.
Although no pharmacologically relevant biological activity has been demonstrated for 42, several studies have reported its redox-mediated catalytic properties in vitro. Amavadin promotes the oxidation of thiols such as cysteine and glutathione, both in the absence and presence of hydrogen peroxide, exhibiting peroxidase- and catalase-like behavior.165,169 It also facilitates per-oxidative halogenation and hydroxylation of organic substrates under oxidative conditions.169 Electrochemical studies revealed that 42 catalyzes thiol oxidation via a mechanism consistent with Michaelis–Menten kinetics.170 Additionally, 42 catalyzes the oxidation of methane to acetic acid under mild conditions in the presence of peroxodisulfate and trifluoroacetic acid.171 Although the physiological relevance of this transformation remains unknown, such reactivity supports its potential role as a redox mediator in fungal metabolism.
Although 42 remains the only known vanadium-containing small molecule, vanadium more commonly occurs in living organisms as part of macromolecular complexes. Among these are vanabins (43) identified in the ascidian Ascidia sydneiensis samea172 and fan worm Pseudopotamilla occelata.173 Vanabins with molecular masses of 12.5, 15, and 16 kDa were found in the cytoplasm of vanadocytes using immunodetection with specific antibodies.172,174 Subsequent studies advanced the molecu lar characterization of vanabins. Biochemical analyses demonstrated their high selectivity for V(IV) over other metals,162 and molecular cloning revealed two isoforms, vanabin 1 and vanabin 2, capable of binding 10 and 20 vanadyl ions, respectively.175 Structural studies using NMR spectroscopy provided the first three-dimensional model176 and later uncovered a cysteine-rich folding pattern, unlike that of any known metalloprotein.177 Despite extensive structural and biochemical characterization, the physiological function of vanabins appears to be limited to intracellular vanadium storage and transport within vanadocytes.162,175,176,178
In addition to serving as a storage protein, vanadium functions as a catalytic enzyme cofactor. The best-studied examples are vanadium-dependent haloperoxidases (44), which employ vanadate covalently bound to a histidine residue to catalyze the oxidation of halides by hydrogen peroxide.177,179 These enzymes have been identified in marine algae, such as Ascophyllum nodosum and Corallina officinalis,177,180 as well as in terrestrial fungi and lichens,181 where they participate in the biosynthesis of diverse halogenated metabolites. Vanadium-dependent nitrogenases (45) represent another class of catalytic enzymes found in certain diazotrophic bacteria such as Azotobacter vinelandii.182 Unlike canonical molybdenum nitrogenases, the vanadium-dependent form contains a V–Fe cofactor and is capable of reducing dinitrogen to ammonia under molybdenum-limited conditions182,183 while also exhibiting higher activity toward alternative substrates such as CO and CN−.184 Together, these vanadium enzymes illustrate how organisms exploit the unique redox versatility of vanadium to catalyze key oxidative and reductive transformations in nature.
For catalytic activity, molybdenum is incorporated into cofactors that serve as integral components of diverse metalloenzymes.189
Molybdenum-containing natural products (Fig. 13) are exclusively represented by protein-bound cofactors rather than small molecules. The molybdenum cofactor (Moco, 46) is widely distributed in nature190 and supports the catalytic activity of enzymes such as xanthine oxidase,191 sulfite oxidase,192 and nitrate reductase.193,194 In contrast, the structurally distinct iron–molybdenum cofactor (FeMoco, 48) serves as the catalytic core of nitrogenase, mediating the biological reduction of dinitrogen to ammonia.195 Given their essential catalytic roles, molybdenum-containing natural products represent important targets for biomedical research and the development of novel therapeutics.189 This section highlights current knowledge of their occurrence, structural organization, and catalytic functions across diverse biological systems. The Moco (46), a pterin-based metallocofactor, was first identified in the early 1970s through studies of molybdoenzyme-deficient mutants in organisms such as Neurospora crassa, which revealed that multiple enzymes require a shared molybdenum-containing cofactor.186,196 Structurally, 46 is constructed on a conserved molybdopterin (47) scaffold, featuring a tricyclic pyranopterin core that coordinates the molybdenum center through an enedithiolate ligand. Its structure was initially elucidated by chemical degradation studies, which established the pterin-based framework and sulfur ligation pattern, and was later corroborated by crystallographic analyses of molybdenum-dependent enzymes.197–199 Through this coordination environment, 46 enables a broad spectrum of redox transformations, most prominently hydroxylation and oxo-transfer reactions. Moco-dependent enzymes mediate essential transformations in carbon, nitrogen, and sulfur metabolism, including purine catabolism, sulfite detoxification, and nitrate reduction.187 In higher eukaryotes, these catalytic functions are indispensable for maintaining metabolic homeostasis, whereas deficiencies in 46 biosynthesis result in severe and often lethal metabolic disorders.200–202
The FeMoco (48) represents a structurally and functionally distinct form of molybdenum used in biology. Unlike the universally distributed pterin-based Moco, 48 is exclusively associated with nitrogenase enzymes, where it serves as the catalytic core for the biological reduction of dinitrogen to ammonia.203,204 Crystallographic analyses revealed its unique cluster architecture, [(MoFe7S9C)-homocitrate], consisting of a molybdenum center coordinated by homocitrate and a carbide-bridged Fe–S cluster.205,206 This elaborate metallocluster exemplifies one of the most complex cofactors known in nature and confers the exceptional reducing power of nitrogenase.207 Beyond dinitrogen reduction, 48 also displays reactivity toward alternative substrates such as CO and CN−, highlighting its remarkable catalytic versatility.183,184 Together, Moco (46) and FeMoco (48) exemplify the diversity of molybdenum-containing natural products, highlighting how fundamentally distinct architectures enable molybdenum to serve versatile catalytic functions in biology.
From a biosynthesis perspective, these unusual atoms are introduced via remarkably diverse mechanisms. Arsenic and selenium are incorporated through SAM-dependent methylation and SeP-mediated routes, respectively, whereas fluorine enters the metabolism via a fluorinase-catalyzed nucleophilic substitution reaction—the only known biological C–F bond-forming process. In contrast, boron and iodine are frequently installed via non-enzymatic or halogenase-assisted processes, and transition metals such as molybdenum and vanadium are incorporated into sophisticated cofactors that mediate essential redox transformations. These contrasting strategies, ranging from enzymatic precision to spontaneous coordination, underscore the biochemical ingenuity of nature in overcoming the physicochemical constraints of rare elements.
Despite their rarity, atypical natural products have pivotal ecological and physiological roles. Arsenobetaine and arsenosugars act as detoxification and storage forms in marine food webs, whereas selenium metabolites such as selenoneine and selenocysteine provide antioxidant protection and redox regulation across organisms. Fluorinated metabolites, such as fluoroacetate and 4-fluoro-L-threonine, function as potent antifeedants and enzyme inhibitors, reflecting the use of fluorine for chemical defense. Iodinated compounds, particularly marine metabolites, such as tasihalides and miuraenamides, exhibit pronounced antimicrobial and cytotoxic effects, highlighting the ecological and pharmacological significance of halometabolites. Boron macrolides such as boromycin and tartrolon E function as ionophores with activity against M. tuberculosis and P. falciparum, whereas molybdenum and vanadium cofactors underpin central metabolic and nitrogen-fixing reactions that are fundamental to global biogeochemical cycles.
These examples reveal convergent evolutionary strategies: (i) chemical stabilization of reactive atoms through coordination (B, Mo, and V); (ii) controlled redox modulation enabling detoxification or signaling (Se and As); and (iii) electrophilic halogenation as a means of defense or metabolic diversification (F and I). These patterns reflect the adaptive value of element-specific chemistry for ecological survival and inter-species competition. Importantly, several of these pathways, particularly those involving halogenases, methyltransferases, and selenium- or boron-handling enzymes, demonstrate that life can exploit even the most energetically demanding reactions by evolving specialized enzymatic architectures and cofactor systems.
In addition to their natural context, these metabolites hold substantial promise for biotechnological applications and drug discovery. Fluorinase from S. cattleya has inspired biofluorination platforms for the synthesis of fluorinated pharmaceuticals. Boron-containing macrolides have served as leads for next-generation antibiotics, and the Sen pathway of selenoneine biosynthesis provides a model for engineering Se–C bond-forming enzymes. Similarly, the structural logic of iodine- and arsenic-containing metabolites provides new scaffolds for developing anticancer and antimicrobial agents. Together, these systems exemplify how knowledge of element-specific biosynthetic logic can guide the rational design of bioinspired catalysts and therapeutics.
Integrating genomics, metalloproteomics, and metabolomics will likely reveal additional families of rare element natural products hidden in unexplored microbiomes and marine symbioses. Coupled with advances in cryo-EM, isotope-enabled tracing, and machine learning–based pathway prediction, future studies can elucidate how these elements are selected, activated, and integrated by living systems. Ultimately, exploring atypical element biosynthesis illuminates the outer limits of biological chemistry, where the inorganic and organic worlds converge, and paves the way toward sustainable biotechnologies that harness the same molecular ingenuity that has been refined for millions of years.
Footnote |
| † These authors contributed equally to this review. |
| This journal is © The Royal Society of Chemistry 2026 |