Open Access Article
Ying Chen†
ab,
Yunsheng Chen†b,
Hao Xiang†
bf,
Changqi Luob,
Jiaqi Duanb,
Kun Hucf,
Xiaohong Zhengd,
Jing Liu
e,
Yongbo Xue
*a and
Yi-Ming Shi
*bf
aSchool of Pharmaceutical Sciences (Shenzhen), Sun Yat-sen University, Shenzhen, China. E-mail: xueyb@mail.sysu.edu.cn
bState Key Laboratory of Quantitative Synthetic Biology, Center for Synthetic Biochemistry, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China. E-mail: ym.shi@siat.ac.cn
cState Key Laboratory of Phytochemistry and Natural Medicines, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
dEquipment Public Service Center, South China Sea Institute of Oceanology, Guangzhou, China
eDepartment of Natural Products in Organismic Interactions, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
fUniversity of Chinese Academy of Sciences, Beijing, China
First published on 27th February 2026
Non-ribosomal lipopeptide antibiotics derive much of their antibacterial performance from the structure of their lipid tail, yet genome mining remains largely peptide-centric and rarely prioritizes lipid donor formation as an entry point. Herein, we establish a lipid-donor-anchored discovery strategy using pfa-type polyunsaturated fatty acid synthase (PUFAS) core genes as the genomic anchor for long polyunsaturated acyl donors, followed by filtering for co-localization of non-ribosomal peptide synthetase (NRPS) assembly lines. This approach delineated 60 curated PUFAS–NRPS biosynthetic gene clusters, including a previously unassigned dxp family, which is distributed across Proteobacteria. Cultivation of Chitinimonas koreensis DSM 17726 enabled the isolation of dioxanopeptins, a new set of non-ribosomal lipopeptides featuring a long polyunsaturated lipid tail further functionalized by a pyruvyl-derived 1,3-dioxane ketal motif. Recombinant DxpH catalyzed the in vitro conversion of dedioxanopeptin to dioxanopeptin using phosphoenolpyruvate as the pyruvyl donor, establishing pyruvylation as a post-assembly tailoring step. Dioxanopeptins displayed selective activity against methicillin-resistant Staphylococcus aureus, and their antibacterial phenotype correlated with disruption of proton motive force homeostasis and ATP production. Removal of the pyruvyl-derived ketal markedly diminished potency, highlighting its contribution to antibacterial activity. Together, these results expand the chemical space of lipopeptides and illustrate how lipid-donor-anchored genome mining can complement NRPS-focused strategies to access functionally differentiated antibiotic scaffolds.
As membrane insertion is frequently the entry point to activity, small structural changes in the lipid moiety, including chain length, branching, saturation, and functionalization, can dramatically influence the potency, spectrum, selectivity, and mode of action. As a trend, shorter lipid tails can favor efficient insertion into bacterial membranes, whereas longer and more hydrophobic tails can enhance stable membrane binding and oligomerization, often shifting antibacterial potency and selectivity. Branched-chain fatty acyl groups alter how the molecule packs into bilayers: even a methyl branch can introduce packing defects and modulate membrane disorder and pore-forming propensity.4 Unsaturated bonds likewise introduce conformational constraints that reshape insertion geometry and self-assembly, thereby affecting biological outputs.
Although lipid-tail features strongly modulate lipopeptide bioactivity, genome mining is still largely peptide-centric, prioritizing biosynthetic gene clusters (BGCs) primarily by non-ribosomal peptide synthetase (NRPS) domain architecture5–8 and therefore providing limited leverage over the lipid-tail chemical space. A key bottleneck is that, for many canonical lipopeptides, the acyl donor is supplied by ubiquitous housekeeping type II fatty acid synthase (FAS II),9 so lipid donor formation often lacks a distinctive, cluster-encoded genomic beacon for prioritizing unusual lipid tails. We frame lipopeptide genome mining around four biosynthetic checkpoints: lipid donor formation, lipoinitiation, assembly, and tailoring (stages 1–4, Fig. 1). Currently, most established genome mining workflows tend to focus on BGC signatures in stages 2–4. Lipoinitiation is frequently exploited, leveraging fatty acyl-AMP ligase/ACP systems10 and starter condensation (C) domains11 that couple an acyl group to the first amino acid, but is less likely to indicate which non-canonical lipid donors are supplied. Non-ribosomal peptide synthetase–polyketide synthase (NRPS-PKS) assembly is then prioritized via module architecture and A-domain specificity prediction,6 which is powerful for proposing peptide cores. Finally, tailoring11 (e.g., thioesterase domain- & terminal reductase domain-mediated macrocyclization or reductive offloading; modification mediated by halogenases and glycosyltransferases) can guide diversification. Consequently, the lipid-tail dimension, despite its outsized functional importance, remains comparatively underexploited as a genome-mining entry point.
![]() | ||
| Fig. 1 Lipid-donor-anchored genome mining for lipopeptide discovery. Schematic comparison of genome-mining entry points mapped onto four stages of lipopeptide biosynthesis. | ||
Given that lipid-tail features can drive membrane engagement and bioactivity, we reasoned that a discovery strategy that explicitly prioritizes lipid-donor formation might better enrich for functionally differentiated lipopeptides. Here, we shifted non-ribosomal lipopeptide genome mining from a peptide-centric to a lipid-donor-anchored strategy, using cluster-encoded lipid-donor formation genes as the primary entry point and NRPS co-localization as a secondary filter. As a proof-of-concept, we focused on polyunsaturated fatty acid synthase (PUFAS) core genes (pfaA-D) as the lipid-donor machinery, which is largely BGC-encoded and thus provides a genomic signature for mining new classes of lipopeptides. This led to the discovery of the dxp (dioxanopeptin) BGC family, predominantly found in Proteobacteria. The Ck-dxp BGC from Chitinimonas koreensis DSM 17726 was expressed under standard laboratory conditions and produced unusual lipopeptides, termed dioxanopeptins (1–6), bearing a long unsaturated lipid tail with a 1,3-dioxane ring, putatively installed by a pyruvyltransferase. Dioxanopeptins exhibited selective activity against methicillin-resistant Staphylococcus aureus (MRSA) by dissipating the proton motive force (PMF), thereby compromising ATP production.
By doing so, we identified 60 candidate BGCs in which a pfa-type PUFAS occurs adjacent to an NRPS assembly line. We then used BiG-SCAPE14,15 to construct a BGC similarity network against the reference BGCs in the MIBiG database,16 which classified these BGCs into four major families (Fig. 3a). This includes three previously described families, fcl (fabclavine biosynthesis),17 zmn (prezeamine biosynthesis),18 and mgp (megapolipeptin biosynthesis),19 as well as a previously unrecognized family without linkage to any MIBiG entries, which we term dxp. We then performed phylogenetic and synteny analyses to delineate and visualize the family-classified BGCs (Fig. 3b and S1). Our analysis was consistent with earlier reports showing that fcl and zmn clusters were primarily associated with genera such as Xenorhabdus, Serratia, and Dickeya,20–22 whereas the mgp family appears largely restricted to Paraburkholderia.19 In contrast, synteny analysis revealed the dxp family as a distinct and comparatively restricted lineage, represented by Chitinimonas, Chromobacterium, Tistlia, Azospirillum, and Collimonas within Proteobacterial genomes.
In the fcl17 and zmn18 pathways (Fig. 4a), the PUFAS-like system is repurposed to build a long polyamino alcohol chain (Fig. 4b) via recruitment of a modular aminotransferase, together with a stand-alone KR and a thioester reductase. In parallel, a PKS–NRPS assembly line produces a mature peptidyl thioester tethered to a carrier protein, which is subsequently condensed with the polyamino alcohol chain to yield fabclavines and prezeamines. The specialized polyamino alcohol moiety is appended to the C-terminus of the peptide scaffold. The mgp family has distinct auxiliary functions (Fig. 4a), including an unusual stand-alone cupin family enzyme and PfaD homologs with an additional ACP domain.19 Notably, the mgp family encodes a pyruvyltransferase, proposed to act together with a thiamine pyrophosphate-dependent lyase to assemble the 4-oxoheptanedioic moiety that decorates the polyunsaturated lipid chain. The specialized lipid thus constitutes the N-terminal starter unit of the megapolipeptin scaffold (Fig. 4b).
By contrast, the dxp family encodes a compact PUFAS–NRPS core (Fig. 4a), in which DxpD (PfaA homolog) carries four tandem ACP domains and DxpF (PfaD homolog) carries a single ACP. This feature matches the mgp family but differs from the fcl and zmn families. In addition, the dxp family encodes fewer accessory genes flanking the core assembly line, yet retains a putative pyruvyltransferase, suggesting a distinct lipid-tail functionalization.
To link genotype to chemotype, we selected three publicly accessible DSMZ strains harboring representative dxp BGCs, including C. koreensis DSM 17726, Chromobacterium subtsuga DSM 17043, and Tistlia consotensis DSM 21585, for cultivation and LC-HRMS screening. To facilitate targeted detection and interpret structural variation, we profiled NRPS A-domain substrate specificity for the prioritized BGCs using three complementary predictors, NRPStransformer23 and PARAS/PARASECT,24 in parallel. We recorded the top-three candidates for each A domain (Table S1). Among the three tested strains, extracts of C. koreensis DSM 17726 reproducibly showed a series of peaks with late retention times (1–10, Fig. 4c), consistent with a highly hydrophobic scaffold. Also, MS/MS spectra displaying neutral losses correspond to valine, leucine, and glutamine residues (Fig. S4–S17). Isotope feeding experiments of d8-L-valine and 13C/15N-labeled media further supported a molecular composition of six nitrogens and 43–49 carbons. Five out of six nitrogens are attributable to four A-domain-incorporated amino acid residues (four backbone amide nitrogens plus one side-chain amide nitrogen from an amide-containing residue, e.g., glutamine), and the remaining one is attributable to an amino transferase domain-installed amino group. The 43–49 carbon count exceeds what would be expected from the peptide alone. Collectively, these data suggest a lipopeptide family bearing an extended polyunsaturated lipid tail, as anticipated from the Ck-dxp BGC. This motivated scale-up fermentation and subsequent isolation and structural elucidation of compounds 1–10.
The stereochemistry of C-4, C-7, and C-41 was tentatively proposed to be 4R, 7S, and 41R, respectively, based on the analysis of the catalytic subdomain of KR (KRC) sequences.27 In the resulting phylogeny, the Ck-DxpD KRC clustered within the B-type KR clade, whereas the Ck-DxpB KRC grouped with the A-type KR clade, supporting type-specific stereoselectivity assignment (Fig. S3). In particular, to provide NMR evidence for the C-41 stereochemical assignment, we recorded a PSYCHEDELIC spectrum28 on dioxanopeptin B (higher abundance) to resolve the congested multiplet and extract 3JH39–H41 = 7.5 Hz (Fig. S2), which indicates an anti-relationship between H-39 and H-41 and is in agreement with the C-41 assignment inferred from the KRC phylogeny. To determine the configuration at C-25, we performed DFT-GIAO NMR calculations29 on truncated structural models. The computed shifts for the 25R isomer (1T-a) showed better agreement with the experimental 13C and 1H data than those for the 25S isomer (1T-b), as reflected by the higher R2 and lower mean absolute error/corrected mean absolute error values. Therefore, DP4+ analysis30 strongly supported the 25R isomer over 25S, giving a DP4+ probability of 100.00% (Fig. S2).
The structures of dioxanopeptins B (2) and C (3) were confirmed by comparative NMR analysis with dioxanopeptin A (1). Dioxanopeptins E (5) and F (6), with shorter retention times in the HPLC chromatogram, showed MS/MS fragmentation patterns identical to 2 and 1, respectively, but had a +18 Da mass shift (Fig. S12 and S13). Therefore, 5 and 6 were assigned as corresponding hydrolytic linear congeners of 2 and 1.
We observed a series of compounds (7–10) that eluted earlier in the HPLC chromatogram than dioxanopeptins A–C (1–3). The MS/MS spectra of 1–3 and 7–10 showed conserved fragmentation for the peptide moiety, while diagnostic fragments of the lipid portion differed by −70 Da (C3H2O2, Fig. S14–S17). This mass deficit is consistent with the loss of a pyruvyl-like moiety, indicating that compounds 7–10 lack the pyruvyl-derived ketal.
Canonical PUFASs involve DHFabA dehydratase/isomerase chemistry for cis double-bond installation.31 To contextualize this unusual geometry biosynthetically, we further analyzed the PUFA-associated DH domains. Ck-DxpE contains two tandem hotdog-fold DH domains. Based on their order, we refer to these regions as Ck-DxpE DH1 (N-terminal) and Ck-DxpE DH2 (C-terminal). Sequence comparison (Fig. S21) indicates that DH1 is degenerate, as it lacks an eight-residue segment within the conserved catalytic region (corresponding to positions 70–88 in FabA and FabZ) and does not retain the canonical His–acidic residue dyad, whereas DH2 preserves an intact catalytic segment and the His–acidic dyad, consistent with a catalytically competent FabZ-like DH domain. We therefore focused our phylogenetic and active-site analyses on Ck-DxpE DH2 as the likely functional dehydratase. Similarly, MgpF contains two DH domains, of which DH1 appears to degenerate, whereas DH2 retains the canonical catalytic active sites. Phylogenetically, Ck-DxpE DH2 falls within the FabA clade and clusters with the DH2 from the mgp family. However, targeted sequence comparison against UniProt-reviewed FabA and FabZ references shows that Ck-DxpE DH2 lacks residues associated with FabA-type isomerization and instead carries a FabZ-like catalytic signature at the His–acidic-residue pair (FabA: His/Asp; FabZ: His/Glu),32 which is consistent with DH activity that yields trans-2-enoyl intermediates without Δ2 to Δ3 cis isomerization. Taken together, these analyses are consistent with the trans assignment by NMR and provide a plausible mechanistic rationale for why the dxp PUFAS–NRPS system may deviate from the more typical cis-PUFA outcome.
In the subsequent full-reduction cycle, the newly formed β-ketoacyl intermediate undergoes KR (DxpD)- and DH (DxpE)-mediated processing followed by enoyl reduction by ER (DxpF) to complete a fully reduced two-carbon extension. Three additional repetitions of this alternation account for the observed polyunsaturated lipid chain.
The 22-carbon lipid is then passed to the downstream PKS–NRPS machinery. A PKS extension associated with DxpA is proposed to generate a β-keto group, which is then converted to a β-amino functionality by the embedded aminotransferase domain, yielding a 24-carbon polyunsaturated amino lipid. DxpA and DxpB catalyze the sequential incorporation of L-leucine/L-valine, L-asparagine, and L-alanine/L-serine/glycine, and the PKS module in DxpB installs a β-hydroxyacyl extender unit. The terminal residue is incorporated by the first module of DxpC and varies among valine/isoleucine, followed by epimerization to the D-configuration. Finally, macrocyclization or hydrolytic off-loading is proposed to be catalyzed either by the TE domain of DxpC or by the stand-alone TE DxpG, yielding the cyclic dedioxanopeptins (7–10), in which the amino group on the lipid chain is involved in ring closure, as well as in the formation of linear products.
The presence of a 1,3-dioxane ring suggests the formation of the dihydroxyl group on the polyunsaturated fatty acyl chain. We therefore propose that DxpH, a putative pyruvyltransferase, installs a pyruvyl ketal onto dedioxanopeptins (7–10) to furnish the dioxane motif of dioxanopeptins (1–6). To test this conversion, we first chemically removed the pyruvyl-like moiety from dioxanopeptin B (2) under acidic conditions to afford dedioxanopeptin B (8). We then performed in vitro assays using a recombinant Ck-DxpH protein (Fig. S23). Incubation of Ck-DxpH with 8 and potassium phosphoenolpyruvate showed the formation of 2, as confirmed by LC-HRMS and MS/MS comparison with authentic 2 (Fig. 5b and c). This indicates that pyruvylation occurs as a post-assembly modification. These results indicate that pyruvylation is a post-assembly tailoring modification in dioxanopeptin biosynthesis.
| Organism | 1 | 2 | 3 | 4 | 8 | ||||
|---|---|---|---|---|---|---|---|---|---|
| MIC | MBC | MIC | MBC | MIC | MBC | MIC | MBC | MIC | |
| a MIC and MBC, minimal inhibitory concentration and minimal bactericidal concentration, µg mL−1; MRSA, methicillin-resistant S. aureus; n.a., not applicable. | |||||||||
| S. aureus ATCC 29213 | 4 | 4 | 16 | 16 | 4 | 4 | >64 | >64 | >64 |
| S. aureus T144 (MRSA) | 4 | 4 | 8 | 8 | 8 | 8 | 64 | 64 | >64 |
| E. faecium BM1405 | 8 | 8 | 8 | 8 | 8 | 8 | 16 | 32 | n.a. |
| E. coli ATCC 25922 | >64 | >64 | >64 | >64 | >64 | >64 | >64 | >64 | >64 |
| E. coli B2 | >64 | >64 | >64 | >64 | >64 | >64 | >64 | >64 | n.a. |
| A. baumannii 7-2 | >64 | >64 | >64 | >64 | 32 | >64 | 64 | >64 | n.a. |
| A. baumannii 7-2 LPS deficiency | 2 | 8 | 1 | 8 | 4 | 32 | 16 | 32 | n.a. |
| K. pneumoniae ATCC 43816 | >64 | >64 | >64 | >64 | >64 | >64 | 64 | >64 | n.a. |
| K. pneumoniae ATCC 43816 ΔwaaC | 64 | >64 | 64 | >64 | 16 | >64 | 64 | >64 | n.a. |
| P. aeruginosa PAO1 | >64 | >64 | >64 | >64 | 64 | >64 | 64 | >64 | n.a. |
To decipher the underlying mechanism of dioxanopeptins, we first evaluated the time-killing dynamics of dioxanopeptin A (1) against S. aureus at metabolically static and active states. The compound showed no bactericidal activity against S. aureus under 0 °C conditions, but was activated when the cultivation temperature was increased to 37 °C, implying that the bactericidal pattern of dioxanopeptin is highly related to bacterial metabolism, which is similar to the mode of action of vancomycin rather than daptomycin35 (Fig. 6a). Furthermore, we determined the metabolic activity of S. aureus with the indicator dye resazurin.36 Resazurin can be transformed by NADH to the fluorescent product resorufin in metabolically active cells and can therefore serve as an indicator of the cellular redox state, as the reaction depends on the intracellular NAD+/NADH level. Our results demonstrated that dioxanopeptin at sublethal concentrations did not significantly alter the overall growth phase of S. aureus. However, a reduced growth rate was observed during the late logarithmic growth phase, accompanied by increased metabolic activity (Fig. 6b). Besides, the exogenous addition of dioxanopeptin did not impair the membrane permeability even when treated with the concentration of 4× MIC against S. aureus for 2 h (Fig. 6c). The fluidity of membrane and the accumulation of ROS were not affected in the presence of dioxanopeptin (Fig. 6d and e). Hence, we evaluated the other aspects that might be responsible for antibacterial activity of dioxanopeptin. Energy generation is necessary for bacterial survival when faced with antibiotic challenge.34 We found that ATP production was decreased in a dose-dependent manner after being treated with dioxanopeptin (Fig. 6f). PMF is vital for ATP production and is modulated by a delicate compensation mechanism, in which dissipation of either the cytoplasmic pH gradient (ΔpH) or membrane potential (Δψ) is compensated by a counteractive increment in the other.37 To understand the role of dioxanopeptin in PMF change, we measured the Δψ and ΔpH using fluorescent dyes DiSC3(5) and BCECF-AM, respectively. The fluorescent dye DiSC3(5) exhibits increased fluorescence intensity with a higher membrane potential, while BCECF-AM shows enhanced fluorescence as the intracellular environment becomes more basic. The results showed that the ΔpH significantly decreased to counteract the up-regulation of Δψ in the presence of dioxanopeptin (Fig. 6g and h), and both components were changed in a dose-dependent manner, indicating that dioxanopeptin disrupted the homeostasis of PMF, leading to the reduction of ATP production. Correspondingly, we found that dioxanopeptin displayed enhanced antibacterial activity under acidic conditions, consistent with the ability of dioxanopeptin to lower the cytoplasmic pH (Fig. 6g and i).
To examine whether the pyruvyl-like moiety is responsible for the antibacterial activity, we compared the bioactivity of dedioxanopeptin B (8) to dioxanopeptin B (2). Compound 8 displayed markedly reduced activity relative to 2, with MICs increasing from 8–16 µg mL−1 to >64 µg mL−1 (Table 1). We propose that the pyruvyl-like carboxylate in an amphiphilic scaffold of dioxanopeptins may facilitate interfacial proton exchange at the membrane–water interface, thereby biasing PMF disruption toward ΔpH dissipation. A decrease in ΔpH can be accompanied by a compensatory increase in Δψ. However, the overall PMF remains perturbed, consistent with the observed ATP decrease. Collectively, these data indicate that the pyruvyl-like moiety is required for antibacterial potency of dioxanopeptins and are consistent with a model in which dioxanopeptins primarily collapse ΔpH, thereby perturbing PMF and lowering cellular ATP levels.
In antifungal assays against a panel of plant-pathogenic fungi, dioxanopeptins exhibited no significant inhibitory activity against Fusarium spp., Rhizoctonia solani, or Colletotrichum fructicola (Table S8).
Biosynthetically, the presence of the 1,3-dioxane motif suggested a pyruvyl ketal installed onto a diol-bearing lipid chain. This hypothesis was evidenced by in vitro assays, where recombinant Ck-DxpH catalyzed the conversion of dedioxanopeptin B (8) to dioxanopeptin B (2) using potassium phosphoenolpyruvate as the pyruvyl donor. This indicates pyruvylation as a post-assembly tailoring step.
Methodologically, our genome mining outcome also highlights the limitation of lipid-donor markers. pfa-type PUFAS modules are not exclusive to NRPS lipopeptides, as reflected by the recovery of the zmn and fcl families in our dataset. At the same time, this rediscovery serves as an internal positive control that the workflow retrieves established PUFAS-associated pathways while still enabling prioritization of previously untapped families such as dxp. Collectively, these results indicate that lipid-donor-anchored mining will benefit from stricter product-class constraints, for example, by coupling lipid-donor signatures with NRPS features indicative of N-acylation (e.g., fatty acyl-AMP ligase/ACP or starter C domains) and with clear termination logic (e.g., terminal thioesterase or reductase domains), to prioritize BGCs most likely encoding NRPS lipopeptides.
Footnote |
| † These authors contributed equally to the work. |
| This journal is © The Royal Society of Chemistry 2026 |