Marianna Iorio*a,
Andrea Gentilea,
Cristina Brunatiab,
Arianna Tocchettia,
Paolo Landinic,
Sonia Ilaria Maffiolia,
Stefano Donadioa and
Margherita Sosioab
aNAICONS Srl, Milan, Italy. E-mail: miorio@naicons.com
bKtedogen Srl, Milan, Italy
cBioscience Dept., University of Milan, Italy
First published on 6th June 2022
In the search for structurally novel metabolites with antibacterial activity, innovative approaches must be implemented to increase the probability of discovering novel chemistry from microbial sources. Here we report on the application of metabolomic tools to the genus Actinoallomurus, a poorly explored member of the Actinobacteria. From examining extracts derived from 88 isolates belonging to this genus, we identified a family of cyclodepsipeptides acylated with a C20 polyketide chain, which we named allopeptimicins. These molecules possess unusual structural features, including several double bonds in the amino-polyketide chain and four non-proteinogenic amino acids in the octapeptide. Remarkably, allopeptimicins are produced as a complex of active and inactive congeners, the latter carrying a sulfate group on the polyketide amine. This modification is also a mechanism of self-protection in the producer strain. The structural uniqueness of allopeptimicins is reflected in a biosynthetic gene cluster showing a mosaic structure, with dedicated gene cassettes devoted to formation of specialized precursors and modular assembly lines related to those from different pathways.
It is generally believed that the known metabolites represent the tip of the iceberg of molecules produced by microorganisms and that alternative approaches can reduce the biases imposed by bioassay-based screens.1,3,4 An attractive approach is represented by employing mass-spectrometry (MS)-based metabolomic tools to navigate the chemical complexity observed in microbial fermentation extracts. Recent advancements in this area are represented by molecular networking tools5 and by machine learning that, combined with publicly accessible databases, have greatly expedited metabolite annotation and prioritization for further investigations.6,7 Particularly promising is the combination of metabolomic tools to poorly explored bacterial taxa. In recent work on bacterial strains belonging to the actinobacterial genus Planomonospora, we uncovered novel chemistry, which led to a family of unexpected biosynthetic gene clusters (BGCs),8,9 which in turn helped uncover further novel chemistry.10
We previously reported that strains belonging to another actinobacterial genus, Actinoallomurus, produce at high frequency molecules with antibacterial activity,11 with several new variants of known scaffolds identified through bioactivity-based screening.12–16 Here we report that metabolomic analysis of Actinoallomurus spp. unveiled an unprecedented acylated cyclodepsipeptide with unusual features and potent antibacterial activity.
Among all the observed clusters, one molecular family attracted our attention: it was associated with an extract generated from strain ID145808 only and was absent in extracts from the remaining 87 strains. This molecular family consisted of a group of metabolites characterized by a parent mass of around 1200 amu (Fig. S1†), whose fragmentation spectra showed no matches to the 11600 MassIVe datasets present in the GNPS repository (https://gnps.ucsd.edu/) and to our private spectral library of metabolic profiles from about 14000 extracts. Interestingly, the microbial extract coming from strain ID145808 possessed activity against Staphylococcus aureus.
HPLC fractionation of the ID145808 extract followed by bioassay of the resulting fractions enabled signal deconvolution: a group of related molecules with m/z of 1205, 1207, 1219 and 1221 [M + H]+ and elution times of 6.76, 6.79, 6.90 and 7.07 min, respectively (Fig. 1). Thus, the four related molecules could be accounted for by mass differences of 2 and 14 amu. Inspection of the LC-MS profile of the entire extract revealed a second group of four related molecules with mass distances of 2 and 14 amu. This second group of molecules, eluting between 7.53 and 8.05 min, was present in HPLC fractions devoid of antibacterial activity (Fig. 1) and showed m/z of 1285, 1287, 1299 and 1301 [M + H]+. Thus, molecules from one group differed by 80 amu from molecules of the other group. All molecules from both groups shared the same UV spectrum with four maxima: the first at 239 nm (ε/dm3 mol−1 cm−1 23700) and a distinctive tricuspid absorption at 260 (25100), 270 (31000) and 280 nm (24200), suggestive of three conjugated double bonds. High resolution mass spectrometry (HR-MS) analysis resolved the distances among the congeners as two hydrogens, a methylene group and a sulphate moiety, as indicated by the molecular formulas reported in Table 1. The active and inactive congeners were named allopeptimicin A1 through A4 and B1 through B4, respectively. The most abundant congener in each group were A3 and B4.
Congener | Molecular formula | m/z found [M + H]+ | m/z calculated [M + H]+ | Number of added acetyls |
---|---|---|---|---|
Allopeptimicin A1 | C62H96N10O14 | 1205.7182 | 1205.7180 | 3 |
Allopeptimicin A2 | C62H98N10O14 | 1207.7348 | 1207.7337 | 4 |
Allopeptimicin A3 | C63H98N10O14 | 1219.7341 | 1219.7337 | 3 |
Allopeptimicin A4 | C63H100N10O14 | 1221.7493 | 1221.7493 | 4 |
Allopeptimicin B1 | C62H96N10O17S | 1285.6755 | 1285.6748 | 2 |
Allopeptimicin B2 | C62H98N10O17S | 1287.6911 | 1287.6905 | 3 |
Allopeptimicin B3 | C63H98N10O17S | 1299.6909 | 1299.6905 | 2 |
Allopeptimicin B4 | C63H100N10O17S | 1301.7063 | 1301.7061 | 3 |
Hydrolytic fragment | C17H31N2O8 | 391.2046 | 391.2075 | Not done |
These properties of allopeptimicins guided interpretation of extensive 1D- and 2D-NMR experiments and HR-tandem MS analyses. A set of NMR experiments (Table 2 and Fig. S3–S8†) in CD3CN/D2O 8:2 was performed on a sample containing the A congeners in the following proportion: A1:A2:A3:A4 15:10:45:30.
Unit | Position | Allopeptimicin A3 | Allopeptimicin B3 | |||
---|---|---|---|---|---|---|
δH (multiplicity, J coupling when visible (Hz)) | δC | NOE correlations | δH (multiplicity, J coupling when visible (Hz)) | δC | ||
Ile_1 | NH | 7.01 | 1.76 (methyl in C2–acyl chain) | 6.97 | ||
CO | 172.3 | 172.3 | ||||
α | 4.34 (t, 7.0) | 57.4 | 4.34 (t, 7.0) | 57.4 | ||
β | 1.86 | 36.7 | 1.86 | 36.7 | ||
γa | 1.32 | 25.9 | 1.33 | 25.9 | ||
γb | 1.10 | 1.08 | ||||
δ | 0.85 (d, 7.0) | 11.0 | 0.86 (d, 7.0) | 11.0 | ||
ε | 0.84 | 14.1 | 0.84 | 14.1 | ||
Val_2 | NH | 7.58 | 4.34 (C2–Ile_1) | 7.56 | ||
CO | 171.4 | 171.4 | ||||
α | 4.16 | 58.9 | 4.16 | 58.9 | ||
β | 2.00 | 30.8 | 2.00 | 30.6 | ||
γ | 0.86 (d, 7.0) | 17.6 | 0.85 (d, 7.0) | 18.8 | ||
δ | 0.87 (d, 7.0) | 18.6 | 0.90 (d, 7.0) | 18.6 | ||
Thr_3 | NH | 7.76 | 4.16 (C2–Val_2) | 7.80 | ||
CO | 169.7 | 169.7 | ||||
α | 4.81 | 57.1 | 4.86 | 56.7 | ||
β | 5.33 | 70.9 | 5.30 | 70.8 | ||
γ | 1.22 | 17.7 | 1.20 | 17.3 | ||
Piz_4 | CO | 172.4 | 172.4 | |||
α | 5.46 | 50.3 | 5.46 | 50.7 | ||
βa | 2.09 | 25.4 | 2.10 | 25.4 | ||
βb | 1.84 | 1.78 | ||||
γa | 1.53 | 20.7 | 1.53 | 20.7 | ||
γb | 1.49 | 1.46 | ||||
δa | 2.99 | 46.3 | 3.00 | 46.1 | ||
δb | 2.69 | 2.73 | ||||
Piz_5 | CO | 170.8 | 170.8 | |||
α | 5.15 | 50.6 | 5.17 | 50.7 | ||
βa | 2.31 | 18.1 | 2.31 | 18.1 | ||
βb | 1.99 | 1.97 | ||||
γa | 2.25 | 20.1 | 2.22 | 20.2 | ||
γb | 1.98 | 2.07 | ||||
δ | 6.96 | 145.6 | 6.98 | 145.5 | ||
Hmg_6 | CO | 169.7 | 169.7 | |||
α | 4.99 | 73.3 | 5.09 | 72.9 | ||
βa | 2.01 | 34.1 | 2.00 | 34.2 | ||
βb | 1.99 | 1.99 | ||||
γ | 2.59 | 35.3 | 2.60 | 35.2 | ||
δ | 178.0 | 178.0 | ||||
γ-Me | 1.15 (d, 6.8) | 16.8 | 1.15 (d, 6.8) | 16.8 | ||
Thr_7 | NH | 7.73 | 4.99 (C2–Hmg_6) | 7.91 | ||
CO | 171.6 | 171.6 | ||||
α | 4.70 | 54.5 | 4.72 | 54.5 | ||
β | 3.89 | 67.5 | 3.88 | 67.7 | ||
γ | 1.09 (d, 6.0) | 18.9 | 1.13 (d, 6.0) | 19.1 | ||
N-Me-Leu_8 | CO | 171.5 | 171.5 | |||
α | 5.15 | 53.2 | 5.16 | 53.2 | ||
βa | 1.69 | 37.2 | 3.02 (N-Me Leu_8) | 1.70 (m) | 37.3 | |
βb | 1.52 | 1.54 (m) | ||||
γ | 1.45 | 24.3 | 1.46 (m) | 24.2 | ||
δ | 0.81 (d, 6.8) | 20.6 | 0.81 (d, 6.8) | 20.5 | ||
ε | 0.87 (d, 6.8) | 22.7 | 0.89 (d, 6.8) | 18.8 | ||
N-Me | 3.02 (s) | 30.8 | 3.02 (s) | 30.7 | ||
Acyl chain | 1 | 170.4 | 170.4 | |||
2 | 130.9 | 130.9 | ||||
3 | 6.25 (d, 9.0) | 136.2 | 6.25 (d, 9.0) | 136.2 | ||
4a/b | 2.21 | 27.7 | 2.21 | 27.7 | ||
5a/b | 2.15 | 31.2 | 2.15 | 31.2 | ||
6 | 5.51 (dd, 14.8, 9.0) | 131.7 | 5.51 (dd, 14.8, 9.0) | 131.7 | ||
7 | 6.02 (d, 14.8) | 130.7 | 6.02 (d, 14.8) | 130.7 | ||
8 | 6.04 (d, 14.8) | 130.7 | 6.04 (d, 14.8) | 130.7 | ||
9 | 5.70 (dd, 14.8, 9.0) | 134.9 | 5.70 (dd, 14.8, 9.0) | 134.9 | ||
10a/b | 2.14 | 32.0 | 2.14 (m) | 32.0 | ||
11a/b | 2.16 | 31.2 | 2.16 (m) | 31.2 | ||
12 | 5.55 (dd, 14.8, 9.0) | 131.7 | 5.55 (dd, 14.8, 9.0) | 131.7 | ||
13 | 6.03 (d, 14.8) | 130.7 | 6.03 (d, 14.8) | 130.7 | ||
14 | 6.14 (d, 10.9) | 130.2 | 6.14 (d, 10.9) | 130.2 | ||
15 | 6.14 (d, 10.9) | 132.7 | 6.14 (d, 10.9) | 132.7 | ||
16 | 6.16 (d, 14.8) | 134.6 | 6.16 (d, 14.8) | 134.6 | ||
17 | 5.57 (dd, 14.8, 9.0) | 126.8 | 3.30 (C19–acyl chain) | 5.57 (dd, 14.8, 9.0) | 126.8 | |
18a | 2.34 | 37.4 | 2.60 (m) | 37.4 | ||
18b | 2.34 (m) | |||||
19 | 3.30 | 47.9 | 3.54 | 52.7 | ||
20 | 1.22 (d, 6.8) | 17.3 | 1.27 (d, 6.8) | 17.3 | ||
2-Me | 1.76 (s) | 12.1 | 2.21 (C4–acyl chain); 7.01 (NH–Ile_1) | 1.76 (s) | 12.0 |
Heteronuclear single quantum coherence spectroscopy (HSQC) experiments revealed several cross peaks between 5.5 and 6.5 ppm (H) and 126 and 136 ppm (C). With the help of homonuclear Correlation Spectroscopy (COSY) and Total Correlated Spectroscopy (TOCSY) experiments, these signals were associated with a C20 polyketide chain containing six double bonds: starting from the acyl group (C1), we could identify a methylated double bond (C2–C3 methylated in C2), two methylenes (C4–C5), two conjugated double bonds (C6 to C9), two further methylenes (C10–C11), a triene system (C12 to C17) and a methylene (C18) (Fig. S8†). The proton at 3.30 ppm, belonging to the C19 methine, showed COSY correlation with the terminal methyl group (C20) at 1.22 ppm and with the C18 methylene group at 2.34 ppm (Fig. S7B†). The chemical shift of the corresponding carbon (47.9 ppm, C19) was compatible with an amine functionalization. We also observed eight 3JH,H coupling constants of about 15 Hz and two of 11 Hz (Table 2). This indicated an E stereochemistry at all double bonds except for the one at C14–C15, which has a Z configuration. This was confirmed by a J-Resolved Spectroscopy (JRES) NMR experiment (Fig. S9†). The methyl at C2 shows a strong NOE cross peak with the Ile amide at 7.01 (Fig. S8A and B†), which suggests an E configuration of the C2–C3 double bond, with the methyl at C2 and the proton at C3 on the opposite side of the double bond. The configuration of C19 could not be assigned from spectroscopic data. However, biosynthetic considerations (see below) suggest an S configuration. Overall, these analyses established the structure of the polyketide portion of allopeptimicins as a C20 chain carrying an amino group at C19 and three unsaturation systems (Fig. 2).
Fig. 2 Structure of allopeptimicin congeners. In red stereocenters configuration inferred by bioinformatic analysis are reported. |
Elucidating the structure of the peptide portion required substantially more effort. In the 0–5.5 ppm portion of the proton spectrum, we could recognize spin systems belonging to one isoleucine (or allo-Ile; see below), one valine, two threonines, an N-methyl-leucine and two piperazic acids (Piz). A spin system containing a Cα carbon at 73.3 ppm (δC) was consistent with an α-hydroxy acid, and COSY and TOCSY correlations demonstrated that it was part of a 2-hydroxy,4-methyl-glutaric acid (Hmg) moiety. The Cβ carbon of one threonine was at 70.9 ppm instead of the standard 67 ppm, suggesting that this carbon was constrained and probably involved in peptide cyclization. An olefinic CH signal at 6.96/145.6 ppm, which was not associated with the polyketide chain, was included in one of the Piz spin systems, indicating an unsaturation between Cδ and Nδ.
After adding H2O to the NMR sample, amide signals appeared and, by using Nuclear Overhauser Effect Spectroscopy (NOESY) experiments, we were able to connect few amino acids (Fig. S8†). In particular, the amide signal at 7.00 ppm in the Ile spin system shows a correlation with both the methyl signal at 1.76 ppm (at C2 of the acyl chain) and the olefinic proton at 6.25 ppm (at C3 of the acyl chain), suggesting that Ile (or allo-Ile) is amidated with the C20 acyl chain. Moreover, we observed NOE correlations between the Val amide at 7.58 ppm and the proton at 4.34 ppm (CH at Cα of Ile or allo-Ile) and between the amide at 7.76 ppm in the threonine with a constrained Cβ and the signal at 4.16 ppm (CH at Cα of Val). An additional correlation was seen between the amide signal at 7.73 ppm of the other threonine and the signal at 4.99 ppm (CH at Cα of Hmg). These observations established the partial sequences acyl chain–Ile(allo-Ile)–Val–Thr and Hmg–Thr in the peptide.
As mentioned above, when allopeptimicins were exposed to alkaline conditions, all congeners released a fragment having m/z 391 [M + H]+, together with a variable larger portion (Fig. S2A and B†). When a 4 mg sample of the A + B complex was exposed to those conditions, the larger hydrolytic fragments showed the tricuspid UV absorption and thus carried the triene moiety, while the small hydrolytic product showed little UV absorption. The latter was purified and analyzed by HR-MSMS and NMR (Fig. S10 and S11†), providing a calculated molecular formula for m/z 391.2046 [M + H]+ of C17H31N2O8 (Table 1). 1D- and 2D-NMR experiments demonstrated the presence of five methyl groups (one on a nitrogen), two methylene groups and six methines (two on oxygens). It was possible to recognize the spin systems of Hmg, Thr and N-MeLeu, in accordance with the molecular formula and with the fragments observed by HR-MSMS (Fig. S10†). In particular, the fragment at m/z 146.1158 [M + H]+ corresponding to a C7H16NO2+ is consistent with a hydrated N-methyl leucine, suggesting that this amino acid is at the C-terminal position of the 391 amu fragment. Overall, these data are consistent with the hydrolytic fragment having the sequence Hmg–Thr–MeLeu and being released by hydrolysis of two ester bonds. The only way to fit the smaller hydrolytic fragment is by connecting the Hmg α-hydroxyl and C-terminus to a carboxyl group and a threonine hydroxyl, respectively, in the larger hydrolytic fragments, establishing the peptide sequence as Ile(allo-Ile)–Val–Thr–Piz–Piz–Hmg–Thr–MeLeu.
By deeply analyzing the NMR data acquired for the A congeners mixture, we observed that the protons related to the unsaturated Piz spin system integrated to lower values than those of the other Piz residue, suggesting that the 2 amu difference and one lesser acetylation site of congeners A2 and A4 with respect to A1 and A3 are due to the unsaturation at Cδ–Nδ of one of the Piz moieties. Similarly, the isoleucine-related signals appeared to be weaker than those of the other amino acids, while the valine spin system seemed to be splitted in two. This phenomenon could be reasonably associated with the A3 and A4 congeners having an isoleucine as the first amino acid followed by a valine, while two consecutive valines are present in A1 and A2.
With insights from the hydrolytic fragment and the NMR analysis of the A congeners, we were able to interpret the HR-MSMS data (Fig. S12 and S13†). In particular, the most intense fragments m/z 613.3173 [M + H]+ and m/z 847.5410 [M + H]+ are consistent with the two consecutive Piz located between acyl chain–Ile(allo-Ile)–Val–Thr and Hmg–Thr–MeLeu moieties. Moreover, comparison between the fragments originating from A1 and A3 shows that the extra methylene difference must be in the N terminal amino acid, compatible with the first amino acid being a Val in A1 and an isoleucine in A3. In addition, the fragmentations of A1 and A2 indicates that the unsaturation is present on Piz5.
For establishing the stereochemistry of the amino-acid building blocks, we resorted to Marfey's method.17 Both threonine residues were found to be D-allo-Thr while valine was in L-configuration. Unfortunately, we were not able to distinguish L-Ile from L-allo-Ile (Fig. S14†) and, due to the lack of adequate standards, we could not assign the stereochemistry to the remaining stereocenters: one in each of the piperazic acid moieties and two in Hmg moiety. However, analysis of the allopeptimicin BGC suggested the possible stereochemistry of the remaining α-carbons and pointed to allo-Ile as the first amino acid residue.
Extensive 1D- and 2D-NMR experiments were also carried out on a sample containing a mixture of allopeptimicin B congeners, with minor amounts of the A congeners (Table 2). The resulting spectra were almost identical to those obtained from the A congeners, except for a shift in few HSQC cross signals: the C19 proton and carbon signals shifted from 3.30 and 47.9 ppm to 3.54 and 52.7 ppm, respectively; the two C18 methylene protons (both at 2.34 ppm) were split in two different signals at 2.34 and 2.60 ppm, while the C18 carbon at 37.4 ppm remained unchanged (Fig. S15A†); and the protons of the C20 methyl group (at 1.22 ppm in the A congeners) were shifted to 1.27 (see Fig. S15B† which reports the overlap of TOCSY experiments from the two datasets and highlights the C18–C20 spin system in B). These changes, together with the mass spectrometry evidence of the presence of a –SO3H moiety and the absence of one acetylatable group in the B congeners, are compatible with the installation of a sulfamic acid on the primary amine at C19 (Fig. 2). Unfortunately, the fragmentation of the B congeners was not very informative, since the sulfamate moiety was split giving the mass of the corresponding A congener (data not shown).
In summary, the structures of allopeptimicins were established to be as reported in Fig. 2, with the B molecules arising from the cognate A congeners after installation of a sulfamate.
CDSa | Size (aa) | Homologb [function, pathway (strain, accession number)] | Identityc | Proposed function |
---|---|---|---|---|
a Suffix C denotes complementary strand.b Sequences with the highest BLAST score from the MIBIG database of validated biosynthetic gene clusters35 or from the non-redundant GenBank database (no protein name; in italics).c % identity of the best matching sequence(s). | ||||
ORF1C | 548 | None, [family 43 glycosylhydrolase, (Rugosimonospora africana, WP_203920346.1)] | 60% | Probably outside cluster |
ORF2C | 121 | None, [hypothetical protein, (Microbispora sp. H11081, WP_169946706.1)] | 42% | Probably outside cluster |
AptA | 474 | QmnB [propionyl-CoA carboxylase subunit beta (Amycolatopsis orientalis, AFI57002.1)] | 71% | Methylmalonyl-CoA supply |
AptB | 357 | CmmSu [sulfotransferase, MM4550 (Streptomyces argenteolus, AGU42411.1)] | 36% | Sulfotransfer |
AptCC | 246 | Hypothetical protein BZZ08_05504 [unknown (Streptomyces sp. MH60, PPS81433.1)] | 31% | Unknown |
AptD1 | 2743 | HmtL [NRPS, himastatin (Streptomyces himastatinicus, CBZ42146.1)] | 53% | Piz biosynthesis |
AptE | 399 | ArtE [NAD(P)/FAD-dependent oxidoreductase, aurantimycin A, (Streptomyces aurantiacus JA 4570, WP_016638468.1)] | 45% | Piz5 dehydrogenation |
AptF1 | 3608 | Cle6 [PKSI, mediomycin A (Kitasatospora mediocidica, AWC08660.1)] | 57% | Polyketide skeleton |
AptGC | 545 | HitE, [ATP-dependent_aminoacyl-ACP_synthetase, itachimycin (Streptomyces scabrisporus, BAR73011.1)] | 64% | Activates the β-amino acid starter and loads it onto AptH |
AptH | 86 | Strop_2776 [PCP, salinilactam (Salinispora tropica CNB-440, ABP55218.1)] | 61% | β amino acid starter-specific PCP |
AptIC | 318 | FlvK [ACP S-malonyltransferase, fluvirucin B2 (Actinomadura fulva subsp. indica, BBD71108.1)] | 61% | β amino acid starter-specific transferase |
AptJ | 514 | LobL [AMP-binding protein, lobosamide A (Micromonospora sp. RL09-050-HVF-A, ALA09365.1)] | 66% | Adds amino acid to AptG-bound β amino acid starter |
AptK | 430 | LobO [pyridoxal 5′-phosphate (PLP)-dependent β-glutamate-β-decarboxylase, incednine (Streptomyces sp. ML694-90F3, BAP34709.1)] | 63% | β-Glutamate decarboxylase |
AptF2 | 7611 | Cle6 [PKSI, mediomycin A, (Kitasatospora mediocidica, AWC08660.1)] | 53% | Polyketide skeleton |
AptL | 316 | VinJ [proline iminopeptidase, vicenistatin (Streptomyces halstedii, BAD08367.1)] | 66% | Removal of alanyl-protecting group of β amino acid starter |
AptM | 231 | QmnC [thioesterase, quartromicin A1 (Amycolatopsis orientalis AFI57003.1)] | 59% | Thioester hydrolysis |
AptF3 | 1778 | AzlB [PKSI, azalomycin F3a (Streptomyces sp. 211726, ARM20277.1)] | 55% | Polyketide skeleton |
AptD2 | 4587 | CDA peptide synthetase I [NRPS, CDA (Streptomyces rochei, ALV82356.1)] | 41% | Peptide skeleton |
AptD3 | 3280 | HmtL [NRPS, himastatin (Streptomyces himastatinicus ATCC 53653, CBZ42146.1)] | 44% | Peptide skeleton |
AptNC | 207 | DUF2154 domain-containing protein [unknown (Actinomadura latina, WP_168444816.1)] | 72% | Cell wall-active antibiotics response protein |
AptO | 454 | HmtM [lysine N(6)-hydroxylase/L-ornithine N(5)-oxygenase, himastatin (Streptomyces himastatinicus ATCC 53653, CBZ42147.1)] | 59% | Piz formation |
AptP | 238 | MfnJ [negative transcriptional regulator, marformycin A (Streptomyces drozdowiczii, AJV88382.1)] | 49% | Piz formation |
AptQ | 486 | MakD1 [acyl CoA synthetase, maklamicin (Micromonospora sp. GMKU326, BAQ25518.1)] | 32% | Activates glutaric acid or derivatives thereof |
AptS | 362 | None [NAD-dependent malic enzyme, unknown (Actinomadura latina, NKZ08311.1)] | 77% | Keto reduction during Hmg formation |
AptT | 261 | Sky 33 [thioesterase type II, skyllamycin A (Streptomyces sp. Acta 2897, AEA30276.1)] | 53% | Thioester hydrolysis |
AptU | 546 | Pmet [P-methyltransferase, phosphinothricintripeptide (Streptomyces viridochromogenes, CAJ14051.1)] | 42% | Methylation during Hmg formation |
AptR1 | 913 | AWR88389.1 [LuxR family transcriptional regulator, auroramycin (Streptomyces filamentosus, AWR88389.1)] | 41% | Regulatory |
AptV | 124 | DsaE [ketosteroid_isomerase, desotamide (Streptomyces scopuliridis, AJW76707.1)] | 68% | L-allo-Ile synthesis |
AptW | 360 | DsaD [branched_chain_amino_acid_aminotransferase, desotamide (Streptomyces scopuliridis, AJW76706.1)] | 71% | L-allo-Ile synthesis |
AptR2 | 139 | AsuR4 [transcriptional regulator, asukamycin (Streptomyces nodosus subsp. asukaensis, ADI58623.1)] | 50% | Regulatory |
AptXC | 77 | STRAU_RS01625 [MbtH family protein, aurantimycin A (Streptomyces aurantiacus JA 4570, WP_016638464.1)] | 73% | MbtH-like protein |
AptR3C | 342 | DtpR2 [streptomycin biosynthesis operon regulator, thiolutin (Saccharothrix algeriensis, AJI44174.1)] | 48% | Regulatory |
AptZ1 | 326 | STRAU_RS01670 [ATP-binding cassette domain-containing protein, aurantimycin A (Streptomyces aurantiacus JA 4570, WP_016638473.1)] | 63% | Transporter |
AptZ2 | 258 | AGZ15463.1 [ABC transporter, polyoxypeptin (Streptomyces sp. MK498-98F14, AGZ15463.1)] | 44% | Transporter |
AptR4C | 224 | AWW87431.1 [response regulator, reedsmycins (Streptomyces sp., AWW87431.1)] | 47% | Regulatory |
Consistent with the chemical structure, the apt BGC was found to encode three large polyketide synthases (PKS) for a total of eight modules and three non-ribosomal peptide synthetases (NRPSs) for additional eight modules. Moreover, there are the gene products needed for the synthesis of four specialized precursors and for the conversion of the A into B congeners (Table 3; Fig. 3A). In addition, one end of the BGC encodes regulators and transporters.
Preceding the aptGHIJK cassette is aptF1, encoding a PKS starting with an ACP domain and followed by three modules. Each module contains a dehydratase (DH) domain and all AT domains are predicted to be specific for malonate, based on computational sequence similarity. These features make AptF1 a likely candidate for elongating the 3-amino butyrate starter (transferred onto the N-terminal PCP by AptI) through three malonate additions, followed by double bond formation after reduction and dehydration.
The C10 intermediate polyketide chain formed by AptF1 would then be elongated by AptF2, which consists of four modules: based on sequence analysis all contain AT domains predicted to be specific for malonate, KR domains and DH domains, but only the first and last module possess ER domains, consistent with the presence of saturated bonds at C4–C5 and at C10–C11 in the final product and with two conjugated double bonds (Fig. 2 and 3C). The mono-modular AptF3 would then complete polyketide synthesis through the incorporation of a (2S)-methylmalonyl-CoA unit, as for the predicted specificity of its AT domain, and installation of a double bond by the consecutive action of a B-type KR domain22 and by a DH domain. No bioinformatic prediction could be made about the stereo-selectivity of the other KR domains, as no consensus residues for A and B type KR could be identified.22
Overall, the number of PKS modules, their domain arrangements and the predicted (stereo)specificities are consistent with collinearity between PKS modules and the structure of the C20 polyketide chain. Both AptF1 and AptF2 are mostly related to Cle6 (modules 1–3), a PKS involved in the synthesis of the giant linear polyketide mediomycin (Table 3), which carries an amino group at the ω position;23 while AptF3 is mostly related to module 1 from AzlB, a PKS involved in the formation of azalomycin.24
The octapeptide chain is likely to be terminated by AptD3, a bimodular NRPS consisting of C–A–T–E and C–A–MT–T–TE domains, with predicted specificities for Thr and Leu. This nicely fits with Thr7 epimerization at the α-carbon, N-methylation of Leu8 and TE-catalyzed ester formation between Leu8 and Thr3 to close the macrocycle (Fig. 3C).
By exclusion, the remaining three residues (Piz4, Piz5 and Hmg6) are likely to be incorporated by the fourth module of AptD2 and by the bimodular AptD1 that, overall, account for the expected three C, A and T domains, plus one E domain, but with an unusual arrangement (Fig. 3C): the second module in AptD1 lacks an A domain, which is present at the C-terminus of AptD2. The unusual domain organization of the second AptD1 module (C–T) has been observed also in HmtI and in KtzE from the himastatin and kutznerides BGCs, respectively: in both cases it has been suggested that the C domain forms an ester linkage between the amino acid loaded onto the upstream T domain and a hydroxy-acid acceptor.25,26 The same may happen during allopeptimicin biosynthesis, with the second C domain in AptD1 forming an ester linkage between Piz and Hmg. The latter could either be activated by the last A domain of AptD1 or provided in trans by the free-standing A domain AptQ (see below). While the amino acids incorporated by the two A domain in AptD1 could not be bioinformatically predicted, they share 95% identity, suggesting they recognize the same amino acid, hence Piz. In summary, we could tentatively propose the peptide segments recognized by the different modules as shown in Fig. 3C, although experimental work will be necessary to establish the role of the individual domains.
The E domains present in the third and first module of AptD2 and AptD3, respectively, directing the epimerization are consistent with the presence of two D-configured threonine residues at positions 3 and 7. If the central module of AptD1 is involved in adding the second Piz residue to the peptide chain, the associated E domain would then epimerize Piz5. Assuming that Hmg is in the L-configuration, the cyclic peptide would consist of alternating L- and D-amino acids, a feature often found in many NRPS-made peptides.27
Synthesis of the non-proteinogenic amino acids L-allo-Ile and Piz can be accounted for by the gene cassettes aptVW and aptOP, respectively (Fig. 3A and B). In the desotamide pathway, L-allo-Ile arises from L-Ile through the action of the pyridoxal 5′-phosphate-linked aminotransferase DsaD and the isomerase DsaE.28 Consistently, AptV and AptW show 64 and 70% identity with DsaD and DsaE, respectively (Table 3). In the kutznerides pathway, Piz is formed from L-ornithine through the consecutive action of the FAD-dependent N-hydroxylase KtzI and the heme-dependent dehydratase KtzT.26,29 AptO and AptP display 58 and 49% identity with KtzI and KtzT, respectively (Table 3). The unsaturation in Piz5 could result from the action of AptE, which shows 45% identity with ArtE, a FAD-dependent oxidoreductase from the aurantimycin BGC. Aurantimycin contains two Piz residues and the complex includes congeners with unsaturated Piz moieties.30 It remains to be established whether AptE acts on free Piz or after peptide synthesis.
While to our knowledge there is no description of a pathway to Hmg formation, the aptQSTU cassette, consisting of closely linked and likely translationally coupled genes, represents a likely candidate for it. AptS is a NADP-dependent oxidoreductase with 38–77% identity to NAD-dependent malic enzymes, and thus may be involved in converting 2-ketoglutarate in the corresponding hydroxy form. AptU is a radical SAM methyltransferase, likely to install the 4-methyl group. The free-standing A domain AptQ might be involved in activating 2-ketoglutarate before further modifications or it might interact in trans with AptD1 (see above). A role for the free-standing thioesterase AptT remains to be established. Further work will be necessary to establish the individual roles of enzymes for Hmg biosynthesis.
While the left-hand end of the cluster is likely represented by aptA, the other end contains several regulatory and export genes: aptR1, encoding a member of the LuxR family of transcriptional regulators, and aptR2, with high similarity to a regulator from the asukamycin BGC, are located upstream and downstream, respectively, of the aptVW L-allo-Ile synthesis cassette. After aptX there are genes encoding AptR3, another regulator of the LuxR family, and the ABC transporters AptZ1 and AptZ2 (Table 3).
Straina | MIC (μg mL−1) | ||
---|---|---|---|
Allopeptimicin A | Allopeptimicin B | VANb | |
a Strains designated with an L prefix are clinical isolates (collected in Italy or USA) from the NAICONS collection. Other strains are from public collections as indicated. American Type Culture Collection, USA.b Vancomycin (VAN) used as control antibiotic. | |||
Staphylococcus aureus ATCC6538P | 0.125 | >128 | 0.25 |
S. aureus L3864 (MRSA) | 0.125 | >128 | 0.5 |
S. epidermidis ATCC12228 | 0.125 | >128 | 1 |
Streptococcus pneumoniae L44 | ≤0.06 | >128 | 0.25 |
S. pyogenes L49 | ≤0.06 | >128 | 0.25 |
Micrococcus luteus ATCC10240 | 1 | >128 | 0.125 |
Bacillus subtilis 168 ATCC27370 | 0.125 | >128 | 0.25 |
Enterococcus faecium L568 | 8 | >128 | 1 |
E. faecalis L559 | 8 | >128 | 4 |
Clostridioides difficile 421 L4013 | ≤0.06 | Nt | 1 |
Helicobacter pylori L4219 | 16 | Nt | >128 |
Pseudomonas aeruginosa ATCC27853 | >128 | >128 | >128 |
Escherichia coli ATCC25922 | >128 | >128 | >128 |
We then investigated the effect of allopeptimicins A on macromolecular synthesis in S. aureus cells by monitoring the incorporation of labeled precursors (5-[3H] thymidine for DNA, [3H] uridine for RNA, L-[3H] tryptophan for protein and [3H] glucosamine for cell wall synthesis): we readily observed that cell wall biosynthesis was inhibited by 50% already at 1.6 μg mL−1 allopeptimicins A, while the concentrations inhibiting by 50% (IC50) were 5.7 μg mL−1 for DNA synthesis, 35 μg mL−1 for RNA synthesis and >50 μg mL−1 for protein synthesis, thus suggesting that cell wall synthesis is the primary target for this antibiotic (Fig. S16B†). Consistently, when tested against different S. aureus mutants selected through serial passages, phenotypically and genotypically related to vancomycin insensitive S. aureus (VISA) strains,36 we observed an increase in the MIC of allopeptimicins A (Fig. S16c†). Against these same mutants, a similar shift in MICs was observed for other bona fide antibiotics that bind to the essential peptidoglycan precursor lipid II such as vancomycin, ramoplanin and NAI-107.36
When ID145808 was grown in AF-AS medium, production of allopeptimicins started to be detectable at 72 h (Fig. 4A) and remained essentially constant up to 120 h. During this period, congeners A and B were produced in similar amounts. Starting at 144 h, the production rate increased, with the A congeners reaching 115 mg L−1 at 240 h, while production of allopeptimicins B increased at a lower rate, leading to a final ratio of about 75:25 A:B (Fig. 4A). Cultivation times longer than 240 h did not lead to increased production.
Supplementing the medium with 1 g L−1 L-threonine did not affect the early phases of production in terms of total allopeptimicins and A:B ratio (Fig. 4B). However, L-threonine supplementation afforded higher titers at 240 h and extended the production phase up to 288–312 h, leading to 180 mg L−1. L-Threonine supplementation did not alter the A:B ratio, which remained 3:1 at long incubation times.
Since congeners A possess antibacterial activity while congeners B do not, the switch in congener ratio upon prolonged cultivation prompted us to analyze the sensitivity of Actinoallomurus ID145808 versus its own products (Fig. S17†). The growth of the producer strain was not affected by spotting increasing amounts of a sample of allopeptimicins B (which contained around 5% of A congeners), until a slight growth disturbance was observed when 50 μg were spotted on the plate. Conversely, 2 μg of the A congeners fully prevented growth, indicating that Actinoallomurus ID145808 is sensitive to the A congeners when growing on agar plates, a condition when visible colonies appear after 72–96 h. We can thus hypothesize that formation of allopeptimicins B is a mechanism of self-protection. Since Actinoallomurus sp. ID145808 grows as large mycelial clumps in liquid medium, it is conceivable that non-dividing portions of a mycelial unit produce the active congeners while actively duplicating sectors express aptB to protect themselves. Further experiments need to be done to understand self-resistance in Actinoallomurus sp. ID145808.
The structural uniqueness of allopeptimicins is reflected in a complex BGC, with dedicated gene cassettes devoted to formation of specialized precursors and modular assembly lines related to those from different BGCs. Thus, the apt BGC is one more example of a BGC with a mosaic structure, a feature common to many BGCs.4,45 Several aspects of the allopeptimicin NRPS remain obscure, including how the Piz–Piz–Hmg portion is assembled.
Structurally, allopeptimicins A consist of a relatively rigid macrocycle with a long acyl chain with few rotatable bonds and terminating with a positive charge. The available evidence – from macromolecular syntheses to reduced activity on VISA strains – suggest that allopeptimicins A interfere with cell wall biosynthesis. It is tempting to speculate that the macrocycle interacts with some essential cell wall components near the bacterial membrane, while the acyl chain embeds into the membrane, with further ionic interactions between the negatively charged phospholipids and the amino group at C-19. Turning the positive into a negative charge through sulfamidation would abolish this interaction with total loss of activity. Not many precedents exist of metabolites whose activity is modulated by sulfation: a notable example is represented by the paralytic shellfish saxitoxins, in which O- and/or N-sulfation leads to a reduced toxicity.46 In any case, turning a positive into a negative charge by formation of a sulfamate represents one more example of the ingenuity of soil bacteria in devising mechanisms for self-protection.
For allopeptimicin production, 2 mL of a frozen mycelium stock of strain ID145808 was inoculated into 25 mL of AF-A medium in a 300 mL Erlenmeyer flask and incubated 72 h at 30 °C in a rotary shaker at 200 rpm. Then, 6 mL was transferred into 100 mL AF-AS medium (AF-A supplemented with 10 g L−1 soluble starch) in 500 mL flasks supplied with 10 g of 5 mm glass beads. Where required, 1 g L−1 L-threonine (Sigma-Aldrich) was added from a filter-sterilized 50 mg mL−1 stock solution. The flasks were incubated at 30 °C, on a rotary shaker at 200 rpm. For allopeptimicin analysis, 0.5 mL was withdrawn from the production culture and transferred into 2 mL Eppendorf tubes containing 0.5 mL acetonitrile. Samples were kept at 40 °C under constant shaking (800 rpm) for 1 h, then centrifuged for 3 min at 13000 rpm. The supernatants were recovered and 500 μL were transferred into 1.5 mL HPLC vials for LC-MS/MS analysis.
Parameters were adapted from the GNPS documentation as follows: MS2 spectra were filtered to remove all MS/MS fragment ions within ±17 Da of the precursor m/z value; only the top six fragment ions in the ±50 Da window through the spectrum were utilized, with a minimum fragment ion intensity of 50. The MS/MS fragment ion tolerance was set to 0.9 Da and the precursor ion mass tolerance was set to 2 Da. Edges of the created molecular network were filtered to have a cosine score above 0.5 and more than 2 matched peaks between the connected nodes, with a maximum of 10 neighbor nodes for one single node. The maximum size of clusters in the network was set to 50 and the maximum shift between precursors was set at 300 Da. The MS2 spectra in the molecular network were searched against GNPS spectral libraries. Reported matches between network and library spectra were required to have a score above 0.7 and at least 3 matched peaks. The molecular network was visualized using Cytoscape 3.9.0.49
Time-kill curves were performed as previously described51 using S. aureus ATCC 6538P. Growth kinetics was performed under the same conditions as MICs and recording OD600 every hour in a Synergy 2 reader (Biotek).
For evaluating the activity against strain ID145808, serial dilutions in 60% methanol were prepared from stock solutions of allopeptimicins A (2 mg mL−1) and B (5 mg mL; this preparation contained 5% A congeners). Then, 10 μL of appropriate dilutions were spotted onto BTT agar plates, which had been previously seeded with strain ID145808. Positive and negative controls were 10 μg mL−1 apramycin and 60% methanol, respectively. Plates were incubated at 30 °C for 10 days before scoring inhibition haloes.
The nucleotide sequences have been deposited in GenBank under accession number OM714933 (16S rRNA gene).
Footnote |
† Electronic supplementary information (ESI) available. See https://doi.org/10.1039/d2ra02094g |
This journal is © The Royal Society of Chemistry 2022 |