Piera M.
Marchetti
,
Van
Kelly
,
Joanna P.
Simpson
,
Mairi
Ward
and
Dominic J.
Campopiano
*
EaStCHEM School of Chemistry, David Brewster Road, University of Edinburgh, Edinburgh, EH9 3FJ, UK. E-mail: Dominic.Campopiano@ed.ac.uk
First published on 22nd March 2018
The marine bacterium Pseudoalteromonas tunicata produces the bipyrrole antibiotic tambjamine YP1. This natural product is built from common amino acid and fatty acid building blocks in a biosynthetic pathway that is encoded in the tam operon which contains 19 genes. The exact role that each of these Tam proteins plays in tambjamine biosynthesis is not known. Here, we provide evidence that TamA initiates the synthesis and controls the chain length of the essential tambjamine fatty amine tail. Sequence analysis suggests the unusual TamA is comprised of an N-terminal adenylation (ANL) domain fused to a C-terminal acyl carrier protein (ACP). Mass spectrometry analysis of recombinant TamA revealed the surprising presence of bound C11 and C12 acyl-adenylate intermediates. Acylation of the ACP domain was observed upon attachment of the phosphopantetheine (4′-PP) arm to the ACP. We also show that TamA can transfer fatty acids ranging in chain length from C6–C13 to an isolated ACP domain. Thus TamA bridges the gap between primary and secondary metabolism by linking fatty acid and pyrrole biosynthetic pathways.
To date, the most hydrophobic natural tambjamine that has been characterised is tambjamine YP113 (2, Fig. 1). It is also the only tambjamine whose biosynthetic gene cluster from Pseudoalteromonas tunicata has been identified14 (Fig. S1†). The YP1 cluster is proposed to contain 19 open reading frames (ORFs) that encode enzymes which assemble the bipyrrole product from amino acid and fatty acid building blocks (Fig. S2, Table S1†). The majority of these enzymes have been assigned putative functions based on the biosynthesis of various natural products, including a similar class of molecules, the tri-pyrrole containing prodiginines.15,16 Biosynthesis of the YP1-specific tail was thought to culminate in a long chain amine (3, Fig. 1) which attaches to the bipyrrole ring system by an enamine linkage. However, the exact details of how this functionality is produced has not been explored.
Kjelleberg and colleagues postulated that the amine tail is derived from C12 lauric acid.14 In P. tunicata this is likely to be produced by the fatty acid synthase (FAS) machinery.17,18 The YP1 biosynthetic pathway then harnesses the FAS output and converts it to a long chain amine. This can be achieved by the commonly used mechanism of fatty acid activation, namely; the adenylation of the acid using ATP, followed by reaction with coenzyme A (CoASH) to generate a thioester intermediate (Fig. S3†). Previously it was predicted that this transformation is carried out by AfaA, a fatty acid CoA ligase (FACL) from outside the Tam cluster. The resulting C12–CoA thioester intermediate would then be subjected to downstream processing by Tam cluster enzymes to eventually yield the amine (3, Fig. 1).
Recombinant P. tunicata TamA was prepared from E. coli with an N-terminal 6xHis affinity tag26 (Fig. S5†). It was purified to homogeneity by standard nickel affinity and gel filtration chromatography which showed it to be monomeric (Fig. S5†). The mass (75271 ± 2 Da) of the purified TamA was determined by denaturing liquid chromatography electrospray ionization – mass spectrometry (LC ESI-MS, Fig. 3a and Fig. S6†). This is consistent with the predicted value (75270 Da) of the apo-protein lacking the 4′-PP post-translational modification on the serine residue of the ACP domain. We employed the commonly used Bacillus subtilis Sfp/CoASH system in an effort to convert the apo-TamA ACP domain to the holo-form.27,28 However, the expected mass shift that accompanies the addition of a 4′-PP (340 Da) was not observed (Fig. 3b). Instead, the observed mass difference (Δ513 ± 3 Da) is 173 Da larger than anticipated. This mass suggested that the TamA ACP domain had undergone some form of acylation as well as 4′-PP modification. For this to occur in the absence of any fatty acid in the Sfp/CoASH reaction, we hypothesised that the recombinant TamA was isolated with a bound acyl-adenylate. Thereafter, upon 4′-PP modification with Sfp, the TamA ANL domain would catalyse transfer of the acyl chain from the already present acyl-adenylate to the newly-formed holo-ACP domain.
To determine if an acyl-adenylate was non-covalently bound in the as-purified TamA, we employed native mass spectrometry that we recently used to observe a BioW/pimeloyl-adenylate complex.29Fig. 3c shows the native MS of TamA directly after purification with two species observed over three charge states (18+ to 16+) with nominal masses of 75800 and 75280 Da. The average mass difference of these species (∼520 Da) is close to the calculated mass of the expected C12 lauryl-adenylate (529 Da). Although this native MS method is not accurate enough to determine the exact mass of the bound molecule30 it does confirm the presence of a non-covalently bound species. We attempted to remove the acyl-adenylate from the protein active site by incubating it overnight with CoASH and then repeating the native MS. However, there was no change in the mass spectrum suggesting that CoASH is not a substrate for the TamA ANL.
Based on the YP1 structure we expected TamA to utilise C12 fatty acids as substrates. However, the mass we observed of the 4′-PP modified and acylated full length TamA was not consistent with a C12 thioester attached to the ACP domain. Attachment of a C12 fatty acid should result in a mass increase of 522 Da (182 Da larger than 4′-PP) but the mass shift we observed was 513 Da (173 Da larger than 4′-PP, Fig. 3b). Since this does not correspond to the mass of any unsaturated fatty acids, we postulated that this species could be holo-TamA modified by a mixture of fatty acid thioesters. The small mass shifts expected from fatty acids of similar length may not resolve due to the large size and high charge state of the full length protein.
To improve the resolution of the MS analysis, we sought to capture the fatty acid moiety on a smaller ACP protein. So, guided by our domain analysis, the predicted ACP portion of the enzyme was expressed and purified (Fig. S7†). LC ESI-MS analysis revealed that the ACP domain was isolated in the apo-form (predicted mass = 10967.5 Da, observed mass = 10967.1 ± 0.2 Da, Fig. S8†). Quantitative conversion to the holo-ACP form was achieved with the Sfp/CoASH system (predicted mass = 11307.8 Da, observed mass = 11307.6 ± 0.1 Da, Fig. 4a). This holo-ACP domain was subsequently incubated with full length, as-purified apo-TamA (with acyl-adenylate(s) bound) to attempt the thiolation reaction in trans – in effect an intermolecular, trans-thioesterification. LC ESI-MS analysis of this reaction revealed the appearance of two new acyl-ACPs. These correspond to the addition of 168.1 ± 0.2 and 182.1 ± 0.4 Da respectively (Fig. 4b) and these mass differences are consistent with the attachment of a C11 and C12 fatty acid to the holo-ACP (predicted mass, 168.3 and 182.3 Da respectively). Although the presence of a C11 fatty acid is unexpected since only the C12 form of YP1 has been reported for P. tunicata, other organisms produce tambjamines with varying acyl chain lengths.2,4 In the E. coli host we presume the enzyme has picked up the C11 and C12 fatty acids from the endogenous fatty acid pool. This data also shows that the catalytic TamA ANL domain is able to transfer acyl chains to an isolated ACP.
Fig. 4 Denaturing ESI-MS analysis of the 7+ charge state of (a) holo-TamA ACP domain and (b) holo-TamA ACP after incubation with apo-TamA. |
This convenient transfer reaction was used to determine the fatty acid chain length specificity of the ANL domain. Holo-TamA produced after 4′-PP modification of the as-purified TamA was incubated with different fatty acids, Mg2+, ATP and the holo-TamA ACP domain. LC ESI-MS analysis of the ACP domain showed the extent of acylation for a range of fatty acids between C2 and C16 (Fig. 5, Table S2†). The data reveals that TamA is able to utilise fatty acids from C6–C13 (the C14 form is barely detectable) however, conversion to the acylated form is highest for C12. The efficiency of the reaction drops off substantially outwith the C12–C13 substrates suggesting a very specific substrate pocket in the ANL active site. In order to test the specificity of the ANL domain towards other ACPs, E. coli ACP (Fig. S9†) was also used in the assay in place of the standalone TamA ACP domain. However, TamA is unable to transfer the C12 fatty acid to E. coli ACP suggesting a highly specific recognition between the ANL and ACP domains (Fig. S10†). It would be interesting to explore this specificity further with a range of ACPs from different species.22,23
Fig. 5 Denaturing ESI-MS analysis of the 7+ charge state of the holo-TamA ACP domain after incubation with holo-TamA, Mg2+, ATP and fatty acids ranging in length from C2–C16. The values of the deconvoluted masses are described in Table S2.† |
With convincing data supporting acylation of TamA we wanted to clearly assign the site of 4′-PP attachment. The unusual TamA ACP domain contains two conserved DSV 4′-PP motifs22,23 but the TamA sequence analysis suggests that the second DSV motif (residues 621–623) is the most likely site of modification. Holo-TamA was denatured and digested with trypsin and the resulting peptide mixture was analysed by ESI-MS. Masses for the acylated 4′-PP peptides were not observed, potentially due to the instability of the thioester bond under the denaturation conditions. However, a peptide mass consistent with the 4′-PP on residue S622 was observed and subjected to MS/MS analysis. A combination of 4′-PP ejection31 and peptide sequencing confirmed S622 as the site of the 4′-PP modification (Fig. S11†).
Diffraction quality crystals of recombinant TamA have not been obtained so we used homology modelling to give some insight into the TamA structure. The ANL domain displays highest similarity (∼30%) to fatty acid AMP-ligases (FAALs, Fig. S12†)32–35 which activate fatty acids for transfer to the terminal 4′-PP thiol of a separate ACP. FAALs are homologous to FACLs but are unable to use free CoASH as a substrate. CoASH is in fact blocked from binding by an insertion loop common to all FAALs, that acts as a gatekeeper to the active site, released upon ACP binding.35 However, the structural model could not be built using these enzymes as there are no reported structures of either a FAAL-ACP fusion or a FAAL in complex with its cognate ACP.
Nonetheless, TamA is a member of the Type I fold of the ANL enzyme superfamily which includes firefly luciferases, acyl-CoA synthetases and the adenylation domains of nonribosomal peptide synthetases (NRPS). Since it shares ∼20% sequence identity with both NRPS ANL and peptide carrier protein (PCP) domains a model of the TamA structure was built with Phyre236 software using four homologous NRPS enzyme sequences (Fig. S13 and 14†).37–41 The model displays the canonical Type I ANL fold as well as the recognizable four-helical bundle of the carrier protein domain with the 4′-PP modification on S622 located.22,23 This working model allows us to identify residues potentially involved in substrate binding and catalysis.
Footnote |
† Electronic supplementary information (ESI) available: Experimental section and results section containing sequence alignments, protein purification, mass spectra and structural models. See DOI: 10.1039/c8ob00441b |
This journal is © The Royal Society of Chemistry 2018 |