Claudia
Clemente
a,
Nicholas
Johnson
a,
Xiaodan
Ouyang
b,
Rafael V.
Popin
b,
Sergio
Dall'Angelo
a,
Matti
Wahlsten
b,
Jouni
Jokela
b,
Alessandro
Colombano
a,
Brunello
Nardone
c,
David P.
Fewer
*b and
Wael E.
Houssen
*ad
aInstitute of Medical Sciences, University of Aberdeen, Ashgrove Road West, Foresterhill Aberdeen AB25 2ZD, UK. E-mail: w.houssen@abdn.ac.uk
bDepartment of Microbiology, University of Helsinki, P.O.Box 56, Viikki Biocenter, Viikinkaari 9, 00014, Finland. E-mail: david.fewer@helsinki.fi
cCEM Microwave Ltd, Buckingham Industrial Park, Buckingham MK18 1WA, UK
dDepartment of Chemistry, University of Aberdeen, Meston Walk, Aberdeen AB24 3UE, UK
First published on 28th September 2022
Cyanobactins are linear and cyclic post-translationally modified peptides. Here we show that the prenyl-D-Arg-containing autumnalamide A is a member of the cyanobactin family. Biochemical assays demonstrate that the AutF prenyltransferase targets the guanidinium moiety in arginine and homoarginine and is a useful tool for biotechnological applications.
The AgcF prenyltransferase from the argicyclamide biosynthetic pathway was recently reported to catalyse the mono- and bis-N-prenylation of the guanidine moiety of L-Arg.17 The argicyclamide A scaffold consists otherwise exclusively of L-amino acids.17 AgcF was found to have high substrate selectivity while being capable of using dimethylallyl diphosphate (DMAPP) or geranyl diphosphate (GPP) as a prenyl or geranyl donor.17
Autumnalamide A (Fig. 1) is an Nω-D-arginine mono-prenylated cyclic peptide that was isolated from the organic extract of the cyanobacterium Phormidium autumnale CCAP1446/1023 and proved to prevent IL2 production from T cells through the modulation of store operated calcium channels.24 The autumnalamide A chemical structure was determined using NMR and HRMS while the absolute configuration was determined using chiral GC MS analysis and 13C NMR.23 This analysis has revealed that autumnalamide A contains D-Pro and prenylated D-Arg residues.
Here, we identify the autumnalamide (aut) biosynthetic pathway and characterize the substrate specificity of the AutF prenyltransferase.
We extracted high-molecular-weight genomic DNA from Phormidium autumnale CCAP1446/10 and obtained a 6.74 Mb draft genome sequence. We predicted a 9.6 kb cyanobactin biosynthetic gene cluster through tBLASTn searches using biosynthetic enzymes from the anacyclamide biosynthetic pathway14 as query sequences against a standalone BLAST database. This putative autumnalamide (aut) biosynthetic gene cluster encoded seven cyanobactin biosynthetic proteins organized in a single operon (Fig. 1). Surprisingly, the aut biosynthetic gene cluster encodes two precursor proteins, AutE1 and AutE2, and a cyanobactin prenyltransferase, AutF (Fig. 1 and Table S2, ESI†). The core sequence of AutE1 matched the sequence of the previously reported autumnalamide A exactly23 (Fig. 1). LC-MS analysis of the cyanobacterial extract also showed that autumnalamide A is the most abundant cyanobactin produced while trace amount of non-prenylated autumnalamide A can be detected (Fig. S2 and Table S1, ESI†).
We used LC-MS analysis to identify the predicted product of the AutE2 precursor peptide. The major product detected was a new non-prenylated cyclic peptide that we named autumnalamide B (Fig. 1 and Fig. S2, Table S1, ESI†). Trace amount of the mono-prenylated autumnalamide B was detected (Table S1, ESI†).
The stereochemistry of autumnalamide B was determined through comparison of retention time over multiple runs in HRLC-MS analysis and MS fragmentation pattern with four synthetic stereochemical variants containing all possible combination of L- or D-Arg and L- or D-Pro 1–4 (Table 1) and with Marfey analysis. This analysis showed that autumnalamide B exclusively consists of L-amino acids (Fig. S3–S5, ESI†).
Sample | Sequencea | Prenylation/(% yield)b | Fig. | Sample | Sequencea | Prenylation/(% yield)b | Fig. |
---|---|---|---|---|---|---|---|
a The core sequences of autumnalamide A and B are “GLDFRFP” and “TLRESTAMYP” respectively. b Percentage yield was calculated by dividing the chromatographic peak area of the charged species by the sum of peak areas of prenylated product and/or the remaining substrate in the reaction mixture. | |||||||
1 | Cyclo[-TLRESTAMYP] | <1.5 | S9 | 13 | H-FDLGPFR-NH2 | <1.5 | S21 |
2 | Cyclo[-TLrESTAMYp] | <1.5 | S10 | 14 | H-FDLGPFr-NH2 | <1.5 | S22 |
3 | Cyclo[-TLRESTAMYp] | <1.5 | S11 | 15 | H-FREDLGPAYD-NH2 | <1.5 | S23 |
4 | Cyclo[-TLrESTAMYP] | <1.5 | S12 | 16 | H-FRADLGPAYD-NH2 | <1.5 | S24 |
5 | H-TLRESTAMYPFQA-NH2 | <1.5 | S13 | 17 | N-α-Boc-L-arginine | <1.5 | S25 |
6 | Cyclo[-LGPFrFD] | N | S14 | 18 | N-α-Fmoc-L-arginine | <1.5 | S26 |
7 | Cyclo[-LGPFRFD] | 4 | S15 | 19 | Z-L-arginine-OH·HCl | <1.5 | S27 |
8 | H-FrFDLGpAYD-NH2 | <1.5 | S16 | 20 | Z-D-arginine-OH·HCl | <1.5 | S28 |
9 | H-FrFDLGPAYD-NH2 | <1.5 | S17 | 21 | Fmoc-L-homoarginine-OH | 30 | S29 |
10 | H-FRFDLGPAYD-NH2 | 30 | S18 | 22 | H-FXFDLGPAYD-NH2 where X is L-homoarginine | <1.5 | S30 |
11 | H-RFDLGPF-NH2 | N | S19 | 23 | H-QYLDEKLPNG-NH2 | N | S31 |
12 | H-rFDLGPF-NH2 | N | S20 | 24 | H-VVKGALKSLV-NH2 | N | S32 |
The autF gene was then cloned in pEHISTEV-SUMO vector (Gift from Dr Haunting Liu) in frame with an N-terminal Tobacco etch virus (TEV) protease-cleavable His6SUMO tag, expressed in E. coli BL21(DE3) and purified (Fig. S8, ESI†). We used a series of 24 linear and macrocyclic peptides and protected amino acids to test the substrate specificity of AutF (Table 1). Some linear peptides were synthesised with a C-terminus three amino acid recognition signals for the macrocyclases PatGmac8a (Ala-Tyr-Asp) and PCY125 (Phe-Gln-Ala) in order to prepare cyclic peptide substrates.
Substrates 1–4 are four variants of autumnalamide B containing “L- or D-Arg” and “L or D-Pro”. AutF catalysed the mono-prenylation of these substrates with poor yield (<1.5%) and this correlates well with the fact that the majority of the naturally detected autumnalamide B is non-prenylated (Fig. S9–S12, ESI† and Table 1). AutF also prenylates the linear peptide substrate 5 containing the autumnalamide B sequence with a C-terminus PCY1 macrocyclase recognition signal but with overall poor yield <1.5% (Fig. S13, ESI† and Table 1).
AutF could not process substrate 6 which is variant of the autumnalamide A core with the only difference of containing L-Pro instead of D-Pro (Fig. S14, ESI† and Table 1). The AutF enzyme was capable of catalysing, albeit with low yield, the mono-prenylation of cyclic peptide 7 which is another autumnalamide A variant containing L-Arg and L-Pro instead of D-Arg and D-Pro respectively (Fig. S15, ESI† and Table 1). However, the yield of the prenylated product of AutF reaction with substrate 7 was 4% which does not correlate with the fact that autumnalamide A constitute the major autumnalamide in the cyanobacterial extract (Table S1, ESI†). Therefore, we tested the ability of AutF to process linear variants of autumnalamide A.
AutF prenylates linear peptide 8 containing the autumnalamide A core sequence with a C-terminus PatGmac recognition signal but with again overall poor yield <1.5% (Fig. S16, ESI† and Table 1). Similarly, the yield was very low in case of substrate 9 which is the same sequence as substrate 8 but containing L-Pro instead of D-Pro (Fig. S17, ESI† and Table 1). However, the yield of the prenylation product increased 29-fold when using substrate 10 containing the autumnalamide A sequence with both L-Arg and L-Pro instead of D-Arg and D-Pro and a C-terminus PatGmac signal (Fig. S18, ESI† and Table 1).
Based on these results, we predict that the prenylation step in autumnalamide A biosynthesis precedes the epimerization step. We could not identify an epimerase-encoding gene in the autumnalamide biosynthetic cluster or more distantly within the draft genome. This is reminiscent of patellamides and kawaguchipeptins in which the genetic origin of epimerization has not yet been discovered and it was proposed to be chemically spontaneous.26,27
Substrates 11–14 are permutations of the core peptide sequence of autumnalamide A in AutE1 (Fig. 1) but with D- or L-Arg at either the N-terminus or the C-terminus. AutF was not able to process substrates 11 and 12 containing L- and D-Arg respectively at the N-terminus (Fig. S19 and S20, ESI† and Table 1) while it was able to process, with very low yield, substrates 13 and 14 which contain L- and D-Arg at the C-terminus respectively (Fig. S21 and S22, ESI† and Table 1). This suggests that AutF is sensitive to the immediate environment surrounding the Arg residue in linear peptides with high preference for the Arg to be sandwiched between two residues. The same requirement was previously reported for the C-3 tryptophan prenyltransferase KgpF from the kawaguchipeptin biosynthetic pathway.16
The fact that the autumnalamide B core sequence was not prenylated suggested that the enzyme is sensitive to the presence of the acidic “Glu” that precedes the Arg in autumnalamide B. To test this hypothesis, we synthesized substrate 15, which is a homolog of substrate 10 but with the replacement of one of the Phe residues surrounding the Arg with Glu. AutF mono-prenylates this substrate but with very low (<1.5%) yield (Fig. S23, ESI† and Table 1).
In order to check if the enzyme would accept small amino acid instead of one of the Phe residues in substrate 10, we synthesized and tested substrate 16 and MS analysis of the reaction revealed that the enzyme mono-prenylates this substrate again with low (<1.5%) yield (Fig. S24, ESI† and Table 1). These results suggest that AutF processes peptides in which Arg is sandwiched between bulky hydrophobic residues.
AutF also showed very weak ability (<1.5% yield) to mono-prenylate N-α-Boc-L-Arg (17), N-α-Fmoc-L-Arg (18), Z-L-Arg-OH (19) and Z-D-Arg-OH (20) (Fig. S25–S28, ESI† and Table 1). However, although AutF could not process the homoarginine-containing peptide substrate 22 (Fig. S30, ESI†), it can process Fmoc-L-homoarginine-OH (21) with 30% yield (Fig. S29, ESI†). This makes AutF a useful tool to produce prenylated Fmoc-L-homoarginine that could be used in solid phase peptide synthesis of homoarginine-containing peptides.
In addition, we tested the ability of AutF to prenylate the basic amino acid, lysine in the peptide substrates 23 and 24. MS analysis of the biochemical reactions shows that AutF could not process these substrates (Fig. S31 and S32, ESI† and Table 1) and thus we can conclude that the enzyme is specifically targeting the guanidine moiety in Arg and homoarginine.
AutF steady state kinetic parameters were determined using substrates 10 (Kcat/Km = 7.18 S−1 M−1) and 21 (Kcat/Km = 10.21 S−1 M−1) due to their relatively higher conversion yield (Fig. S33, ESI†). The optimal reaction conditions for AutF were determined by calculating the turnover number (Kcat) with substrate 21 under different reaction conditions (Fig. S34, ESI†). The results indicate that the higher turnover is achieved at pH 7 in the presence of 500 mM NaCl and Mg2+ as a metal ion. These conditions were used to scale up the enzymatic reaction with Fmoc-homoarginine (21) to obtain sufficient amount of prenylated product for structural characterization by NMR. The 1H–1H COSY spectrum shows a correlation between 16-NH and 17-H which ascertains that the prenylation occurs on the Nω of the homoarginine guanidinium moiety (Fig. S35a–o and Table S5, ESI†).
Interestingly, AutF did not accept GPP as an isoprene donor because we did not detect any geranylated product from its GPP-containing reaction with substrate 10. On the other hand, AgcF, from argicyclamide pathway, was reported to accept both DMAPP and GPP and was capable of catalysing the mono-geranylation of argicyclamide C.17 AutF shares 62% sequence identity with AgcF (Fig. S36, ESI†). To gain insight into the reason of AutF inability to use GPP as a donor, we generated computational models for both AgcF and AutF using the crystal structure of PagF (PDB: 5TU6) with which they share 42% and 44% sequence similarity respectively (Fig. S37, ESI†). The residues forming the active site entrance in both AgcF (Cys218, Cys266, Leu288, Gly66, Gly132) and AutF (Cys219, Cys267, Leu288, Gly67, Ala133) are conserved with the only difference is the replacement of Gly132 in AgcF with the slightly larger Ala133 in AutF (Fig. S37, ESI†). Schmidt and Nair groups recently demonstrated that a single amino acid change (F222G or F222A) in the isoprene-binding pocket of the O-Tyr prenyl transferase, PagF, completely switched its donor specificity from DMAPP to GPP.28 We used site directed mutagenesis to generate AutF A133G variant. This protein was expressed, purified and tested for its ability to process substrate 10 with both DMAPP and GPP. Results indicate that this variant can only use DMAPP as cofactor donor. Structural studies are then required to fully understand the substrate/cofactor specificity of both AutF and AgcF. This is because the binding of the substrate may induce conformation changes in the active site that create more space to accommodate longer cofactor. This has been noticed in the bacterial indole prenyltransferases; TleC from Streptomyces blastmyceticus and MpnD from Marinactinospora thermotolerans which are known to catalyze the prenylation of (−)-indolactam V at the C-7 position of the indole ring with GPP or DMAPP, to produce lyngbyatoxin or pendolmycin, respectively. Structural data revealed that TleC-specific Trp97 rotates by about 70° when the substrates are bound to the active site. This rotation generates a bigger space to accommodate the long side chain of the C10 GPP. This space is lacking in the C5 prenyltransferase MpnD as the rotation of the corresponding residue Tyr80 is not allowed due to the steric hinderance between Tyr80 and Met159.29
In this study, we identified the autumnalamide biosynthetic cluster and identified a new product. In addition, we biochemically characterized the prenyl transferase AutF, which targets the guanidine moiety in arginine and homoarginine. Interestingly, the enzyme catalyses the linear L-Arg containing variant of autumnalamide A with much higher yield than the cyclic variants containing L- or D-Arg. We predict that prenylation precedes the epimerization. However, this requires additional studies to be confirmed.
W. E. H. and D. P. F. designed research; C. C., N. J., X. O., R. V. P., S. D., M. W., J. J., A. C., B. N., D. P. F. and W. E. H. performed experiments and analysed data; W. E. H., D. P. F., J. J. and M. W. wrote the manuscript with contribution from all the authors.
This project was supported by a fellowship grant from the EPSRC (No. EP/S027246/1, W.E.H.), a training grant from the BBSRC (BB/V509206/1, W. E. H. and S. D.), the Novo Nordisk Foundation (18OC0034838, D. P. F.) and the NordForsk NCoE program “NordAqua” (Project Number 82845, D. P. F.). C. C. is funded by a PhD studentship from University of Aberdeen. N. J. is funded by the IBioIC CTP PhD programme. R. V. P. was funded by the Doctoral Programme in Microbiology and Biotechnology of the University of Helsinki. X. O. was funded by the China Scholarship Council (Grant 201906150148). We are grateful to Professor James Naismith (University of Oxford) for sharing the construct used to express the macrocyclase PCY1. We thank Dr Huanting Liu (University of St Andrews) for sharing the pEHISTEV-SUMO vector.
Footnote |
† Electronic supplementary information (ESI) available: Experimental section, Fig. S1-S37 and Tables S1-S5. See DOI: https://doi.org/10.1039/d2cc01799g |
This journal is © The Royal Society of Chemistry 2022 |