Discovery of megapolipeptins by genome mining of a Burkholderiales bacteria collection†
Abstract
Burkholderiales bacteria have emerged as a promising source of structurally diverse natural products that are expected to play important ecological and industrial roles. This order ranks in the top three in terms of predicted natural product diversity from available genomes, warranting further genome sequencing efforts. However, a major hurdle in obtaining the predicted products is that biosynthetic genes are often ‘silent’ or poorly expressed. Here we report complementary strain isolation, genomics, metabolomics, and synthetic biology approaches to enable natural product discovery. First, we built a collection of 316 rhizosphere-derived Burkholderiales strains over the course of five years. We then selected 115 strains for sequencing using the mass spectrometry pipeline IDBac to avoid strain redundancy. After predicting and comparing the biosynthetic potential of each strain, a biosynthetic gene cluster that was silent in the native Paraburkholderia megapolitana and Paraburkholderia acidicola producers was cloned and activated by heterologous expression in a Burkholderia sp. host, yielding megapolipeptins A and B. Megapolipeptins are unusual polyketide, nonribosomal peptide, and polyunsaturated fatty acid hybrids that show low structural similarity to known natural products, highlighting the advantage of our Burkholderiales genomics-driven and synthetic biology-enabled pipeline to discover novel natural products.