Open Access Article
Amphun
Chaiboonchoe‡
a,
Lila
Ghamsari‡§
b,
Bushra
Dohai‡
a,
Patrick
Ng‡
c,
Basel
Khraiwesh
a,
Ashish
Jaiswal
a,
Kenan
Jijakli
a,
Joseph
Koussa
a,
David R.
Nelson
a,
Hong
Cai¶
a,
Xinping
Yang||
b,
Roger L.
Chang
d,
Jason
Papin
*e,
Haiyuan
Yu
*c,
Santhanam
Balaji
*abf and
Kourosh
Salehi-Ashtiani
*ab
aLaboratory of Algal, Systems, and Synthetic Biology, Division of Science and Math, New York University Abu Dhabi and Center for Genomics and Systems Biology (CGSB), New York University Abu Dhabi Institute, Abu Dhabi, UAE. E-mail: ksa3@nyu.edu
bCenter for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, and Department of Genetics, Harvard Medical School, Boston, MA, USA
cDepartment of Biological Statistics and Computational Biology and Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA. E-mail: haiyuan.yu@cornell.edu
dDepartment of Systems Biology, Harvard Medical School, Boston, MA, USA
eDepartment of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA. E-mail: papin@virginia.edu
fMRC Laboratory of Molecular Biology, Cambridge, UK. E-mail: bsanthan@mrc-lmb.cam.ac.uk
First published on 14th June 2016
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
How might the sequence spaces of enzymes be explored by evolution? The idea of adaptive landscapes in evolution was first introduced in the 1930s to conceptualize the evolvability potential of organisms.19 This concept was further developed into coevolution and the dependency of fitness landscapes was mathematically modeled, e.g., using the NK model, describing the changes in the ruggedness of the landscape with respect to dependency on the function of other genes.20–25 In broad terms, the linkage between phylogenetic affiliations and their functional groupings26,27 can be viewed as a consequence of linked fitness landscapes which may be detectable from co-conservation of genes.
In recent years, reconstruction of genome-scale metabolic networks has elucidated the fundamental aspects of metabolic network formation and evolution.1,5,6 Extensive work has been done on studying the architecture of metabolic networks, linking topology, evolution and function of metabolic enzymes. Von Mering et al.28 have shown that a large portion of metabolic enzymes cluster together in a modular fashion within metabolic networks. Such findings have been further corroborated by Zhao et al.29 where they identified a core–periphery modular organization of the network within which the peripheral modules show a more cohesive coevolution as compared to the core pathways. Kanehisa et al.30 have further ascertained the latter finding where they suggested that the core metabolic pathways might have evolved in an individualized fashion, whereas the peripheral or extensions were driven by modular sets of enzymes and reactions.
The evolutionary dynamics of metabolic genes are not characterized in C. reinhardtii and still not fully resolved in any eukaryote, particularly with respect to the relationship with distant lineages. We addressed this gap here by extending the information content of a genome-scale metabolic network that we recently reconstructed.31 We defined evolutionary affinities of the network with 13 major eukaryotic lineages representing most if not all major eukaryotic lineages, some of which reside very distant to C. reinhardtii. We looked at the evolutionary dynamics of gene pairs by distinguishing highly conserved pairs with those that are conserved in a subset of lineages. This information was then integrated with topological and metabolic analyses in conjunction with gene expression data to determine if there is concordance between evolutionary affinities, expression, and functional constraints within the C. reinhardtii network. Furthermore, we carried out interolog analysis to assess the rewiring of the metabolic networks in yeast and Arabidopsis.
000 pairs) was predicted using COBRA Toolbox v.2 under two different conditions of dark and autotrophic light growth. The COBRA toolbox (COBRA = Constraint-Based Reconstruction and Analysis) is a comprehensive collection of tools developed for in silico model-based analysis and reconstruction of metabolic networks.33,34 To simulate growth under dark with acetate (or “DA”), light flux was set to zero and an acetate uptake of up to 10 mmol gDW−1 h−1 was permitted to provide a source of energy and carbon. The wild-type maximum growth rate was 0.7 mmol gDW−1 h−1. Simulation of growth under light with no acetate (or “LNA”) was achieved by setting the acetate intake flux to zero and light flux to 646 mmol gDW−1 h−1. These parameters resulted in a biomass productivity of 0.3 mmol gDW−1 h−1 for wild-type autotrophic metabolism. The value obtained for each double-gene deletion was divided by the wild-type growth rate under both conditions to get the growth rate ratio for each in silico deletion mutant. We only considered the deletion pairs in which the decrease in the metabolic output was greater in the double deletion compared to the sum of the respective single deletions. We classified double deletions that result in zero growth as synthetic lethal and those that reduce the metabolic output as “synthetic sick”. We note that the number of synthetic sick interactions under DA conditions was more than those under LNA in some categories including (100–80)%, (40–20)% and (20–0)%. We did not look at positive synthetic interactions. We note that although there may be limitations in the predictive capabilities of this type of modelling,35 the generated predictions have been validated experimentally in many different cases.36
The raw reads for each library were deposited in the NCBI BioSample database and they are accessible through Sequence Read Archive (SRA) accession number SRP065253.
094 edges (connections) between 1086 metabolic gene products, with an average connectivity of ∼21 and a clustering coefficient of 0.57. The network has 14 connected components; 1040 of the 1081 nodes reside in its largest component. We note that the average degree and clustering coefficient of the network are higher than a typical protein–protein interaction network, alluding to a high interconnectivity of metabolic genes and pathways in the network.
We extended the information content of iRC1080 by defining the evolutionary affinities (i.e., sequence similarity) of genes in the C. reinhardtii network (Table S1, ESI†) with protein-coding genes of major eukaryotic lineages. We interrogated over 250 annotated genomes spanning 13 eukaryotic lineages (Table S2, ESI†) with BLAST and clustered the obtained high scoring hits to assign the affinities (Fig. 1 and Table S3, ESI†). The highest number of affinities is assigned to Viridiplantae (green plants) with Stramenopiles (or heterokonts, which include diatoms, golden, and brown algae) and Metazoa (animals) occupying the next two largest groups. Members of Diplomonadida, which do not possess true mitochondria, have the lowest number of affinities assigned. Interestingly, Choanoflagellida, a group of flagellates closely related to animals have a significantly lower number of assigned affinities compared to animals.43 We note that approximately 200 genes in the network remain unassigned to any eukaryotic lineage (other than C. reinhardtii or potentially to other green alga), as their affinities fall below our set threshold of P < 0.001. These genes are likely to have homology to cyanobacteria and other prokaryotes, while a subset may be Chlamydomonas-specific.
![]() | ||
| Fig. 1 Evolutionary affinities and co-conservation of genes in the network. (A) Phylogenetic affinities of genes in the network are shown as the fraction of total for each of the 13 eukaryotic lineages explored. The list of the genes and the lineages are provided in Tables S1 and S2 (ESI†), respectively. (B) Comparison of distribution of mutually informative pairs between the real and randomized networks. Mutual information of neighboring genes (nodes) in the network is a measure of dynamic co-conservation (see Methods). The y-axis is in the natural log scale; plots in green and red represent the real and random networks, respectively. At the mutual information index value of 0.3, the difference between gene pairs in the random networks and the real network becomes statistically significant (P = 0.001); this value (i.e., 0.3) was used to identify mutually informative pairs. Four hundred and fifty five genes form 908 gene pairs in the network with index values of 0.3 or higher. (C) Comparison of distribution of number of pairs and evolutionary profile distances between the real and randomized networks (see Methods). Evolutionary profile distances are a measure of co-conservation; only the gene pairs that are conserved in at least 50% of the lineages were included. The plots in green and red represent evolutionary profile distance values of gene pairs in the real and random networks, respectively. The y-axis is in the natural log scale. Profile distances of 0.1 or less (dotted vertical line) display statistically significant differences between the real and random networks, these gene pairs are referred to as statically co-conserved pairs. Two hundred and twenty three genes form 775 such pairs in the real network. All values have been normalized to the maximum value and represented in the graph. (D and E) Mutual information and evolutionary profile distance in randomized networks. Based on randomization of the network threshold, the values for the mutual information index and evolutionary profile distance were set. For every pair of genes in the network, if the mutual information index values were higher than or the profile distance values were less than the set thresholds, those pairs were considered dynamically or statistically co-conserved gene pairs, respectively. (D) The real network (red square) has the highest number of high MI pairs compared to all randomized network (P ≤ 0.001). (E) The number of gene pairs with low PD values in the real network (red square) is significantly higher than the randomized network (P < 0.001). | ||
Universally conserved gene pairs have low MI values and cannot be detected as statistically significant pairs (P < 0.001) while they are clearly evolutionarily constrained. We examined the co-occurrence of highly conserved gene pairs in the network by calculating evolutionary profile distances for each neighboring pair in the network and compared them to randomized network distances. At the normalized distance threshold value of 0.1, occurrences of pairs with 0.1 or lower profile distances become statistically significant relative to random networks (P < 0.001) (Fig. 1C). With this threshold, 775 pairs comprised of 223 gene products (21% of genes in the network) can be detected (Table S5, ESI†). Because these gene pairs have similar profiles that are conserved across most or all of the 13 lineages, we refer to these as statically co-conserved pairs.
We further corroborated the mutual information and evolutionary profile distance analyses by randomizing the network structure (while maintaining the affinity vectors intact) and investigated how many dynamic or static pairs occur in the random networks. The randomized networks in both cases show a statistically significant lower number (P ≤ 0.001) of dynamic and static co-conserved pairs relative to the real network (Fig. 1D and E) indicating that the occurrences of these pairs (at the threshold value used) are not random.
Two sub-networks were reconstructed to examine connectivities within the dynamic and static groups (Fig. 2A and B); the gene pairs not assigned as being dynamic or static are not included in these sub-networks. GO term enrichment of the static (low evolutionary profile distance pairs) and dynamic (high MI) pairs showed a number of overlapping terms; however, most terms were enriched uniquely in the dynamic and static networks; “calcium ion binding” was the only term that was shared between the two sub-networks (Fig. 3A and B and Fig. S3, ESI†). GO terms that were exclusively enriched within the static pairs included nucleotide kinase activity and oxidoreductase activity, acting on sulphur group of donors. On the other hand, galactosidase activity and intramolecular oxidoreductase activity (transposing C
C bonds) were enriched within the dynamic pairs (Fig. S3, ESI†). These results demonstrate that there is considerable segregation between the two sub-networks both topologically and functionally.
![]() | ||
| Fig. 2 Dynamically and statically co-conserved pairs in the network. (A) A subnetwork based on the identified dynamic pairs was reconstructed to highlight the connectivity between dynamically co-conserved pairs; non-dynamic nodes were not included in this network. The indicated numbers designate regions of the network described in the text and Fig. S6 (ESI†). The color of the nodes represents their degree (blue highest, dark-red lowest); the size of the nodes corresponds to the clustering coefficient of the nodes. (B) A subnetwork based on the statically co-conserved gene pairs was reconstructed to highlight the connectivity of these genes; non-static genes are not included in this network. | ||
![]() | ||
| Fig. 3 Gene ontology (GO) term analysis of dynamically and statically co-conserved pairs and their hubs in the network. Uniqueness of GO terms for the dynamic and static sub-networks and their associated enrichment P-values are shown for biological process (A) and molecular function (B) ontologies. For each set, over representation probabilities were determined using the C. reinhardtii metabolic network as a reference (see also Fig. S3, ESI†). No overlap is observed between the dynamic and static GO terms at any significance level in biological process ontologies, while an overlap of a single term is detected for molecular function in (B). (C) The hubs in the network (defined as nodes forming the top 20% of highly connected genes) that show evidence of dynamic and static co-conservation are shown in the network (linked respectively with green and red edges) and reported in the Venn diagram. The blue circle marks five of the most connected hubs in the network. | ||
The dynamic network is fragmented and displays more varied conservation. It consists of 89 connected components, many of which consist of isolated bi- or tri-gene groups; its largest connected component consists of 171 genes. The static sub-network is smaller (223 genes) but less fragmented compared to the dynamic sub-network – it encompasses 14 connected components in contrast to 89 components of the dynamic sub-network and its nodes have a higher average degree (6.95 vs. 3.99). The static sub-network is nearly universally conserved.
Hubs, or highly connected nodes in biological networks, often carry important or essential functions.46 To investigate if the hubs in the transformed network show segregation with respect to their co-conservation, we identified highly connected nodes (Table S6, ESI†) and then classified them as dynamic or static on the basis of their interaction with their partnering nodes. We found that hubs with dynamically evolving partners have little overlap with statically evolving hubs (Fig. 3B), which suggests a functional distinction between the two types of hubs. Indeed, the distinction between the two hub types can be observed in the metabolic processes they are involved in; many of the dynamic hubs are involved in photosynthesis or lipid metabolism, whereas the low evolutionary profile distance hubs are involved in central metabolism but not photosynthesis (Tables S7 and S8, ESI†).
Taking the five most connected hubs as examples, four of the five are exclusively dynamic and one is a dual static and dynamic hub (Fig. 3C and Fig. S4, ESI†). The four dynamic hubs encode ferredoxins and are involved in photosynthesis or other metabolic processes such as lipid metabolism. These four hubs have distinct affinities including the following lineages: fungi, Alveolata, Rhodophyta, Stramenopiles, and Viridiplantae. Several distinct ferredoxins are known to be differentially expressed under a variety of specialized conditions.47 For example, in C. reinhardtii, FDX3 has been shown to be involved in nitrogen assimilation, FD4 in glycolysis and response to reactive oxygen species, and FDX5 in hydrogenase maturation under anoxic conditions. Both FDX1 and FDX2 serve as the primary electron donor for NADPH and H2 production, however the electron transfer speed of FDX2 is less than half as fast as that of FDX2,47 so C. reinhardtii is capable of modulating the speed of NADPH and H2 production by differentially expressing these FDXs. Environmental condition variability would have a strong impact on the differential expression and most likely evolutionary maintenance of C. reinhardtii's ferredoxins. The fifth hub in this group encodes an acyl-carrier protein (ACP2), which is involved in lipid metabolism. The encoding gene is conserved across all lineages except for Diplomonadida. There are only three other dual hubs in the network; these encode CYC1 (cytochrome c), a CYC1 paralog, and EamA transporter. Overall, our results support the hypothesis that the dynamic hubs have emerged to fulfill the metabolic fitness of the species under specialized or specific conditions with shared constraints. On the other hand, static hubs are not determinants of specialized metabolic functions, rather they perform universally shared functions.
To examine if the dynamic and static interologs are distinguishable with respect to function, we carried out GO enrichment analyses for the identified interologs. The interolog analysis (Fig. 4 and Fig. S5, ESI†) showed that for the static pairs, many enriched GO terms overlap between C. reinhardtii/A. thaliana and C. reinhardtii/S. cerevisiae; and for dynamic pairs, none of the significantly enriched GO terms overlap. For example, a GO term uniquely enriched in the dynamic interologs of C. reinhardtii/A. thaliana but not in C. reinhardtii/S. cerevisiae is the cGMP biosynthetic process. In Chlamydomonas, nitric oxide (NO)-dependent guanylate cyclases (GCs) mediate nitrogen-assimilatory signalling by forming cGMP from GTP in the presence of extracellular ammonium.50 The presence of these interologs in C. reinhardtii/A. thaliana but not in C. reinhardtii/S. cerevisiae indicates dynamic evolution of these components of the nitrogen assimilation signalling pathway in plants but not in yeast50 which is consistent with dynamic pairs being involved in specialized functions. Altogether, these results indicate that while some rewiring of metabolic functions have occurred during evolution, a significant level of conservation has persisted, which in turn attests to a persistence of selective pressure in the course of evolution. As expected, less rewiring is observed in static pairs, particularly in yeast, which is consistent with the centrality of static pairs in the network.
For statically co-conserved gene pairs, MCODE lists 21 sub-networks with the highest score of 12.211 (20 nodes and 116 edges) and the lowest score of 2.667 (4 nodes and 4 edges). The top GO terms (biological process) found were lipid glycosylation, glycoside metabolic process and cofactor metabolic process.
There was only one significantly enriched GO term (lipid modification) with overlap between the top five modules of dynamically and statically co-conserved pairs. This indicates that (1) the dynamic and static networks are modular, (2) the largest modules have distinct and non-overlapping functions, and (3) the largest dynamic modules are enriched in specialized functions, while the static modules are involved in more general metabolic functions.
000 double deletions under each condition, Fig. 5A, Tables S12 and S13, ESI†) and predicted the resulting biomass yields accordingly. We further binned the interactions according to the resulting level of biomass reduction (Fig. S10, ESI†) and calculated their pairwise evolutionary profile distances. The pairwise profile distances (Euclidean distances) between synthetically interacting genes showed a range of values and in many cases values of above 1, indicating that the genes that are involved in the interactions have distinct evolutionary affinities.
We carried out the Kolmogorov–Smirnov (KS) test for measuring the maximum absolute difference between our data and the standard normal distribution with the null hypothesis being that the distances between the interacting pairs follow that of random interactions in the network. The standard normal distribution was obtained from evolutionary profile distances between all 1086 genes in the network. As illustrated in Table S14 (ESI†), the KS test revealed that the synthetic interaction distribution is not a standard normal distribution. A significant difference was observed between the random pairs in the network and the synthetic interaction profile distances under both light and dark simulated biomass production. These results show that the evolutionary affinities of the genes involved in the synthetic interactions under both conditions of growth in light with no acetate (LNA) and in the dark with acetate (DA) differ from the overall pairwise distance distributions of the network.
To test if the interacting pairs are enriched for short or long evolutionary profile distances as compared to random pairs in the network, we performed hypergeometric tests for enrichment of distances greater than or equal to 1, 2 and 3 for both light with no acetate or LNA and dark with acetate or DA conditions. We observed that under LNA conditions, synthetic interactions with values greater than 2 are significantly enriched; in contrast, under DA conditions, interactions with profile distances of less than 1 and 2 show a significant enrichment (Fig. 6B and Table S15, ESI†). GO term enrichment analysis was carried out on each of the different bins under both growth conditions and major results are found in Table S16 (ESI†). The lists of double-gene knockouts under two different conditions; DA and LNA, were used to create the gene interaction networks using Cytoscape and compare the selected KEGG pathways for each condition (Fig. S11A and B, ESI†). The KEGG pathway enrichment between two conditions (light and dark) for synthetic lethal conditions shows the enrichment of a number of pathways common between the two conditions (Fig. S12, ESI†). As an example, synthetic interactions in the KEGG pathway are shown in Fig. S13–S17 (ESI†).
We identified the genes associated with the reactions in the co-sets and calculated the evolutionary profile distances of all possible gene pairs between reactions (we note that some reactions are associated with multiple genes and some reactions have no associated genes) (Fig. 6C). As in synthetic pairs (described in the previous section), we observed many of the distances to be greater than 1, indicating different phylogenetic profiles among the genes. A hypergeometric test was carried out in relation to the random evolutionary profile distances of the whole network (all possible gene combinations in the network). The co-sets with only one pair of genes were not considered in this analysis. The test revealed the statistical significance of distance values of less than 2 and values of 3 or greater (Tables S19 and S20, ESI†). The enrichment probabilities become more significant with distances of less than one, or 3 or greater. These analyses indicate that the enrichments of co-sets are bipartite relative to random network distances, with over-representation of both short and long distances within the sets.
The presence of non co-conserved pairs in the network, which constitute the majority of pairs, implies that functional constraints for these genes are not shared between C. reinhardtii and the explored lineages in the context of the studied metabolic network, at least not in the context of neighboring gene-pairs in the network in a consistent manner. This fluidity in co-conservation, which we also observe in functional analyses of the network, in turn suggests that rewiring of metabolic pathways may be a significant contributing force behind evolutionary adaptations as recent data have suggested being the case in genetic interaction and transcription networks.59,60
Our analyses identify most network hubs as either dynamic or static with very few having characteristics of both. This is a consequence of the topological segregation of dynamic and static pairs. As we have demonstrated, this segregation is also manifested with respect to both regulation, as judged on the basis of enrichment under light and dark growth conditions (Fig. 5C), and with respect to function as indicated on the basis of differential enrichments of GO terms. With respect to the latter, we note that this differential enrichment is observable at the level of the entire subnetwork (Fig. 3A and B and Fig. S3, ESI†) as well as at the module level (Fig. S6 and Table S9, ESI†). Taken together, the observed topological, temporal, and functional segregation of the static and dynamic pairs and hubs suggests that these segregated organizations may provide adaptive values in varying evolutionary niches. In biological terms, the different ferredoxins that form major hubs in the network are expected not to be interchangeable as they may have different redox potentials.61 As we have illustrated, these proteins have different evolutionary affinities and mostly demonstrate dynamic co-conservation, which likely reflect different biochemical requirements in species belonging to different lineages. Taking into account the crucial functions of ferredoxins and their involvements in large sets of reactions and pathways,62 selective pressures in maintaining the optimal redox potential can be expected for a specific set of ferredoxins in each lineage. This and other similar hypotheses fall in line with what Fang et al., 2013 set forth in terms of gene co-expression and evolution.63 They concluded that selective pressure acts on the relationship between genes rather than on individual genes, which may further explain the maintaining of a set of ferredoxins within the C. reinhardtii metabolic network.
Our analyses show a range of evolutionary profile distances for genes in coupled reaction sets as well as those with predicted synthetic interactions, which as in our topological analyses, point to fluctuations in co-conservation within the network despite a shared or related function. Synthetic pairs identified under “dark” metabolism are enriched for pairs with (Euclidean) distances of 1 or less in their phylogenetic profiles, indicating that these gene pairs have very similar phylogenetic profiles. Gene pairs showing synthetic interactions under light growth are enriched in distant values of 2 or greater and less than 1, the former indicates distant evolutionary profile distances despite a related function in the network. Genes in the co-sets show a similar bimodal enrichment with some extremes observable, that is, some co-sets are enriched with less than 1 and some are enriched for values of equal or greater than 3.
Notably, co-sets under both dark and light conditions, and with a range of profile distances are shown to be involved in purine catabolism, N-glycan biosynthesis, and fatty acid biosynthesis. Importantly, the N-glycan biosynthesis pathway involves an intersection of light and dark relevant co-sets with long and short profile distances, respectively (Note S1, ESI†). Furthermore, synthetic lethal interactions link N-glycan metabolism and fructose and mannose metabolism (profile distance of 3.6), and the pentose-phosphate pathway with the biosynthesis of steroids (profile distance of 0) under light conditions. As for dark condition interactions, amino acid synthesis and nitrogen metabolism are observed to interact with a profile distance of 3.4 (Note S2, ESI†).
It is to be noted that correlated reactions as well as synthetic interactions can be distant in the network topologically (a few examples are shown in Fig. S13–S17, ESI†). Therefore, enrichment for large evolutionary profile distances may coincide with distant placements in the network. As such, the C. reinhardtii network can be hypothesized to have assembled by evolutionary adaptive processes in such a way that evolutionary rigidity (exemplified as statically co-conserved pairs and short distances in synthetic and co-set pairs) and plasticity (exemplified by dynamically co-conserved pairs and long distances in synthetic and co-set pairs) are segregated. The need for such plasticity may be evident at the physiological level with the recent observation that a wide range of metabolites can be utilized by C. reinhardtii as nitrogen sources, including di- and tripeptides as well as a number of D-amino acids.64 Moreover, when buffering of pathways is required, the network architecture makes use of genes with dissimilar phylogenetic profiles. These findings provide an alternative and a wider perspective on metabolic network architecture and evolution.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c6mb00237d |
| ‡ These authors contributed equally to this work. |
| § Present address: Genocea Biosciences, 100 Acorn Park Drive, Cambridge, MA, USA. |
| ¶ Present address: BGI-Shenzhen, Shenzhen 518083, China. |
| || Present address: Departments of Obstetrics & Gynecology, Nanfang Hospital, Southern Medical University, Guanzhou, 510515, China. |
| This journal is © The Royal Society of Chemistry 2016 |