Michaël Méret‡
a,
Daniel Kopetzki‡b,
Thomas Degenkolbe‡a,
Sabrina Kleessena,
Zoran Nikoloskia,
Verena Tellstroemc,
Aiko Barschc,
Joachim Kopkaa,
Markus Antonietti*b and
Lothar Willmitzera
aMax Planck Institute of Molecular Plant Physiology, Potsdam-Golm, D-14476, Germany
bMax Planck Institute of Colloids and Interfaces, Potsdam-Golm, D-14476, Germany. E-mail: markus.antonietti@mpikg.mpg.de
cBruker Daltonik GmbH, Bremen, D-28359, Germany
First published on 6th February 2014
Metabolomics comprises of the monitoring of small molecules present in a biological system as a function of time and space. Coupled with emerging modeling approaches, it facilitates predictions of reaction sequences. Here, we explore the potential of metabolomic tools for analyzing the complex chemical systems in a model reaction, the hydrothermal reforming (HTR) of glycine. The profiles for more than 20 monitored compounds were used to reconstruct the glycine reaction network. The mechanism of glycine conversion into serine and alanine was validated, where new carbon–carbon (C–C) bonds are formed from the C2-position of glycine. We thus demonstrated that metabolomic methods are useful for the analysis of complex combinatorial problems in chemistry.
For the illustration of the potential advantages and restrictions, we apply here these tools to the hydrothermal reforming (HTR) of glycine, which is known to result in a large number of biologically relevant intermediates and platform chemicals. Glycine is a simple, abundant and multifunctional chemical which was likely relevant to pre-biotic chemistry.1–3 In evolutionary terms, HTR reactions are discussed as palaeo-chemical reactions that preceded life and that may still be active on Earth, for example, at deep sea hot vents.2,4–7 New insights into glycine HTR may lead to the use of glycine as a reaction modulator of the hydrothermal carbonization of carbohydrates.8–11 HTR, once appropriately understood, is however also expected to enable the green synthesis of valuable chemicals in de novo biorefining schemes.
Fig. 1 HTR products of glycine. The chemicals were identified by paired GC-EI-TOF-MS and GC-APCI-TOF-MS. GC-APCI-TOF-MS enabled the unambiguous elucidation of molecular formulas from complex mixtures (Table S1†). Structure hypotheses were deduced from molecular formulas and the paired EI-TOF-MS fragmentation spectra. |
Our GC-MS analysis resulted in no less than 21 glycine HTR products (Fig. 1, Table S1†), exceeding the previously reported set of products.12,13 These encompassed highly diverse structures (Fig. 1) ranging from aromatic and non-aromatic heterocycles (16–22), to amino acids, such as alanine (13) and serine (14), to carboxylic acids, such as glycolic acid (3) and glyoxylic acid (4). In addition, condensation products, such as iminodiacetic acid (7), N-glycylglycine (6), and glycine-N-methylamide (12) could be identified. As described in the ESI,†the chemical identity of the HTR products was determined by fast scanning GC-EI-TOF-MS and high mass resolution GC-APCI-TOF-MS. Moreover, glycine isotopomers differentially labeled with stable 13C, 15N or 2D isotopes were used as the starting materials to cross-check the structural analyses (Fig. 1, Table S1, and ESI†).
The first statistical method, hierarchical cluster analysis, separated the HTR products into five distinct clusters. Glycine, as the substrate, was the sole member of cluster 5 (Fig. 2B). Glycolic acid, glyoxylic acid, oxamic acid (3–5), and N-glycylglycine (6) appeared early (cluster 1). Cluster 2 contained the subsequently occurring amino acids alanine (13) and serine (14), as well as the heterocycles hydantoin (16) and four hydroxylated pyrazines (17–20). According to GC-MS and confirmed by NMR, the main glycine HTR product is 3,6-dihydropyrazine-2,5-diol (17).
The time pattern of (17) was in agreement with a previously proposed reaction path via the early intermediate N-glycylglycine (6).13 3,6-Dihydropyrazine-2,3,5-triol (19) represents a mixed dimer of glycine and either glyoxylic (4) or oxamic acid (5), both members of the early cluster 1. Cluster 3 contained a third dihydropyrazine, 3-methyl-3,6-dihydropyrazine-2,5-diol (21), which can be interpreted as a mixed dimer of glycine (1) and alanine (13), which appeared after the preceding accumulation of alanine. The presence of pairs of pyrazines/dihydropyrazines, namely (17)/(18), (19)/(20), and possibly (21)/(22) (Fig. 1, Table S1†) indicated the existence of a dehydrogenation reaction which irreversibly transformed the dihydro-dimers into the more stable aromatic structures.
It is interesting to note that the observed pyrazines are positional isomers of pyrimidines, the biologically relevant nucleobases. The presence of uracil, 5,6-dihydrouracil, thymine or 5,6-dihydrothymine, can however be ruled out. Cluster 3 included in addition, iminodiacetic acid (7), sarcosine (8), and glycineamide (11) which represented potential alternative condensation and decarboxylation products of glycine. Cluster 4 contained glycine-N-methylamide (12), a decarboxylation product of N-glycylglycine (6). The remaining constituents of cluster 4 indicated the gradual increase of CO2/H2CO3 (2) which is generated by decarboxylation reactions. The N-carboxyamines (9), (10), (15) represented reversible products of amines with CO2/H2CO3 (2).
The second statistical method applied to complex data sets is “principal component analysis” (PCA), a method for dimension reduction. Subjecting the joined kinetic data sets (i.e. the reactions taking place at 180 °C and 250 °C) to a PCA, we find that the reactions performed at 180 °C and 250 °C follow the same trajectories, with the reaction at 180 °C just trailing behind the reaction at 250 °C. Therefore, HTR at the two temperatures runs very similarly, with the 250 °C experiments only being faster. The principal components PC1 and PC2 explain 77% and 7% of the total variance, respectively. The decomposition of glycine (1) and the accumulation of oxamic acid (5) explain the major variance of the early HTR process (Fig. 2D).
Applying a simple Spearman correlation to the time-resolved profiles did not allow us to identify associations between the products (Fig. S2A and B†). Thus, in a second step, the first-order partial correlations for the network construction were applied. This approach is applied to remove spurious secondary correlations16,17 and can thus identify the concerted appearance or consumption of compounds. In chemistry, the statistical associations found in this network imply that the respective compounds are common products of a single reaction or are linked by sequential reactions. For instance, 3,6-dihydropyrazine-2,5-diol (17) and pyrazine-2,5-diol (18), and 3,6-dihydropyrazine-2,3,5-triol (19) and pyrazine-2,3,5-triol (20) are correlated (ESI Results, Fig. S2C and D†), which means that they form simultaneously or in a direct cascade.
Usually, simple correlation-based approaches consider the association of compounds across the full monitored time interval. Here, we found at least two different phases of the glycine HTR system (Fig. 2), i.e., early processes, predominantly seen in the 180 °C kinetic series, and late processes, more active in the 250 °C time series (Fig. 2C). The resulting two networks for the glycine HTR were strikingly different. Only a single edge, namely between N-glycylglycine (6) and glycolic acid (3), is shared between the two networks, indicating that this product pair is relevant in both the early and late phase of the glycine HTR system. While the 180 °C network was composed of single paths, the 250 °C HTR network contains locally dense regions (ESI Results, Fig. S2A–D†). This means that the complexity found is created in the later stages of the reaction.
Scheme 1 Reconstructed reaction networks from the data at (A) t2 = 0.60 min and (B) t7 = 3.58 min of the 180 °C HTR together with the interpretation in terms of a chemical reaction network. The reconstructed networks were comprised of reaction nodes (gray) and compound nodes (orange), which were linked by edges that indicated the compounds participation on the same side of the reaction. The positional label information indicated by red asterisks was in agreement with the positional labeling obtained from HTR of 1-13C-glycine and 2-13C-glycine (ESI Results, Fig. S3†). |
At t4, the reconstructed reaction network indicated that iminodiacetic acid (7) was formed from two molecules of glycine (1) by the loss of NH3 (ESI Results, Scheme S1†). Sarcosine (8) and glycine-N-methylamide (12) resulted from the condensation of glycine (1) and methylamine. The equilibrium between the previously formed (Scheme 1A, t2) N-carboxyglycine (9), and glycine (1) was displaced in favor of the dimerization of two glycine molecules to 3,6-dihydropyrazine-2,5 diol (17). This compound was then dehydrogenated to the thermodynamically stable aromatic compound pyrazine-2,5-diol (18).
Systems chemistry, as we suggest calling this approach in analogy to systems biology, can improve our insights into reaction cascades or just detrimental side reactions causing yield limitations. However, the observation of (at a given set of conditions) spurious side products and the analysis of the path of their formation can point to new possible reaction schemes, and in the case of food chemistry, these side products can define taste and value. Most importantly, systems chemistry approaches might enable the analysis and finally, the understanding of highly complex chemistry involving reactions of multiple substrates and mechanisms occurring at the same time.
Footnotes |
† Electronic supplementary information (ESI) available: Methods, supplementary results. See DOI: 10.1039/c3ra42384k |
‡ Contributed equally. |
This journal is © The Royal Society of Chemistry 2014 |