Open Access Article
Anxo
Lema-Saavedra
a,
Antonio
Fernández-Ramos
ab and
Emilio
Martínez-Núñez
*b
aCentro Singular de Investigación en Química Biológica y Materiales Moleculares (CIQUS), Universidade de Santiago de Compostela, C/Jenaro de la Fuente s/n, 15782, Santiago de Compostela, Spain
bDepartamento de Química Física, Facultade de Química, Universidade de Santiago de Compostela, Avda. das Ciencias s/n 15782, Santiago de Compostela, Spain. E-mail: emilio.nunez@usc.es
First published on 17th September 2025
The formation of complex organic molecules (COMs) in the interstellar medium (ISM) is central to astrochemistry and prebiotic chemistry, as these species may act as precursors to biomolecules essential for life. Among COMs, glyceraldehyde (HOCH2CH(OH)C(O)H, GCA) has attracted attention as a potential building block in early biochemical pathways. Although GCA has not yet been detected in the ISM, the presence of structurally related compounds in various astronomical environments suggests that it may form under interstellar conditions. In this study, we employed the automated reaction discovery tool AutoMeKin to systematically explore the gas-phase chemical reaction networks (CRNs) of C3H6O3 (GCA), C3H7O3 (a hydrogenated analog), and C2H5O2. Reaction pathways were characterized at the ωB97XD/Def2-TZVPP level of theory, and rate coefficients for key processes were computed using the competitive canonical unified statistical (CCUS) model, which accounts for multiple dynamic bottlenecks. Our analysis revealed several barrierless pathways leading to GCA or to GCA and a leaving group. Notably, the reaction between glyoxal (HCOHCO) and the HOCHCH2OH radical, though neither has yet been detected in the ISM, was found to efficiently produce GCA and a formyl radical, with rate coefficients on the order of 5.4–7.9 × 10−10 cm3 molecule−1 s−1 across the 10–100 K temperature range. However, aside from the aforementioned exception, most GCA formation channels result in highly vibrationally excited intermediates that are more likely to undergo rapid unimolecular decomposition than to be stabilized by radiative emission under typical ISM conditions. These results suggest that while gas-phase GCA formation is chemically feasible, it is likely transient and difficult to detect directly. In contrast, alternative products such as formaldehyde, glycolaldehyde, and (Z)-ethene-1,2-diol dominate many pathways and align better with current astronomical observations. This work provides detailed mechanistic and kinetic insights that enhance astrochemical modeling and advance our understanding of molecular complexity in star-forming environments. Furthermore, it highlights the utility of automated CRN exploration for uncovering viable synthetic routes to prebiotic molecules in space.
The gas-phase decomposition of GCA has been investigated experimentally through pyrolysis,2 and the early steps of the Formose mechanism have been studied computationally using density functional theory (DFT).1 Under pyrolytic conditions, GCA starts decomposing at 400 °C, undergoing retro-aldol fragmentation and dehydration. The retro-aldol reaction predominantly yields glycolaldehyde and formaldehyde.3,4
Although GCA has not yet been detected in the ISM despite dedicated observational searches,5 its structural analogs, such as glycolaldehyde and ethylene glycol, have been observed in star-forming regions and comets,6–9 suggesting that it may also form under astrochemical conditions. Laboratory experiments and computational studies indicate that GCA may originate from formaldehyde-based reactions in interstellar ice analogues, driven by ultraviolet irradiation, radical recombination, or thermal processes,2 or from the reaction of (Z)-ethene-1,2-ediol with hydroxycarbene (HOCH).10 Understanding the formation pathways of GCA and related species in the ISM provides valuable insights into the chemical processes that drive molecular complexity and, ultimately, the emergence of life's building blocks in space.
This study explores potential gas-phase formation pathways for GCA, as well as other key reactions relevant to astrophysical environments, evaluating their viability under interstellar conditions. The formation of COMs in the ISM can proceed through various reaction mechanisms. Given the extremely low temperatures of interstellar clouds (10–150 K), gas-phase reactions must generally proceed without significant activation barriers.11 In other words, such reactions are expected to follow energetically favorable pathways with submerged barriers,12,13 often involving at least one radical or ion species. Additionally, due to the extremely low densities of interstellar clouds, product stabilization is inefficient unless radiative cooling provides a competitive mechanism.11
In particular, we investigate two types of GCA gas-phase formation reactions:
| R1˙ + R2˙ → GCA + hν | (1) |
| R3˙ + M → GCA + H˙ | (2) |
To explore these reaction mechanisms, we employ AutoMeKin,14–16 an automated tool for discovering chemical reaction networks (CRNs). AutoMeKin has been successfully used to propose gas-phase formation routes of various interstellar molecules, including benzene,17 cyanoketene,18 cyanamide,19 formaldehyde,20 glycolonitrile,21 hydrogen isocyanide,22 indole,13 malononitrile,23 vinyl alcohol,24 and (Z)-1,2-ethenediol.25
In this study, we employ AutoMeKin to construct detailed reaction pathways for species with the chemical formulas C3H6O3 and C3H7O3, corresponding to reactions (1) and (2), respectively. C3H6O3 represents GCA, while C3H7O3 corresponds to GCA with an additional hydrogen (GCA–H). Additionally, we investigate the C2H5O2 CRN using AutoMeKin due to its close relationship with the other CRNs. Through the analysis of these networks, we identify key intermediates and reaction channels, and we perform kinetic calculations to determine rate coefficients for the most relevant reactions. The computed rates provide valuable inputs for astrochemical models.
Our findings have the potential to bridge gaps in current astrochemical models by providing accurate rate coefficients and detailed reaction mechanisms. As observational techniques continue to advance, the insights gained from this work will contribute to a deeper understanding of the chemical evolution of star-forming regions and the origins of molecular complexity in space.
Additionally, AutoMeKin now also incorporates an improved algorithm for identifying barrierless reaction pathways.13 Since this algorithm relies on level 1 calculations, we further supported the barrierless character of selected pathways by performing relaxed scan optimizations at level 2, corrected with single-point B2PLYP/Def2-TZVPP calculations. As shown in the Supplementary Information (SI), the energy profiles of the selected pathways confirm the absence of barriers.
For a subset of key reactions, energies were further refined using CCSD(T)/CBS calculations (level 3). The CBS limit was estimated via extrapolation using the jun-cc-pVXZ basis set (with X = D, T).28 level 2 and level 3 calculations were performed using Gaussian09.29
Beyond its core capabilities in MD simulations and graph-theoretic analysis, AutoMeKin has recently been extended with a Python-based library, amk_tools, designed for parsing, processing, and analyzing CRNs.13 Custom Python scripts further allow for the adaptation of energy profiles generated by amk_tools and facilitate the comparison of the predicted decomposition products with known interstellar molecules.
For the C3H6O3 and C3H7O3 molecular systems, the AutoMeKin workflow was executed iteratively 30 times, with each cycle comprising MD simulations, optimization of minima and transition states (TSs), and CRN generation. Each MD simulation set included 500 trajectories, with a maximum simulation time of 0.5 ps. For the smaller C2H5O2 system, the workflow was run for 20 iterations with 480 trajectories per cycle.
The transition-state filtering criteria from our previous study on indole decomposition13 were also applied here. These criteria served to remove structures with very low imaginary frequencies and to eliminate redundant TSs. Additional details on the methodological approach are provided in ref. 13.
To evaluate the convergence of the CRNs obtained through iterative refinement, kinetic Monte Carlo (KMC) simulations were conducted to solve the chemical master equation.30 Specifically, the abundances of decomposition products were determined by exciting 103 molecules of GCA and GCA–H at 250 kcal mol−1. The reaction rate coefficients were estimated using RRKM theory,31 given by:
![]() | (3) |
While our study primarily focuses on the barrierless formation pathways of GCA, the complete CRNs are available in Zenodo.33 The analysis presented here is based on lumped networks, in which conformers are grouped together; however, the full networks, including all individual conformers, are also provided in Zenodo.33 In these networks, minima, transition states, and products are labeled as MIN, TS, and PROD (or PR), respectively, with numerical identifiers distinguishing different species. The labels for each network are assigned independently. Molecular visualizations and representations were performed using CYLview.34
![]() | (4) |
![]() | (5) |
![]() | (6) |
| kf = Cμ−1/2(dAdB)2/3T−1/6 | (7) |
The rate coefficients k1 and k2 are obtained using canonical variational transition state theory (CVT),38 with A and B as the reacting species. For processes involving hydrogen transfer, tunneling corrections are included via the small curvature tunneling (SCT)39 model. The rate coefficients were calculated under the assumption of zero pressure.
When there is no competition between pathways and S* can undergo only a single unimolecular reaction (k1), the applied model is referred to as the canonical unified statistical (CUS) model, and the rate coefficient is given by kCUS = kfk1/(kf + k1).
The vibrationally excited species S* may also undergo a radiative stabilization process:
![]() | (8) |
![]() | (9) |
![]() | (10) |
In the calculations above, all vibrational frequencies were scaled by the recommended factor of 0.975.41
1. Purple and violet-blue pathways (left of Fig. 1). These routes involve hydroxycarbene (HOCH) reacting with either (Z)-ethene-1,2-diol (HOCHCHOH, violet-blue) or (E)-ethene-1,2-diol (purple). Among these three species, only (Z)-1,2-ethenediol has been detected in the ISM.10 Notably, the violet-blue route corresponds to the reaction mechanism hypothesized by Rivilla et al.10 These are the only pathways involving a reaction between two singlet species. Both routes lead to GCA in five elementary steps, with the resulting GCA* possessing an excess energy of approximately 80 kcal mol−1. The first step in both routes involves the association of reactants to form stable van der Waals (vdW) complexes, which then rearrange to produce cyclopropane-1,2,3-triol (at 21.6 kcal mol−1). The rearrangement in the violet-blue route is more favorable than in the purple route.
2. Dark blue pathway (center of Fig. 1). This route begins with the barrierless association of two radical species at 104.4 kcal mol−1: hydroxymethyl (CH2OH) and HOCCHOH radicals. This pathway converges with the purple one at the (Z)-prop-1-ene-1,2,3-triol intermediate (at 6.5 kcal mol−1), leading to GCA* in just two steps, but with a significant excess energy of 104 kcal mol−1. Additionally, while none of the reactants have been detected in the ISM, protonated formaldehyde (CH2OH+) has been observed.42
From the (Z)-prop-1-ene-1,2,3-triol intermediate, the convergence point of the three aforementioned pathways, the most favorable route proceeds through TS98 to yield 2-hydroxyprop-2-enal (CH2COHCHO) and water, rather than forming GCA.
3. Light green pathway (center of Fig. 1). This route shares one reactant with the dark blue pathway: hydroxymethyl radical. The other reactant, the hydroxymethylcarbonyl radical (HOCH2CO), has not been detected in the ISM, but it was identified as a product of the reaction between atomic hydrogen and glycolaldehyde in solid para-hydrogen at 3.2 K.43 As in the previous case, this route leads to GCA in two steps.
4. Red route (right of Fig. 1). In this pathway, the formyl radical (HCO) undergoes a barrierless radical–radical association with HOCHCH2OH, leading to GCA in a single step. Among the two reagents, only the formyl radical has been detected in the ISM.44
5. Light blue route (right of Fig. 1). This pathway involves two reactants: the hydroxymethyl radical and the 2-oxoethylideneoxidanium radical (HOCHCHO), the latter of which has been proposed as a precursor to GCA formation in dark regions of the ISM.43 Similar to the previous route, GCA forms through the association of the reactants in a single step.
Although some of the reactions described above appear plausible due to their barrierless nature, GCA* forms with a significant excess energy (62.9 kcal mol−1 in the most favorable case, specifically in the light blue route). In this scenario, GCA* can only dissipate energy through radiative emission of IR light or undergo further unimolecular reactions. An RRKM calculation for the unimolecular decomposition of GCA at 62.9 kcal mol−1 predicts 100% abundance of formaldehyde and ethene-1,2-diol, after surmounting an energy barrier of 36.8 kcal mol−1 (see Fig. 2). The estimated radiative emission rate is 111 s−1, which is orders of magnitude lower than the unimolecular decomposition rate of GCA*, leading to formaldehyde and 1,2-ethenediol, which is approximately 3 × 106 s−1. Given the large disparity between the rates of radiative emission and unimolecular decomposition, we have chosen to neglect radiative stabilization in all kinetic calculations, as the excess energy above the decomposition barrier is consistently high and this process is not expected to compete effectively.
The following reaction represents the outcome of the radical–radical association described in the light blue route:
| CH2OH + HOCHCHO → CH2O + HOCHCHOH | (11) |
The rate coefficients for reaction (11) evaluated using the CCUS model and presented in Table S1 of the SI, indicate that the abundance of the Z isomer is orders of magnitude higher than that of the E isomer. Notably, only the Z isomer has been detected in the ISM.10 However, since the reactants involved in reaction (11) have not yet been observed in the interstellar medium (ISM), this reaction cannot be considered a viable formation pathway for 1,2-ethenediol in the ISM.
The rate coefficients for the formation of formaldehyde and (Z)-1,2-ethenediol from reaction (11) in the 10–100 K temperature range vary from 8.7 × 10−10 and 6.0 × 10−10 cm3 molecule−1 s−1.
Fig. 2, along with our kinetic calculations, shows that the predominant decomposition pathway of GCA leads to the formation of formaldehyde and 1,2-ethenediol. Therefore, the light green, red and light blue pathways depicted in Fig. 1 exclusively yield these two products, with GCA serving as a key intermediate. In contrast, the purple, violet-blue, and dark blue pathways result in the formation of water and 2-hydroxyprop-2-enal.
As in the CRN described in the previous section, the screening process was similar, i.e., reactions where atomic hydrogen is released and that involved barriers whose energy were below the energy of reactants were sought. Additionally, among the possible reactants, those that could rearrange through an exothermic hydrogen transfer were discarded, as this process would potentially be much faster than reaction (2).
We found that the reactions between the reactant pairs shown in Fig. 3 are particularly relevant.
The reaction between the fragments depicted in Fig. 3a, although leading to the formation of GCA–H without a barrier in a single step, does not yield GCA and H, as their energy is higher than that of the reactants. However, this reaction is of particular interest because no lower-energy pathway exists for these reactants other than its dissociation back into the original fragments. Consequently, the only possible outcomes of a reactive encounter between these two species are either immediate dissociation or stabilization of the complex via spontaneous radiative emission. As mentioned above, formaldehyde is a compound detected in the ISM,45 whereas the larger fragment, the HOCHCH2OH radical, which also appears in the red route of Fig. 1, has not been observed. However, the reaction between hydroxyacetaldehyde, which has been detected,46 with atomic hydrogen to form HOCHCH2OH has been reported to be exothermic by 24.4 kcal mol−1, albeit with a significant barrier.43
The reactants depicted in Fig. 3b would lead to the formation of GCA + H˙ in four steps, with the reaction being exothermic by 17.5 kcal mol−1. However, neither of these two compounds has been found in the ISM. The two-carbon molecule is the radical derived from ethanol (detected in the ISM)47 lacking a hydrogen atom. The other reactant in the high-energy isomer of formic acid (HOCOH), which can undergo hydrogen transfer to form the more stable acid.
The reaction between the reactants depicted in Fig. 3c is the most promising and is discussed in more detail below.
Finally, the reaction between the fragments shown in Fig. 3d is the most energetically favorable of all, with an energy difference of 60.6 kcal mol−1 between the reactants and GCA + H˙, leading to the formation of GCA in five steps. Additionally, the small fragment, methylene (CH2), has been detected in space.48 However, the larger fragment is relatively complex, with three oxygen atoms. Nevertheless, the existence of this compound would not be unexpected, as it is structurally similar to ethene-1,2-diol with an additional OH group. Various hypotheses could be proposed regarding its formation.
As shown in Fig. 4, the reaction between hydroxycarbene and HOCHCH2OH (at 43.5 kcal mol−1), the reactants of Fig. 3c, is barrierless and highly energetically favorable, leading to the formation of glyceraldehyde and atomic hydrogen after three steps. However, the energy profile of Fig. 4 clearly indicates that the most favorable pathway results in the formation of the fragments displayed in Fig. 3a, formaldehyde and the HOCHCH2OH radical, or to the formation of water and HOCH2CHCHO. The former represents a peculiar and less efficient pathway for the isomerization of hydroxycarbene into formaldehyde.
As mentioned above, hydroxycarbene has not been detected in space. One possible reason for this could be that hydrogen transfer between oxygen and carbon, leading to the more stable formaldehyde conformer, occurs relatively easily. This reaction has been previously investigated using higher levels of theory than those employed in the present study.49,50 Nevertheless, this compound frequently appears in our CRNs and seems to be a highly reactive species, driving several barrierless and highly exothermic reactions. The numerous reaction pathways that the presence of this compound in the ISM would enable motivated us to study its unimolecular kinetics in greater detail.
The HCOH molecule can adopt two possible conformations: syn and anti. The anti conformation can undergo hydrogen transfer via tunneling, leading to the formation of formaldehyde, as shown in Fig. 5. Using both the CCUS and the CVT/SCT models, the rate coefficient for the formation of formaldehyde from hydroxycarbene in the temperature range 10–50 K temperature range varies between 1.3 × 10−4 and 7.5 × 10−3 (see Table S2 in the SI for detailed values). On the other hand, the rate coefficients for the reaction of hydroxycarbene with HOCHCH2OH radicals are on the order of 7.2–9.4 × 10−10 cm3 molecule−1 s−1. Therefore, the rearrangement of hydroxycarbene depicted in Fig. 5 may potentially compete with its reaction with HOCHCH2OH, depending on the abundance of these radicals.
Finally, we also explored other interesting reactions identified within this CRN, beyond those leading to GCA. In this context, we identified a highly efficient formation route for both hydroxymethyl radical and glycolaldehyde (HOCH2CHO) from hydroxycarbene and the HOCH2CH2O radical:
| HOCH + HOCH2CH2O → CH2OH + HOCH2CHO | (12) |
| R3˙ + M → GCA + X˙ | (13) |
| HCOHCO + HOCHCH2OH → GCA + HCO | (14) |
![]() | ||
| Fig. 6 DFT-computed reaction energies for various reactions of the type R3˙ + M → GCA + X˙ (reaction (13), see text). In each case, the zero-energy reference corresponds to the products, with the respective reactant pairs indicated in the figure. | ||
Glyoxal can exist in two structural forms: E and Z. Although it has not been detected in the ISM yet, its presence in space has been hypothesized based on the dimerization of HCO radicals.51 One possible reason for its elusive detection is that the lowest-energy isomer (E) has a vanishing dipole moment, making it undetectable by rotational emission.10 While we provide rate coefficients for reactions involving glyoxal, we emphasize that these results are intended to support astrochemical modeling efforts and do not constitute evidence that such reactions are responsible for the formation of GCA in the ISM.
The reaction pathways connecting the reactants and products of reaction (14), computed at level 2, are shown in Fig. 7. The corresponding level 3-corrected pathways are presented in Fig. S5.
The rate coefficients for the formation of GCA and HCO viareaction (14) were computed in this study. The reaction is limited by the initial association step, and the corresponding rate coefficients are provided in Table S4 of the SI. In the 10–100 K temperature range, the rate coefficients for the Z isomer are higher than those for the E isomer and range from 5.4 × 10−10 to 7.9 × 10−10 cm3 molecule−1 s−1.
In the C2H5O2 CRN of Fig. 8, we can distinguish two large regions separated by a high barrier (TS93). On the left side, we observe both the E and Z isomers of ethene-1,2-diol and atomic hydrogen (at 35.2 and 39.0 kcal mol−1). The reaction between any of the two isomers and hydrogen has a barrier, which rules it out as a possible pathway for the formation of other molecules. However, ethene-1,2-diol can be formed from the reaction between oxirane and hydroxyl radical (located in the center of the figure at 46.6 kcal mol−1). However, we can assume that this reaction would exclusively lead to the most favorable route, resulting in formaldehyde and the hydroxymethyl radical (at 19.6 kcal mol−1).
Another interesting aspect of the profile is the reaction between ethenol (CH2CHOH) and hydroxyl radical (at 30.3 kcal mol−1). These two species, both detected in the ISM, react without a barrier to form the HOCHCH2OH radical. Once this radical is formed, the only two possibilities are either decomposition back into the initial reactants or stabilization via spontaneous radiation. This result supports the consideration of the HOCHCH2OH radical as a potential reactant, since, even if it has not been detected, its existence in the ISM would be expected.
On the right side of the energy profile, the other isomer of acetic acid, methyl formate (CH3OCHO), appears along with atomic hydrogen at an energy of 16.2 kcal mol−1. Although its formation would be energetically viable from methoxy radical (CH3O) and hydroxycarbene (at 78.8 kcal mol−1), it would be expected that formic acid (HCOOH) and methyl radical (at −5.5 kcal mol−1) would be the predominant products.
It should be noted here that a more comprehensive study by Barone's group on the same reaction network indicates that H-abstraction in the reaction between the OH radical and vinyl alcohol primarily yields the CH2CHO radical and water.25 Our results cannot confirm this conclusion, as they are based on the analysis of a lumped network in which reactions of the type A + B → C + D are not considered.
Our analysis of the C3H6O3 CRN reveals several plausible formation routes for GCA, including radical–radical and radical–molecule association reactions. Notably, pathways involving hydroxycarbene, HOCHCH2OH, formyl and hydroxymethyl radicals proceed without barriers and exhibit significant exothermicity. Despite their kinetic feasibility, these reactions typically yield highly energized GCA* molecules, which are prone to rapid unimolecular decomposition before they can be stabilized via radiative cooling. Predicted decomposition products include water, 2-hydroxyprop-2-enal, glycolaldehyde, formaldehyde, and ethene-1,2-diol; all species that have either been observed in the ISM or are considered highly probable.
The C3H7O3 CRN further supports the hypothesis that GCA functions as a transient intermediate. Even within this network, the most accessible pathways favor decomposition over the accumulation of stable GCA. We also explored alternative reactions analogous to GCA formation that involve the departure of larger radicals instead of atomic hydrogen. Among these, the reaction between glyoxal and the HOCHCH2OH radical is particularly notable for its barrierless and exothermic profile. However, the detection of glyoxal in the ISM remains challenging due to its near-zero dipole moment.
Our complementary analysis of the C2H5O2 network underscores the chemical significance of the HOCHCH2OH radical, which emerges repeatedly as an intermediate in multiple reaction schemes. Although this species has not yet been detected in the ISM, its formation is supported by barrierless reactions involving known interstellar molecules such as ethenol and hydroxyl radicals.
Taken together, these results suggest that while the gas-phase formation of GCA is chemically feasible under ISM conditions, it is likely outcompeted by pathways leading to other, more stable complex organic molecules (COMs). GCA may exist only fleetingly in interstellar environments, serving as a reactive intermediate en route to more persistent species. The data generated in this study, including structural information, reaction mechanisms, and rate coefficients, provide essential input for astrochemical models and may inform future observational strategies.
Overall, this work highlights the power of automated CRN exploration in astrochemistry and reinforces the notion that molecular complexity in space emerges not only through stepwise buildup but also through dynamic networks of reactive intermediates, shaped by the unique physical conditions of the interstellar medium.
Supplementary information: convergence of product abundances with iteration number, computed rate coefficients and plots of selected barrierless pathways. See DOI: https://doi.org/10.1039/d5cp01690h.
| This journal is © the Owner Societies 2025 |