Halis Seuretab,
Ailish D. Sullivanb,
Cercis Morera-Boado*g,
Tina A. Harriott*bc,
Daniel Majaess
*bc,
Lou Massad and
Chérif F. Matta
*bef
aCentro de Investigaciones Químicas, IICBA, Universidad Autónoma del Estado de Morelos, Cuernavaca, 62209, Morelos, México
bDepartment of Chemistry and Physics, Mount Saint Vincent University, Halifax, Nova Scotia B3M2J6, Canada. E-mail: daniel.majaess@gmail.com; daniel.majaess@msvu.ca; cherif.matta@msvu.ca
cDepartment of Mathematics and Statistics, Mount Saint Vincent University, Halifax, Nova Scotia B3M2J6, Canada. E-mail: Tina.Harriott@msvu.ca
dHunter College & the PhD Program of the Graduate Center, City University of New York, New York, NY 10065, USA
eDepartment of Chemistry, Saint Mary's University, Halifax, Nova Scotia B3H3C3, Canada
fDépartement de Chimie, Université Laval, Québec, Québec G1V 0A6, Canada
gDepartamento de Físicoquímica Teórica, Secihti-Centro de Investigaciones Químicas, IICBA, Universidad Autónoma del Estado de Morelos, Cuernavaca, 62209, Morelos, México. E-mail: cercis.morerab@uaem.edu.mx
First published on 6th May 2025
Diffuse interstellar bands (DIBs) comprise over 550 celestial absorption features whose molecular carriers remain largely unidentified or contested. In this study, we present a statistical analysis that identifies two previously overlooked families of strongly correlated lines associated with the original Heger features at 5780 and 5797 Å. Comprehensive UV-vis spectra were computed at several levels of theory (mainly TD-PBE0 and EOM-CCSD with an aug-cc-pVTZ basis set) for the following candidates posited as diffuse interstellar band carriers (in both their neutral and cationic forms): 2-cyclopenten-1-one, 3(2H)-thiophenone, 2(5H)-furanone, 3(2H)-selenophenone, 3-hydroxypropanamide, oxamic acid, lactamide, and glycolamide. Glycolamide is of particular interest since it has recently been detected in microwave (rotational) spectra of the comparatively dense molecular cloud G+0.693–0027. Importantly, the computations reveal that the anions exhibit marginal electron affinities despite producing improbable lines (i.e., with excitations to levels above the ionization threshold) overlapping DIBs, whereas the neutral molecules yield lines shortward of DIBs and possibly linked to the broad 220 nm interstellar feature, and their cations produced too few lines in the DIB domain inspected. Further vetting of candidates awaits the construction of an expansive optical-infrared molecular ion database, which will facilitate concurrent matching to DIBs in the optical (electronic) and their energy differences in the mid-infrared (vibrational), thereby narrowing the parameter space.
Diffuse interstellar bands are absorption features observed in spectra of diverse celestial targets. When observed in binary star spectra these lines do not undergo the expected periodic Doppler shifts due to the stars' motion, and thus these features arise in the intervening interstellar medium.6–13 These absorptions are molecular in origin, since they are generally broader than atomic lines. The broadening of molecular lines in DIBs spectra arises from several factors including (but not necessarily limited to) overlapping rotational transitions (especially for large molecules with dense rotational energy levels), thermal Doppler broadening due to random molecular motions, turbulence (non-thermal) Doppler broadening due to the collective currents of large masses within a given cloud in different directions, the short lifetimes of the excited states, the presence of molecules with different isotopic compositions (isotopologues), and many more effects. These factors contribute significantly in making the identification of DIB carriers difficult.9–13
A century ago, Mary Lea Heger identified the first DIBs at 5780 and 5797 Å.6,7,14 Since this seminal discovery more than 550 DIBs have been identified.15 There is yet to exist an undisputed molecular carrier for any DIB, though a consensus is continuing to coalesce around the connection between C60+ and two DIBs (9577 and 9632 Å).16–19
Identifying groups of correlated DIBs is a crucial first step in detecting a molecular carrier.20–25 The “correlation” referred-to here is that between the ratios of the equivalent widths (defined below) of pairs of DIBs within a group (also known as “family”) regardless of the observational sightline. In other words, while the individual equivalent widths of a set of “correlated lines” may vary from one sightline to the next, the pairwise ratios of these equivalent widths are constant – within a high statistical confidence – irrespective of the dust cloud being observed. Diverse groups of interrelated lines (e.g., via Pearson correlations), and the sheer number of DIBs, imply there are numerous molecules responsible for these lines.
The purpose of this effort is to identify families of correlated lines, and propose a procedure to identify possible candidate carriers. The following procedure was adopted, and could be pertinent for similar endeavors:
1. Identification of correlated DIBs based on equivalent widths. The equivalent width (EW) of a DIB is the area of the absorption feature in a normalized spectrum. It is related to the “extinction” or strength (loosely speaking, the “intensity”) of the absorption line.
2. Bolstering the existence of these families by determining the correlations between the EWs of DIBs and interstellar reddening. Optical reddening arises from blue light being preferentially scattered when traversing dust clouds.
3. Using energy differences between correlated DIBs (a family) as a probe of the vibrational level spacings of the source ((non-)linear vibronic progression).
4. Using a molecular database to examine functional groups exhibiting infrared vibrations that agree with those implied by the wavelength differences between DIBs within a family.
5. Completing quantum chemistry computations of neutral molecules identified from comparisons to molecular databases, to assess if there are lines which directly overlap with observed DIBs. Evaluating their ion counterparts.
6. Benchmarking of the quantum chemical levels of theory, and assess their accuracy, precision, and systematic offsets.
Two families of DIBs, identified in steps (1) and (2), are of particular interest. These are the separate families tied to the first DIBs discovered by Mary Lea Heger at 5780 and 5797 Å. The identified 5780 Å family consists of 11 lines: 5236.27, 5487.64, 5609.82, 5705.12, 5779.59, 6195.99, 6203.58, 6269.89, 6284.05, 6993.12, and 7224.16 Å. DIBs associated with the 5797 Å family are 8 lines at: 5494.10, 5545.08, 5797.18, 5849.82, 6108.06, 6113.22, 6439.51, and 6449.27 Å.26 (Fig. 1).
Common correlations within a family should emerge relative to the EWs and reddening.27–31 For example, Merrill and Wilson29 demonstrate that the EWs of the DIB 6283 Å showed an increasing linear dependence on reddening. Optical reddenings for the stars in the APO catalog were adopted,15 and a comparatively common correlation was found between optical reddening and the EWs of the lines in each proposed family.32 This increases confidence in those two families tied to the Heger lines (5780 and 5797 Å), prompting further analysis.
The energy difference between highly correlated DIBs have been used to probe vibrational transition quanta.21 That approach is adopted here to search for possible molecular sources behind the DIB families. Energy differences between DIBs within each family were compared with (approximate) IR molecular spectra. Accurate spectra are only known for a small percentage of molecules.11,13,33 To circumvent this limitation, approximate molecular spectra will be used in this work. The Rapid Approximate Spectral Calculations for ALL (RASCALL 1.0)3 database contains approximate IR spectra of more than 16000 neutral molecules. To create this database Sousa-Silva et al.3 analysed 120 functional groups associated with 19 different bond types, and found the properties for their spectrally-active rovibrational modes. The spectrum of each molecule was then estimated by considering the contributions of these functional groups to the overall molecular spectrum.
The energy offsets between DIBs within a given family were converted to the corresponding wavelengths (μm), denoted as λoffset. These wavelengths were then compared to those featured in RASCALL (λRASCALL). Molecules that featured lines matching at least three predicted vibrational energies to within a threshold of |λoffset − λRASCALL| = 0.013 μm were saved. The selected molecules suggest certain functional groups that the bona fide family may contain, rather than identifying the broader specific carrier for that family. The veracity of the results was then tested using Monte Carlo simulations.
The Monte Carlo simulations were performed using an in-house Python code whereby 10000 independent random families of DIBs within the range of 4000–9000 Å (where DIBs are primarily observed) were generated, each family containing one of the two Heger lines. This was done separately for each of the real families tied to the lines 5780 Å and 5797 Å. The random families contained the same number of lines as in the real family associated with each of the lines 5780 Å and 5797 Å. That is, the random families created for 5780 Å had 11 lines, and that for 5797 Å had 8 lines. These 10
000 random families were analyzed using the same methods as the families associated with 5780 Å and 5797 Å and compared to molecules within the RASCALL 1.0 database using the same threshold |λoffset − λRASCALL| = 0.013 μm. The emergence of the candidate molecules associated with 5780 Å and 5797 Å from these random families were monitored. It is found that the candidate molecules are observed in less than 10% of the results for random families, suggesting that the results for the real families of DIBs are not coincidental.
The candidate neutral molecules were subsequently subjected to quantum chemical calculations to obtain their optimized molecular geometries, along with their UV-vis spectra at different levels of theory.
All calculations were conducted using the Gaussian16 suite of programs.34 The molecular geometries of all species were optimized in their lowest electronic state, followed by a harmonic frequency calculation to ensure that the optimized geometries are minima on the potential energy hypersurfaces (PES). The minima of the family consistent with the 5780 Å molecules have been previously determined (Ref. 59 and 60), while the family consistent with the 5797 Å molecules corresponds to rigid structures. Therefore, an extensive exploration of the PES was not considered necessary to locate the respective global minima.
None of the identified molecules in their neutral ground electronic states exhibit absorptions in the optical, only in the UV. Since ionization by electron loss or capture may shift their electronic spectra sufficiently to the Vis range, both cations and anions were considered. These ionized species were modelled by subtracting or adding one or two electron(s) (and not, for example by adding or removing protons). Thus, all singly-charged ionic species in this work are open-shell doublets, while the doubly-charged and the neutral species are singlet closed-shell systems.
Both vertical and adiabatic ionizations were considered. The vertical ionization is one in which the ion is frozen at the geometry optimized for the neutral species, and the adiabatic is one where the geometry of the ion is optimized before the final electronic structure is generated. The calculated adiabatic energy shifts include the zero-point vibrational energies (ZPEs). The ionization potential (IP), whether vertical or adiabatic, is defined as:
IP = Ecation − Eneutral, | (1) |
EA = Eanion − Eneutral. | (2) |
Since total energies are negative, when EA > 0, this implies that the energy of the neutral species is lower (i.e., more stable) than the anion.
All molecules were optimized using density functional theory (DFT)'s Perdew–Burke–Ernzerhof (PBE)35,36 GGA-XC functional. Additionally, optical spectra were computed using time-dependent density functional theory37–40 (TD-DFT),41,42 equation-of-motion coupled cluster with single and double excitations (EOM-CCSD)43–46 and state averaged complete-active-space-self-consistent-field (SA-CASSCF)47,48 with a (6,6) active space for neutral glycolamide (M0). Only for CASSCF calculations was the MOLPRO v2012.149,50 used. The hybrid PBE0 (Perdew–Burke–Ernzerhof exchange & 25% Hartree–Fock exchange plus full PBE energy correlation),51,52 was used for the TD-DFT calculations. All calculations were performed using Dunning's aug-cc-pVTZ correlation-consistent polarized valence triple-zeta basis set, augmented with diffuse function.53,54
![]() | ||
Fig. 2 The chemical structures of the molecules of interest for the 5780 Å (left column) and 5797 Å families (right column). |
The molecules for the 5780 Å family revealed some commonalities amongst the functional groups they may possibly contain. They all appear to possibly contain a primary amide and a hydroxyl group. The identification of certain types of chemical bonds and functional groups in interstellar medium is another piece in the jigsaw that can shed light on the nature of the carriers (see discussions in ref. 55). For the 5797 Å family, the identified molecules are similar in structure to 2-cyclopenten-1-one, each differing by one atom within the five-membered ring.
The strongest absorption band of most neutral molecules is found between 187–217 nm (Fig. 3a, 4 and Fig. S1, ESI†). That hints that the prominent 220 nm interstellar absorption feature may arise in part from the superposition of the UV spectra of these molecules.58
Different minimum energy conformers are possible for the neutral molecules consistent with the 5780 Å family. Glycolamide, 3-hidroxypropanamide, and oxamic acid show the lowest energy as –syn conformations with an intramolecular OH⋯O hydrogen bond59 (Fig. 2). Conversely, lactamide’s minimum energy corresponds to a trans conformer, forming an NH⋯O intramolecular hydrogen bond.60 The neutral molecules consistent with the 5780 Å family show the most intense bands in the far and near UV spectrum. For glycolamide, several bands near 200 nm reflect n–π* transitions, typical of the amide moiety.61 Fig. 3a depicts the n–π* HOMO–LUMO+5 transition of neutral glycolamide, and Fig. 4 shows other n–π* transitions for the rest of the 5780 Å family.
Table 1 shows the results obtained for the transitions to the first two excited states (S1: n–π* and S2: π–π*) with the three methods used in this work, namely, TD-PBE0, EOM-CCSD, and CASSCF, for neutral gycolamide.
Neutral glycolamide (M0) | ||||
---|---|---|---|---|
State | CASSCF E (eV), f | EOM-CCSD E (eV), f | TD-PBE0 E (eV), f | Exp.62 E (eV) |
S1 | 6.51, 0.0005 (ΔE = 0.86) | 6.02, 0.0001 (ΔE = 0.37) | 5.93, 0.0002 (ΔE = 0.28) | 5.65 |
S2 | 7.41, 0.0112 (ΔE = 0.09) | 6.90, 0.0116 (ΔE = 0.42) | 6.42, 0.0006 (ΔE = 0.90) | 7.32 |
ΔES2–S1 | 0.90 | 0.88 | 0.49 | 1.67 |
As we could not find experimental gas-phase absorption spectra of glycolamide, we compared the first two excited states of neutral glycolamide with the experimental UV-vis data for the simplest amide, i.e., formamide, which should be similar given the similarity of the electronic environments in the two amides and that the transition being compared occur between corresponding energy levels.
Hirst et al.62 reported the experimental spectra of NH2CHO and identified the lowest energy n–π* transition at 5.65 eV and the most intense π–π* transition at 7.32 eV. The EOM-CCSD for glycolamide shows a better agreement with the corresponding experimental transitions in formamide than CASSCF (6,6) and TD-PBE0. The three methods agree with the experiment in that the S2 π–π* transition is the most intense. For glycolamide, agreement with experiment is in the following decreasing order: EOM-CCSD > SA-CASSCF (6,6) > TD-PBE0.
For the neutral molecules matching the 5797 Å family, the intense bands in the UV correspond to a transition from the HOMO or HOMO−1 orbitals to the LUMO, which corresponds to a n–π* transition (Fig. S1, ESI†). M. Christianson et al. obtained the experimental UV-vis spectra of 2-cyclopenten-1-one with the n–π* band at 203 nm, which is comparable with the TD-PBE0/aug-cc-pVTZ predicted value (208 nm, Fig. S1a, ESI†). In addition to n–π* type bands, this family also presents π–π* transitions.
Fig. 3b displays the TD-DFT spectra of glycolamide in its +1 cation state. The calculated spectra of the remaining cation species (M = +1, +2) of the 5780 and 5797 Å families can be found in Fig. 4 and Fig. S1 (ESI†). Often, ionized molecules can be less stable than their neutral parents.67 For the 5780 Å family, all closed-shell singlet dicationic species dissociate into the alcohol radical (˙CH2OH for glycolamide), and the carbamoyl radical (˙CONH2). These two radicals have structural similarity to formaldehyde and formamide, respectively. Formaldehyde is present in the interstellar medium as Goldanskii demonstrated, especially relative to the tunneling of the formaldehyde monomer to produce polyoxymethylene (POM).68–71 Formamide is also believed to be present in the interstellar medium (ISM),72–77 and so are aminomethanol (NH2CH2OH) and the iminium ion (CH2NH2+), in addition to two open-shell (radical) molecules, the carbamoyl radical as a fragmentation product of di-ionization of glycolamide, in tandem with its isomer HCONH˙. Moreover, glycolamide is an isomer of glycine, which has also been found in the ISM. Glycine dications have been studied in gas-phase, and its fragmentation into two singly-charged cations is possible.67 Therefore, the possible fragmentation of the glycolamide dication in the ISM may likewise be feasible, or glycolamide-glycine isomerization could lead to new ISM species.
For the 5780 Å family, Fig. 4 shows that although for the M+1 species the transitions shift toward the visible region, there is no match with the DIBs observed for this family. For the 5797 Å family, two ionized states could be obtained. Fig. S1 (ESI†) shows the TD-DFT spectra obtained with a PBE0/aug-cc-pvtz level of theory. The radical (doublet) singly-ionized and the (singlet) doubly-ionized cations of the four molecules exhibit bands with oscillator strength f > 0.0002 below 500 nm. The most intense bands of the cation are located in the UV (Fig. S1, ESI†).
The M−1 and M−2 anions of the studied species are such that the added electron is either marginally bound or marginally unbound (Table 2 lists the IPs and EAs of the molecules of interest in their neutral closed-shell states). Hence, the absorption of an energetic photon necessary to induce an optical transition is most likely to eject that excess electron conferring the rest of the energy to the departing fragments' kinetic energy. The anions are, therefore, not pursued further in this paper.
Fig. 4 and Fig. S1 (ESI†) show that the single and double cationic forms of these two families do not exhibit bands above 500 nm, which again invalidates the cationic species as DIBs carriers.
Moleculea | IP (eV) | EA (eV) |
---|---|---|
Adiab./ZPE/Vert. | Adiab./ZPE/Vert. | |
a Adiabatic (Adiab.): energy difference between the ground state of the neutral system and the ground state of the ionized system, including structural relaxation. Vertical (Vert.): energy difference at the frozen geometry of the neutral system, excluding relaxation of the ion. ZPE: short for “adiabatic energy differences including zero-point vibrational corrections for both neutral and ionized states”. | ||
Glycolamide | 9.01/9.01/9.68 | 0.06/0.03/0.07 |
Lactamide | 8.68/8.66/9.20 | 0.11/0.06/0.15 |
3-Hydroxypropanamide | 8.73/8.73/9.08 | 0.03/0.01/0.06 |
Oxamic acid | 9.62/9.58/9.88 | −0.45/−0.54/−0.16 |
2-Cyclopenten-1-one | 8.95/8.89/9.02 | −0.06/−0.13/0.07 |
2(5H)-Furanone | 9.90/9.84/10.07 | −0.06/−0.15/0.02 |
3(2H)-Thiophenone | 8.69/8.67/8.79 | −0.22/−0.32/0.01 |
3(2H)-Selenophenone | 8.49/8.47/8.56 | −0.31/−0.41/−0.10 |
It is pertinent here that we quote Herbst,78 who, in turn, quotes Bill Klemperer—who proposed Sn- (n = 2, 3) in oxide glasses as DIBs carriers—that “there is no better way to lose a scientific reputation than to speculate on the carrier of the diffuse bands”. While no definitive carriers emerged, the work, nevertheless, suggests a roadmap combining quantum chemistry and observational astronomical spectroscopy in the quest for this century-old open problem. This study also underscores the importance of compiling electronic and vibrational spectra of, not only neutral molecules, but equally importantly of their ionized states. The difficulty of obtaining extensive tabulations of such spectra especially for ions bring to the fore the role of quantum chemistry in this problem especially with the “astronomical” advances in computer power, algorithms, and machine learning. An analysis of rovibronic spectra of charged species could perhaps yield more definite confirmations of DIB correlations, especially in the light of additional information gathered from the rotational substructure of each absorption peak.
We report two new families of highly correlated lines tied to the two Heger lines (5780 and 5797 Å). The correlated families are then analyzed, as described above, to posit a number of small organic molecules that may be related to these DIBs. Thus, electronic spectra were calculated for: glycolamide, 2-cyclopenten-1-one, 3(2H)-thiophenone, 2(5H)-furanone, 3(2H)-selenophenone, lactamide, 3-hydroxypropanamide, and oxamic acid. These particular molecules were selected on the basis of a comparison of the spacings between the DIBs within each family to a molecular database (RASCALL). The calculated spectra for the neutral and the cationic states of these molecules show that they are not DIB carriers. However, the same calculations suggest that these species absorb in the UV range within the broad 220 nm shoulder to which they may conceivably be among its carriers. Notably, glycolamide has been recently detected in the interstellar medium using high sensitivity Yebes 40 m and IRAM 30 m telescopes, which observed microwave radiation from the molecular cloud designated G+0.693–0027.82
Rivilla et al. report a column density for glycolamide of 7.4 ± 0.7 × 1012 cm−2, more than 11 orders of magnitudes less concentrated than molecular hydrogen (H2).82 Despite its comparative rarity with respect to the most abundant element in the universe (hydrogen), the presence of glycolamide within interstellar clouds is in tandem with the work conveyed here and suggests the that this molecule could be responsible—perhaps as a cation—for a subsample of DIB absorptions. Moreover, the detection of carbamoyl radicals in ISM,72 which are products of the Coulomb explosion of glycolamide after losing two electrons, bolsters the case for the glycolamide's presence in ISM and also for the propensity of this molecule to lose electrons in interstellar conditions.
Recently, an artificial intelligence reverse-engineering approach has been used in the context of drug-design, regressing from known properties to an actual compound with that set of properties.83 The amount of data required to obtain a high level of certainty for a one-to-one match of properties to a specific molecule requires a massive amount of properties data. A key objective moving forward is to reduce the expansive parameter space, thereby mitigating the number of candidate molecules. One pathway requires that a carrier emerge from simultaneously matching its ions in the electronic and vibrational (mid-IR) domains. Even stricter constraints could likewise be explored, whereby the selection is restricted to those molecules whose rotational signatures were observed in the microwave/radio (e.g., glycolamide).
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cp04023f |
This journal is © the Owner Societies 2025 |