Open Access Article
Laura Estévez
* and
Nicolás Otero
Universidade de Vigo, Departamento de Química Física, Facultade de Química, Vigo, 36310, Spain. E-mail: lestevez@uvigo.es
First published on 19th March 2026
Understanding the photoluminescent (PL) behavior of citric acid (CA)-derived carbon dots (CACDs) requires accounting for the contribution of molecular fluorophores (MFs) formed in situ during synthesis. Among these, citrazinic acid (CZA), CAGly – derived from CA and glycine, and 4-hydroxy-1H-pyrrolo[3,4-c]pyridine-1,3,6(2H,5H)-trione (HPPT) exhibit donor–acceptor groups capable of tautomerization and protonation–deprotonation equilibria, making their PL properties highly sensitive to pH and solvent effects. In this work, we combine CREST screening with DFT/TD-DFT calculations to explore the tautomeric landscape of neutral and ionized forms of these MFs in both ground (S0) and excited (S1) states. Our results reveal that the keto tautomer (DPR moiety) predominates in S0 for CZA and CAGly, whereas the enol tautomer (recovering the 2-pyridone ring) becomes the main phototautomer in S1 through an excited-state intramolecular proton transfer (ESIPT) process, accurately reproducing previously reported experimental emission wavelengths. This mechanism is strongly influenced by pH and is facilitated by water molecules that lower the ESIPT barrier, revealing the effect of solvent. For HPPT, PL arises from competing pathways involving ESIPT and excited-state deprotonation (ESDP), accounting for a large Stokes shift and pH-independent emission up to pH 10. Overall, these findings evidence that excitation-dependent fluorescence in CACDs may be better explained by tautomeric equilibria and proton-transfer dynamics rather than anti-Kasha behavior, providing a molecular-level framework for tuning optical properties in carbon-based luminescent materials.
The understanding of the reactions occurring during thermal treatment of CA with different nitrogen sources has improved markedly, though the correlation of photoluminescence of CACDs with their structure still lacks a comprehensive model. In particular, the core–shell model does not always fit with the experimental results.10 The mechanism of the molecular photoluminescence of CACDs has gained much attention recently, and several studies describing the formation of small fluorophore species at reaction conditions applied in CD synthesis are particularly noteworthy.11,12 Thus, a significant number of research works indicate the high importance of PL-emitting low-molecular weight compounds formed at an incipient stage or concurrently with the formation of CDs, as they significantly contribute to photoluminescence properties of CACDs.13,14 Indeed, great effort has been made to identify the structure of these molecular fluorophores (MFs) and nowadays the structures of many of these MFs are well-documented.8,11–15
Typically, these MFs contain donor and acceptor groups capable of intramolecular proton transfer, including keto–enol tautomerization, which may imply the coexistence of tautomers in dynamic equilibrium. Since structural changes between tautomeric forms are accompanied by variations in physicochemical properties,16–18 identifying the dominant tautomer – or a set of them – is essential for understanding their optical behaviour.19,20 The presence of these functional groups in the molecular structure suggests that the surrounding pH will govern the equilibrium between neutral and ionized species, potentially resulting in distinct photoluminescent (PL) properties.21 Thus, CACDs and MFs have been explored as pH sensing probes.21–24 Furthermore, it has been shown that tautomerism and aggregation could cooperate in shaping the PL output and that under extreme pH, protonation or deprotonation-driven aggregation further generated new, weakly emissive species.25–27 Aggregation into H- or J-type assemblies is known to generate new excitonic states and red- or blue-shifted emission, as reported for citrazinic-acid-based systems and other carbon-derived fluorophores. Moreover, pH-dependent protonation/deprotonation can promote or suppress such aggregation, thereby influencing the balance between emissive pathways. While aggregation is not the focus of the present study, these effects are acknowledged here as they may conceptually coexist with tautomerism in related systems. Therefore, characterization of the dominant tautomeric and ionized forms will afford critical guidance for evaluating the aggregation structures most likely to occur.
The blue-emitting citrazinic acid (CZA) is the most studied compound of this class obtained as the product of thermal condensation of CA and ammonia in 1893 by Sell and Easterfield.28 Later,15 it was found that upon heating of mixtures of urea and citric or citrazinic acid under anhydrous conditions, a green-emitting fluorophore can be readily obtained, identified as (4-hydroxy-1H-pyrrolo[3,4-c]pyridine-1,3,6(2H,5H)-trione – HPPT), featuring as well proton donor and acceptor groups leading to possible coexistence of tautomers.29 This investigation15 also found that depending on the conditions, allowing water evaporation or not, HPPT or CZA, respectively, can be obtained exclusively.
Scheme 1 outlines a stepwise mechanism for the formation of CZA and HPPT, guided by the mechanistic considerations discussed in ref. 15. Notably, the reaction between CA and NH3 proceeds analogously to the pathway proposed for MFs derived from CA and amino acids, including glycine (CAGly).8 According to this mechanistic scheme, the structure obtained for CZA differs from the commonly depicted representation. Instead, it resembles a dioxo-pyridine-ring (DPR)-containing MF, like CAGly,8 but optical properties of CZA have never been associated with those exhibited by DPR-type MFs.8 The structure most frequently used to represent CZA corresponds to what we labelled as the enol tautomer (featuring the 2-pyridone moiety), rather than the keto tautomer (comprising the DPR ring).
![]() | ||
| Scheme 1 Stepwise formation of CZA and HPPT. In red, atoms belong to CA; in black, atoms belong to urea. | ||
Compared to CA-based MFs containing the 2-pyridone ring, which typically exhibit high PL QY and well-defined emission governed by Kasha's rule, DPR-type fluorophores display markedly different photophysical behaviour: lower QY and excitation dependent fluorescence, often explained by invoking anti-Kasha's emission.8,30 However, this phenomenon may also be attributed to emitters undergoing tautomerization, each with a S1 emitting state, or explained by the coexistence of multiple tautomers in the S0 ground state, each characterized by distinct excitation and emission wavelengths.18,31 Several studies32–36 have shown that many molecular systems and nanomaterials exhibit dual emission and large Stokes shifts arising from Excited-State Intramolecular Proton Transfer (ESIPT). These studies demonstrate that proton donors/acceptors and hydrogen-bond networks strongly modulate the efficiency of ESIPT and the associated activation barriers.
In this work, we demonstrate that the most stable tautomer of CZA in the S0 state is the keto form (Scheme 1). However, this is no longer the case in the S1 state, where an ESIPT process leads to the formation of the CZA enol phototautomer. We also investigate the effect of pH on the tautomers.
We expand the study to two other MF's: HPPT and CAGly, the latter serving as a representative DPR-type MF. While its structural characterization was reported in 2017 (ref. 8) and has not been further expanded upon, this experimental assignment remains the only available reference and has not been challenged in subsequent research.
Our results demonstrate that the coexistence of several tautomers of neutral and their ionized counterparts, each with unique photoluminescent (PL) properties, highlights the complexity of the system and its sensitivity to the environment, providing a plausible alternative explanation for the excitation-dependent emission that does not require invoking anti-Kasha behavior.37,38
We use computational chemistry methods as they offer a reliable means to achieve these objectives, particularly in cases where experimental techniques may lack sensitivity—especially for detecting multiple tautomers arising from rapid intramolecular proton transfer processes39,40—or, as demonstrated below, when the tautomer responsible for emission only appears in the excited state.
Neutral and charged CZA and HPPT tautomers have been considered in previous computational investigations.14,41–44 These studies, however, have not been carried out exhaustively as to the best of our knowledge the CZA keto tautomer has not been previously claimed as the most stable tautomer, as we will demonstrate later in this work. In addition, the CZA keto tautomer comes up by following the formation mechanism (Scheme 1).
We started our investigation by searching for all tautomers of neutral CZA, HPPT and CAGly by using CREST (see Methods sections for further details). From these results, the keto-tautomer of CZA shows up as the most stable, later confirmed at a quantum chemistry level. CREST was also employed to locate the most probable tautomers of the ionized forms of the three evaluated MFs: those resulting from removing one, two or three protons (when possible) as simulating an increase of pH or those resulting from protonation as simulating extreme acidic conditions where neutral forms may be protonated to some extent.17 Optimization with DFT and TD-DFT of selected tautomers (those obtained with CREST within an energy window of 20 kcal mol−1) at both S0 and S1 states, respectively, let us establish the tautomeric ratios for the different pH-dependent forms.
As far as neutral CZA, our results (Scheme 2) reveal the almost exclusive predominance of the keto-tautomer, with a keto (CZA_N1)–enol (CZA_N2) ratio of ca. 99.7/0.2 (ΔG = +3.8 kcal mol−1), in S0 state, with a contribution of 0.1% of CZA_N3 (ΔG = +4.1 kcal mol−1).45 To assess whether microsolvation could influence the relative stability of the tautomers, we performed additional calculations including up to seven explicit water molecules in representative microsolvated clusters. In all cases, microsolvation preserved the preference for the keto tautomer (), which remained more stable than CZA_N2 or CZA_N3 (Fig. S1). These results indicate that specific hydrogen-bonding interactions do not overturn the stability trend predicted with the implicit solvation model. Furthermore, the CZA_N1 → CZA_N2 tautomerization pathway was characterized, yielding a transition-state barrier of 17.2 kcal mol−1, confirming that the interconversion is limited by both thermodynamic and kinetic factors. Overall, these analyses demonstrate that even when explicit microsolvation is considered, the population of the enol tautomer is expected to remain low under the studied conditions.
We recall that the keto form, CZA_N1, featuring the DPR moiety, is the structure obtained following the mechanistic path described in Scheme 1, while the CZA_N2 or CZA_N3 are both the commonly ones used to depict it. To substantiate our computational results, we conducted a literature search,27,41,44 to collect NMR data reported for CZA. Nevertheless, NMR spectra of CZA have been acquired either in D2O–NaOD or DMSO-d6, conditions under which strong solvent effects (double deprotonation or specific intermolecular hydrogen bond,46,47 IHB) significantly alter the tautomeric equilibrium. Consequently, these data cannot be directly compared to the neutral keto tautomer (CZA_N1). Nevertheless, the comparative analysis in DMSO-d6 is consistent with the presence of CZA_N1, though CZA_N2 and CZA_N3 may be majority as both can established up to three IHB's with DMSO. To address this point, we performed a full computational analysis including explicit-solvent DFT models and simulated NMR spectra, which are discussed in detail in the SI.
A similar keto–enol tautomeric ratio (NN1/NN2 = 99.8/0.2) is obtained for neutral CAGly (Scheme 3). It is worth mentioning that the keto tautomer, CAGly_N1, is indeed the structure used to represent it.8 To further support this assignment, we conducted a computational analysis to compare the calculated chemical shifts with the DMSO-d6 NMR experimental data reported in ref. 8. The results are consistent with the proposed structure, suggesting that the experimental spectra can be explained by a mixture of CAGly_N1 and CAGly_N2 whose relative abundances are influenced by IHBs with DMSO, as also observed for CZA.
Interestingly, the tautomeric ratio is reversed in the S1 state, where the enol tautomer becomes the only species with a non-zero population for both CZA (CZA_N2*) and CAGly (CAGly_N2*). In the excited state, the enol form is favored by a Gibbs energy difference of approximately 10 kcal mol−1, leading to its exclusive population at the S1 state.
When analyzing the anionic species, we found a similar behavior: although the most stable anion in the S0 state corresponds to the keto tautomer, this preference no longer holds in the S1 state.
Four anionic structures were identified for CZA (see SI), although only three of them are significantly representative of the tautomeric distribution in the ground state (Scheme 2). The most stable anion, CZA_A1, which retains the DPR moiety, lies 3.2 kcal mol−1 below the enol-like tautomer CZA_A2. In contrast, and in close analogy to the behavior observed for the neutral species, CZA_A2 becomes the most stable form in the S1 state, accounting for 99.9% of the excited-state population. The keto-type anion CZA_A1* is disfavored in S1 (ΔG = 4.2 kcal mol−1) and contributes only a negligible 0.1% to the population.
A similar situation is observed for the anionic species (A) of CAGly. Because CAGly contains two carboxylic groups, first deprotonation occurs preferentially at the most acidic one—derived from the citric acid fragment (Scheme 1)—leading to CAGly_A1 as the most stable anion. The tautomer CAGly_A2 lies close in energy (ΔG = 1.4 kcal mol−1), while CAGly_A3 is higher in energy (ΔG = +2.8 kcal mol−1). These relative stabilities result in ground-state (S0) populations of 90.9%, 8.3%, and 0.8% for CAGly_A1, CAGly_A2, and CAGly_A3, respectively. In the S1 state, however, the situation reverses: CAGly_A3* becomes by far the most stable anion and thus the dominant phototautomer, with CAGly_A1* contributing only marginally (ΔG = +4.1 kcal mol−1).
Upon second and third deprotonation of CZA, only the enol-like tautomers—CZA_dA and CZA_tA, respectively—which possess the 2-pyridone moiety, retain a non-zero population in both the S0 and S1 states. This behavior contrasts with that of CAGly, where the dianion CAGly_dA still features the DPR moiety and remains the most stable species in S0, while the enol form becomes overwhelmingly favored in the S1 state, reaching a population of 99.7%. Upon third deprotonation, CAGly yields CAGly_tA, which adopts an enol-like structure and is the sole species present in both S0 and S1.
Concerning HPPT, it has two fused rings and a larger number of carbonyl and amino groups, increasing the number of possible tautomers (Scheme 4), making different its PL properties: the keto/enol tautomerization is not playing any role but more than one phototautomer can coexist in the ground or in the excited state.
The most stable neutral tautomer of HPPT is HPPT_N1 (94.9%) at S0 which features the most common molecular representation, and it agrees with other findings,29 yet we found HPPT_N2 with non-zero probability (NN2 = 4.9%). Notably, HPPT_N2* reaches a population of 40.1% at S1 and would be coexisting with HPPT_N4* (NN4* = 59.8%), being the population of HPPT_N1* close to zero. It is also noted that HPPT_N3, the keto tautomer, has almost negligible population at both S0 and S1 states.
Despite the larger number of carbonyl and amino groups, tautomers of the anionic form limits to one, HPPT_A1. Two tautomers for the dianionic form were located, HPPT_dA1 and HPPT_dA2, but the latter with only a minor contribution at S1 state (NdA2* = 1.7%). Similarly, the cationic form in the ground state is limited to one tautomer, HPPT_C1, but it vanishes at S1, being replaced by two tautomers HPPT_C2 and HPPT_C3, with populations of NC2* = 98.6% and NC3* = 1.4%, respectively. However, HPPT has been described as a superacid29 therefore, cationic species are predicted to be present only under extremely acidic conditions. In addition, it has been found that hydrolysis of HPPT_A1 occurs before deprotonation, leading to a non-emissive compound.29 Nevertheless, we have included these forms in the SI for completeness.
As noted above, the most stable tautomer in the ground state does not necessarily remain so in the excited state and more worthy, it may turn out that it is neither the phototautomer. This is the case for both CZA and CAGly. Our results indicate that the absorption maxima of CZA_N1 and CAGly_N1 are located at 311.1 nm and 308.2 nm, respectively, but emission from S1 occurs within the UV spectrum region. In contrast, the excited-state tautomers CZA_N2* and CAGly_N2*, which are the sole tautomeric species present in the S1 state, shows fluorescence at 443 nm and 448 nm, respectively, in good agreement with previously reported experimental data.8,27 These findings lead us to consider an excited state tautomerization before emission could happen; a process usually claimed to explain PL, specifically, when large Stokes shifts are observed.48 Accordingly, photoexcitation converts the neutral CZA tautomer CZA_N1 into CZA_N1*, which subsequently undergoes keto to enol tautomerization leading to CZA_N2*. This transformation is strongly thermodynamically favored (ΔGN1*→N2* = −24.0 kcal mol−1) and is associated with a low kinetic barrier (TSN1*→N2* = 2.9 kcal mol−1) making it fully accessible. The resulting CZA_N2* is then responsible for the observed fluorescence emission (Fig. 1). Looking at the molecular orbitals involved in the N1 → N1* electronic transition (Fig. 1), it corresponds to a weak n → π* excitation27 (f ≈ 0; as expected in symmetry-forbidden electronic transitions)49 originating from the C
O lone pair while N2 ← N2* results from a more intense transition of π ← π* character (f = 0.12).
![]() | ||
| Fig. 1 ESIPT from keto (CZA_N1*) to enol phototautomer (CZA_N2*) explaining the PL of CZA (a similar process has found for neutral CAGly with similar values). | ||
Cationic species of CZA and CAGly may become predominant over their neutral counterparts under sufficiently acidic conditions (pH < 1).22,27 Only one cationic tautomer (Schemes 2 and 3) may exist with population different from zero but with emission in the UV region. When the pH is increased, the equilibrium progressively shifts toward the deprotonated species, becoming noticeable above pH ≈ 4 according to our pKa estimations (see SI). As commented above, the most stable anion for CZA and CAGly in the ground state no longer holds in the S1 state yet, upon photoexcitation CZA_A1* and CAGly_A1* may eventually experience ESIPT leading, respectively, to the corresponding phototautomer CZA_A2* and CAGly_A3* with fluorescence emission at 521 nm (f = 0.17) and 525 nm (f = 0.16), respectively. Upon further deprotonation, CZA and CAGly yield CZA_dA (pH > 11) and CAGly_dA1 (4 < pH < 7), respectively. In the case of CZA, both CZA_dA and the trianionic CZA_tA are predicted to be the most stable tautomers in both the ground (S0) and excited (S1) states. These species display fluorescence emissions at 433 nm (f = 0.25) and 450 nm (f = 0.18). For dianionic CAGly, an ESIPT pathway must be considered—consistent with the behavior observed for the other protonation states—from CAGly_dA1* (λem = 615, f ≈ 0) to CAGly_dA2* (λem = 530 nm, f = 0.15) to rationalize the presence of intense fluorescence.
Regarding the optical properties of HPPT, previously reported experimental data29 show absorption maxima at approximately 365 nm and 405 nm, corresponding to the neutral and anionic forms, respectively, which agree well with the computed values for HPPT_N1 (NN1 = 96.4%; λabs = 357 nm; f = 0.18) and HPPT_A (NA = 100%; λabs = 406 nm; f = 0.18), respectively (Scheme 4). Our computational results indicate that diverse radiative pathways may occur after photoexcitation of neutral HTTP_N1 at 357 nm yielding HPPT_N1* (Fig. 2). These include: (i) ESIPT leading to the two stable tautomers in the S1 state, HPPT_N2* and HPPT_N4*; both readily accessible in protic solvents. Their population-weighted emission maximum is predicted near 536 nm (Fig. S8), although ESIPT may preferentially produce the most stable tautomer, HPPT_N4*, which emits at λem = 562 nm, in line with literature experimental data;14 and (ii) excited-state deprotonation (ESDP) of HPPT_N1*, generating HPPT_A*, which emits at 535 nm (f = 0.19). This emission is also obtained when HPPT_A is the sole species present in solution, in agreement with the experimentally recorded data (in the pH range 1–10).29
![]() | ||
| Fig. 2 Schematic representation of the radiative pathways accessible to HPPT_N1: (A) ESIPT to phototautomers HPPT_N2* and HPPT_N4* or (B) ESDP to HPPT_A1* phototautomer. | ||
C– or N–H⋯N
C– moieties. In these kinds of systems, it has been noted that polar protic solvents reduce ESIPT as solvent molecules could limit this incipient ESIPT transition state toward the enol tautomer –N⋯H–O/N–C–.31,50,51
This is not the case for CZA and CAGly systems, instead we claim water assisted keto → enol (pseudo)48,50 ESIPT to rationalize the observed PL properties of CZA and CAGly as model systems for DPR MFs that may help to explain optical features of CACDs containing these MF's.
ESIPT may occur either reversibly or irreversibly and can operate under kinetic or thermodynamic control. Each regime exhibits characteristic photophysical features, as demonstrated in numerous experimental studies, although the interpretation of ESIPT signatures can be challenging due to the influence of multiple competing factors.32
Computational studies aimed at quantifying ESIPT energy barriers are still limited, and those available rely on constrained scans of the S1 and S0 potential-energy surfaces (PES) rather than on the explicit identification of transition states.52,53 Therefore, no literature data exist for direct benchmarking of our computed ESIPT transition states and their associated barriers. Likewise, virtually no studies have examined water-assisted ESIPT in the excited state, despite the well-established role of water in lowering proton-transfer barriers in the ground state.54 In agreement with these ground-state findings, our calculations show a substantial reduction in the keto-to-enol barrier when explicit water molecules are included (see above). This behavior provided a strong motivation for applying the same strategy in the excited state to locate the corresponding S1 transition states.
We have located the transition state for the keto to enol tautomerization at S1, for CZA. Notably, the activation energy significantly reduces on going from none up to three water molecules. We have confirmed this asynchronous TS54 connects to the local minimum corresponding to the reactant (local minimum on the S1 PES after some steps of relaxation from the initially excited Franck–Condon (F–C*) state32,55) from which the barrier is computed (ΔG#3 W CZA_N1*→CZA_N2* = 2.8 kcal mol−1, Fig. 3).56 The thermodynamics is also favoured (ΔG = −24.0 kcal mol−1 from local minima connected to the TS). This large energy difference suggests an irreversible regime governing the ESIPT process, leading to emission exclusively from CZA_N2*.32
At pH values above 4, where the anionic species of CZA would be present—or even constitute the predominant form—ESIPT must occur for bright fluorescence to be observed (Table S1). Thus, we computed a slightly lower barrier for the CZA_A1* → CZA_A2* conversion (ΔG#3 W CZA_A1*→CZA_A2* = 2.2 kcal mol−1) than for neutral form (see above). However, the thermodynamic driving force is weaker (ΔG = −5.1 kcal mol−1), suggesting that the process may proceed in a reversible manner. In this scenario, thermodynamic rather than kinetic factors may determine the presence of the CZA_A1* form in the emission (Table S1). As CZA_dA and CZA_tA correspond to enol tautomers in the ground state, ESIPT is not needed to explain their ∼427–450 nm emission (being emission at 446 nm from the CZA_N2*). Overall, CZA will feature near pH independent emission spectrum. This could be experimentally seen as having no pH effects on the emission wavelength.29,30
An analogous analysis with CAGly lead us to find a low barrier for CAGly_N1* to CAGly_N2* tautomerization at 2.5 kcal mol−1 and favourable thermodynamics ΔG = −21.6 kcal mol−1. For the anion form a similar outcome as with CZA is expected. However, as CAGly anion, dianion and neutral form may coexist in neutral pH (see SI), more complex emission spectrum may be expected.8
Our results evidence that understanding tautomeric dynamics and their pH dependence provides a molecular-level basis for tuning the optical properties of CACDs. This study substantiates that tautomerism and pH are key factors determining PL properties of CA-derived MFs, which may also serve as models for understanding CA-based biomaterials optical behaviour.
Although additional experimental data would certainly strengthen the present conclusions, computational chemistry can still provide valuable insight into photophysical processes that remain only partially explored—particularly the detailed ESIPT mechanisms in these MFs. Future work should therefore integrate quantum molecular dynamics (QMD) simulations to better capture the role of the solvent, including the conditions under which dimers or higher aggregates may form and whether the most stable dimer in the ground state remains preferred in the S1 state. Moreover, classical or hybrid MD simulations could help describe the interactions among the different tautomeric and ionic forms in the presence of model carbon dots, thereby establishing a more complete picture of how these species behave in realistic environments. Such studies would significantly advance our understanding of the photophysics of these MFs and of the CDs derived from them.
![]() | (1) |
When applying the Boltzmann distribution to obtain relative tautomer populations in S1 is assumed that vibrational relaxation and tautomer interconversion in the excited state occur much faster than the S1 lifetime. This allows S1 to reach a quasi-thermal equilibrium, so ΔG(S1) reliably determines the population distribution.
| This journal is © The Royal Society of Chemistry 2026 |