Open Access Article
Andrea
Canal-Martín‡
a,
Claudio D.
Navo‡
b,
Elena
Sáez
c,
Dolores
Molero
c,
Gonzalo
Jiménez-Osés
*bd and
Ruth
Pérez-Fernández
*a
aStructural and Chemical Biology Department, Centro de Investigaciones Biológicas Margarita Salas, CIB-CSIC, Madrid 28040, Spain. E-mail: ruth.perez@csic.es
bCenter for Cooperative Research in Biosciences (CIC bioGUNE), Basque Research and Technology Alliance (BRTA), Bizkaia Technology Park, Building 800, 48160 Derio, Spain. E-mail: gjoses@cicbiogune.es
cCAI de RMN, Universidad Complutense de Madrid, 28040 Madrid, Spain
dIkerbasque, Basque Foundation for Science, 48013 Bilbao, Spain
First published on 5th August 2021
Hydrazone bond formation is a versatile reaction employed in several research fields. It is one of the most popular reversible reactions in dynamic combinatorial chemistry. Under physiological conditions, hydrazone exchange benefits from the addition of a nucleophilic catalyst. We report a mechanistic study and superior performance of electron-rich p-substituted aniline derivatives as catalysts for efficient hydrazone formation and exchange in both protic and aprotic solvents. Rigorous kinetic analyses demonstrate that imine formation with 3-hydroxy-4-nitrobenzaldehyde and aniline derivatives proceeds with unprecedented third-order kinetics in which the aldehyde consistently shows a partial order of two. Computational investigations provide insights into the mechanisms of these transformations.
N bond, (ii) the stability to hydrolysis of the C
N bond under neutral conditions due to its mesomeric effect, (iii) an imine carbon that has both electrophilic and nucleophilic characters; and (iv) an acidic NH proton used in anion sensing, metal coordination or intramolecular H-bonding.13
However, despite the appealing hydrazone bond properties, the relatively slow rate of its formation under physiological conditions has limited its applications (Fig. 1).
![]() | ||
| Fig. 1 Acylhydrazone formation for DCC at physiological pH (a) in the absence and (b) in the presence of aniline as a catalyst. | ||
Over the last fifteen years, several groups have used nucleophilic catalysts to speed up this reaction, including aniline as a catalyst.14–16
Early comparative polarimetric studies of kinetics and mechanisms regarding hydrazone, semicarbazone and oxime formation were reported by Stempel et al. in the 1940s.17 Mechanistic studies on semicarbazone and oxime formation provided by Jencks in the 1960s18 showed that the rate-limiting reaction was the condensation of the substrate with the catalyst to form the activated imine, prior to the transimination reaction. Studies showing how the structural features of aldehydes and hydrazines influence the hydrazone reaction19,20 and the hydrolytic stability of the hydrazone bond21 have been reported.
On the other hand, hydrazone exchange has been extensively used in dynamic combinatorial chemistry (DCC), being one of the most popular reversible chemistries.22,23 DCC is based on adaptive molecular networks, named dynamic combinatorial libraries (DCLs), which under thermodynamic control and through reversible bonds respond to external stimuli. Protein-directed DCC has been proven to be an excellent tool to speed up the drug discovery process allowing the protein target to find its best ligand in situ.24 We are interested in the application of hydrazone exchange under physiological conditions to protein-directed dynamic combinatorial systems. Acidic conditions were used to speed up bond formation and cleavage until the addition of aniline as a nucleophilic catalyst for acylhydrazone DCC at physiological pH was reported.25,26 Mechanistically, aniline reacts with the aldehyde as a nucleophilic catalyst to form an aromatic Schiff base intermediate (Fig. 1b). This intermediate acts as an electrophile. The higher reactivity of Schiff bases compared to that of their parent aldehydes toward nitrogen nucleophiles enables the transimination step to achieve the hydrazone/acylhydrazone exchange. Nevertheless, the requirement of a high concentration of aniline and its toxicity reported in cells prompted the search for alternative catalysts.27–30 Note that in catalyzed DCC, unlike in conventional catalysis, the amount of catalyst used is not limited to substoichiometric concentrations.31
Recently, modelling studies on hydrazone exchange in the absence of a catalyst supported the hypothesis that the rate-limiting step in hydrazone exchange is the nucleophilic attack on the protonated hydrazone,32,33 whereas in hydrazone formation the rate-limiting step in water at neutral pH was the cleavage of the carbinolamine intermediate to eliminate water.18c
Studies regarding the effectiveness of nucleophilic catalysts in aqueous solvents have been reported and the reaction rate of the global process is described, assuming that the imine formation is always the rate-limiting step.18c,34 Nonetheless, the acylhydrazone formation mechanism has been rarely studied as a multistep process.
Here we present a detailed study of three different p-substituted aniline derivatives compared to aniline in acylhydrazone formation. We performed kinetic studies by NMR spectroscopy on imine and acylhydrazone formation as separate reactions in a polar aprotic solvent (acetonitrile) in analogy to previous reports.35 Furthermore, the acylhydrazone reaction was performed in water in the absence and presence of the catalysts, and in a dynamic combinatorial chemistry system which confirmed the efficiency of p-toluidine and p-anisidine in acylhydrazone exchange at physiological pH and low temperatures.
A dynamic equilibrium between both reagents and the corresponding hemiaminal intermediate (HA) was reached before the first experiment was performed, yielding a ratio of around 4
:
2
:
1 of compounds 1
:
2
:
HA, respectively. This ratio was maintained, and no new signals appeared after more than 5 h, indicating that the subsequent dehydration reaction to form the final acylhydrazone 3 did not take place at a measurable rate. From these data, the equilibrium constant for the first step of the imine formation reaction and, thus the ratio between the forward (k1) and reverse (k−1) reaction rates was found to be: Keq = k1/k−1 = 12.7 M−1.
The same reaction between aldehyde 1 and acylhydrazide 2 was then analyzed in the presence of different aniline derivatives 4a–d as catalysts, namely 4a: aniline, 4b: p-toluidine, 4c: p-anisidine and 4d: p-bromoaniline. For a better understanding of the global process the two consecutive reactions, namely imine formation between aldehyde 1 and aniline derivatives 4a–d and subsequent imine exchange with acylhydrazide 2 to yield acylhydrazone 3, were studied separately (Fig. 3). The estimated reaction rate constants for all the studied reactions are summarized in Table 1.
| Aniline | R | Reaction 1 + 4a–d | Reaction 5a–d + 2 |
|---|---|---|---|
| k obs (M−2 s−1) | k obs (M−2 s−1) | ||
| 4a | H | 0.0030 ± 0.0026 | 0.5795 ± 0.3887 |
| 4b | Me | 0.0107 ± 0.0020 | 0.0686 ± 0.0476 |
| 4c | OMe | 0.0365 ± 0.0093 | 0.0521 ± 0.0252 |
| 4d | Br | 0.0006 ± 0.0013 | n.d. |
For the first reaction (i.e. imine formation), compounds 5a–d formed smoothly at measurable reaction rates, and no signals corresponding to the hemiaminal intermediates (HA) were observed in the NMR spectra. This suggests that a fast-dynamic equilibrium between reagents 1 and 4a–d and the corresponding hemiaminal intermediates is reached in which the reverse nucleophilic addition reaction is much faster than the forward process, and that elimination of water from intermediates is also fast.
By performing the reactions using different sets of initial concentrations for the reagents and using the initial rate method, an unexpected partial order of 2 for aldehyde 1 and an expected order of 1 for anilines 4a–d were determined. To the best of our knowledge, such a second-order character in aldehydes has been only reported for processes involving nucleophilically activated aromatic aldehydes such as the Cannizaro36 and benzoin condensation37 reactions. Possible mechanistic explanations for this behavior, including catalysis by a benzoic derivative of aldehyde 1 (labelled as 1′, see below) are summarized in the ESI.† Third-order integrated rate laws for reactions in which the initial concentrations were either different or equal were derived, which allowed the estimation of the observed reaction rate constants; kobs (see the ESI†). As expected, the reaction with p-anisidine 4c was observed to be the fastest process due to its higher nucleophilic character (kobs = 0.0365 M−2 s−1), while the reactions with the other aniline derivatives were 3 to 61 times slower (Table 1).
Identical analyses were performed for the second reaction in the global aniline-catalyzed process, namely the reaction between acylhydrazide 2 and freshly synthesized and purified imines 5a–c. Imine 5d was discarded due to its poor solubility in CD3CN at the concentrations required for NMR analysis, which precluded obtaining reliable and reproducible initial concentration values. Before the first experiment was performed, a fast-dynamic equilibrium was reached between both reagents and the corresponding aminal intermediates (AI-a–c), which could be detected by 1H NMR in nearly all cases (Fig. 3 and the ESI†). The final acylhydrazone 3 was then steadily formed as a mixture of Z/E isomers from the corresponding intermediates. This reaction mechanism was also found to be second-order in the imine and first-order in the acylhydrazide, although the deviation from integer partial orders was quite significant in some cases such as with imine 5a. As a consequence, the resulting third-order integrated rate laws provided quite different kobs values depending on the initial concentrations of the reagents and thus such values must be taken with caution. As expected, the reaction with the imine derived from p-anisidine 5c was observed to be the slowest one due to its poor electrophilic character (kobs = 0.0521 M−2 s−1) (Table 1). Nevertheless, as can be seen in Fig. 3, when nearly equimolecular concentrations of reactants were used, it was clear that this second reaction (i.e. acylhydrazone formation) was always faster than the first reaction (i.e. imine formation), which is rate-limiting in all cases.
Anilines para-substituted with electron-donating groups 4b (R = Me) and 4c (R = OMe) were determined to be the most effective catalysts for the global reaction between aldehyde 1 and acylhydrazide 2 in acetonitrile solution. In fact, predicting the consequences of varying the electronic properties of the substituents at the para position of aniline catalysts is not trivial in this type of multi-step reaction. On one hand, for the first reaction (imine formation) electron-donating substituents (p-OMe and p-Me) are expected to lower the activation barriers for the initial nucleophilic attack on the aldehyde and, to a lower extent, for water elimination (i.e. the lone electron pairs of the nitrogen atom are more localized and thus more available for forming the double C
N bond). On the other hand, for the second reaction (hydrazone formation), electron-donating substituents (p-OMe and p-Me) are expected to increase the activation barriers for the nucleophilic attack of the acylhydrazide on the imine (i.e. imines are less electrophilic) and aniline elimination (i.e. the anilines are worse leaving groups). Thus, since so many equilibria and microkinetic steps are involved in the global reaction, the effects of the aniline substituents on the global reaction rate will be determined by the rate-limiting step (i.e. the step with the highest activation barrier). The observation that electron-rich anilines (p-OMe and p-Me) accelerate the global reaction indicates that the rate-limiting steps occur at the first reaction (i.e. formation of the imine), either in the initial nucleophilic attack on the aldehyde or water elimination steps. Performing the reaction in different solvents (i.e. aqueous or not aqueous) can affect the relative rates of each step, water elimination being expected to be unfavored in aqueous solvents as previously reported. Our results stress the importance of studying the accelerating properties of a given catalyst in the different sequential reaction steps, and are in good agreement with the results observed in aqueous solvents with similar reagents (vide infra).
The mechanism of hydrazone exchange in the presence of aniline catalysts was further studied through quantum mechanical calculations in an implicit solvent (acetonitrile) at the SMD(CH3CN)/M06-2X/6-311++G(2d,p)//PCM(CH3CN)/M06-2X/6-31+G(d,p) level (see Computational details and the ESI†). For the initial imine formation reaction (i.e. condensation of aldehyde 1 with anilines) no transition structures describing the direct nucleophilic addition to the aldehyde carbonyl group – either in its protonated or neutral forms – could be located. However, assisting the nucleophilic attack with protic molecules acting as proton shuttles, such as water, the ortho-nitrophenol moiety of aldehyde 1 or a benzoic acid derivative generated from or present as an impurity of aldehyde 1 not detected by 1H NMR (labelled as 1′),38–40 did allow locating feasible transition structures for both the nucleophilic addition (ts1 a–d) and water elimination (ts2 a–d) elementary steps (see the ESI†). Hence, catalysis by the benzoic acid derivative 1′ yielded the lowest activation barriers (ΔG‡) for these processes and, more importantly, suggested that water elimination is the rate-limiting step to yield imines 5a–d in agreement with previous reports17,18 and reproduced the experimental reactivity trends: ΔG‡ (ts2c) < ΔG‡ (ts2 b) < ΔG‡ (ts2 a) < ΔG‡ (ts2 d) (Fig. 4).
Such a catalytic effect of acid 1′ would provide a plausible explanation for the experimentally observed partial order of 2 for aldehyde 1. Assuming that the concentration of acid 1′ at a given reaction time is a fraction (χ) of the concentration of aldehyde 1 ([1′] = χ [1]), third-order reaction rate equations could be written as: rate = k [1′][1][a–d] = k·χ [1]2[a–d].
Thermodynamics is also well represented by this model, indicating that (a) the hemiaminal intermediate formation is an endergonic process (and thus not experimentally detectable) and (b) the formation of the stable imine products (the E isomers being much more stable) is either thermoneutral or slightly endergonic, suggesting microscopic reversibility.
Stationary points (i.e. transition structures and energy minima) could also be located for the second reaction (i.e. nucleophilic attack of an abbreviated model of acylhydrazide (2′) followed by aniline cleavage) starting from protonated iminium intermediates (5+ a–d) (Fig. 5 and the ESI†). It is noteworthy that the transition states of none of the elementary steps were involved in this reaction (i.e. nucleophilic attack of acylhydrazide and aniline cleavage could be computed unless the C
N bonds of the reacting imines were protonated as iminium ions (C
NH+)). Given the lack of experimental pKb values for imines 5a–d or similar compounds, the energy penalty associated with the formation of such iminium species (5+ a–d) in acetonitrile was estimated indirectly using combinations of machine learning approaches and experimental data41–44 (see the ESI†). However, considering the insufficient accuracy of these estimations, we were more interested in the intrinsic activation energies involved in the process. Although the transition states for the nucleophilic addition (ts3a–d) and aniline cleavage (ts4a–d) were calculated to be very close in energy, the activation barriers for the latter step were rate-limiting in all cases, with values following the trend observed experimentally: ΔG‡ (ts4c) > ΔG‡ (ts4 b) > ΔG‡ (ts4 a) > ΔG‡ (ts4 d). In agreement with the NMR experiments described above that allowed the detection of transient species, stable protonated aminal intermediates (AI+ a–d) after the addition of a model acylhydrazide were calculated for all studied cases involving derivatives from aniline, p-toluidine, p-anisidine and p-bromoaniline. Also in agreement with the experimental observations, both the Z and E isomers of the protonated hydrazone (3+) were calculated to have similar energies, the latter being slightly more stable. As in the imine formation reaction, hydrazone formation is also either thermoneutral or slightly endergonic, suggesting microscopic reversibility. Calculations in implicit water yielded virtually identical results, with differences in the relative energies of the calculated stationary points smaller than 1 kcal mol−1 (see the ESI†). As a referee noted, water and protic solvents could change the reaction mechanism acting as a proton shuttle and/or favoring the elimination of hydroxide ions from carbinolamine intermediates,45 although such theoretical predictions have not been confirmed experimentally.46,47
To study the performance of the catalysts at physiological pH, the reaction between aldehyde 1 (0.09 mM) and acylhydrazide 6 (0.9 mM) to yield acylhydrazone 7 was performed according to our previous work in a Tris buffer solution, using an excess of acylhydrazide to achieve fast conversion at reasonable reaction times.30 To monitor the reaction progress by HPLC-MS, an acylhydrazide with a chromophore group, like compound 6, was selected (Fig. 6).
The performance of amines 4a–d as catalysts was monitored for 1 week and compared to the uncatalyzed reaction (Fig. 6a). The time-course of the reaction during the first 6 h is shown in Fig. 6b. The reaction in the presence of the catalyst was completed between 3 hours and 4 days depending on the catalyst (5.5 h for 4a, 4 h for 4b, 3 h for 4c, and 4 days for 4d). It is noteworthy that the two best catalysts in aqueous buffer solutions, namely, 4b (R = Me) and 4c (R = OMe), also exhibited the fastest kinetics in acetonitrile, thus validating the initial predictions and mechanistic studies despite the different reaction conditions used.
In a dynamic combinatorial library, a particular challenge of hydrazone chemistry at physiological pH lies in the efficiency of achieving the exchange process. After establishing p-toluidine (4b) and p-anisidine (4c) as the most efficient catalysts for acylhydrazone formation in water, the DCC approach was attempted by mixing aldehyde 1 (Fig. 7) with six acylhydrazides (2, 6, and 8a–d). Structurally diverse building blocks were chosen for the DCL composition. The equilibration was completed in the presence of 4b (2 h 30 min) and 4c (3 h), whereas the same system in the absence of the catalyst took more than 2 days to equilibrate (>16-fold acceleration). Aldehyde 1 could not be detected in the reaction mixture, indicating that it was continuously being sequestered as an acylhydrazone component. The acylhydrazones were identified by HPLC-MS (see the ESI†). The reversibility of the DCL was confirmed by setting up consecutive DCLs where a new acylhydrazide was added once the DCL reached its equilibration point. The DCL started with mixing aldehyde 1 and four acylhydrazides 6, 8a, 8c and 8d. This was followed by the addition of acylhydrazide 2 and finally, acylhydrazide 8b was added, showing an identical distribution to that in Fig. 7, either in the presence of catalyst 4b or 4c (see the ESI†).
Furthermore, a dynamic combinatorial library with the best performing aniline derivatives, p-toluidine (4b) and p-anisidine (4c), was set up at physiological pH and low temperatures. The DCL achieved the required equilibrium composition in 3 h and 2.5 h, respectively, considerably improving the reported results in the absence of both catalysts. Additionally, taking into account the performance and properties of p-anisidine such as water solubility, affordability, and low toxicity, it proves to be an interesting alternative to aniline in systems where biological targets are present.
These efficient catalysts could expand the application of hydrazone-based chemistry, contributing to the development of DCC in biological environments and its use as a hit identification tool in drug discovery.
:
Z = 40
:
60). M.p: 179–180 °C. 1H-NMR (300 MHz, DMSO-d6) δ 11.59 (s, 1H, E), 11.44 (s, 1H, Z), 11.14 (s, 1H, E/Z), 8.70 (s, 1H, E), 8.13 (s, 1H, Z), 7.93–7.86 (m, 1H, E/Z), 7.39 (s, 1H, E/Z), 7.27–7.24 (m, 1H, E/Z), 2.65 (q, J = 7.5 Hz, 2H, Z), 2.23 (q, J = 7.5 Hz, 2H, E), 1.08 (t, J = 7.5 Hz, 3H, E/Z). 13C-NMR {1H} (75 MHz, DMSO-d6): δ 176.0, 152.8, 143.7, 141.4, 140.3, 126.4, 117.8, 116.7, 25.7, 9.1. Anal. calcd for C10H11N3O4: C, 50.63%; H, 4.67%; N, 17.71%. Found: C, 50.36%; H, 4.60%; N, 17.59%. HPLC-MS: tR: 7.06 min, [M + H]+ = 238 m/z.
48 using the M06-2X hybrid functional49 and 6-31+G(d,p) basis set with ultrafine integration grids. Bulk solvent effects in acetonitrile and water were considered implicitly using the IEF-PCM polarizable continuum model.50 The possibility of different conformations was taken into account for all structures. All stationary points were characterized by a frequency analysis performed at the same level used in the geometry optimizations from which thermal corrections were obtained at 298.15 K. The quasiharmonic approximation reported by Truhlar et al. was used to replace the harmonic oscillator approximation for the calculation of the vibrational contribution to enthalpy and entropy.51 Scaled frequencies were not considered. Mass-weighted intrinsic reaction coordinate (IRC) calculations were carried out using the Gonzalez and Schlegel scheme52,53 in order to ensure that the TSs indeed connected the appropriate reactants and products. Single point energies were calculated at the M06-2X/6-311++G(2d,p) level using the SMD polarizable continuum model.54 Zero-point energy, enthalpy and Truhlar's entropy corrections, obtained from the frequency calculation at the PCM(CH3CN)/M06-2X/6-31G(d,p) level, were added to the single point energy calculated at the SMD(CH3CN/H2O)/M06-2X/6-311++G(2d,p) level to give the final reported enthalpy and Gibbs free energy. Relative Gibbs free energies (ΔG) were used for the discussion on the stabilities of the considered structures. The lowest energy conformer for each calculated stationary point was considered in the discussion; all the computed structures can be obtained from authors upon request. Cartesian coordinates, electronic energies, entropies, enthalpies, Gibbs free energies, and lowest frequencies of the calculated structures are summarized in the ESI.†
Footnotes |
| † Electronic supplementary information (ESI) available: NMR spectra, MS, computational data, kinetic experiments and equations. See DOI: 10.1039/d1ob00871d |
| ‡ These authors contributed equally to the work. |
| This journal is © The Royal Society of Chemistry 2021 |