Luis
Rincón
*a,
Wendy M.
Rodríguez
a,
Jose R.
Mora
a,
Cesar
Zambrano
a,
Luis E.
Seijas
b,
Andres
Reyes
c and
F. Javier
Torres
*a
aGrupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Colegio Politecnico de Ciencias e Ingeniería, Diego de Robles y Vía Interoceanica, Universidad San Francisco de Quito, Quito 17-1200-841, Ecuador. E-mail: lrincon@usfq.edu.ec; jtorres@usfq.edu.ec
bEscuela de Ingeniería Ciencia y Tecnología, Universidad del Rosario, Bogota 111711, Colombia
cDepartamento de Química, Universidad Nacional de Colombia, Bogota 111321, Colombia
First published on 28th March 2025
In the present work, a new definition of the conceptual density functional theory reactivity indexes is proposed, based on a cubic interpolation of the energy as function of number of electrons as well as a generalization of the net electrophilicity index. This new proposal takes into account both the influence of hyperhardness on the reactivity and a weighted average of the electrodonating and electroacepting powers. Thus, the presented redefinition incorporates corrections and additional degrees of freedom to the prior CDFT indexes. Numerical support for global descriptors is presented for 30 benzhydrylium ions (i.e., charged electrophiles) and 15 alkyl and aryl nucleophiles taken as reference cases from the Mayr Database of Reactivity Parameters. In the best-case scenario, the descriptors correlated better with the electrophilicity parameter (r2 = 0.981) than with the nucleophilicity parameter (r2 = 0.827).
In view of the previous, extensive research has been devoted to establish appropriate electrophilicity and nucleophilicity scales. Among the most notable cases, the early proposals of Swain and Scott,3 Edwards,4–6 and Ritchie7 are to be mentioned. These pioneering works ultimately led to the Mayr–Patz equation,8 which is recognized as the workhorse of reactivity scales in organic chemistry and is defined as follows:
log10(k) = sN(E + N). | (1) |
In eqn (1), k is the second-order rate constant for a combination of an electrophile with a nucleophile at 20 °C, sN is a nucleophile-specific sensitivity factor, and E and N are the electrophilicity and nucleophilicity scales, respectively. Concerning the parameter sN, it is equal to 1 when 2-methyl-1-pentene acts as the nucleophile. Although the Mayr–Patz equation was created considering a set of specific experimental data obtained for π-nucleophiles (i.e., alkenes, arenes and allylsilanes) reacting with benzhydrylium ions as electrophiles,9–12 the applicability of eqn (1) has been confirmed for a wide diversity of nucleophiles, including amines, enamines, alcohols, ethers, ylides, phosphines, hydride donors and even organometallic compounds, as well as for many types of electrophiles such as carbocations and Michael acceptors (i.e., compounds with α, β unsaturated carbonyl groups).9–12 Noteworthy, Mayr's scale has also been used to critically examine the HSAB principle.13 Moreover, a particularity of Mayr's scale is that electrophilicity is exclusively related to electronic effects, whereas nucleophilicity can bear, in some cases, contributions of solvent14 and steric effects.15 In spite of its success, a complete theoretical understanding of the applicability of eqn (1) is still lacking, and the existence of a unique and universal nucleophilicity scale is still doubtful.16 Nonetheless, the Mayr's Database of Reactivity Parameters17 is considered the main source of numerical information on the reactivity of organic species. In July 2024, this data bank held numerous entries: 355 for electrophiles (E) and 1300 for nucleophiles (N, sN). In Mayr's database, a star rating system qualitatively accounts for the experimental uncertainty in the correlations used for the determination of each reactivity scale. In this vein, each compound, belonging to the database, is rated from one to five stars, depending on the experimental conditions used to determine the molecule's parameters and the reliability of the results. Consequently, the larger the number of stars, the greater the confidence in the correlation-related reactivity parameter. In this way, for reactions of reference electrophiles with reference nucleophiles, the most accurate predictions of the rate constants resulting from eqn (1) can be expected. Finally, although rate constants for reactions involving reference electrophiles are commonly reproduced with variances of less than a factor of two, errors of 10 to 100 must be expected when both reactants are not from the reference set.15,18 For this reason, in this work, only reference species from the Mayr's Database of Reactivity Parameters17 are considered.
Beyond reactivity scales, such as Mayr's one, theoretical and computational methodologies have been developed with the purpose of quantifying nucleophilicities and electrophilicities by employing electron-based descriptors. In fact, one of the main objectives in computational organic chemistry is to reach accurate predictions of the Mayr's reactivity scale,18 and at the same time, use these theoretical descriptors to gain a deeper knowledge of the factors that control the reactivity of organic compounds in general. As chemical reactivity is not an observable in a strict sense (something that happens with many basic chemical concepts19), global indexes intend to capture in a single number the 'propensity' of a molecule to donate an electron (nucleophile) to an electron-deficient molecule (electrophile). In this context, we cannot expect experimental scales and theoretical indexes to be identical because, in most cases, these two approaches consider different effects (or the same effect in a different way). Experimental scales are clearly based on rate constants; however, theoretical indexes depend on the energy change as a result of electron transfer in a molecule. Although in the latter case, we believe that a solid theoretical index should reflect, at least, the main trends observed in eqn (1). At the end of the day, the reactivity scales and reactivity indexes must be complementary rather than similar.
In the framework of the conceptual density functional theory, (CDFT)20–26 attempts have been made towards the definition of global and local electrophilic indexes.27–35 In the next section, we critically review some classical global CDFT electrophilicity indexes as they provide an appropriate framework for the present work, which focuses on defining global quantities designed to characterize reactivity by assigning a unique value to each molecule in a similar way to the E and N of eqn (1) originate electrophilicity and nucleophilicity scales. It is important to underline that, classical CDFT indexes are based on the quadratic expansion of the energy, and, although such approximation results are adequate for the context, an important flaw emerges from it as the leading term results to be directly proportional to the ionization potential rather than the electron affinity, as expected for an electrophilicity index. A solution to this problem is proposed at the end of the next section as part of the present contribution. In contrast to the proliferation of electrophilic indices for CDFT, the definition of a nucleophilicity index is undoubtedly a more difficult task. One reason is the withdrawal of electrons from a molecule always increases its energy. In these regards, many authors have conducted research on nucleophilicity models in the CDFT framework, but a definitive recipe that accounts for all important phenomena remains elusive.36–38 To take one step further, we define the electrophilicity index through a cubic expansion of the energy (Section 3). Through our approach, we consider the effect of incorporating the hyperhardness in the expansion (the third derivative of the energy with respect to the number of electrons).39–46 This idea has been recently explored by Hoffmann, Chermette and Morell,47 and also by Figueredo and Quintero.48 In this work, we complement previous studies by demostrating that the cubic expansion introduces some corrections to the classical CDFT electrophilicity indexes. In addition to these corrections, these cubic indexes can incorporate new degrees of freedom that are simple to interpret. This approach provides further insight into the factors that determine electrophilicity and nucleophilicity. To test the performance of cubic reactivity indices, in Section 4, we correlated the CDFT indexes with the experimental Mayr reactivity parameters for two reference cases: benzhydrylium ions (charge electrophiles) and alkyl nucleophiles. We note, that quantitative correlations of classical electrophilicity measurements with Mayr's reactivity parameters have previously been reported.49 Since the cubic indices introduce additional degrees of freedom, compared with the classical ones, a better numerical correlation is obtained, in particular when nucleophiles are employed. The concluding remarks and future directions are presented in the final section.
![]() | (2) |
![]() | (3) |
These derivatives are commonly interpreted as the molecule's sensitivity to electron transfer processes (expressed as a change in the number of electrons) and thus indicate the molecule's susceptibility to chemical processes that involve electron transfer. The evaluation of eqn (2) and (3) require an expression of the energy as a function of Ne at a constant external potential (e.g., molecular geometry). At this point, it is worth noting that (at zero-temperature) the energy as a function of the number of electrons consists of a series of straight lines connecting the ground state energies of integer numbers of particles.53,54 Despite the challenges associated with energy discontinuities of the exact interpolation, from a chemical standpoint, it makes sense to assume a smooth expansion energy with a change in the number of electrons; in fact, various flavors of these expansions are described in the literature.55 The simplest and most common approach to define the energy as a function of the number of electrons is the use of a quadratic expansion around a reference number of electrons (N0),50,52,55,56
![]() | (4) |
This expansion is reasonable because a fragment of a reacting system can have fractional charge, and the dependence of the energy on the amount of fractional charge is, based on both computational evidence and theoretical arguments,34,57,58 expected to be smooth. It is also common to consider the interpolation of eqn (4) between integer values around N0 using the ionization potential (I = E[N0 − 1] − E[N0]) and the electron affinity (A = E[N0] − E[N0 + 1]). Assuming that the quadratic expansion is valid, it is possible to obtain both μ and η from I and A as
![]() | (5) |
η = I − A. | (6) |
eqn (5) and (6) are finite-difference approximations to the first and second derivatives of eqn (2) and (3), respectively. Further, approximations to eqn (5) and (6) can be obtained based on Koopmans' theorem in terms of the highest occupied (εHOMO) and lowest unoccupied (εLUMO) molecular orbital energies assuming that I = −εHOMO and A = −εLUMO. This approach is justified within the framework of the Hatree–Fock approximation; however, DFT is a theory of the total energy, and while the Kohn–Sham auxiliary system is a powerful construct, the energies of the Kohn–Sham orbitals are not necessarily related to the energy of the charge excitations. The Janak theorem established the equality of orbital energies with the derivatives of total energy with respect to the orbital occupation numbers.59 For the exact Kohn–Sham potential, it has been proven that the highest occupied Kohn–Sham orbital energy (HOMO) is negative of the first ionization energy.60 Additionally, the ground state chemical potential theorem proves that in any ground-state Kohn–Sham calculation with an exchange–correlation functional that is a continuous functional of the density or the Kohn–Sham density matrix, the energy of the HOMO is the chemical potential of the electron removal and the energy of the lowest unoccupied molecular orbital (LUMO) is the chemical potential of electron addition.61,62 The ground-state chemical potential theorem justifies using the Kohn–Sham orbital energies to approximate experimental I and A. Throughout this study, all ionization potentials and electron affinities refer to vertical rather than adiabatic quantities.
Based on kinetic data, in 1998, Maynard empirically proposed that the ratio between the square of the electronegativity and the hardness measures the electrophilic power of a molecule, that is, its tendency to accept electrons.63 This work, inspired by Parr, von Szentpaly, and Liu, defined the electrophilicity index as the energy change of an electrophile when it becomes saturated with electrons,27 considering the case where an electrophilic species is immersed in an idealized zero temperature electron sea with zero chemical potential. In such a situation, the electrophile becomes saturated with electrons when its chemical potential becomes equal to that of the electron sea. Using the quadratic expansion, the system reaches saturation when ∂ΔEquadratic/∂ΔNe = 0, and this occurs when
![]() | (7) |
By inserting eqn (7) in the quadratic expansion, we obtain the electrophilicity index of a molecule, just as was suggested by Maynard,63
![]() | (8) |
In the previous equation, we explicitly indicate that this expression is based on the quadratic expansion, eqn (4). Eventually, it is possible to generalize this expression for the spin-polarized case.64–66 Clearly, the electrophilicity index corresponds to the minimum energy in the quadratic interpolation. The electron affinity of a chemical system is assessed by its ability to accept one electron from a donor, but the maximum amount of electron flow is determined by the electrophilicity, which depends on the chemical potential and hardness as described above. From eqn (5) and (6), it is also clear that as long as the quadratic expansion is valid, the electrophilicity depends not only on the electron affinity, A, but also on the ionization potential, I,
![]() | (9) |
The electrophilicity index sounds appealing because it is based on the stabilization energy produced by electron transfer. Several comprehensive reviews on the usefulness of the electrophilic index are available in the literature.67–72 We encourage readers to refer to these reviews to assess the significance of the electrophilicity index in many chemical contexts. As shown in these reviews, this index has been quite successful in comparing electrophilicity differences among molecules. The minimum electrophilicity principle and the electrophilicity equalization principle have provided some insights into the possible underlying rules governing this quantity.73–75 The influence of dynamical effects on the electrophilicity was investigated by Hoffmann, Tognetti and Joubert.35 The authors discovered substantial differences when compared to static reactivity descriptors. Recently, Miranda-Quintana raised some concerns about the definition of the electrophilicity index,33 and redefined the electrophilicity index in terms of the electronic Helmholtz free energy, focusing on the zero-temperature limit. This new definition, referred to as thermodynamic electrophilicity, addresses two key issues present in the quadratic electrophilicity index: the use of a truncated second-order Taylor expansion to estimate energy changes and the assumption of a zero-temperature environment. These approaches often lead to inconsistencies, particularly when applied to systems with very low electron affinities or multicharged cations. By focusing on thermodynamic electrophilicity, these challenges are mitigated, offering a more reliable framework for such systems. Furthermore, an electrophilicity index has been defined for the electronic Helmholtz free energy, and it is observed that, for some temperature values, this index corresponds to the electrophilicity index defined within the quadratic interpolation.34
One big issue of eqn (9) is related to the fact that I > A (sometimes much larger). Therefore, it is convenient to expand this equation in powers of . The result is as follows,
![]() | (10) |
This equation establishes that the leading term of ωquadratic goes as therefore in general, the electrophilic index follows the ionization potential, not the electron affinity as expected.33 The success of ωquadratic comes from the significant correlation between I and A.
In 2007, Gázquez, Cedillo, and Vela distinguished between charge-donating and charge-accepting scenarios by minimizing the change in the grand potential with respect to the amount of transferred charge.28 In the grand canonical ensemble, chemical reactivity is characterized in terms of derivatives of the grand potential. In this context, changes in the grand potential due to electron transfer between a molecular system and an electron sea, with chemical potential μ0, are defined as
ΔΩ = ΔE − μ0ΔN. | (11) |
Using the quadratic expansion for ΔE, the number of electrons transferred that minimize the grand potential is
![]() | (12) |
In contrast with eqn (7) where μ0 = 0, this equation states that the number of electrons transferred is determined solely by the difference between the chemical potential of the molecular system and that of the environment. Thus, depending on this difference, the number of electrons in the molecule can be increased or decreased. In principle, this approach may provide a way to define electrophilicity and nucleophilicity at the same time.
Consider, for example, the situations where μ0 is close to μ, and therefore Δμ = μ − μ0 is small. One may assume that μ0 = sμ, where if s > 1 we obtain that the number of electrons in the molecule decreases, while if s < 1 the system increases the number of electrons. Of course, if s = 1, electron transfer does not occur. Thus, the energy change in the molecular system is given by,
![]() | (13) |
In addition to including the change in the grand potential, eqn (13), Gázquez, Cedillo and Vela consider that, in contrast to the finite differences approximation to μ, eqn (5), the derivative of the energy evaluated at some integral value of N generally has one result when evaluated from the left and a different result when evaluated from the right.28 That is, from the left is the negative of the ionization potential,
![]() | (14) |
![]() | (15) |
The use of different values of chemical potential for the electron donation and withdrawal cases is equivalent to the use of two quadratic interpolations: one for ΔNe < 0 and a different one for ΔNe > 0. Because s in eqn (13) is a constant that characterizes the environment; one can define for the charge donating process the electrodonating power as,
![]() | (16) |
![]() | (17) |
The right-hand side of eqn (16) and (17) are based on an interpolation in which the chemical hardness for the electron donation and electron accepting process is equal. In contrast to ωquadratic, ω−GCV gives more emphasis to the ionization potential than to the electron affinity, while ω+GCV shows the opposite behavior. Using the reasoning presented in the expansion of eqn (10), we obtain the leading term of eqn (16) and (17), and
. The electrodonating and electroacepting power is closely related to the nucleofugality and electrofugality that is used to predict the quality of leaving groups in organic reactions.76 Alternatively, Chamorro, Duque-Noreña and Perez approximated the electrodonating and electroaccepting powers using the chemical potentials given by eqn (14) and (15),29,30 that is,
![]() | (18) |
![]() | (19) |
In 2009, Chattaraj, Chakraborty, and Giri introduced the concept of net electrophilicity as the electron accepting ability of a species relative to its electron donating ability,31 which is defined from the electrodonating and electroaccepting power in the form,
Δω± = ω+ + ω−, | (20) |
![]() | (21) |
The idea of combining ω+ and ω− appears to be interesting based on the fact that a good electrophile is, in some way, a bad nucleophile, and vice versa. However, eqn (21) has the disadvantage of having two quantities with different units. As a result, eqn (20) can be regarded as a more adequate expression for theoretical developments because it presents units consistency. Moreover, eqn (20), divided by 2, can be seen as the arithmetic mean of ω− and ω+. In the following, these descriptors are referred to as Δω±arithmetic = 0.5(ω+ + ω−). The geometric mean can be used as an alternative to the arithmetic mean of eqn (20),
![]() | (22) |
The last equation contains the same ingredients as eqn (20), but the leading term, in this case, is the electron affinity.
As commented earlier, previous indexes emphasized electrophilicity rather than nucleophilicity. Therefore, the final part of this section is devoted to the less explored nucleophilicity index. Regarding this, Partihar and Roy have proposed, with limited success, the inverse of the electrodonating power as the nucleophilicity index.37,77 Later, Domingo, Chamorro, and Perez connected nucleophilicity to the highest occupied Kohn–Sham orbital energy, which was a partially successful approach.37,78 Some additional attempts were made towards the definition of a relative measure of nucleophilicity.36,38 Finally, Liu and co-workers have proposed a method to simultaneously determine electrophilicity and nucleophilicity values through the use of information-theoretic quantities, resulting in good correlations with experimental scales.79–82 Notwithstanding, as we mentioned in the introduction, a descriptor that accounts for all major trends in the experimental nucleophilic index is still missing.
![]() | (23) |
Due to the greater order of the expression, a better description of the chemical reactivity can be expected. Considering eqn (23), the dependence of μ, η and γ on the ionization potential (I = E[N0 − 1] − E[N0]) and electron affinity (A = E[N0] − E[N0 + 1]) of an N0 electron reference system can be defined in the same way as in the case of the quadratic interpolation. In fact, it is straightforward to demonstrate that the hardness formula of the cubic expression coincides with the one obtained from the quadratic model; that is, η = I − A. On the other hand, it follows that the chemical potential and the hyperhardness (γ) satisfy the relation,
− 3(I + A) = 6μ + γ | (24) |
Therefore, based on this constrains, γ depends on the chemical potential in the cubic interpolation. At this point, to establish a connection with the previous elaborations of reactivity parameters, we introduce an α-dependent μ in the form,
μ = −(αI + (1 − α)A). | (25) |
Now, eqn (23) results in,
![]() | (26) |
Clearly, if α = 1/2, eqn (26) reduces to the quadratic interpolation. This equation shows that α represents the weighted contribution of both the ionization potential and the electron affinity to the chemical potential. We notice that based on the constrains imposed in the present work, in eqn (26), ΔEcubic = I if ΔN = −1 and ΔEcubic = A if ΔN = +1, for any value of the parameter α. Recently, Miranda-Quintana interpreted eqn (25) as a perturbed chemical potential that incorporates the effect of the molecular environment.83–87 Moreover, as shown in the work of Heidar-Zadeh et al.,55eqn (26) incorporates the correct interpolated electron density and the Fukui function in the hyperhardness. Eventually, the parameter α can be adjusted based on additional constrains, or alternatively, α can be envisaged as the propensity of a given chemical species to donate or accept fractional amounts of charge and can be treated as an adjustable parameter for correlations with experimental data.
In Fig. 1 we compare the exact behaviour (blue line), the quadratic interpolation (black line) and the cubic interpolation with α = 0.25 (grey line) and α = 0.75 (red line). Compared to the quadratic interpolation, the cubic interpolation with α = 0.25 is closer to the straight lines connecting the integer ΔN values for electron gain, that is when ΔN > 0; in contrast, the cubic interpolation with α = 0.75 resembles the blue line for electron loss.
![]() | ||
Fig. 1 Plot of the total energy as a function of the number of electrons. Straight lines correspond to the exact behaviour (blue line), black line correspond to the quadratic model (eqn (4)), gray line correspond to the cubic interpolation (eqn (26)) for α = 0.25 and red line for α = 0.75. |
The global electrophilicity is defined as the stabilization energy when the system is saturated with the maximum allowed number of electrons.27 Using this definition, the maximum number of electrons transferred is obtained by minimizing the eqn (23). Since the global minimum of eqn (23) is at ΔN = −∞, the number of electrons transferred in the local minima is,
![]() | (27) |
![]() | (28) |
Since η > 0, ΔNmax is negative when γ is positive, that is for α > 0.5; whereas ΔNmax is positive for γ negative (α < 0.5). Thus by changing the value of α we can describe both electron accepting and electron donation process. This number of electrons results in the following stabilization energy when inserting in eqn (23),
![]() | (29) |
At first glance, eqn (29) seems different from the previous expressions of the electrophilic index. To connect ωcubic to its quadratic model counterpart, we use the power expansion of the square root in eqn (28), that is . The number of electrons transferred is then approximately given by
![]() | (30) |
![]() | (31) |
The first term in this equation resembles the previous elaborations of the electrophilicity index (the electrophilicity index and the electrodonating power). This equation is similar to that found by Hoffmann, Chermette, and Morell47 by taking the local maxima rather than the local minima of the cubic energy expansion. Since, eqn (31) is an approximation of eqn (29), it is instructive to determine the conditions under which this approximation holds. In Fig. 2, we plot ωcubicversus cubic using the values of I and A for the 61 atoms and 55 molecules reported in the work of Parr, von Szentpaly, and Liu.27 In both indexes we assume that μ = − (αI + (1 − α)A), η = I − A and γ = 3(2α − 1)(I − A). We include different values of α in the range between {0.0, 1.0}. The straight blue line corresponds to the value of α = 0.5 for which both equations are identical, and corresponds to the case of ωquadratic. An important observation in this figure is that for α ≠ 0.5 the values of eqn (29) always tend to be larger than eqn (31), for this reason, the points are in the inferior quadrant of the figure. In the range, {0.40, 0.60} a good correlation is observed, and the approximation is safe. For α < 0.5 the values of ωquadratic are lower than for α > 0.5, indicating that the range of values of ω tends to increase with α. For α = 0.0 the ω is in the range [0.05, 0.60], but for α = 1.0 in [3.9, 1.1], and this is observed for eqn (29) and (31). Therefore, as α increases, the electrophilicity also grows. The difference between the two equations is greater for α > 0.5 than for α < 0.5. For α = 0.0 the mean deviation is 2.2%, but for α = 1.0 is 37.4%. For α < 0.5 the term
is positive, since μ < 0 and γ < 0. It is important to notice that for small values of α, the term
is often larger than 1, and the square root of eqn (28) is undetermined. This implies that for low values of α, the cubic interpolation do not present a local minimum for certain combinations of I and A.
![]() | ||
Fig. 2 Correlation between ωcubiceqn (31) and ![]() |
Since the differentiation of a molecule's response to donation and acceptance of charge is important from a chemical perspective, it is plausible to use different values of α depending on the sign of ΔN. For instance, we assume that for the charge donation process (ΔN < 0) α > 0.5, while for ΔN > 0 we employ an α < 0.5. Using two α values not only distinguishes between charge receiving and donating processes, as also observed in Fig. 1, it also provides a better approximation of energy. Using the previous approximations, one can define for the charge donating process the electrodonating power as
![]() | (32) |
![]() | (33) |
Finally, in the same spirit of the net electrophilicity index, a general net nucleophilicity and electrophilicity index can be defined as a weighted average between and
. Thus, we define the net nucleophilicity as a weighted arithmetic mean,
![]() | (34) |
![]() | (35) |
Alternatively, a geometrical index can be defined as
![]() | (36) |
![]() | (37) |
These formulations incorporate a correction to the classical index as well as new degrees of freedom in the parameters α−, α+, and β. As commented before, the α parameters can be interpreted as the weighted contribution of the ionization potential and electron affinity to the chemical potential. Likewise, β can be envisaged as the weighted contribution of − and
+ to the net nucleophilicity or electrophilicity. In the following, we will analyze how the previously discussed electrophilicity and nucleophilicity correlate with the Mayr's scale.
Linear regression models were used to test the correlation between the experimental reactivity scale and the CDFT reactivity indexes. The following metrics were used to assess the model performance: root mean square error (RMSE), mean absolute error (MAE), maximum error, the coefficient of determination (r2) and Kendall rank correlation coefficient (τKendall). In contrast to the coefficient of determination, τKendall is used to measure the strength and direction of the association between two variables, even if the correlation is not linear. If τKendall ∼ r2, the linear correlation is reliable. Statistical analysis was performed using Scikit-learn (version 1.6.1) and τKendall is obtained from the stats library of Scipy (version 1.11.4). The parameters α and β of eqn (34)–(37) are optimized using the Levenberg–Marquardt algorithm as implemented in the least-square library of Scikit. The parameters α and β are limited to being between 0 and 1, and the computed indexes of eqn (34)–(37) must be positives. A total of 12 CDFT indexes are used in the correlations:
• The ionization potential (1) and the electron affinity (2). As previously described, these descriptors are calculated in two forms.
• The electrophilicity index obtained from the quadratic expansion (3), ωquadratic (eqn (8)). Even though this descriptor was originally proposed as an electrophilic measure, we tested it also in the case of nucleophilic compounds based on the (not necessarily erroneous) premise that a good electrophile is a bad nucleophile. Furthermore, as shown in eqn (9), the leading term of this descriptor is dependent on the ionization potential and may describe electron loss.
• The electrodonating and electroaccepting power. We used the electrodonating power to correlate nucleophilicity and the electroacepting power to correlate electrophilicity. Two definitions are used, depending on the form of μ+ and μ−: ω+GCV and ω−GCV correspond to descriptor (4) and ω+CDP and ω−CDP to descriptor (5).
• The net electrophilicity. We employ the value taken from the arithmetic mean, eqn (20), and the geometric mean introduced in this work, eqn (22). For each of these two net electrophilicity definitions, the electrodonating and electroacepting power can be calculated using the two definitions previously mentioned. Thus, for the arithmetic mean, when using ω+GCV and ω−GCV, the descriptor is denoted as Δω±arithmetic,GCV (6), and for ω+CDP and ω−CDP the descriptor is denoted as Δω±arithmetic,CDP (7). For the geometric mean, when using ω+GCV and ω−GCV, the descriptor is Δω±geometric,GCV (8), and for ω+CDP and ω−CDP the descriptor is Δω±geometric,CDP (9).
• The approximated electrophilic index obtained from the cubic expansion, eqn (31), by considering α as an optimized parameter. This descriptor is (10).
• The generalized net arithmetic nucleophilicity of eqn (34) or electrophilicity of eqn (35), (11), and the generalized net geometric nucleophilicity eqn (36) or electrophilicity of eqn (37), (12).
Table 1 presents the data obtained for the reference benzhydrylium ions, where the names of the compounds are taken from Mayr's database.17 This set of carbocations represents the most precise estimate of the electrophilicity parameter, E, which covers almost 18 orders of magnitude on the rate constant; that is: from −10.04 to 7.52 (in log scale). Using the data in Table 1, we perform linear correlations of E with each one of the 12 descriptors mentioned previously. The ESI,† provides the value of each descriptor for the entire collection of electrophiles. The statistics of the correlations are presented in Table 2, where some relevant aspects can be highlighted: the first point is that the correlation is very similar for the two definitions used of I and A. Removing or adding an electron does not significantly alter the numbers obtained from the HOMO and LUMO energies. This is because even if IHOMO > I+ and ALUMO < A−, both descriptors are strongly correlated. The second point to mention is the fact that both I and A strongly correlate with E, in fact, their correlation is better than most of the classical descriptors. We anticipate the correlation of A, but not of I. This is a consequence of the high correlation between I and A for this set. For the classical descriptors (3–9) the RMSE are around 1.2 and the MAE is close to 1, this mean that an error in an order of magnitude is expected for rate constant. We can improve the correlation in the case of cubic descriptors introduced in this work (10–12) by including additional degrees of freedom. By using an optimized value of α+ = 0.65(0.62) in 10 the correlation improves to r2 = 0.973(0.976) and the MAE decrease to 0.735(0.728). This is a 25% reduction in MAE respect the classical descriptors. Notice that an α+ value greater than 0.5 (α+ > 0.5) in the descriptor 10 is counterintuitive, because it suggests that the contribution of the ionization potential is more significant than the electron affinity in the estimation of the chemical potential. However, this specific value of α+ guarantees the existence of a minimum in the cubic interpolation, being, this behavior, in line with the existing correlation between I and A quantities of the database. Notwithstanding, it must be indicated that its interpretation deserves some caution. Inclusion of the parameter β+ in eqn (34) and (36) improve much more the correlation only in the case of the geometric average. The best fitting is to use the geometric average with β+ = 0.73(0.76), α+ = 0.00(0.00) and α− = 0.76(0.74). The improvements in correlation due to the descriptors 10 and 12, even at the cost of introducing new degrees of freedom, open the doors to comprehending similar set of electrophiles with an attempt to understand the variability of the parameters α and β. Additional studies in this direction is now underway.
Molecule | E | I HOMO | A LUMO | I + | A − |
---|---|---|---|---|---|
(ani)2CH+ | 0.00 | 11.52 | 5.36 | 11.34 | 5.60 |
(dfp)(mfp)CH+ | 7.52 | 13.42 | 6.88 | 13.29 | 7.05 |
(dfp)PhCH+ | 6.74 | 13.30 | 6.67 | 13.15 | 6.84 |
(dma)2CH+ | −7.02 | 10.20 | 4.55 | 9.89 | 4.84 |
(dpa)2CH+ | −4.72 | 9.75 | 4.42 | 9.27 | 4.87 |
(ftol)2CH+ | 5.24 | 12.56 | 6.23 | 12.39 | 6.46 |
(fur)(ani)CH+ | −0.81 | 11.35 | 5.27 | 11.15 | 5.52 |
(fur)2CH+ | −1.36 | 11.22 | 5.21 | 11.01 | 5.47 |
(ind)2CH+ | −8.76 | 9.88 | 4.36 | 9.54 | 4.67 |
(jul)2CH+ | −9.45 | 9.62 | 4.17 | 9.20 | 4.54 |
(lil)2CH+ | −10.04 | 9.71 | 4.19 | 9.36 | 4.52 |
(mfa)2CH+ | −3.85 | 10.64 | 4.92 | 10.31 | 5.23 |
(mfp)2CH+ | 6.87 | 13.21 | 6.69 | 13.13 | 6.86 |
(mfp)PhCH+ | 6.23 | 13.10 | 6.47 | 13.01 | 6.64 |
(mor)2CH+ | −5.53 | 10.21 | 4.63 | 9.83 | 4.97 |
(mpa)2CH+ | −5.89 | 10.08 | 4.40 | 9.66 | 4.77 |
(pcp)2CH+ | 5.48 | 12.55 | 6.36 | 12.44 | 6.60 |
(pfa)2CH+ | −3.14 | 10.48 | 4.77 | 10.07 | 5.16 |
(pfp)2CH+ | 5.01 | 12.92 | 6.32 | 12.85 | 6.48 |
(pyr)2CH+ | −7.69 | 9.98 | 4.36 | 9.60 | 4.69 |
(tfm)PhCH+ | 6.70 | 13.36 | 6.59 | 13.53 | 6.78 |
(thq)2CH+ | −8.22 | 9.87 | 4.32 | 9.50 | 4.65 |
(tol)2CH+ | 3.63 | 12.36 | 5.86 | 12.19 | 6.09 |
(ani)(Ph)CH+ | 2.11 | 12.24 | 5.74 | 12.10 | 5.94 |
(ani)(pop)CH+ | 0.61 | 11.48 | 5.33 | 11.25 | 5.61 |
(ani)(tol)CH+ | 1.48 | 11.94 | 5.59 | 11.77 | 5.82 |
Benzhydrilium ion | 5.47 | 13.05 | 6.27 | 12.95 | 6.44 |
(pfp)(Ph)CH+ | 5.20 | 12.98 | 6.29 | 12.89 | 6.46 |
(pop)(Ph)CH+ | 2.90 | 11.96 | 5.66 | 11.93 | 5.91 |
(tol)(Ph)CH+ | 4.43 | 12.69 | 6.05 | 12.56 | 6.25 |
Descriptors | RMSE | MAE | Max error | r 2 | τ Kendall |
---|---|---|---|---|---|
a α + = 0.65(0.62). b β + = 1.00(0.98), α+ = 0.65(0.62), α− = 0.49(0.48). c β + = 0.73(0.66), α+ = 0.00(0.00), α− = 0.76(0.74). | |||||
1 I | 1.041 (1.133) | 0.803 (0.830) | 3.137 (3.412) | 0.967 (0.961) | 0.894 (0.885) |
2 A | 0.997 (0.919) | 0.835 (0.802) | 2.002 (1.697) | 0.969 (0.974) | 0.936 (0.949) |
3 ωquadratic | 1.161 (1.222) | 1.006 (1.048) | 2.174 (2.665) | 0.959 (0.954) | 0.936 (0.959) |
4 ω+GCV | 1.274 (1.505) | 1.106 (1.342) | 2.501 (3.140) | 0.950 (0.931) | 0.940 (0.949) |
5 ω+CDP | 1.434 (1.910) | 1.242 (1.735) | 2.878 (3.590) | 0.937 (0.888) | 0.940 (0.917) |
6 Δω±arithmetic,GCV | 1.147 (1.187) | 0.992 (1.011) | 2.128 (2.593) | 0.960 (0.957) | 0.936 (0.959) |
7 Δω±arithmetic,CDP | 1.111 (1.096) | 0.953 (0.937) | 1.997 (2.379) | 0.962 (0.963) | 0.931 (0.954) |
8 Δω±geometric,GCV | 1.282 (1.260) | 1.110 (1.087) | 2.720 (2.739) | 0.946 (0.951) | 0.931 (0.959) |
9 Δω±geometric,CDP | 1.224 (1.391) | 1.064 (1.224) | 2.366 (2.961) | 0.954 (0.941) | 0.936 (0.959) |
![]() |
0.930 (0.883) | 0.735 (0.728) | 2.645 (1.960) | 0.973 (0.976) | 0.913 (0.931) |
![]() |
0.929 (0.882) | 0.740 (0.733) | 2.574 (1.886) | 0.974 (0.976) | 0.917 (0.931) |
![]() |
0.784 (0.830) | 0.632 (0.734) | 1.732 (1.844) | 0.981 (0.979) | 0.894 (0.922) |
Table 3 presents the data for a set of 15 references nucleophiles (11 Alkenes and 4 Arenes). In all cases, the solvent employed in determining the N scale is dichloromethane; thus, no solvent influence is expected in this dataset. N spans nine orders of magnitude, from −3.65 (2-chloropropene) to 5.85 (N-methylpyrrole). The electron affinity in all cases is negative and very small, as computed by both approaches. In contrast to the electrophile data set, in nucleophiles the correlation between I and A is very small, r2 < 0.1. Since I ≫ A in this case, the CDFT descriptors are strongly dependent on the values of I. Using the data in Table 3 we perform linear correlations of N with each one of the 12 descriptors previously mentioned. The ESI,† provides the value of each descriptor for the entire collection of nucleophiles. The statistical data is presented in Table 4, where it is observed that the correlations are lower than in the case of electrophiles, sometimes remarkably low for the descriptors A 2, Δω±geometric,GCV 8 and Δω±geometric,CDP 9. These three descriptors are strongly dependent on A, and this last descriptor is almost constant and close to −1.1 eV in all cases, except for 1-methyl-4-vinyl-benzene and styrene. Notice that even if the regression is not as good for this case, when compared with the electrophilic case, the errors in RMSE, MAE, and Max Error are not significant. τKendall shows that for most cases, the correlation is negative, which means that the larger descriptors are the poor nucleophiles. As in the case of the electrophilic index, the generalized net nucleophilicity based on the cubic interpolation outperforms the classical descriptors, in particular the geometric mean. In the case of the arithmetic mean 11, β− = 0.25(0.11), α+ = 0.23(0.14) and α− = 1.00(0.99), the r2 = 0.803(0.763) and a MAE of 0.873(0.938), taking the geometric mean with β− = 0.99(0.99), α+ = 0.11(0.09) and α− = 0.85(0.72), increase the fit to r2 = 0.827(0.831) and the MAE decrease to 0.751(0.792). In the last descriptor β− is close to 1 indicating the importance of I in the correlation.
Molecule | N | I HOMO | A LUMO | I + | A − |
---|---|---|---|---|---|
1-Hexene | −2.77 | 9.21 | −1.17 | 9.53 | −1.01 |
1-Methyl-4-vinyl-benzene | 1.70 | 8.10 | −0.67 | 8.14 | −0.67 |
1-Methylcyclopentene | 1.18 | 8.38 | −1.16 | 8.59 | −0.99 |
1,3-Dimethoxybenzene | 2.48 | 8.07 | −1.05 | 8.13 | −0.85 |
2-Chloropropene | −3.65 | 9.24 | −1.10 | 9.64 | −0.88 |
2-Methyl-pent-1-ene | 0.84 | 8.87 | −1.11 | 9.10 | −0.95 |
2-Methyl-furan | 3.61 | 8.13 | −1.12 | 8.41 | −0.93 |
2,3-Dimethyl-but-1-ene | 0.65 | 8.89 | −1.09 | 9.10 | −0.93 |
2,3,3-Trimethyl-but-1-ene | 0.06 | 8.91 | −1.08 | 9.07 | −0.92 |
2,4,4-Trimethyl-pent-1-ene | 0.79 | 8.82 | −1.07 | 8.95 | −0.92 |
Furan | 1.33 | 8.55 | −1.21 | 8.94 | −1.00 |
Isobutylene | 1.11 | 8.91 | −1.13 | 9.27 | −0.97 |
N-Methylpyrrole | 5.85 | 7.86 | −1.04 | 8.09 | −0.80 |
Norbornene | −0.25 | 8.68 | −1.09 | 8.86 | −0.93 |
Styrene | 0.78 | 8.33 | −0.54 | 8.42 | −0.58 |
Descriptors | RMSE | MAE | Max error | r 2 | τ Kendall |
---|---|---|---|---|---|
a α − = 0.79(0.67). b β − = 0.25(0.11), α+ = 0.23(0.14), α− = 1.00(0.99). c β − = 0.99(0.99), α+ = 0.11(0.09), α− = 0.85(0.72). | |||||
1 I | 1.108 (1.257) | 1.024 (1.128) | 1.660 (2.229) | 0.743 (0.669) | −0.657 (−0.600) |
2 A | 2.166 (2.120) | 1.472 (1.502) | 4.933 (4.614) | 0.017 (0.058) | 0.086 (0.048) |
3 ωquadratic | 1.644 (1.225) | 1.269 (0.964) | 3.004 (2.563) | 0.434 (0.686) | −0.600 (−0.657) |
4 ω−GCV | 1.185 (1.089) | 1.063 (0.920) | 1.919 (2.132) | 0.706 (0.751) | −0.752 (−0.638) |
5 ω−CDP | 0.998 (1.124) | 0.903 (1.011) | 1.545 (2.096) | 0.791 (0.735) | −0.714 (−0.619) |
6 Δω±arithmetic,GCV | 1.335 (1.102) | 1.144 (0.923) | 2.420 (2.210) | 0.627 (0.746) | −0.714 (−0.657) |
7 Δω±arithmetic,CDP | 0.979 (1.143) | 0.884 (1.036) | 1.506 (2.098) | 0.799 (0.726) | −0.714 (−0.619) |
8 Δω±geometric,GCV | 1.946 (1.573) | 1.428 (1.206) | 3.864 (3.459) | 0.206 (0.482) | −0.505 (−0.543) |
9 Δω±geometric,CDP | 2.158 (2.107) | 1.471 (1.509) | 4.917 (4.577) | 0.024 (0.070) | −0.086 (−0.086) |
![]() |
0.983 (1.090) | 0.880 (0.920) | 1.511 (2.135) | 0.798 (0.751) | −0.733 (−0.638) |
![]() |
0.969 (1.064) | 0.873 (0.938) | 1.555 (1.796) | 0.803 (0.763) | −0.714 (−0.657) |
![]() |
0.909 (0.831) | 0.751 (0.792) | 1.867 (1.705) | 0.827 (0.831) | −0.581 (−0.543) |
Electrophilicity and nucleophilicity are widely used throughout organic and inorganic chemistry. Despite their impressive popularity, no satisfactory quantitative definition exists, neither as an experimental reactivity scale nor as a theoretical reactive index (or descriptor). All methods for describing the reactivity have some limitations. Perhaps one of the most common errors is attempting to separate electrophile and nucleophile reactivity to establish an universal scale. In some way, a nucleophile's reactivity is determined by the electrophile with which it reacts. Therefore, at best, an average scale can be obtained. In many parts of the text we insist that reactivity scales, like the Mayr's ones, and CDFT reactivity index may not necessarily be equal. In general, they are constructed from different perspectives. Reactivity scales are based on correlations obtained from the logarithm of the experimental rate constants (or the activation energy), the basis idea in eqn (1) is that nucleophilicity and electrophilicity are separable and additive. CDFT reactivity index is intended to capture how the energy of a molecular system changes as a result of electrons transfer. Its main idea is to treat chemical processes as molecular perturbations with respect to the total number of electrons and the external potential. These two points of view are similar but not equivalent, and in the case of the nucleophilicity a problem arises because the minima of the energy always occur for electron attachment, not for electron donation. To bring together of these two visions is not always possible because a large part of the activation energy in an electrophile–nucleophile reaction is due to geometric distortions, rather than electron transfer. For this reason, we consider both visions complementary, not equivalent.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5cp00994d |
This journal is © the Owner Societies 2025 |