The intrinsic view of ionization equilibria of polyprotic molecules

The intrinsic approach describing microscopic ionization equilibria is presented. This description massively reduces the number of parameters needed to characterize microequilibria. Particularly, by exploring molecular symmetries and group transferability, this approach is capable of resolving such equilibria even for rather complex molecules. Intrinsic constants are assigned to each ionizable group and interactions between these sites are introduced. These interactions involve pairs or triplets of sites. The strength of these interactions decreases rapidly with the distance between the sites. Once these parameters are known, one can obtain macroconstants, microconstants, microstate mole fractions, and overall or site-specific titration curves. These quantities provide insight into the protonation of the molecules in question. The knowledge of such properties is relevant for a wide range of phenomena, including receptor–ligand interactions, action of drugs, or geochemical processes.


Introduction
Binding of ligands to receptors plays a key role not only in chemistry, but also in biology, material sciences, or geochemistry.0][11] Binding of oxygen to hemoglobin as treated by Pauling probably represents one of the best studied systems since. 1 Ion binding is further essential to the performance of novel smart materials, such as self-healing rubber or supramolecular polymers. 12,13Binding of ions to minerals and natural organic matter determines the interactions between surfaces relevant to geochemical processes in subsurface transport of chemicals, colloidal particles, or microorganisms. 14he chemical nature of the binding sites where the ligands dock to receptors is primordial.However, this information is often unavailable, and systematic tools to resolve these questions are essential.Here we show how to tackle this problem in the archetypal ionization of polyprotic acids and bases.While monoprotic acids or bases are treated in all introductory chemistry courses, the diprotic case is seldom discussed in textbooks appropriately. 1,9,15Binding of protons to more complex receptors, such as proteins, polyelectrolytes, or dendrimers is discussed in specialized literature only. 4,16,17is reluctance might be related to the popularity of macroscopic equilibria in treating polyprotic systems.However, macroconstants only tell us that binding of a proton takes place, but not at which binding site.This information is contained in the microconstants, which specify equilibria between the different microspecies.For molecules with few sites, the number of such microconstants is limited, and their values can be deduced using various tools, for example NMR. 9,18,19However, the number of microconstants grows very rapidly with the number of sites, and for larger molecules attempts to obtain all these constants may seem hopeless. 20,21he intrinsic description of ionization equilibria advocated here circumvents this problem by focusing on interactions between sites.This approach drastically reduces the number of parameters involved, and takes molecular symmetries into account.Kirkwood proposed such a framework to deal with proton binding to proteins, 22 which was later adapted to polyelectrolytes. 23,248][29][30] The intrinsic description will be presented with examples of increasing complexity, thereby highlighting various strategies to extract the relevant parameters.We address the notion of group transferability, then the central idea of cluster expansion of the free energy, and finally molecular symmetries and homologous series.Concerning the technical background, we refer to the original articles 26,[31][32][33] and reviews. 4,16,27,34roton affinities of functional groups are influenced by their close chemical environment only.
Glycine has two different ionizable groups, namely an amine and a carboxylic group (Fig. 1a).Depending on the solution pH, this molecule is either deprotonated, which makes it negatively charged, singly protonated and hence neutral, or doubly protonated, which renders it positively charged.This macroscopic description thus invokes three so-called macrospecies.These species are in chemical equilibrium, which can be described with two equilibrium constants called macroconstants.These constants are 2.33 and 9.57 at an ionic strength of 0.1 M and 25 1C as determined by potentiometric titration (Fig. 1c). 35We report common logarithms of formation constants throughout.These macroconstants determine the mole fractions of macrospecies in the prevalence diagram (Fig. 1f).However, the macroscopic description does not tell us which groups ionize.Only the microscopic description provides this information.However, this description involves 4 microspecies and thus 3 independent microconstants have to be determined.Since these microconstants are related to the macroconstants, only one additional constant is needed to complete the microscopic description.
The missing constant can be obtained by invoking group transferability.The microscopic ionization constant of the amine group, provided the carboxylic group is neutral, will be very similar to the ionization constant of glycine methyl ester, which is 7.66 under the same conditions (Fig. 1a). 36The perturbation of the amine group through the modification of the distant carboxylic group will be negligible.With this value at hand, the 4 microconstants can be deduced (Fig. 1d).From these constants it follows that the predominant microspecies is the zwitterion, namely where the amine group is protonated and positively charged, while the carboxylic group is deprotonated and negatively charged.The other microspecies, where both groups are neutral, occurs in traces only.One also finds that the mole fraction of each of these microspecies with respect to both singly protonated microspecies is constant.From the microscopic description, the site-specific titration curves can be inferred (Fig. 1e).They reveal that the amine group is fully deprotonated above pH 8, while the carboxylic group protonates only below pH 4.
Since only 3 out of the 4 microconstants are independent, the situation is better summarized with intrinsic (or cluster) parameters.The intrinsic constants correspond to the ionization constants of the neutral molecule, and they are 4.24 for the carboxylic group and 7.66 for the amine group.The mutual influence of these groups can be quantified by a pair interaction parameter, which turns out to be 1.91.This parameter tells us that upon protonation of the carboxylic group the microscopic ionization constant of the amine group decreases by 1.91, and vice versa.This situation is normally referred to as negative or anti-cooperativity. 5Microscopic equilibria of glycine can thus be summarized in terms of 3 parameters, namely with 2 intrinsic constants and one interaction parameter (Fig. 1b).

Cluster expansion
The suggested grouping of microscopic parameters in intrinsic constants and interaction parameters is a powerful concept as it reflects the expansion of the free energy in terms of clusters of sites.When all clusters are included, this description is equivalent to classical microscopic equilibria.This grouping suggests, however, that the most important contributions originate from individual sites, less important ones from pairs of sites, while triplets or quadruplets contribute the least.Such an expansion converges rapidly, and when truncated, the parameter space is reduced substantially.Let us now illustrate how this approach can be used to resolve microscopic ionization equilibria of more complex molecules.We will use inositol phosphates as examples since their site-specific titration curves can be directly measured by 31 P-NMR.To a good approximation, the chemical shift of each phosphorus nucleus is a linear function of the degree of protonation of the respective phosphate group, since cross-coupling between different nuclei is negligible due to the distances involved. 19,26,37ernissi-Arifi et al. 37 reported site-specific titration of D-myoinositol 1,2,6-trisphosphate. 26 From these data, one obtains the 3 intrinsic ionization constants and the 3 pair interaction parameters from the site-specific titration curves with a least-squares fit (Fig. 2).The intrinsic constants have values of about 6.The most basic phosphate group is situated in position 1, while the most acidic in position 2. The strongest interaction is found between the equatorial groups 1 and 6, while the interaction between the equatorial group 1 and the axial group 2 is weaker.The weakest interaction is between groups 2 and 6, since they are well separated.The pair interaction can be interpreted as the shift of the microconstant of one of these sites due to the protonation of another one.These interactions reflect the mutual configuration of the phosphate groups, and they typically decrease with separation.This distance dependence originates from the electrostatic nature of this interaction, and hence the actual values may also be influenced by conformational degrees of freedom. 38nce the intrinsic parameters are known, they can be used to calculate other properties of interest (Fig. 2).In particular, one can evaluate all 12 possible microconstants, among which 7 are independent.The missing parameter among the 6 intrinsic parameters is the triplet interaction, which is probably small and can be assumed to vanish. 26Introducing this parameter does not improve the fit.The respective microstate fractions are also presented (Fig. 2d).While the concentration of the macrospecies varies with pH as indicated in the macroscopic prevalence diagram, the respective fractions of microspecies within a given macrospecies remain constant.Very similar results were obtained by direct analysis of the site-specific titration curves in terms of all microscopic ionization equilibria. 26ased on this information, the deprotonation mechanism of the D-myo-inositol 1,2,6-trisphosphate can be deduced.
The molecule deprotonates in the 2 position first, as this group is the most acidic one.The second deprotonation step principally involves the group in position 6.However, other microspecies remain significant, and one cannot identify a predominant ionization mechanism, as was the case for glycine.
Triplet interactions become important for D-myo-inositol 1,2,4,5-tetrakisphosphate. The site-specific titration curves obtained by 31 P-NMR are shown in Fig. 3.By considering pair interactions only, these curves could be described only moderately well. 39A triplet interaction between the groups 1, 4, and 5 improves the description substantially, and the results are summarized in Fig. 3.The pair interaction between the groups 2 and 5 turns out to be negligible due to their substantial separation.A non-vanishing triplet interaction indicates that the shift in the microconstant of a given group due to the protonation of the two neighboring groups is not pair-wise additive.This interaction reflects an additional shift in the microconstant when both neighboring groups are protonated.Triplet interactions originate from averaging of different site-site interactions in various conformers. 38A negative triplet interaction indicates positive cooperativity, which may result from the coupling between ionization and conformational degrees of freedom.This point is discussed in more detail elsewhere. 5The deprotonation mechanism of D-myo-inositol 1,2,4,5tetrakisphosphate can now be established.Groups 2, 4, and 5 dissociate the first proton almost simultaneously, but the species where group 2 is dissociated dominates.When the second proton dissociates, the prevalent microstate is the one where groups 2 and 5 are dissociated.Within the last deprotonation step, the relevant microspecies feature a deprotonated group 2.
The major simplification introduced by the cluster expansion should be clear now.The D-myo-inositol 1,2,4,5-tetrakisphosphate involves 32 different microconstants, from which 15 are independent, but only 13 of those can be determined from titration curves. 21The cluster expansion groups these parameters into 4 intrinsic constants and 11 interaction parameters, whereby 6 involve pairs, 4 triplets, and one quadruplet.For this molecule, the quadruplet contribution corresponds to the highest-order interaction present, and can be neglected.By considering their distance dependence, we identify 6 as the most important ones, namely 5 pair interactions and one triplet.The intrinsic description now involves 10 parameters.For this molecule, a direct analysis in terms of microscopic equilibria would be a daunting task.

Molecular symmetries
The number of independent parameters entering the intrinsic model can be further reduced by invoking molecular symmetries.The reduction in the number of parameters can be so substantial that microequilibria can be fully resolved from the macroconstants of a single molecule in favorable cases.
The simplest example of this situation is realized with symmetric diprotic acids or bases (e.g., aliphatic dicarboxylic acids and diamines). 31,40,41Since both ionizable groups are equivalent, one has one intrinsic constant and one pair interaction parameter.These two parameters can be obtained from the two macroconstants accessible by potentiometry.One finds that the resulting interaction parameter decreases strongly with increasing separation between the ionizable groups.This dependence reflects the importance of electrostatic interactions and confirms that interactions between more distant groups are negligible. 31These interactions depend on the ionic strength only weakly, since the electrostatic interactions are mainly mediated through the molecular backbone of low dielectric permittivity. 40,42y invoking molecular symmetries and group transferability simultaneously, microscopic equilibria can be resolved from macroconstants even for rather complex molecules.For example, consider linear aliphatic pentamine 2,2,2,2-pent (Fig. 4).Trying to obtain 80 microconstants from 5 macroconstants may seem impossible.However, these microconstants can be obtained by applying the above principles.We first assume that there are only two different intrinsic constants, namely one for the primary amine and one for the secondary amine.The number of interaction parameters can be reduced by exploiting their short ranged nature.When one assumes nearest neighbor pair interactions only, the resulting model fits poorly.The quality of the fit improves substantially when interactions involving three neighboring sites are included, namely triplet interactions and pair interactions between next nearest neighbors.These 5 parameters can be obtained from the 5 known macroconstants, and one finds that the latter pair interaction is negligible.The final intrinsic model thus involves 4 parameters, namely two intrinsic constants, one nearest neighbor pair interaction, and one triplet interaction (Fig. 4).These 4 parameters can be obtained with a least squares fit of the 5 macroconstants, and this model describes these constants within experimental error.Once these parameters are known, one can evaluate all microconstants and other properties of interest.The present results can be confirmed by an analysis of site-specific 13 C-NMR titration curves of 2,2,2,2-pent, which yields very similar parameters. 26In this case, however, crosscoupling of the chemical shifts between different carbon atoms must be considered. 19Such coupling is also important in 15 N-NMR titrations. 26These coupling effects may be further complicated by conformational changes, which can be sometimes resolved with multi-dimensional NMR techniques. 43rom this intrinsic model the protonation mechanism of 2,2,2,2-pent can be deduced (Fig. 4).When decreasing the pH, the primary amine group protonates first.In the second protonation step, the other primary amine group is protonated, even though the species with the central amine group protonated is present in small quantities as well.The third protonation step forms the dominant microspecies where the primary amines and the central secondary amine are protonated.The two prevalent microspecies, where one group remains deprotonated, involve the ones with deprotonated secondary amines.This microspecies leads to the pronounced minimum in the site specific titration curve, which has also been observed with 13 C-NMR. 26

Homologous series
Symmetry considerations are sometimes insufficient to constrain the problem such that these parameters can be determined from macroconstants only.In this case, consideration of a homologous series may help.As an example consider the resolution of microequilibria of the biologically relevant 3,4,3-tet or spermine (Fig. 5).Since the amine groups are well separated, one may consider nearest neighbor pair interactions only.There are two such parameters and two different intrinsic constants, which yield 4 unknown parameters.While 4 macroconstants should be sufficient to obtain those unknowns, their values are strongly correlated.This problem can be circumvented by simultaneously considering the homologue 3,3,3-tet or thermine.Supposing that the pair interactions are the same in both molecules, there are 2 additional unknown intrinsic constants.Least-squares fit now yields 6 unknown constants from 8 known macroconstants unanimously.These results can be confirmed by analyzing 13 C-NMR titration data of spermine, which yield a very similar intrinsic parameter set. 27he intrinsic protonation mechanism of spermine can now be deduced (Fig. 5).Under basic conditions, the secondary amine is preferentially protonated due to its somewhat larger basicity.In the next step, the distant primary amine is principally protonated, since in this fashion the molecule can avoid all pair interactions.Subsequently, the other secondary amine is protonated due to the larger basicity of this group.
Our results for 2,2,2,2-pent can also be confirmed with a homologous series.When 2,2-tri and 2,2,2-tet are considered together with 2,2,2,2-pent, one has 12 macroconstants that should be described with the same intrinsic model containing 4 parameters.The best fit results are shown in Table 1, and the resulting parameters turn out to be very similar to the ones obtained from 2,2,2,2-pent only. 31

Polyelectrolytes, dendrimers, and proteins
The intrinsic model can also be used to describe the ionization behavior of molecules with a larger number of sites. 4,16Linear polyelectrolytes with closely spaced sites ionize in two steps (e.g., linear polyethylene imine and polymaleic acid).Their titration curve features an intermediate plateau at a protonation degree of 1/2.This plateau originates from the stability of the intermediate microstate with alternating protonated and deprotonated sites (Fig. 4d and 6a).Comb-like polyelectrolytes show two intermediate plateaus in the titration curve at 1/2 and 3/4, which can be traced to two stable microstates (Fig. 6b). 44he intermediate plateau at 2/3 observed for polypropyleneimine dendrimers is due to a stable even-odd shell microstate (Fig. 6c). 32The ionization mechanism of polyamidoamine dendrimers is different, however.They feature an intermediate plateau at 1/2, which is a consequence of the stability of a The intrinsic model was calibrated on the experimental macroconstants of 3,3,3-tet and 3,4,3-tet (Table 1).Calculated (e) site-specific titration curves (f) and the macroscopic prevalence diagram.microstate where all primary amine groups are protonated (Fig. 6d). 45The different structure of the stable microstate results from weak site interactions, but widely different proton affinities of the primary and tertiary amine groups.For polypropyleneimine dendrimers, the site interactions are much stronger, while the affinities of the different amine groups are similar.
When the ionizable groups are situated further apart, the effect of interactions remains weak.When a larger number of neighboring groups are involved, however, their effect may be still important, and usually leads to a diffuse broadening of the titration curve.One observes this situation in more weakly charged polyelectrolytes (e.g., polyvinylamine and polyacrylic acid).Proteins are characterized by a wide distribution in the affinities of the ionizable residues, but by weak site interactions as well. 16

Outlook
The intrinsic approach to fully resolve microscopic ionization equilibria is discussed.Thereby, one assigns an intrinsic constant to each ionizable group, and introduces interactions between these sites.The interactions among pairs of neighboring sites are the most important ones.Their strength decreases rapidly with increasing distance between the sites.While higher order interactions may also be considered, only triplet interactions between neighboring sites seem relevant.By exploring molecular symmetries and group transferability, the intrinsic model can describe even rather complex molecules with a small number of parameters.Once these parameters are known, one can calculate other quantities of interest, including macroconstants, microconstants, and microstate mole fractions.These quantities provide insight into the protonation mechanism of the molecules in question.
The parameters entering the intrinsic model can be obtained by fitting experimental data, but two approaches appear to be most promising.The first approach relies on potentiometric titration data or the corresponding macroconstants.The parameter determination can be simplified by exploiting molecular symmetries, group transferability, or homologous series.The other powerful approach is the analysis of NMR titration curves, from which site-specific titration curves can be extracted.One might be also able to use other spectroscopic techniques, but as a rule, NMR is unsurpassed in terms of selectivity and resolution. 19,27However, even site-specific titration curves may be insufficient to determine all relevant intrinsic parameters. 21In such cases, a combined analysis of homologous series and of group transferability might help to resolve the problem. 46hile various approaches are now available to determine the intrinsic parameters, their molecular interpretation may be nontrivial.The microconstants often obey the group additivity relationships, but these shifts may be complicated by longer ranged interactions and resonance effects. 2 For example, secondary amines are more basic than primary ones in spermine, but more acidic in 2,2,2,2-pent.The latter shift is related to the electron withdrawing effect of neighboring amine groups. 2 The interaction parameters normally decrease with increasing distance between the ionizable groups, mainly due to the important role of Coulombic forces in these interactions.However, these parameters can be strongly influenced through their coupling to conformational degrees of freedom.In some cases, this coupling may even induce negative interactions and cooperative effects, as illustrated with 1,2,4,5-tetrakisphosphate here.

Fig. 1
Fig. 1 Ionization of glycine at an ionic strength of 0.1 M and 25 1C.(a) Structural formula and comparison with glycine methyl ester, (b) intrinsic, (c) macroscopic, and (d) microscopic models.Ionization constants are indicated in normal fonts (c, d) and in (b) in bold.Mole fractions with respect to all other microspecies within a macrostate are given in (d) in italic.The interaction parameter is given in (b) in italic bold.The intrinsic model was calibrated with the macroscopic constants and the ionization constant of glycine methyl ester.Calculated (e) site-specific titration curves (f) and the macroscopic prevalence diagram.

Table 1
Macroscopic protonation constants of linear aliphatic polyamines at 0.1 M and 25 1C 35 together with calculated values from the intrinsic model