Diazo compounds for the bioreversible esterification of proteins

A diazo compound is shown to convert carboxylic acids to esters efficiently in an aqueous environment.


Introduction
Esters are nearly absent from human cells, aside from acylglycerols and amino-acyl tRNAs. One reason is their lability in the presence of intracellular esterases. 1 This reactivity is the basis for the clinical utility of the many prodrugs that are unmasked by enzyme-catalyzed ester hydrolysis. [2][3][4][5] Inspired by prodrugs, we envisioned esterication as a means to modify proteins covalently but reversibly with desirable pendants, such as pharmacokinetic enhancing, cell-targeting, or cell-penetration moieties. The acetylation of serine and threonine residues is a natural post-translational modication. 6,7 There is not, however, an efficient means to generate esters from protein carboxyl or hydroxyl groups by chemical synthesis. Carbodimides and other reagents can be used to activate protein carboxyl groups, but solvent water and protein amino, hydroxyl, and sulydryl groups compete effectively with exogenous alcohols for the ensuing activated acyl groups. 8-10 Accordingly, we sought a different strategy-one that enlists Oalkylation rather than acyl transfer.
Diazo compounds are in widespread use in synthetic organic chemistry. [11][12][13] The simplest-diazomethane-readily converts carboxylic acids into methyl esters with molecular nitrogen as the only byproduct. Diazomethane suffers, however, from non-specic reactivity. For example, diazomethane is known to react with water and with the side chains of lysine and tyrosine residues. 14 Other non-stabilized diazo compounds or stabilized metal carbenoids are capable of esterifying carboxylic acids, [15][16][17][18][19][20] but their high reactivity likewise limits utility with biomolecules.
We were aware of precedent for the use of stabilized diazo compounds in a biochemical context. In the 1960s, 2-diazoacetamide, 21,22 N-(diazoacetyl)glycinamide, 23 and diphenyldiazomethane 24 were used to identify the most reactive carboxylic acid groups in proteins. These efforts required adding a vast molar excess (>10 3 -fold) of the diazo compound and tedious monitoring of reaction pH, all to achieve modest labelling. We suspected that the problem was non-specic reactivity, as with diazomethane. 14 Here, we report on the reactivity of still more stabilized diazo compounds with relevant carboxylic acids.

Results and discussion
We began by screening the reactivity of diazo compounds 1 and 2 with small-molecule carboxylic acids of varying acidity and bearing a variety of reactive functional groups. These diazo compounds were accessed from their corresponding azide by a deimidogenation reaction that uses a phosphinoester to convert an azide into a diazo-compound. 25,26 In acetonitrile, both diazo compounds were unreactive toward some nucleophiles common in biomolecules: amino, hydroxyl, and sulydryl groups. Both, however, did react with carboxylic acids to give comparable yields of ester product (Scheme 1). In contrast to diazo compounds 1 and 2, diethyl 2-diazomalonate was unreactive with a-h. Tellingly, b-alanine (h) proved unreactive with all three diazo compounds under any tested condition. This acid alone exists as a zwitterion, and its lack of reactivity is consistent with esterication occurring via a diazoniumcarboxylate salt (Fig. 1). 27,28 Moreover, we noted that a carboxylic acid (a-g) is acidic enough to promote the reaction, but a phenolic hydroxyl (d), sulydryl (f), or ammonium group (h) is not.
Next, we investigated the analogous reactions in an aqueous environment (Scheme 2). Whereas diazo compound 1 was competent for esterication in a 3 : 1 mixture of acetonitrile and 10 mM MES-NaOH buffer, pH 5.5, the major product was alcohol 5 formed when water attacks the diazonium ion. On the other hand, diazo compound 2 gave primarily the desired ester 4 in all cases. Interestingly, the ester : alcohol product ratios with 1 were variable, whereas those with 2 were approximately two. 29 These data are consistent with the nascent diazoniumcarboxylate salt being maintained in a solvent cage by Coulombic forces (Fig. 1), as described by others. 28,30 Carrying out the reaction in a 1 : 1 mixture of acetonitrile and 10 mM MES-NaOH buffer resulted in greatly deleterious partitioning with 1 but not 2. Diethyl 2-diazomalonate was found to be unreactive in these aqueous conditions, as in neat acetonitrile.
What is the basis for this differential reactivity? The pK a values of relevant acids have been measured in dimethyl sulfoxide. 31 These pK a values correlate with the observed reactivity via the mechanism of Fig. 1. Specically, the conjugate base of diethylmalonate (pK a 16.4) 32 is weak, and diethyl 2-diazomalonate cannot abstract a proton from a carboxylic acid. On the other hand, the conjugate base of diethylacetamide (pK a 35) 33 is strong, and diazo compound 1 lacks chemoselectivity in an aqueous environment (Scheme 2). Only the conjugate base of uorene (pK a 22.6) 34 is matched to the task, enabling diazo compound 2 access to the mechanism of Fig. 1 with high chemoselectivity.
We used thioacetic acid to probe the mechanism of esteri-cation. Its acidic proton resides primarily on sulfur, 35 but an intermediate diazonium-thiocarboxylate salt could lead to either a thioester or thionoester product. In anhydrous acetonitrile, complete selectivity for thioester formation was obtained with diazo compound 2 (Scheme 3). This selectivity rules out a cyclic transition state reminiscent of an ene reaction, 36,37 which would provide the thionoester as the product. On the other hand, diazo compound 1 produced a mixture of thio-and thionoester products. In an aqueous environment, these same reactants led exclusively to a thio-rather than a thionoester, along with the hydrolysis product from diazo compound 1. 38 The lack of thionoester formation can be attributed to the greater nucleophilicity of sulfur on the diazonium salt and the greater ability of oxygen to form a hydrogen bond with solvent water, 39 which would decrease its nucleophilicity still further.
Finally, we assessed the ability of diazo compounds 1 and 2 to esterify the carboxyl groups present in a model protein, bovine pancreatic ribonuclease (RNase A; Fig. 2A). The catalytic activity of this well-characterized enzyme provides an extremely sensitive measure of native structure and function. 40 RNase A was incubated for 4 h at 37 C in a 1 : 1 mixture of 10 mM MES-NaOH buffer, pH 5.5, and acetonitrile containing a diazo compound (10 equiv.). Under these conditions, diazo compound 1 proved incapable of labelling the protein. Only in the presence of 200 equiv. was any esterication observed with this reagent, consistent with its tendency towards hydrolysis in an aqueous environment (Scheme 2). In contrast, diazo compound 2 esteried, on average, three of the eleven carboxylates (Fig. S1 †). Using trypsin digestion coupled with mass spectrometry, we were able to identify the esteried residues. Diazo compound 2 (10 equiv.) labelled Asp14, Glu49, Glu111, and Asp121, almost exclusively. Diazo compound 1 (200 equiv.) labelled a similar subset of residues (Glu9, Asp14, Glu49, Glu111, and the C terminus). The basis for reaction with these particular residues is not clear, but likely integrates the reactivity (i.e., high pK a ) and accessibility of their carboxyl groups.
Most importantly, we probed the bioreversibility of our protein esterication reaction (Fig. 2B). To enhance the relevance of this experiment, we sought to employ esterases endogenous to human cells rather than commercial enzymes. Accordingly, we used recombinant DNA technology to add an 8residue FLAG tag to the N-terminus of RNase A, and we esteri-ed the ensuing FLAG-RNase A with diazo compound 2 as described above. We found labelling to be comparable to that of wild-type RNase A (Fig. S12 †), and we observed that the enzyme lost half of its catalytic activity upon esterication (Fig. S13 †). 41 Then, we incubated the esteried protein with a HeLa cell extract. We recovered the enzyme from this complex mixture with an anti-FLAG antibody immobilized on magnetic beads. We could not detect any esters in the extract-treated enzyme with mass spectrometry (Fig. S12 †), 42 and we found that the enzyme had recovered all of its catalytic activity (Fig. S13 †).
These data indicate that the protein esterication reaction was fully bioreversible.
Finally, we sought to demonstrate the generality of our esterication method by employing a second protein substrate with a different tag for its purication. Specically, we applied diazo compound 2 (10 equiv.) and the conditions described above to the mCherry variant of red uorescent protein (RFP) from Discosoma sp. We found that His 6 -RFP became esteried with 1-3 uorenyl groups. The protein did not lose its uorescence during the esterication reaction. As with RNase A, the esterication of RFP was bioreversible. Incubation with a HeLa cell extract followed by recovery with a Ni-affinity resin, yielded unmodied His 6 -RFP (Fig. S13 †). Success with both FLAGlabeled RNase A and His 6 -labeled RFP indicates that our ester-ication method has a broad scope.

Concluding remarks
We nd that a diazo compound can effect the efficient Oalkylation of carboxylic acids in an aqueous environment near neutral pH, and that this reactivity extends to proteins. The esters in the ensuing protein can be hydrolysed by esterases that are endogenous to human cells, thereby recreating the wild-type protein and avoiding a compromise to function or the display of an epitope. By enabling the facile semisynthesis of "proproteins" containing transitory pharmacokinetic enhancing, celltargeting, or cell-penetration moieties, we envision that Oalkylation of carboxylic acids by diazo compounds could broaden the utility of antibodies, enzymes, hormones, and other proteins in chemical biology and pharmacology.