Jayani A. Christopher,
Sahan A. Galbada Liyanage,
Eve M. Nicholson,
William D. Kinney and
T. Ashton Cropp*
Department of Chemistry, Virginia Commonwealth University, Richmond, VA 23284, USA. E-mail: tacropp@vcu.edu
First published on 29th November 2022
Here we report the synthesis and genetic encoding of the lysine post translational modifications, β-hydroxybutyryl-lysine, isobutyryl-lysine and isovaleryl-lysine. The ability to obtain a homogenous protein samples with site-specific incorporation of these acylated lysine residues can serve as a powerful tool to study the biological role of lysine post translational modifications.
The lysine PTM, β-hydroxybutyryl-lysine, (HBK) (Fig. 1), was discovered in 2016 as a modification of histone proteins.9 It was shown that this modification is directly related to the concentration of the corresponding ketone bodies available in the cell, and is thus heavily influenced by metabolism. Interestingly, HBK not only quenches the lysine positive charge, but the presence of a hydroxyl group introduces the potential for additional hydrogen bonding, that could contribute to the alteration of protein–protein or protein DNA interactions.8 Interestingly a similar structure, isobutyryl-lysine (IBK), was also recently discovered as a histone modification mark.16 Indeed, many other acylations that are derived from the corresponding acyl-CoAs have been observed, including isovaleryl-lysine (IVK).17 Given the structural similarity of these amino acids, and the importance of understanding epigenetic regulation, we attempted to add these three amino acids to the genetic code.
Previously, we reported the successful incorporation of a different lysine PTM, 2-hydroxyisobutyryl lysine PTM into the genetic code of E. coli using the pyrrolysyl-tRNA synthetase (PylRS) and the orthogonal tRNACUA originally from Methanosarcina mazei species.18 Xiao and co-workers also successfully incorporated this PTM into recombinant histone proteins in E. coli and mammalian cells using an evolved PylRS/tRNACUA pair which was initially obtained from Methanosarcina barkeri (Mb) species.19 Here we show that this pair can also be used for the successful genetic encoding of these related PTMs. These analogues have been identified as potential lysine PTMs due to the availability of the corresponding acyl-CoAs from metabolic pathways and similar counter parts have been discovered in different proteins.17,20
We first prepared the HBK amino acid in several steps from commercially available benzoyl protected Boc-lysine (Nε-Cbz-Boc-lysine). The acylating agent, ethyl-3-hydroxybutyrate was protected by a TBDMS group to avoid reactivity of the 3-hydroxyl group, which we found critical for success (Scheme S2†). The sequence of our synthesis allowed us to produce the amino acid as a methyl ester (HBK-OMe), which was expected to be more bioavailable than the free amino acid. Methyl esters of amino acids can be hydrolysed in the cell,21 and we found that in the case of HBK, esterification was essential for successful protein production. Both IVK and IBK were synthesized as free amino acids using a similar synthetic route starting with Boc-lysine and isovaleryl chloride and isobutyl chloride for acylation, respectively (Scheme S3†). It was not necessary to use these as methyl esters.
As an initial approach, we performed a genetic selection against a Methanosarcina mazei PylRS (MbPylRS) active site library using HBKOMe. Surprisingly, the vast majority of enriched clones from this library consisted of wild-type active PylRS sequences. Importantly, these gene sequences consisted of many variants of degenerate codons that encoded the wild-type residues, suggesting that the wild-type sequence was enriched from the selection. To confirm the substrate specificity of the synthetase, a screening was performed by using a reporter plasmid that expresses GFPuv from a T7 promoter, and a T7-RNA polymerase gene containing an in-frame amber stop codon. This reporter is quite sensitive as a single amber suppression even can lead to multiple copies of GFPuv being expressed. As shown in Fig. 1, fluorescence was observed only in the presence of the acylated lysines, with little to no background expression in the absence of supplemented amino acids. The observed activity was similar to a positive control, Nε-Boc-Lys, which is often used as a benchmark in unnatural amino acid mutagenesis studies. This indicates that all three amino acids serve as substrates for the wild-type MbPylRS.
To isolate protein containing these amino acids, we performed an expression of superfolder GFP (His6-sfGFP) containing an amber stop codon mutation in place of Y151, and an N-terminal 6X-histidine tag. The expression was performed by growing the cultures in the presence of 2 mM IBK, IVK and HBK-OMe and purified the His tagged proteins with Ni2+ affinity chromatography under native purification conditions. Protein expression was only observed in the presence of the amino acids (Fig. 2), again confirming that endogenous amino acids are not substrates for the synthetase. The ESI-MS mass spectra of the intact proteins confirmed incorporation of the correct amino acid (Fig. S1†). The mass spectra did indicate the presence of some free lysine residues, perhaps the result of hydrolysis reactions in vivo, or during protein purification (Fig. S1†).
Finally, to demonstrate expression of the proteins most likely to be studied using these amino acids, we expressed human histone H3 protein containing the amber stop codon in place of the codon for lysine 9. This position has been observed to contain lysine modifications, including HBK, in cellular studies. The protein expression was performed with 4 mM BocK, IBK, IVK and HBK-OMe cultures alongside a wild-type histone H3 containing a native K9 residue. H3-proteins were purified by Ni2+ affinity chromatography under denaturing conditions, which has been used in the past to successfully re-fold functional protein.3 The protein expression was observed only in the presence of the acylated and wildtype lysines. The expression with IVK showed the highest yield when, whereas HBK-OMe expression was observed to be the lowest (Fig. 3). Nevertheless, as seen before, no expression was observed in the absence of exogenous amino acids indicating efficient site-specific incorporation. While HBK expression was low in this example, in other protein expressions, yields are comparable.
In summary, we were able to successfully synthesize and add new lysine PTMs to the genetic code of E. coli using the Mb PylRS/PylRS tRNACUA orthogonal pair. This leverages this system and enables the synthesis of natural human histone proteins containing newly discovered PTMs at site-specific locations. These protein products could serve as unique substrates to be used in deciphering the substrate specificity of enzymes that act on PTMs such as histone deacetylases. Moreover, this approach should enable the in-depth study of the consequences of these modifications with residue-level precision.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ra04898a |
This journal is © The Royal Society of Chemistry 2022 |