Natalya
Voloshchuk
a and
Jin Kim
Montclare
ab
aDepartment of Chemical and Biological Sciences, Polytechnic Institute of New York University, Brooklyn, NY 11201, USA
bDepartment of Biochemistry, SUNY-Downstate Medical Center, Brooklyn, NY 11203, USA. E-mail: jmontcla@poly.edu; Fax: +1 718-260-3676; Tel: +1 718-260-3679
First published on 4th September 2009
The challenge of synthetic biology lies in the construction of artificial cellular systems. This requires the development of modular “parts” that can be integrated into living systems to elicit an artificial, yet programmed, response or function. The development of methods to engineer proteins bearing unnatural amino acids (UAAs) provides essential components that may address this challenge. Here we review the emerging strategies for incorporating UAAs into proteins with the endgame of engineering artificial cells and organisms.
Proteins are essential components of synthetic cellular networks. Synthesis of cellular pathways with the ultimate goal of creating functional artificial cells can benefit greatly from the generation of proteins with desirable properties that can be effectively regulated. Proteins capable of functioning independently of the host organism’s endogenous circuitry require further developments in “parts” design.
The recent advances in reassigning the genetic code to unnatural amino acids (UAAs)12,14–16 expand the molecular toolbox of protein “parts”. In fact, introduction of UAA can impart new functions that are difficult or impossible to create with proteins comprised of the natural 20 amino acid building blocks such as photo-induced switching,17IR probe active18 and redox sensitive19proteins. Moreover, proteins bearing UAAs may result in improved properties such as hyperstability20,21 and protease resistance.22 Such features may be useful for tailoring orthogonal proteins in the context of synthetic biological networks. Here we discuss approaches that focus on protein engineering and that can potentially expand synthetic biology methods by UAA incorporation for the creation of artificial proteins.
Fig. 1 Proteins in synthetic biology. (A) Repressilator23 (left) is a synthetic network under transcriptional regulation where repressor protein, LacI, inhibits the transcription of tetR repressor gene. The product of the tetRgene, in turn, inhibits the transcription of λcIgene. To complete the cycle, λCI inhibits lacI expression. TetR also inhibits transcription of the gfp reporter gene. Oscillations of the repressilator network are observed in E. coli cells by monitoring GFP fluorescence. Genetic toggle switch24 (right) is constructed of two repressors and two constitutive promoters, resulting in oscillations produced by transient inducers. In the network, promoters are inhibited by repressors that are under transcriptional control of the opposing promoter, i.e. Rep 1 inhibits expression of repressor 2gene. (B) Non-native protein–protein interactions (top)28 are designed to rewire cellular pathway. Ste5 mutant contains mutations that destroy recruitment of Ste7 kinase resulting in non-functional mating pathway in yeast. Replacement of defective recruitment interaction with a designed heterologous protein–protein interactions rewires the signal transduction pathway and alleviates mating deficiency. A single input protein switch26 (bottom) is turned on and off by introduction of competitive ligand (CL). In the “on” position, N-WASP domain is able to carry out actin polymerization. (C) The RBP receptor is computationally designed to bind non-biological ligand (TNT)32 and initializes signal transduction pathway through interaction with transmembrane histidine kinase, Trz, resulting in autophosphorylation and phosphorylation of transcription regulator OmpR. Phosphorylated OmpR activates the lacZ reporter gene expression. (D) Signalling metabolite acetyl phosphate is used to regulate an oscillating synthetic circuit.33 The circuit is comprised of two enzymes that are under the transcriptional control of acetyl phosphate and two metabolites (acetyl CoA and acetyl phosphate) to generate oscillations. |
In addition to protein–nucleic acid interactions, protein–protein interactions have also been manipulated to re-wire signal transduction cascades.26–31 As the first example of engineering protein–protein networks, Lim et al. inserted a heterologous protein interaction within the yeast MAP kinase pathway and demonstrated that simple tethering can produce the appropriate function or output.28 Known mutations that disabled the kinase function in the mating pathway in Saccharomyces cerevisiae were rescued by introduction of alternative interactions in the scaffold. Most remarkably, a synthetic pathway was designed where native interactions were destroyed by mutagenesis and non-native interactions were established through tethering a different kinase set, allowing for a non-natural input-output scheme (Fig. 1B).28 This confirmed that proteins could form functional pathways that were “modular and flexible”.28 Moreover, using the actin regulatory protein N-WASP as the model system, Lim and coworkers designed a synthetic switch where the output of N-WASP was essentially spliced into a PDZ domain-ligand pair.26 This resulted in the switch being in the “off” state and upon addition of PDZ ligand, the switch was turned “on” (Fig. 1B). To further expand the ability of these switches to process multiple inputs, a combinatorial library was synthesized.
Four parameters were varied: (1) domain type; (2) domain–ligand interaction; (3) linker length and (4) domain architecture.26 The library resulted in synthetic switches with a variety of novel gating behaviours to non-physiological inputs. In order to engineer artificial pathways and functions like the ones described,26–31,34 a method to systematically identify common protein motifs and protein–protein interactions is needed. Programs and databases are being developed to recognize, classify, and organize productive protein–protein interactions with the overall objective for construction and reprogramming of intracellular interaction for synthetic biosystems.35–37
Signal transduction cascades as well as metabolic pathways often rely on protein–small molecule interactions. Key elements of molecular recognition for effective receptor–ligand interaction can be extracted for building highly specific cellular pathways. An initial example of engineering proteins for new ligands focused on employing computational methods to alter the specificities of periplasmic binding protein (PBP) receptors that were experimentally tested for binding artificial ligands (Fig. 1C).32 When the re-engineered receptors for trinitrotoluene (TNT) and seratonin were integrated into a 2-component synthetic signal transduction pathway with a reporter gene in Escherichia coli, expected dose response signals were observed.32
In the case of metabolic pathway engineering,9,33,38 the small molecule and how it interacts with various proteins in a pathway are critical. The Liao group designed an oscillatory circuit that coupled metabolite inputs and outputs with transcriptional regulation.33 The metabolite acetyl coenzyme A (AcCoA) was designed to be converted into acetyl phosphate (AcP) by phosphate acetyltransferase (Pta) and acetate by acetate kinase (Ack). The enzyme acetyl coenzyme A synthetase (Acs) could then process acetate back into AcCoA (Fig. 1D). This two-metabolite system was connected to transcription by engineering the acsgene under the glnAp2 promoter, which could be activated by AcP. The glnAp2 promoter was also engineered to control the lac repressor to repress the ptagene.33 The oscillatory circuit demonstrated the importance of small molecules in the network connectivity, in addition to protein–protein and protein–DNA interactions within the network.
Because of the integral role proteins play in synthetic biology as described above, protein engineering methods are increasingly in demand for the design of transcription networks,39–41 novel input-output signal transduction cascades,27,42 and metabolic pathways.38,43 As the applications for synthetic biology become focused on generating an “artificial” or “non-natural” function or product, approaches that can enhance orthogonality within the system will become vital. Protein modifications such as UAA incorporation provides a valuable mode of integrating orthogonality and may enable tuning of a complex circuit beyond the constrains of nature.44
Fig. 2 Cotranslational insertion mechanism of selenocysteine and pyrrolysine.46,47 (A) An opal suppressor tRNA is charged with serine by seryl-tRNA synthetase. In the presence of selenium monophosphate, serine is converted into selenocysteine in a reaction catalyzed by selenocysteine synthase (SelA). SelB mediates selenocysteine insertion at the UGA codon adjacent to the SECIS element on the mRNA. (B) Pyrrolysine is charged to amber suppressor tRNA by pyrrolysyl-tRNA synthetase and inserted at the amber codon next to the PYLIS element. EF-Tu aids in the cotranslational insertion. |
Pyrrolysine is determined to be genetically encoded by Methanosarcinaceae by the UAG stop codon in the methylamine methyltransferasegenes required for methylamine metabolism as well as the trimethylamine methyltransferase homolog from the gram-positive Desulfitobacterium hafniense.48,49 Similar to selenocysteine, pyrrolysine incorporation is reported to depend on a specific structural element, termed pyrrolysine insertion sequence (PYLIS) in the mRNA.48 Whereas selenocysteine is synthesized on the tRNA, pyrrolysine is directly charged to tRNAPyl (pylT) by the designated pyrrolysyl-tRNA synthetase (PylS) (Fig. 2).48 Elongation factor EF-Tu binds tRNAPyl and is able to decode UAG as pyrrolysine as long as the pylT and pylSgenes are also provided.48 Because both selenocysteine and pyrrolysine utilize a stop codon for incorporation, it has been a challenge to identify these amino acids until recently.
Although the first two steps can be achieved by site-directed mutagenesis and use of suppressor tRNAs, the aminoacylation of the tRNA with UAA requires more effort. As an alternative to employing the AARS, tRNA aminoacylation with UAA can be accomplished with T4 RNA ligasein vitro.50 The UAA can be aminoacylated chemically upon reaction with pdCpA dinucleotide and then ligated to the truncated supressor tRNA lacking pCpA dinucleotide. Such aminoacylated tRNAs have been initially used in a cell-free translation mixture to incorporate UAA.51In vitro translation systems use cellular extracts that contain all the required components for efficient translation. The site of UAA incorporation is based on a readthrough of an amber (UAG) codon in mRNA by a suppressor tRNA aminoacylated with the analog. From initial in vitro experiments, it was recognized that the suppressor tRNA should not be charged by the endogenous AARSs from the cellular extracts since this would result in reduced efficiency of UAA incorporation. Thus, S. cerevisiae tRNACUAPhe is utilized in the in vitro E. coli expression system because S. cerevisiaetRNAs are orthogonal and not recognized by E. coli AARS.51 Although this allows for efficient incorporation of UAA, the chemically aminoacylated tRNA is not regenerated, limiting the amount of protein that can be produced.
Fig. 3 Selection strategies of orthogonal AARS/tRNA pairs for in vivo, site-specific incorporation using nonsense codons. (A) Negative and positive selection using barnase and β-lactamase as reporter genes for orthogonal amber suppressor MjtRNA in E. coli.15,61 When MjtRNA is charged by endogenous AARS, barnase is produced and cells die. Orthogonal MjtRNA cannot be recognized by cellular AARS and cells survive. Cognate AARS was identified by ability of cells to grow on ampicillin. (B) Positive and negative selection scheme for MjTyrRS specific for UAA. The selected mutRNACUATyr as shown in (A) is used for the following selections.54MjTyrRS library variants capable of activating UAA survive on chloramphenicol in presence of UAA, but cannot survive in media containing no UAA due to read through of the catgene. |
MjTyrRS variants that are able to charge suppressor MjtRNACUATyr with a particular UAA were identified from a library.18,54,57–61 Five residues in the active site of the MjTyrRS were randomized via saturation mutagenesis to generate a library of 1.6 × 109 variants.15,61 The residues were identified from the crystal structure of a TyrRS homolog from Bacillus stearothermophilus. A MjTyrRS variant capable of charging the suppressor MjtRNACUATyr with a selected UAA with high fidelity was identified through several rounds of positive and negative selection (Fig. 3B).
As a positive selection, the MjTyrRS library was transformed into E. coli bearing a plasmid with a gene for the orthogonal suppressor MjtRNACUATyr and chloramphenicol acetyltransferase (CAT) with an in-frame TAG codon. Surviving cells grown on chloramphenicol in the presence of UAA indicated that MjTyrRS was able to charge MjtRNACUATyr with UAA or natural amino acids. This positive selection was followed by negative selection, where plasmid containing MjTyrRS variants from survived cells was transformed into E. coli bearing a plasmid encoding barnase with several in-frame nonsense codons. Cells were grown in the media depleted of UAA. Cells expressing MjTyrRS variants capable of charging suppressor MjtRNACUATyr only with UAA survived because barnase was not produced. Genes for MjTyrRS variants that charged MjtRNACUATyr with UAA could be isolated and serve as a base for a new library created by DNA shuffling. The cycle of positive and negative selections could be repeated several times to identify the most effective and efficient MjTyrRS variants for UAA incorporation. Another selection method for synthetase specificity involved combination of chloramphenicol resistance and a fluorescent reporter allowing use of a single E. coli strain for positive and negative screens.54–55,63 The MjtRNACUATyr/TyrRS pair was used to incorporate a variety of UAA into proteins in E. coli15,18,54–60,63–65 (Table 1). In addition, other orthogonal tRNA/AARS pairs were developed for in vivo incorporation in E. coli66,67 (Table 1). Recently, a single plasmid selection system for the evolution of AARSs was reported by Schultz and coworkers.68 The fusion genechloramphenicol acetyltransferase (cat) and uracil phosphoribosyltransferase (uprt) was used as a positive/negative selection marker.68 Cells that were able to incorporate UAA in response to the amber codon were selected for in the presence of chloramphenicol and cells that harbored AARS variants that incorporated endogenous amino acids were selected against by adding 5-fluorouracil.68 The creation of orthogonal pairs further expands the molecular toolbox of UAAs that can be cotranslationally inserted into proteins.
Organism | tRNA/AARS pair | Derived from | Analog |
---|---|---|---|
E. coli | tRNACUAPhe/PheRS | S. cerevisiae | p-fluorophenylalanine 53 |
tRNACUATyr/TyrRS | M. jannaschii | p-aminophenylalanine 55,60,63 | |
naphthylalanine 55 | |||
p-benzoylphenylalanine 55,65 | |||
p-methyoxyphenylalanine55 | |||
p-aminomethylphenylalanine 55 | |||
p-methylphenylalanine 55 | |||
p-trifluoromethylphenylalanine55,56 | |||
p-nitrophenylalanine 55 | |||
p-isopropylphenylalanine 63 | |||
O-allyl-tyrosine 63 | |||
p-acetylphenylalanine 57,65 | |||
p-azidophenylalanine 54,65 | |||
p-iodophenylalanine 58,65,68 | |||
β-N-acetylglucosamine-serine59 | |||
p-cyanophenylalanine 18 | |||
p-hydroxyphenyllactic acid 64 | |||
O-methyltyrosine74 | |||
phenyl-selenocysteine 75 | |||
(2,2′-bipyridin-5-yl)alanine76,77 | |||
(8-hydroxyquinolin-3-yl)alanine 78 | |||
p-iodophenylalanine 68 | |||
2-amino-3-(8-hydroxyquinolin-3-yl)propanoic acid 78 | |||
tRNACUAPyl/PylRS | M. maize | N ε -cyclopentyloxycarbonyl-L-lysine79 | |
O-nitrobenzyl-oxycarbonyl-Nε-L-lysine79 | |||
tRNAAAAPhe/PheRS T415G | S. cerevisiae | 3-(2-naphthyl)alanine 80 | |
tRNACUAPhe/PheRS T415G | S. cerevisiae | p-bromophenylalanine 67 | |
benzothienylalanine66 | |||
6-bromotryptophan 66 | |||
6-chlorotryptophan 66 | |||
tRNACUAPyl/PylRS | M. barkeri | N ε -acetyllysine81 | |
S. cerevisiae | tRNACUATyr/TyrRS | E. coli | p-benzoylphenylalanine 62,69 |
p-azidophenylalanine 62,69 | |||
p-acetylphenylalanine 62,69 | |||
p-propargyloxyphenylalanine69 | |||
O-methyltyrosine62 | |||
p-iodotyrosine 62 | |||
4,5-dimehtoxy-2-nitrobenzylserine62 | |||
tRNACUALeu/LeuRS | E. coli | methionine and cysteine homologs82 | |
2-aminocaprylic acid82 | |||
2-aminononanoic acid 82 | |||
2-aminodecanoic acid 82 | |||
P. pastoris | tRNACUATyr/TyrRS | E. coli | p-benzoylphenylalanine 83 |
p-azidophenylalanine 83 | |||
p-(propargyloxy)-phenylalanine83 | |||
p-methoxyphenylalanine83 | |||
p-iodophenylalanine 83 | |||
tRNACUALeu/LeuRS | E. coli | 4,5-dimethoxy-2-nitrobenzylserine 83 | |
dansylalanine83 | |||
Mammalian cells | tRNACUATyr/TyrRS | B. stearothermophilus | 3-iodotyrosine 70 |
E. coli | p-acetylphenylalanine 73 | ||
p-benzoylphenylalanine 73 | |||
tRNACUATrp/TrpRS | B. subtilis | 5-hydroxytryptophan 71 | |
tRNACUATyr/TyrRS | E. coli | O-methyltyrosine72 | |
p-benzoylphenylalanine 72 | |||
tRNACUALeu/LeuRS | E. coli | 2-amino-3-(5-(dimethylamino)naphthalene-1-sulfoamido) | |
propanoic acid 72 | |||
tRNACUAPyl/PylRS | M. maize | o-nitrobenzyl-oxycarbonyl-Nε-L-lysine79 | |
N ε -tert-butyloxycarbonyl-L-lysine84 | |||
N ε -benzyloxycarbonyl-L-lysine84 | |||
N ε -acetyllysine84 |
Fig. 4 Positive and negative selection of orthogonal EcTyrRS/mutRNACUATyr pair specific for UAA in S. cerevisiae.62 Only S. cerevisiae cells carrying genes for orthogonal EcTyrRS/mutRNACUATyr pair that mediate UAA incorporation can survive on 3-AT or without uracil when grown in presence of UAA. In the absence of UAA, cells that contain active EcTyrRS/mutRNACUATyr pair cannot survive in presence of 5-FOA. When GAL4 amber mutations are suppressed with natural amino acids, URA3 is expressed and cells are converting 5-FOA to a toxic product.62 |
Exploiting the orthogonal pairs from previous selections, the Wang group utilized the EctRNACUATyr/TyrRS pair selected from yeast.72 Several expression constructs were tested in HeLa cell lines . In order to overcome the challenge of expressing sufficient quantities of functional tRNAs, the H1 promoter was utilized to drive the functional biosynthesis of EctRNACUATyr. Moreover, the 3′-flanking sequence of human tRNAfMet was also needed for efficient expression of the EctRNACUATyr in mammalian cells. The genes for EctRNACUATyr and EcTyrRS was engineered on a single tRNA/AARS expression plasmid. This plasmid was coexpressed with another plasmid containing the GFP gene with an in-frame TAG codon. Those EctRNACUATyr that were expressed and correctly processed was aminoacylated, yielding a fluorescent cell through suppression of the TAG codon in GFP. Orthogonality of EctRNACUATyr was tested by removing EcTyrRS gene from the expression construct. Transfection of the resultant plasmid into HeLa cells did not change fluorescent intensity, indicating inability of the endogenous synthetases to aminoacylate the EctRNACUATyr. In addition, they proved that the H1 promoter could efficiently express E. colitRNAs by demonstrating fluorescence using a different orthogonal EctRNACUALeu/EcLeuRS pair. For UAA incorporation, the EcTyrRS gene in the tRNA/AARS expression plasmid were replaced by the OmeTyrRS gene, a synthetase specific for O-methyltyrosine (OmeTyr) previously evolved in yeast.72
Successful suppresssion of TAG in GFP was demonstrated in HeLa cells grown in media containing UAA, with a 41% incorporation efficiency. Expression of BraRS, a synthetase specific for p-benzoylphenylalanine (Bra) evolved in yeast, led to 13% incorporation of the UAA. EctRNACUALeu and mutant synthetase specific for 2-amino-3-(5-(dimethylamino)-naphthalene-1-sulfonamido)propanoic acid (DanAla) were expressed in HeLa cells with a 13% incorporation efficiency. Similarly, incorporation of OmeTyr and Bra was achieved in mouse hippocampal neurons. While OmeTyr was incorporated into position 19 of the inactivation peptide of potassium channel Kv1.4 expressed in HEK293T cells using the EctRNACUATyr/OmeTyrRS pair, DanAla was incorporated using the EctRNACUALeu/DanAlaRS pair. The presence of different sized UAAs allowed insight into the inactivation mechanism of Kv1.4.72 RajBhandary, Sakmar and coworkers employed amber suppressor tRNA derived from B. stearothermophilus tRNACUATyr, and E. coli TyrRS variants developed previously to incorporate p-acetylphenylalanine (Acp) and p-benzoylphenylalanine (Bzp) into CCR5 and rhodopsin in HEK293T cells.73 In this case, the firefly luciferase reporter was used to screen for orthogonality and specificity of UAA incorporation. The luciferase reporter contained a TAG codon at position Y70, which in the presence of the appropriate orthogonal tRNA/TyrRS pair and UAA produced full length luciferase.
Fig. 5 Alternative approaches for site-specific incorporation. (A) UAA incorporation through nonsense suppression in Xenopusoocytes.88 Designed tRNA with a chemically ligated UAA is introduced into Xenopusoocytes by microinjection and translates the mRNA bearing the stop codon. (B) Frameshift method for site-specific incorporation of UAA using four-base codons in cell-free expression systems. When CGGG is translated by UAA-charged tRNA, full length protein is produced with the UAA inserted at the CGGG site.89 When CGGG is read as CGG by endogenous tRNAprotein synthesis is terminated. |
Expression system | tRNA | Analog |
---|---|---|
Xenopus oocytes | S. cerevisiae tRNA CUA Phe | 3-N-(7-nitrobenz-2-oxa-1,3-diazol-4-yl)2,3-diaminopropionic acid86 |
T. thermophila tRNACUA | over 10088 | |
T. thermophila tRNACUAGln | 5-fluorotryptophan 90 | |
T. thermophila tRNAACUGln | 5-fluorotryptophan 85 | |
S. cerevisiae tRNACCCGPhe | 5-fluorotryptophan 85 | |
S. cerevisiae tRNAACCCPhe | α-aminobutyric acid85 | |
norvaline 85 | ||
E. coli | Archae tRNAAGGALys/LysRS | L-homoglutamine93 |
Archae tRNAAGGALys/LysRS and Mj tRNACUATyr/TyrRS | L-homoglutamine and O-methyltyrosine93 | |
E. coli cell-free expression system | S. cerevisiae tRNAACCUPhe | p-biphenylalanine94 |
S. cerevisiae tRNAACCCPhe | p-benzoylphenylalanine 94 | |
S. cerevisiae tRNAACCGPhe | p-phenylazophenylalanine94 | |
p-naphthylalanine 94 | ||
S. cerevisiae tRNACCCGPhe | BODIPY FL p-aminophenylalanine95 | |
S. cerevisiae tRNAUACCGPhe | p-nitrophenylalanine 96 | |
ε-nitrobenzoxadiazolyllysine96 | ||
Eukaryotic rabbit reticulocyte cell-free expression system | S. cerevisiae tRNAACCGPhe | p-nitrophenylalanine 97 |
S. cerevisiae tRNAAGCGPhe | p-nitrophenylalanine 97 | |
S. cerevisiae tRNAAGGGPhe | p-nitrophenylalanine 97 | |
S. cerevisiae tRNAAGAGPhe | p-nitrophenylalanine 97 | |
S. cerevisiae tRNAAUAGPhe | p-nitrophenylalanine 97 | |
S. cerevisiae tRNAACCCPhe | p-nitrophenylalanine 97 | |
Sf21 insect cell-free expression system | M. acetivorans tRNAPylCCCG or | p-nitrophenylalanine 98 |
M. acetivorans tRNAPylUCCG | 1-naphthylalanine 98 | |
2-naphthylalanine 98 |
A selection system for efficient frameshift suppressor tRNAs with high decoding activity was developed recently.95 Synthetic tRNAs were ligated with 5′-phosphodeoxycytidyl-phospho-puromycin (pdCp-puromycin) by T4 RNA ligase. These puromycin-tRNAs were added to the cell-free translation system with streptavidin-linked mRNA that contained the CGGG codon at the C-terminus. Puromycin-tRNAs that bound to the ribosome produced steptavidin-tRNA fusions that could be released from the ribosome complex through reaction with EDTA and recovered with biotin-covered beads. To create a DNA library that encoded functional tRNAs, recovered steptavidin-tRNAs were subjected to RT-PCR. Several rounds of selection were performed through reaction with EDTA and recovered with biotin-covered beads. To create a DNA library that encoded functional tRNAs, recovered steptavidin-tRNAs were subjected to RT-PCR. Several rounds of selection were carried out to identify frameshift tRNACGGG with high efficient decoding activity in E. coli cell-free expression system.85 Frameshift suppression was also used in vivo to introduce L-homoglutamine in response to the four-base codon AGGA in E. coli.93 The orthogonal LysRS/tRNA pair derived from Archaea was used in the system.93 Simultaneous incorporation of two amino acids (L-homoglutamine and O-methyltyrosine) at two different positions in myoglobin was demonstrated in vivo in E. coli using both the developed frameshift suppression (Archae tRNAAGGALys/LysRS) and amber suppression (MjtRNACUATyr/TyrRS).93
In vivo UAA incorporation in Xenopusoocytes using frameshift suppression was demonstrated with SctRNACCCGPhe for 5-fluorotryptophan and SctRNAACCCPhe for α-aminobutyric acid or norvaline.85 Simultaneous incorporation of 5-fluorotryptophan and α-aminobutyric acid (or norvaline) was done by microinjection of both tRNACUA (THG73, described above) and SctRNACCCGPhe (or SctRNAACCCPhe) with a neuroreceptor mRNA. For incorporation of three amino acids, three tRNAs were microinjected into oocytes: 5-fluorotryptophan-tRNACUA/THG73, α-aminobutyric acid-SctRNACCCGPhe and norvaline-tRNAACCCPhe. Expression levels were adequate for electric current measurements of the functional neuroreceptor.89
Fig. 6 Orthogonal O-mRNA/O-ribosome selection in E. coli for improved UAA incorporation.99 (A) The ribosome binding site (RBS) on the mRNA is modified to eliminate endogenous ribosome binding. Uracil phosphoribosyltransferase (UPRT) is encoded by uppgene. 5-fluorouracil is converted by UPRT into an inhibitor of thymidylate synthase, causing cell death. Translation of cat allows O-ribosome containing cells survive on chloramphenicol. (B) Selection of O-ribosome with improved UAGA decoding in E. coli.100 Cells are transformed with O-rRNA library and the reporter plasmido-cat. Survival of cells on chloramphenicol indicates efficient UAGA decoding. |
The ribosome library was co-transformed with a reporter plasmid bearing the catgene with 2 UAGA stop codons fused downstream of the orthogonal ribosome-binding site (o-cat) to isolate ribosomes capabe of translating the O-mRNA.100 Cells containing o-cat (UAGA103,UAGA146)/tRNAser2UCUA and O-ribosome library were grown in the presence of chloramphenicol. The ribosome binding site sequence of the O-mRNA was altered so that it was not recognized by the endogenous ribosome.99,100 Through this selection U531G, U534A 16S rRNA mutant ribosome was identified for efficient UAGA decoding. This resulted in an O-ribosome variant that could only carry out translation of the O-mRNA, and not the cellular mRNAs.100 The O-ribosome/O-mRNA pair was capable of functioning in parallel to the cellular translation.100 The system offered the unique advantage of not interrupting cellular translation that could consequently cause cell death.100
Using a previously developed p-benzoylphenylalanyl-tRNA synthetase/tRNACUA (BraRS/tRNACUA) pair evolved from the MjTyrRS/tRNACUA pair in E. coli,63 in conjunction with the evolved O-ribosome with corresponding O-mRNA, they reported an increase in efficiency of 62% for site-specific p-benzoylphenylalanine (Bpa) incorporation into protein expressed in E. coli100,102Bpa incorporation at two amber stop codon sites in the test protein resulted in a slightly lower efficiency of 38%.100 Thus O-ribosome/O-mRNA pair was shown to effectively read through the amber codon for UAA incorporation. Essentially, the evolved O-ribosome decreased the functional interaction with E. coli RF-1, allowing suppressor tRNA to compete more efficiently for A-site binding in the presence of a UAG codon on the mRNA.100,102
In order to expand the structural diversity of UAA incorporated globally, modifications to the cellular biosynthetic machinery have been made. Of the components involved in protein translational, the AARS serves as the gatekeeper for accepting or rejecting a particular amino acid to be coupled with its tRNA. Specifically, the methods implemented to shepherd UAA incorporation relies on: (i) introducing additional copies of the natural AARS; (ii) enlargening the AARS binding pocket; and (iii) shrinking the editing domain of the AARS.
Substrate recognition specificity and editing function of the AARS have been modified in order to expand incorporation of non-isosteric amino acid analogs. In order to introduce bulkier UAAs, Henneke and coworkers created an A294G mutation in the E. coliphenylalanyl-tRNA synthetase (PheRS).136,137 This increased the size of the binding pocket to allow charging of tRNAPhe with the para-halogenated phenylalanine analogs: p-chlorophenylalanine and p-bromophenylalanine (Table 3).136–139 The same mutant PheRS was used by the Tirrell group to incorporate additional phenylalanine analogs p-azidophenylalanine, p-iodophenylalanine, p-ethynylphenylalanine and p-cyanophenylalanine in E. coli (Table 3).127,140,141 The introduction of a T251G in addition to the A294G mutation in PheRS enabled the incorporation of ketone functionality of p-acetylphenylalanine into recombinant dihydrofolate reductase (DHFR) (Table 3).143 As an alternative to altering the binding pocket of the AARS, others have made changes to disable the editing domain to facilitate UAA incorporation. First demonstrated by Schimmel and coworkers, a single amino acid substitution in the editing pocket of valyl-tRNA synthetase (ValRS) from threonine to proline resulted in the misacylation of tRNAVal.144 This enabled aminobutyrate incorporation in response to valinecodons with 20% efficiency.144 Subsequently Tang and Tirrell illustrated that substitution of T252 in the editing pocket of leucyl-tRNA synthetase (LeuRS) with larger amino acids led to misacylation of tRNALeu, allowing the incorporation of norvaline, norleucine, allylglycine, homoallylglycine, homopropargylglycine and 2-butynylglycine (Table 3).145
Recently, the development of screening methods to identify variants of AARS capable of charging amino acid analogs has allowed for the rapid expansion of residue-specific UAA incorporation.103,104,120 A saturation mutagenesis library of MetRS variants was screened to identify mutants that enable azidonorleucine incorporation by Tirrell et al.103 (Fig. 7A, Table 3). The library was cloned into a vector that also contained the gene for outer membrane protein C (OmpC). Reactive UAA, such as azidohomoalanine or azidonorleucine, was introduced into OmpC in E. coli. Since the OmpC was expressed on the outer surface of the E. coli, cells displaying UAA could be covalently biotinylated by a Cu-catalyzed azide-alkyne ligation and labeled with fluorescent avidin.120Fluorescence-activated cell sorting (FACS) was employed to sort and differentiate labeled from unlabeled cells. Selected cells could undergo another round of sorting viaFACS to identify MetRS variants able to charge the azide-bearing UAAs. Through this screen a MetRS variant with a L13G mutation was identified to incorporate azidonorleucine.127
Fig. 7 Screening methods of AARS mutants for residue-specific incorporation of UAAs. (A) Identification of MetRS variants in E. coli for incorporation of azidonorleucine by exploiting its reactive side chain via bio-orthogonal 3 + 2 cycloaddition.103 Cells are transformed with plasmid bearing a MetRS library and the reporter OmpCgene. OmpC expression is induced in presence of UAA. Successful incorporation of UAA results in cell surface display of reactive side chain that forms covalent bond with reactive biotin in the presence of Cu(II). Biotin on the cell surface is subsequently bound to fluorescent avidin. Employing fluorescence-activated cell sorting (FACS), labelled cells can be isolated. Sorted cells can be analyzed to identify MetRS mutants or subject to additional screening. (B) Screening for MetRS variants that activate trifluoronorleucine for residue-specific incorporation in E. coli.104 Cells transformed with a construct containing a MetRS library and the GFP reporter gene are grown in media containing 19 amino acids in the presence (positive screening) or absence (negative screening) of UAA. Cells collected in negative screen are subjected to further positive and negative screening to identify cells that grow in the presence of UAA and exhibit high fluorescence. |
For this type of screen, the UAA must be limited to a bioorthogonal chemical handle for reaction with the label. To overcome this limitation, OmpC was replaced with a GFP reporter (Fig. 7B).104 All the methionine residues in the β-barrel region of GFP were randomized first to generate a mutant that retained its fluorescence even upon methionine replacement. A GFP mutant in which all the methionine residues were replaced and exhibited fluorescence comparable to wild type was identified. Five methionine residues were then introduced into this mutant at positions that did not alter fluorescence of the folded GFP. This resulting mutant GFP was used for the screening process. Cells were transformed with a plasmid bearing the MetRS library and mutant GFP were grown in media supplemented with 19 amino acids in the presence of 6,6,6-trifluoronorleucine (Tfn) and sorted by FACS as a positive screen (Table 3). A negative screen was perfomed in which the selected cells were expressed in 19 amino acids in the absence of methionine to remove any MetRS variants that could activate natural amino acids other than methionine The postive and negative screening was repeated several times to enrich for clones active towards UAA incorporation. The resulting MetRS variant (L13S, Y260L, H301L) was identified to demonstrate Tfn incorporation into the recombinant test protein DHFR with near complete replacement.
Reassignment of degenerate sense codons was used to incorporate L-3-(2-naphthyl)alanine (Nal) at five positions in DHFR by Tirrell and coworkers80 (Table 1, Fig. 8). Similar to site-specific incorporation, an orthogonal SctRNA/PheRS pair was introduced into the E. coli expression system. Rather than utilizing a suppressor tRNA, they employed the tRNAGAAPhe from yeast and altered the anticodon to produce tRNAAAAPhe. This allowed for exact Watson–Crick pairing with the UUU minor codon such that Nal could be incorporated in response to their presence. The yeast PheRS T415G mutant was selected to accommodate the bulky Nal. This mutant ScPheRS/ tRNAAAAPhe pair was used for the expression of DHFR in an E. coliphenylalanine auxotrophic strain.80 Although the tRNAAAAPhe was shown to compete with endogenous tRNA for UUU decoding, the codons were assigned to Nal with 80% efficiency.105 The efficiency of incorporation was comparable to the efficiency of site-specific incorporation.55 Moreover, by this method, five residues were replaced with the UAA.
Fig. 8 Reassignment of degenerate sense codonsin vivo in E. coli.80,105 A modified orthogonal tRNAAAA/PheRS pair from S. cerevisiae are introduced into E. coli that enable incorporation of naphthylalanine in response to the UUU codon. At the same time, endogenous E. colitRNA/PheRS pair mediates phenylalanine incorporation in response to the UUC codon. |
Fig. 9 Biosynthesis and site-specific incorporation of p-aminophenylalanine (pAF) in E. coli.60 The genespapA, papB, and papC from S. venezuelae were engineered into the constructs to enable the in vivo production of pAF. |
Sequence optimization experiments were reported for GFP containing UAA at one position by Sisido et al. (Fig. 10).149 These experiments were carried out in cell-free E. colitranscription system. Specifically, p-aminophenylalanine (pAF) was incorporated into GFP at Y66 in response to the frameshift CGGG codon, resulting in reduced fluorescence intensity as compared to wild type GFP. Saturation mutagenesis was used to create library of mutants with amino acid substitutions at S65 and Y145 (Fig. 10). From this library, 42 colonies were subjected to colony PCR, in vitro translation and fluorescence measurements. Three mutants, Y145L, Y145F and Y145M, were identified to exhibit improved fluorescence intensity with pAF at position 66. By combining site-specific incorporation with directed evolution, they were able to enhance the fluorescent property of the GFP bearing the UAA.
Fig. 10 Identification of a variant with improved fluorescence of GFP bearing as site-specific inserted UAA.149 GFP library, created by saturation mutagenesis, was transformed into E. coli. Selected colonies were isolated for GFP gene amplification . These genes were translated in cell-free E. coli expression system in the presence of UAA and expressed variants were analyzed by fluorescence measurements. |
Residue-specific incorporation and directed evolution was integrated to recover or enhance the function of proteins bearing UAAs (Fig. 11). The incorporation of 5′,5′,5′-trifluoroleucine (TFL) into chloramphenicol acetyltransferase (CAT) yielded a loss in thermostability.129,148,156 In the case of CAT, Montclare and Tirrell created random mutagenesis libraries by error-prone PCR, transformed them into E. colileucine auxotrophic strains and expressed them in the presence of TFL in 96-well plates.129
Fig. 11 Scheme of directed evolution protocol for CAT129 and GFP148proteins with UAA incorporated in residue-specific manner expressed in E. coli. Gene library generated by error-prone PCR was transformed in E. coli and grown in media containing UAA. Variants were screened for desirable properties: activity for CAT, fluorescence intensity for GFP. The best variants were chosen for additional rounds of directed evolution. |
A high-throughput colorimetric screening method was developed for cell lysates bearing the fluorinated variants.129 Most active clones were identified for the additional rounds of random mutagenesis and screening. A fluorinated CAT variant with enhanced thermal stability containing three mutations (K46M, S87N and M142I) was identified.129 Recently, this screen was also employed by the Montclare and coworkers to obtain an activity and thermostability profile of single-TFL-to-isoleucine mutants.157 This protocol of applying directed evolution to proteins bearing UAA was performed on GFP using FACS as a screening method. The incorporation of TFL into GFP resulted in a loss of fluorescence.148 To identify GFP variants with enhanced folding character, a random library was generated via error-prone PCR similar to the CAT work but with a slightly higher mutation rate.129,148 Cells underwent additional rounds of directed evolution resulting in the isolation of a fluorinated GFP mutant bearing twenty mutations with a total of fifteen fluorinated analogs.148 These experiments demonstrate that through combination of residue-specific UAA incorporation and directed evolution, functional proteins can be achieved.129,148
Recently site-specific incorporation in combination with phage display was employed to select a highly functional artificial antibody containing UAA by the Schultz and Smider groups.153 Multivalent hyperphage display was employed in which the C-terminus of the antibody scFv fragment was fused to pIII. Phagemid-encoded scFv was engineered to bear an amber codon within its DNA sequence, resulting in the incorporation of UAA at position 111 in the protein.153 Read-through of the amber codon yielded expression of functional pIII leading to phage assembly and scFv display on the phage surface153 (Fig. 12). By optimizing expression system, the bias of low yield of UAA containing scFv was eliminated and directed evolution experiments were carried out.153 The phage displayed scFv bearing the UAA sulfotyrosine (sTyr) was selected for gp 120 binding.153 Enhanced binding affinity and specificity of gp120 binding by the identified antibody was further confirmed and characterized by ELISA. These results demonstrate that directed evolution of proteins with UAA can offer a selective advantage through the specific functional role it provides beyond the natural set of 20 amino acids.
Fig. 12 Illustration of phage display integrated with site-specific UAA incorporation.153–155 Phage produced by E.coli displayed scFv antibody that undergoes directed evolution. After cleavage with protease and elution, scFv antibodies were assayed for gp120 binding, and the best variants were selected to undergo another round of directed evolution. |
This journal is © The Royal Society of Chemistry 2010 |