Anup Adhikari†
a,
Bibek Raj Bhattarai†a,
Ashika Aryalb,
Niru Thapaa,
Puja KCa,
Ashma Adhikaria,
Sushila Maharjana,
Prem B. Chandac,
Bishnu P. Regmid and
Niranjan Parajuli*a
aBiological Chemistry Lab, Central Department of Chemistry, Tribhuvan University, Kritipur, 44618, Kathmandu, Nepal. E-mail: niranjan.parajuli@cdc.tu.edu.np
bDepartment of Chemistry, Birendra Multiple Campus, Tribhuvan University, Bharatpur, Chitwan, Nepal
cDepartment of Chemistry and Physics, Southeastern Louisiana University, Hammond, Louisiana 70402, USA
dDepartment of Chemistry, Florida Agricultural and Mechanical University, Tallahassee, Florida 32307, USA
First published on 26th November 2021
Unnatural amino acids have gained significant attention in protein engineering and drug discovery as they allow the evolution of proteins with enhanced stability and activity. The incorporation of unnatural amino acids into proteins offers a rational approach to engineer enzymes for designing efficient biocatalysts that exhibit versatile physicochemical properties and biological functions. This review highlights the biological and synthetic routes of unnatural amino acids to yield a modified protein with altered functionality and their incorporation methods. Unnatural amino acids offer a wide array of applications such as antibody-drug conjugates, probes for change in protein conformation and structure–activity relationships, peptide-based imaging, antimicrobial activities, etc. Besides their emerging applications in fundamental and applied science, systemic research is necessary to explore unnatural amino acids with novel side chains that can address the limitations of natural amino acids.
The structure of UAAs may resemble or differ significantly from natural amino acids and they are referred to as analogues or surrogates, respectively.3 The synthesis and applications of UAAs have received considerable attention, particularly in the field of enzymology and drug discovery. In protein engineering, the ability to substitute any natural amino acid with UAA at a particular point has emerged as a valuable molecular tool because such substitution allows the introduction of altered physicochemical and biological properties.5 UAAs can be incorporated into many structural units to develop potential leads in peptidic and non-peptidic complexes. The incorporation of UAAs into protein expands its functional diversity in biophysics, spectroscopy and optical probes, bio-orthogonal chemistry and protein labelling, post-translational modifications; their mimetics, signal transduction, protein interactions mapping, and photoactivated motifs.6 Furthermore, UAAs dramatically expand the possibilities for using chiral building blocks and molecular scaffolds to construct combinatorial libraries.7
UAAs can be synthesized chemically or they are produced naturally as secondary metabolites in several organisms, such as bacteria, fungi, plants, or marine organisms.8 Over the past few years, by constructing engineered microbial strains, various metabolic pathways have aided in the production of UAAs. Modern discoveries allow promising methods to produce sufficient amounts of proteins in vitro and in vivo using UAAs. The site-specific incorporation of UAAs into enzymes has enabled structural modification, accepting a wide range of substrates such as ketone, azide, alkyne, alkene, tetrazine, etc. Various probes, designed using UAAs, can be used for diverse applications, such as evaluating the effects of small molecule proteasome stimulators in live cells and comparing proteasome activity in different cancer cell types.9 Here, in this review, we have focused on the synthesis, incorporation approaches, applications of UAAs, as well as their limitations and prospects. We believe this review would provide useful information that helps researchers to reprogram natural proteins by incorporating UAAs.
The two primary biocatalytic methods for the synthesis of UAAs are kinetic resolution and asymmetric synthesis. While the kinetic resolution is limited by a maximum theoretical yield of 50%, asymmetric synthesis provides a theoretical yield of 100%.19,20
Asymmetric synthesis is an effective process that occurs without any protective group(s) and is based on straightforward reaction steps to synthesize new UAAs. Substrates are asymmetrically converted into their optically pure form by using transaminase and dehydrogenase.21 Deamination of L-threonine 1 followed by hydrogenation gives a better theoretical yield than other processes (Fig. 1e).22 Various amino acid dehydrogenases have been developed and utilized for enzymatic chiral amino acid synthesis.20,23 Even naturally occurring amine dehydrogenases have been engineered to improve their substrate specificity and catalytic efficiency. Cai et al.24 have synthesized γ-aminobutyric acid from biobased lignocellulosic waste using an engineered amine dehydrogenase.
Similarly, asymmetric synthesis by transaminase is much simpler and effective since the substrates like keto acids can easily be converted into amino acids. Displaying broad substrate specificities, transaminase also plays a vital role in nitrogen metabolism as an industrial biocatalyst. It exhibits a rapid reaction rate and requires no external cofactor regeneration. Hence, α-transaminases like tyrosine, aspartate, valine–alanine, branched-chain, and aromatic amino acid transaminases are widely studied.25 Approximately 50% of UAAs successfully incorporated into proteins are derived from tyrosine or phenylalanine.26 Synthetically easily accessible α-keto acid is taken as a target intermediate that can be easily converted into an amino acid with the help of enzymes in a single step. The keto group is replaced with an amino group in a stereospecific manner. In this process, aminotransferase (transaminase), which uses pyridoxal 5′-phosphate as a cofactor, reversibly catalyses the transamination reaction.26 The transaminases possessing relaxed substrate specificity and high enantioselectivity are widely employed for the biosynthesis of many UAAs. L-Homophenylalanine was synthesized using an enzyme of transaminase class, namely aspartate aminotransferase isolated from Paracoccus denitrifican.27 Similarly, several L-thienylalanines28 and L-phosphinothricin29 have been synthesized by employing tyrosine aminotransferase.
The biological synthesis of UAAs by the kinetic resolution uses different enzymes such as lipases and nitrilases during the synthetic process.30 Acylase I (aminoacylase; N-acyl amino acids amidohydrolase), the most widely used and applicable enzymatic catalyst, has been employed for the kinetic resolution of UAAs and α-amino acids. This enzyme accepts the substrate with a broad range of functionality and structure. Chenault et al.31 have studied this approach to synthesize UAAs by utilizing acylase from Aspergillus oryzae and porcine kidney. In this method, acylase-I catalysed the enantioselective hydrolysis of N-acyl-L-amino acids ±11 to 11 and 12. D-Amino acid and L-amino acid products having a variety of uses were obtained with high enantiomeric excess (Fig. 1f).
Fig. 5 Synthesis of UAAs (a) synthesis of PrDiAzk 57 (b) synthesis of BN-tryptophan ester 63 (c) synthesis of UAA from oxidation of proline (d) synthesis of chiral spin-labeled amino acid 69. |
Of specific interest are catalytic methods with high levels of enantioselective control. A ruthenium catalyst was used for asymmetric hydrogenation of N-acetylamino phenyl acrylic acids 18 to yield chiral acids 20 (Fig. 2b), which are expected to be important in pharmaceuticals.34,35 Conformationally constrained peptides have a lower penalty for folding, which facilitates bioactive conformation. A new generation of chemical and drug discovery tools has emerged from conformationally constrained peptides. Asymmetric alkylation of a fluorine-modified Ni(II) Schiff base complex (23, 24) yielded an unnatural alkenyl amino acid (25, 26) required for peptide ‘stapling’ (Fig. 2c).36 The most apparent technique for producing UAAs is the alkylation of glycine.34 Moving to a more economical and environmentally suitable chiral phase transfer catalyst, the asymmetric alkylation of glycine has a yield between 60% and 90%. However, these synthetic routes depend on toxic reagents such as methylsulfonyl chloride.37,38 To minimize the effect of toxic reagents, 1-ethyl-3-(3-dimethyl aminopropyl)carbodiimide/4-dimethylaminopyridine (EDCI/DMAP) as a coupling agent, a chiral intermediate i.e. derivative of glycine non-proline 27 was synthesized starting with proline and then alkylating in the presence of sodium tert-butoxide which obtained 28 (Fig. 2d). By using this method, five UAAs for peptide stapling 29 were prepared with high enantioselectivity >99% ee and yields ranging from 60% to 70%.37 Furthermore, a new methodology for the asymmetric synthesis of UAAs-containing peptides 32 was also developed using imino peptides 30 such as α-imino perfluro alkylesters, imides, or thioesters (Fig. 2e).39
Known as an inexpensive and abundant metal, iron is commonly utilized as a catalyst in organic synthesis. Developing an effective and practical diastereoselective method to synthesize unnatural chiral α-amino acids using iron as a catalyst is highly desired. Toward this end, an iron-catalysed diastereoselective method has been developed; the method employs widely accessible iron salts, 2-phthaloyl acrylamide 35, and alkenes 36 as starting materials, and phenyl silanes as a reductant for unnatural chiral (S)-α-amino acids with γ-quaternary carbon centres 37 (Fig. 4b).41 This protocol has several benefits, including simple and broad substrates, moderate conditions, excellent diastereoselectivity, and simple workup methods.
It has been demonstrated that bifunctional UAAs called PrDiAzK 57 can be inserted into the protein interface, using genetic code expansion, with a minimum structural perturbation. Hoffmann et al.47 demonstrated bifunctionality for UAAs based on Z-lysine, benzophenone-alkyne “BPKyne,” and γ-selenolysine, in which a lysine-based bifunctional amino acid combines a photoreactive diazirine group with a terminal alkyne handle for reaction with azides using copper-catalysed click chemistry. The starting material was commercially available dihydroxyacetone 51. In liquid ammonia, the ketone was converted into diazirine (52, 53) in a two-step reaction. Propargyl bromide was used to alkylate one of the two hydroxyl groups. The single-substituted product 54 was isolated, converted into a reactive carbonate, and bound to lysine (56, 57) (Fig. 5a).47 In the hope to utilize this UAA in further protein studies and classify its properties concerning the natural substrate, boron and nitrogen-containing unnatural analogues of tryptophan 63 have also been synthesized through the functionalization of BN-indole 58. In this case, TBS-BN-indole 58 was subjected to a substitution reaction with dimethyliminium chloride. Iodomethane was used to methylate the resulting product 59, which was then displaced with silyl-ketene-acetal. The Schiff base protecting group of 62 can be removed in aqueous acidic conditions. Following this, the silyl protecting group was deprotected to yield BN-tryptophan ethyl ester 63 (Fig. 5b).48 More notably, these studies demonstrated the first example of an azaborine containing amino acid being introduced into proteins.
Furthermore, the production of genetically encoded fluorescent α-amino acids that produce greenish-blue light has widespread applications in research, biotechnology, and the pharmaceutical sector. The amino acid 4-phenanthren-9-yl-L-phenylalanine (Phen-AA) 79 that emits greenish-blue light in the visible region has good stability with a 75% quantum yield. As shown in Fig. 6c, its production was initiated with the synthesis of N-(tert-butoxycarbonyl)-4-iodo-L-phenylalanine methyl ester intermediate (Boc-Phe(4-I)-OMe) 77 where the carboxylate group through methylation of the commercially available N-(tert-butoxycarbonyl)-4-iodo-L-phenylalanine 76. The next step involved Suzuki–Miyaura cross-coupling reaction which coupled α-amino acids bearing vinyl or aryl halide side-chains with polyaromatic boronic acids and resulted in N-(Boc)-4-(9-phenanthracenyl)-L-phenylalanine methyl ester 78 intermediate. De-protection of the methyl and Boc protecting groups of 4 produced the final L-α-amino acid 4-phenanthracen9-yl-L-phenylalanine 79 i.e. novel fluorescent α-amino acid that emits greenish-blue fluorescence.53
Also, a synthetic method for new enantiomerically enriched non-protein (S)-α-amino acids 86 was developed to generate new carbon–carbon bonds in the side chain group of the amino acid moiety with high chemical yields and optical purity using the reaction of Suzuki. In this method, as the initial synthon, the Schiff base Ni(II) complex 84 of the chiral auxiliary (S)-BPB (N-benzyl proline benzophenone) and 4-bromo-L-phenylalanine amino acid were employed. The final Ni(II) complexes 85 were decomposed with aqueous HCl, and the amino acids were isolated with excellent enantioselectivities (>99% ee). The chiral auxiliary ligand (S)-BPB can be recycled and reused for the synthesis of starting Ni(II) complex (Fig. 6f).57
The Negishi cross-coupling is a powerful C–C bond-forming reaction widely utilized in many areas of organic synthesis. The Negishi cross-couplings were used to synthesize various UAAs like aromatic, heteroaromatic, and complex amino acid derivatives such as aspartic and glutamic acid derivatives 89, aryl glycine derivatives, phenylalanine derivatives, amino acids containing metal-coordinating side chains, β-amino acids, aryl bromides with an iodo serine derivative, naphthyl-appended amino acid, bis(amino acid) derivative, fluoroaromatic amino acid, p-(C-glucopyranosyl) phenylalanine derivative, cycloalkenyl-protected amino acid derivatives (Fig. 4g and 6g).58
Derivatization of natural amino acids including cysteine, threonine, serine, and tyrosine containing heteroatoms such as N, S, O could open the door to synthesizing many possible UAAs. Undoubtedly, novel synthetic methods to construct UAAs will continue to provide new ways for peptidomimetics.
UAA(s) placed in proteins at strategically selected locations offer a new and fascinating variety of protein research and engineering options.61 In vivo incorporation of UAAs can be carried out in two ways: in site-specific approach and residue-specific approach. The site-specific incorporation uses an exogenously evolved tRNA or synthetase pair, and residue-specific incorporation uses the misacylation of the endogenous tRNA.60 In vivo incorporation methods are marked with many engineering challenges like UAAs transportation across the cell membrane, cytotoxicity, and low incorporation efficiency.59
To overcome the challenges of in vivo approach, strategies for in vitro incorporation of UAAs62 have received considerable attention to gain broader specificity and increase efficiency.59 The cell-free incorporation of UAAs results in the formation of complex proteins like integral membrane proteins, physiologically toxic proteins, and large protein complexes.63 The cell-free incorporation of UAAs can be achieved by global suppression methods and orthogonal translation systems. The former utilizes natural biological mechanisms while later reengineers tRNA, aminoacyl-tRNA synthetase, ribosome, elongation factor, and release factor via directed evolution or rational design to incorporate UAAs. The orthogonal translation system is commonly used because of its effectiveness towards site-specific UAAs incorporation in comparison to the global suppression method.59 Other ways to incorporate UAAs include solid-phase synthesis and chemical ligation methods. Solid-phase synthesis allows the direct transformation of peptides' side-chains into novel peptides on the solid support. Here, the accumulation of side products prevails because of incomplete deprotection or coupling reactions. This drawback is overcome by chemical ligation methods which permits peptide fragments to the couple in aqueous solutions.64–66 Chemical alteration of a synthetic fluorophore protein after translation and the use of chemically mis-attached tRNAs suppressors to insert UAAs are the most straightforward methods. These methods, however, give limited yields of proteins and are restricted to readily available protein locations.45 The incorporation of UAAs via codon reassignment offers novel chemical and biological functions to translated products.67 The amber stop codon (UAG), showing a low level of amino acids misincorporation, is most commonly targeted to reassign and incorporate UAAs during translation in comparison to opal (UGA) and ochre (UAA) stop codons.63,67 The UAA mutagenesis was found to enhance the efficiency and regioselectivity of CYP P450 oxidation catalysts.68 The detailed mechanism on UAA incorporation is well explained by Young and Schultz,2 and Lang and Chin.66
Furthermore, incorporating UAAs, such as replacing methionine with norleucine, into enzymes may protect proteins from methionine oxidation. This approach is useful for biocatalysts that become inactive in oxidative conditions or require oxidizing substrates or cofactors.17 Another method was genetic code expansion (GCE) technology that has been used to introduce small labelling sites in the form of uniquely reactive ncAAs in a target protein. Low incorporation efficiency of UAAs and high background fluorescence limit have super-resolution microscopy (SRM) applications. SRM benefits immensely from the ability to mount photographic fluorescent labels on proteins.71 By choosing a sequential allylic C–H amination/vinylic C–H arylation, which began with inexpensive commercially available α-olefins and boronic acids, UAAs precursors were also obtained using Pd(II)/sulfoxide catalysis. Using coupled enzyme reactions, which are a novel biocatalytic method for synthesizing L-homoalanine from L-threonine consisting of a threonine deaminase (TD) and ω-TA. TD catalyses the dehydration/deamination of L-threonine, asymmetrically converting to L-homoalanine through transamination with benzylamine executed by ω-TA.72
N-Alkylated-α-amino acids produced via enantioselective methods are valuable building blocks for pharmaceutical and fine chemical industries. Therefore, they are treasured and widely investigated. While there are many chemical methods for their synthesis, biocatalytic approaches can give a greener and cleaner alternative to current practices. Alternative processes such as methylation in other proteins and peptides, including N-α-methylation, have biological activity of peptides.73 For example, cypemycin is a naturally occurring bacterial peptide with post-translational changes, including N-α-dimethylation, and it has decisive antibacterial action and in vitro efficacy against murine leukemia cells.74 Also, N-arylated α-amino acids and pyrazolidin-3-ones are widely used in pharmaceuticals and agrochemicals as chiral building blocks. They are biocatalytically produced by utilizing ethylenediamine-N,N′-disuccinic acid lyase (EDDS lyase) as a biocatalyst. This enzyme has a broad substrate range and high conversions, resulting in high isolated yields and enantiomeric excess of the relevant N-arylated aspartic acids.75 The use of biocatalyst engineering has greatly aided enzyme discovery and applications in industrial and pharmaceutical applications. The service and advancement of enzyme engineering techniques have grown tremendously in recent years. Importantly, engineering techniques incorporating UAAs successfully produce enzymes with more excellent stability, selectivity, and altered catalytic properties.
In recent decades, applications of enzymes have increased significantly, especially in scientific methodology, pharmaceutical science, food alteration, laundry, biofuel production, agro-industry, and many others. The increasing demand for biocatalysts as a replacement for traditional chemical catalysts has grown progressively. The engineering strategies of enzymes rely on nature's genetically encoded alphabet of twenty canonical amino acids. In the past few years, the emergence of genetic code expansion methods that allow several structurally diverse amino acids to be introduced into the proteins have been observed.76
UAA incorporation in enzymes produces enzyme resistance towards temperature and organic solvents, resulting in effective catalytic properties. Various UAAs with different side chains overcome the limitations encountered by NAAs in enzymes, thereby improving their potential applications. The residue-specific method and site-specific method are favoured for producing enzymes with improved and modified functions. Also, it can be done with the coupling of residue-specific and site-specific incorporation methods. Multifunctional green fluorescent proteins (GFP) were constructed through site-specific incorporation of L-3,4-dihydroxyphenylalanine and residue-specific incorporation of (2S,4S)-4-fluoroproline (4S-FP) or L-homopropargylglycine (hpg).77 Site-specific methods have attracted considerable attention as proteins or enzymes that bear multi-UAAs display improved functionalities. Interestingly, it has been discovered that integrating two chemically distinct UAAs into GFP by employing two orthogonal pairs in a single expression shows no mutual cross-reactivity and thus can be developed for efficient double labelling. In addition, a highly efficient suppressor plasmid pUltra has been generated, exhibiting higher suppression activity, enabling the efficient suppression of three different amber stop codons in GFP.78 In the residue-specific incorporation method, incorporation of 3-fluorotyrosine into ω-transaminase showed a 2 fold higher half-life with enhanced catalytic activity.79 Furthermore, the global incorporation of UAAs into β-galactopyranoside resulted in a two-fold increase in Vmax for ortho-nitrophenyl-β-D-galactopyranoside at pH 7.0 and a 4–5-fold increase in Vmax toward phenyl-β-D-galactopyranoside at the same pH.60 Global substitution using Klentaq DNA polymerase resulted in comparable activity in the same range and similar deoxyribonucleoside triphosphate conversion (dNTP). The addition of (4R)-fluoroproline to the DNA polymerase resulted in a fluorinated enzyme that was highly active.80
Cytochromes P450 (CYP) are biocatalysts that catalyse the transfer of an oxygen atom from molecular oxygen to an organic substrate upon a donation of electrons by coenzymes, such as NAD(P)H. In P450 BM-3, methionine residue was replaced with the isoteric methionine analogue norleucine to test whether the enzyme stability was hampered by Met oxidation throughout the reaction; a two-fold increase in peroxygenase activity along with a significant reduction in thermal stability has been demonstrated.81 Kolev et al.82 reported P450 BM-3 variant that oxidizes (+)-nootkatone as a representative substrate (since (+)-nootkatone has a various number of different C–H bonds (primary, secondary, tertiary, aromatic) as well as many functional groups such as carbonyl, ester and olefinic group) resulted with the higher turnover number (kcat) for an engineered CYP on a complex molecule. A five-fold increase in the turnover number (kcat) by the substitution of Leu75 with para-amino phenylalanine (pAmF) by stop codon suppression (SCS) in the active site of the enzyme was observed.82
Transaminases catalyse the transfer of an amino group between amino acids and α-keto acids, and they act as biocatalysts in the production of optically pure α-amines. The incorporation of meta-fluorotyrosine enhanced thermostability and organic solvent tolerance in ω-transaminase by using the selective pressure incorporation (SPI) method.79 N-Terminally truncated version of DNA polymerase I from Thermus aquaticus (Klentaq) was generated. Klentaq is a highly thermostable polymerase with no nuclease activity. Replacing 32 proline residues by (4R)-fluoroproline in Klentaq DNA polymerase by SPI method was 92% efficient with a highly active fluorinating enzyme.80 The substitution of the critical Tyr309 of phosphotriesterase from Agrobacterium radiobacter (arPTE) by the SCS method, i.e. arPTE variant containing L-(7-hydroxycoumarin-4-yl)ethyl glycine (HCEtG) in place of Tyr results in an eight-fold increase in kcat values. Phosphotriesterase hydrolyses organophosphates, such as pesticides with target residues located in the substrate-binding site. The electrostatic repulsion between the HCEtG and the product, which are both negatively charged, contributes to the increased rate-limiting product release step of substrate turnover. The 7-hydroxyl group of L-(7-hydroxycoumarin-4-yl)ethylglycine promoted the hydrolysis of pesticide paraoxon due to interactions between the bacterial phosphotriesterase enzyme and substrate during the Michaelis complex and product release.83 Disruption of Asp–His hydrogen bond by replacing the proximal histidine ligand with the unnatural structural analogue N-methylhistidine (NMH) by SCS into an engineered ascorbate peroxidase (APX2) achieved significantly increased turnover numbers. Heme peroxidases catalyse a range of oxidative transformations for various biotechnological applications.84 Introducing 3-fluorotyrosine to organophosphate hydrolase showed extended optimal pH activity and improved thermal stability at alkaline pH.85 Global fluorination of aromatic residues of lipase B from Candida antarctica enhanced stability with 4-fluorophenylalanine residue-specific substitution.86 Enhanced protein foldability and increased thermostability were shown by the fluorination of phosphotriesterase (PTE) with UAA 4-fluorophenylalanine. Approximately 30% of the enzymes' native structure was maintained.86 Replacement of hydrophobic amino acids, such as methionine, proline, and phenylalanine from Thermoanaerobacter thermohydrosulfuricus lipase related synthetic analogues like lipase analogues, in a residue-specific manner, resulted in a 25% increase in catalytic activity, substrate tolerance by up to 40%, changes in optimal temperature by up to 20 °C and pH by up to 3.69 A change in substrate specificity of the enzyme, N-acetylneuraminic acid lyase (NAL) mutants was obtained after incorporating various UAAs, thereby increasing activity towards aldol reactions of erythrose and pyruvate.87 Incorporation of UAAs in proteins leads to the mutation of amino acids intimately involved in an enzyme's catalytic mechanism, thereby overcoming challenges such as the substrate or product inhibition, organic solvent tolerance, catalytic efficiency, turnover number, thermal and chemical stability.
Enzymes | Name of UAA | Structure of UAA | Characteristics | Incorporation method | References |
---|---|---|---|---|---|
ω-Transaminase | 3-Fluorotyrosine | Enhanced catalytic activity and thermostability | Residue-specific incorporation of 4-fluoroproline | 79 | |
Organophosphate hydrolase | 3-Fluorotyrosine | Thermal stability increment at alkaline pH | Residue-specific incorporation | 85 | |
P450 | Nor-leucine | A two-fold increase in peroxygenase activity | Residue-specific incorporation | 81 | |
Lipase B | 4-Fluorophenylalae | Prolonged the shelf life of lipase activity | Residue-specific incorporation of 4-fluorophenylalanine | 86 | |
Phospotriesterase (PTE) | 4-Fluorophenylalanine | Enhanced protein refolding | Residue-specific incorporation of 4-fluorophenylalanine | 115 | |
Green fluorescent protein (GFP) | L-3,4-Dihydroxyphenylalanine | Forming protein–chitosan complexes enhancing stability | Site-specific incorporation of L-DOPA and residue-specific incorporation of 4-fluoroproline | 77 | |
Glutathione S-transferase (GST) | pNCSF | Formation of stable thiourea crosslinks | Site-specific incorporation | 83 | |
Lipase | Nor-leucine | A 10-fold increase in catalytic activity | Residue-specific incorporation | 69 | |
Phosphotriester (PTE) | L-(7-Hydroxycoumarin-4-yl)ethylglycine | A 8-fold increase in turnover number; promotion of Michaelis complex formation | Site-specific incorporation | 83 | |
NAL | 2,3-Dihydroxypropyl cysteine | Alters substrate selectivity | Site-specific incorporation | 87 |
UAA | Structure | Method of synthesis | Application | Mode of use | References |
---|---|---|---|---|---|
Substituted arylglycine | Chemical synthesis | Building blocks in many bioactive compounds and natural products | Assist in drug discovery | 46 | |
PrDiAzK | Chemical synthesis | Potential application in the system-wide mapping of protein–protein interaction | Site-specifically incorporated into proteins in both bacterial and mammalian cell culture | 47 | |
α-Substituted glutamic acid derivatives | Chemical synthesis | Serves as a new pathway to open new route to biologically active molecules | Shortcut to active biomolecule | 124 | |
2-Indolylglycine derivatives | Chemical synthesis | Building blocks in biomolecules | Assist in drug development | 34 | |
(S)-N-(2-Benzoylphenyl)-1-(2-flurobenzyl)-pyrolidine-2-carboxamide | Chemical synthesis | Brings ease and efficiency to stapled peptide research | Peptide stapling | 37 | |
4-Phenanthracen-9-yl-L-phenylalanine | Chemical synthesis | Might find broad application in research, biotechnology, and pharmaceutical industry | Gets into human cells, being visible upon 450 nm laser excitation | 53 | |
BN (boron and nitrogen) tryptophan analog | Chemical synthesis | Incorporation with proteins | Employ BN isoterism of arenes in a biological context where tryptophanyl-tRNA synthetase can recognize and azaborine containing amino acid | 48 | |
p-Acetylphenylalanine | Biological synthesis | Antibody drug conjugates | Part of antibody-drug conjugates | 61 | |
γ-Aminobutyric acid (GABA) | Biological synthesis | In metabolic engineering | To develop engineered strain | 14 | |
L-Azidohomoalanine | Biological synthesis | Investigate more engineering approaches and metabolic pathways | Biosynthesis of other pharmaceutically valuable UUAs | 125 | |
p-Azidomethyl-L-phenylalanine | Chemical synthesis | Antibody drug conjugates | Part of antibody-drug conjugates | 93 | |
O-[18F]-Fluoroethyl-L-tyrosine | Radiochemical synthesis | Tumor imaging agent for PET | Radiolabeled tracer | 74 | |
18F-FDOPA | Chemical synthesis | Tumor imaging agent, PET | Radiolabeled tracer | 100 | |
18F-Fluciclovine | Chemical synthesis | PET tumor imaging agent | Radiolabeled tracer | 101 | |
L-(7-Hydroxycoumarin-4-yl)ethylglycine | Chemical synthesis | Preparation of fluorescently labeled protein | Incorporated to improve catalytic activity | 126 | |
N-Acetylated fluorophenylalanine | Chemical synthesis | Antiviral, anticancer drug conjugates | Part of antibody-drug conjugates | 127 | |
4-Fluoroproline | Chemical synthesis | Protein design and engineering | Enhances conformational stability upon proteins | 128 | |
Sarcosine (N-methyl-α-L-glycine) | Biocatalytic synthesis | Used as a dietary supplement and as a non-specific glycine transport | Building blocks for the pharmaceutical | 73 | |
N,N-Dimethylglycine | Biocatalytic synthesis | Acts as an athletic performance enhancer, displays anticonvulsant activity inhibitor | Building blocks for the pharmaceutical | 73 | |
Betaine (glycine betaine) | Biocatalytic synthesis | Treatment for homocystinuria and can offer benefits to human health, attenuation of liver injury | Display antioxidant and antihistaminic activity | 73 |
Genetically encoded ncAA with orthogonal chemical reactivity are used for the synthesis of ADCs. Such produced ADCs have precise control over linkage site and stoichiometry, facilitating optimization of ADCs as therapeutic carriers and coupling agents.92 Another ADC with greater effect in vitro cell cytotoxic assays was produced through strain-promoted alkyne–azide cycloaddition (SPACC) copper-free click chemistry when unnatural amino acid, para-azidomethyl-L-phenylalanine (pAmF), was site-specifically incorporated, facilitating almost complete conjugation of DBCO-PEG-MMEF drug to the tumour-specific Her2-binding IgG trastuzumab.93
These days the main challenge in applying peptide therapeutics is their relatively short plasma half-life and the corresponding in vivo stability.94 A new molecule is thus introduced, called cell-penetrating peptides (CPPs), a short-chain of 5–30 amino acid residues that are widely used for the effective intracellular transportation of huge varieties of biologically active drug molecules, drug delivery vectors as well as cargos via biological membranes.95 CPPs can be a crucial aspect of therapeutic vaccines and drugs. A bio-reducible CPP, a branched m-R9 (B-mR9), was synthesized from nona-arginine (mR9) modification using disulfide bonds whose unusual polyplex B-mR9/siVEGF efficiently inhibited tumour sites and tumour growth as well as showed high accumulation and strong retention in comparison to other groups.96 Some of the restricted secondary structures of the peptides due to the presence of UAAs make them enzyme resistant compared to those containing natural amino acids.60 This greatly benefits the strength of CPPs.
18F-FDOPA(6-[18F]-fluoro-L-3,4-dihydroxyphenylalanine) based PET/CT imaging was found to be very sensitive to post-treatment diagnosis of head and neck paragangliomas (HNPGLs),100 and diagnosis of pheochromocytoma-paraganglioma induced tumour.99 These treatments are found to be more effective than conventional MRI/CT by many such experimental trials. It could be a useful tool in the diagnosis of many neuroendocrine tumours. 18F-Fluciclovine (anti-1-amino-3-18F-fluorocyclobutane-1-carboxylic acid (FACBC)) PET/CT is an effective tool for imaging breast cancers even malignant ones such as invasive ductal carcinoma (IDC) and invasive lobular carcinoma (ILC), and also detection of extra-axillary nodular metastases.101 It also distinguishes between benign tissues and malignant tumours.102,103 PET/CT scans have been proved to be up-and-coming tools for imaging and a more precise diagnosis of nodal disorders in preclinical investigations. These studies imply that many of the flaws in traditional diagnostic approaches may be overcome.
A coumarin-based fluorescent amino acid (S)-1-carboxy-3-(7-hydroxy-2-oxo-2H-chromen-4-yl)propan-1-aminium (CouAA) was incorporated into E. coli FtsZ using an evolved Mj tRNATyr/TyrRS pair for the visualization of in vivo localization of particular proteins.105 Tryptophan is the most widely used amino acid as a fluorescent reporter for studying protein dynamics and functions. It has high sensitivity towards the electrostatic environment and greater quantum field.106 Various modifications are performed on tryptophan to synthesize more suitable and better-functioning fluorophores. A deep blue coloured tryptophan analogue, beta-(1-azulenyl)-L-alanine, having characteristic fluorescent and photophysical properties, was synthesized using Neighisi coupling. Upon incorporating this UAA into argyrin C, the group showed great potential in the localization and visualization of targeted proteins.107 Another tryptophan analogue, 4CN-Trp being a small fluorescent UAA possessing unique photochemical properties, is a potential candidate for its use in vitro and in vivo spectroscopic and microscopic studies of proteins.108
Due to dominating activity that is ineffective and low serum stability, systematic Cbf-14 modification by incorporating UAAs (ornithine [Orn], norleucine [Ile], and D-amino acids) results in a positively charged peptide mutant Cbf-14-2 with similar stability, negligible toxicity, and superior antimicrobial activities against penicillin-resistant bacteria both in vivo/in vitro.112 An in vitro study hinted toward a selective anti-cancer effect of Cbf-K16 against human non-small cell lung carcinoma H460 cells.113 Antibacterial peptides (ABPs) incorporated with UAAs (uABPs) are essential for host defence against microbial infections.
UAAs can be utilized to improve bio-redox processes. For example, the abiotic nicotinamide dinucleotide flucytosine cofactor can assist in identifying protein structures and modifying them after translation.114 A novel medication type, which can replace or be an alternative to treatments now utilized, has been found by UAAs synthesizers with its unique advantages. Further therapeutic research uses UAAs to aid unriddle emerging and unsolved pathogenic diseases.
Whereas UAA incorporation into proteins, being a powerful tool to modify the properties of proteins, this approach is becoming more attractive for biotechnological uses. Some biotechnological tools include molecular imaging probes, protein–protein interaction probes, therapeutic proteins, enzymes, fluorescent probes, biocontainment, biomaterials, and biocatalysts with novel functional and structural properties which can be prepared via UAA incorporation into proteins. Although the results obtained are eye-catching and promising, the potential is still far from achieved. It has been demonstrated that methyl ester form of UAAs against free acid form showed much better yield; the improved yield is possibly due to the improved membrane permeability of the methyl ester forms of the UAAs to E. coli cells as compared to the free acid forms, even though the same tRNA synthetases and E. coli strains were used for incorporation.129 A study in 2020 optimized the direct incorporation of a fluorescent UAA (7-hydroxy-4-coumarin-yl)-ethyl glycine, where MS methods confirm that the UAA is introduced at only one site and that there is limited precedent.116
The biochemical implications of altering the cellular translation machinery, as employed in GCE-based tagging, have yet to be extensively explored. First, cells were not examined or had an impact on ribosomal function in the cellular response to an exogene tRNA/tRNA synthetase combination. Second, the physiological effects of protein reading induced by UAAs are not yet fully examined in TAG endogenous codons.130 The integration of UAAs into proteins has been an area of active interest for several fundamental and applicable research sciences. Despite the various types of synthetic UAAs, due to the limited number of studies available, further studies are highly recommended. In the field of amino acids and peptide/protein chemistry, directing groups (e.g., 8-aminoquinoline) and protecting groups (e.g., Phth) are uncommon. The introduction and removal of these protecting and directing groups necessitate additional steps, lowering the atom and phase efficiency of the modification process. Even with the aforementioned current challenges, we predict that C–H functionalization would become a cost-effective solution for synthesizing many UAAs as the repertoire of the C–H functionalization technique grows.
E. coli is primarily utilized for large-scale protein synthesis. On the other hand, this prokaryotic workhorse is less frequent and incapable of producing post-translational modifications, including glycosylation, ubiquitination, phosphorylation, and other eukaryotic and proteolytic protein maturation. Furthermore, eukaryotic and mammalian cell systems are prone to contamination, frequently requiring specific growth sites and bypassing or disabling glycosylation systems to produce humanized therapeutic proteins without introducing additional factors that may induce immunogenicity. Both stationary phase biology and post-translational modifications in prokaryotic systems are still active research areas with many unanswered questions. Synthesizing appropriate UAAs and assembling UAAs and scaffolds to produce functional catalysts are distinct synthetic problems that cannot be overlooked or obscured by similar research aimed at uses other than catalysis. The absence of library techniques for iterative scaffold optimization is perhaps the biggest roadblock to creating hybrid catalysts that use UAAs.131
When acidic conditions were adopted for deprotection at the oxazolidinone centre, 96 percent of the free amino acid was produced. In addition, oxazolidinone species I and II may be easily derived to get more complex UAAs. Control data indicates that to introduce alkylation reactions under normal circumstances, there should not be the presence of any light or photocatalyst.52 Recent primary and secondary C(sp3)–H bond functionalization processes have selectively changed natural amino acids and peptides. Sequential C–H activation and C–X (X = C, B, O, N, F, etc.) bond formation reactions have successfully changed the side chains of amino acids. Even though substantial progress has been made in this area, there are still several obstacles to overcome. The reactivity of unactivated C(sp3)–H bonds, especially secondary C(sp3)–H bonds, is still low, which can be overcome by strategically developing new ligands and directing groups.132 ncAA-mediated methods broaden the range of things identified and generated, such as conjugates, vaccinations, and cell-based treatments. It is also becoming clear how important it is to apply medicinal chemistry ideas to bigger proteins, i.e., protein medicinal chemistry. Given the potency of channels for discovering therapies that employ just canonical amino acids, granting them full access to the chemical cabinet would very definitely result in totally new therapeutic groupings.131
The asymmetric synthesis of UAAs is a significant step to incorporate optically pure UAAs in natural proteins during the reprogramming of natural proteins by using UAAs. Therefore, an extensive study on enantioselective, catalytic, and chiral pool synthesis of optically pure UAAs has been reported in recent years.43–51 Further study in the design of novel catalysts and chiral auxiliaries, as well as proper selection of chiral precursors for chiral pool synthesis, could enhance the synthesis of chiral UAAs with an excellent enantiomeric excess (ee) and high yield. Consideration of the green chemistry approach towards the asymmetric synthesis of UAAs is highly recommended.
There are a limited number of UAAs in the current defined metabolic pathways, and in some cases, essential pieces of information are missing, including the enzymes involved. During the biosynthesis process, some UAAs inhibit the growth of chassis strain and may cause trouble in fermentation processes. Hence, the prospect needs the involvement of crucial enzyme representation and clear metabolic pathways. For balanced metabolism and increased resistance, more chassis strain is needed to be metabolically engineered or screened. After examining secondary structures of polypeptides containing proteinogenic amino acid residues, it isn't easy to anticipate the conformation of contemporary, non-natural polypeptide materials based on their side-chain structures. To elucidate their effects on secondary structures, fundamental understandings of side-chain interactions, such as charge, H-bonding, hydrophobic, and dipole–dipole interactions, are needed. Ultimately, synthetic polypeptides can be used as standalone therapeutics in addition to using polypeptide products as nano-carriers. Glatiramer acetate (GA), a statistical copolypeptide with four natural amino acid residues, is one noteworthy example.133
The full promise of protein medicinal chemistry and other ncAA applications in biotherapeutics will be realized only when high-performance ncAA integration systems in bacterial and eukaryotic expression systems are available. Given the strength of biological platforms using only canonical amino acids, full access to the chemical cabinet provides these platforms with completely new therapeutic classes.134 Future research should concentrate on reaction designation, enzyme property enhancement, and process optimization to develop commercially feasible platforms for derivatizing amino acids. Biocatalysts will likely be used to manufacture additional fine chemicals from amino acids on a commercial scale in the future.135 Scopes of UAAs in biotechnology, synthetic biology, and traditional tools should be upgraded by well-designed algorithms and computer-aided enzyme/protein molecular prediction, which supports the practical implementation of UAAs and helps to design different artificial enzymes.
Footnote |
† Equal contribution. |
This journal is © The Royal Society of Chemistry 2021 |