 Open Access Article
 Open Access Article
      
        
          
            Jaeseung 
            Yu
          
        
      , 
      
        
          
            Jinsol 
            Yang
          
        
      , 
      
        
          
            Chaok 
            Seok
          
        
       and 
      
        
          
            Woon Ju 
            Song
          
        
       *
*
      
Department of Chemistry, College of Natural Sciences, Seoul National University, Seoul 08826, Republic of Korea. E-mail: woonjusong@snu.ac.kr
    
First published on 17th February 2021
Directed evolution has provided us with great opportunities and prospects in the synthesis of tailor-made proteins. It, however, often requires at least mid to high throughput screening, necessitating more effective strategies for laboratory evolution. We herein demonstrate that protein symmetry can be a versatile criterion for searching for promising hotspots for the directed evolution of de novo oligomeric enzymes. The randomization of symmetry-related residues located at the rotational axes of artificial metallo-β-lactamase yields drastic effects on catalytic activities, whereas that of non-symmetry-related, yet, proximal residues to the active site results in negligible perturbations. Structural and biochemical analysis of the positive hits indicates that seemingly trivial mutations at symmetry-related spots yield significant alterations in overall structures, metal-coordination geometry, and chemical environments of active sites. Our work implicates that numerous artificially designed and natural oligomeric proteins might have evolutionary advantages of propagating beneficial mutations using their global symmetry.
Despite significant progress in protein design and evolution,16–18 there is no standard rule prioritizing the residues for sequence optimization. The amino acid residues in the vicinity of active sites are often the first and primary candidates. B-Factor and sequence conservation analyses have also been carried out to generate focused mutant libraries.19,20 However, our current state-of-the-art understanding of the protein sequence-structure–function relationship is incomplete in that seemingly trivial or distant mutations are essential to induce the desired properties,21,22 while even proximal residues to the active site may cause no detectable changes in structure and function.23 In addition, multiple mutations often exhibit non-additive effects so that iterative mutations and screenings are necessary,24 requiring at least mid to high throughput screening.25–30 Therefore, an alternative guideline to pinpoint key residues for sequence optimization would be advantageous to facilitate enzyme design and to elucidate intertwined protein sequence networks.
In the exploration of this complicated yet, essential question, de novo enzymes could be a versatile model system. Their sequence and structure are not strongly correlated with the nascent function, and thus, enzyme evolution may occur with little underlying interrelation to the intrinsic nature of proteins. Previously, numerous protein scaffolds have been transformed into artificial enzymes, particularly metalloenzymes.31 A large portion of these examples is found as homomeric proteins, having catalytic metal-sites on the protein–protein interfaces.32–34 One of these examples is an artificial metallo-β-lactamase, AB5, where a catalytic center, Zn-OH2/OH unit, was created on the protein–protein interfaces of an α-helical homo-tetramer (Fig. 1a).35 Intriguingly, we found that a seemingly trivial mutation (C96T) yielded substantial structural alterations (Fig. 1b), resulting in the conversion of a tetramer with a large void space into a closely packed one. Besides, hydrolytic activities with ampicillin were enhanced. In contrast, several attempts to optimize K85, E92, Q103, and K104 residues, close to the Zn-active sites (Fig. 1a), exerted no significant enhancement in the hydrolytic activities. Based on these observations, we surmised that C96 residue is a more effective hotspot than four proximal residues (K85, E92, Q103, and K104) because of its location on the C2-rotation axis. If the mutations of residues related to symmetry operations such as rotation may produce significant impacts on overall protein structure and function, possibly using the fluxional protein–protein interface, we further inquired whether other residues on rotational axes can be efficiently targeted for directed evolution of de novo designed oligomeric enzymes, despite being distantly located from active sites and seemingly unrelated to enzyme catalysis.
|  | ||
| Fig. 1 X-ray crystal structures of de novo Zn-dependent β-lactamases related to this study. (a) AB5 protein with C96 residue (PDB 5XZI). (b) C96T variant (PDB 5XZJ). Residues located at the C2 rotational axes, such as 96, 38, 39, and 81, are shown with magenta sticks. Residues located at proximal to the active sites, such as 85, 92, 97, 103, and 104, are shown with blue sticks. Zn ions located at the catalytic sites and metal-ligating residues are shown with grey spheres and lines, respectively. The metal-bound water or chloride ions in catalytic sites are depicted with red and green spheres, respectively. The Zn ions at the structural sites and crystal packing interactions are omitted for clarity. | ||
Both symmetry and proximity-related mutant libraries were screened for the presence of whole-cell hydrolytic activities with an antibiotic β-lactam substrate, ampicillin. For full coverage with a 95% confidence level,36 greater than 94 colonies for each single-site saturated mutant library and 1953 colonies in total were screened (Fig. 2a). For more efficient and quantitative cell-based screening, we slightly modified the method from previous studies35 so that we measured the cell-growth rates upon the addition of a high concentration (10–15 mg L−1) of ampicillin instead of using LB/agar plates containing a relatively low concentration of ampicillin. Then, mutations detrimental to β-lactamase activity would lead to zero or negative cell-growth rates upon the addition of antibiotic substrates, if the cells no longer grow or get ruptured by the cell wall-targeting antibiotics, respectively. In contrast, beneficial mutations would lead to the rates faster than that of the parent protein. When the individual cells of each mutant library were sorted in increasing order of cell-growth rate relative to that of the parent protein, the fitness effect, which can be estimated by the magnitude of alterations upon mutations, irrespective of the occurrence of beneficial or detrimental effects upon mutation, followed the order T97X ≈ K104X ≪ E81X ≤ A38X ≈ C96X < D39X (Fig. 2b and c), where X indicates the 20 randomized amino acid. Notably, the four libraries comprising mutants randomized at the symmetry-related residues, 96, 38, 39, and 81 (Fig. 2b), exhibited considerably greater magnitudes of alterations in cell-growth rate than those of the two libraries saturated at residues 97 and 104 (Fig. 2c), indicating that symmetry-related residues exert greater degrees of perturbation to the protein than proximal residues. The distribution of fitness effects37–39 was also depicted by box charts (Fig. 2c inset), where the mutations at symmetry-related residues exhibit much more outspread cell-growth rates than those of proximal residues. The fastest-growing cells from each library were sequenced, resulting in T97T (parent), K104S, E81G, A38D, C96I, C96K, and D38E mutants.
For more accurate kinetic measurements of the single-site mutants, we carried out in vitro steady-state kinetic analysis of the purified Zn-complexed β-lactamase AB5 variants. By applying various substrate concentrations, the Michaelis–Menten kinetic parameters of the C96K and C96I variants, the best hits from the first-round screening, were obtained (Fig. 3a and b). These variants indeed exhibited up to 2.9- and 2.7-fold increased kcat and kcat/KM values, respectively relative to those of the parent AB5 protein (Table S4 in the ESI†). The results contrasted with the previous attempts to evolve the protein by randomizing the residues at the active sites, 85, 92, 103, and 104.35
Intriguingly, we noted that the enhanced β-lactamase activities were primarily derived from the elevated kcat rather than lowered KM, where the latter value is related to the substrate-binding affinity or dissociation constant (Kd). These results implicate that the single mutations selectively accelerated turnover rates rather than increasing substrate affinity. The molecular origin of acceleration contrasted previous work with an analogous protein, AB3, where enhanced catalytic activities were derived from elevated substrate-binding affinity.40 To identify whether the discrete enhancement mechanism was related to the modified screening conditions using higher concentrations of ampicillin, we reduced the concentration of ampicillin (1.8 mg L−1) for the screening of the C96X mutant library (Fig. S4b in the ESI†). While C96I is still a positive readout, the parent (C96) and C96V became the dominant hits. When we further modified the screening conditions by replacing ampicillin with carbenicillin, another β-lactam antibiotic, at low concentration (Fig. S4d in the ESI†), cells containing C96V and C96L grew relatively fast, implying that screening conditions, such as substrate concentrations and structures, alter the selected readouts.
More importantly, the kinetic parameters of the newly detected variants, C96V and C96L, exhibited substantially lower KM values for ampicillin and/or carbenicillin, respectively, than did the parent protein or previously detected active variants, C96I and C96K (Fig. 3b and d). These data indicated that the tighter substrate-binding affinity obtained by C96V and C96L mutations is likely to be beneficial when low concentrations of β-lactams are applied for activity-based screening. In contrast, C96I and C96K variants with higher turnover rates and catalytic efficiency are likely to be more advantageous when the substrate concentration and binding affinity are no longer limited.
The correlation between the kinetic parameters and the screening conditions was further supported by measuring the substrate-binding affinity or dissociation constant (Kd) directly from intrinsic fluorescence assays (Fig. S5 and Table S5 in the ESI†). The Stern–Volmer plots41,42 of the C96 variant demonstrate that the thermodynamic parameters were consistent with the KM values from kinetic analysis, suggesting that the screening conditions and the chemical properties of the selected products might have promoted specific evolutionary trajectories beneficial for the applied chemical pressures.
To further monitor the effects of single mutations in symmetry-related residues, X-ray crystal structures of three variants from the first-round screening, C96I, C96K, and C96V, were determined (Fig. 4 and Table S6 in the ESI†). The C96I, C96K, and C96V proteins were isolated as tetramers, consistent with the oligomeric states determined in solution (Fig. S6a in ESI†). They were also similar in that all tetramers possess two sets of Zn-binding sites, structural and catalytic ones (Fig. S7 and S8 in the ESI†). Notably, N-terminal residues, surface-exposed acidic residues, and/or H59 and E81 also formed Zn-binding sites, although they are likely to be catalytically irrelevant and generated due to the crystal packing interactions in the presence of excess Zn ions for crystallization (Fig. S9 in the ESI†).
|  | ||
| Fig. 4 X-ray crystal structures of C96 variants from the first-round screening. The catalytic Zn sites of (a) the parent protein, AB5 (PDB 5XZI) (b) C96I (c) C96K and (d) C96V variants. Zn ion and metal-binding residues are shown with navy spheres and navy sticks, respectively. Metal-bound water molecules and chloride ions are shown with red and green spheres, respectively. Two rotamers of H100 residue in C96V protein are highlighted with black arrows. The first metal-coordination sites are shown with 2Fo–Fc electron density contoured at 1.0σ overlaid. | ||
All three variants were closely packed tetramers, similar to the C96T mutant (Fig. 1b) and different from the parent protein with the C96 residue (Fig. 1a). Consequently, both the structural and catalytic Zn-binding sites were drastically altered by the single mutations. In particular, the directionality of the catalytic Zn-OH2/OH species in all C96 variants was flipped towards I67 and T97, instead of E92 and Q103 (Fig. 4). In addition, the first coordination sphere of the Zn center was greatly altered. Whereas the C96 parent protein exhibits a catalytic Zn ion ligated by 2His/1Glu (E89, H93, H100) and a buffer-derived molecule/ion (Fig. 4a), E89 was no longer coordinated to the catalytic Zn ions in the C96I variant (Fig. 4b, S8a and b in the ESI†). Instead, solvent-derived anions or molecules were bound to Zn, creating the distinct first metal-coordination spheres, the secondary microenvironments, and substrate-binding pockets, which would have enhanced metal-dependent enzyme catalysis. Notably, significantly shorter distances between Zn ions and ligating N atoms of H93 and H100 were observed in C96I protein (1.79–2.03 Å) relative to the parent protein (2.17–2.34 Å) (Table S7 in the ESI†). In addition, two discrete angles of εN (H93)–Zn–δN (H100), 91.3° and 122.7°, were observed, one of which significantly deviated from the parent protein (92.8°), creating two asymmetric bis-His motifs for hydrolytically active Zn-centers.
The C96K mutation also altered the first coordination spheres of the catalytic Zn-binding sites (Fig. 4c). The coordination spheres were composed of H93 and H100 residues with one water molecule, and Zn-OH2/OH was pointed towards I67 and T97. E89 was also dissociated from the Zn ion and was hydrogen-bonded to E92, Q103, and one ordered water molecule near the catalytic Zn-binding sites (Fig. S8c and d in the ESI†). Two discrete angles of two histidine residues and zinc ion were again observed as 103.3 and 125.1°, even further deviated from those in the parent protein. In addition, two conformations of the C96K side chains were observed, forming hydrogen bonds with H93, possibly tuning the chemical properties of the Zn site, such as the pKa value of catalytically critical residues, the nucleophilicity of Zn-OH species, and Zn-binding affinity.
In the C96V protein, Zn ions were ligated by H93 and H100, and E89 was again no longer directly coordinated to Zn upon C96 mutation (Fig. 4d). Instead, E89 was hydrogen-bonded to a metal-bound water molecule or was pointed away from the metal-coordination site. Consequently, two solvent-derived molecules, such as Cl− and H2O, were coordinated to the Zn ion in a tetrahedral geometry. In addition, one of the two H100 residues in the asymmetric unit was in two discrete rotameric states, which was not observed in other variants (Fig. S8e and f in the ESI†). As a result, the angle of εN (H93)–Zn–δN (H100) became 95.3° on one side and 90.2° and 148.6° on the other side, resulting in the mutant possessing either nearly symmetric or the most asymmetric Zn sites on the α-helical protein–protein interface.
To explore whether these geometric perturbations are related to catalytic activities, we carried out substrate-docking simulations using Galaxydock43,44 (Fig. S10 in the ESI†). The data suggest that the discrete Zn-binding sites induced by C96 mutations may favor alternative modes in substrate-binding. Because C96I, C96K, and C96V variants exhibit the unique kinetic parameters or substrate-binding affinity, even the slightly modified Zn-coordination sites might not be an artifact of the crystalline packing interactions but related to the dynamic snapshots of the protein structure in solution. Then, these results indicate that seemingly trivial single mutations on the rotational axis can modify the chemical properties of catalytically active Zn sites.
After structural characterization, we further measured the pH-dependent hydrolytic activities to estimate the pKa value of catalytically critical residues or Zn-OH2 species (Table S8 and Fig. S11 in the ESI†). The parent protein exhibited a pKa value of 8.4(1),35 whereas those of C96I, C96K, and C96V were estimated to be 8.6(1), 9.3(1), and 9.2(2). These data indicate that a single mutation at a symmetry-related position alters the net concentration of catalytically active species by tuning the chemical environments of catalytic sites.
Because C96I/A38S double variants yielded the highest catalytic efficiency, we constructed a third round of the mutant library by randomizing the E81 residue of C96I/A38S as the template (Fig. 5d). The outputs exhibit variations in cell-growth rates by ca. 3- and 5-fold in positive and negative directions, respectively, and the best-hit was sequenced to be E81H, resulting in a triple variant, C96I/A38S/E81H (Fig. 5c). Then, we further randomized and screened the last symmetry-related residue, D39. The randomized single-mutant library yielded significantly altered cell-growth rates by up to ca. 5-fold relative to that of the C96I/A38S double mutant (Fig. 5d). The sequencing results indicate that the quadruple variant, C96I/A38S/E81H/D39N, displayed the fastest cell-growth rate, resulting in 8.1-, 3.2-, and 2.2-fold enhancements relative to that of the best hit from the first, second, and third round of screening, C96I, C96I/A38S, and C96I/A38S/E81H, respectively. These data suggest that the iterative sequence optimizations were carried out efficiently by targeting symmetry-related residues.
The Michaelis–Menten kinetic analysis of the best hits from each round was also carried out (Fig. 5f; Table S8 and Fig. S14 in the ESI†). The overall activities were consecutively elevated throughout every round of evolution, resulting in higher kcat and kcat/KM values than those of the variants from the previous rounds of screening. Relative to the uncatalyzed rate (kuncat) of 3.0(1) × 10−6 min−1, the activities of the quadruple mutant accounted for 4.0(7) × 105 and 3.3(3) × 107 M−1 in terms of rate enhancement (kcat/kuncat) and catalytic proficiency (kcat/KM/kuncat), respectively (Table S8 and Fig. S15 in the ESI†). Notably, no improvement in KM was observed throughout the evolution, and C96I/A38S even lost a saturation behavior with increasing substrate concentration, resulting in a second-order rate constant (k2), instead of kcat/KM. The weaker substrate-binding affinities of the screened mutants might be attributed to the screening conditions, where substantially high concentrations of antibiotics were applied. Then, the microscopic catalytic properties of the evolved variants might be the result of β-lactamases discretely evolved to elevate turnover rates rather than to lower substrate-binding affinity.
In addition, we measured the pH-dependent hydrolytic activities of the quadruple variant. The pKa value was 8.4(1) (Table S9 and Fig. S16 in the ESI†), which differs by 0.6 from that of C96I/A38S. Notably, we observed an inverse correlation between the pKa values and kinetic parameters (such as kcat and kcat/KM) of the mutants having compact tetrameric structures, C96T before the screening, C96I, C96I/A38S, C96I/A38S/E81H, and C96I/A38S/E81H/D39N from evolution, implying that distant mutations increased the net concentration of catalytically essential species, such as Zn-OH species, over that of Zn-OH2.
The structural features of the variants were evaluated by size exclusion chromatography. The parent protein exhibits two disulfide bonds formed via four C96 residues in tetramer form,35 which enables the formation of a tetramer even in the absence of Zn ions at the protein–protein interfaces. Upon the removal of the disulfide bond, Zn-free apo-protein was isolated as a mixture of monomers and dimers, similar to C96T35 (Fig. S17 in the ESI†). The isolated mutants from the screening, C96V, C96K, C96I/A38S, C96I/A38S/E81H, and C96I/A38S/E81H/D39N, exhibit analogous Zn-dependent oligomerization, exclusively forming tetramers even after a series of mutations on the protein–protein interfaces. Intriguingly, exceptions were detected for the C96I and C96L variants from the first-round of screening. They form nearly exclusively tetramers even in the absence of Zn ions. The hydrophobic residues in the two opposing α-helical domains are analogous to those in leucine zippers,45,46 where repeated hydrophobic amino acids form noncovalent interactions, inducing oligomerization. Although it is unclear whether the enhanced protein–protein interfaces kinetically promote the in vivo activity of C96I and C96L by forming tetramers prior to metal binding and increasing the effective concentrations of catalytically active species inside the cells, the results indicate that the impact of single-site mutations at symmetry-related spots is indeed substantial in both structural and functional aspects, and can therefore give rise to significant impacts on protein evolution.
Finally, one of the evolved variants was characterized by X-ray crystallography (Fig. 6). The C96I/A38S variant from the second-round screening was a compact tetramer, similar to the C96I, C96K, and C96V variants. The catalytic Zn-binding sites were also similar in that they are composed of Zn ions coordinated to H93, H100 (Zn–N = 2.0–2.3 Å), and one or two water molecules. Notably, the angle of εN (H93)–Zn–δN (H100) was measured to be 92.2° and 94.5°, resulting in more symmetric metal-centers located at the α-helical domains than in the C96I single variant. The subtle yet significant geometric perturbation at the first metal-coordination sphere, induced by A38S mutation, might be associated with the enhanced catalytic activities, possibly by adjusting the nucleophilicity of the Zn-OH species, optimizing noncovalent interactions with the secondary coordination spheres, and/or shaping the internal active site pocket. Notably, A38S is 17 Å distant from the catalytic Zn center. Therefore, the mutation effect exerted at the rotational axis is likely to be transferred through fluxional protein–protein interfaces, suggesting that distant and beneficial mutations can be created efficiently by targeting symmetry-related residues for protein evolution.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 rpm (18
000 rpm (18![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 800g) at 7 °C for 30 min, the supernatants were adjusted to pH 8 by adding 2 M NaOH solution. Then, the solutions were manually loaded onto a Q-sepharose column pre-equilibrated with 10 mM NaPi pH 8.0 buffer. By applying the step gradients of 0–1 M NaCl in NaPi buffer, red fractions due to the presence of heme cofactor were collected and concentrated using stirred cells (Amicon) with 10 kDa cutoff membranes. Then, the samples were loaded onto a HiTrap Q HP anion exchange column (GE Healthcare Life Sciences) pre-equilibrated with 10 mM NaPi pH 8.0 buffer at 4 °C using FPLC (ÄKTA pure). A linear gradient of 0–1 M NaCl was applied, and the fractions with A415/A280 ≥ 4 measured using a UV-Vis spectrophotometer (Agilent Cary 8454) were collected using and concentrated.
800g) at 7 °C for 30 min, the supernatants were adjusted to pH 8 by adding 2 M NaOH solution. Then, the solutions were manually loaded onto a Q-sepharose column pre-equilibrated with 10 mM NaPi pH 8.0 buffer. By applying the step gradients of 0–1 M NaCl in NaPi buffer, red fractions due to the presence of heme cofactor were collected and concentrated using stirred cells (Amicon) with 10 kDa cutoff membranes. Then, the samples were loaded onto a HiTrap Q HP anion exchange column (GE Healthcare Life Sciences) pre-equilibrated with 10 mM NaPi pH 8.0 buffer at 4 °C using FPLC (ÄKTA pure). A linear gradient of 0–1 M NaCl was applied, and the fractions with A415/A280 ≥ 4 measured using a UV-Vis spectrophotometer (Agilent Cary 8454) were collected using and concentrated.
        Then, the protein sample was loaded onto a HiLoad 16/600 Superdex 75 pg column (GE Healthcare Life Sciences) pre-equilibrated with 20 mM Tris/HCl pH 7.0 buffer with 150 mM NaCl. After elution, pure fractions were collected and concentrated. To prepare the metal-free, apo protein, 10-fold EDTA to the protein was treated for 1 h at 4 °C, and the excess EDTA was removed by 10DG desalting column (Biorad) chromatography. Metal content was measured by a colorimetric assay using 4-(2-pyridylazo) resorcinol (PAR) as described previously.54,55 The purified metal-free, apo-protein was concentrated up to 1.0 mM and stored at −80 °C until further use. The protein concentration was determined by using ε415 nm = 148![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 cm−1 M−1.
000 cm−1 M−1.
The pH-dependent hydrolytic activities of Zn-complexed positive hits were monitored by measuring the specific activity with 3 mM ampicillin under various pH conditions; 100 mM MOPS buffer for pH 7.0–7.5 and 100 mM sodium borate buffer for pH 8.0–10.0 ranges (Table S9, Fig. S11 and S16 in the ESI†). Then, the following equation was used for iterative non-linear fitting using Origin 2016 software; y = (kmax × 10(pH−pKa))/(1 + 10(pH−pKa)).
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 56 and CCP4i.57 Molecular replacements were performed with Molrep58 by using a structure of C96RIDC1 monomer (PDB 3IQ6) as a search model. Rigid-body and restrained refinements were carried out using REFMAC5
56 and CCP4i.57 Molecular replacements were performed with Molrep58 by using a structure of C96RIDC1 monomer (PDB 3IQ6) as a search model. Rigid-body and restrained refinements were carried out using REFMAC5![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 59 along with manual inspection and refinements with COOT60 (Table S6†). The resolutions of C96I, C96K, C96V, and C96I/A38S protein structures were determined as 1.98, 2.35, 2.00, and 2.45 Å, respectively. Structural and catalytic Zn-binding sites in C96X variants (X = I, K, and V) are shown in Fig. S7–S9.† The geometric parameters of the Zn-binding sites are summarized in Table S7 in the ESI.†
59 along with manual inspection and refinements with COOT60 (Table S6†). The resolutions of C96I, C96K, C96V, and C96I/A38S protein structures were determined as 1.98, 2.35, 2.00, and 2.45 Å, respectively. Structural and catalytic Zn-binding sites in C96X variants (X = I, K, and V) are shown in Fig. S7–S9.† The geometric parameters of the Zn-binding sites are summarized in Table S7 in the ESI.†
      
      
        | Footnote | 
| † Electronic supplementary information (ESI) available: Supporting figures, supporting tables, and X-ray diffraction analysis data are included. See DOI: 10.1039/d0sc06823c | 
| This journal is © The Royal Society of Chemistry 2021 |