Selvakumar
Edwardraja
a,
Sokalingam
Sriram
a,
Raghunathan
Govindan
a,
Nediljko
Budisa
b and
Sun-Gu
Lee
*a
aDept. of Chemical Engineering, Pusan National University, Busan, Republic of Korea. E-mail: sungulee@pusan.ac.kr; Fax: +82 51 512 8563; Tel: +82 51 510 3680
bDept. of Biocatalysis, Institute of Chemistry, Technical University of Berlin, Juni 124 Berlin, Germany. E-mail: budisa@biocat.tu-berlin.de; Fax: +49 30 314 79 651; Tel: +49 30 314 23 661
First published on 19th November 2010
Single-chain Fv (scFv) format protein is a commonly used analytical tool for diagnostic and therapeutic applications. The usage of scFv antibody fragments in therapeutic applications has demonstrated that they need to have high thermostability. Many rational or irrational methods have been described erstwhile to engineer or improve the stability of scFv proteins by modifications of natural amino acid. Here we have demonstrated an alternate strategy to efficiently improve the thermostability of scFvs by non-canonical amino acid. Previously, fluoroprolines have been proven as a choice to tune the stability of many polypeptides and few globular proteins. Hence we exploited the usage of fluoroproline to enhance the thermal stability of scFv by replacing the natural proline on the framework regions of scFv that influence the folding or stability. To demonstrate our approach, a bacterial cytoplasmic foldable and humanized anti-c-Met scFv (hu-MscFv) was used. The hu-MscFv proline sites were successfully incorporated with (2S,4R)-4-fluoroproline without affecting its structure and function by the in vivo residue specific global replacement method which exploits bacterial auxotrophic system. The time-dependent temperature effect on the activity of fluorinated hu-MscFv revealed its increased stability at 40 °C along with improved half-life than the hu-MscFv with natural proline. Further model based structure analysis on hu-MscFv with fluoroproline indicated that the fluorine atoms were able to establish new favourable dipole interactions with neighbouring polar groups in their local micro environments that rationalizes its improved thermostability. Moreover the scFv sequence based statistical analysis strongly supports the fact that this method can be applied to any target scFv, since they contain high frequency conserved proline sites in their framework regions.
ScFv is constructed by linking two variable domains of IgG, i.e.VH and VL. Each variable domain is composed of a highly conserved framework region (FR) that primarily affects the folding/stability, and a hyper-variable complementarity determining region (CDR) that plays a major a role in the binding activity.8,9 There have been lots of studies devoted to engineering the scFv proteins for stability improvement,9,10 which can be categorized into two ways. The first way is the rational11–15 or irrational engineering16–20 of FR sequences in the target scFv. The second way of stabilization is the grafting of CDR sequences of the target scFv into stable FR sequences.21–23 Indeed both methods have been successfully applied to improve the biophysical stabilities of scFv and helped us overcome the problems caused by the poor stability of scFv.
Here we propose an alternative strategy for enhancing the stability of scFv proteins. The strategy is based on the approach that fluorinates proline residues of the target protein and the following features of the approach tempted us to test it for stabilizing scFv. First, fluorination of proline residues induces stereoelectronic effects that can affect the folding/stability of polypeptides. A number of reports have demonstrated that it is possible to tune or enhance the stability of polypeptide structures by exploiting the effect.24–26 For instance, hyperstable collagen could be synthesized by introducing fluoroproline into the sequence. Second, fluoroprolines such as (2S,4R)-4-fluoroproline ((4R)-FPro) and (2S,4S)-4-fluoroproline ((4S)-FPro) are efficiently recognized by the natural translation system of Escherichia coli. Thus we can synthesize recombinant protein with (4R)-FPro or (4S)-FProin vivo on a large scale.27,28 In fact, most studies on the fluoroproline effect have focused on peptide sequences rather than globular proteins partly due to the difficulty in fluorination of globular proteins. However, a recent report has clearly shown that (4R)-FPro and (4S)-FPro were very efficiently incorporated into the proline sites of recombinant protein by the residue specific substitution method which exploits auxotrophic strain.26 Moreover, the report demonstrated that folding efficiency of green fluorescent protein (GFP) could be tuned with the fluoroprolines, which specially encouraged us to apply the approach for scFv. Third, the in vivo residue specific substitution method using existing residues on the target protein allows us to engineer proteins without any modifications of primary sequences.29,30 Thus we can employ the approach not only to engineer protein with its original sequences but also to further modify the sequences engineered by traditional protein engineering methods.
The availability of proline sites in target proteins is important in order to apply the approach using fluoroprolines. We analyzed the proline profile of scFv protein sequences from the Kabat database,31 showing that approximately 90% of scFv sequences contain at least seven or more proline frequencies in their composition (Fig. 1). Further analyses on scFv sequences for proline distribution and conservation revealed the presence of 10 conserved proline sites, in which eight sites (VH9, VH14, VH41, VL8, VL15, VL40, VL44, VL59) were present in FR sequences and two sites (VH52a and VL95) were in CDR (Fig. S1, ESI†). These statistical analyses indicated the abundance of proline residues in FR that plays a critical role in the folding and stability of scFv, which favored our strategy as a fruitful one to be carried out.
![]() | ||
Fig. 1 Statistical calculation of scFv sequences from Kabat database for proline frequencies. Plot shows the percentage of scFv sequences with their proline frequencies. Almost 90% of scFv sequences contain prolines with the frequencies of 7 or above indicated as black bars and fewer frequencies as gray bars. |
Scheme 1 outlines our strategy of single step stabilization of scFv proteins with fluoroprolines. To demonstrate it, an engineered anti-c-Met scFv (hu-MscFv)32 was employed as a target scFv sequence. The hu-MscFv recognizing c-Met, a receptor of hepatocyte growth factor/scattering factor, is expected to be used in clinical treatment or imaging of many cancer cells.33 Previously, we generated the hu-MscFv sequence by grafting the CDR of anti-c-Met scFv from rabbit into the consensus derived human FR (huFR) for the production in E. colicytoplasm as well as for humanization, and predicted its tertiary structure via homology modeling method.32 From the sequence and modeled structure of hu-MscFv, we could identify the distribution, cis/trans state, conservation, solvent accessibility and secondary structure of proline residues in the scFv protein (Table 1). The hu-MscFv sequence lacks proline residue in its CDR and contains a total of eight proline residues in its FR sequence, among which seven prolines are distributed in the conserved proline sites and only one proline exists at the non-conserved site. These properties of hu-MscFv made it as an ideal candidate to demonstrate our strategy. In this study, we introduced (4R)-FPro or (4S)-FPro into the target scFv, investigated the thermal stability of scFv with fluoroproline, and performed the structural analysis of the fluorinated protein.
![]() | ||
Scheme 1 Schematic representation of the approach for stability improvement of scFv proteins with fluoroprolines. The proline conserved sites are indicated with the numerical values in parentheses for the relative frequency of proline on the VH and VL domains that were examined from the Kabat database (Fig. S1, ESI†). As a single step stabilization of scFv, the available conserved proline residues in the target scFvs can be easily replaced with fluoroprolines by residue specific substitution method in the cytoplasm of E. coli, to enhance the stability of the target scFv. |
Chain | Kabat position | FR a/CDRb | Amino acid position | cis/trans isomerism | Evolutionary significance | Solvent accessibility | Secondary structure/motifsc |
---|---|---|---|---|---|---|---|
a FR: framework region. b CDR: complementary determining region. c Analysed by PDBSum Generate online tool. | |||||||
Heavy | 14 | FR | 30 | trans | Conserved | Exposed | Loop/β turn |
Heavy | 41 | FR | 57 | trans | Conserved | Exposed | Loop/β turn/β hairpins |
Light | 8 | FR | 162 | cis | Conserved | Buried | Loop |
Light | 15 | FR | 169 | trans | Conserved | Exposed | Loop/β turn |
Light | 40 | FR | 195 | trans | Conserved | Exposed | Loop/β turn/β hairpins |
Light | 44 | FR | 199 | trans | Conserved | Exposed | β strand |
Light | 59 | FR | 218 | trans | Conserved | Exposed | Loop |
Light | 80 | FR | 239 | trans | Non-conserved | Exposed | Helix |
The total cell protein profile clearly showed that the expression levels of 4S-FP-hu-MscFv and 4R-FP-hu-MscFv were almost comparable to the expression level of scFv with natural proline NP-hu-MscFv (Fig. S3, ESI†). This result suggests that 4S-FPro and 4R-FPro are efficiently recognized by E. colitranslation system as previously reported.26 The expression profiles of soluble and insoluble fractions from each variant were analyzed by western blot (Fig. 2a). It showed that the folding level of hu-MscFv was significantly affected by the incorporated proline analogues. Almost 95% of the 4S-FP-hu-MscFv was not able to fold properly thereby found mostly in the insoluble fractions, which suggested that the incorporation of 4S-FPro into the hu-MscFv proline sites negatively affected the folding of the protein. On the other hand, the 4R-FP-hu-MscFv was favored to fold properly to the same level as NP-hu-MscFv even with the incorporation of 4R-FPro (Fig. 2a).
![]() | ||
Fig. 2 Expression and activity confirmation analysis of fluorinated hu-MscFvs. (a) Western blotting of soluble and insoluble fractions of BL21(DE3)pLysS (KC1325) for the NP-hu-MscFv, 4S-FP-hu-MscFv and 4R-FP-hu-MscFv. Lane NP, NP-hu-MscFv; lane −ve, negative control (hu-MscFv over expressed without proline or its analogue); lane RP, 4R-FP-hu-MscFv; lane SP, 4S-FP-hu-MscFv. (b) The relative activities from soluble fractions of BL21(DE3) pLysS(KC1325) for the NP-hu-MscFv, 4S-FP-hu-MscFv and 4R-FP-hu-MscFv were analyzed by ELISA. Lane −ve, negative control (hu-MscFv over expressed without proline or its analogue); lane NP, NP-hu-MscFv; lane RP, 4R-FP-hu-MscFv; lane SP, 4S-FP-hu-MscFv. The activity of NP-hu-MscFv was taken to be 100%. |
The binding activities of hu-MscFv variants were also examined using the soluble fractions of the respective variants. The analysis confirmed that the folded variant 4R-FP-hu-MscFv was active and not affected significantly by the replacement of prolines with 4R-FPro (Fig. 2b). As expected, there was no significant amount of activity signal from the soluble fraction of 4S-FP-hu-MscFv, which again ensured that the folded or active form of 4S-FP-hu-MscFv in the soluble fractions was very limited. These results indicated that the 4R-FPro had no negative influence on the folding and activity of hu-MscFv.
The trypsin digested peptides containing proline sites (L.VLTQSATL.S/VHPro8, W.YQQK
GQA
RYL.I/VHPro40 and VHPro44, and Y.TKGTGV
ARF.S/VHPro59) were ionized and analyzed by ion-trap MS/MS. The hu-MscFv with 4R-FPro had most abundant three different fragment ions m/z (947.53022, 1484.76447 and 1051.57890) peptides. Similarly, the same three fragment ion peptides of NP-hu-MscFv with natural prolines were observed at m/z 929.53022, 1448.76447 and 1033.57890 respectively. The mass shift (+18, +36 and +18 Da) clearly indicated that 4R-FPro (133.12 Da) was incorporated instead of prolines (115.13 Da) in the selected four different proline sites of 4R-FP-hu-MscFv. The incorporation efficiency of 4R-FPro was estimated to be approximately more than 90% on the basis of the results of SDS-PAGE (Fig. S3, ESI†) and the HPLC-MS-MS, as already demonstrated in the previous reports.26,35
![]() | ||
Fig. 3 Specific activity analysis of hu-MscFv with natural proline or its analogues. The relative specific activities of purified NP-hu-MscFv (hu-MscFv Proline) and 4R-FP-hu-MscFv (hu-MscFv 4R-FPro) were determined by ELISA assay against cMet. The specific activity of NP-hu-MscFv was taken to be 100%. Values are the average of duplicate experiments and represented as mean ± standard deviation. |
![]() | ||
Fig. 4 Effect of temperature on the stability of hu-MscFv with fluoroproline. Percentage of relative binding activities of NP-hu-MscFv (N-Pro) and 4R-FP-hu-MscFv (R-Pro) were determined against c-Met by ELISA after incubating the samples in different temperatures (30, 40, 50, 60, 70, 80 and 90 °C) for 30 min. The binding activity of control (without any incubation) was taken to be 100%. |
![]() | ||
Fig. 5 Time-dependent activity profile of hu-MscFvs. Percentage of relative binding activities of NP-hu-MscFv (N-Pro) and 4R-FP-hu-MscFv (R-Pro) were determined against cMet by ELISA after incubating the samples at 40 °C for different time durations (30, 45, 60 and 120 min). The binding activity of control (without any incubation) was taken to be 100%. Each value represents the mean of triplicates. Values are the average of triplicate experiments and represented as mean ± standard deviation. |
To investigate the above point, the model structure of the 4R-FP-hu-MscFv was generated from the previously modeled hu-MscFv.32 The structural replacement of hydrogen atom into fluorine atom in the Cγ-(C4) position of proline residues was successfully done with the Discovery Studio 2.0 (Accelrys Inc., San Diego, CA, USA). It is always necessary to have energy minimization for the relaxation of structure after any modification. Hence the modified structure was prepared and energy was minimized as described in the Materials and Methods section. The energy minimization cycles were done with Steepest descent and followed by Conjugate gradient method, and both exited by satisfying the RMS gradient tolerance less than 0.1 kcal mol−1 Å−1, which indicated the structure attained a local minimum conformation. This energy minimized structure was further used for structural analysis (Fig. S8, ESI†).
The structural analysis for the identification of dipole interactions between the fluorine atom of 4R-FPro and the other polar groups (amide, hydroxyl or carbonyl) in the modeled structure of 4R-FP-hu-MscFv revealed the establishment of ten new interactions (Fig. 6). Among the identified ten interactions three show stereoelectronic repulsive position, in which two repulsions may be considered to be nullified by the favorable interactions of the neighbor atoms and remaining seven interactions are located in the distance to show favorable interactions with their local environment polar groups. These results indicate that the newly generated dipole interactions may induce the enhancement of the thermal stability.
![]() | ||
Fig. 6 Calculated dipole interactions in the modelled structure of 4R-FP-hu-MscFv. Fluorines are cyan, the new interactions are shown in green line except three repulsive interactions, which are indicated in black line. Four interactions are found in fluorinated VH domain: (a) VH(4R)FPro14 interacts favourably with the backbone –NH– of both VHSer113 (3.95 Å) and VHAla84 (3.84 Å) but with backbone –CO– of VHVal111 (3.90 Å) sterically located as repulsive, which may be nullified with the other two favourable interactions on the neighbouring residues. (b) The VH(4R)FPro41 has the most closer interaction with the –OH– group of VHThr87 (2.78 Å). Remaining six interactions were found in the fluorinated VL domain. (c) The VL(4R)FPro8 shows strong repulsive interaction with the backbone carbonyl of VLThr10 (3.31 Å). (d) The 4R-FPro of VL(4R)FPro44 has two favourable interactions with the side chain –NH– groups (E22: 3.60 Å and E21: 3.71 Å) and one repulsive steric contact with the side chain –CO(E1)– group of VLGln38 (3.21 Å), which may be nullified with other two favourable interactions. (e) The final two favourable interactions are identified between the VL(4R)FPro59 and the side chain –NH(H12)– and –NH(H22)– groups of VLArg61 in the measured distance of 3.26 Å and 3.21 Å respectively. Sums of 10 interactions are newly created by the fluorine atoms in 4R-FP-hu-MscFv that are absent in NP-hu-MscFv. |
Proline is a unique amino acid due to the existence of cis/transpeptidyl proline isomeric conformations on protein structures, which plays a key role as a rate limiting element for protein folding.36,37 Especially it was found to have correlation with the puckering effect in its pyrrolidine ring conformations as Cγ-exo or Cγ-endo.38,39 Previously these properties were structurally tuned with 4R-FPro and 4S-FPro that favor the Cγ-exo puckering/trans conformation and Cγ-endo puckering/cis transformation, respectively.35 These stereoelectronic effects have generally influence on the folding rate of protein depending on the peptidyl proline isomeric conformations on the protein structure. We could also observe this effect in our study. As shown in Fig. 2, only 4R-FPro favored the folding of hu-MscFv in vivo and 4S-FPro substantially disfavored it, which may be related to the isomeric conformation on hu-MscFv. According to our modeled structure of hu-MscFv, there are seven trans-prolines and only one cis-proline site which may prefer 4R-FPro and 4S-FPro, respectively. Therefore the folding of hu-MscFv with 4R-FPro might have a negative influence on the VL8 cis-proline site, but the in vivo soluble expression levels of scFv with 4R-FPro or natural proline were not different. This result indicates that the cis-proline residue in hu-MscFv may not be so sensitive to the stereoelectronic effect caused by incorporation of fluoroproline, which can be supported by the previous report showing that the VL8 proline site is insensitive to mutation.40 On the other hand, the negative effect of 4S-FPro incorporation on the folding of hu-MscFv suggests that there might be very sensitive sites in the seven trans-proline residues against the stereoelectronic effect.
We attempted the analysis of dipole interactions to understand the enhanced thermal stability of scFv, which allowed us to identify the newly formed seven favorable dipole interactions from the replaced fluorine atom in 4R-FPro. In fact, there would be other intramolecular interactions that can be newly generated from the introduced 4R-FPro and have influence on the stabilization of hu-MscFv. For example, the fluorination of the buried proline residues may increase the hydrophobic interactions in the protein core, which can synergetically stabilize the protein with dipole interactions.40 However, we presume that the newly formed dipole interactions are the major source for the stabilization of hu-MscFv rather than the hydrophobic effect because seven residues are surface-exposed and only one is buried among the eight proline residues in the protein (Table 1).
Briefly 1 mM of natural proline, 4R-FPro, and 4S-FPro were added into separate culture flasks and immediately induced with 0.05 mM IPTG for the cytoplasmic expressions of NP-hu-MscFv, 4R-FP-hu-MscFv, and 4S-FP-hu-MscFv respectively, for 5 h at 25 °C. Finally the OD600 was measured and cells were harvested by centrifugation for 10 min (5000 rpm) at 4 °C and stored at −70 °C until use. Lysis of the cells was carried out by using BugBuster protein extraction kit (Novagen). Briefly, collected cell pellet corresponding to 1 mL of 1 OD cells were resuspended in 100 μL of lysis buffer, incubated at room temperature for 20 min, and centrifuged for 45 min at 12000 rpm at 4 °C. The supernatant was saved as a soluble fraction, and the pellet was resuspended in 8 M urea for insoluble fraction. The amount of the soluble fraction as well as insoluble fraction was analyzed by SDS-PAGE (12% acrylamide gel) and Western blotting as described previously.32
Footnote |
† Electronic supplementary information (ESI) available: Frequency profile, sequence information, total cell expression profile, purification, LC-ESI-MS/MS, model structure. See DOI: 10.1039/c0mb00154f |
This journal is © The Royal Society of Chemistry 2011 |