Manxi
Wang
and
Feng
Gai
*
Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China. E-mail: fgai@pku.edu.cn
First published on 14th March 2025
4-Cyanotryptophan is a blue fluorescent unnatural amino acid (UAA) that not only has a large fluorescence quantum yield (>0.8) and a long fluorescence lifetime (>13 ns) in water but is also chemically and photophysically stable, making it a valuable UAA-based fluorophore in biological spectroscopy and microscopy. However, its broad utility requires further development of simple methods for introducing this UAA into large proteins. Herein, we demonstrated two such methods, one is based on a post-translational modification strategy and the other is based on a selective pressure incorporation approach. In addition, in a proof-of-principle application, we demonstrated that incorporation of this UAA at the 66 position of a green fluorescent protein (GFP) variant can modify its photophysical properties.
As shown (Scheme 1), the first method is chemically based, which utilizes an alkylation reaction to append a 4-cyanoindole (4CNI) moiety, which is the chromophore of 4CN-Trp, to the desired protein via a free sulfhydryl group (–SH) of a cysteine (C) residue. Because the underlying SN2 reaction is relatively simple and can be carried out under mild conditions with good yield, it has been used to incorporate various small-sized spectroscopic probes, such as boron dipyrromethene difluoride (BODIPY) fluorescent probes,15,16 nitrile-based NMR probes,17 and an ester infrared probe18 into proteins.
![]() | ||
| Scheme 1 The alkylation reaction used to append the fluorophore of 4CN-Trp to the protein (blue sphere) of interest. | ||
As shown (Scheme 2), the second method is biologically based, which employs the selective pressure incorporation (SPI) method to incorporate 4CN-Trp (the L-form) into the protein of interest. The SPI method19 relies on the promiscuity of a natural aminoacyl-tRNA synthetase (AARS) to recognizes a UAA for protein translation. This method is most effective when an auxotroph is used, because it requires a specific amino acid to be supplied exogenously and therefore an analogue of this amino acid can be used for substitution. Because its simplicity in practice, the SPI method has been widely used to incorporate various UAAs into proteins in a residue-specific manner. In particular, tryptophan-autotrophs have been used to introduce various Trp analogues into different proteins. For example, Bronskill and Wong incorporated fluorine-substituted Trps into ribosomal proteins to suppress the intrinsic fluorescence of Trp and to facilitate the spectroscopic investigation of other chromophores,20 Boknevitz et al.21 incorporated a boron and nitrogen-containing tryptophan analogue into superfolder GFP (sfGFP) to serve as a fluorescent probe, and Ohler et al.22 incorporated 4-fluoro-5-hydroxytryptophan and 6-fluoro-5-hydroxytryptophan into Azurin to serve as EPR probes.
In the current study, we used sfGFP as our model system to first test whether 4CN-Trp can be incorporate into proteins via the SPI method. This is because sfGFP can be highly expressed in E. coli23 and contains only one Trp residue (i.e., Trp57). Subsequently, we employed cerulearn (pdb ID: 2WSO),24 which is a variant of enhanced cyan fluorescent protein (eCFP)25 that consists of a Trp-based chromophore, to show that 4CN-Trp can potentially be used to develop fluorescent proteins with different photophysical characteristics. Our results demonstrated that 4CN-Trp not only can be incorporated into both protein systems, but it can also be used to tune their absorption and emission properties.
The peptide used in the current study was synthesized via standard 9-fluorenylmethoxy-carbonyl (Fmoc) solid phase peptide synthesis method and purified by reverse-phase high-performance liquid chromatography on an Agilent Technologies 1260 Infinity equipped with a C18 column. The masses of the unlabelled and labelled peptides were verified using a liquid chromatography mass spectrometer from Waters. All proteins were purified by AKTA fast protein liquid chromatography equipped with desalting column and His Trap HP column. For spectroscopic measurements, protein samples were prepared in PBS buffer (50 mM, pH = 7.4).
Time-resolved fluorescence measurements were carried out on a FLS980 time-correlated single-photon counting (TCSPC) apparatus (Edinburgh Instruments). Fluorescence excitation at 405 nm was achieved by an EPL-405 laser, and fluorescence emission was collected at 480 nm. The time base was set to be 0–60 ns with a total of 2048 channels and the maximum peak count was set at 10
000. The fluorescence decay kinetics were analyzed by the F980 software using an instrument response function (IRF) determined by measuring the scattering light from a silica solution under the same experimental conditions. In addition, the optical density (OD) of the sample at the excitation wavelength of 405 nm was in the range of 0.10–0.15.
The fluorescence QY of a particular sample was determined using the following equation:
To demonstrate the feasibility of using 3B4CNI to carry out the aforementioned cysteine alkylation reaction, we first tested its reaction efficiency with a short peptide that contains a single cysteine residue (sequence: YGGCGG). We found that, as expected, 3B4CNI can react with YGGCGG quickly under mild conditions (i.e., in PBS buffer of pH 8.0 and at 37 °C, see the ESI,† for details). For example, through the use of mass spectrometry as a means to monitor the progress of the reaction, we found that, at a 3B4CNI/peptide molar ratio of 10
:
1, the yield of the alkylation reaction can reach up to ca. 95% in three hours (Fig. S1 in the ESI†). In addition, we found that in the presence of 8 M urea, which is commonly used to unfold proteins, this reaction also works well (Fig. S2 in the ESI†), suggesting that 3B4CNI can also be used to react with cysteine residues that are buried in the interior of the protein of interest under denaturation conditions.
As shown (Fig. 1A and Fig. S3 in the ESI†), the absorption spectrum of the 4CNI-labelled YGGCGG peptide (hereafter referred to as YGGC*GG) is similar to that of GWCNG (where WCN represents 4CN-Trp). However, the maximum emission wavelength (λem) of the 4CNI fluorophore in YGGC*GG is at ca. 437 nm, which is redshifted by ca. 15 nm from that of the 4CNI fluorophore in GWCNG (Fig. 1B and Fig. S3 in the ESI†). In addition, the absorption spectrum of YGGC*GG is insensitive to solvent (Fig. S4 in the ESI†), and its fluorescence QYs measured in different solvents, including H2O, ethanol (EtOH), isopropanol (IPA) and tetrahydrofuran (THF), are similar (Fig. S5 and Table S1 in the ESI†). For example, the QY values were determined to be 0.21 ± 0.03 in H2O and 0.23 ± 0.07 in THF, respectively. Taken together, these results indicate that the sulfur atom on the sidechain can significantly affect the photophysical property of 4CNI. Nevertheless, as observed for 4CNI and 4CN-Trp,9 the λem value of YGGC*GG exhibits a sensitive dependence on solvent (Fig. 1B), suggesting that the 4CNI fluorophore thus appended on a protein could also be used as a fluorescence probe of local electric field.9
![]() | ||
| Fig. 1 (A) Absorption spectrum of YGGC*GG in H2O. (B) Normalized fluorescence spectra of YGGC*GG in H2O, EtOH, IPA and THF, as indicated. These data were obtained with an λex of 325 nm. | ||
Next, we tested the utility of 3B4CNI to incorporate a 4CNI fluorophore into large protein systems (see the ESI,† for details). Specifically, we chose bovine serum albumin (BSA) as it is easily available and has only one free cysteine residue (i.e., Cys34). Moreover, Cys34 is only partially exposed to solvent, making the test more representative. Specifically, the BSA solution was prepared in phosphate buffer (pH 8.0, 50 mM) with a concentration of 13.2 mg mL−1 and the alkylation reaction was started by mixing a 100 μL of this BSA solution with a 3B4CNI solution (4.6 mg in 10 μL DMF). Then, the resultant reaction mixture was vortexed and shaken vigorously for 3 hours at 37 °C. Subsequently, the solid in the reaction mixture was removed by centrifugation (5 min, 4 °C, 10
000 rpm), followed by removal of the unreacted 3B4CNI in the supernatant by an AKTA FPLC desalting column. Then, the protein-containing solution, which contains the labelled BSA and unlabelled BSA molecules, was lyophilized and the resultant powders were used in the subsequent experiments.
The relevant mass spectrometry results exhibit a mass difference of ca. 160 Da, indicating the successful labelling (Fig. S6 in the ESI†). As shown (Fig. 2A), the absorption spectrum of the labelled BSA further confirms the successful incorporation of the 4CNI chromophore into the protein, since in the wavelength range of >300 nm it exhibits the characteristics of the absorption spectrum of YGGC*GG. Similarly, the fluorescence spectrum of the labelled BSA in H2O, obtained with an λex of 325 nm, also corroborates this notion (Fig. 2B). This is because the other native aromatic amino acids in BSA have a negligible absorbance at 325 nm; hence the fluorescence must arise from the 4CNI fluorophore. Moreover, the λem (415 nm) of the 4CNI-labelled BSA is blue-shifted ca. 22 nm from that (437 nm) of 4CNI-labelled peptide in H2O (Fig. 1B), which is consistent with the fact that the Cys34 is partially buried in BSA.
![]() | ||
| Fig. 2 (A) Normalized absorption spectra of BSA (black) and 4CNI-labelled BSA (red) in H2O. (B) Fluorescence spectrum of 4CNI-labelled BSA in H2O, obtained with an λex of 325 nm. | ||
Result from SDS-PAGE analysis, which shows the existence of a ∼27 kDa protein (Fig. S7 in the ESI†), indicates that sfGFP can be successfully expressed in media that contains 4CN-Trp but no Trp and that 4CN-Trp is recognized by the AARSs from the auxotrophic strain and the native Trp residue is mutated to 4CN-Trp. As shown (Table S2 in the ESI†), mass spectroscopic measurements on the expressed protein (hereafter refeed to as 4CN-Trp-sfGFP) also confirm that it contains a 4CN-Trp residue. Further spectroscopic measurements on 4CN-Trp-sfGFP provide additional supporting evidences. As indicated (Fig. 3A), compared to that of sfGFP, the absorption spectrum of 4CN-Trp-sfGFP contains an additional band in the 300–350 nm region, which is characteristic of the absorption feature of 4CN-Trp. Interestingly, the fluorescence spectrum of 4CN-Trp-sfGFP obtained with an λex of 340 nm, where the intrinsic fluorophore of sfGFP has a minimum absorbance, indicates that the 4CN-Trp is only weakly emissive in this case (Fig. 3B). However, in comparison to that of the fluorescence spectrum of sfGFP of equal concentration, the intensity of the intrinsic sfGFP fluorescence spectrum of 4CN-Trp-sfGFP is significantly increased (Fig. 3B). This indicates that the 4CN-Trp fluorescence is significantly quenched by the intrinsic fluorophore of sfGFP, likely through a fluorescence resonance energy transfer (FRET) mechanism. This is consistent with the fact that the fluorescence spectrum of 4CN-Trp completely overlaps with the absorption spectrum arising from the intrinsic chromophore of sfGFP as well as with the fact that, according to the crystal structure of sfGFP,23 the Trp57 residue is only ca. 13.0 Å away from this chromophore. Finally, the fluorescence spectrum of the 4CN-Trp residue in 4CN-Trp-sfGFP is peaked at 397 nm, indicating that it is situated in a dehydrated environment, which is consistent with the fact that Trp57 is buried. Taken together, our results not only confirm that 4CN-Trp can be incorporated into proteins via the SPI method but also indicate that in the case of sfGFP (or similar GFP variants) the Trp to 4CN-Trp mutation can effectively increase the overall brightness of the chromophore in the 300–350 nm excitation wavelength region.
Because nitrile is a strong electron-withdrawing group,38 we hypothesize that replacing the Trp66 residue with 4CN-Trp in Cerulean can further tune its absorption and emission properties. As shown (Table S3 in the ESI†), the two native Trp residues in Cerulean can be successfully replaced by 4CN-Trp using E. coli ATCC49980 that was routinely transformed with plasmid: pET22b-Cerulean (with a His-tag at the protein C-terminus) and the same expression protocols described above for sfGFP, except that that in the current case a lower induction temperature (27 °C) and a longer incubation time (24 h) were used.
As indicated (Fig. 4), the 4CN-Trp-containing Cerulean (hereafter referred to as 4CN-Trp-Cerulean) indeed exhibits different photophysical behaviors from Cerulean. Specifically, (1) its absorption spectrum is blue-shifted by ca. 25 nm from that of Cerulen, with an λab of ca. 413 nm; (2) its fluorescence spectrum becomes narrower and is also blue-shifted (ca. 8 nm) from that of cerulean; (3) its fluorescence QY (0.32 ± 0.08; determined with an λex of 420 nm; Fig. S8 in the ESI†) is lower than that of Cerulean (0.49);39 and (4) while similar to that observed for Cerulean, a double-exponential function is required to describe the fluorescence decay kinetics of 4CN-Trp-Cerulean (Fig. S9 in the ESI†), the decay lifetimes (τ1 = 0.8 ± 0.1 ns and τ2 = 3.1 ± 0.1 ns) are different from those of Cerulean (τ1 = 2.3 ns and τ2 = 4.5 ns).25 Therefore, taken together, these results confirm our hypothesis that 4CN-Trp can be used to more effectively modify the photophysical properties of fluorescent proteins, compared to Trp.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5cp00112a |
| This journal is © the Owner Societies 2025 |