Matthew S.
Luchansky
a,
Bryan S.
Der
a,
Sabato
D’Auria
b,
Gabriella
Pocsfalvi
b,
Luisa
Iozzino
b,
Daniela
Marasco
c and
Jonathan D.
Dattelbaum
*a
aDepartment of Chemistry, University of Richmond, Gottwald Center for the Sciences, 28 Westhampton Way, Richmond, VA 23173, USA. E-mail: jdattelb@richmond.edu
bInstitute of Protein Biochemistry, CNR, Naples, Italy
cUniversity of Naples Federico II, Naples, Italy
First published on 22nd September 2009
Members of the periplasmic binding protein superfamily are involved in the selective passage of ligands through bacterial cell membranes. The hyperthermophilic eubacterium Thermotoga maritima was found to encode a highly stable and specific periplasmic arginine-binding protein (TM0593). Following signal sequence removal and overexpression in Escherichia coli, TM0593 was purified by thermoprecipitation and affinity chromatography. The ultra-stable protein with a monomeric molecular weight of 27.7 kDa was found to exist as both a homodimer and homotrimer at appreciable concentrations even under strongly denaturing conditions, with an estimated transition temperature of 116 °C. Its multimeric structure may provide further evidence of the importance of quaternary structure in the movement of nutrients across bacterial membranes. Purified and refolded TM0593 was further characterized by fluorescence spectroscopy, mass spectrometry, and circular dichroism to demonstrate the specificity of the protein for arginine and to elucidate structural changes associated with arginine binding. The protein binds arginine with a dissociation constant of 20 μM as determined by surface plasmon resonance measurements. Due to its high thermodynamic stability, TM0593 may serve as a scaffold for the creation of a robust fluorescent biosensor .
As in many organisms, the T. maritimagenome contains a diverse range of ATP-binding cassettes, which couple the hydrolysis of ATP to the passage of ligands through the cell membrane.5 The expressed ATP-binding cassette consists of two transmembrane domains and two ATP-binding domains for transporting a specific ligand across the cell membrane.6 Each cassette relies on a soluble periplasmic binding protein (PBP) capable of interacting with a specific cognate ligand necessary for bacterial metabolic and nutrient gathering processes. T. maritima exhibits a high level of redundancy in its transport proteins, and the duplicated function provides augmented nutrient gathering and metabolic capabilities that represent a selective advantage.7 Combined with its position as a link between Archaea and Eubacteria, the wide array of transport systems that arose from operon duplication or horizontal gene transfer makes T. maritima an interesting and valuable subject for phylogenetic studies.
The PBPs that are associated with the ABC and other transport systems are also of interest for the construction of protein-based biosensors . Due to the variety of binding specificities, PBPs from Escherichia coli have been utilized as design platforms for fluorescent proteinbiosensors capable of targeting many naturally-occurring ligands, including sugars, anions, and amino acids.8–13 While diverse in ligand affinity, this family of proteins contains a highly conserved structural organization. PBPs are formed from a single polypeptide chain composed of two easily distinguishable domains that are connected by a hinge region which undergoes a large rotational-bending movement upon ligand binding.14,15 Using spectroscopic techniques, such as environmental-sensitivity, FRET, or plasmonic interactions, rational placement of fluorescent probes allows transduction of the binding event into a quantifiable optical signal that varies as a function of ligand concentration.16–18 In addition to detecting the natural set of ligands associated with PBPs, re-engineering the PBP binding pocket has significantly expanded the number of small-molecule analytes for which sensors may be constructed.19–21 However, there is a considerable cost in thermal stability for these re-engineered biosensors as a result of the large number of mutations required to alter the binding specificity.22 For this reason, putative thermophilic PBPs are being isolated and characterized because of their potential value in the design of new biosensing technology that features enhanced stability23 and the ability to monitor metabolite trafficking in vivo.24 Since T. maritima thrives at temperatures near the optimal growth temperature of 80 °C,1 its thermostable proteins are capable of retaining functional stability necessary to augment the current array of mesophilic scaffolds available for biosensing applications.
Here, we investigate a putative amino acid transport system in the T. maritimagenome .4 A periplasmic binding protein designated tm0593 at position chr: 627081–627
821 exhibits high homology to several E. coli amino acid-binding PBPs, particularly those for glutamine and histidine. As a hyperthermophilic protein, the gene product may be of interest for the creation of fluorescent biosensors based on a periplasmic binding protein scaffold, as well as the study of Archaea-like proteins in Eubacteria. In this study, we present the purification and spectroscopic characterization of the tm0593 gene product. Based on mass spectrometry, circular dichroism, fluorescence spectroscopy, and surface plasmon resonance, tm0593 was found to encode a thermostable periplasmic arginine-binding protein.
To serve as a starting point for determining the cognate ligand of TM0593, we measured the absorption and emission spectra in order to search for an altered spectroscopic response based on changes in Trp microenvironment polarity.27 The emission spectrum of the single Trp residue in TM0593 displays an unstructured peak with a maximum occurring at ∼320 nm (Fig. 1). This indicates an environment of low polarity similar to that observed for the common reference proteinRibonuclease T1 which is known to possess a buried Trp residue.28 Since TM0593 displays sequence similarity to many periplasmic amino acid-binding proteins, we tested for changes in Trp intensity and spectral shape in the presence of potential amino acid and sugar ligands, including arginine, glutamine, histidine, asparagine, glucose, ribose, and fructose (data not shown). Trp emission intensity did not change following prolonged incubations with any of these potential ligands. Furthermore, increasing the temperature for this thermophilic protein did not alter these observations, suggesting that the wild-type Trp residue location is not optimal for direct detection of ligand binding.
![]() | ||
Fig. 1 Absorption and emission spectral properties of TM0593. Excitation of the single Trp residue was performed at 295 nm in 5 mM phosphate buffer, pH 7.0. |
![]() | ||
Fig. 2 Thermal unfolding of TM0593 measured by circular dichroism in the presence of Gdn-HCl (top). The extrapolated unfolding temperature was found to be 116 ± 3 °C. In the presence of 6 M Gdn-HCl, the addition of 1 mM arginine produced a significant 16 °C increase in the melting temperature (bottom). |
Determination of the cognate ligand binding partner was accomplished using CD thermal unfolding experiments in the presence and absence of substrates as described previously.23,29,30 An increase in thermal stability in the presence of a particular ligand indicates a strong preference for binding. To test for binding, we performed CD thermal melting experiments on dialyzed TM0593 in the presence of 6 M Gdn-HCl and 1.0 mM of ligand: aspartate, glutamine, histidine, asparagine, lysine, or arginine (Table 1). Equilibration of each sample was complete following 24 h of incubation at room temperature. Of the ligands tested in this experiment, arginine alone produces a significant 16 °C increase in the TM0593 melting temperature (Fig. 2, bottom).
Protein a | Unfolding temperature/°C | Standard error/°C |
---|---|---|
a All CD spectra were obtained using 1 μM protein equilibrated in 6 M Gdn-HCl for 24 h prior to taking measurements. b Ligand was added to 1 mM. | ||
Native TM0593 (bound form) | 78.4 | 0.7 |
TM0593 no ligand | 62.2 | 1.8 |
TM0593 + Aspb | 62.7 | 0.9 |
TM0593 + Glu | 63.4 | 0.9 |
TM0593 + Asn | 62.8 | 0.8 |
TM0593 + Lys | 62.5 | 0.2 |
TM0593 + His | 62.8 | 0.5 |
TM0593 + Arg | 78.1 | 1.9 |
Non-denaturing nano-ESI-MS was also used to study the selectivity of TM0593 for various amino acid ligands. Low concentration volatile buffer (10 mM ammonium formate) was used to maintain the native protein conformation in the solution phase, and carefully controlled low energy ionization conditions were applied to transfer weak molecular associations from the solution into the gas phase. Purified TM0593 protein (10 μM) was incubated with arginine (20 μM) and resulted in three distinct charge distributions in the nano-ESI mass spectrum corresponding to native TM0593, denatured TM0593, and native TM0593 plus ligand (Fig. 3). The most abundant distribution was observed in the mass range of m/z 1400–2300 which comprises the protonated molecular ion peaks of TM0593 with 19 to 12 positive charges. These peaks are indicative of the native non-denatured form of the protein. The denatured form, on the other hand, yields a second distribution of highly charged (34 to 24 positive charges) molecular ion peaks in the lower mass range m/z 800–1400. These two charge distributions were used to calculate the average molecular mass for the intact protein and yielded identical values of 27701.13 Da, which is the expected mass of TM0593 expressed with a linker to a hexahistidine peptide tag sequence.
![]() | ||
Fig. 3 Nano-ESI mass spectrum of purified TM0593 (10 μM) incubated with 20 μM arginine in 10 mM ammonium formate. The three distinct charge distributions in the spectrum correspond to native TM0593 (circles), denatured TM0593 (shaded), and native TM0593 plus Arg (triangles). |
The third distribution observed in Fig. 3 occurs at slightly higher m/z than the non-denatured protein distribution. The molecular mass at this peak was determined to be 27875.6 Da, which corresponds to TM0593 in complex with the amino acid arginine (mw = 174.2 g mol−1) in a 1 : 1 protein–ligand stoichiometry. This indicates that only the native form of the protein takes part in the protein–ligand association, as would be predicted. Similar experiments performed with Asn and His did not lead to the formation of protein–ligand complex under the conditions used.
To quantitate the binding affinity and to further characterize the binding specificity of this protein for arginine, surface plasmon resonance (SPR) experiments were performed on TM0593. Two model peptides were designed using the crystallographic data for the E. coliglutamine bindingprotein in complex with glutamine (pdb code: 1WDN), which is a protein with high sequence homology to TM0593. The glutamine ligand is bound in a cleft between two protein domains and stabilized by hydrogen bonds and ionic interactions with Asp10, Gly68, Thr70, Ala67, Asp157, Arg75, Lys115, Gly119 and His156.31Peptides were constructed with either an N-terminal arginine or glutamine residue with sequences of (Arg-Gly)2-Lys-(Gly)3-Cys-NH2 and (Gln-Gly)2-Lys-(Gly)3-Cys-NH2. Five glycine residues were introduced as spacers to displace the N-terminal bindingamino acid from the C-terminus, whereas the cysteines were inserted into the peptide for conjugation with thiol-reactive dyes in future fluorescence studies. The branched end structure was added to produce peptides with a suitable molecular size for SPR binding studies. TM0593 was efficiently immobilized on the chip surface, and an overall 3500 RU immobilization level was achieved under the reported conditions. SPR analyses were performed by injecting increasing concentrations of each peptide: (Arg-Gly)2-Lys-(Gly)3-Cys-NH2 (0 to 400 μM ) and (Gln-Gly)2-Lys-(Gly)3-Cys-NH2 (0 to 700 μM ). RUmax values from each SPR binding experiment were plotted, and the EC50 values were evaluated to be 23.4 ± 1.1 μM and 160 ± 1.0 μM for (Arg-Gly)2-Lys-(Gly)3-Cys-NH2 and (Gln-Gly)2-Lys-(Gly)3-Cys-NH2, respectively (Fig. 4). These data demonstrate a highly preferential binding affinity of TM0593 for the polar amino acidarginine.
![]() | ||
Fig. 4 SPR binding data for immobilized TM0593 with (Arg-Gly)2-Lys-(Gly)3-Cys-NH2 (top) and (Gln-Gly)2-Lys-(Gly)3-Cys-NH2 (bottom). The line drawn represents the best fit by non-linear regression analysis and was used to determine the EC50 values reported. |
![]() | ||
Fig. 5 Representative data for TM0593 thermal unfolding monitored by a decrease in Trp emission in the presence and absence of 1 mM arginine or asparagine. Fluorescence measurements were obtained at an excitation wavelength of 280 nm in 5 mM phosphate buffer, pH 7.0. |
![]() | ||
Fig. 6 TM0593 conformational change is identified by Stern–Volmer quenching with iodide in the presence and absence of 1 mM Arg. Error bars represent the standard deviation associated with the average of three independent trials. |
One of the major findings of this work is that it represents the first use of nano-ESI-MS to aid in the identification of the cognate ligand of a PBP. While sequence similarity searches are a good approach to narrow the pool of potential cognate ligands, we believe that the MS experiments presented here represent a more rapid alternative to traditional methodologies that involve screening libraries of ligands using fluorescence, CD, or radioactive binding assays . By combining the structural characterization capabilities of mass spectrometry with more traditional fluorescence and CD-based spectroscopic methods, it is possible to convincingly identify specific protein–ligand interactions.
This study also provides evidence that the arginine-binding PBP in T. maritima exists as both a homodimer and homotrimer at appreciable concentrations. Two previous studies, using the periplasmic binding proteins TM032226 and the Rhodobacter shpaeroides α-keto acid-binding protein,32 detected the presence of higher order oligomers in the native state. It is speculated that ligand-induced conformational changes lead to multimerization and that these changes are necessary for ligand transport.26,32 Additionally, the arginine–agmatineexchange transporter, a component of the arginine-dependent acid-resistance system in E. coli, has been demonstrated to exist as a homodimer within lipid membranes.33 Though increasingly observed in a variety of bacterial systems, the specific function of binding protein multimeric association is not well understood, as it is possible that either each monomer acts independently or that there exists significant cross-talk or cooperation between the monomers. The dimeric and trimeric nature of TM0593, even in strongly denaturing conditions, provides further evidence that quaternary structure and multimerization are significant elements of metabolic and homeostatic processes that involve arginine. Taken together, these results support a pattern of binding protein multimerization that is hypothesized to be important for the translocation of some ligands across the bacterial membrane.
Comprehensive phylogenetic analyses of T. maritima have been performed, and notable lateral gene transfer events between T. maritima and Archaeal species living in the same environment have been widely observed.34–36 TM0593 was originally assigned as a glutamine bindingprotein belonging to the 4th largest protein family, consisting of putative response regulators and ABC transport proteins. In E. coli, there are 4 well-characterized transport systems for polar amino acids, including glutamine37 (glnHPQ), glutamate–aspartate38,39 (gltIJKL), arginine40 (artPIQM artJ), and lysine–arginine–ornithine–histidine41,42 (argT hisJQMP). The confirmation of function for putative proteinsvia biochemical analysis is time-consuming and requires utilization of a broad array of experimental tools,36 so the ability to draw phylogenetic connections is particularly valuable. The ABC uptake system described here is organized from TM0593 (soluble periplasmic binding protein), TM0592 (transmembrane permease), and TM0591 (ATP-binding protein). Although functionally more similar to the arginine transport system, this gene organization is most similar to the glutamine permease operon (glnHPQ) where dimers of the permease and ATP-binding protein are necessary for activity.5 TM0592 displays 30% amino acid sequence identity (50% similarity) to the E. coli ArtQ transmembraneortholog, and TM0591 displays 43% identity (66% similarity) to E. coli ArtP ATP-binding protein. Based on the functional assignment of TM0593 as an arginine-binding protein and to reflect the nomenclature used to describe genes of ABC transport operons, we propose to rename the annotated T. maritimagenes (TM0593, TM0592, TM0591) to reflect an operon organization denoted artJQP.
Each of the ABC operons is dependent upon a periplasmic binding protein for transport specificity. A neighbor-joining phylogenetic analysis comparing TM0593 with PBPs from a variety of mesophilic and thermophilic organisms is shown in Fig. 7. Each of the proteins used has been experimentally characterized for ligand binding and for three-dimensional structure determination. While many PBP family members possess high levels of amino acid sequence similarity and overall three-dimensional structures reflecting their common origin, there are two distinct structural classes of PBPs based on the number and arrangement of β-strands present among these proteins.43 Class I proteins (red in Fig. 7) contain six β-strands ordered 213456, while class II proteins (blue in Fig. 7), which were derived later in evolutionary time, have five β-strands organized 21354 and also possess a more complex pattern of connectivity between upper and lower domains.43 Experiments analyzing the evolutionary significance of this structural differentiation have shown distinct differences in folding pathways; class I proteins typically follow simple two-state pathways, while more complex folding is observed in class II PBPs.44,45 However, caution in interpreting folding data on PBPs is important because additional data on GBP show that folding proceeds along a complex route of intermediates prior to the addition of glucose, and then a two-state mechanism of complementary folding is observed only after ligand binding.46
![]() | ||
Fig. 7 Unrooted neighbor-joining tree of PBPs with experimentally established 3D structure and ligand specificity. Proteins representing class I (red) and class II (blue) structures are shown.43 Phylogenetic analysis of aligned sequences was performed using PAUP*, and the tree was constructed using 500 bootstrap replicates employing a distance optimality criterion.52 Cognate ligand designations: ABP, arabinose; ArtJ_EC, Arg; ArtJ_GST, Arg–His; DPBP, dipeptide; GBP, glucose; HBP, His; LBP, Leu; LIVBP, Leu–Iso–Val; LAO, Lys–Arg–Orn; MBP, maltose; NiBP, nickel(II); OPBP, oligopeptide; PhBP, phosphate; QBP, Gln RBP, ribose; SBP, sulfate. Abbreviations used for organisms: EC, E. coli; GST, G. stearothermophilus; ST, S. typhimurium; TM, T. maritima; TT, T. thermophilus. |
In the tree presented in Fig. 7, three distinct branches are evident, with the two structural classes grouping to a significant extent. On the TM0593 branch, binding proteins for the polar amino acidshistidine, glutamine, ornithine, and lysine, as well as Arg only, are found. The Geobacillus stearothermophiluslysine-, arginine-, histidine-binding protein (ArtJ_GST) which was recently characterized47 was the closest thermophilic protein neighbor in the diagram. Located on the same branch as TM0593, one of the most well-characterized periplasmic binding proteins is the Salmonella typhimurium LAO binding protein (LAO_ST).41 By performing X-ray crystallography and binding assays on LAO_ST, sub-micromolar dissociation constants were found for lysine, arginine, and ornithine, as well as for histidine.42,48 While broad ligand binding capabilities are found in many binding proteins with arginine affinity, our CD and SPR data indicate that TM0593 binds exclusively to Arg with micromolar affinity.
It is possible to use this phylogenetic analysis as a predictive tool for structural classification of newly characterized PBPs, and these results lead us to propose that TM0593 should crystallize as a class II PBP. These data suggest that the ability to acquire polar amino acids from the environment may have evolved later than transport systems for sugars or nonpolar amino acids based on the structural distinctions used here as well as by others. Our work represents the continuing advancement of biochemical comparisons among amino acid transport systems in mesophiles and thermophiles .
Additionally, this work succeeds in the characterization of a highly thermostable protein which may be utilized as a scaffold for biosensor design. In its native form, TM0593 is an ultra-stable protein that has the potential to provide the basis for a durable and highly specific sensor for arginine. Employing techniques such as FRET would allow for detection of arginine and its quantitation in unknown samples. Furthermore, through gene shuffling into less stable PBP family members specific for other analytes or by computationally-derived site-directed mutagenesis of the binding pocket, it may be possible to alter the specificity of TM0593 for other small-molecule targets. Ideally, this would allow access to a broad array of sensing targets while still maintaining the high structural stability characteristic of the native arginine-binding protein.
Cells were thawed at room temperature and incubated with lysozyme (2 mg) and DNase (0.3 mg) for 20 min. The re-suspended cells were lysed by sonication, and the cell debris was removed by centrifugation (13000 rpm, 30 min). The protein supernatant was initially purified by thermoprecipitation (68 °C water bath, 35 min) and the heat-denatured proteins were removed by centrifugation (13
000 rpm, 30 min). The remaining soluble protein extract was added to a Ni-NTA column for purification by affinity chromatography. TM0593 was bound to the column via a C-terminal 6×His tag and was extensively washed with 20 mM imidazole prior to elution with 250 mM imidazole. At this point, the protein was buffer exchanged into 5 mM phosphate, pH 7.0, and could be stored for more than 4 months at 4 °C. The protein concentration was determined by UV absorbance at 280 nm by a Nanodrop Spectrophotometer (Thermo Scientific, Wilmington, DE) using 14
400 M−1 cm−1 as a predicted extinction coefficient.49
To remove any bound ligand remaining following purification, extensive dialysis or chemical denaturation/refolding was performed. For the latter method, isolated protein was unfolded by incubating the protein overnight at 68 °C in 7.2 M Gdn-HCl. The denatured protein was bound to a Ni-NTA column, washed, and refolded by incremental addition of decreasing Gdn-HCl concentrations. The refolded protein was eluted with 250 mM imidazole and buffer exchanged into 5 mM phosphate pH = 7.0 using a PD-10 column (Bio-Rad, Hercules, CA) to yield approximately 8 mg of TM0593 per litre of cells. Alternatively, isolated protein was dialyzed against 5 mM phosphate buffer for 3 days with twice daily buffer changes. Both methods yielded purified TM0593 with identical spectroscopic and chromatographic properties. Though TM0593 containing the signal sequence was isolatable from the periplasm using sucrose shock,11 higher levels of total protein without the signal sequence were obtained from the cytoplasm from an equivalent number of cells. The molecular weight of purified TM0593 was characterized using 15% SDS-PAGE and visualized using coomassie brilliant blue.
Nano-ESI-MS experiments were performed on a quadrupole time-of-flight (Q-TOF) instrument, QSTAR Elite, from Applied Biosystems (Foster City, CA). Sample (10 μM) was introduced via a distal coated silica PicoTip (od 150 μm, id 20 μm, tip id 10 μm; New Objective) by infusion at 300 nL min−1. For MS analysis under native conditions, an aliquot of the TM05953 solution was buffer exchanged into 10 mM ammonium formate solution using a Millipore microcon column (10 kDa cut-off) immediately prior to injection. For analysis under denaturing conditions, a 10 μM sample was diluted twice in 50% acetonitrile, 0.1% formic acid. The following experimental parameters were used: ion spray voltage, 1.8 kV; curtain gas, 15 L h−1; declustering potential 1 and 2 were set to 40 V and 15 V, respectively; focusing potential, 280 V.
PBP | bacterial periplasmic binding protein |
TM0593 | Thermotoga maritima arginine-binding protein |
Gdn-HCl | guanidine hydrochloride |
T M | melting temperature |
CD | circular dichroism |
Trp | tryptophan |
Arg | arginine |
MALDI-TOF | matrix assisted laser desorption ionization–time-of-flight |
nano-ESI-MS | nanoflow electrospray ionization mass spectrometry |
TIS | triisopropylsilane |
TFA | trifluoroacetic acid |
DMF | dimethylformamide |
DCM | dichloromethane |
HBTU | 1-H-benzotriazolium, 1-[bis(dimethylamino)methylene]-hexafluorophosphate(1-),3-oxide |
HOBt | N-hydroxybenzotriazole |
DIEA | diisopropylethylamine |
Fmoc | fluorenylmethoxycarbonyl |
TCEP | tris(2-carboxyethyl)phosphine |
LC-MS | liquid chromatography–mass spectrometry |
EDC | 1-ethyl-3-[3-dimethylaminopropyl]carbodiimide hydrochloride |
NHS | N-hydroxysuccinimide |
Footnote |
† Electronic supplementary information (ESI) available: Structures of the peptides used for SPR measurements are provided. Links to view 3D visualisations of structures using FirstGlance. See DOI: 10.1039/b908412f |
This journal is © The Royal Society of Chemistry 2009 |