 Open Access Article
 Open Access Article
Jacopo Marino
 a, 
Reto Walser
db, 
Martin Poms
a, 
Reto Walser
db, 
Martin Poms
 dc and 
Oliver Zerbe
dc and 
Oliver Zerbe
 *d
*d
aLaboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
bAstex Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge, CB4 0QA, UK
cDepartment of Neurology, Children's Hospital Zurich, Steinwiesstrasse 75, CH-8032 Zurich, Switzerland
dDepartment of Chemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland. E-mail: oliver.zerbe@chem.uzh.ch;  Fax: +41-44-6356882;   Tel: +41-44-6354263
First published on 9th March 2018
Cotranslational protein folding is a vectorial process, and for membrane proteins, N-terminal helical segments are the first that become available for membrane insertion. While structures of many G-protein coupled receptors (GPCRs) in various states have been determined, the details of their folding pathways are largely unknown. The seven transmembrane (TM) helices of GPCRs often contain polar residues within the hydrophobic core, and some of the helices in isolation are predicted to be only marginally stable in a membrane environment. Here we review our efforts to describe how marginally hydrophobic TM helices of GPCRs integrate into the membrane in the absence of all compensating interhelical contacts, ideally capturing early biogenesis events. To this end, we use truncated GPCRs, here referred to as fragments. We present data from the human Y4 and the yeast Ste2p receptors in detergent micelles derived from solution NMR techniques. We find that the secondary structure in the fragments is similar to corresponding parts of the entire receptors. However, uncompensated polar or charged residues destabilize the helices, and prevent proper integration into the lipid bilayer, in agreement with the biophysical scales from Wimley and White for the partitioning of amino acids into the membrane-interior. We observe that the stability and integration of single TM helices is improved by adding neighboring helices. We describe a topology study, in which all possible forms of the Y4 receptor were made so that the entire receptor is truncated from the N-terminus by one TM helix at a time. We discover that proteins with an increasing number of helices assume a more defined topology. In a parallel study, we focused on the role of extracellular loops in ligand recognition. We demonstrate that transferring all loops of the human Y1 receptor onto the E. coli outer membrane protein OmpA in a suitable topology results in a chimeric receptor that displays, albeit reduced, affinity and specificity for the cognate ligand. Our data indicate that not all TM helices will spontaneously insert into the helix, and we suggest that at least for some GPCRs, N-terminal segments might remain associated with the translocon until their interacting partners are biosynthesized.
While the first low-resolution data emerged from cryo-electron microscopy in 1993![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 4 the scientific community had to wait until 2000 for the first crystal structure to appear, the structure of bovine rhodopsin.5 Since then we have witnessed the publication of new structures at an ever-accelerating pace culminating in the Nobel prize in chemistry awarded to Brian Kobilka6 and Bob Lefkowitz7 in 2012. The major breakthroughs in this area of structural biology have recently been reviewed by Grisshammer.8
4 the scientific community had to wait until 2000 for the first crystal structure to appear, the structure of bovine rhodopsin.5 Since then we have witnessed the publication of new structures at an ever-accelerating pace culminating in the Nobel prize in chemistry awarded to Brian Kobilka6 and Bob Lefkowitz7 in 2012. The major breakthroughs in this area of structural biology have recently been reviewed by Grisshammer.8
Early studies of folding of GPCRs and GPCR-like proteins comprise the seminal work of the Khorana group that demonstrated that bacteriorhodopsin (bR) can be refolded from its fully denatured state.9 While refolding of most GPCRs from the denatured state is difficult but not impossible, no structure of a refolded GPCR has been published so far. Most of the folding studies of GPCRs rather focused on unfolding them in chaotropic agents10 using a number of biophysical or computational11 techniques to characterize the folding intermediates and to detect folding cores. However, most of the experimental techniques cannot deliver dynamical information at atomic resolution with the notable exception of single-molecule techniques such as atomic force microscopy (AFM). Those can be used to map overall conformal preferences of membrane proteins, even if they do not adopt a unique topology.12–14
The potential of investigating dynamical features of GPCRs by NMR has been recognized early on, but studies of GPCRs by NMR are largely hampered by technical issues.15 The seminal paper by the Nietlispach group reporting on sensory rhodopsin16 and the structure of proteorhodopsin17 have indicated that solution NMR in principle is capable of determining the structures of 7-TM membrane proteins. In addition, GPCRs labeled with only certain amino acid types18–22 or receptors covalently linked to fluorine moieties23,24 have been used to investigate aspects of receptor activation.
In this review we summarize our work on the use of truncated GPCRs, here referred to as “fragments”, to study interhelical contact formation in neighboring TM helices, and ligand recognition by extracellular loops and the N-terminal domain. The different approaches we followed for using fragments are depicted in Fig. 1.
We focus on examples from two GPCR families: the human Y receptors and the yeast mating pheromone receptor Ste2p. The Y receptor family consists of four members (Y1, Y2, Y4 and Y5), which all are class A receptors that bind to peptides of the NPY family. The Y receptors mediate vasoconstriction, stimulation of food intake, intestinal functions, regulation of circadian rhythms, and the release of pituitary sex hormones.25 We began by studying the binding mechanisms of the NPY ligands to the Y receptors. We discovered that binding is orchestrated via a two-step mechanism, which includes first association to the cellular membrane and a second recognition by the extracellular N-terminal domain and by the extracellular loops of the GPCRs26 following the membrane compartment model initially postulated by Schwyzer.27,28 For these reasons, the peptides of the NPY family represent an excellent class of ligands whose binding to membranes or membrane mimetics can be studied in great detail by NMR. We further used the Y4 receptor to study interhelical contact formation in neighboring TM helices. The S. cerevisiae Ste2p receptor binds the tridecapeptide α-factor mating pheromone, resulting in growth arrest and gene regulation in preparation for mating.29 Studies on this receptor were carried out in collaboration with the groups of Naider and Becker, and largely concern the question of how uncompensated polar residues at central helix position influence protein topology.
The membrane compartment model indicates that the extracellular N-terminal domains or the extracellular loops of the Y receptors may interact with their ligands. For this reason, we produced the peptides corresponding to the N terminal domains of the Y-receptors and studied by NMR their interaction with their ligands (Fig. 1A). We then created a model for the extracellular loops of the Y1 receptor by grafting the latter in a topologically related fashion onto a well-behaved membrane protein scaffold derived from the bacterial outer membrane protein OmpA (Fig. 1A).30,31 This allowed us to study recognition of the peptide ligands by the loops.32,33 Subsequently, we made 2–3 TM fragments to study the importance of inter-helical contacts (Fig. 1B), and complemented this approach by studying folding of a series of overlapping N-terminal fragments of increasing length (Fig. 1C) by NMR. We further complemented the NMR data with studies on the orientation of these fragments in the inner membrane of E. coli. Finally, we briefly mention the use of complementary fragments to make a mimic of a segmentally labeled entire GPCR.
The two-stage membrane-protein folding model proposed by Popot and Engelman postulates that segments of membrane proteins form helical structures upon partitioning into the water–membrane interface.41 These helices subsequently insert into the hydrophobic core of the membrane and diffuse within the bilayer until contacts with other helices are made eventually leading to the assembly of the helical bundle. If secondary structure in membrane environments is predominantly determined by sequence, and does not depend on the formation of tertiary contacts, it should be retained in fragments. As described below in more detail, we usually observe good agreement between the location of helical segments and structural models of our GPCRs, an indication that formation of secondary structure of membrane proteins is indeed primarily encoded in the immediately neighboring amino acid sequence.
Is tertiary structure correctly retained in fragments? A number of features were proposed to be important for the formation of inter-helical contacts.42–45 Those are for example shape-complementary amino acids, e.g. the knobs-in-hole arrangement in GXXXG motifs.46,47 Another way to mediate helical contacts is through formation of aromatic sidechain interactions48 or via formation of hydrogen bonds.49 The latter requires the presence of polar residues in sequence locations where they would not be expected based on a match of hydrophobicity with the surrounding lipid environment. It is obvious that exposure of polar residues to lipids will hamper proper integration of the corresponding stretch into the membrane, and will additionally destabilize secondary structure.50 The location of polar residues situated within the membrane is important since it provides specificity for interhelical contacts, which should be taken into account by selecting the fragment such that most of the polar residues find their interaction partners within the same fragment if possible. On the other hand, looking at fragments also offers the possibility to study the behavior of an N-terminal segment of the protein in the absence of the remainder of the protein, ideally mimicking what happens during early biogenesis events within the lipid bilayer.
Experience from our and other laboratories indicates that it is possible to generate milligram quantities of purified GPCR fragments from 1–2 L of E. coli culture, even when growing cultures with minimal media in deuterated water.50–57 We have observed that some GPCR fragments are well produced in E. coli, while others require the use of an N-terminal fusion partner to increase expression. Among the N-terminal fusion proteins that we have employed to increase expression are the ΔTrp-leader sequence58 or Mistic.59
During protein biogenesis in E. coli, GPCR fragments are intrinsically prone to aggregation due to their hydrophobic nature and lack of a recognition sequence for the Sec translocon. Thus, they accumulate in the form of inclusion bodies. However, inclusion bodies can be solubilized with the aid of organic solvents such as trifluoroacetic acid (TFA),60 or detergents such as SDS or fos-choline-12,51 among other strategies followed in our group to solubilize insoluble proteins.61,62
Once GPCR fragments are solubilized from inclusion bodies, we performed a chromatographic separation in presence of denaturants, such as reverse-phase column or ion metal affinity chromatography (IMAC), followed by dialysis against water to remove the denaturant, and lyophilization. Purity and the correct mass of the protein obtained is verified by mass-spectrometry. This procedure offers the unique advantage that the lyophilized material can be solubilized in the required volume of buffer containing detergent, a prerequisite to obtain reproducible NMR measurements. Furthermore, solubilization of lyophilized material into small volumes of buffers minimizes the use of expensive deuterated detergents.
We have experienced that some GPCRs fragments could not be efficiently re-solubilized after lyophilization. This prompted us to investigate the expression into the inner membrane of E. coli. GPCRs lack N-terminal signal sequences for insertion into the membrane, but expression of functional GPCRs in E. coli sometimes can be achieved by using N-terminal fusion partners such as the maltose binding protein (MBP),63 or Mistic.59 The use of N-terminal fusions proved to be useful for GPCR fragments, and allowed us to investigate how the transmembrane helices of the fragments are oriented in respect to the lipid bilayer of the E. coli inner membrane (vide infra).
For larger ligands, and in particular in case of peptide ligands, photoaffinity-labeling studies revealed that contact points between receptors and their ligands can be assigned to the extracellular face of GPCRs, which is comprised of an N-terminal domain and three extracellular loops (ECLs) of variable size.69–71 To investigate this aspect we have studied the interactions of peptides of the NPY family with isolated N-terminal domains72 or with a chimeric receptor in which all extracellular loops as well as the N-terminal domain were grafted onto a β-barrel scaffold, the membrane protein OmpA.32,33
Since formation of proper tertiary structure per se is questionable in fragments, it is highly unlikely that the loop conformations in isolated TM fragments will be meaningful. Grafting the loops onto a suitable scaffold could allow for the simultaneous display of several (ideally all) binding epitopes of a GPCR in approximately correct relative orientations and could hence serve as a suitable GPRC-mimic for biophysical or structural studies. We therefore selected a scaffold that can accommodate at least three extracellular loops of the Y1 receptor, can easily be biosynthesized and handled, resides in the membrane, and also can be characterized by solution NMR.
In the resulting chimeric receptor, we grafted the loops of the Y1 receptor onto the β-barrel scaffold derived from the E. coli outer membrane protein A (OmpA) by replacing three of its four loops at one side of the barrel by the Y1 receptor extracellular loop sequences (Fig. 3).32,33 The three loops can be transferred in various arrangements onto the scaffold. We ranked the different topomers according to how similar distances of loop-anchoring residues are with respect to known GPCR structures and chose to biosynthesize four candidates that most closely resemble the native receptors. All of those constructs expressed well and could be refolded. Using chemical shift mapping and saturation-transfer NMR techniques we demonstrated that the chimeric receptor recognizes the native ligand NPY in a specific manner.32 We have also transferred the extracellular domain onto the remaining accessible β-barrel scaffold loop acceptor position, but could not detect an increase in binding affinity.33 The affinity between these chimeric Y1 receptors and NPY are in the micromolar range, which is 2–3 orders of magnitude weaker than the interaction between NPY and the native Y1 receptor. This discrepancy can be attributed to differences in the topology of the chosen template, in particular in different relative locations of the loop anchoring points. Alternatively, X-ray structures of GPCRs with peptide ligands,75 unknown at the outset of our study, later revealed that additional interactions formed between the ligand and TM residues are involved in ligand binding.
Grafting loops of GPCRs onto soluble scaffolds has been successfully demonstrated in case of the RXFP1 receptor76,77 using the B1 immunoglobulin binding domain of the streptococcal protein G (GB1) as the scaffold. Similarly, the N terminal domain and the third extracellular loop of the PTH receptor were grafted onto the four-helix bundle protein TM1526 from the thermophilic archaea bacteria Thermotoga maritime.78 Last but not least it was also demonstrated that a polypeptide, in which the extracellular loop sequences of the CXCR4 receptor were connected via appropriate spacers recognizes gp120 with good affinity.79,80
Initially, we determined the structure of TM1 in LPPG/DPC mixed micelles. Instead of one continuous helix our NMR analysis revealed that this fragment is composed of two separate, rather well-defined short helices comprising residues Thr48–Phe55 and Ala61–Ile71 (Fig. 4C). These two helical segments are spaced by the amino acid sequence GSRVG. The orientation of the two short helices relative to each other was poorly defined.
When adding residues from the second TM, we detected interhelical contacts between TM1 and TM2 via NOEs between methyl groups, and TM1 now formed a continuous helix. This observation indicates that the presence of TM2 stabilizes secondary structure (around the GXXXG motif) in TM1 when compared to the isolated TM1, resulting in a helical hairpin for TM12 (Fig. 4C and D).
Unfortunately, the fragment TM123 is rather unstable and tends to aggregate. However, backbone assignments of TM123 were still possible, and both deuterium amide exchange and attenuation from soluble spinlabels indicated that there is water-access to the central part of TM3. This indicates that TM3 does not stably pack against TM1 and/or TM2. In contrast, the construct TM127, in which TM7 is artificially covalently linked to the loop of TM2, proved to be much more stable. Assuming that TM7 packs against TM1 and TM2 in the entire receptor,82 this fragment allowed us to investigate the formation of inter-helical contacts. Spectroscopic properties of TM1 and TM2 in TM127 changed only minimally, indicating that the helical hairpin is not significantly altered upon addition of TM7. However, TM7 similarly as TM3 does not stably pack against TM1 or TM2, but displays motions around an internal flexible hinge comprised by the central Leu-Pro-Leu tripeptide motif. In contrast to TM3, however, TM7 appears to be better integrated into the hydrophobic core, as we did not observe access of water to TM7.
Two principles emerged from that study that are particularly worth mentioning: (i) the stability of secondary and tertiary structure depends to a large extent on the presence of polar or charged residues in regions that are placed within the membrane interior. (ii) Fragments in general do not stably insert into the membrane.
We like to briefly mention that we started our work on fragments of the Ste2p receptor with TM7, using a construct that also comprises 40 residues from the C-terminal intracellular domain.83 The protein was expressed and purified in the lab of Fred Naider, and we determined the structure in DPC micelles. Similarly to TM1, the NMR data revealed that TM7 is not stably folded but rather exists in form of two helical stretches interrupted around a central Pro residue. The two helix portions are not stably anchored in the hydrophobic core of the micelles but transiently buried.
All NMR studies described above are performed using detergent micelles as membrane mimetics, and these have been used also for many crystallographic studies of membrane proteins. Detergent micelles, however, do not perfectly reflect planar bilayers as encountered in biological membranes nor do they contain non-lipid components such as other membrane proteins (integral or peripheral) or glycolipids. To obtain data on partitioning into real biological membranes we have collaborated with the group of Ismael Mingarro from Alicante University. His group performed assays on liver microsomes, relying on the absence or presence of glycosylation to probe for the periplasmic location of the C-terminus of the fragments.81 The conclusion from that work was that TM1 is not integrated into the membrane but TM12 and TM123 or TM127 mostly assumed native-like topology. A peptide corresponding to the more hydrophobic TM2 did insert into the membrane, and replacing Arg-58 at the center of TM1 by Leu or Ala also resulted in insertion. These studies indicate a pivotal role of uncompensated polar or charged residues within central locations of TM helices for folding and membrane-integration.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 51 to obtain a series of overlapping polypeptides that would span the entire Y4 receptor. We managed to obtain close-to-complete backbone chemical shift assignments for most residues, both from loops as well as from the TM helices, with the notable exception of most residues from TM3 and TM4. Secondary chemical shifts of assigned TM residues indicate that the secondary structure in the putative TM portions is correct in the 2-TM fragments. We suspect that resonances from residues TM3 and TM4 were broadened through conformational exchange, reflecting the fact that TM helices often do not pack stably against each other in fragments. However, conformational broadening is also observed on entire receptors and the intrinsic instability is of importance for the mode of receptor activation.
51 to obtain a series of overlapping polypeptides that would span the entire Y4 receptor. We managed to obtain close-to-complete backbone chemical shift assignments for most residues, both from loops as well as from the TM helices, with the notable exception of most residues from TM3 and TM4. Secondary chemical shifts of assigned TM residues indicate that the secondary structure in the putative TM portions is correct in the 2-TM fragments. We suspect that resonances from residues TM3 and TM4 were broadened through conformational exchange, reflecting the fact that TM helices often do not pack stably against each other in fragments. However, conformational broadening is also observed on entire receptors and the intrinsic instability is of importance for the mode of receptor activation.Interestingly, in contrast to the corresponding fragment from the Ste2p receptor the structure of TM12 of the Y4 receptor did not show tertiary contacts.50 Secondary structure is largely formed, however, and interrupts occur close to Gly, Pro or polar residues. A comparison of these two structures is depicted in Fig. 5. Please note the much more frequent occurrence of polar residues in TM1 or TM2 of the Y4 receptor when compared to the Ste2p receptor.
In principle, one could try to construct an entire receptor from two complementary fragments. If such a fragment complex would be capable of still binding the ligand, the method would facilitate the NMR resonance assignment tremendously. Seminal work by the Khorana lab, which reconstituted active bacteriorhodopsin from complementary fragments, indicated that this should indeed be feasible.85 Similarly Dumont could reconstitute the Ste2p receptor from several pairs of complementary fragments.38 Note that in the work by the Dumont group the two fragments were co-expressed, and the Khorana group used cleavage of reconstituted bacteriorhodopsin to generate the fragments. We have tried to mix separately expressed fragments of the Ste2p receptor, e.g. TM12![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 86 with TM34567,87 but so far failed to develop protocols to form protein complexes from fragments individually dissolved in detergents.
86 with TM34567,87 but so far failed to develop protocols to form protein complexes from fragments individually dissolved in detergents.
To compare the data obtained in vitro by NMR with the topology of fragments in a cellular membrane environment, and to possibly determine whether the lack of interhelical contacts that we observed for TM12 of the Y4 receptor was influenced by the detergent used for NMR (Fig. 5), we have performed a topology analysis by truncating the entire receptor by one helix at a time, starting from both the N-terminus and the C-terminus.89 We used GFP as a reporter for cytoplasmic and alkaline phosphatase (PhoA) as a reporter for periplasmic localization.88 A schematic of the approach is presented in Fig. 6A and B.
|  | ||
| Fig. 6 Topology analysis of Y4 GPCR fragments of increasing length into the inner membrane of E. coli. (A) Schematic representation of the assay used to study the topology of membrane insertion. When the C-terminus of a membrane protein is located in the periplasm, fusion to alkaline phosphatase (PhoA) allows detection of enzymatic activity. In case of cytoplasmic location of the C-terminus, GFP fluorescence can be measured. (B) Schematic representation of the Lep-fusion to TM1 and TM12 of the Y4 receptor and their possible topologies in the E. coli inner membrane, considering the distribution of positive and negative charges in the loops and the occurrence of polar or charged residues within the TM helices. (C) Experimental results of the topology assay of the Lep-fusions constructs, reporting the Y4 fragments of increasing length starting from the N-terminus of the Y4 receptor. (D) Schematic representation of the Mistic-fusion to TM1 and TM12 of the Y4 receptor, and possible topologies considering the distribution of positive and negative charges in the loops. (E) Experimental results of the topology assay for fragments of the Y4 receptor fused to Mistic. (F) Free energies for partitioning the entire TM peptide into the membrane using the scale from Wimley and White.92,93 | ||
Since the N-terminus is likely either in the periplasmic or in the cytoplasmic compartments for the N-terminally truncated constructs, depending on how many helices were removed, we have used two different N-terminal fusion proteins to perform the study: the transmembrane region of the bacterial peptidase Lep was used to ensure proper periplasmic orientation,90 while Mistic was used to ensure cytoplasmic orientation (Fig. 6B and D).
The information acquired by combining the results from the GFP and PhoA reporters allowed quantifying the fraction of expressed protein that folds in a topology that resembles the organization in the entire receptor. Interestingly, we could observe that N-terminal fragments do not acquire a unique topology, however, the fraction displaying the correct localization of the C-terminus increased with the number of helices that were added. A scheme displaying the possible topologies for the two short fragments TM1 and TM12 of the Y4 receptor, and experimental data on the location of the C-terminus are depicted in Fig. 6B and D.
In case of TM1, two topologies are possible for each fusion (Fig. 6B): a dual topology is observed in case of fusion with Lep (Fig. 6B and C), while a predominant cytosolic location exists for Mistic-TM1 (Fig. 6D and E). Conversely, for TM12 when fused to Lep, it predominantly adopts a periplasmic location (Fig. 6B and C), while when fused to Mistic it adopts dual topology (Fig. 6D). Possible orientation of the TM helices with respect to the E. coli inner membrane, considering the positive inside rule91 are reported (Fig. 6B and D). Our topology data are in good agreement with predictions of insertion based on the free energies of transferring entire TM helices into the membrane (Fig. 6F).92,93 They reveal that the first three helices, and in particular TM1, are unlikely to insert, while the sequences of TM4–6 strongly favor insertion.
In eukaryotes membrane protein biosynthesis occurs at the ribosome in the ER starting at the N terminus from which the nascent chain is transferred via the signal recognition particle (SRP), which targets the nascent chain to the ER inner membrane.94,95 From the SRP the chain is transferred to the large channel of the translocon machinery (Fig. 7A). In the original co-translational folding model, the emerging polypeptide chain is translocated or partitioned into the hydrophobic environment of the membrane bilayer, where the translocon decides which parts of the chain to translocate and which to insert.96–98 The insertion of helices into hydrophobic environments depends on biophysical properties of the amino acid sequence in the corresponding stretch. Wimley and White experimentally determined the free energy of transferring amino acids into the membrane interior or the membrane interface.92,93 Similarly, von Heijne has established an in vivo translocon-based scale, that qualitatively largely agrees with the in vitro data.99,100 Both scales qualitatively describe the membrane integration behavior and conformational preferences surprisingly well.
|  | ||
| Fig. 7 Models for co-translational folding. (A) TM helix segments are immediately released into the hydrophobic core, remain there and diffuse in the membrane until proper inter-helical contacts are made. (B) Immediate release of TM segments into the hydrophobic core. Depending on hydrophobicity some will locate in the interface. Once all TMs are made and helices transiently located in the interface, they will fully insert when proper contacts are made. (C) TM helices remain associated within the translocon until all helices are made, and then the bundle is transferred into the hydrophobic core. In (1) the TMs accumulate inside the translocon, while in (2) they are stored outside the translocon such that the hydrophilic sides (shown in blue) point into the interior of the channel. Figure modified from Skach.94 | ||
Engelman and Popot in their two-stage model for folding of helical membrane proteins41,101 postulated that helices spontaneously form once the segments coding for the TM helices insert into the membrane. This is the case because formation of secondary structure prevents that polar moieties of the peptide bond form unfavorable contacts with lipids. Following the coupled insertion/folding step helices diffuse in the membrane until the proper interhelical contacts are formed. In this simplest form the model requires that helices remain inserted until the entire chain has been biosynthesized, all inter-helical contacts are formed, and folding is completed. Recent evidence, however, indicates that individual helices of integral membrane proteins are often marginally stable in the hydrophobic interior,102 and many helices are not even expected to insert at all. The reason for this at first sight surprising property is the fact that TM helices often contain polar or even charged residues – these mediate the specific inter-helical contacts that drive the assembly of the helical bundle.103–106 It is a legitimate question to ask what happens with these marginally stable TM helices when other helices that provide the compensating interactions are not present because they have not been synthesized yet. Will they remain in the hydrophobic interior, or will they become surface associated? If they partition into the interface, will they fully re-insert once the missing TMs have been synthesized? Or do they need to be stored close to the translocon until all parts of the chain have become available?
We propose that these questions can be best addressed when studying protein fragments. These offer the conceptual advantage, when compared to the study of the entire proteins under slightly denaturing conditions, that parts, which are biosynthesized later, are not contained in the construct and hence cannot form any interactions. Solution NMR studies of fragments in membrane-mimicking environments allow studying structural details of early folding intermediates. In this review we have outlined our efforts to study folding and recognition of GPCRs from fragments.
To summarize, we have learnt that secondary structure in these fragments largely corresponds to the prediction from homology models. Hence, secondary structure is likely encoded in the local sequence, and does not completely depend on the formation of tertiary contacts as in the case of soluble proteins. In contrast, tertiary contacts are mostly not formed, and the occurrence and persistence of these contacts is related to the location and number of polar or charged residues located within the hydrophobic compartment of the membrane. The work on TM1, TM12 and TM123 or TM127 of Ste2p indicates that longer fragments integrate better into the membrane and more likely form tertiary structure. Similarly, the systematic topogenesis study of the series of Y4 receptor fragments clearly indicates that fragments containing more TM helices insert more stably and in a more defined topology. Importantly, TM1 of the Y4 receptor is predicted to not partition into the membrane,107 and when taking this feature into account the topology of the truncated fragments can be predicted rather reliably (see Fig. 7).
Inside the translocon, hydrophobic segments are laterally gated through a cleft into the membrane interior.108 Which segments are partitioned into the membrane interior is believed to be primarily decided based on hydrophobicity, a property that is reflected in the biophysical properties of the amino acid sequence.109 If TM helices are sufficiently hydrophobic they will remain inserted in the membrane and diffuse until the native contacts are made and the protein has assumed its final topology (Fig. 7A). However, when TM helices contain charged or many polar residues they may not properly insert and rather become surface associated. Once all remaining TM helices are released by the translocon, these surface-associated helices may form the compensating interactions and re-insert (topological maturation94) (Fig. 7B). Alternatively, TM segments may remain within or near to the translocon (Fig. 7C) to be released into the hydrophobic interior only once the entire polypeptide chain has been made (Fig. 7C).94
Interesting experimental results concerning cellular mechanisms of α-helical membrane protein biogenesis were obtained on the two closely related water-channel proteins aquaporin-1 (AQP1) and aquaporin-4 (AQP4). Using truncated fragments it was demonstrated that TM segments of AQP4 leave the translocon one after another in a perfectly sequential mode, and insert into the membrane in the correct topology (corresponding to Fig. 7A).110,111 In contrast, TM2 of AQP1 does not terminate translocation and therefore is partitioned into the ER lumen instead of into the membrane resulting in a four-TM instead of the native 6-TM bundle.112 The group of Dowhan has demonstrated how changes in the composition of the phospholipids switch the topology of helical membrane proteins in a non-assisted manner,113,114 and hence spontaneous topology maturation may occur (Fig. 7B). On the other hand, photo-crosslinking experiments indicated that TM helices might form either specific or nonspecific contacts with residues from the translocon,115 and that TM helix bundles could leave the translocon and enter the lipid bilayer in a concerted manner.110,116–118 Using chemical cross-linking it was also shown that large portions of opsin remain bound or associated to the translocon during ribosomal synthesis.119 Recent data on the structure of a ribosome-nascent chain translocon complex revealed that TM1 and TM2 of proteorhodopsin inserted into the membrane close to the lateral gate.120 Based on this observation von Heijne and White have suggested that TM helices never fully insert into the translocon but rather slide along the lateral gate.121 Possibly, polar side chains may then still point into the translocon channel in order to escape forming contacts to lipids, while most of the hydrophobic part would be placed rather outside the translocon (Fig. 7C (path 2)). We noticed that such a scenario would help to reduce space requirements for all TMs in the translocon interior. Our data will not allow deciding on the exact details of Ste2p or Y4 GPCR biogenesis, but they provide molecular details of early states during the folding in a valid environment and help understanding the underlying biophysics.
In principle, recognition of peptide ligands by their GPCRs again concerns the interaction of polypeptide chains in a lipid environment, this time of course for separate chains. Our studies revealed interactions of the pancreatic polypeptide PP with the N-terminal extracellular domain. These suggest that the ligand is transferred via a fly-casting mechanism from the membrane-associated state into the ligand-binding pocket, which is mostly formed by the extracellular loops. In fragments the orthosteric ligand-binding site is either not present or not fully formed, and hence this weak interaction, while likely short-lived in the entire receptor, can still be observed. The grafting work demonstrates that extracellular loops provide binding affinity for the ligands when provided in an approximately correct topology. However, the work also showed that the binding affinity for the grafted model is significantly lower than for the entire receptor. This work therefore highlights a constant dilemma when working with fragments – they provide much conceptual insight into folding or binding but can never fully reproduce the natural system. Nevertheless, we believe that the information gained on these systems justifies using fragments to address specific questions of folding and binding.
| This journal is © The Royal Society of Chemistry 2018 |