G-Protein coupled receptors: structure and function in drug discovery

Chiemela S. Odoemelam; Benita Percival; Helen Wallis; Ming-Wei Chang; Zeeshan Ahmad; Dawn Scholey; Emily Burton; Ian H. Williams; Caroline Lynn Kamerlin; Philippe B. Wilson

doi:10.1039/D0RA08003A

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

DOI: 10.1039/D0RA08003A (Review Article) RSC Adv., 2020, 10, 36337-36348

G-Protein coupled receptors: structure and function in drug discovery

Chiemela S. Odoemelam^a, Benita Percival^a, Helen Wallis^a, Ming-Wei Chang^b, Zeeshan Ahmad^c, Dawn Scholey^a, Emily Burton^a, Ian H. Williams^d, Caroline Lynn Kamerlin^e and Philippe B. Wilson*^a
^aNottingham Trent University, 50 Shakespeare St, Nottingham NG1 4FQ, UK. E-mail: philippe.wilson@ntu.ac.uk
^bNanotechnology and Integrated Bioengineering Centre, University of Ulster, Jordanstown Campus, Newtownabbey, BT37 0QB, Northern Ireland, UK
^cDe Montfort University, The Gateway, Leicester, LE1 9BH, UK
^dDepartment of Chemistry, University of Bath, Claverton Down, Bath, BA1 7AY, UK
^eDepartment of Chemistry – BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden

Received 20th July 2020 , Accepted 22nd September 2020

First published on 1st October 2020

Abstract

The G-protein coupled receptors (GPCRs) superfamily comprise similar proteins arranged into families or classes thus making it one of the largest in the mammalian genome. GPCRs take part in many vital physiological functions making them targets for numerous novel drugs. GPCRs share some distinctive features, such as the seven transmembrane domains, they also differ in the number of conserved residues in their transmembrane domain. Here we provide an introductory and accessible review detailing the computational advances in GPCR pharmacology and drug discovery. An overview is provided on family A-C GPCRs; their structural differences, GPCR signalling, allosteric binding and cooperativity. The dielectric constant (relative permittivity) of proteins is also discussed in the context of site-specific environmental effects.

Background

The G-protein coupled receptor (GPCR) superfamily consists of structurally similar proteins arranged into families (classes), and is one of the most abundant protein classes in the mammalian genome.^1–5 GPCRs undertake a plethora of essential physiological functions and are targets for numerous novel drugs.^4,5 Their ligands are structurally heterogenous, including natural odorants, nucleotides, amines, peptides, proteins, and lipids.⁴ The conserved structure of GPCRs consists of seven TMD of approximately 25–35 successive amino acid residues that express moderately high levels of hydrophobicity⁴ and are characterised by α-helices which span the plasma membrane.⁴ The primary function of GPCRs is the transduction of extracellular stimuli into intracellular signals.² Currently, approximately thirty to forty percent of marketed pharmaceuticals target GPCRs.^1,6–10 Hence, there is enormous potential for the development of new drugs targeting these receptors.³ Examples of drugs targeting GPCRs include histamine receptor blockers, opioid agonists, β-blockers and angiotensin receptor blockers.⁵ Computational biology methods are currently being employed to understand GPCRs as such drug targets.^6,11,12 Breakthroughs in GPCR crystallography has facilitated novel discovery through virtual screening as well as better off-target rationalisation.⁶ Recently, the Tikhonova group developed a computational protocol which combines concepts from statistical mechanics and cheminformatics to explore the flexibility of the bioamine receptors as well as to identify the geometrical and physicochemical properties which characterise the conformational space of the bioamine family.¹³ Multiple-microsecond timescale molecular dynamics (MD) simulations have been used in capturing the process of several drugs binding to β₁- and β₂-adrenergic receptors.¹⁴ Molecular docking is one of the most commonly used methods in GPCR structure-based drug design (SBDD).¹⁴ Esguerra et al. developed GPCR-ModSim, a web-based portal designed specifically for the homology modelling and MD simulation of GPCRs.¹⁵

It was historically assumed that GPCRs exist in two conformations: active and inactive.^16–18 The long-established extended ternary-complex model of GPCR-driven signalling was based on this concept.^16,19,20 This model suggested that the active GPCR conformation opted for by G-protein-coupled receptor kinases (GRKs), arrestins and G proteins is uniform.¹⁶ Nevertheless, biophysical investigations with a refined fluorescent-labelled β2-adrenergic receptor (β2AR) demonstrated that a receptor can exist in numerous conformations and that the conformational equilibrium is influenced both by the bound ligand and the proximity to the related G protein.¹⁶

The human genome alone contains approximately 800 GPCRs making it the largest family of membrane proteins.^5,21 GPCRs have been classified based on structural and physiological features.⁴ Some systems of classification have grouped these based on location of the ligand binding pocket, while some have utilised both the structural and physiological properties.^4,22 The A–F classification system was the first system of classification to be introduced.²³ This was first introduced in 1994 as A–F, and O for the (now obsolete) GCRDb database by Kolakowski.²³ The defunct GCRDb system was further developed, leading to the GPCRDB^24,25 database by Horn et al. with the rhodopsin family (class A) being the largest and consisting of four main groups: α, β, γ, and δ, and 13 sub-branches.^4,23,24 All GPCRs comprise of seven TMD helices (Fig. 1), alongside an eight helix and a palmitoylated cysteine at the C terminal tail.²⁶


	Fig. 1 A schematic representation of a GPCR showing the transmembrane domains, N-terminus, C-terminus, the intracellular and extracellular loops (generated using GPCRDB Tools, https://gpcrdb.org/).²⁷

The diversity of GPCRs has resulted in a perceived difficulty in developing a comprehensive classification system.⁵ The A–F system orders the GPCRs into six classifications on the basis of their sequence homology and functional similarity, namely: family A (rhodopsin-like receptors), family B (secretin receptor family), family C (metabotropic glutamate receptors), family D (parasitic mating pheromone receptors), family E (cyclic AMP receptors) and family F (frizzled and smoothened receptors).⁵ Based on phylogenetic studies, human GPCRs have been classified under a system called “GRAFS”, this system comprises of five main families namely; glutamate (G), rhodopsin (R), adhesion (A), frizzled/taste2 (F), and secretin (S).^4,21,26 The major difference between the two systems concerns the additional division of family B into the adhesion and secretin families within GRAFS.²⁶ This division was based on early findings describing a distinctive evolutionary history between both families.²⁶

Family A (rhodopsin-like receptors)

The rhodopsin receptor family (RRF) is the largest of the GPCR families, comprising of approximately 680 members, and accounts for 80% of receptors in humans.^4,28 The RRF is classified into four groups (α, β, γ, δ) and 13 main subdivisions,^4,29 and it has numerous characteristics which indicate a common ancestry.^4,29 These characteristics include the DRY motif situated at the border between TM3 and intracellular loop (IL) 2 and NSxxNPxxY motif in TM7 (Fig. 2).^4,29 The N-terminal region of the family A GPCR receptors are situated extracellularly,^29,30 while the C-terminal is located within the cytoplasm (Fig. 3).^29,30 The ligand binding site is located within the extracellular region of the TMD bundle.²⁹


	Fig. 2 Schematic diagram showing the structure of family A GPCRs generated using ClustalW.³¹ Reprinted with permission from Springer Nature: Springer Nature, Nature Reviews Drug Discovery, Structural diversity of G protein-coupled receptors and significance for drug discovery, M. C. Lagerström and H. B. Schiöth, Copyright (2008). The upper section of shows the differences in the secondary structure of the N termini of the family A receptors.³¹ The scissor image indicates the cleavage site of the protease activated receptors whilst in the lower part of the image, the schematic TMD regions show the consensus of an alignment generated using ClustalW 1.82.³¹ In addition, the area circled in red describes the elliptical orientation.³¹ Residues conserved in all eight sequences are displayed as circles in which conserved aromatic residues are shown in purple, polar in orange, aliphatic residues are shown in beige, positively charged in red and negatively charged in blue.³¹


	Fig. 3 Illustration showing the modification of rhodopsin and its orientation in membranes.³⁰ Reprinted with permission from Annual Reviews: Annual Reviews, Annual review of biochemistry, G protein–coupled receptor rhodopsin, K. Palczewski, Copyright (2006). (a) Two-dimensional illustration of rhodopsin. The polypeptide of rhodopsin is seen to cross the membrane seven times with C-I, C-II, C-III comparable to the cytoplasmic loops and E-I, E-II, E-III to the extracellular loops. The yellow cylinders represent the transmembrane region (b) depicts the location of the chromophore and the charges on the extracellular and cytoplasmic surface of rhodopsin. Red and blue colours represent negative and positive charged residues respectively, while the location of the chromophore is revealed by deleting fragments of the transmembrane helices.³⁰

According to Palczewski, the arrangement of the seven TMD helices which vary in length from 20 to 30 residues is responsible for the overall elliptic, cylindrical shape of rhodopsin (Fig. 3).³⁰ The family A GPCRs vary greatly when their ligand preference and primary structure are considered.³¹ However, there is homogeneity in the N-termini of family A GPCRs, but heterogeneity within the TMD regions.³¹ However, some of the family A GPCRs share specific sequence motifs within the TMD region.³¹

Palczewski reported the dimensions of rhodopsin as an ellipsoid of approximately 35 × 48 × 75 Å, with the long axis perpendicular to the membrane in the standard view.³⁰ The surface area of the section protruding from the membrane is approximately 1200 Å², with cytoplasmic projection being larger in surface area and volume than the extracellular surface (Fig. 2b).³⁰ The TMD helices of rhodopsin are irregularly shaped due to the conformational changes associated with the Gly–Pro residues; they also incline at several angles in correspondence to the anticipated membrane surface.³² Teller et al. reported that helix 1 tilted from the membrane plane at 25° and contains a 12° kink within it as a result of Pro53 residues being present.³² Helix 2 kinked at an angle of 30° around Gly89 and Gly90 and the most significant bend being at Helix 6 at angle of 36° due to the presence of Pro267.³²

Family B (secretin receptor family)

The family B GPCRs form a small group, and with an extracellular hormone-binding site, they bind to large peptides.³¹ The family name “secretin” derives from the secretin receptor, which was the first to be cloned in this family.³ In 1975, Sasaki et al.³³ solved the first X-ray crystal structure of glucagon, a family B GPCR.³⁴ The family corresponds to group B of the A–F system of classification,³ and comprises 15 members including: vasoactive intestinal peptide receptors (vIPR1, vIPR2), glucagon-like peptide receptors (GLP1R, GLP2R), adenylate cyclase activating polypeptide receptor (PAC1/ADCYAP1R1), growth-hormone-releasing hormone receptor (GHRHR), calcitonin and calcitonin-like receptors (CALCR, CALCRL), gastric inhibitory polypeptide receptor (GIPR), secretin receptor (SCTR), corticotropin-releasing hormone receptors (CRHR1, CRHR2), glucagon receptor (GCGR), and parathyroid hormone receptors (PTHR1, PTHR2).^3,31 These 15 receptors share between 21 and 67% sequence identity, and a large portion of the dissimilarity is identified in the N-terminal sequence.^31,35 These receptors contain conserved cysteine residues in the first and second extracellular loops of the TMD regions (Fig. 3).³¹ However, the majority of the receptors within this family contain conserved cysteine residues that make up a cluster of cysteine bridges in the N-terminus³¹ The binding profile of the secretin receptors is outlined by three binding domains comprising of the proximal region and the juxtamembrane region of the N-terminus, as well as the extracellular loops, together with TM6 (Fig. 4).³¹ The ligand is thought to activate the receptor by spanning the N-terminal and the TMD extracellular loops, this way mediating the active conformation of the receptor, which increases the probability of activation of the signalling units.³⁶


	Fig. 4 Schematic diagram showing the structure of family B GPCRs generated using ClustalW.³¹ Reprinted with permission from Springer Nature: Springer Nature, Nature Reviews Drug Discovery, Structural diversity of G protein-coupled receptors and significance for drug discovery, M. C. Lagerström and H. B. Schiöth, Copyright (2008). The residues conserved in all 15 sequences are displayed as circles, the conserved polar residues are shown in orange, the aromatic residues in purple, the aliphatic residues in beige, the positively and negatively charged residues are shown in red and blue respectively.³¹ The uppercase letters show the completely conversed positions, the lowercase letters show the well-conserved positions (>50%) while the letter “x” show the variable positions. The conserved sequence motifs which are found in the TMD of the family B GPCRs are surrounded by red boxes.³¹ The conserved cysteine residues are depicted as yellow circles, the cysteine bridges between EL1 and EL2 are shown as two straight lines while the N-terminal cysteine bridges are drawn as lines.³¹

In addition to the presence of an extracellular N-terminal domain (ECD) of 120–160 residues, three intracellular (IL) and extracellular (EL) loops interconnect seven TMD (TM1-TM7) of 310–420 residues that are structurally similar and are thus members of the family B GPCR.^37,38 According to Parthier et al. hormonal recognition in family B GPCRs is believed to follow the ‘two-domain’ binding mode, the N- and C-terminal regions of the peptides interact with the J- and N-domains of the receptors respectively, i.e. the C terminus of the peptide initiates a peptide recognition with the ECD, thus allowing the peptide N terminus to bind the TMD ligand-binding pocket activating the receptor and prompting a downstream signalling cascade.^34,38–40 The presence of a conserved ECD structure and the ‘two-domain’ binding mode across the family B GPCRs suggest a similar receptor activation across the GPCR family.³⁸

The secretin receptors have immense potential in drug discovery due to their importance in fundamental homeostatic functions.^31,38 To date, three of these hormones (glucagon, parathyroid hormone and calcitonin) are used clinically for the treatment of hypoglycaemia, osteoporosis and hypercalcaemia individually.³¹ GLP1-R and GLP2-R are particularly relevant targets, as a result of their part in appetite control and the treatment of type 2 diabetes.³¹

Family C (metabotropic glutamate receptors)

The family C GPCRs comprise of the two γ-aminobutyric acid_B receptors (GABA_B receptors), odorant receptors in fish, eight metabotropic glutamate receptors (mGlu receptors or GRM), pheromone receptors, Ca²⁺-sensing receptors (CaS receptors or CASR), sweet and umami taste receptors (TAS1R1-3), GPCR Class C Group 6 Member A (GPRC6A) and seven orphan receptors.^3,4,31,41 The taste receptors in this GPCR family are targeted by the taste additives used in the food industry.⁴¹ The CaS, mGlu and GABA_B receptors belong to a novel category of drug targets that are essential for considering conditions which affect the central nervous system and calcium homeostasis.⁴² Currently, family C GPCRs are targeted by two therapeutic drugs in the market. One is Cinacalcet,^41–45 the first GPCR allosteric modulator to be marketed, which targets the CaS receptor. The other is Baclofen (now sold under the brand names Lioresal, Liofen, Gablofen, etc.), which is a GABA_B agonist used in the treatment of muscle spasms.^{41,42,44–46}

The family C GPCRs differ from others by possessing a large extracellular domain, distal to the TMD receptors, and containing the orthosteric sites; they also form constitutive dimers with unique activation systems in comparison with other GPCR families.⁴¹ Similarly, to their related families, family C GPCRs exhibit a typical motif of seven TMD helices however differ structurally from other GPCR families in their possession of an unusually large extracellular domain, an intracellular carboxyl-terminal (C-terminal) domain and a heptahelical TMD (Fig. 5A).⁴¹ The family C GPCRs are structurally distinct from other GPCR families as a result of their extracellular domain including a cysteine rich domain (CRD, with the exception of GABA_B receptor) and Venus flytrap module (VFT).^31,41 The TM domain of family C GPCRs contain only the allosteric binding sites differing from other families with their TM domains conserved while the orthosteric sites are situated in the VFT module.^41,44 Domains present in the family C GPCRs provide numerous ligand sites of action, bar the intracellular C-terminal domain; this is highly variable and plays an essential role in signalling protein coupling and scaffolding.⁴¹ The family C GPCRs are unique due to their compulsory dimerization, either as heterodimers (GABA_B receptor and TIRs) or homodimers (mGlu and CaS receptors) (Fig. 5B).^41,47,48


	Fig. 5 Graphical illustration of family C GPCR structure.⁴¹ Reprinted with permission from Springer Nature: Springer Nature, Acta Pharmacologica Sinica, Structure and ligand recognition of class C GPCRs, L. Chun, W.-h. Zhang and J.-f. Liu (2012). (A) Represents the structural organisation of family C GPCRs. Family C GPCRs have a peculiar structure which comprises of VFT with two lobes separated by an orthosteric binding pocket, a CRD and a TMD except for GABA_B receptor. (B) Graphical illustration of two members family C GPCRs; GABA_B receptor (heterodimer) and mGlu receptor (homodimer). There is a direct link between VFT and TMD in the GABA_B receptors and the two subunits, GABA_B1 and GABA_B2 make an obligatory heterodimer while the VFT connects to TMD using CRD in the mGlu receptors. The mGlu receptors form homodimers which can potentially offer two other orthosteric binding pocket per dimer.⁴¹

Structural differences

GPCRs share a common structural characteristic, the TMD region, with its intracellular C-terminus and extracellular N-terminus, which exhibits the greatest homology.^21,28,49,50 The intracellular loops which span TM5 and 6, the amino terminus and the carboxyl terminus are among the most irregular structures in GPCRs with a substantial variation observed in the amino terminus (N-terminus).^21,51 The sequence is relatively short for peptide and monoamine receptors comprising of about 10–50 amino acids,^21,51 and larger for glutamate family receptors and glycoprotein hormone receptors (350–600 amino acids).^21,51 The largest amino terminal domains were observed in the adhesion family receptors.^21,51

Bortolato et al. compared crystal structures of family B and family A GPCRs using receptors in the various classes (glucagon receptors, corticotropin-releasing factor receptor 1 (CRF₁) and dopamine D₃ receptor).⁵² The comparison of the CRF₁ and glucagon receptor crystal structure to dopamine D₃ receptor, a family A GPCR, showed that their cytoplasmic regions superimposed well.⁵² However, the TM6 regions of both glucagon receptors and CRF₁ extend outwardly while the cytoplasmic moieties are situated in proximity to the TM3 regions in sites similar to the dopamine, as well as other class A receptors.⁵² The family B GPCRs lack the direct connectivity between TM3 and TM6 which is regarded as the classical ‘ionic lock’, playing an important role in family A GPCR activation.^52,53 The family C GPCRs structurally differ from family A and B due to their remarkably large extracellular domain which comprises of a cysteine-rich domain and VFT; an intracellular carboxyl-terminal (C-terminal) domain. The TMD regions in family A and B GPCRs are conserved however family C GPCRs have the allosteric binding site within the TMD region (Fig. 4).⁴¹ Table 1 shows some of the characteristics of the GPCR families discussed in this review.

Table 1 Table showing some characteristics of family A–C GPCRs^a

Feature	Family A	Family B	Family C	Reference
a TM: transmembrane, GPCR(s): G-protein coupled receptor(s), VFTM: venus fly trap module, SUSHI: short consensus repeats.
Transmembrane domains	All families possess seven transmembrane domains			31, 54 and 55
Orthosteric binding site	TM region	Extracellular loops, extracellular N-terminus, TM6	Extracellular N-terminus (VFTM, SUSHI)	12, 41, 55 and 56
Number of approved and marketed drugs	33	16	22	57
Motifs	All GPCRs share the D/E-R-Y/W motifs			30, 49 and 54
Number of conserved residues in TMD regions	25	33	94	55
Type of ligand	Small molecules, proteins, peptides	Proteins, peptides	Small molecules, cations, amino acids	31
Suitable as drug targets?	Yes, except the sensory receptors	Yes	Yes, except the sensory receptors	31

Allosteric binding and cooperativity

Allostery is a widespread biological process, which is defined as the ability of interactions occurring at a particular site on a molecule to modulate actions on a different binding site on the same molecule.^58,59 For example, the binding of an allosteric modulator on a molecule allosterically changes the conformation of its binding pocket as shown in Fig. 6. Currently, there are two types of marketed pharmaceuticals: allosteric modulators, which bind at the allosteric binding site on the receptor and allosterically change the structural conformation of the receptor binding site, and orthosteric modulators, which bind at the active site of the receptor.⁶⁰ Orthosterically-binding drugs must overcome a major challenge in mediating the potential side effects arising from binding to homologous proteins sharing similar binding sites.⁶⁰ Hence an orthosterically-binding drug must have a very high affinity for its target, in order for a small dose to selectively achieve the goal of target-only binding.⁶⁰ The binding of transcription factors (TFs) to DNA regulatory elements (REs) provides a good example illustrating the specificity in orthosteric drugs.⁶⁰


	Fig. 6 Mechanism of action of allosteric modulators.⁶³ Reprinted with permission from Springer Nature: Springer Nature, Nature Reviews Drug Discovery, Allosteric modulators of GPCRs: a novel approach for the treatment of CNS disorders, P. J. Conn, A. Christopoulos and C. W. Lindsley, Copyright (2009). (a) Allosteric ligands bind to an alternative binding site on a receptor to modulate the activities of an orthosteric ligand efficacy (blue) and/or affinity (red). A number of allosteric ligands can also directly disrupt signalling in their own right (green). (b) Results from simulation show the effects on the function (right) or binding (left) of an orthosteric agonist mediated by three allosteric potentiators depicted in red, blue and green; red enhanced orthosteric agonist affinity only, blue enhanced only the efficacy, green was observed to modestly enhance both efficacy and affinity, as well as showing allosteric agonism.⁶³

The process of GPCR signaling initiates when an endogenous extracellular signal interacts with the orthosteric binding site of a GPCR, resulting in a conformational change which passes on the signal through the plasma membrane traversing the TMD region, and eventually activating intracellular signaling cascades through heterotrimeric G proteins and other adjunct proteins.^58,61,62 A different approach, demonstrated for ligand-gated particle channels, is the advancement of allosteric modulators of the receptor subtypes, these small molecules do not bind to the traditional orthosteric binding site, instead interacting with the allosteric binding site to either enhance or inhibit receptor activation.⁶³

Allosteric GPCR modulators show at least one of the outlined pharmacological properties (Fig. 6). Agonism/reverse agonism: the allosteric modulator disrupts receptor signaling in either a positive (agonism) or negative (antagonism) manner, notwithstanding the presence or absence of an orthosteric ligand.⁶³ Efficacy modulation: the effect of allosterism causes changes in intracellular responses, leading to alterations in the inherent efficacy of an orthosteric ligand.⁶³ Affinity modulation: conformational change influences the orthosteric binding pocket, resulting in dissociation or association rate (or sometimes both) of the ligand being modified (Fig. 6).⁶³ Some known allosteric modulators of family B GPCRs include NovoNordisk compounds 1–6:⁶³ T-0632, which blocks the GLP-1 induced cAMP production^63,64 (GLP 1 receptor); DMP696, which blocks the CRF-stimulated adenylyl cyclase activity in cell line expressing CRF₁ receptor;^63,65 NBI 27914, which blocks the CRF₁ receptor;^63,66 NBI 35965;⁶³ antarlamin⁶³ (CRF 1 receptor).⁶³

Cooperativity is a thermodynamic term which has varying meanings in different biochemical contexts.^67,68 It is used to explain the complex interactions of identical ligands with a receptor at multiple binding sites.⁶⁷ Cooperativity also describes the thermodynamics of macromolecular conformational transitions, which include nucleic acid helix–coil transitions and protein folding.⁶⁷ Positive cooperativity is defined as the increase of binding affinity at one site of a receptor when a ligand is bound elsewhere.⁶⁹ A classic example of positive cooperativity is the binding of oxygen to haemoglobin; the binding of one oxygen molecule to the ferrous iron of the heme molecule increases the affinity of deoxyhaemoglobin for oxygen.⁶⁹ Negative cooperativity is observed when 2,3-bisphosphoglycerate binds to an allosteric binding site of haemoglobin and the affinity for oxygen is reduced.^67,69

GPCR signalling via G-proteins

G-proteins consist of several families of varied cellular proteins which perform several cellular functions, such as contractility and angiogenesis, learning and memory.^70,71 These proteins bind to the guanine nucleotides (guanine diphosphate (GDP) and guanine triphosphate (GTP)) and also have inherent GTPase activity.⁷¹ They play a principal role in a many cellular processes, including protein synthesis and cell development, vesicular transport, and cytoskeleton assembly, in addition to signal transduction.⁷¹ G-proteins are trimers comprising of two functional components: a beta-gamma dimer (35 and 8 kDa) which closely relates with the alpha subunit upon binding with GDP, and an alpha subunit (39–52 kDa) which is a catalyst for GTPase activity.⁷² Human G proteins are classified into two classes, namely small (monomeric), and heterotrimeric G proteins.^71,72

GPCRs are the largest superfamily of cell-surface receptors involved in TMD signalling, usually transmitting signals into cells via their response to a range of extracellular stimuli, such as glycoproteins, polypeptides and ions, and hence regulating a wide variety of physiological and developmental functions.⁷³ The intracellular-signalling cascades activated by GPCRs have been proven to be remarkably complex.^73,74 The binding of a ligand to the GPCR binding site leads to a conformational change in the receptor, in turn promoting the binding of the heterotrimeric G proteins, consisting of G_α-GDP and G_βγ-subunits, within the intracellular moiety of the receptor.⁷⁴ The exchange of GTP for GDP on the G_α-subunit results in the reversible dissociation of the G protein subunits, initiating a downstream signalling via G_α-GTP and G_βγ.^73,74

Dielectric constant

The most effective way of correlating the structure and function of macromolecules is through the examination of its electrostatic energies.⁷⁵ The intermolecular interactions present are affected by the effective dielectric constant (relative permittivity, ε_r),⁷⁶ which differs according to the size and composition of the protein.⁷⁷ The accuracy of the method of determination is important in understanding various biochemical interactions such as protein–ligand and protein–protein interactions, charge separation, ion channel selectivity and electron and proton transfer signal transduction and macromolecular assembly;^77,78 these interactions are influenced by the electrostatic potential of the protein surface.^77–79 The dielectric constant of dry proteins ranges from 2.5 to 3.5 obtained from direct measurement.⁷⁸ The theoretical calculation of local dielectric constant of lone proteins based on their amino acid composition yielded an average of 2.7.⁸⁰ The polarity of the residues which make up the structural motifs within a protein have been shown to affect its dielectric constant values, these findings were based on computational studies based on continuum electrostatics and molecular dynamics simulations.^77,78

According to Warshel and Åqvist, the value of the dielectric constant of proteins is dependent on the property used to define it. They highlighted several possible ways of defining the dielectric constant in proteins, as outlined in Table 2,⁸¹ where Q₁ and Q₂ are charges on ionisable groups separated by distance r, μ is a group dipole moment (in units of electron Ångström), ΔG is the electrostatic Gibbs free energy, ā is the effective radius of charge, and ε_B is the effective dielectric constant associated with a given interaction.

Table 2 Some rules for the definition of dielectric constants in proteins

Definition	Value	Comments
Polar = ε large	ε = large	Protein sites are always polar near small radii ions.
Nonpolar = ε small	ε = large	Protein sites are always polar near small radii ions.
	ε(r) > 10 often ε(r) ≥ 40	The value of ε is large for charge–charge interactions.
	ε_B > 10	Proteins can provide as much solvation as water for ionised groups with small radii.
	ε ≥ 4	For functionally important charge–dipole interactions, the value of ε could be as small as 4. Such a low value, however, requires relatively fixed dipoles with little energy for reorganisation.

Li et al. reported that the average dielectric constant inside a protein is relatively low, about 6–7, but this figure reaches about 20–30 on the surface of the protein.⁸² The high average local dielectric constant values are often linked to the charged residues while the low values are assigned automatically to the regions comprised of mostly hydrophobic residues.⁸²

According to Wilson et al. solvent effects on mechanisms of reactions have been established, but its effect on kinetic isotope effects (KIEs) are rather well less comprehended.⁸³ A change in solvent can alter the KIE indirectly by changing the transition-state (TS) structure. It can also affect KIE by affecting isotopically sensitive vibrational frequencies directly, notwithstanding the TS structure or identity of the rate-determining step.⁸³ Wilson et al. investigated the medium effects on KIE for S_N2 methyl transfer using UFF or UAO cavity method within the polarized continuum model (PCM) and a hybrid quantum mechanical/molecular mechanical (QM/MM) method.⁸³ Their findings showed that the majority of variation in the equilibrium isotope effects (EIE) occur within the same range of dielectric constants (1 ≤ ε ≤ 10) as is considered to occur with enzyme active sites and proteins.⁸³ There is a possibility that any reaction which involves separation, neutralisation or charge distribution within an enzyme active site could indicate variations in KIEs, between a wildtype and mutant form of an enzyme, which originates as a result of changes in the local dielectric response within the diverse protein environment.⁸³ The use of UFF or UAO cavity method within the polarized continuum model (PCM) and a hybrid QM/MM method to characterise ligand binding in GPCRs would further assist in understanding the interactions which occur in the both the active and inactive states of GPCRs, as well the changes which occur during the transition from inactive state to active state upon ligand activation.

Computational biology techniques in GPCR research

The first major breakthrough in human GPCR structural biology took place in 2007 as the solving of the β₂-adrenergic receptor (β₂AR with a diffusible ligand) using a modified lipidic cubic phase (LCP) produce to produce β₂AR-TCL crystals which diffracted to a resolution of 2.2 Å, the structure was further refined at a 2.4 Å resolution.¹² Presently 64 structures of unique GPCRs with varying resolutions have been solved using spectroscopic methods such as fluorescence, electron paramagnetic resonance (EPR) and nuclear magnetic resonance (NMR) spectroscopy and structural techniques such as cryogenic electron microscopy (cryo-EM), this provides opportunities in employing computational biology techniques such as molecular modelling, and molecular docking in drug discovery research.^84,85 The milestones achieved in GPCR structural studies have provided insights on the arrangements of the transmembrane domains,^1–5,11,12 the location of the orthosteric,^12,31,41 allosteric,^12,31,41 bitopic,¹² as well as biased ligand binding sites,¹² the homo- or hetero-oligomerization of receptors¹² and the structural rearrangements associated with conformational changes upon GPCR activation and inactivation.¹² This base of structural information on GPCRs is vital for SBDD,^12,86 ligand-based drug design (LBDD),¹² and integrated models which complement drug discovery efforts.¹²

In 2012, Sosei Heptares published a detailed account on the use of A_2AR structure in identifying series of agents as potential antagonists, this became the first published GPCR SBDD discovery.⁸⁷ In a research carried out by de Graaf et al. using structure based virtual screening (SBVS), they identified allosteric modulators of two family B receptors namely; glucagon receptor and glucagon-like peptide receptor.⁸⁸ SBDD approaches has also lead to the development of new agonists of the A₃ adenosine receptor (A₃AR).⁸⁹

Conclusion and future prospects

GPCRs are multifaceted proteins which exist in varying conformations, and that the conformational equilibrium of these group of receptors is influenced both by the bound ligand and the proximity to the related G protein. Their structure is highly conserved comprising of seven TMD. These receptors possess different binding domains, namely; allosteric and orthosteric binding domains. The progress in GPCR structural biology has substantially accelerated our understanding of GPCRs as potential drug targets using SBDD and LBDD approaches. Further computational studies assessing nuclear quantum effects on ligand receptor binding, as well as hybrid QM/MM and empirical valence bond theory in the mechanistic studies of GPCRs would allow for further insight into the interactions which occur in both the active and inactive states of GPCRs, as well the changes which occur during the transition from these states upon ligand activation. This review has aimed to provide an accessible and introductory perspective on advances in GPCR-based drug discovery approaches; many reviews on the topic highlighted herein are indeed highly detailed and authoritative but may not provide as accessible an account for a less specialised or more general audience in the chemical sciences.

Conflicts of interest

There are no conflicts to declare.

References

P. M. Dijkman, O. K. Castell, A. D. Goddard, J. C. Munoz-Garcia, C. de Graaf, M. I. Wallace and A. Watts, Nat. Commun., 2018, 9, 1710 CrossRef .
W. K. Kroeze, D. J. Sheffler and B. L. Roth, J. Cell Sci., 2003, 116, 4867 CrossRef CAS .
R. Fredriksson, M. C. Lagerström, L.-G. Lundin and H. B. Schiöth, Mol. Pharmacol., 2003, 63, 1256 CrossRef CAS .
H. B. Schiöth and R. Fredriksson, Gen. Comp. Endocrinol., 2005, 142, 94–101 CrossRef .
E. Ghosh, P. Kumari, D. Jaiman and A. K. Shukla, Nat. Rev. Mol. Cell Biol., 2015, 16, 69–81 CrossRef CAS .
A. S. Hauser, M. M. Attwood, M. Rask-Andersen, H. B. Schiöth and D. E. Gloriam, Nat. Rev. Drug Discovery, 2017, 16, 829–842 CrossRef CAS .
X.-l. Tang, Y. Wang, D.-l. Li, J. Luo and M.-y. Liu, Acta Pharmacol. Sin., 2012, 33, 363–371 CrossRef CAS .
H. R. Kim, N. M. Duc and K. Y. Chung, Biomol. Ther., 2018, 26, 101–108 CrossRef CAS .
K. Sriram and P. A. Insel, Mol. Pharmacol., 2018, 93, 251–258 CrossRef CAS .
P. A. Insel, K. Sriram, M. W. Gorr, S. Z. Wiley, A. Michkov, C. Salmerón and A. M. Chinn, Trends Pharmacol. Sci., 2019, 40, 378–387 CrossRef CAS .
K. A. Jacobson, S. Costanzi and S. Paoletta, Trends Pharmacol. Sci., 2014, 35, 658–663 CrossRef CAS .
S. Basith, M. Cui, S. J. Y. Macalino, J. Park, N. A. B. Clavio, S. Kang and S. Choi, Front. Pharmacol., 2018, 9, 128 CrossRef .
A. Heifetz, G. F. X. Schertler, R. Seifert, C. G. Tate, P. M. Sexton, V. V. Gurevich, D. Fourmy, V. Cherezov, F. H. Marshall, R. I. Storer, I. Moraes, I. G. Tikhonova, C. S. Tautermann, P. Hunt, T. Ceska, S. Hodgson, M. J. Bodkin, S. Singh, R. J. Law and P. C. Biggin, Naunyn-Schmiedeberg's Arch. Pharmacol., 2015, 388, 883–903 CrossRef CAS .
X. Yuan and Y. Xu, Int. J. Mol. Sci., 2018, 19, 2105 CrossRef .
M. Esguerra, A. Siretskiy, X. Bello, J. Sallander and H. Gutiérrez-de-Terán, Nucleic Acids Res., 2016, 44, W455–W462 CrossRef CAS .
V. V. Gurevich and E. V. Gurevich, Int. J. Mol. Sci., 2017, 18, 2519 CrossRef .
P. S. H. Park, Curr. Med. Chem., 2012, 19, 1146–1154 CrossRef CAS .
D. Provasi, M. C. Artacho, A. Negri, J. C. Mobarec and M. Filizola, PLoS Comput. Biol., 2011, 7, e1002193 CrossRef CAS .
P. Samama, S. Cotecchia, T. Costa and R. J. Lefkowitz, J. Biol. Chem., 1993, 268, 4625–4636 CAS .
S. M. de Munnik, M. J. Smit, R. Leurs and H. F. Vischer, Front. Pharmacol., 2015, 6, 40 Search PubMed .
B. K. Kobilka, Biochim. Biophys. Acta, 2007, 1768, 794–807 CrossRef CAS .
J. Bockaert and J. Philippe Pin, EMBO J., 1999, 18, 1723–1729 CrossRef CAS .
M. N. Davies, A. Secker, A. A. Freitas, M. Mendao, J. Timmis and D. R. Flower, Bioinformatics, 2007, 23, 3113–3118 CrossRef CAS .
F. Horn, E. Bettler, L. Oliveira, F. Campagne, F. E. Cohen and G. Vriend, Nucleic Acids Res., 2003, 31, 294–297 CrossRef CAS .
F. Horn, J. Weare, M. W. Beukers, S. Hörsch, A. Bairoch, W. Chen, Ø. Edvardsen, F. Campagne and G. Vriend, Nucleic Acids Res., 1998, 26, 275–279 CrossRef CAS .
G.-M. Hu, T.-L. Mai and C.-M. Chen, Sci. Rep., 2017, 7, 15495 CrossRef .
G. Pándy-Szekeres, C. Munk, T. M. Tsonkov, S. Mordalski, K. Harpsøe, A. S. Hauser, A. J. Bojarski and D. E. Gloriam, Nucleic Acids Res., 2018, 46, D440–D446 CrossRef .
D. M. Rosenbaum, S. G. F. Rasmussen and B. K. Kobilka, Nature, 2009, 459, 356–363 CrossRef CAS .
S. B. Gacasan, D. L. Baker and A. L. Parrill, AIMS Biophys., 2017, 4, 491–527 CAS .
K. Palczewski, Annu. Rev. Biochem., 2006, 75, 743–767 CrossRef CAS .
M. C. Lagerström and H. B. Schiöth, Nat. Rev. Drug Discovery, 2008, 7, 339–357 CrossRef .
D. C. Teller, T. Okada, C. A. Behnke, K. Palczewski and R. E. Stenkamp, Biochemistry, 2001, 40, 7761–7772 CrossRef CAS .
K. Sasaki, S. Dockerill, D. A. Adamiak, I. J. Tickle and T. Blundell, Nature, 1975, 257, 751–757 CrossRef CAS .
C. Parthier, S. Reedtz-Runge, R. Rudolph and M. T. Stubbs, Trends Biochem. Sci., 2009, 34, 303–310 CrossRef CAS .
M. Wheatley, D. Wootten, M. T. Conner, J. Simms, R. Kendrick, R. T. Logan, D. R. Poyner and J. Barwell, Br. J. Pharmacol., 2012, 165, 1688–1703 CrossRef CAS .
C. R. R. Grace, M. H. Perrin, M. R. DiGruccio, C. L. Miller, J. E. Rivier, W. W. Vale and R. Riek, Proc. Natl. Acad. Sci. U. S. A., 2004, 101, 12836–12841 CrossRef CAS .
V. Karageorgos, M. Venihaki, S. Sakellaris, M. Pardalos, G. Kontakis, M.-T. Matsoukas, A. Gravanis, A. Margioris and G. Liapakis, Hormones, 2018, 17, 45–59 CrossRef .
C. de Graaf, G. Song, C. Cao, Q. Zhao, M.-W. Wang, B. Wu and R. C. Stevens, Trends Biochem. Sci., 2017, 42, 946–960 CrossRef CAS .
F. Wu, L. Yang, K. Hang, M. Laursen, L. Wu, G. W. Han, Q. Ren, N. K. Roed, G. Lin, M. A. Hanson, H. Jiang, M.-W. Wang, S. Reedtz-Runge, G. Song and R. C. Stevens, Nat. Commun., 2020, 11, 1272 CrossRef CAS .
K. Hollenstein, C. de Graaf, A. Bortolato, M.-W. Wang, F. H. Marshall and R. C. Stevens, Trends Pharmacol. Sci., 2014, 35, 12–22 CrossRef CAS .
L. Chun, W.-h. Zhang and J.-f. Liu, Acta Pharmacol. Sin., 2012, 33, 312–323 CrossRef CAS .
P. Rondard, C. Goudet, J. Kniazeff, J.-P. Pin and L. Prézeau, Neuropharmacology, 2011, 60, 82–92 CrossRef CAS .
S. D. Hellyer, S. Albold, T. Wang, A. N. Y. Chen, L. T. May, K. Leach and K. J. Gregory, Mol. Pharmacol., 2018, 93, 504 CrossRef CAS .
B.-O. Hans, P. Wellendorph and J. Anders, Curr. Drug Targets, 2007, 8, 169–184 CrossRef .
C. S. Tautermann, Bioorg. Med. Chem. Lett., 2014, 24, 4073–4079 CrossRef CAS .
B. L. Roth, Nat. Struct. Mol. Biol., 2019, 26, 535–544 CrossRef CAS .
X. C. Zhang, J. Liu and D. Jiang, Protein Cell, 2014, 5, 492–495 CrossRef .
T. C. Møller, D. Moreno-Delgado, J.-P. Pin and J. Kniazeff, Biophys. Rep., 2017, 3, 57–63 CrossRef .
D. Zhang, Q. Zhao and B. Wu, Mol. Cells, 2015, 38, 836–842 CrossRef CAS .
M. Dong, C. Koole, D. Wootten, P. M. Sexton and L. J. Miller, Br. J. Pharmacol., 2014, 171, 1085–1101 CrossRef CAS .
M. Orel, E. Padrós and J. Manyosa, FEBS Journal, 2012, 279, 2357–2367 CrossRef CAS .
A. Bortolato, A. S. Doré, K. Hollenstein, B. G. Tehan, J. S. Mason and F. H. Marshall, Br. J. Pharmacol., 2014, 171, 3132–3145 CrossRef CAS .
K. J. Culhane, Y. Liu, Y. Cai and E. C. Y. Yan, Front. Pharmacol., 2015, 6, 264 Search PubMed .
P. Tewatia, N. Agrawal, M. Gaur and S. Sahi, Biochimie, 2014, 101, 168–182 CrossRef CAS .
B. Trzaskowski, D. Latek, S. Yuan, U. Ghoshdastider, A. Debinski and S. Filipek, Curr. Med. Chem., 2012, 19, 1090–1109 CrossRef CAS .
W. I. Weis and B. K. Kobilka, Annu. Rev. Biochem., 2018, 87, 897–919 CrossRef CAS .
D. Wacker, R. C. Stevens and B. L. Roth, Cell, 2017, 170, 414–427 CrossRef CAS .
P. R. Gentry, P. M. Sexton and A. Christopoulos, J. Biol. Chem., 2015, 290, 19478–19488 CrossRef CAS .
H. N. Motlagh, J. O. Wrabl, J. Li and V. J. Hilser, Nature, 2014, 508, 331–339 CrossRef CAS .
R. Nussinov and T. Chung-Jung, Curr. Pharm. Des., 2012, 18, 1311–1316 CrossRef CAS .
N. Tuteja, Plant Signaling Behav., 2009, 4, 942–947 CrossRef CAS .
C. D. Hanlon and D. J. Andrew, J. Cell Sci., 2015, 128, 3533–3542 CrossRef CAS .
P. J. Conn, A. Christopoulos and C. W. Lindsley, Nat. Rev. Drug Discovery, 2009, 8, 41–54 CrossRef CAS .
E. C. Tibaduiza, C. Chen and M. Beinborn, J. Biol. Chem., 2001, 276, 37787–37793 CAS .
Y.-W. Li, L. Fitzgerald, H. Wong, S. Lelas, G. Zhang, M. D. Lindner, T. Wallace, J. McElroy, N. J. Lodge, P. Gilligan and R. Zaczek, CNS Drug Rev., 2005, 11, 21–52 CrossRef CAS .
T. Z. Baram, D. T. Chalmers, C. Chen, Y. Koutsoukos and E. B. De Souza, Brain Res., 1997, 770, 89–95 CrossRef CAS .
J. R. Williamson, Nat. Chem. Biol., 2008, 4, 458–465 CrossRef CAS .
T. Lenaerts, J. Ferkinghoff-Borg, J. Schymkowitz and F. Rousseau, BMC Syst. Biol., 2009, 3, 9 CrossRef .
I. G. Denisov and S. G. Sligar, Arch. Biochem. Biophys., 2012, 519, 91–102 CrossRef CAS .
Y.-J. I. Jong, S. K. Harmon and K. L. O'Malley, Br. J. Pharmacol., 2018, 175, 4026–4035 CrossRef CAS .
V. Zachariou, R. S. Duman and E. J. Nestler, in Basic Neurochemistry, ed. S. T. Brady, G. J. Siegel, R. W. Albers and D. L. Price, Academic Press, New York,8th edn, 2012, pp. 411–422 Search PubMed .
H. Schulman, in From Molecules to Networks, ed. J. H. Byrne, R. Heidelberger and M. N. Waxham, Academic Press, Boston, 3rd edn, 2014, pp. 119–148 Search PubMed .
J. Doijen, T. Van Loy, B. Landuyt, W. Luyten, D. Schols and L. Schoofs, Biosens. Bioelectron., 2019, 137, 33–44 CrossRef CAS .
G. J. Augustine, Neuroscience, ed. D. Purves, G. Augustine, D. Fitzpatrick, L. Katz, A.-S. LaMantia, J. McNamara and M. Williams, Sinauer Associates, Sunderland MA, 3rd edn, 2004 Search PubMed .
A. Warshel and A. Papazyan, Curr. Opin. Struct. Biol., 1998, 8, 211–217 CrossRef CAS .
S. E. Braslavsky, Pure Appl. Chem., 2007, 79, 293–465 CAS .
M. Amin and J. Küpper, ChemistryOpen, 2020, 9, 691–694 CrossRef CAS .
M. Amin and J. Küpper, 2020, arXiv e-prints, arXiv:2001.07053.
C. N. Schutz and A. Warshel, Proteins: Struct., Funct., Bioinf., 2001, 44, 400–417 CrossRef CAS .
A. S. Alshami, J. Tang and B. Rasco, Food Bioprocess Technol., 2017, 10, 1548–1561 CrossRef CAS .
A. Warshel and J. Aqvist, Annu. Rev. Biophys. Biophys. Chem., 1991, 20, 267–298 CrossRef CAS .
L. Li, C. Li, Z. Zhang and E. Alexov, J. Chem. Theory Comput., 2013, 9, 2126–2136 CrossRef CAS .
P. B. Wilson, P. J. Weaver, I. R. Greig and I. H. Williams, J. Phys. Chem. B, 2015, 119, 802–809 CrossRef CAS .
M. Jaiteh, I. Rodríguez-Espigares, J. Selent and J. Carlsson, PLoS Comput. Biol., 2020, 16, e1007680 CrossRef .
D. Hilger, M. Masureel and B. K. Kobilka, Nat. Struct. Mol. Biol., 2018, 25, 4–12 CrossRef CAS .
P. Nakliang, R. Lazim, H. Chang and S. Choi, Biomolecules, 2020, 10, 631 CrossRef CAS .
M. Congreve, C. de Graaf, N. A. Swain and C. G. Tate, Cell, 2020, 181, 81–91 CrossRef CAS .
C. de Graaf, C. Rein, D. Piwnica, F. Giordanetto and D. Rognan, ChemMedChem, 2011, 6, 2159–2169 CrossRef CAS .
A. Ciancetta and K. A. Jacobson, in Computational Methods for GPCR Drug Discovery, ed. A. Heifetz, Springer New York, New York, NY, 2018, pp. 45–72 Search PubMed .

Click here to see how this site uses Cookies. View our privacy policy here.