Bruno L. Sousa*a,
Ito L. Barroso-Netob,
Evanildo F. Oliveiraa,
Emerson Fonsecac,
Pedro Lima-Netob,
Luiz O. Ladeirac and
Valder N. Freirea
aDepartamento de Física, Universidade Federal do Ceará, 60455-760, Fortaleza, Ceará, Brazil. E-mail: brunolopesdesousa@gmail.com
bDepartamento de Química Analítica e Físico-Química, Universidade Federal do Ceará, 60455-760, Fortaleza, Ceará, Brazil
cDepartamento de Física, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
First published on 22nd August 2016
Osteoporosis is a degenerative disease associated with excessive bone resorption, a natural process performed by osteoclasts. In turn, osteoclast maturation is critically regulated by the receptor activator of nuclear factor kB ligand (RANKL), its signalling receptor (RANK), and its decoy receptor osteoprotegerin (OPG). The critical role of the protein triad, RANK–RANKL–OPG, in osteoclastogenesis has made their binding an important target for the rational development of drugs against osteoporosis. Based on this, we have performed a quantum biochemistry investigation of the binding between RANKL and its decoy receptor, OPG, in order to analyse the individual contributions of all amino acid residues involved in the complex formation, providing a deeper understanding of the inhibition process. The role of specific residues in the RANKL–OPG binding was evaluated through quantum biochemistry computations performed within the molecular fractionation with conjugate caps (MFCC) methodology, and inter-residue binding energies were calculated within the framework of density functional theory (DFT). Our simulations, considering water effects (implicit and explicit) and the role of the dielectric constant background, attested the major importance of site II, when compared to site I, over OPG binding and functionality, mainly through interactions performed by the tripeptide OPG core, I94–E95–F96. The obtained results also explain (i) the impact of a specific OPG mutation (F96L) on Paget's disease development; (ii) how some pioneers proposed that peptides efficiently inhibit the RANKL–OPG complex, acting as promising drugs for the treatment of osteoporosis. In conclusion, our quantum biochemistry approach provides a solid base that allows important insights into peptide and drug design for the treatment of osteoporosis based on RANKL–OPG binding inhibition.
RANKL is a member of the tumor necrosis factor (TNF) superfamily, expressed as a homotrimeric transmembrane or secreted protein in osteoblasts, which plays a central role in osteoclast formation and bone resorption.6–8 The binding of RANKL to RANK, present at the membrane of osteoclast precursors, leads to the trimerization of this receptor and promotes osteoclast differentiation and activation through a downstream signalling cascade.9 On the other hand, OPG is secreted by osteoblasts and inhibits the binding of RANKL to RANK, thereby limiting osteoclastogenesis.9 OPG occurs as a dimer and can act as a decoy receptor of RANKL in view of structural similarities to the RANK ectodomain.10 Recently, it has been reported that the leucine-rich repeat-containing G-protein-coupled receptor 4 (LGR4) is another receptor for RANKL, inhibiting osteoclastogenesis through competition with RANK and by downstream signalling. However, the soluble version of this protein (LGR4-ECD) presented lower binding affinity than OPG for RANKL, and had little physiological effect on osteoclast differentiation.11
Bone remodelling becomes perturbed in a variety of pathological conditions that affect the skeleton, including postmenopausal osteoporosis and rheumatoid arthritis,12 in which there is local and/or systemic alteration in the levels of hormones or proinflammatory cytokines that are known to stimulate or inhibit bone resorption in vitro and in vivo.4 Functional mutations in RANKL lead to human autosomal recessive osteopetrosis (ARO), whereas RANKL overexpression has been implicated in the pathogenesis of bone degenerative diseases, such as osteoporosis, demonstrating that RANKL and RANK are indispensable for osteoclastogenesis, whereas the absence of OPG causes increased osteoclastogenesis and osteopenia.13 Although RANKL is best known for its role in bone resorption, it also plays multiple roles in the immune system,6,7,14–18 mammary gland development during pregnancy,19 thermoregulation,20 heart diseases,21 cancer metastasis,22,23 and hormone-derived breast development.24 Therefore, the RANK–RANKL–OPG molecular triad is an attractive target for the development of rational therapies to prevent bone destruction in many osteopenic conditions, such as osteoporosis and arthritis.10,25
Among the currently available agents to prevent or treat bone loss, we can mention estrogen, calcitonin and bisphosphonates.26 Estrogen plays a key role in regulating bone loss and it has been used for the treatment of postmenopausal women, reducing the percentage of RANKL-expressing cells, but it remains unclear if this is a direct or indirect effect of estrogen.27 On the other hand, calcitonin has proven efficiency in inhibiting osteoclastogenesis, but it is frequently associated with side effects, such as nausea and flushing.28 In clinical trials, bisphosphonates are widely used as therapeutic agents, but they may cause some adverse gastrointestinal effects, such as osteonecrosis of the jaw, and renal toxicity.29
Alternative therapies based on the development of specific antibodies for the RANK–RANKL–OPG molecular triad are currently under development. The first RANKL inhibitor approved for the treatment of osteoporosis and the prevention of skeleton-related events, named denosumab, consists of a fully human monoclonal antibody against RANKL, which binds to a specific loop region (DE loop) on the ligand and prevents its binding to RANK on responding cells, avoiding osteoclast maturation.5,30 However, the use of macromolecules, such as antibodies and cytokines, for therapeutics is associated with many disadvantages, such as poor bioavailability, neutralizing antibody production, rebound symptoms, limited long term efficiency, low stability, and a risk of severe and sometimes life-compromising side effects resulting from the antigen–antibody reaction.31 Additionally, they are costly and there are insurance issues stemming from their high expense.32,33 Recently, it has been reported that the leucine-rich repeat-containing G-protein-coupled receptor 4 (LGR4) is another receptor for RANKL, inhibiting osteoclastogenesis through competition with RANK and by downstream signalling, but the soluble version of this protein (LGR4-ECD) presented lower binding affinity than OPG for RANKL and had little physiological effect on osteoclast differentiation.11 A different strategy is exemplified by the use of a truncated version of the human parathyroid hormone (PTH) as a therapeutic for the treatment of osteoporosis.34 The recombinant version of this long peptide (34 residues), named teriparatide (Forteo®), increases bone remodelling, formation, and density, improves bone microstructure and reduces fracture risk.35 Nevertheless, this therapeutic presented a high incidence of osteosarcomas, documented in preclinical animal models, with a few cases confirmed for humans.36
In the current scenario, the design of biocompatible inhibitory peptides targeting specific protein–protein interactions (PPIs) is available as an interesting alternative approach for the treatment of different pathological conditions, since these interactions are involved in most cellular processes and influence biological functions through proximity-induced changes in protein features.37–40 Additionally, peptide-based therapeutics are attractive in view of their high biological activity, associated with low toxicity and high specificity, including benefits such as specific binding, minimization of undesired interactions between drugs and reduced tissue accumulation, decreasing risks of complications due to intermediate metabolites.41 Based on this, the modulation of PPIs is considered a promising strategy towards next-generation therapeutics.37
Peptides are unique candidates for rational drug development, since they are flexible in adopting and mimicking local structural features of proteins, being generally non-immunogenic in view of their low molecular weight.42 In contrast to small molecular ligands that bind to defined protein pockets, the interfaces of PPIs often involve rather flat protein surfaces that exhibit large contact areas. The classical structure-based method for designing therapeutic peptides takes advantage of the amino acid sequences found at PPI sites, which can occur between two structured protein domains, a structured domain and a relatively short peptide, or between two peptide stretches. In many cases, additional weak contacts distant to the defined interaction area contribute to binding, complicating the prediction of PPI characteristics.43 However, the analysis of several PPI interfaces revealed that specific residue side chains mainly contribute to the Gibbs energy of protein–protein binding. These so-called hot-spot residues often overlap with structurally conserved regions and represent a common feature of PPI interfaces.44,45
The molecular structure of murine RANKL has been available for a long time, and recently the molecular structures for murine RANK–RANKL and human RANKL–OPG complexes, as well as the hybrid complex between murine RANK and human RANKL, have also been determined, providing important details related to ligand/receptor interactions in the RANK–RANKL–OPG molecular triad.10,46,47 RANKL interacts with OPG through three distinct but equivalent shallow grooves located between monomers in the trimer (Fig. 1). Biochemical studies have indicated a ratio of 1:
1 in the trimer/dimer RANKL–OPG complex, where OPG monomers both bind to the same RANKL trimer, and each monomer of OPG cannot bind to a separate RANKL trimer.48 This allows efficient inhibition of RANKL-induced RANK trimerization, which is essential for subsequent signalling and osteoclast maturation. The RANKL–OPG binding interface presents two anchorage sites, named site I and II (Fig. 1). At site I, the 50s loop (CDR2) from OPG is placed along a groove in RANKL, forming patches of relatively small and separate contacts. Conversely, site II presents the 90s loop (CDR3) from OPG, deeply inserted into the groove surrounded by specific loops from neighbouring RANKL monomers, establishing a continuous patch of interactions.10
The available structural and biochemical data have suggested that site II is the major binding determinant of RANKL for the anchorage of OPG and RANK, despite subtle differences in a few interactions. Structural analysis of human and murine complexes of RANKL with OPG and RANK, respectively, has demonstrated that OPG exerts its decoy receptor function by directly occupying site II and blocking the access to key interacting residues, such as R223, Y241, and K257 in human RANKL.10,47 The importance of key residues involved in the interaction between RANKL and OPG has been determined based on biochemical and biological assays, such as surface plasmon resonance, site-directed mutagenesis, and osteoclast differentiation assays.10,47 Nevertheless, detailed energetic characterization of the PPI involved in the complex is not yet available, and would provide important information about the inhibition process and assist the design of new therapeutic molecules for the treatment of bone-related diseases.49 In this context, the purpose of this work is to allow a deeper understanding of the RANKL–OPG interface within a quantum biochemistry framework.
Quantum biochemistry methods are important tools when a relatively qualitative estimate of binding affinities is required, being frequently used to describe systems with tens up to thousands of atoms within the scope of divide-to-conquer methodologies.50–53 Despite the complexity of biological systems, increasing computational power, combined with recent developments in density functional theory (DFT),54,55 has allowed the application of quantum biochemistry (frequently combined with classical molecular mechanics techniques) to study biological issues, such as enzymatic reactions and binding processes mediated by proteins.56,57 For these systems, fragment-based methods were also developed to accelerate the calculations, seeking a balance between the computational cost and the accuracy required to produce trustworthy results.58,59 These methods enable the computational treatment of large molecules or molecular systems, reaching consistent descriptions at the quantum biochemistry level. The principle is to divide the molecule of interest into fragments, employ a quantum biochemistry approach to calculate some properties of each fragment, and then combine the results to predict the same properties for the whole.60 Within this picture, it was possible to describe the inactivation of ovine cyclooxygenase-1 by bromoaspirin and aspirin;61 statin inhibition effectiveness in HMG-CoA reductase;62 human dopamine D3 receptor interaction with the selective antagonist eticlopride and the inverse agonist haloperidol;63,64 the activation and antagonism strength of an AMPA-sensitive glutamate receptor;65 the investigation of willardine partial agonism in AMPA receptors;66 the binding of isoniazid NADH adduct to InhA reductase,67 and ibuprofen transportation by human serum albumin,68 as well as estimating the residue–monomer and residue–residue interaction energies of the triple helix structure of collagen-like peptide T3-785.69
Moreover, an important feature to be considered during biological system simulations is a better description of the electrostatic effects. Almost all biological processes are controlled or modulated by electrostatic effects, which crucially demands the ability to perform accurate electrostatic calculations to describe quantitative structure–function correlation in proteins.70 According to Schutz and Warshel (2001) electrostatic effects on biological systems play many roles, being present in enzyme catalysis, electron transfer, proton transport, ion channeling, ligand binding, macromolecular assembly, and signal transduction.71 The electric charge pattern in a protein gives rise to electrostatic effects, which are described by an inhomogeneous dielectric function ε(r), where r is the spatial coordinate describing the 3D structure of the protein. Generally, the dielectric function of the protein is assumed to be constant, within a mean field approximation, and various values have been chosen for it, usually in the range 4–40.72 Small values for the dielectric constant (ε < 10) are associated with hydrophobic patches or cavities, like binding pockets, while large values of dielectric constants (ε = 20, ε = 40, or ε = 80) are associated with solvated hydrophilic patches and metal binding sites.45
Here, we take full advantage of the 2.7 Å resolution X-ray structure of the human RANKL–OPG complex (PDB ID: 3URF) to employ quantum methods, in order to investigate the individual energetic contribution of all interactions established in the complex.10 The DFT formalism was chosen to estimate the binding energies within the molecular fractionation with conjugate caps (MFCC) framework,38,50,51 a methodology specifically designed to facilitate the quantum biochemistry description of protein systems.50 Moreover, explicit water molecules were considered in the calculations in two different ways, composing the molecular caps or as part of the analysed residues, in order to precisely estimate their effect on the binding energies. Furthermore, implicit water effects were considered by assuming two different values for the RANKL–OPG dielectric constant (ε = 10 and ε = 40) in the computations.
Our quantum calculations provide a detailed profile of all the PPIs involved in the RANKL–OPG interaction, presenting important information about the complex organization. The obtained results confirm the previously reported importance of residues such as R223 and K257 (but deny others, such as Y241), as well as reinforcing the importance of neglected residues, such as K181 and R224.10 Additionally, the impact of a specific mutation in the development of Paget's disease is explained through a comparison between the non-mutant and mutant residue contribution to the RANKL–OPG binding energy.47 Finally, previously proposed OPG-based peptides targeting the RANKL–OPG complex have also been analysed with respect to their binding energy to RANKL, corroborating the critical role of a few interactions. Therefore, the obtained results establish a quantum biochemistry-based picture of the inhibition mechanism exerted by OPG on RANKL within a binding energy framework, going beyond a strict distance-based analysis of the complex, as furnished by the crystallographic data, thus providing important new and quite relevant information for the design of new peptides for the treatment of osteopenic conditions.
DFT simulations on the classically optimized structures were performed using DMOL3 code, implemented in Materials Studio, adopting the dispersion correction GGA + D (generalized gradient approximation) exchange–correlation functional, which is adequate to describe systems where hydrogen bonds and dispersive forces, such as van der Waals interactions, are present.74 To model the dispersive forces, a state-of-the-art semi-empirical correction scheme proposed by Tkatchenko and Scheffler (TS) was adopted.75 A double numerical plus polarization (DNP 4.4) basis set was chosen to expand the Kohn–Sham orbitals within an all-electron treatment scheme. The orbital cutoff for these computations was set to 3.7 Å, and a total energy variation smaller than 10−6 Ha was imposed as a threshold to achieve self-consistency.
EI(Ri–Rj) = E(Ci−1RiCi+1Cj−1RjCj+1) − E(Ci−1RiCi+1Cj−1Cj+1) − E(Ci−1Ci+1Cj−1RjCj+1) + E(Ci−1Ci+1Cj−1Cj+1) | (1) |
In the above equation, the Ck terms refer to the conjugate caps, which must be chosen carefully to reproduce the local electronic environment of the amino acid residues.53 In our study, these caps are the residues covalently bound to Rk; Ck±1 = Rk±1 plus hydrogen atoms placed at any dangling bonds. At the right-hand side of eqn (1), the first term, E(Ci−1RiCi+1Cj−1RjCj+1), is the total energy of the system formed by two interacting capped residues. The second term, E(Ci−1RiCi+1Cj−1Cj+1), gives the total energy of the system formed by the capped residue Ri and the hydrogenated caps of Rj. The third term, E(Ci−1Ci+1Cj−1RjCj+1), is the total energy of the system formed by Rj and the set of caps of Ri. Finally, E(Ci−1Ci+1Cj−1Cj+1) is the total energy of the system formed by the caps only.69
Following this scheme, structural files (protein data bank format) were prepared and used as input for calculations with DMOL3 code (ESI Fig. S1†).74 The total energies obtained for each system were then inserted in the above equation to determine the interaction energy between a specific pair of amino acid residues.
All interaction energy calculations were performed using the COSMO continuum solvation model, i.e. an implicit water representation, ε = 10 and ε = 40.78,79 These values are in agreement with the results of the work of Vicatos et al., who have investigated the absolute folding free energies in a diverse set of 45 proteins, finding that the best fitted values of the dielectric constant for charge–charge interactions and for self-energies are in the range 1 < ε < 40.79 Currently, the application of the COSMO continuum solvation model is a physically reasonable and also practical solution.80 Warshel and co-workers (1984) have shown that any model with a large dielectric constant for charge–charge interactions will usually looks like an excellent model in cases with surface groups.81 In addition, Antony and Grimme (2012) considered the inclusion of the crystallographic water molecules into the fractionation scheme as the next step to improve the efficiency of GGA-based DFT energy calculations of full protein-ligand interactions.80 Recently, it was shown that a value assigned to the dielectric constants of human serum albumin and the laccase enzyme can influence the binding pattern of the anti-inflammatory, ibuprofen, in the former case,68 and the pesticide, formetanate, in the latter.82
Molecular docking calculations were performed with AutoDock Vina, version 1.1.2, which applies an iterated local search global optimizer for the optimization procedure. The succession of each step consists of a mutation and local optimization, with the acceptance decisions made according to the Metropolis criterion. This uses the efficient quasi-Newton method, Broyden–Fletcher–Goldfarb–Shanno (BFGS) for local optimization.86 The Autodock graphical interface, AutoDockTools, version 1.5.6, was used to retain polar hydrogens and add partial charges to the proteins and ligands using the Kollman united charges.61 The receptor (RANKL) and ligands (peptides) were treated as rigid and flexible molecules, respectively. The search space for the docking calculations was defined by a 20 Å × 20 Å × 20 Å cube centred on binding site II. Exhaustiveness was set to 15, and all other parameters were used as default. For each ligand, the ten top-ranked generations based on the predicted binding affinity (in kilocalories per mole) were analysed. The solutions were first chosen based on the coordination of the crystal structure for peptide-2, and then the most suitable results were further ranked based on the theoretical binding energy (given as a negative score in kcal mol−1).
DFT calculations were employed to assess the relative energetic contribution of each pair of interactions, as well as the individual contribution of each amino acid residue, at binding sites I and II, from the RANKL–OPG complex.87 As described above, treatments one and two were applied aiming to further analyse the influence of water molecules over energy estimates. Nevertheless, the current discussion will focus solely on treatment 1 (T1:ε = 10 and T1:ε = 40 for dielectric constants of 10 and 40, respectively), presenting treatment 2 (T2:ε = 10 and T2:ε = 40 for dielectric constants of 10 and 40, respectively) as supplementary material.
The current strategy, using a distance-based binding interface (top distance of 4.0 Å), has found a few extra residues on RANKL, beyond those previously mentioned in the literature, composing binding sites I and II.10,47 Therefore, for RANKL, site I is composed of residues Y188, D190, R191, G192, K248, I249, H253, K282, R284, P301, D302, and Q303, while site II presents residues S179, H180, K181, R223, H224, H225, T227, Q237, M239, Y241, K257, T261, F270, H271, F272, S294, and N295 (newly identified residues highlighted in bold) (Table 1). In the case of OPG, the current approach has confirmed all previously described residues for sites I (H47, Y48, S56, E58, L60, Y61, P64, and V65) and II (K67, E68, L69, R90, Y91, L92, E93, I94, E95, F96, C97, L98, and K99) (Table 2). As described earlier, OPG interacts with RANKL at binding sites I and II mainly through loops 50s and 90s, respectively, which represent linear fragments of the protein structure.10,47 Based on this, we have named here as peptides 1 and 2 the protein fragments from OPG corresponding to loops 50s (T55–P64) and 90s (R90–K99), respectively, which directly interact with RANKL (Fig. 2).
Residue | Site | T1:ε = 10 | T1:ε = 40 | T2:ε = 10 | T2:ε = 40 |
---|---|---|---|---|---|
H47 | I | −4 | −1 | −5 | −2 |
Y48 | I | −10 | −9 | −9 | −9 |
Y49 | I | 1 | 1 | 1 | 1 |
S56 | I | 6 | 5 | 6 | 5 |
D57 | I | −9.2 | −2.6 | −9.2 | −2.6 |
E58 | I | −27 | −10 | −26 | −10 |
L60 | I | −4 | −2 | −4 | −2 |
Y61 | I | −8 | −5 | −7 | −5 |
S63 | I | −1 | −1 | −1 | −1 |
P64 | I | −3.3 | −2.7 | −3.8 | −3.3 |
V65 | I | −3 | −3 | −3 | −4 |
K67 | I/II | 14 | 7 | 11 | 8 |
E68 | II | −7 | −2 | −8 | −2 |
L69 | II | −1 | −4 | −2 | −4 |
R90 | II | −2.5 | −0.7 | −3.5 | −0.7 |
Y91 | II | 0 | 0 | −1 | 0 |
L92 | II | −1 | 0 | 1 | −1 |
E93 | II | 19 | 19 | 17.4 | 17.5 |
I94 | II | −3 | −1 | −7 | −5 |
E95 | II | −38.6 | −8.9 | −36.7 | −7.8 |
F96 | II | −15 | −15 | −21 | −21 |
C97 | II | −2 | −3 | −7 | −5 |
L98 | II | −5 | −2 | −5 | −2 |
K99 | II | −2 | 0 | −2 | 0 |
In the case of binding site II, forty-two interactions are present within a range of 4.0 Å, disposed in a much more intricate net. Unlike site I, the presence of a comparable number of attractive and repulsive interactions confers to site II a considerably lower total binding energy (−38.1 and −24.0 kcal mol−1, applying T1:ε = 10 and T1:ε = 40, respectively) (Fig. 6 and 10, and ESI Fig. S4 and S5†). However, the number and importance of all interactions, determined by quantum calculations, corroborate their critical role for the RANKL–OPG complex (Table 4).10 In peptide-2, the most energetically important residues are I94, E95, and F96, with individual contributions of −3.0, −38.6, and −15.0 kcal mol−1 (T1:ε = 10), respectively, comprising more than a half of all interactions established in site II (Fig. 6, 7, and 8, and ESI Fig. S6†). These results agree with previous biological assays and attest the importance of this tripeptide in OPG binding and its inhibitory activity (Fig. 7 and 8).
![]() | ||
Fig. 9 Total interaction energy for site I as a function of distance, applying treatments 1 and 2 and distinct values of the dielectric constant. Energy values are represented as kcal mol−1. |
Complementarily, quantum calculations have indicated R191, K282, and R284 as the energetically most relevant residues at site I, and K181, R223, and K257 at site II, for OPG recognition by RANKL (ESI Fig. S6–S9†). These residues are potential templates for alternative approaches focused on designing peptides targeting RANK instead of RANKL, aiming to block key residues on RANK's surface for the complex formation. Recently, peptides based on the RANKL structure (RANK antagonist) were proposed to specifically block RANK–RANKL interactions.88 Nevertheless, these RANKL-based peptides presented a lower efficiency over osteoclast differentiation when compared to OPG or OPG-based peptides.84,85,88 Interestingly, these antagonist peptides were designed not to target the binding sites between RANK and RANKL, but a hinge region between two cysteine-rich domains on RANK, which is responsible for the conformational change upon RANK association.89 Therefore, efficient inhibition of osteoclastogenesis seems to hinge on the proper blockage of key PPIs in the RANK–RANKL complex.
Interestingly, the energy analysis revealed that calculations involving a dielectric constant of 10 (ε = 10) generate increased (sometimes significantly) binding values (attractive or repulsive) for polar interactions when compared to a dielectric constant of 40 (ε = 40). However, the same is not true for hydrophobic interactions, which eventually present slightly increased values for treatments, considering ε = 40. The values of dielectric constants used in this work (ε = 10 and ε = 40) are representative enough to consider the inhomogeneous nature of protein interfaces, which may include a great number of polar, charged, and hydrophobic residues. Protein systems may include patches presenting variable values of the dielectric constant within the 4–80 interval, with a value of 80 representing a completely solvated environment.45
Additionally, treatment two (waters considered as part of the analysed residues) presents increased values for attractive interactions, especially those of lower significance. Thereby, we can conclude that this energy increment is a contribution of water molecules involved in these interactions, since treatment one considers it as part of the caps, which highlights its importance over the protein binding.
The key role of E95 was analysed by Luan and co-workers (2012), who proved the inability of an OPG mutant at this position (E95A) to act as an inhibitor of the osteoclast differentiation induced by RANK.10 Now, our quantum calculations have indicated E95 as the energetically most important residue in the complex, which corroborates the available biological data. This residue is placed on the tip of the 90s loop, being inserted upon OPG binding into the bottom of the cavity built by RANKL units on site II. The most significant attractive interactions involving E95 are established with residues K181 (RANKL-B), R223 (RANKL-A), and K257 (RANKL-B), and include the most important interactions in the complex. These interactions are directly dependent on two other repulsive ones established with residues Q237 and Y241 (both from RANKL-B), which induce proper coordination of the E95 side chain, as detailed later.
In the case of I94, despite presenting a low energetic contribution, this residue is essential for the coordination of peptide-2, allowing and assisting the blockage of key residues for RANK anchorage (Fig. 7 and 8). Attractive interactions are established through hydrophobic contacts with surrounding side chains, especially K181 (RANKL-B), while repulsive interactions arise from pairing to charged groups, such as those present on the Q237 side chain.
For F96, a structural comparison between the human RANKL–OPG and the hybrid mRANK–hRANKL complexes suggests that the pi-stacking interaction involving the phenyl group of this residue is critical for RANKL–OPG complex stabilization. Quantum calculations showed that F96 is the second energetically most important residue on peptide-2 (after E95), establishing attractive interactions to eight different RANKL residues, and forming a compact hydrophobic core (Fig. 7 and 8). Comparatively, these interactions are absent in the RANK–RANKL complex, this residue being replaced by a cysteine (C127) in the RANK structure.47 Indeed, a sulphate ion occupies the position of the F96 side chain in the hybrid mRANK–hRANKL complex, indicating a lack of stabilizing interactions. This remarkable difference between RANK and OPG presents a strict correlation to the stronger binding performed by the latter with RANKL. This statement is reinforced by pathological conditions associated with mutations on this very residue, such as Paget's disease, in which a leucine residue replaces the original phenylalanine on the OPG structure (F96L), causing a disorder of accelerated bone resorption.90
Aiming to understand the structural basis of this severe condition, we performed in silico mutagenesis of the phenylalanine residue at position 96 by a leucine, followed by quantum calculations of the mutated residue to analyse the energy impact of this substitution. The structural analysis revealed that a leucine residue at this position on the OPG structure is able to establish only five interactions to RANKL residues, compared to eight performed by phenylalanine, despite presenting similar coordination of its side chain. Furthermore, most of these interactions are energetically less significant when compared to those from phenylalanine, which generates for the latter a total binding energy that is twofold higher (Fig. 11a). Based on this, we can assume that substitutions at this position that may disrupt these hydrophobic contacts would significantly decrease the binding affinity of OPG for RANKL, which is in accordance with the available biological data, which reports an affinity 2900 times lower of the mutant OPG-F96L for RANKL, when compared to the wild-type protein.47 In this way, quantum calculations have consistently described the energy impacts of this substitution over the complex, emphasizing the critical role of the tripeptide core, I94–E95–F96, for OPG biological function.
![]() | ||
Fig. 10 Total interaction energy for site II as a function of distance, applying treatments 1 and 2 and distinct values of the dielectric constant. Energy values are represented as kcal mol−1. |
However, two peptides among the several proposed have attracted attention due to their high inhibitory potential. These peptides were named YR-11 (YLEIEFSLKHR) and OP3-4 (YCEIEFCYLIR), both presenting 11 amino acid residues but with different structures, strictly based on loop 90s (Y91 to R101). YR-11 is a linear peptide presenting a single modification when compared to peptide-2, where a cysteine residue at position 7 (involved in a disulphide bridge with C86 on OPG) was replaced by a serine (highlighted in bold). On the other hand, OP3-4 is an exocyclic peptide (through a disulphide bridge between C2 and C7) presenting four sequence alterations in comparison to peptide-2 (highlighted in bold). In this sequence, a tyrosine residue replaces a leucine at position 8 and the original C-terminal residues, K9–H10, were replaced by L9–I10. The inhibitory effect of these peptides relies on the blockage of key residues on RANKL for RANK anchorage at binding site II, such as K181 and Y241, preventing receptor trimerization and consequently osteoclastogenesis.84,85
Molecular docking calculations were performed to simulate the complexes between RANKL and the proposed peptides, YR-11 and OP3-4. The best results obtained for both peptides presented the tripeptide core, I4–E5–F6 (corresponding to I94–E95–F96 on OPG), with coordinations almost identical to the crystal structure, which was used as a selection parameter. Based on the generated complexes, we could infer the influence of the alterations inserted on the peptide sequences over the binding to RANKL. In the case of YR-11, the presence of a serine residue on position 7 (instead of a cysteine) probably increases the peptide solubility and maximizes the polar contacts with H225 on RANKL. For OP3-4, Y8 maximizes the contacts with F270 and Q237 on RANKL, also enhancing the peptide stability through pi-stacking interactions with F6. Interestingly, the replacement in OP3-4 of the original C-terminal residues has increased the inhibitory potency of this peptide, which was attributed to a stronger interaction with RANKL.85
After docking, a distance-based binding interface (within a range of 4.0 Å) was determined for both complexes to detect all interactions involved in the peptide binding to RANKL. Quantum calculations were also performed to analyse the energetic contribution of each peptide residue to the complex formation. Following this strategy, no interactions were detected for the last C-terminal residues, H11–R12 on YR-11 and L10–I11–R12 on OP3-4 (Fig. 11). These results are similar to that obtained for peptide-2 (R90–K99), which presented no interactions involving residues H100 and R101. Additionally, no energy contribution was attributed by quantum calculations to residue K10 from YR-11. Based on this, we can conclude that the proposed peptides are too long, each presenting three extra C-terminal residues that do not participate in the binding to RANKL and may disturb the peptide stability after binding, especially when long side chains are present (as in C-terminal R12).
Quantum calculations also revealed similar total binding energies for both peptides, YR-11 (−53.9 kcal mol−1 with ε = 10) and OP3-4 (−57.4 kcal mol−1 with ε = 10), which is in agreement with their equivalent inhibitory potential (around 90%). Interestingly, the average energetic contribution of the tripeptide core, I4–E5–F6, is very similar (−63.0 and −64.8 kcal mol−1 for YR-11 and OP3-4, respectively, with ε = 10), although it presents a higher value for E5 on OP3-4, and higher values for I4 and F6 on YR-11 (Fig. 11). In both cases, the binding energy presented by this tripeptide core is higher (more attractive) than the binding energy of the whole peptide. In fact, most of the remaining residues present less significant contributions, except for E93 (corresponding to E93 in OPG), which presents a high repulsive energy (17.5 and 19.0 kcal mol−1 for YR-11 and OP3-4, respectively, with ε = 10), offering little benefit or even disturbing peptide binding (Fig. 11). Since the dissociation rate of mimetics has been shown to correlate with biological activity, the presence of these extra amino acid residues may impact the efficiency of these peptides as therapeutic agents.85 Based on this, our results emphasize that the key to osteoclastogenesis inhibition relies on the tripeptide core, I94–E95–F96, which is available as a template for designing small peptides with potential for the treatment of osteopenias.
We have considered water molecules in two different ways (treatments one and two) during the calculations, also applying two different values of the dielectric constant (ε = 10 and ε = 40), with the aim of properly analyzing the influence of the solvent over the binding energies. Our results indicate that both treatments are suitable for considering water molecules in the calculations, although the majority of analysed residues display an increased total binding energy for treatment two, pointing to a contributive role of water molecules over protein binding. Furthermore, lower values of the dielectric constant (around ε = 10) have proven appropriate to estimate important polar interactions without underestimating hydrophobic contacts.
Quantum calculations have confirmed the importance of site II for OPG binding and activity, mainly through the coordination of interactions performed by the tripeptide core, I94–E95–F96. This tripeptide coordinates more than a half of all interactions involved in site II, contributing 53% of the total binding energy in the complex. The disruption of these interactions may lead to a severe impact on OPG functionality, such as that observed in Paget's disease. Our results have proven that the single mutation responsible for this autosomal recessive disease (F96L) generates a mutant OPG that is unable to properly bind RANKL due to a lack of stabilizing interactions. This fact emphasizes the crucial role of this tripeptide core, which is also the key to the high efficiency of the analysed inhibitory peptides, YR-11 and OP3-4, as attested by the structural and energy analyses. These assumptions corroborate the previous statement that OPG exerts its decoy receptor function by directly occupying binding site II and blocking the accessibilities of key interacting residues, such as R223 and K257, on RANKL, preventing RANK recognition.10
Nevertheless, it is important to highlight that, despite presenting a secondary role, site I stabilizes OPG along RANKL's surface, allowing precise insertion of the 90s loop into binding site II. Previous results have indicated that mutations affecting OPG's residues on binding site I had almost no impact over the complex formation, but we have attested through quantum calculations a collaborative association of several minor interactions at this site.10 This result indicates peptide-1 as a potential template for the design of inhibitory molecules targeting a broader spot on RANKL.
Currently, the design of inhibitory peptides targeting the RANK–RANKL–OPG molecular triad is based solely on the structural analysis of crystal structures, validated by biological assays. Some of the proposed peptides have proven useful and able to efficiently inhibit osteoclastogenesis. Nevertheless, our quantum calculations indicate that the inhibitory potential of these peptides relies on the tripeptide core, I–E–F, attributing to the remaining residues a minor role (or even a disturbing role) over the process. Based on this, we propose the design of small and specific peptides based on this tripeptide core as an innovative strategy for RANK–RANKL complex inhibition. These peptides would present better stability, specificity and cost, emerging as important tools for the treatment of osteoporosis and other osteopenias.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c6ra16712h |
This journal is © The Royal Society of Chemistry 2016 |