Lucas S.
Franco
ab,
Rodolfo C.
Maia
bc and
Eliezer J.
Barreiro
*abc
aPrograma de Pós-Graduação em Farmacologia e Química Medicinal, Instituto de Ciências Biomédicas, Universidade Federal do Rio de Janeiro, Avenida Carlos Chagas Filho, 373, Ilha do Fundão, 21941-912, Rio de Janeiro, RJ, Brazil
bLaboratório de Avaliação e Síntese de Substâncias Bioativas (LASSBio®, http://www.lassbio.icb.ufrj.br), Instituto de Ciências Biomédicas, CCS, Universidade Federal do Rio de Janeiro, Cidade Universitária, Rio de Janeiro, RJ, Brazil. E-mail: ejbarreiro@ccsdecania.ufrj.br
cInstituto Nacional de Ciência e Tecnologia de Fármacos e Medicamentos (INCT-INOFAR; http://www.inct-inofar.ccs.ufrj.br/), CCS, Universidade Federal do Rio de Janeiro, Cidade Universitária, Rio de Janeiro, RJ, Brazil
First published on 2nd November 2020
In December 2019, an infectious disease was detected in Wuhan, China, caused by a new pathogenic coronavirus, named SARS-CoV-2. It spread very rapidly, and on March 11th of 2020, the outbreak was declared a pandemic by the World Health Organization. Currently, effective treatment options remain limited. SARS-CoV-2 enzyme main protease (MPRO) plays a pivotal role in the viral life cycle, making it a putative drug target. In order to identify suitable hits to develop inhibitors with adequate antiviral properties, we explored the LASSBio Chemical Library employing multiple strategies of virtual screening. A fragment-based pharmacophore model enabled the identification of key interactions involved in the molecular recognition at the catalytic site of MPRO, namely, with amino acid residues His41, His163 and Glu166. Docking-based virtual screening was performed, leading to the identification of LASSBio-1945 (9), a new hit of MPRO, presenting an IC50 = 15.97 μM. This compound, an 1,3-benzodioxolyl sulfonamide, represents an interesting starting point for subsequent hit-to-lead optimization steps and, to the best of our knowledge, a new distinct chemotype for MPRO inhibition.
The new virus has been named SARS-CoV-2 because the RNA genome (GenBank ID: MN908947.3) is about 82% identical to that of SARS-CoV (GenBank ID: NC_004718.3);2 both viruses belong to clade b of the genus Betacoronavirus.3,4 SARS-CoV-2 spread very rapidly from China to all countries, and on March 11th of 2020, the outbreak was declared a pandemic by the World Health Organization (WHO).5
Current disease management is limited to social measures, such as social distancing, travel ban, and lockdown in many regions. Thus, there is an urgent need for the discovery of prevention and treatment strategies for COVID-19. It is acknowledged that the introduction of a vaccine would happen, in the earliest, at the beginning of 2021. While the discovery of a new drug should take even longer, the drug repositioning approach,6i.e. the use of an existing drug to treat COVID-19, seems the fastest strategy since these compounds have either regulatory approval as drugs or have already cleared safety studies.7,8
Although several known drugs are currently under investigation,7,9 there is a clear need to identify other therapeutic alternatives within the drug repurposing approach. As illustrated by strategies employed to treat HIV nowadays, such as the highly active antiretroviral therapy (HAART), the long-term management of endemic viruses normally includes the use of multiple antiretroviral drugs, interfering with different stages of the virus life-cycle, in an attempt to control infection and the appearance of resistance.10,11 Protease inhibition encompasses one of the strategies employed in HIV multi-therapy, and could be successfully mirrored in the treatment of COVID-19. SARS-CoV-2 presents a main protease (MPRO) that plays a pivotal role in mediating viral replication and transcription, making it an attractive drug target.12,13 Additionally, it has been demonstrated that the active site of MPRO remains highly conserved across different CoVs, suggesting the possibility of wide-spectrum anti-CoV drug design.14,15
In fact, some approaches towards the identification of antiviral agents acting as MPRO inhibitors have already been reported.15,16 In the context of drug repositioning, a combination of SARS-CoV-2 MPRO structure-based virtual screening and high-throughput screening of more than 10000 compounds (containing approved drugs, clinical trials, and other active substances) identified six compounds as promising MPRO inhibitors (IC50 range: 0.67–21.4 μM). Among these identified inhibitors, disulfiram and carmofur are FDA-approved drugs, whereas ebselen, tideglusib, shikonin, TDZD8 and PX-12 are currently under preclinical studies.16
In another study, a combined strategy employing structure-based drug design, virtual screening and high-throughput screening identified N3, a Michael acceptor inhibitor of MPRO of SARS-CoV and MERS-CoV, as a potential SARS-CoV-2 MPRO inhibitor. Corroborating the high conservation of the active site of MPRO across different CoVs, N3 was shown to form a covalent bond with and to be an irreversible inhibitor of SARS-CoV-2 MPRO.15
In the same study, ebselen was identified as a SARS-CoV2 MPRO inhibitor after a high-throughput screening approach. Ultimately, in a plaque-reduction assay with simian Vero cells infected with SARS-CoV-2, N3 and ebselen displayed antiviral and cell protection efficacy at EC50 values of 16.77 μM and 4.67 μM, respectively,15 demonstrating their antiviral potential against SARS-CoV-2.
In this work, we developed a screening strategy to identify MPRO ligands within the LASSBio Chemical Library (LCL),17 in order to provide a suitable chemical starting point to develop novel inhibitors with adequate antiviral properties. A fragment-based pharmacophore model was created to identify key interactions involved in the molecular recognition at the catalytic site of MPRO by analysing the fragments previously obtained by the Diamond Light Source researchers.18 Afterwards, molecular docking-based virtual screening on MPRO with the LCL was performed. Next, the information generated by the fragment-based pharmacophore model was employed to assist the analysis of the docking poses, thus refining our final ranking. Additionally, this contribution complements the work done by the Diamond Light Source researchers by focusing the analysis on the non-covalent fragments identified by them and, therefore, it is our hope to collaborate with the worldwide efforts to deliver a strategy that can enhance the success rate of attempts to provide novel therapeutic agents to treat COVID-19.
To build the FBPM, we analysed the 22 non-covalent fragments‡ (detailed in the ESI†) co-crystallized with MPRO by the Diamond research group.18 Considering that the vast majority of LCL compounds were designed as non-covalent ligands, the decision to use only non-covalent fragments was adopted.
Initially, the interactions and crystallographic binding poses of each fragment were individually analysed, aiming to identify if there were patterns of interaction that could highlight specific amino acid residues and/or types of interactions (Table S1, ESI†). Curiously, none of the 22 fragments interacts with MPRO through ionic interactions. After mapping out all the interactions, it became clear that residues His41, His163 and Glu166 interacted more frequently than the others. Glu166 was the higher frequency interacting residue, mainly performing hydrogen bonding by means of its backbone and also forming water-mediated interactions, with its side chain carboxylic acid oriented towards the exterior part of the active site (Fig. 1A).
Fig. 1 A) Fragment x0434 (1, magenta) at the MPRO active site and its interactions with His41, Phe140, His163 and Glu166. Hydrogen bonds are shown as red dashed lines. B) Fragments x0434 (1, magenta) and x0540 (2, cyan) superimposed at the MPRO active site. Slight structural changes and differences in the linker size resulted in significantly different binding modes. These images were generated using UCSF Chimera.22 |
Interestingly, His41 and His163 presented unique and distinct behaviours. His41 was the only residue identified in the active site able to perform π–π stacking interaction with molecular fragments, and it seems to prefer hydrophobic contacts rather than interactions of more polar nature. On the other hand, His163 only interacted through hydrogen bond formation with the fragments, mostly as a hydrogen bond donor. We could also observe that, in most cases, a “convenient” non-classical hydrogen bond with the backbone of the Phe140 residue occurred together with the interaction with His163 (Fig. 1A). This probably happened because of the specific orientation needed by the fragment to interact with His163 which, in turn, projected the electron-deficient hydrogen bonded to the carbon adjacent to the nitrogen to the carbonyl of the Phe140 backbone, favouring this interaction. As mentioned, the interaction with His163 seems to demand a specific orientation inside the MPRO site, because even slight structural changes among the fragments that interact with this residue result in different conformational binding orientations (Fig. 1B). Apparently, adopting the conformation that allows for this specific hydrogen bond formation is preferred, indicating that interacting in this fashion with His163 and Phe140 could be a pharmacophoric feature in the molecular recognition process. The way fragment x540 (2) binds corroborates this hypothesis since it orientated its cyclohexyl-urea subunit towards the exterior of the active site in order to adopt the most adequate orientation to allow the pyridine ring to interact properly with His163 instead of trying to occupy other parts of the active site to interact with additional residues at the expense of a not so ideal orientation to interact with His163 (Fig. 1B).
Another relevant observation was that fragments interacting with His41 did not interact with His163 and vice versa. The only exception was fragment x0434 (1), which interacted with both His41 and His163 (Fig. 1A). This indicated that there were two subgroups of fragments, and two clear distinct regions inside the active site of MPRO to be explored during the molecular design. Additionally, this signalled that in order to interact with both His41 and His163 a ligand would need to display an optimal distance between the respective interacting subunits. With this in mind, the fragments were also analysed regarding (i) the presence of subunits able to interact with His41 and His163; (ii) the size of the linker between these subunits, and (iii) the distance between the central atom of the linker and both subunits (Table S2, ESI†). This analysis suggested that in order to interact simultaneously with His41 and His163, the fragment needed to have a linker of no more than 3 atoms (approximately 2.50 Å) and the optimal distance between the central atom of the linker and the atom interacting as a hydrogen bond acceptor in the aromatic subunit occupying the His163 site was ca. 4.40 Å. The distance of 3.85 Å was the optimal between the central atom of the linker and the centroid of the aromatic subunit interacting with His41, while keeping the adequate orientation to interact with His163 (Table S2, ESI†). These findings corroborate why fragments x0434 (1) and x0540 (2) display such diverse binding orientations. Despite the flexible nature of the linker in 2, it is too long (4.46 Å – five atoms) to accommodate both rings inside the active site (Fig. 1B).
All these features resulted in the building of the 3D pharmacophore model based on residues His41, His163 and Glu166 as pharmacophoric points for the establishment of an energetically favoured binding complex (Fig. 2A). By molecular complementarity, a potential ligand would need to present a hydrophobic aromatic subunit to interact with His41 via π–π stacking, a second aromatic subunit bearing a hydrogen bond acceptor to interact with His163, and a linker that can interact with Glu166 through H-bonding, and that can also provide a distance in the range of 6–7 Å between the atom interacting as a hydrogen bond acceptor with His163 and the aromatic subunit interacting with His41 (Fig. 2B).
The fragment screening performed by the Diamond Light Source group is part of an international initiative called COVID Moonshot.23 After the fragments were published, the following actions of the initiative were crowdsourcing molecular designs inspired by these fragments, synthetizing and testing the most promising compounds submitted. Some of these results have already been released in their platform,24 which allowed us to evaluate the hypothesis of our FBPM against some experimental data from non-covalent MPRO inhibitors. These experimental data confirm our suggestion that the His163 residue interacting as a hydrogen bond donor is important for the molecular recognition process and formation of the binding complex. This is illustrated by the activities displayed by compounds TRY-UNI-714a760b-6 (3) (IC50 = 24.57 μM) and EDG-MED-0da5ad92-2 (4) (IC50 = 53.72 μM). The subtle insertion of a methyl group in position 6 of the 3-pyridine ring was able to promote a 2-fold increase in MPRO inhibition,25 probably due to the induction of a favourable pyridine ring conformation for the interaction with His163 (Fig. 3A). This conformational effect induced by the methyl group is confirmed by other related isomers, where the methyl group was introduced at other positions of the pyridine ring, and all were found to be inactive (Fig. 3B).
The preference of His41 for interacting with more hydrophobic subunits is another feature of the pharmacophore model in agreement with the experimental data released by the Moonshot initiative. It suggests a relationship between the increase in the lipophilic character of the aromatic ring that interacts with His41 and a higher MPRO inhibition (Fig. S2, ESI†). It was delightful and reassuring to acknowledge that some of the pharmacophore model's key interaction hypotheses were supported by the experimental data disclosed so far. The next step of this work consisted of applying the information provided by the pharmacophore model to the analysis of the virtual screening results, which will be better detailed in the discussion of the molecular docking results.
Along with rapid development of techniques for determination of biomacromolecules' structures, docking became an important approach of computer-aided drug design.31 Although docking is a well-established technique that contributes to the early drug discovery process, one of its flaws is the frequent false positives arising from ranking compounds based on biased scoring functions.32,33 Therefore, achieving a successful hit identification rate from a chemical library in a virtual screening campaign is also directly related to the combination of appropriate in silico strategies.34 In this context, the integration of both ligand and structure-based methods with visual inspection should be used to help the identification of a subset of compounds from a library to be tested on adequate pharmacological assays.35,36 Hence, ligand-based methods such as molecular similarity and substructure searches were implemented to complement molecular docking analysis.37
In this work, the GOLD (Genetic Optimization for Ligand Docking) program of version 5.8.1 (ref. 38) was employed for docking simulations. GOLD has four dimensionless fitness scores named GoldScore,39 ChemScore,40 ASP (the Astex Statistical Potential),41 and ChemPLP.42 Even though in each case the scale of the score is different, it is a guide as to how good the pose is, i.e. the higher the score, the better the docking result is likely to be. Given the fact that the scoring functions combined had better performance in redocking procedures when compared to each scoring function alone (Table S3, ESI†), rank-by-rank consensus scoring was also applied to aid the analysis of docking poses in this study.43
The developed pharmacophore model (Fig. 2) highlighted three residues mainly responsible for molecular recognition, i.e. His41, His163 and Glu166. These amino acid residues are located at two conserved pockets of the MPRO active site, on which His41 forms hydrophobic interactions, His163 acts as a hydrogen bond donor, and Glu166 acts either as a hydrogen bond donor or acceptor. This structural information was used as a criterion to analyse the docking poses of the compounds obtained from the molecular similarity search44 using the non-covalent co-crystalized fragments disclosed by the Diamond research group as queries (Scheme S1†). The established similarity coefficient cut-off of 0.4 led to the identification of LASSBio-1962 (8)45 and LASSBio-1945 (9)46 (Table 1), which were able to interact with the residues of the pharmacophore model. Additionally, the top 2% ranked LCL compounds were analysed according to the pharmacophore model, and compounds LASSBio-428 (10),47 and LASSBio-1615 (11)48 were selected.
The binding modes of the compounds selected by the molecular similarity search (8–9) illustrated the general molecular recognition pattern (Fig. 4) of the identified compounds in Table 1. The compounds occupied the pocket near His163, as depicted by the binding mode of LASSBio-1962 (8). In addition, LASSBio-1945 (9) was also able to occupy the pocket near His41. LASSBio-1962 (8) shares some degree of similarity with x0434 (1) and forms a H-bond with His163 using the quinoxaline system. However, probably due to the nature of the bicyclic ring, the distance for H-bonding with this residue is increased for 8 (2.1 Å), when compared to 1 (1.7 Å), which might also reflect the inability of the same linker to adopt a conformation similar to 1 and interact with Glu166 (Fig. 4A). However, the urea linker was able to form a bidentate H-bond with His41, suggesting that it could still bind to the active site. On the other hand, the docking pose of LASSBio-1945 (9) (Fig. 4B) suggests that the oxygen atom in the 1,3-benzodioxole ring mimics the hydrogen bond acceptor interaction made by the pyridine nitrogen atom of 1, with the His163 residue. Additionally, the n-propyl side chain occupies the so-called S2 hydrophobic pocket, next to His41. X-ray diffraction studies of previously described peptide inhibitors of MPRO show that isobutyl,15 cyclopropyl,49 and cyclohexyl50 aliphatic side chains are able to occupy this pocket, supporting the binding mode depicted in Fig. 4B. The hydrogen bonding interaction between Gln189 and the oxygen of the sulphonamide group of LASSBio-1945 (9) may also contribute to its molecular recognition.
Fig. 4 Docking poses of (A) LASSBio-1962 (8),45 (B) LASSBio-1945 (9),46 (C) LASSBio-428 (10),47 and (D) LASSBio-1615 (11).48 The structure of SARS-CoV-2 MPRO is shown as lines and surface, with pharmacophore model residues and additional predicted interacting residues shown as sticks (carbon atoms in grey). Docked compounds are shown as sticks (carbon atoms in green). Hydrogen bonds are shown as red dashed lines. These images were generated using UCSF Chimera.22 |
The binding mode of compounds selected from the top 2% illustrated that the structures which fitted in the His163 and His41 pockets scored higher. LASSBio-428 (10) and LASSBio-1615 (11) occupied both pockets with aromatic residues, of which 11 interacted with the 2-phenyl-imidazo[1,2-a]pyridine system, and 10 fitted the His163 pocket with its phenyl ring and the His41 pocket with the 3-trifluormethylaniline subunit. Even though these compounds were able to form H-bonds with amino acid residues in the binding site, none interacted with His163.
Due to the fact that the oxygen atom of the 1,3-benzodioxole moiety acted as a H-bond acceptor, interacting with the pharmacophoric residue His163, as it is depicted in the binding mode of LASSBio-1945 (9) (Fig. 4B), we performed further substructure-based searches in the chemical library (Table 2). Firstly, we searched for 1,3-benzodioxole derivatives, leading to the identification of LASSBio-1649 (13).51 Secondly, we defined pyridine and imidazole rings as queries based on pyridine co-crystallized fragments disclosed by Diamond Light Source (e.g.1, Fig. 1), and on co-crystallized imidazole inhibitors of SARS-CoV MPRO, in which the imidazole ring was able to interact with the conserved histidine residue equivalent to His163.52 In an attempt to identify additional chemotypes, a similarity search based on inhibitors of SARS-CoV MPRO, obtained from the scientific literature and patents available on the Integrity database, was also conducted (Table 2). After visual analysis based on the pharmacophore model (Fig. S3–S6, ESI†), LASSBio-1649 (13),51 LASSBio-1891 (14),53 and LASSBio-1600 (15)54 were selected as a result of searches using 1,3-benzodioxole, pyridine and imidazole subunits as queries, respectively. LASSBio-1652 (16)51 was selected from the similarity search which resulted in compound 17 (IC50 = 17 μM; MPRO SARS-CoV)55 as the nearest neighbour with a Tanimoto coefficient of 0.51 (Table 2).
All these compounds belong to the class of N-acylhydrazones (NAHs), and this is probably due to the nature of the LASSBio Chemical Library. In fact, the NAH functional group is the major chemotype studied in LASSBio56 and, consequently, it is present in approximately 50% of the LCL compounds. In the particular example of 1,3-benzodioxole, which was also selected in the similarity search (LASSBio-1652, 16), it is an outcome of the continuous studies based on the use of safrole, an abundant Brazilian natural product and the principal chemical constituent of sassafras oil (Ocotea pretiosa).57 In conclusion, these additional searches allowed the identification of another four compounds (13–16) for investigation of potential MPRO inhibition.
Compound | IC50 (μM) | Inhibition at 50 μM (%) |
---|---|---|
N.D. – not determined. | ||
LASSBio-428 (10) | N.D. | −10.63 |
LASSBio-1600 (15) | N.D. | −16.29 |
LASSBio-1615 (11) | N.D. | 7.53 |
LASSBio-1649 (13) | N.D. | −6.66 |
LASSBio-1652 (16) | N.D. | −12.83 |
LASSBio-1891 (14) | N.D. | −6.32 |
LASSBio-1945 (9) | 15.97 | 68.26 |
LASSBio-1962 (8) | N.D. | 0.77 |
As demonstrated in Table 3, this iteration of hit discovery allowed us to identify, among the eight compounds evaluated, compound LASSBio-1945 (9)46 as a MPRO inhibitor in the micromolar range (IC50 = 15.97 μM), which means a success rate of 12.5%.
While it was observed that the 1,3-benzodioxole subunit of LASSBio-1945 (9) could mimic the interaction pattern of the 3-amino-pyridinyl subunit of other MPRO inhibitors (Fig. 4B), it was intriguing to note that compounds LASSBio-1649 (13) and LASSBio-1652 (16), also bearing the 1,3-benzodioxole subunit, were inactive. One of the reasons might be the size of the linker of both compounds. The distances between the carbonyl and the imine carbon were 2.9 Å and 3.6 Å for LASSBio-1649 (15) and LASSBio-1652 (18), respectively.
As already discussed in the pharmacophore model section, the linker should not have more than 3 atoms (ca. 2.50 Å) in order to be able to interact simultaneously with His41 and His163. Probably, NAH derivatives 13 and 16 behave in the same way as fragment x0540 (2) (Fig. 1B), i.e. in order to accommodate the 1,3-benzodioxole moiety to interact with His163, they adopt a binding pose where the aromatic subunit supposed to interact in the His41 pocket is oriented towards another position within the active site, resulting in the loss of that additional, relevant interaction. This size of the linker issue could also be one of the factors contributing to the inactivity of compounds LASSBio-1891 (14) and LASSBio-1600 (15), since they also have the NAH linker, despite 15 being able to interact with His41 via hydrophobic interactions and with His163 acting as a hydrogen bond acceptor (Fig. S5, ESI†). The results of LASSBio-1962 (8) support the hypothesis that the interactions with both His163 and His41 are important for activity. Even though the docking pose suggests that 8 was able to interact with His163, LASSBio-1962 (8) was not able to form aromatic interactions with His41. In the cases of LASSBio-428 (10) and LASSBio-1615 (11), the absence of the H-bond interaction with His163 may be the reason for the inactivity.
In summary, these results uncover a potential bioisosteric relationship between the 1,3-benzodioxole moiety present in LASSBio-1945 (9) and the 3-amino-pyridinyl moiety displayed by several fragments, such as 1, TRY-UNI-714a760b-6 (3) (IC50 = 24.57 μM) and EDG-MED-0da5ad92-2 (4) (IC50 = 53.72 μM). The experimental results suggest that the interactions with the residues His41 and His163 of the pharmacophore model are essential for molecular recognition. In addition, the interaction of LASSBio-1945 (9) with Gln189 suggests that perhaps the interaction with Glu166 is not essential if the linker interacts via H-bonding with another close residue. The superimposition of LASSBio-1945 (9) with fragment x0434 (1) illustrates how the n-propyl chain occupies the pocket next to His41, and a possible non-classical hydrogen bond formed by the methylene group of 1,3-benzodioxole, which might be equivalent to the one pointed out for x0434 (1) (Fig. 5). Moreover, LASSBio-1945 (9) is a MPRO inhibitor with a unique structural pattern and a suitable candidate for subsequent optimization steps through medicinal chemistry-oriented SAR cycles.
Fig. 5 Superimposition of the docking pose of LASSBio-1945 (9, green, non-polar hydrogen atoms visible)46 and x0434 (1, magenta). The structure of SARS-CoV-2 MPRO is shown as lines and surface, with pharmacophore model residues and additional predicted interacting residues shown as sticks (carbon atoms in grey). Putative interactions are shown as red dashed lines. Sub-pockets S1–S4 are indicated in orange. This image was generated using UCSF Chimera.22 |
CoV | Coronavirus |
FBDD | Fragment-based drug discovery |
FBPM | Fragment-based pharmacophore model |
FDA | Food and Drug Administration |
GOLD | Genetic optimization for ligand docking |
HAART | Highly active antiretroviral therapy |
HIV | Human immunodeficiency virus |
MPRO | Main protease |
MERS-CoV | Middle East respiratory syndrome coronavirus |
MW | Molecular weight |
SARS-CoV | Severe acute respiratory syndrome coronavirus |
WHO | World Health Organization |
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0md00282h |
‡ When this study was performed, there were 22 fragments released. By the time the XRD data of the 23rd fragment was made available, we had the analysis completed. |
This journal is © The Royal Society of Chemistry 2021 |