Marisa G. Santibáñez-Morána,
Edgar López-Lópezb,
Fernando D. Prieto-Martíneza,
Norberto Sánchez-Cruza and
José L. Medina-Franco*a
aDIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Mexico City, Mexico. E-mail: medinajl@unam.mx; jose.medina.franco@gmail.com; Tel: +52 (55) 5622-3899, ext. 44458
bDepartment of Pharmacology, Center of Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV), Mexico City, Mexico
First published on 1st July 2020
The pandemic caused by SARS-CoV-2 (COVID-19 disease) has claimed more than 500000 lives worldwide, and more than nine million people are infected. Unfortunately, an effective drug or vaccine for its treatment is yet to be found. The increasing information available on critical molecular targets of SARS-CoV-2 and active compounds against related coronaviruses facilitates the proposal (or repurposing) of drug candidates for the treatment of COVID-19, with the aid of in silico methods. As part of a global effort to fight the COVID-19 pandemic, herein we report a consensus virtual screening of extensive collections of food chemicals and compounds known as dark chemical matter. The rationale is to contribute to global efforts with a description of currently underexplored chemical space regions. The consensus approach included combining similarity searching with various queries and fingerprints, molecular docking with two docking protocols, and ADMETox profiling. We propose compounds commercially available for experimental testing. The full list of virtual screening hits is disclosed.
SARS-CoV-2 has a complex architecture, and as happens with different viruses, there are several proteins involved in viral internalization and replication. The life cycle of SARS-CoV-2 starts with the viral recognition of its spike protein by a cellular receptor (ACE receptor and TMPRSS2). After that, the internalization and uncoating process is mediated by membrane proteins. Once into the host cell, RNA replication, and biosynthesis of viral polypeptides are carried out (RdRp – ribosomes). Finally, the processing of precursors proteins by the main protease (3CLpro or Mpro) and the assembly of these, contributes to the generation of new viruses.3–5 These main targets offer a venue for the development of new treatments via rational drug design. Examples include spike protein, RNA polymerase, and chymotrypsin-like cysteine protease (3CLpro or Mpro) which are presented in Fig. 1.3–5 Of these, the main protease (Mpro) is a promising target for the design and proposal of new therapies due to the lack of homologous proteins in humans.6 Also, its selective inhibition would take advantage of the natural life cycle of SARS-CoV-2, avoiding its replication and dissemination. Several research groups are actively pursuing Mpro as a molecular target to identify drug candidates for the treatment of COVID-19.
Computational methods represent an approach with the power of efficiently filter large and diverse compound libraries to select potential candidates for drug development.7,8 Recently published works show a tendency towards drug repurposing and to search structurally different libraries (e.g., with broad scaffold diversity), and natural products.9–13 Moreover, the search for novel compounds commercially available or with the possibility of being synthesized has had a vital rebound (e.g., screening part or the entire ZINC database).9,14–16 Table 1 summarizes representative examples of virtual screening (VS) studies directed to different molecular targets, including SARS-CoV-2 Mpro. Most of these efforts relied on structure-based drug design (SBDD). Few others include similarity searching and quantitative structure–activity relationship (QSAR) modeling.17 In this sense, there are many compounds suggested by computational methods that could be evaluated quickly with in vitro techniques. However, the use of computational consensus methodologies could improve the performance of each technique.
Target | Experimental methods | Libraries | Compounds screened/outcome | Ref. |
---|---|---|---|---|
a Computational hits.b Active hits. | ||||
Mpro | Deep docking | ZINC 15 | 1.3 billion/1,000a | 9 |
Mpro | Pharmacophore model, molecular docking, and dynamics | Marine natural products | 14064/17a | 10 |
Mpro | Pharmacophore screening and molecular docking | ZINC | 50000/10a | 15 |
Spike protein | Homology modeling and molecular docking | FDA | 3300/12a | 18 |
Mpro, PLpro and RdRp | Homology modeling, molecular docking, and dynamics | DrugBank and traditional Chinese medicine | 1973/57a | 11 |
ACE2 | Molecular docking | Literature compilation (natural products) | —/5a | 12 |
Mpro | Molecular docking | Literature compilation (natural products) | 80/8a | 13 |
Mpro | Molecular docking | FDA | 486/20a | 19 |
Mpro | Molecular docking, and dynamics | ZINC | 606 million/12a | 20 |
Mpro | Similarity search and QSAR modeling | DrugBank (marketed, withdrawn, experimental, and investigational) | 9615/41a | 17 |
Mpro | Molecular docking and dynamics | DrugBank (approved and drug candidates in clinical trials) | 2201/5a | 21 |
Mpro and TMPRSS2 | Homology modeling and molecular docking | ZINC | 34500/8a | 14 |
Mpro | Induced fit docking | In-house | 10000/6b | 22 |
The goal of this work is to propose active compounds against Mpro from SARS-CoV-2 and related coronaviruses. One of the novelties of the present study relies on the probed chemical space: food chemicals and molecules in the Dark Chemical Matter (DCM), which to the best of our knowledge, have been explored for SARS-CoV-2 on a limited basis. Thus, the rationale was to expand the search of chemical space and suggest molecules for experimental screening. Active compounds could be later optimized to increase activity. As a screening strategy, we started with similarity searching using different fingerprints to pre-select compounds using data fusion strategies. Selected compounds from similarity searching were screened with molecular docking with two different software. The final selection of computational hits was based on consensus scoring, information of protein–ligand contacts, and the ADMETox (absorption, distribution, metabolism, excretion, and toxicity) profile of compounds. Additional criteria used to guide the selection of hit candidates for testing included predictions by machine learning (ML) models for SARS-CoV-2 activity developed by Collaborations Pharmaceuticals, Inc and freely available.23
Dataset | Content overview and sizea | Rationale | Ref. |
---|---|---|---|
a After data curation. | |||
Actives | N3, alpha-ketoamides 11a, 11r, and 11s, carmofur, cinaserin, disulfiram, ebselen, PX12, shikonin, and tideglusib | Reference compounds used in docking to compare docking scores and predicted binding modes | 22 and 27 |
FooDB | 22880 compounds | Large library of food chemicals. Smaller food chemical data sets have been screened | 28 |
Dark chemical matter (DCM) | 139329 compounds | Large screening library underexplored. Likelihood to shade light into the darkness of the COVID-19 pandemic | 29 |
ZINC (top-ranked hits) | 10 top-ranked virtual screening hits of ZINC using deep docking/Glide and SARS-CoV-2 Mpro (PDB ID: 6LU7) | Further consensus of published computational hits with other docking programs (Vina and MOE) | 9 |
One of the screening databases was the public food chemical database (FooDB) with 23883 compounds.28 The chemical diversity and coverage of chemical space of FooDB have been reported revealing that food chemicals are structurally diverse and have, in general, large molecular complexity.30 DCM was the other screening database. DCM is a collection of 139352 compounds that had shown no activity when tested in at least 100 screening assays.29 Even though DCM has a low activity profile against common targets, the rationale of screening this collection was to explore regions in chemical space currently overlooked. Moreover, DCM has yielded active molecules in other assays31,32 probing the value of screening this region of the chemical space. The structures of FooDB and DCM were curated and standardized, employing RDKit, CDK (Chemistry Development Kit), and ChemAxon tools. The largest component of molecules with more than one fragment was retained, compounds containing an atom type other than H, C, O, N, S, P, F, Cl, Br, I, B, Si, and Se were removed. The tautomer with the lowest energy for each remaining compound was generated.
Active compounds from the study of Jin et al.22 were used as a reference. These were the peptide-like inhibitor N3, carmofur, cinaserin, disulfiram, ebselen, PX12, shikonin, tideglusib, and alpha-keto amides (11a, 11r, 11s).27 Lopinavir, nelfinavir, and ritonavir were other reference compounds for the molecular docking performed in AutoDock Vina.
To identify additional potential hit compounds, we included the top 10 ranked virtual screening hits from the study of Ton et al.9 Authors of that work screened the ZINC database against the SARS-CoV-2 Mpro (PDB ID 6LU7) using the docking program Glide. The rationale of using this set was to explore further the predicted profile of top-ranked compounds using different docking programs (i.e., Vina and MOE, vide infra).
In the first virtual screening approach (Fig. 2), nelfinavir, lopinavir, and ritonavir were used as independent queries (vide infra). The molecular similarity between each of the queries and each of the molecules in FooDB and DCM was estimated with the Tanimoto coefficient.33 The compounds with a Tanimoto coefficient higher than the median plus two standard deviations were considered as a hit. The molecules labeled as hits according to more than one molecular fingerprint (consensus hits), were selected. The consensus hits for the three queries were additionally analyzed by molecular docking.
In the second approach (Fig. 2), 1052 compounds with potential affinity for SARS-CoV-2 Mpro or SARS-CoV Mpro were selected from published molecular docking studies9,19,27,34,35,59 and used as queries. The structure file with the chemical structures of the 1052 compounds is available in the ESI.† Mean-fusion similarity scores and max-fusion similarity scores were determined using the eight molecular fingerprints and the Tanimoto coefficient.36 Compounds with max-fusion similarity scores and mean-fusion similarity scores higher than the median plus two standard deviations for more than one fingerprint were selected as consensus hits and evaluated by molecular docking.
The molecular similarity analyses were generated in KNIME employing the RDKit node for molecular fingerprints generation and the CDK node for the similarity calculation.37,38
Docking with Autodock Vina was conducted with two crystallographic structures obtained from the Protein Data Bank (PDB),41 namely, SARS-CoV-2 Mpro (PDB ID 6LU7)22 and the structurally related SARS-CoV Mpro (PDB ID 5N5O).42 Both structures are co-crystallized with a peptide-like (N3) and an alpha-ketoamide (11s) inhibitor, respectively. The crystal structures were prepared in Autodock Tools. The grid-box was constructed based on the binding site of the alpha-ketoamide inhibitors 11a and 11s. The ligands were normalized, their clean 3D form was generated, hydrogens were added, and molecules were optimized using the Universal Force Field (UFF) in KNIME. The results were visualized in PyMol (version 2.3).
Induced fit docking protocol for the Mpro (PDB ID 6LU7) of SARS-CoV-2 was carried out with MOE software v.2019. The protein was prepared with the “Quick prepare” tool using the parameters assigned by the PFROSST force field. The peptide-like inhibitor N3 was removed, and their binding site was used to direct the docking. Triangle matcher method was refined with the induced fit protocol, and the other parameters were established by default. This protocol was validated using experimental information recently published by Jin et al.22 The binding poses were successfully reproduced. The binding scores showed a correlation of 0.703 with the in vitro inhibition values of the data set.
After the ligands were prepared (as described in the Methods Section 2.3), 143 consensus hits from FooDB were found to be highly similar to nelfinavir, lopinavir, and ritonavir (i.e., with similarity values above than the median plus two standard deviations). From the 143 consensus hits, 40 compounds with drug-like properties were selected for more analyses. Five hundred compounds were selected from the DCM database with significantly high Tanimoto similarity values to nelfinavir, lopinavir, and ritonavir. DCM compounds are constantly tested in HTS assays, and therefore, they were considered to have suitable physicochemical properties for drug development. In this sense, it is not surprising that a more significant number of consensus hits for the three drugs were found in DCM, considering that the molecular and physicochemical properties of DCM do not significantly differ from approved drugs. In contrast, FooDB was not assembled to be “drug-like.”
A small dataset of 1052 compounds with predicted affinity to SARS-CoV-2 Mpro was assembled to broaden the search of potential Mpro inhibitors. Although these alternative reference compounds are potentially (but not confirmed) active, it has been suggested that they can increase the likelihood to identify active molecules. Such an approach is reminiscent of what has been described as “turbo-similarity searching”.58 As more data becomes available, a more chemically diverse and larger set could be integrated. Meanwhile, the top hits reported in six peer-reviewed molecular docking studies were included.9,19,27,34,35,59 After ligand preparation, 178 and 174 consensus hits from FooDB and DCM were recovered, respectively. Significant hits were found for five of the eight molecular fingerprints, highlighting the advantages of using multiple molecular fingerprints.60
Four compounds were overlapping consensus hits from both similarities searching methods. DBB13044 and DBB18117 from FooDB, and DCM33835 and DCM97265 from DCM database.
The total number of consensus hits further analyzed by molecular docking and ADMETox in silico profiling was 888 compounds (including stereoisomers).
Fig. 3 shows the predicted binding mode of representative hits compounds with Mpro. As discussed hereunder in the Section 3.4 Hit Selection (vide infra), the selected hit compounds shown in Fig. 3 had favorable docking scores with Vina and MOE and had at least one interaction with the catalytic residues His41, Cys145 and/or Glu166 (key interactions reported).22 According to the docking models, other important key interactions were observed. DBB2790 makes Pi–H interactions with sidechain of His 41, H-bond interaction with the sidechain of Cys 145 and H-bonds interactions with the sidechain and backbone of Glu 166; DCM78683 makes H-bond interactions with the sidechain of Asn 142 and Cys 145, and DCM111769 makes Pi–H interactions with Glu 166. These proposed compounds are predicted to preferentially bind to the P1, P2, and P3 regions.
Fig. 3 Binding modes of three selected hits within SARS-CoV-2 Mpro (PDB ID 6LU7) as predicted by Molecular Operating Environment. |
A literature survey revealed that the VS hit DBB2790 (Fig. 3) has a high structural similarity to compound GC373 (a molecule with nanomolar activity against Mpro from SARS-CoV-2).61 In 2013 Kim et al. reported GC373 as an inhibitor of Mpro from feline coronavirus.62 Moreover, the protein–ligand interactions of both compounds and Mpro are similar. These observations support the potential antiviral activity of DBB2790.
Group | Number of compounds | Commercial availabilitya | In silico safety criteriab | Hydrogen bonds with H41 or C145 | Active according to machine learning |
---|---|---|---|---|---|
a Compounds reported as “in-stock” in the ZINC database were considered commercially available.b Compounds that do not have PAINS alerts, do not pass through the BBB, and are predicted to not inhibit CYP1A2, CYP2C19, CYP2C9, CYP2D6 or CYP3A4. | |||||
1 | 41 | Available | Safe | Present | Active/inactive |
2 | 10 | Available | Safe | Not present | Active |
Available | Not safe | Present | Active | ||
3 | 34 | Not available | Safe | Present | Active/inactive |
4 | 20 | Not available | Safe | Not present | Active |
Not available | Not safe | Present | Active |
Group 1 includes commercially available compounds that meet our safety criteria (based on the predictions of SwissADME), i.e., they do not have PAINS alerts, do not pass through the BBB, and do not inhibit CYP1A2, CYP2C19, CYP2C9, CYP2D6 or CYP3A4. The molecules in this group are predicted to form hydrogen bonds with at least one of the catalytic residues of PDB ID 6LU7. Table 4 summarizes the 41 molecules that fell into this top priority group.
Set | ID | ZINC ID | Vina's score 5N5O kcal mol−1 | MOE's score 6LU7 kcal mol−1 | GIa absorption | Pgpb substrate | Alic log_S | Ali class | Lipinski violations | Brenk violations | Bioavailabilityd |
---|---|---|---|---|---|---|---|---|---|---|---|
a GI gastrointestinal.b Pgp P-glycoprotein.c Ali topological method implemented from Ali J. et al. 2012.63d Probability that the compound will have F > 10%.e Compounds that do not violate any of the following rules: Lipinski, Ghose, Veber, Egan, and Muegge.f Compounds predicted to be active by the ML model. | |||||||||||
foodb_mfsm | DBB9450 | 169676920 | −6.6 | −10.9 | Low | Yes | −8.39 | Poorly soluble | 3 | 4 | 0.17 |
foodb_mfsm | DBB5554 | 85545908 | −7.9 | −10.9 | Low | Yes | −6.76 | Poorly soluble | 3 | 2 | 0.17 |
foodb_mfsm | DBB2790 | 4217536 | −7.8 | −9.3 | Low | Yes | −6.4 | Poorly soluble | 3 | 3 | 0.17 |
dcm_ch | DCM110214 | 34805301 | −7.4 | −9.2 | Low | Yes | −2.7 | Soluble | 1 | 1 | 0.55 |
dcm_ch | DCM122034 | 15990331 | −7 | −8.9 | High | Yes | −3.55 | Soluble | 0 | 1 | 0.55 |
dcm_ch | DCM73598 | 8918473 | −7.2 | −8.7 | Low | Yes | −4.01 | Moderately soluble | 1 | 1 | 0.55 |
foodb_mfsm | DBB2455 | 53057130 | −7.6 | −8.6 | Low | Yes | −4.76 | Moderately soluble | 1 | 1 | 0.55 |
dcm_ch | DCM2279 | 38144961 | −6.8 | −8.5 | Low | Yes | −3.84 | Soluble | 1 | 1 | 0.55 |
dcm_ch | DCM82216 | 4270581 | −7.1 | −8.3 | High | Yes | −2.36 | Soluble | 0 | 1 | 0.55 |
dcm_ch | DCM55533 | 8917865 | −6.4 | −8.3 | High | Yes | −1.82 | Very soluble | 0 | 1 | 0.55 |
dcm_ch | DCM119353 | 9409555 | −7.8 | −8.2 | Low | Yes | −4.18 | Moderately soluble | 0 | 0 | 0.56 |
dcm_ch | DCM65267 | 100771995 | −6.2 | −8.2 | Low | Yes | −1.75 | Very soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB13825 | 4228235 | −7.4 | −8.1 | Low | No | 0.85 | Highly soluble | 2 | 4 | 0.17 |
dcm_ch | DCM131779 | 9159501 | −6.4 | −8 | Low | Yes | −3.37 | Soluble | 1 | 1 | 0.55 |
dcm_ch | DCM65270 | 100778159 | −7.2 | −7.9 | High | Yes | −0.63 | Very soluble | 0 | 0 | 0.55 |
dcm_ch | DCM82831 | 9109751 | −7.8 | −7.8 | Low | No | −2.37 | Soluble | 0 | 1 | 0.55 |
foodb_mfsm | DBB13483 | 5283951 | −6.3 | −7.8 | High | No | −3.7 | Soluble | 0 | 2 | 0.55 |
foodb_mfsm | DBB13002 | 2005305 | −7.3 | −7.8 | Low | No | −3.27 | Soluble | 2 | 4 | 0.11 |
foodb_mfsm | DBB14163 | 8577218 | −7.4 | −7.7 | Low | No | −2.11 | Soluble | 2 | 2 | 0.11 |
dcm_ch | DCM131783 | 15954557 | −6.9 | −7.7 | High | Yes | −2.24 | Soluble | 0 | 1 | 0.55 |
dcm_ch | DCM93255e | 32980237 | −7.2 | −7.7 | High | No | −3.22 | Soluble | 0 | 1 | 0.55 |
foodb_mfsm | DBB13917 | 2036915 | −7.5 | −7.7 | Low | No | −2.74 | Soluble | 2 | 2 | 0.11 |
dcm_mfsm | DCM116923e | 2970717 | −6.5 | −7.7 | High | Yes | −2.54 | Soluble | 0 | 1 | 0.55 |
dcm_ch | DCM10478 | 4083870 | −6.6 | −7.6 | Low | No | 0.02 | Highly soluble | 1 | 3 | 0.55 |
dcm_ch | DCM28770 | 100778693 | −6.7 | −7.6 | High | Yes | −2.52 | Soluble | 0 | 0 | 0.55 |
dcm_ch | DCM33486e | 1181094 | −6.6 | −7.5 | High | Yes | −3.62 | Soluble | 0 | 1 | 0.55 |
dcm_ch | DCM30682e | 1577795 | −6.4 | −7.5 | High | No | −3.76 | Soluble | 0 | 2 | 0.55 |
dcm_ch | DCM110206e | 12652624 | −7.2 | −7.5 | High | Yes | −4.15 | Moderately soluble | 0 | 2 | 0.55 |
dcm_mfsm | DCM91011e | 6754750 | −7.7 | −7.4 | High | No | −1.49 | Very soluble | 0 | 1 | 0.55 |
foodb_mfsm | DBB13919 | 4228265 | −7.7 | −7.4 | Low | No | −1.97 | Very soluble | 2 | 2 | 0.17 |
foodb_mfsm | DBB17132e | 20431033 | −6.2 | −7.1 | High | No | −1.7 | Very soluble | 0 | 2 | 0.55 |
dcm_ch | DCM131782e | 2126038 | −7.1 | −7.1 | High | No | −0.08 | Very soluble | 0 | 1 | 0.55 |
dcm_mfsm | DCM71724 | 18056800 | −6.2 | −7.1 | Low | Yes | −4.37 | Moderately soluble | 0 | 2 | 0.55 |
dcm_mfsm | DCM94188e | 18143600 | −7.1 | −6.9 | High | No | −2.63 | Soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB20185 | 2242693 | −6.1 | −6.6 | Low | No | 0.98 | Highly soluble | 0 | 2 | 0.55 |
foodb_mfsm | DBB17114 | 4090721 | −7 | −6.5 | High | No | −1.38 | Very soluble | 0 | 2 | 0.55 |
foodb_mfsm | DBB18961 | 4321512 | −6.8 | −6.5 | Low | No | −0.96 | Very soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB18947 | 1303441 | −6.1 | −6.1 | High | No | −0.42 | Very soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB19736 | 2040854 | −5.4 | −6.1 | High | No | 2.05 | Highly soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB19719 | 1532770 | −5.6 | −5.9 | High | No | 1.67 | Highly soluble | 0 | 0 | 0.55 |
foodb_mfsm | DBB21857e,f | 895813 | −5.8 | −5.6 | High | No | −1.75 | Very soluble | 0 | 0 | 0.56 |
Group 2 comprises ten commercially available compounds that are predicted to be active by ML, but they violate one of the other two criteria. They can meet our safety criteria and do not form hydrogen bonds with the catalytic residues. Else, they can form hydrogen bonds with the catalytic residues but do not meet our safety criteria.
Group 3 consists of 34 molecules that are not commercially available but meet the safety criteria and form hydrogen bonds with at least one of the catalytic residues. These compounds would be suited for synthesis and testing.
Group 4 contains 20 molecules that are not commercially available and are predicted to be active by ML. However, they do not meet the safety criteria or do not form hydrogen bonds with the catalytic residues. According to our classification, compounds in this group would have the lowest priority for acquiring (synthesizing since they are not commercially available) and testing. Compounds that do not fall into any of these four groups were considered as non-priority for acquisition.
Table 4 summarizes the in silico profile of representative hit compounds selected for experimental validation.
Table 5 summarizes the information of 18 compounds listed in group 1 from FooDB with their corresponding IDs and annotated sources. Interestingly, some of the selected hits that were structurally similar to potential Mpro inhibitors were from endogenous sources. For instance, angiotensin II (DBB9450) and angiotensin IV (DBB5554) (a degradation product) were predicted as binders of the active site of SARS-CoV-2 Mpro. Key interactions predicted were hydrogen-bonds with His41, Ser46, Cys145, Gln189 (DBB9450) and Thr26, Met49, Cys145, and Glu166 (DBB5554). Angiotensin II (ANG-II) is an octapeptide hormone product of angiotensin I's cleavage by the angiotensin-converting enzyme (ACE). ANG-II binds to AT1 and AT2 receptors; the activation of AT1 receptors by ANG-II induces vasoconstriction, vasopressin and aldosterone release, thirst, renal sodium reabsorption, angiogenesis, vascular aging, and inflammation. ANG-II can be converted to angiotensin 1–7 by the angiotensin-converting enzyme II (ACE2). The action of aminopeptidase A and aminopeptidase N produces angiotensin III and angiotensin IV, respectively.
IDs | FooDB annotation |
---|---|
DBB9450/FDB022383 | Angiotensin II, endogenous |
DBB5554/FDB022385 | Angiotensin IV |
DBB2790/FDB023765 | Tetragastrin, endogenous |
DBB2455/FDB023767 | Morphiceptin, endogenous |
DBB13825/FDB031192 | Tetrahydrofolate |
DBB13483/FDB013079 | Neotame, artificial sweetener |
DBB13002/FDB022600 | 5-Methyltetrahydrofolic acid (5-MTHF) |
DBB14163/FDB014504 | Folic acid |
DBB13917/FDB022702 | Aminopterin |
DBB13919/FDB022395 | Dihydrofolic acid |
DBB17132/FDB028374 | Phenylbutyrylglutamine, metabolite of phenylbutyrate |
DBB20185/FDB003618 | Gamma-L-glutamyl-L-phenylalanine, soft-necked garlic |
DBB17114/FDB029352 | Indole acetyl glutamine, endogenous |
DBB18961/FDB023789 | N4-Acetylcytidine, endogenous |
DBB18947/FDB022917 | 5-Methyldeoxycytidine (5-mdc) |
DBB19736/FDB012937 | Carnosine 44A |
DBB19719/FDB022217 | Homocarnosine, metabolite |
DBB21857/FDB022212 | Hydroxyphenylacetylglycine, endogenous human metabolite |
Angiotensin 1–7 has opposite actions to ANG-II. Because ACE2 mediates the entry of SARS-CoV-2 to the host cells and ACE2 activity may be downregulated after virus infection, the accumulation of ANG-II could be linked to the development of severe symptoms of COVID-19 disease. If Mpro inhibitors are structurally similar to ANG-II, their potential binding affinity for the active site of ACE2 should be evaluated. Some studies have assessed the ability of ACE2 inhibitors to prevent SARS-CoV from entering into the cells.64 However, the inhibition of the ACE2 function could cause overaccumulation of ANG-II and promote its undesired effects.
Nonetheless, probably, DCM compounds may not elicit a dual inhibition of SARS-CoV-2 Mpro and ACE2, considering that these molecules had shown no activity against common targets evaluated in HTS assays.
Food folates like 5-MTHF, folic acid, dihydrofolic acid, and tetrahydrofolate (Table 5) were also among the compounds in the top priority group with observed hydrogen bonds to the catalytic residues of the SARS-CoV-2 Mpro, and favorable docking scores (below −7.4 kcal mol−1). Folates are cofactors in many one-carbon transfer reactions, including nucleotide synthesis for DNA and RNA synthesis, interconversion of serine and glycine, methionine generation and methylation of histones, DNA, proteins, phospholipids, and neurotransmitters. Folate deficiency has been linked to neural tube defects, brain dysfunction, coronary heart disease, and increased risk of colorectal and breast cancer.65 Since mammalian cells cannot synthesize de novo folate, naturally occurring food folates and synthetic folic acid are used in dietary supplements and fortified food.
Nevertheless, recent studies showed that a high intake of folic acid might be associated with a risk of developing leukemia and other conditions such as cancer, arthritis, insulin resistance, and masking deficiency of vitamin B12.66 Thus, the implications of low and high plasma levels of folates in COVID-19 patients must be evaluated. Our results suggest that folates could inhibit SARS-CoV-2 Mpro, but their activity in in vitro and in vivo assays remains to be confirmed. To broaden our knowledge of the impact of a healthy diet, and the specific mechanisms through which food chemicals participate in the progression of COVID-19 disease could be a simple approach for the prevention and combat of the current pandemic.
Intriguingly, aminopterin (DBB13917), a folic acid analog that inhibits the dihydrofolate reductase enzyme was also a potential Mpro inhibitor. Aminopterin is one of the so-called antifolates that interfere with folate metabolism and in turn nucleotide synthesis. Currently, an aminopterin analog with less toxic effects, methotrexate, is under clinical trials for the treatment of COVID-19 disease (NCT04352465). Methotrexate is an immunosuppressant used in the treatment of cancer and inflammatory conditions; it is often concurrently administered with folic acid.
Footnote |
† Electronic supplementary information (ESI) available: Excel file with ten worksheets that report all similarity values, docking scores, and ADMETox profile of the hit compounds outlined in Fig. 2. Structure file of the 1052 queries used for the similarity searching. See DOI: 10.1039/d0ra04922k. |
This journal is © The Royal Society of Chemistry 2020 |