Investigating the structure–activity relationship of marine natural polyketides as promising SARS-CoV-2 main protease inhibitors

Since its first report in December 2019, the novel coronavirus virus, SARS-CoV-2, has caused an unprecedented global health crisis and economic loss imposing a tremendous burden on the worldwide finance, healthcare system, and even daily life. Even with the introduction of different preventive vaccines, there is still a dire need for effective antiviral therapeutics. Nature has been considered as the historical trove of drug discovery and development, particularly in cases of worldwide crises. Herein, a comprehensive in silico investigation of a highly focused chemical library of 34 pederin-structurally related marine compounds, belonging to four polyketides families, was initiated against the SARS-CoV-2 main protease, Mpro, being the key replicating element of the virus and main target in many drugs development programs. Two of the most potent SARS-CoV-2 Mpro co-crystallized inhibitors, O6K and N3, were added to the tested database as reference standards. Through molecular docking simulation, promising compounds including Pederin (1), Dihydro-onnamide A (11), Onnamide C (14), Pseudo-onnamide A (17), and Theopederin G (29) have been identified from different families based on their superior ligand–protein energies and relevant binding profiles with the key Mpro pocket residues. Thermodynamic behaviors of the identified compounds were investigated through 200 ns all-atom molecular dynamics simulation illustrating their significant stability and pocket accommodation. Furthermore, structural activity preferentiality was identified for the pederin-based marine compounds highlighting the importance of the terminal guanidine and cyclic hemiacetal linker, and the length of the sidechain. Our findings highlight the challenges of targeting SARS-CoV-2 Mpro as well as recommending further in vitro and in vivo studies regarding the examined marine products either alone or in combination paving the way for promising lead molecules.


Introduction
The coronavirus disease 2019 (COVID-19) outbreak declared as a pandemic by World Health Organisation (WHO) in March 2020 is caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), a newly identied coronavirus, which was rst recognized in Wuhan, China. The global pandemic of SARS-CoV-2 is still ongoing. Although effective vaccines and vaccination programs are underway, there is an urgent need to identify new effective therapeutic agents to treat SARS-CoV-2 infection. SARS-CoV-2 is the seventh CoV capable of infecting humans (HCoV), but the rst and only HCoV with pandemic potential. 1 The previous six human pathogenic HCoVs are HCoV-229E, HCoV-OC43, severe acute respiratory syndrome coronavirus (SARS-CoV), HCoV-HKU1, HCoV-NL63, and Middle East respiratory syndrome (MERS-CoV). 1,2 SARS-CoV-2 is a 100 nanometer in diameter pleomorphic circularshaped protein complex that contains approximately 100 units of 10 nanometer length trimeric spike proteins on each virion. 3,4 The structure SARS-CoV-2 consists of different structural and non-structural proteins to include spike (S) glycoprotein, envelop (E), membrane (M), nucleocapsid (N), hemagglutinin esterase dimer (HE), and non-structural proteins (NSP). 5 The SARS-CoV-2 main protease (Mpro) is a non-structural protein that proteolytically cuts the overlapping pp1a and pp1ab polyproteins resulting in functional proteins that are required to mediate viral replication and transcription. 6 Since the Mpro of SARS-CoV-2 is a very important key for the conversion of its primary polypeptides into essential functional proteins through its master role in both viral transcription and replication processes. 7,8 Therefore, targeting SARS-CoV-2 Mpro is a very promising approach to get a lead compound combating the COVID-19 pandemic as soon as possible. [9][10][11] Considering that oceans and seas are covering almost 70% of the planet's surface, the marine environment had privileged to represent the extremely largest ecological system on the earth with 92% of total phyla exists in life, is featuring a huge number of wealthy and advantageous produces yet undiscovered. [12][13][14][15][16] Over seven decades and since the emerging of the rst marine bioactive nucleotides in the 1950s by Brigman et al., marine natural products (MNPs) have been marked as sustainable prolic pipelines for numerous structurally diverse bioactive candidates. Since more than 35 000 discovered compounds bearing unique structural features and unprecedented biological potentialities along with hundreds of new chemical entities are being unearthed each year, MNPs have preliviged a central focus of the worldwide drug lead discovery program. [17][18][19][20] Those exploitations have led to the identication of 15 successful approved/commercialized marine-based drug leads including, along with more than 33 candidates that are currently under different preclinical investigations. [21][22][23] Particularly, in 2020, the FAD has approved two marinebased compounds as anticancer, including belantamab mafodotin-blmf (Blenrep™), a synthetic drug-conjugate analog of the marine peptide dolastatin 10, originally isolated from a mollusk-derived cyanobacterium, recently developed by GlaxoSmithKline (GSK) and approved for the treatment of relapsed or refractory multiple myeloma. 24 Lurbinectedin, a synthetic tetrahydropyrrolo [4,3,2-de]quinolin-8(1H)-one analog of the marine alkaloid trabectedin, which was originally isolated from the tunicate Ecteinascidia turbinate, recently dened by PharmaMar and approved for the treatment of metastatic small cell lung carcinoma. 25 Pederins/mycalamides/onnamides and theopederins are distinct families of structurally related polyketides-containing nitrogen which are biosynthetically derived from a PKS-NRPS gene cluster (Schemes 1-4). Chemically, they compromise the main core of two tetrahydropyran rings linked together through an N-acyl aminal and decorated by variant oxidation degrees. 26 Pederin (1) was the rst member to be reported in 1949 from the female beetle called Paederus littoralis, however, its structure was complete and revised in 1968. 27 Later, in 1989-1990, perry et al. reported the isolation of structurally related members of marine origin, named mycalamides A-B (4-5) from a marine sponge of the genus Mycale collected in New Zealand. 28,29 Subsequentially, two additional compounds, mycalamide C-D (7)(8) were isolated from marine sponges of the genus Stylinos and Mycale sp. 30,31 Venturi et al., isolated the last member of this family namely mycalamide E (6) from the marine sponge Mycale hentscheli collected in Pelorus Sound, New Zealand. 32 Onnamides (9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20)(21) are considered the largest group of this family. They have been isolated from the marine sponges of the genus Theonella and Trachycladus. [33][34][35][36] Theopederins A-J (22)(23)(24)(25)(26)(27)(28)(29)(30)(31), were reported from marine sponges of the genus Theonella, 37,38 however, theopederins K-L (32)(33) were recorded from marine sponge Discodermia sp. 39 Pharmacologically, pederin-related compounds are classi-ed as protein synthesis inhibitors, 32,40,41 which preliviged them to display a myriad of biological activities including cytotoxicity with IC 50 under 5 nM (ref. [42][43][44] and immunosuppressive. 45 Pederin and related derivatives were tested on a panel of tumor mammalian cells including HeLa and KB and showed signicant cytotoxicity at 2 nM concentrations. 46 Recently, Nakabachi et al. reported the potent cytotoxic activity of diaphorin, a structurally related pederin derivative isolated from a bacterial symbiont of the Asian citrus psyllid against 39 human cancer cells with GI 50 , TGI, and LC 50 values at the micromolar range. 47 More interestingly, they display in vitro potent antiviral activities. Mycalamide A (4), exhibited promising antiviral activity against A59 coronavirus. Moreover, mycalamides A-B (4-5) exhibited signicant antiviral inhibition activity against Herpes simplex type-1 and Polio type-1 viruses active at 3.5-5.0 and 1.0-2.0 ng per disk respectively. 28,29 Burres et al., have demonstrated that these compounds can inhibit DNA and RNA replication, meanwhile, further reports documented their action to be more like protein synthesis inhibitors rather than DNA or RNA synthesis inhibitors. 40 Additional four synthetic analogs of mycalamide A (4) were evaluated as strong protein synthesis inhibitors and showed potent binding to the nucleoprotein of the inuenza virus (H1N1) and prevent its replication. 48 For detailed chemistry including the isolation, synthesis, and biological potentialities of those families of natural products, see the comprehensive report by Mosey et al. 26 Therefore, taking into consideration the crucial role of SARS-CoV-2 Mpro, besides, the previously discussed antiviral activities of the examined marine compounds and as a part of our continuous program to identify potentially active marine natural products [49][50][51][52][53][54] with competent antiviral therapeutic activity 55 and to search for therapeutics combating SARSCoV-2 Mpro, [8][9][10][11][56][57][58][59][60][61] we decided to examine the anti-SARS-CoV-2 activities of the thirty-four marine compounds  and propose their mechanism of action as promising SARS-CoV-2 Mpro inhibitors using molecular docking approach, conrm their docking results through applying detailed molecular dynamics calculations, and nally study the structure-activity relationships for the obtained results in order to help scientists in the future discovery, design and synthesis of new effective anti-SARS-CoV-2 therapeutics in the near future.

Molecular docking study
Molecular docking studies were performed for the pederins, mycalamides, onnamides and theopederins related compounds  listed in (Schemes 1-4) against the dimeric form of the Mpro of SARS-CoV-2 using the MOE 2019.012 suite. 62 Moreover, both the co-crystallized inhibitor (O6K, 35) of the used dimeric protein (6Y2G), 63 besides the co-crystallized one (N3, 36) extracted from the monomeric protein (6LU7) 64 of SARS-CoV-2 Mpro were added to the tested database as two reference standards.
2.1.1. Pederins, mycalamides, onnamides and theopederins (1-33) preparation. The ChemDraw professional 17.0 was used to sketch the 2D chemical structures of the selected pederin, mycalamide, onnamide, and theopederin compounds  which were copied to the MOE window individually. Each transferred compound was converted to its 3D form, energy minimized aer the adjustment of its partial charges as well, and saved as (.moe) extension to be ready for the docking step as described earlier. [65][66][67] Moreover, the co-crystallized inhibitors of both the used dimeric SARS-CoV-2 Mpro (6Y2G) besides that of the monomeric form (6LU7) (35 and 36, respectively) were extracted and saved separately to be used as reference standards. Finally, all the aforementioned prepared compounds  were inserted in one database le and saved as (.mdb) extension to be uploaded to the ligand site during the docking process.
2.1.2. The target dimeric Mpro of SARS-CoV-2 preparation. The target dimeric form of Mpro enzyme of SARS-CoV-2 was downloaded from the Protein Data Bank (PDB code: 6Y2G). 63 Also, it was subjected to correction, 3D protonation, and energy minimization as described before [68][69][70] to be ready for the docking step.
2.1.3. Docking of the prepared database  to the dimeric Mpro of SARS-CoV-2. A general docking process was initiated aer uploading the previously mentioned database in place of the ligand and the prepared protein in place of the receptor. The docking site was selected to be the binding site of the co-crystallized a-ketoamide inhibitor (O6K, 35) of the dimeric SARS-CoV-2 Mpro pocket. Also, the general docking specications were selected to be triangle matcher, London dG, GBVI/WSA dG, and rigid receptor for the placement methodology, rst scoring methodology, nal scoring methodology, and renement methodology, respectively. [71][72][73] Aer completion of the docking process, the best pose for each examined compound-according to the score and RMSD values-was selected for further investigations.
of the co-crystallized a-ketoamide inhibitor (O6K, 35) of the used dimeric Mpro of SARS-CoV-2 at its binding pocket. The valid performance was conrmed by the observed low RMSD values (1.46) describing the root mean squared deviation between the native and redocked poses of the co-crystallized aketoamide inhibitor (35) (Fig. 1). 74-76

Molecular dynamics simulations
Models showing the best docking scores of the most promising leads, as well as reference ligands (O6K and N3) in complex with SARS-CoV-2 Mpro, were chosen as starting coordinates for 200 ns all-atom molecular dynamics simulation using the GROMACS-2019 soware package. 77,78 The symmetry operations (rotations and translations) by the VMD soware were applied using the transformation matrices within the PDB crystalline les to obtain the ligand-Mpro dimers. Parameterization of all investigated ligands and generation of their respective topology les were automatically generated using the CHARMM-General Force Field program (Param-Chem project; https:// cgenff.umaryland.edu/). 79 The CHARMM36m force eld was preferred for the proteins within all MD simulations. 9,80,81 Each ligand-Mpro model was solvated within a cubic box of the TIP3P water model under periodic boundary conditions implementation allowing a minimum of 10 A marginal distance between each 3D box side and the protein. 82 The residues of the Mpro target protein were assigned at their standard ionization states under physiological conditions (pH ¼ 7.0), while the net charge of the entire systems were neutralized using sufficient numbers of potassium and chloride ions being added via the Monte-Carlo ion-placing approach (ESI; Fig. S1 †). 83 The MD simulations were conducted over three conventional stages; one-staged minimization, double-staged equilibration, and production. The minimization step involved initial system geometry optimization through 5 ps (5000 iterations) under the steepest descent algorithm. 8 The second two-staged equilibration step proceeded for 100 ps (100 000 iterations) per stage. Under a constant number of particles, volume, and temperature (NVT) ensemble, the rst equilibration was conducted at 303.15 K being regulated by the Berendsen temperature coupling method. 84 Whereas the second equilibration stage was performed under a constant number of particles, pressure, and temperature (NPT) ensemble at 303.15 K and 1 atmospheric pressure regulated by the Parrinello-Rahman barostat method. 85 A force was constant of 1000 kJ mol À1 nm À2 was used for preserving original protein folding and restraining all heavy atoms during the minimization and equilibration processes. The production stage involved 200 ns MD simulation runs under constant pressure (NPT ensemble) while using the Particle Mesh Ewald (PME) algorithm for computing the longrange electrostatic interactions. 86 All covalent bond lengths, including hydrogens, were modeled under the implemented linear constraint LINCS method allowing an integration time step size of 2 fs. 87 Both Coulomb's and van der Waals nonbonded interactions were truncated at 10 A using the Verlet cut-off scheme. 88 Computing comparative analysis tools, including root-meansquare deviation (RMSD) and root-mean-square uctuation (RMSF), were performed through analyzing the MD trajectories using the GROMACS built-in tools. The difference RMSF (DRMSF) was estimated for each ligand-bound protein relative to the SARS-CoV-2 Mpro apo/unliganded state (PDB code: 7C2Q; atomic resolution 1.93 A), where DRMSF ¼ RMSF apo À RMSF holo . The same previous preparation, minimization, equilibration, and 200 ns all-atom MD simulation production were applied to the Mpro apo state, except no ligand preparation was performed. The Hydrogen Bond Analysis within Visual Molecular Dynamics ver.1.9.3 soware (VMD; University of Illinois, Urbana-Champaign, USA) was utilized to estimate and monitor the number of ligand-Mpro intermolecular hydrogen bonding over the whole simulation periods. The cut-off values for all hydrogen bond (donor-H/acceptor) distance and angle were assigned at 3.0 A and 20 , respectively. 89,90 Finally, the binding-free energy between the ligand and protein was estimated via the Molecular Mechanics/Poisson-Boltzmann Surface Area (MM/PBSA) using the GROMACS "g_mmpbsa" module. The MM/PBSA calculations provided more insights regarding the magnitude of ligandprotein affinity, the nature of the interaction, in addition to the residue-wise contributions within the binding-free energy calculations. 91 Important MM/PBSA parameters for polar/solvation calculations were set at solvent dielectric constant (80 pdie), solute dielectric constant (2 pdie), the radius of solvent probe (1.40 A), and reference vacuum (1 vdie). Concerning SASA apolar solvation; the radius of SASA solvent probe, offset constant, and solvent surface tension were set at 1.40 A, 3.8493 kJ mol À1 , and 0.0227 kJ mol À1 A À2 , respectively. Finally, parameters for the continuum-integral-based model were set as solvent probe radius 1.25 A, bulk solvent density (0.0334 A À3 ), and 200 for numbers of quadrature points per A 2 . The MM/PBSA calculations of all simulated systems were applied on representative frames for the whole MD simulation runs (200 ns). Using the GROMACS command-lines "gmx trjcat", four hundred representative snapshots/frames were spared out of the whole trajectory le at specied time intervals (i.e. one frame/snapshot every 500 ps). For representing the ligand-protein conformational analysis across specic timeframes, the Schrödinger™ Pymol™ graphical soware ver. 2.0.6 was used. 92

Docking studies
Analyzing the binding modes of the co-crystallized inhibitors (35 and 36) of the dimeric and monomeric forms of SARS-CoV-2 Mpro showed an asymmetric binding in each case. Furthermore, molecular docking of the pederins, mycalamides, onnamides and theopederins related compounds (1-34) against the dimeric form of SARS-CoV-2 Mpro achieved very promising results to be discussed in detail. Generally, the binding score order of the tested compounds was found to be in the following order: onnamides > theopederins > pederins > mycalamides. Many compounds were found to be superior to the cocrystallized inhibitor of the dimeric form (35) especially those of the onnamides family (9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)21). It was also noted that the cocrystallized inhibitor of the monomeric form (36) got a binding score of À10.24 kcal mol À1 which was better than that of the dimeric form (35) which recorded a binding score of À8.77 kcal mol À1 (Table 1).
Regarding the docking results depicted in Table 1, we decided to further study pederin (1) as the most promising member of pederins, dihydro-onnamide A (11), onnamide C (14), and pseudo-onnamide A (17) as the most promising members of onnamides, and theopederin G (29) as the most promising member of theopederins as well, besides the two docked co-crystallized inhibitors (35 and 36) as represented in Tables 1 and 2. Also, the 2D binding interactions of the aforementioned compounds were depicted in (ESI; Fig. S2 †).
Regarding both Tables 1 and 2; the co-crystallized inhibitor of the dimeric SARS-CoV-2 Mpro (O6K, 35) was found to form one H-bond with His163 at 2.98 A and two pi-H bonds with Ala191 and Gln192 at 3.64 and 4.58 A, respectively. On the other hand, the co-crystallized inhibitor of the monomeric SARS-CoV-2 Mpro (N3, 36) got stabilized inside the binding pocket of the dimeric SARS-CoV-2 Mpro through the formation of only one Hbond with Ser1 at 3.29 A.
Collectively, the aforementioned results referred to very promising binding scores and interactions which indicate the expected promising intrinsic activities of the tested marine compounds at the same time.

Molecular dynamics simulations
Being an effective tool for investigating the relative stability of ligand-target complex as well as their respective dynamic behavior, MD simulation studies were performed. The latter computational tool is considered particularly benecial for exploring the conformation space of ligand-target complex being more efficiently than other in silico tools including molecular docking and mechanics energy minimization approaches for just static image analysis. 93 Showing relevant ligand-Mpro docking interactions, the top docked poses of the investigated compounds related to different families of the studied polyketides (pederin (1, D1), dihydro-onnamide A (11, D2), onnamide C (14, D3), pseudo-onnamide A (17, D4), and theopederin G (29, D5)), besides the two reference ligands (N3 and O6K) within the SARS-CoV-2 Mpro canonical binding site were subjected to 200 ns all-atom MD simulation.
3.2.1. Stability analysis of ligand-protein complexes. Throughout the 200 ns all-atom MD runs, several examined agents illustrated signicant global stability within the target's canonical binding site as being conrmed through the monitored RMSD trajectories. Generally, RMSD estimates the molecular deviation of a particular ligand relative to a designated original/reference structure. Such an analytical tool would provide a good indication for the ligand-target stability and the adopted MD simulation protocol was valid. Target's instability and signicant conformational alterations are associated with high RMSD trajectories. 94 On the other hand, high complex RMSD would correlate to limited ligand-target affinity where the ligand is unable to be conned within the target's canonical binding site along the simulation periods. 95 The estimated RMSD deviations for Mpro proteins, in reference to their respective alpha-carbon atoms (C-a RMSD), depicted an overall typical behavior for MD simulations ( Fig. 2A). Over the initial frames, the protein's C-a RMSD tones increases as a result of constraining release at the beginning of MD simulation runs. Following the rst 20 ns of the MD runs, steady protein's C-a RMSD trajectories were obtained for more than half of the simulation run time (>150 ns), except for minimal uctuation for D4 and N3-bound protein around 100-200 ns and near the end of MD simulation runs, respectively. Notably, almost all investigated ligands leveled off at comparable RMSD trajectories across the trajectory plateau and till the end of MD simulation courses (D1 2.32 AE 0.16 A, D2 2.39 AE 0.22 Slightly higher values assigned for D4 and N3 (2.43 AE 0.22 A and 2.55 AE 0.26 A, respectively) were correlated to the depicted uctuations across the MD simulation timeframes. On the other hand, the D1-bound Mpro systems managed to exhibit the steadiest C-a RMSD tones, exhibiting the lowest standard deviation value aer the equilibration was attained. The described dynamic behavior of the investigated Mpro proteins indicates the successful convergence of the target proteins. Moreover, the above-depicted protein's C-a RMSD tones also infer that successful system minimization, relaxation, and thermal equilibration stages have been adopted before the MD production step and thus, no further extension of the MD simulation beyond the 200 ns period was needed.
For gaining more insights concerning the ligand's connement within the Mpro canonical binding site across the MD run, the RMSD uctuations were monitored for the combined ligand-protein complex in reference to the protein backbone initial frame (Fig. 2B). Despite limited uctuations, the binary complexes of Mpro with almost all examined compounds and reference ligands managed to reach their respective dynamic equilibrium illustrating backbone RMSD plateau, beyond the 30 ns, indicating sufficient complex stability. Despite the differential backbone RMSD tones at the initial MD simulation frames, the investigated complexes managed to converge along the last 200 ns reaching a nal RMSD around 2.54 AE 0.17 A. Nevertheless, this dynamic behavior was not the same for the D4-Mpro complex since beyond 90 ns high uctuations were depicted ($3.51 AE 0.27 A) till the end of the MD simulation run indicating signicant ligand orientation shi. Interestingly, compounds D1, D5, and crystalline reference O6K achieved early equilibration with the steadiest complex RMSD trajectories and low comparative average values (2.33 AE 0.15 A, 2.39 AE 0.19 A, and 2.43 AE 0.18 A, respectively) as compared to other ligands ($2.67 AE 0.37 A). The latter observation highlights the better ligand's retainment for D1 and D5 within the protein pocket as compared to the rest of examined compounds as well as the reference N3 potent inhibitor.
Being considered as an additional descriptor for ligandpocket connement and convergence of the simulated proteins, the sole ligand RMSDs relative to the reference protein backbone frame were monitored along the MD simulation runs (Fig. 2C) with the above D4-Mpro complex RMSD trajectories, higher uctuations and average ligand RMSD trajectories were assigned for D4 (average 4.09 AE 0.81 A) ensuring its signicant orientation shi following the 90 ns of MD run. It is worth mentioning that protein RMSD trajectories were within 1.5-fold those of their respective ligands, with higher values for D4, the thing that further conrms the successful convergence of the ligand-protein complexes inferring the suitability of 200 ns simulation timeframe needing no further MD extension.
The MD simulation convergence was further validated and monitored via Principal Component Analysis (PCA). The latter approach examines the protein's collective dynamic motion and behavior out of the MD trajectories through constructing and diagonalizing covariance matrix from protein's C-a atomic coordinates. 96 The average covariance matrix allows capturing strenuous atom motions throughout both the eigenvectors and eigenvalues. Typically, the covariance matrix eigenvectors elucidate the atom's overall motion direction, while the eigenvalues represent the atom-wise contribution values within such motion. In these regards, both the covariance matrix eigenvectors and eigenvalues furnish the modes of collective motion and their respective amplitudes. The GROMACS "gmx_covar" command script was used for constructing and diagonalizing the covariance matrices, whereas, "gmx_anaeig" was used for presenting the most dominant modes (eigenvectors-1 and -2) in addition to estimating the overlap between principal components and trajectory coordinates. Since the corresponding eigenvalues indicate the dynamic behavior and degree of uctuations, covariance matrix with lower trace values would correlate with the minimal escalation of collective protein motion which further denote the MD simulation convergence. [97][98][99] Applying the PCA technique on the last 50 ns MD trajectories while comparing it with that for the rest of MD simulation frames would be fundamental for monitoring and validating the MD simulation convergence.
Interestingly, the average trace value of the covariance matrix at the last 50 ns was of lower magnitudes as compared to that along the rst 150 ns MD simulation trajectories (Fig. 3). The average trace value of the covariance matrix for almost all investigated models showed a nearly 30% decrease for the MD trajectories at the last 50 ns (Table 3). The latter ensures higher stability of the protein atoms at the last 50 ns which in turn conferring a validated convergence of the adopted MD simulation. On the other hand, only the D4 model exhibited $15% reduction which came in good agreement with the depicted uctuations observed at its previously described C-a RMSD trajectories. Concerning comparative PCA analysis, it was noticed that comparable MD convergence patterns were assigned for the Mpro proteins in complex with D1, D2, D5, and O6K (average 7.419155 AE 0.24 and 5.45171 AE 0.18 nm 2 for initial 150 ns and last 50 ns frames, respectively). The lowest magnitude was obtained for the D1 system at the last 50 ns correlating with the observed steady C-a RMSD following equilibration being superior over other investigated systems ( Fig. 2A). Both D3 and N3 showed higher covariance matrix traces at both investigated MD trajectory frames as compared to those for the above stably converged models. Nevertheless, both systems showed the same magnitude of reduction for their respective average trace value of the covariance matrix being around 30%. Based on the above ndings, validated MD simulation convergence was obtained for the above-investigated models ensuring the adequacy of the 200 ns MD simulation timeframe for exploring the ligand-Mpro thermodynamic behaviors.
Since both the RMSD analysis and PCA techniques highlight the signicant ligand-target stability for several examined ligands, it was benecial to further investigate the local protein exibility and how this could be contributed to the ligand binding. The uctuation of the target's residues was monitored by estimating the RMSF stability validation parameter which was able to highlight the residue-wise contribution within the target protein stability. Typically, RMSF provides a valuable evaluation of the target's residues dynamic behavior represented as both uctuation and exibility, through estimating the average deviation of each protein's amino acid concerning its respective reference position across time. 100 Within the presented manuscript, the difference root-meansquare uctuation (DRMSF) was a better estimation of the protein local exibility being the RMSF difference for each ligand-bound protein relative to the SARS-CoV-2 Mpro apo state (DRMSF ¼ apo RMSF À holo RMSF). A DRMSF cut-off value of 0.30 A was relevant for estimating the signicant alterations within the protein's structural movements meaning that amino acids with DRMSF above 0.30 were considered of limited mobility. This adopted cut-off was able to identify the immobile residues, whereas excluding those exhibiting inherited exibility including those at exible protein secondary structure (loops) and terminal segments. 89,101 Investigating the RMSF trajectories essentially execute for a trajectory region considered stable. Based on the above protein's C-a RMSD analysis ( Fig. 2A), the Mpro proteins target were of signicant conformational stability along the 200 ns MD simulations for all systems (<3.5 A), despite the limited uctuations for D4 and N3 systems. Therefore, the C-a RMSF calculations were reasoned for estimation across the whole MD simulation trajectories.
Throughout the DRMSF analysis, the free terminals residues showed a typical uctuation pattern with the highest negative DRMSF values in comparison to the core residues (Fig. 4). This behavior is highly reasoned since these terminal residues are most likely to uctuate at the highest deviations in comparison to core residues the thing that is typical for a well-behaved MD simulation. Interestingly, higher uctuation patterns were depicted for the residues of each ligand-protein complex at the Mpro C-terminus as compared to those located near the NH 2 end (average À1.32 AE 0.46 versus À0.26 AE 0.58 A). The terminal exible residues are at regions located >15 A from the protein's canonical binding site. The latter infers to the ability of the active site to accommodate bulkier ligands. Several distinct residue ranges including; 41-48, 162-167, 185-188, and 203-296, illustrated signicant immobility possessing an average DRMSF above the 0.30 A threshold. Notably, the residue range 289-296 being vicinal to the protein's C-terminal showed one of the highest immobility proles (DRMSF up to 3.81 AE 0.13 A). Such dynamic behavior confers signicant inuence of ligand's binding upon the stability of these C-terminal vicinal residues which came in great agreement with reported studies. 102 Concerning comparative protein's local stability, lower DRMSF trends were assigned for D4-bound Mpro residues relative to those of the other isolated and reference ligands. This was recognized across several ranges of protein residues, most notably for the 203-270 residue range. On the other hand, the high stability of D1, D5, and O6K-bound proteins conrmed with the presented DRMSF ndings which further conrms the superior stability of these three latter systems as being previously discussed via RMSD and PCA data.
3.2.2. Local protein exibility and uctuation of target's residues. For gaining more insights regarding the ligand interactions with Mpro pocket, a comparative analysis of the furnished DRMSF trajectories has proceeded regarding the specic exibility of Mpro key lining pocket residues. Interestingly, several canonical pocket residues depicted signicant immobility with DRMSF above the cut-off mobility threshold 0.30 A (Table 4). Residues lining the S1 0 subsite showed significant mobility with the lowest of all DRMSF values (lots of negative values) the thing that infers the great mobility indices  His41-vicinal residues (average DRMSF ¼ 1.28 AE 0.49 A, 1.43 AE 0.53, and 1.38 AE 0.57 A, respectively). Thus, it was suggested that the ligand-His41 hydrogen bond pair has a much more pronounced impact on ligand-protein stability over that of the other catalytic residue, Cys145. This was in great concordance with our previous study investigating promising natural scalaranes sesterterpenes isolated from the Red Sea marine sponge Hyrtios erectus as promising inhibitors of SARS-CoV-2 Mpro target. 77 Moving towards Mpro S1 subsite, signicant-high DRMSF values were depicted across the investigated ligand-bound proteins regarding a couple of pocket lining residues (Phe140, His163, and Glu166). Signicant immobility for the Glu166 (from 0.48 and up to 0.80 A) ensures the reported data within the current literature suggesting the crucial role of S1 subsite Glu166 residues for stabilizing several drug-like and peptidomimetic ligands at the Mpro active site. 8,9,63,77,[103][104][105][106] It is worth mentioning that the stability proles of Phe140 were only associated with D3 (0.35 A) and O6K (0.33 A) highlighting the signicant role of hydrophobic interactions in stabilizing both ligands at the Mpro binding site. Finally, almost all residues lining the S2 sub-pocket and a couple of residues comprising the S3 one (Met165 and Leu167) showed high trends of significant immobility and limited uctuations ranging from 0.32 A and up to 0.94 A. Interestingly, this residue-wise immobility trade was of the highest positive numbers for the S2 Met49 residue, particularly at the D1, O6K, and N3-bound proteins. Several vicinal residues for the S2 subsite depicted signicant rigidity. These immobile residues include Phe185 and Val-186 inferring the stability of ligands within these two respective protein subsites. In brief, the provided DRMSF ndings highlighted the key role of several S2 amino acids in addition to S3 Met165/Leu167, S1 His163/Glu166, S1 0 catalytic His41, as well as vicinal residues for stabilizing the investigated compounds and both reference ligands within the Mpro canonical pocket. Additionally, the DRMSF trajectories positively add to suggested sustained stability and compactness of the ligand-Mpro investigated complexes, particularly for D1, across the all-atoms MD simulations. All these came in high concordance with the above presented dynamic behaviors presented by the RMSD and PCA ndings.
3.2.3. Conformational analysis and intermolecular hydrogen bonding of ligand-Mpro complexes. Analysis of key conformational alterations across the MD simulation timeframe was performed through examining the ligand-Mpro models at trajectories of regular intervals. Selected frames at 0, 50, 100, 150, and 200 ns for each ligand-protein model were extracted and minimized to a 0.001 kcal mol À1 A À2 gradient using the MOE system preparation package. A stable binding prole was assigned for almost all isolated compounds as well as reference ligands. Notably, D1 showed the most limited conformational changes across the selected trajectories, particularly following the 100 ns MD simulation time frame where the ligand exhibited comparable spatial orientation and pocket connement (Fig. 5A). This could be reasoned as D1 exhibited a lower number of rotatable bonds, particularly since it lacks the long aliphatic tail, the thing that could signicantly impact its special stability within the Mpro pocket. Similar behavior was depicted for D5 where comparable spatial orientations were depicted following the 50 ns simulation time with the ligand's tail substitution being directed towards the pocket's solvent-exposed side (Fig. 5B). Despite having a tail substitution, this theopederin family member (D5) is with reasonable rigidity since the tail is being unsaturated with two double bonds. Both depicted D1 and D5 preferential stability came in great agreement with the previously described RMSD ndings.
Concerning the onnamides family members, both D2 and D3 exhibited higher aliphatic extensions with reasonable conformational changes being limited to their respective aliphatic tails rather than their tetrahydropyran rings (Fig. 5C  and D). However, both ligands showed reasonable connement within the Mpro pocket site despite their inherited conformational exibility. Higher tail-related conformational alterations were assigned for D2 rather than D3 since the latter exhibited tail rigidication owing to possessing an additional tetrahydropyran ring due to hydroxyl group-mediated tail cyclization. Ligand connement within Mpro pocket was not depicted for the other close related onnamide member where D4 abandoned the Mpro canonical binding site at the middle and end of the MD simulation (Fig. 5E). The latter dynamic behavior explains the exhibited higher uctuations within the complex and sole ligand RMSDs beyond the 90 ns MD simulation time frame. It was suggested that the presence of higher unsaturation (three double bonds conjugation) within the D4 tail could limit its conformational exibility while limiting its favored maneuvers towards the signicant pocket residues. Exhibiting unfavoured tail orientations might have cost D4 to lose signicant polar contacts with Mpro pocket residues mediated by the ligand's terminal polar functionalities (guanidine, carboxylic group, and central amide) causing a compromised ligand-target pocket accommodation.
Thus, the depicted differential conformational changes among the onnamide family members highlight the impact of the nature of the aliphatic tail substitution for guiding preferential ligand-target stability. Regarding the reference ligands, both O6K and N3 are comparable proteomimetic ligands with several rotatable bonds exhibiting signicant rotation at their dihedral/torsion angles. Interestingly, such inherited exibility caused both ligands to twist along an anti-clockwise direction yet being maintained within the same orientations in respect to the Mpro pocket site (Fig. 5F and G). Notably, slightly higher orientation changes were illustrated for N3 over those of O6K providing reasonable explanations for the N3-complex RMSD trajectory uctuations near the end of the MD simulation run.
Further examining the proposed ligand-Mpro pocket stability was proceeded through monitoring the hydrogen bonding established between the ligand and target protein across the MD simulation trajectories. Plotting the number of formed hydrogen bonds between the simulated ligand and protein across the MD trajectories has revealed differential hydrogen bonding patterns across different systems. The simulated mycalamide family member, D1, showed a lower number of average hydrogen bond interactions as compared to members of the onnamide and theopederin family members. Lacking the terminal aliphatic tail causes D1 to possess the lowest number of hydrogen bond donors/acceptors available for predicted polar contacts with Mpro pocket residues. Additionally, D1 exhibited a nearly consistent number of hydrogen bonds with the target protein at the middle and till the end of the MD simulation time (Fig. 6A). This was highly correlated with the limited D1 conformational changes, particularly following the 100 ns MD simulation time frame, where the ligand exhibited comparable spatial orientation and pocket connement. On the contrarily, both the D2 and D3 of the onnamide family showed the highest number of hydrogen bond interactions across the whole MD simulation time frame (Fig. 6B and  C). Showing such high polar interaction proles could be reasoned since both D2 and D3 possess a higher number of hydrogen bond donors/acceptors (incorporated within both the core tetrahydropyran rings and tail substitution) besides showing signicant ligand-pocket accommodation along with the whole simulation. As expected, the polar interaction prole of the above onnamide family members was not relevant for D4 since the latter exhibited poor pocket accommodation abounding the Mpro binding site within the middle of the MD simulation run. Interestingly, the D4-Mpro system lack any signicant hydrogen bonding across several MD simulation frames, particularly around 130 and 160 ns simulation times (Fig. 6D). Regarding the simulated theopederin member D5-Mpro model, higher polar interacting proles up to 5 hydrogen bonds were illustrated across the rst 100 ns MD simulation trajectories (Fig. 6E). Aerward, a consistent number of hydrogen bonding was achieved till the end of the MD simulation except only for few frames missing relevant hydrogen bonding. Moving towards the two reference ligands, both O6K and N3 showed somewhat comparable hydrogen bonding proles across MD simulation runs ( Fig. 6F and G). This may be related to the close proteomimetic nature of both ligands incorporating similar N-terminal amino acids within their respective structures. However, N3 showed slightly higher average hydrogen bond numbers as compared to O6K owing to N3's longer residue sequence reecting more hydrogen bond donors/acceptors incorporated within its structure.
3.2.4. Binding-free energy calculations. In an attempt to further understand the nature of the ligand-protein interaction, explore the comparative ligand-binding site affinity, and obtain more information concerning individual ligand/residue contributions, the calculation of the binding-free energy was performed. 107 In this regard, the MM/PBSA calculation was implemented for binding-free energy estimation, where higher negative binding energy explains more ligand affinity towards its respective target pocket. 91 The MM/PBSA is considered of comparable accuracy to the free-energy perturbation approaches, yet with much smaller computational expenses. 91 Using the solvent-accessible surface area (SASA), the only model of the free-binding energy calculation (DG total ¼ DG molecular mechanics + DG polar + DGA polar ), as well as the single-trajectory approach, each energy term and their average values, were calculated across the representative frames extracted/saved from the whole 200 ns MD simulation trajectories. The single-trajectory approach was chosen since dealing with one trajectory of the ligand-Mpro complex rather than separated trajectories of the complex, receptor, and ligand was shown to be not only much faster but also less noisy. 108 Adopting the calculation across the 200 ns MD simulation time course was rationalized by the rapidly attained equilibration/ convergence of the complex RMSD trajectories for all investigated compounds and reference ligands following few initial MD frames (Fig. 2B).
To our delight, several investigated natural compounds depicted signicant free-binding and affinity to the target's pocket ( Table 5). The free binding energies of both the theopederin and onnamide family members (D2, D3, and D5) were estimated at signicant negative values reaching up to 2-fold those of the two reference ligands. Considering the reported superior inhibition activity of the N3 ligand against SARS-CoV-2 Mpro, 105 the obtained data for these latter drug class members highlights their potential activity against the same target enzyme. On the other hand, the less stabilized onnamide ligand, D4, depicted calculated binding-free energy being comparable to that of the reference ligands especially in relation to the N3-Mpro system. Finally, the calculated free binding energy for the D1-Mpro model was the least of all investigated ligands the thing that highlights the signicant role of the extended aliphatic tail substitution for guiding the ligandtarget interaction.
Dissecting the obtained binding-free energy into its contributing energy terms showed a dominant energy contribution of the van der Waals interactions within the free-binding energy calculation of both the reference ligands. The higher hydrophobic and lower electrostatic energy contributions were assigned for O6K over N3 owing to the higher aromaticity and lower number of hydrogen bond donors/acceptors incorporated within the O6K core skeleton. The same preferential free binding energy contribution of the van der Waals potentials was depicted with D1-Mpro system where the absence of polar functionality related to the tail substitution deprived the ligand of relevant anchoring potentiality with the pocket polar residues. The above hydrophobic/electrostatic contribution prole was contrarily for the top-ranked isolated compounds (D2, D3, and D5) where the Coulomb's electrostatic potential energy was much higher than that of the van der Waals non-bonding interactions reaching up to $2-fold for D2 and D5 systems. The latter energy contribution pattern could be related to the highly polar extended tail substitutions, where these tails allowed D2, D3, and D5 to exhibit extended orientations within the Mpro pocket positioning their hydrophilically decorated bis-tetrahydropyran rings at one side of the pocket and the polar functionalities related to their long tail at the far pocket side (Fig. 5). Such depicted ligand-Mpro orientations have allowed these extended polyketides (D2, D3, and D5) to exhibit extensive polar interactions with Mpro pocket hydrophilic residues, whereas, D1 was deprived from such relevant anchoring potentiality for lacking the tail substitution. This came in good reason since the latter ligands showed higher number and frequency of hydrogen bonding with the Mpro protein pocket as compared to those of D1, N3 or O6K (Fig. 6). It is worth mentioning that the reported data within the current literature has considered the Mpro pocket to be more hydrophobic being deep, less solvent exposed, and with conserved hydrophobic pocket lining residues. 63,105,106 Nevertheless, the ability of D2, D3, and D5 to establish favored strong polar interactions with the pocket's key residues allow them to be deeply anchored and attaining signicant pocket specicity. This was obvious through the previously described conformational and hydrogen bonding analysis. All above data conrms the signicant role of the polar functionalities related to the terminal tail substitution for ligand anchoring within Mpro binding site. Notably, the presence of polar functionalities related to the terminal tail substitution could act as a double-bladed inuencer on ligandprotein binding since such functional groups impose higher DG solvation that might compromise the ligand anchoring since the binding process is a solvent-substitution approach. Thus, optimizing these isolated compounds through the introduction of ionizable groups yet with higher hydrophobic characteristics (i.e., tetrazole functionality) would be suggested relevant for minimizing the DG solvation , extending the ligand-Mpro binding, and furnishing potential target inhibition. Finally, the total non-polar interactions (DG vanderWaals plus DG SASA ) were higher for the top-binding compounds (D2 and D3) as compared to those of reference ligand N3 the thing that would have been directly related to the pocket's large surface area. Being hydrophobic and with a large surface area, the Mpro binding site would favor higher non-polar interactions with D2 and D3 since the latter ligands are capable to attain a more extended conformation within the target's pocket.
For gaining more insights regarding ligand-residues interactions, the binding-free energy decomposition within the g_mmpbsa module was utilized to identify the key residues involved within the obtained binding free energies. 91 Interestingly, similar residue-wise energy contribution patterns were assigned for the top-binding onnamide and theopederin family members, D2, D3, and D5 (Fig. 7A). This was of no surprise since these ligands depicted comparable energy terms as well as total binding-free energy values. On the other hand, the family member compound (D1) shares, to some extent, several residuewise energy contribution patterns similar to those of the three top-binding ligands, yet of lower magnitude (Fig. 7B). As expected, the D4 compound showed the lowest residue-wise energy contributions with most of the residues involved with the top-binding compounds. Additionally, several Mpro pocket-specic residues showed positive energy contributions with D4 inferring repulsion effect and unfavoured ligand-pocket accommodation.
Notably, residues of the S2 subsites and their vicinal residues showed the most extended and one of the highest residuebinding energy contributions with values up to twodigit kJ mol À1 values. Residues such as; Glu47, Asp48, Met49, Glu55, Asp56, Asp187, and Arg188 illustrated very high energy contributions ranging from À6.20 AE 0.761 kJ mol À1 and up to À15.32 AE 0.98 kJ mol À1 . Similar energy contribution patterns, yet with smaller magnitudes, were assigned for the S3 subsite residues and vicinal amino acids including Leu167, Pro168, Thr169, Asp176, and Glu178 (À5.10 AE 1.63 up to À11.46 AE 0.38 kJ mol À1 ). The profound and widespread energy contribution of the S2 residues came in great concordance with the previously described RMSF analysis inferring the crucial role of these residues within the ligand anchoring. Contributions of the key S1 sub-pocket residues were only assigned high for Glu166 exhibiting high residue-associated energy contribution of À22.173 AE 2.26 kJ mol À1 and À13.16 AE 4.21 kJ mol À1 for D2 and D3, respectively. On the contrary, the rest of the S1 residues either showed low negative (Phe140, Leu141, Asn142) or even positive energy contribution values as with His163 inferring their limited or unfavoured role for ligand binding, respectively.

Structure-activity relationship study
The investigated compounds share a common core composed of two tetrahydropyran rings; one of them possesses an aminal moiety whilst the other bears an acyl functional group, which are coupled via an amide linkage. Despite possessing a similar core, the estimated highly variable binding score might be a reection of the presence or absence of some moieties, additional rings, functional groups, and/or variance of the oxygenation pattern (numbers and positions of mainly hydroxyl and/or methoxy groups). Relating the structures (1-34; Schemes 1-4) to the estimated binding scores (Table 1) suggests that the most inuential structural element on the predicted binding score is the carbon chain present as a substituent on the tetrahydropyran ring bearing the aminal moiety, especially when contains a terminal arginine residue (Fig. 8). The impact of other structural elements was relatively lower on the calculated binding sore.
Thus, the high scores of onnamides (9-19 and 21) that surpassed the calculated binding score of the co-crystallized inhibitor (O6K, 35) of the dimeric form (Table 1) might be attributed mainly to the presence of 10-12 unsaturated acyl carbon chain (green-colored; Fig. 8) coupled with an arginine residue (redcolored; Fig. 8). This might be conrmed upon comparison with the binding score onnamide (20) that lacks the arginine residue, which possessed a lower binding score relative to the calculated score of the co-crystallized inhibitor (35) of the dimeric form. Probably, the guanidinic moiety of the arginine reside offers favorable binding interactions. Amongst onnamides, the best score was for 6,7-dihydro-onnamide A (11) whose substituent acyl chain coupled with arginine is of 12 carbons length and incorporates trans-8,10-diene. Its calculated binding score (À10.19 kcal mol À1 ) was much better than the calculated score of the co-crystallized inhibitor (35) of the dimeric form and almost similar to the calculated score of the co-crystallized inhibitor (36) of the monomeric form (Table 1). Such a high score of 6,7dihydro-onnamide A (11) might be a result of an optimum distance between arginine residue and the core tetrahydropyran rings offered by the 12 carbons-length trans-2,4-diene substituent. This might be inferred from the found relatively lower score of compounds possessing 10 carbons-length trans-2,4-diene substituents (12 and 13) in comparison with compounds possessing 12 carbons-length substituents. Although of minimal effect, it was found that compounds having 17-hydroxy functionality (12) are slightly better than 17-oxo functionality (13). However, this lowering effect for conversion of the corresponding 19-hydroxy functionality in 6,7-dihydro-onnamide A (11) to a 19oxo functionality in 6,7-dihydro-oxo-onnamide A (18) was much larger. Nevertheless, this order was reversed as the conversion of the corresponding 19-hydroxy functionality in onnamide D (15) to a 19-oxo functionality in oxo-onnamide A (19) resulted in minimal enhancement of the binding score. Introduction of a double bond at 6-position of the 12 carbon-length trans-2,4diene to become trans-2,4,6-triene (pseudo-onnamide A (17), onnamide E (16), onnamide A (9), oxo-onnamide A (19), and onnamide D (15)) afforded notably high scoring compounds, but less than 6,7-dihydro-onnamide A (11). Similarly, the formation of a cyclic hemiacetal between the 19-hydroxy functionality and an introduced 15-carbonyl functionality in side-chain affording an additional tetrahydropyran ring (onnamide C (14)) resulted in a notably high scoring compound but, also, less than 6,7-dihydroonnamide A (11). The scores of these compounds were very close but in order pseudo-onnamide A (17) > onnamide C (14) > onnamide E (16) > onnamide A (9) > oxo-onnamide A (19) > and 6,7-dihydro-oxo-onnamide A (18) z onnamide D (15). As revealed from these results, the 1,3-dioxane ring fused to tetrahydropyran ring bearing the aminal moiety in onnamides has little effect as onnamide D (15) and onnamide E (16).
In the case of pederins, the size of the inuential side chain is 3 carbons-length. Such chain bears two methoxy groups which are among the features that distinguish pederins from mycalamides in which one of/both of these methoxy groups exist(s) as hydroxyl group(s). As noted above, the 1,3-dioxane ring fused to tetrahydropyran ring bearing the aminal moiety has little effect. Accordingly, pederins showed good binding scores although they lack such 1,3-dioxane ring. The highest score among pederins was triggered by pederin (1) which has a derivative 2-methoxytetrahydropyran as the core tetrahydropyran ring bearing the acyl functional group. Conversion of the 4-hydroxytetrahydropyran that bears the aminal moiety in pederin (1) into tetrahydro-4-pyrone in pederone (3) results in the minimal reduction of the calculated binding score. However, the noticed reduction of the calculated binding score upon replacement of the derivative of 2-methoxytetrahydropyran in pederin (1) by the corresponding derivative of 2hydroxytetrahydropyran in pseudopederin (2) suggests a potential role for this methoxy function. This might be supported by the found much lower binding score of mycalamide E (6) that contains a similar 2-hydroxytetrahydropyran moiety relative to other mycalamides possessing 2-methoxytetrahydropyran. Considering the effects of the other structural elements, mycalamides in which the presence of 2methoxytetrahydropyran ring was combined with a side chain bearing two hydroxy groups; mycalamide A (4) and mycalamide D (8), were the best scoring mycalamides. Out of them, mycalamide A (4) was better, probably because the other 4-hydroxy functionality on the other tetrahydropyran ring of mycalamide D (8) is converted in a 4-methoxy group. When the side chain possessed one methoxy and one hydroxyl group; mycalamide B (5) and mycalamide C (7), there was a binding score reduction relative to the corresponding compounds. The more score reduction was associated with mycalamide C (7) which lacks also the 1,3-dioxane ring fused to tetrahydropyran ring bearing the aminal moiety. Over members of pederins and mycalamides, the best scores order was pederin (1) > mycalamide A (4) > pederone (3) > mycalamide D (8) > mycalamide B (5) > mycalamide C (7) > pseudopederin (2) > mycalamide E (6).

Conclusion
Thirty-three focused marine natural products related to the pederins, mycalamides, onnamides and theopederins polyketide families were comprehensively examined for their binding affinities against the dimeric form of the Mpro of SARS-CoV-2 using an integrated set of computational methods including molecular docking and molecular dynamics simulations studies. Our results disclosed that most of the examined members are exhibiting promising binding scores and modes especially dihydro-onnamide A (11), onnamide C (14), and pseudo-onnamide A (17) which exceeded the co-crystallized inhibitor (O6K, 34) binding score. Molecular dynamic simulation illustrated the preferential stability of almost all investigated compounds at the Mpro binding site, however, D2, D3, and D5 exhibited the most preferential stability and highest free binding energies being nearly 2-fold those of the potent Mpro inhibitors, O6K and N3. For future lead development and optimization of these top stable ligands, it is recommended that introducing ionizable groups yet with higher hydrophobic characteristics (i.e. tetrazole functionality) would be relevant for minimizing the DG solvation , extending the ligand-protein binding, as well as furnishing potential target inhibition. Furthermore, an interesting SAR study was performed to correlate the diverse structural modications on the proposed activity. Those encouraging ndings highlight that such fantastic molecular architectures could open a gate for the development of promising antiviral treatments for beating the COVID-19 outbreak. Moreover, considering exible chemical syntheses for plenty of those compounds/congeners or structurally related modied members could be urged for more preclinical and clinical examinations either alone or in combination with each other for COVID-19 management.