 Open Access Article
 Open Access Article
      
        
          
            Xuan 
            Zhou
          
        
       ab, 
      
        
          
            Abhishek 
            Khetan
ab, 
      
        
          
            Abhishek 
            Khetan
          
        
       ad, 
      
        
          
            Jie 
            Zheng
          
        
      e, 
      
        
          
            Mark 
            Huijben
ad, 
      
        
          
            Jie 
            Zheng
          
        
      e, 
      
        
          
            Mark 
            Huijben
          
        
       e, 
      
        
          
            René A. J. 
            Janssen
e, 
      
        
          
            René A. J. 
            Janssen
          
        
       ac and 
      
        
          
            Süleyman 
            Er
ac and 
      
        
          
            Süleyman 
            Er
          
        
       *a
*a
      
aDIFFER – Dutch Institute for Fundamental Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands. E-mail: s.er@differ.nl
      
bDepartment of Applied Physics, Eindhoven University of Technology, Eindhoven 5600 MB, the Netherlands
      
cMolecular Materials and Nanosystems, Institute for Complex Molecular System, Eindhoven University of Technology, Eindhoven 5600 MB, the Netherlands
      
dMultiscale Modeling of Heterogeneous Catalysis in Energy Systems, RWTH Aachen University, Aachen 52062, Germany
      
eMESA+ Institute for Nanotechnology, University of Twente, Enschede 7500 AE, the Netherlands
    
First published on 30th May 2023
Organic cathode materials are attractive candidates for the development of high-performance Li-ion batteries (LIBs). The chemical space of candidate molecules is too vast to be explored solely by experiments; however, it can be systematically explored by a high-throughput computational search that incorporates a spectrum of screening techniques. Here, we present a time- and resource-efficient computational scheme that incorporates machine learning and semi-empirical quantum mechanical methods to study the chemical space of approximately 200![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 quinone-based molecules for use as cathode materials in LIBs. By performing an automated search on a commercial vendor database, computing battery-relevant properties such as redox potential, gravimetric charge capacity, gravimetric energy density, and synthetic complexity score, and evaluating the structural integrity upon the lithiation process, a total of 349 molecules were identified as potentially high-performing cathode materials for LIBs. The chemical space of the screened candidates was visualized using dimensionality reduction methods with the aim of further downselecting the best candidates for experimental validation. One such directly purchasable candidate, 1,4,9,10-anthracenetetraone, was analyzed through cyclic voltammetry experiments. The measured redox potentials of the two lithiation steps,
000 quinone-based molecules for use as cathode materials in LIBs. By performing an automated search on a commercial vendor database, computing battery-relevant properties such as redox potential, gravimetric charge capacity, gravimetric energy density, and synthetic complexity score, and evaluating the structural integrity upon the lithiation process, a total of 349 molecules were identified as potentially high-performing cathode materials for LIBs. The chemical space of the screened candidates was visualized using dimensionality reduction methods with the aim of further downselecting the best candidates for experimental validation. One such directly purchasable candidate, 1,4,9,10-anthracenetetraone, was analyzed through cyclic voltammetry experiments. The measured redox potentials of the two lithiation steps,  , of 3.3 and 2.4 V, were in good agreement with the predicted redox potentials,
, of 3.3 and 2.4 V, were in good agreement with the predicted redox potentials,  , of 3.2 and 2.3 V vs. Li/Li+, respectively. Lastly, to lay out the principles for rational design of quinone-based cathode materials beyond the current work, we constructed and discussed the quantitative structure property relationships of quinones based on the data generated from the calculations.
, of 3.2 and 2.3 V vs. Li/Li+, respectively. Lastly, to lay out the principles for rational design of quinone-based cathode materials beyond the current work, we constructed and discussed the quantitative structure property relationships of quinones based on the data generated from the calculations.
Quinone-based small molecules are particularly attractive in the research community as they offer high theoretical specific capacity and promising redox stability and can be sourced from biomass.7,11 However, in comparison to state-of-the-art inorganic cathode materials, small quinones have not yet been found to be sufficiently viable in terms of actual redox potentials and cycling stability.12–14 Improvements in the electrochemical performance of quinone-based cathode materials are possible by fine-tuning their molecular structure through different approaches, including functionalization with R-groups,15 fusing aromatic16 or heteroaromatic rings17 together, and incorporation of additional redox-active carbonyl groups.18
Going beyond the Edisonian experimental approach of trial-and-error, high-throughput computational screening (HTCS) has emerged as a powerful approach for accelerating the discovery of high-performing material candidates. Owing to the ever-increasing computational power and the new analysis tools that are based on robust theoretical approaches, HTCS has become an efficient means for the exploration of large ‘virtual chemical spaces’.19–24 However, previous computational studies on quinone-based LIB cathode materials have been restricted to a modest number of molecules which were derived from a limited range of often intuitively chosen quinone core structures, including cyclohexa-2,5-diene-1,4-dione (BQ),25–27 naphthalene-1,4-dione (NQ),25,28,29 anthracene-9,10-dione (AQ),29–34 and phenanthrene-9,10-dione (PQ).17,29,34 These investigations of chemical spaces, although around few quinone core structures, have offered useful insights on the effectiveness of various strategies, such as functionalization and ring-fusion. However, the effect of these strategies, especially in terms of often competing performance requirements, has not yet been sufficiently well explored for a large range of quinones that span across several functional groups, ring numbers and the number of redox-active carbonyl groups that can interact with Li.
In this work, we tapped into a large chemical space of systematically enumerated quinones by means of HTCS with two principal goals: (1) finding the top-performing candidates which can be validated experimentally and (2) deciphering the trends in battery performance-related properties induced by pondered molecular engineering. The structure-property relationships emerging from the second step can be used as guiding design principles for future discovery efforts. The workflow of chemical space exploration of organic molecules conceptualized and applied in this work is shown in Fig. 1. The HTCS approach has been carried out in five major steps:
|  | ||
| Fig. 1 An overview of the data-driven computational and experimental workflow that has been applied for the exploration of the quinone-based LIB cathode materials in the current study. | ||
• Library generation: a virtual library consisting of 199![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 498 unique quinone molecules was designed and developed by employing three elementary molecular building-blocks and four chemically different R-groups.
498 unique quinone molecules was designed and developed by employing three elementary molecular building-blocks and four chemically different R-groups.
• Property computations: an automated search for the commercial availability of the virtual library molecules in the ZINC database35 was performed, in which matching records for a total of 245 compounds were found. In addition, the simplified molecular input line entry system (SMILES) representations of all the virtual library molecules were used as inputs for a machine learning (ML) model that predicts the synthetic complexity score (SCScore) of the molecules. Next, we employed semi-empirical quantum mechanics (SEQM) calculations to optimize the geometries of molecules and to calculate their redox potentials via a linear regression model. Furthermore, the theoretical gravimetric charge capacity (Q) and energy density (W) of the virtual molecules were calculated.
• DFT simulations: a small proportion of the entire library of 199![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 498 molecules, which were either commercially available or easily synthesizable, was treated with density functional theory (DFT) calculations. Accordingly, 641 molecules were further optimized and their properties were recalculated at a higher level of accuracy. In addition, to determine the stability of molecules, we used a structural alignment algorithm and analyzed the deviations between DFT-optimized structures of the pre- and post-lithiated molecules.
498 molecules, which were either commercially available or easily synthesizable, was treated with density functional theory (DFT) calculations. Accordingly, 641 molecules were further optimized and their properties were recalculated at a higher level of accuracy. In addition, to determine the stability of molecules, we used a structural alignment algorithm and analyzed the deviations between DFT-optimized structures of the pre- and post-lithiated molecules.
• Virtual screening: out of the 641 candidates that have been considered in DFT-calculations, 430 molecules have satisfied the molecular stability criterion enforced in the current work. Moreover, 349 of these 430 molecules have met the prescribed criterion for redox potential. Lastly, we employed the Uniform Manifold Approximation and Projection (UMAP) dimension reduction technique in combination with the tailored similarity method in order to obtain a non-linear dimensional reduction of the top-candidate molecular chemical space.
• Experimental validation: for real-lab validation, we performed electrochemical measurements on a readily purchasable molecule from the list of top-compounds that had been identified via the virtual screening.
In addition to providing a list of top-grade quinone-based candidate molecules for LIBs, we also included here a structure–property analysis of the entire virtual library molecules.
In addition to the size constraint, the CQSs contained either two or four C![[double bond, length as m-dash]](https://www.rsc.org/images/entities/char_e001.gif) O groups; in other words, a maximum of two quinone building block molecules per CQS were allowed. Applying these two structural constraints, 170 unique CQSs were generated, which can be classified into five main groups based on the number and position of carbonyl groups. The molecules that consist of one 1,4-benzoquinone ([P]), one 1,2-benzoquinone ([O]), two 1,4-benzoquinone ([P + P]), two 1,2-benzoquinone ([O + O]), and one 1,4-benzoquinone and one 1,2-benzoquinone ([P + O]), are shown in Table 1. Accordingly, the molecular skeletons of [P] and [O] can undergo one pair of lithiation reactions leading to aromatization, whereas [P + P], [O + O], and [P + O] can undergo two pairs of lithiation reactions.
O groups; in other words, a maximum of two quinone building block molecules per CQS were allowed. Applying these two structural constraints, 170 unique CQSs were generated, which can be classified into five main groups based on the number and position of carbonyl groups. The molecules that consist of one 1,4-benzoquinone ([P]), one 1,2-benzoquinone ([O]), two 1,4-benzoquinone ([P + P]), two 1,2-benzoquinone ([O + O]), and one 1,4-benzoquinone and one 1,2-benzoquinone ([P + O]), are shown in Table 1. Accordingly, the molecular skeletons of [P] and [O] can undergo one pair of lithiation reactions leading to aromatization, whereas [P + P], [O + O], and [P + O] can undergo two pairs of lithiation reactions.
Finally, the CQSs were functionalized by replacing the peripheral H atoms on the rings with four different R-groups: –Cl, –CH3, –OCH3, and –NH2. These electron-withdrawing and electron-donating chemical groups were chosen to enlarge the electrochemical window of the CQSs.13,37,38 Each CQS was exhaustively enumerated using a single type of R-group without combining different R-groups. The range of functionalized positions of R-groups varied between zero and the maximum number of H atoms that are present on that specific CQS. The enumeration of the virtual library was performed automatically using the Custom R-Group Enumeration tool as implemented in Schrödinger Materials Science Suite (SMSS).39 An overview of the statistical decomposition of the virtual library, which contained 199![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 498 unique quinone-based molecules, is shown in Table 1. The structures of the complete family of 170 CQSs, including their 2D structure drawings and SMILES representations, as well as the total number of R-group functionalized derivatives per CQS, are given in ESI Table S1.†
498 unique quinone-based molecules, is shown in Table 1. The structures of the complete family of 170 CQSs, including their 2D structure drawings and SMILES representations, as well as the total number of R-group functionalized derivatives per CQS, are given in ESI Table S1.†
After generating the virtual library, an automated search for finding the commercially available (CA) molecules was performed using an in-house developed tool on the ZINC database, which contains vendor information data on 750 million purchasable compounds.35 This tool uses SMILES strings of molecules as inputs and collects the available vendor information from the ZINC database. The search resulted in matching records for 245 organic compounds, which represents a tiny fraction (0.13%) of the grand virtual library that has been built in the current work. This indicates a vast chemical space of molecules that can possibly be synthesized and explored for application as quinone-based cathode materials in LIBs.
|  | (1) | 
 and
 and  denote the predicted redox potential and LUMO energy of quinone-based molecules computed at the SEQM-level, respectively. For a group of 641 down-selected candidates, further OPT and SPE calculations were performed at a higher DFT-level of accuracy using the B3LYP functional46,47 with the LACVP++**48 basis set with diffuse and polarization functions. The DFT calculations were performed using the Jaguar program49 as implemented in SMSS. The change in energy and RMS density matrix were separately converged to within 5.0 × 10−5 and 5.0 × 10−6 Hartree, respectively. A ‘medium’ grid density was chosen for performing OPT, whereas a ‘fine’ grid density was used for SPE calculations. Similar to the above, the DFT-calculated LUMO energy of the optimized quinone reactant molecule was employed to predict redox potential. Meanwhile, it has been reported that the electronic structure, particularly the average electric potential of carbon atoms in carbonyl-equipped rings, demonstrates a linear correlation with the redox potential of quinone molecules.17 Here, we compared the prediction performance of the two chemical descriptors, and the fitting results are shown in Fig. S1.† It can be observed that the LUMO energy of the quinone molecule (R2 = 0.97) shows better prediction performance than the average electric potential of carbon atoms in carbonyl-equipped rings (R2 = 0.81). Similarly, the linear regression equation is given as:
 denote the predicted redox potential and LUMO energy of quinone-based molecules computed at the SEQM-level, respectively. For a group of 641 down-selected candidates, further OPT and SPE calculations were performed at a higher DFT-level of accuracy using the B3LYP functional46,47 with the LACVP++**48 basis set with diffuse and polarization functions. The DFT calculations were performed using the Jaguar program49 as implemented in SMSS. The change in energy and RMS density matrix were separately converged to within 5.0 × 10−5 and 5.0 × 10−6 Hartree, respectively. A ‘medium’ grid density was chosen for performing OPT, whereas a ‘fine’ grid density was used for SPE calculations. Similar to the above, the DFT-calculated LUMO energy of the optimized quinone reactant molecule was employed to predict redox potential. Meanwhile, it has been reported that the electronic structure, particularly the average electric potential of carbon atoms in carbonyl-equipped rings, demonstrates a linear correlation with the redox potential of quinone molecules.17 Here, we compared the prediction performance of the two chemical descriptors, and the fitting results are shown in Fig. S1.† It can be observed that the LUMO energy of the quinone molecule (R2 = 0.97) shows better prediction performance than the average electric potential of carbon atoms in carbonyl-equipped rings (R2 = 0.81). Similarly, the linear regression equation is given as:|  | (2) | 
 and
 and  denote the predicted redox potential and LUMO energy of quinone-based molecules computed at the DFT-level, respectively. It should be noted that both eqn (1) and (2) are valid for an electrolyte formulation of 1 M LiPF6 with a (w/w = 1
 denote the predicted redox potential and LUMO energy of quinone-based molecules computed at the DFT-level, respectively. It should be noted that both eqn (1) and (2) are valid for an electrolyte formulation of 1 M LiPF6 with a (w/w = 1![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1) solution mixture of ethylene carbonate (EC) and dimethyl carbonate (DMC).50
1) solution mixture of ethylene carbonate (EC) and dimethyl carbonate (DMC).50
      
      
        
        |  | (3) | 
|  | (4) | 
In the current study, to scrutinize the structural changes upon lithiation, we calculated the root mean square deviation (RMSD) between the reactant and product molecules for all the down-selected 641 candidates. For this task, we employed the LS-align algorithm,58 which is an iterative and heuristic tool for atom-level structural comparisons of the molecules. We used the DFT-level optimized reactant and product molecular geometries for the comparisons. To ensure consistency over the explored space of candidate molecules, the calculation of RMSD was based only on the carbon backbones of the pre- and post-lithiated CQSs.65 The backbone carbon atoms were selected not only because they represent proportionally the largest atomic fraction count in quinone backbones but also because they undergo significant aromatization-driven changes due to lithiation. It must be noted that the inclusion of the contribution of non-core atoms to RMSD is anyway likely to be erroneous in the gas phase approximation that has been adopted here. This is because the reorganization of these peripheral atoms are highly likely to be affected by the inter-molecular interactions in the solid phase.
It is important to note that some CQSs, such as BQ,26,27 NQ,29 AQ,29–34 PQ,17,29,34 and several others18,61,62 found in our screening library, have been investigated in recent literature as cathode materials for LIBs. Thus, these molecules were classified as already-investigated. After excluding the already-investigated molecules from the screened dataset of 349 molecules, the rest of the molecules that were commercially-available were classified as CA otherwise as low-SCScore. To utilize ChemPlot, we fed it a two-column dataset comprising the SMILES representation of molecules in the first column and their corresponding three categorical target values (low-SCScore, CA, and already-investigated) in the second column. ChemPlot's tailored similarity approach works by initially computing all available descriptors for each molecule, which are then collated into a matrix format, with the rows representing the individual molecules and the columns representing their respective descriptors. The software then identifies which calculated descriptors are correlated with the categorical target values. Finally, the resulting matrix of selected descriptors is utilized as an input for the dimensionality reduction phase.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 g mol−1, Sigma-Aldrich) with a mass ratio of 8
000 g mol−1, Sigma-Aldrich) with a mass ratio of 8![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1
1![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1. In particular, the organic active material was ground with super P in an agate mortar for 15 minutes to obtain the black mixed powder, which was subsequently dispersed in N-methylpyrrolidone (NMP, ≤99%, Sigma-Aldrich) solution with the dissolved PVDF (0.05 g ml−1). The mixed slurry was treated by ultrasonication for 30 minutes and cast on Al foil. The half-cell consisted of a working electrode, lithium metal (99.9%, Sigma-Aldrich), and a glass fiber separator (ECC1-01-0012-B/L, EL-CELL), while the cell assembly was performed in an argon-filled glove box. The electrolyte is composed of a solution mixture of EC and DMC (v/v = 1
1. In particular, the organic active material was ground with super P in an agate mortar for 15 minutes to obtain the black mixed powder, which was subsequently dispersed in N-methylpyrrolidone (NMP, ≤99%, Sigma-Aldrich) solution with the dissolved PVDF (0.05 g ml−1). The mixed slurry was treated by ultrasonication for 30 minutes and cast on Al foil. The half-cell consisted of a working electrode, lithium metal (99.9%, Sigma-Aldrich), and a glass fiber separator (ECC1-01-0012-B/L, EL-CELL), while the cell assembly was performed in an argon-filled glove box. The electrolyte is composed of a solution mixture of EC and DMC (v/v = 1![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1) with 1.0 M LiPF6 (Sigma-Aldrich, battery grade). The cyclic voltammetry (CV) measurements were conducted using a galvanostat/potentiostat (VMP-300, BioLogic) with a scan rate of 0.2 mV s−1.
1) with 1.0 M LiPF6 (Sigma-Aldrich, battery grade). The cyclic voltammetry (CV) measurements were conducted using a galvanostat/potentiostat (VMP-300, BioLogic) with a scan rate of 0.2 mV s−1.
      
    
    
      
       values of molecules in the virtual library is shown in Fig. 2a. Each grey dot on the map represents a unique molecule, while the entire group of 245 CA molecules is shown using a spectrum of colors, where the colors for molecules were assigned according to the total number of R-groups accommodated on them. As can be observed, there is a wide-ranging distribution of molecules across the two metrics with no apparent correlation. The CA molecules are found to accumulate on the bottom-left of the distribution map. Remarkably, with med(SCScores of CA molecules) = 2.2, 95% of the CA molecules had an SCScore < μ(SCScores of all molecules). In addition, the SCScore distribution of molecules is observed to be positively biased towards the molecules that accommodate a small number of R-groups. This is expected since the synthesis of molecules that contain a large number of functional groups will be more complex. All these observations support both the aptness of the SCScore as a first-order metric for describing the synthesizability of the virtual molecules and the screening of these molecules on the basis of their SCScores.
 values of molecules in the virtual library is shown in Fig. 2a. Each grey dot on the map represents a unique molecule, while the entire group of 245 CA molecules is shown using a spectrum of colors, where the colors for molecules were assigned according to the total number of R-groups accommodated on them. As can be observed, there is a wide-ranging distribution of molecules across the two metrics with no apparent correlation. The CA molecules are found to accumulate on the bottom-left of the distribution map. Remarkably, with med(SCScores of CA molecules) = 2.2, 95% of the CA molecules had an SCScore < μ(SCScores of all molecules). In addition, the SCScore distribution of molecules is observed to be positively biased towards the molecules that accommodate a small number of R-groups. This is expected since the synthesis of molecules that contain a large number of functional groups will be more complex. All these observations support both the aptness of the SCScore as a first-order metric for describing the synthesizability of the virtual molecules and the screening of these molecules on the basis of their SCScores.
        The predicted redox potentials of the CA molecules were found to be in the lower range, as shown in Fig. 2a. 180 out of 245 CA molecules had  values of 2.5 ± 0.2 V vs. Li/Li+. On a more detailed inspection, it is found that a majority (∼81%) of the CA molecules were the functionalized derivatives of only four different quinones, namely, BQ (24 molecules; ∼13%), NQ (43 molecules; ∼24%), AQ (63 molecules; ∼35%), and PQ (16 molecules; ∼9%). In previous experiments, the
 values of 2.5 ± 0.2 V vs. Li/Li+. On a more detailed inspection, it is found that a majority (∼81%) of the CA molecules were the functionalized derivatives of only four different quinones, namely, BQ (24 molecules; ∼13%), NQ (43 molecules; ∼24%), AQ (63 molecules; ∼35%), and PQ (16 molecules; ∼9%). In previous experiments, the  values of NQ and AQ were between 2.3 and 2.5 V vs. Li/Li+, whereas PQ had
 values of NQ and AQ were between 2.3 and 2.5 V vs. Li/Li+, whereas PQ had  .29,50,61 Next, molecules that have both a low-SCScore (≤2.2) and small number of functional groups (≤4) were down-selected for further screening. This resulted in 396 low-SCScore molecules from the library that were not CA. These molecules had 1.90 <
.29,50,61 Next, molecules that have both a low-SCScore (≤2.2) and small number of functional groups (≤4) were down-selected for further screening. This resulted in 396 low-SCScore molecules from the library that were not CA. These molecules had 1.90 <  < 4.25 (±0.03) vs. Li/Li+, which is a wider potential window than has previously been exercised with quinone-based materials at LIB cathodes.45 Given the large variety of functional groups available in the chemical literature, we surmise that the synthesis of quinones with targeted redox potentials is unlikely to be a bottleneck.
 < 4.25 (±0.03) vs. Li/Li+, which is a wider potential window than has previously been exercised with quinone-based materials at LIB cathodes.45 Given the large variety of functional groups available in the chemical literature, we surmise that the synthesis of quinones with targeted redox potentials is unlikely to be a bottleneck.
Next, we grouped together 245 CA and 396 low-SCScore molecules, thus reaching a total of 641 molecules, for further screening. It has previously been shown that DFT is more accurate than SEQM when predicting the redox potentials of quinones, with the normalized root-mean-square errors (NRMSE) being 12.65% and 16.10%, respectively.45 In order to proceed further with higher accuracy, DFT calculations have been performed to recalculate the redox potentials of the 641 molecules. As a side note, the parity plot comparing SEQM (AM1) data against DFT (B3LYP) data is shown in Fig. 2b. The observed RMSE and coefficient of determination (R2) of 0.12 V vs. Li/Li+ and 0.94, respectively, imply that the AM1 method is sufficiently accurate when used for the purpose of the initial ranking of molecules in a large virtual library.
To evaluate the 641 molecules based on their structural stability and reaction kinetics, we calculated the RMSD of backbone carbon positions between the DFT-optimized structures of pre- and post-lithiated molecules. Fig. 3 shows the distribution of calculated RMSD values of molecules against  . We found no clear correlations between the calculated RMSD values and either of
. We found no clear correlations between the calculated RMSD values and either of  or chemical functional group type. Nevertheless, it is observed that the molecules functionalized with –Cl groups tend to have higher RMSD (above the horizontal dotted line: ∼11% of 641 molecules for Cl, ∼6% for CH3, ∼7% for OCH3, and ∼5% for NH2), thus making them more likely to undergo degradation. Drawing any inferences on the stability of candidate molecules based on RMSD analysis needs validation against experimentally backed metrics. However, such analysis is currently not possible due to the lack of such corroborating experimental data in the literature. Still, we recommend candidates with low RMSD values for experimental validation. Theoretically, the lengths of single, aromatic, and double bonds between the two neighboring carbon atoms in a molecule are 1.54, 1.40, and 1.33 Å, respectively.63 Thus, as a crude approximation, we posit a maximum tolerable value of change in the bond length for an aromatic bond as 0.14 Å (i.e. single-aromatic) because bond length changes larger than 0.14 Å will likely be more prone to structural deformation and degradation. Subsequently, applying this criterion to the group of 641 molecules, which is demarcated by the horizontal line in Fig. 3, 430 promising molecules, which included 216 CA molecules, were down-selected.
 or chemical functional group type. Nevertheless, it is observed that the molecules functionalized with –Cl groups tend to have higher RMSD (above the horizontal dotted line: ∼11% of 641 molecules for Cl, ∼6% for CH3, ∼7% for OCH3, and ∼5% for NH2), thus making them more likely to undergo degradation. Drawing any inferences on the stability of candidate molecules based on RMSD analysis needs validation against experimentally backed metrics. However, such analysis is currently not possible due to the lack of such corroborating experimental data in the literature. Still, we recommend candidates with low RMSD values for experimental validation. Theoretically, the lengths of single, aromatic, and double bonds between the two neighboring carbon atoms in a molecule are 1.54, 1.40, and 1.33 Å, respectively.63 Thus, as a crude approximation, we posit a maximum tolerable value of change in the bond length for an aromatic bond as 0.14 Å (i.e. single-aromatic) because bond length changes larger than 0.14 Å will likely be more prone to structural deformation and degradation. Subsequently, applying this criterion to the group of 641 molecules, which is demarcated by the horizontal line in Fig. 3, 430 promising molecules, which included 216 CA molecules, were down-selected.
Continuing the down-selection process, we considered the predicted redox potential values as a metric. The most extensively investigated quinone, both in experiments and computations, is AQ.29–34 The calculated  and
 and  values for AQ were 2.30 and 2.28 V vs. Li/Li+, respectively, which are both consistent with the experimentally reported value of 2.27 V.50,61 Targeting molecules with a redox potential higher than AQ, 349 of 430 molecules were found to have
 values for AQ were 2.30 and 2.28 V vs. Li/Li+, respectively, which are both consistent with the experimentally reported value of 2.27 V.50,61 Targeting molecules with a redox potential higher than AQ, 349 of 430 molecules were found to have  ≥ 2.28 V vs. Li/Li+, as demarcated by the vertical line in Fig. 3. This filtered set was found to be composed of 55 distinct CQSs, and it included 140 CA molecules. These encouraging findings indicate a wealth of diverse, purchasable and potentially high-performing molecules which are worthy of in-depth analysis aimed at digging out good cathode materials for LIBs. A CSV file containing comprehensive information on each of the 349 molecules can be found in the Data availability section.
 ≥ 2.28 V vs. Li/Li+, as demarcated by the vertical line in Fig. 3. This filtered set was found to be composed of 55 distinct CQSs, and it included 140 CA molecules. These encouraging findings indicate a wealth of diverse, purchasable and potentially high-performing molecules which are worthy of in-depth analysis aimed at digging out good cathode materials for LIBs. A CSV file containing comprehensive information on each of the 349 molecules can be found in the Data availability section.
Finally, to determine the molecules for experimentation in the current study, we visualized the clustering of the 349 quinones by using ChemPlot (Fig. 4). The principle that guides the use of ChemPlot is to identify molecules that are both intriguing for further validation and characterization, while also being structurally distinct from the molecules that have already been studied in the literature, considering that there are numerous molecules available to choose from.
The Fig. 4 inset shows the distribution of  and Q of the 349 molecules. As can be observed in Fig. 3 and 4, there are ample possibilities in terms of a multitude of properties, including redox potential, stability, charge capacity and chemical variety in functional groups. Quite remarkably, it is apparent from the Fig. 4 inset that the CA (green dots) and already-investigated (blue dots) molecules tend to show only modest energy-related properties in comparison to the virtual library molecules with low SCScores (orange dots). The CA and already-investigated molecules were distributed in the low
 and Q of the 349 molecules. As can be observed in Fig. 3 and 4, there are ample possibilities in terms of a multitude of properties, including redox potential, stability, charge capacity and chemical variety in functional groups. Quite remarkably, it is apparent from the Fig. 4 inset that the CA (green dots) and already-investigated (blue dots) molecules tend to show only modest energy-related properties in comparison to the virtual library molecules with low SCScores (orange dots). The CA and already-investigated molecules were distributed in the low  and Q regions. Interestingly, many of the already-investigated molecules were located in proximity to the CA molecules. Consequently, these molecules were not prioritized for experimental validation in the current study, as our main interest is in molecules that have not been studied for LIB cathodes.
 and Q regions. Interestingly, many of the already-investigated molecules were located in proximity to the CA molecules. Consequently, these molecules were not prioritized for experimental validation in the current study, as our main interest is in molecules that have not been studied for LIB cathodes.
As the prime candidates for our experiments, we focused on the CA molecules that were located in the same clusters as a large number of low-SCScore molecules. Accordingly, five CA quinones were identified in three different clusters of low SCScore virtual molecules that are pointed out using the black arrows in Fig. 4. The same five CA molecules are also highlighted using the black crosses in the inset figure of Fig. 4. The calculated cathode-related properties of these five CA molecules are summarized in Table 2. The molecules #1 and #5 are predicted to exhibit lower cathode-related performance than the rest of the three molecules based on the calculated properties of  , Q, and W. Thus, we prioritized molecules #2, #3 and #4 for experimentation and among them we selected molecule #4 (AT) for electrochemical characterization based on its in-stock commercial availability.
, Q, and W. Thus, we prioritized molecules #2, #3 and #4 for experimentation and among them we selected molecule #4 (AT) for electrochemical characterization based on its in-stock commercial availability.
 , gravimetric charge capacity (Q), and gravimetric energy density (W) of the top five CA quinones for LIB cathodes
, gravimetric charge capacity (Q), and gravimetric energy density (W) of the top five CA quinones for LIB cathodes
		 and the B3LYP
 and the B3LYP  methods (Table 2). For the low redox potential, we used the computed LUMO+1 energy of AT
 methods (Table 2). For the low redox potential, we used the computed LUMO+1 energy of AT  and eqn (2) to obtain the predicted
 and eqn (2) to obtain the predicted  between AT-Li2 and AT-Li4. The corresponding calculated value of 2.29 V was close to the experimental value of 2.4 V.
 between AT-Li2 and AT-Li4. The corresponding calculated value of 2.29 V was close to the experimental value of 2.4 V.
        |  | ||
| Fig. 5  Cyclic voltammograms in the first four cycles of 1,4,9,10-anthracenetetraone (AT) in 1 mol L−1 LiPF6 and a v/v = 1 ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) : ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1 solution mixture of EC and DMC. | ||
The effects of functional groups on  and SCScore are shown in Fig. 6a and d, respectively, and the more detailed distributions in terms of the different total number of rings (∼size) are shown in Fig. S2.† It is found that the presence of highly electron-withdrawing functional groups (–Cl > –CH3 > –OCH3 > –NH2) on molecules shifts the predicted redox distribution towards higher potentials (Fig. 6a and S2†). On the other hand, as shown in Fig. 6d, the SCScore was the lowest for –Cl functionalized molecules, but it had no clear dependence on the choice of the functional group otherwise. Also observable in Fig. S2† is the fact that the effect of functional groups on
 and SCScore are shown in Fig. 6a and d, respectively, and the more detailed distributions in terms of the different total number of rings (∼size) are shown in Fig. S2.† It is found that the presence of highly electron-withdrawing functional groups (–Cl > –CH3 > –OCH3 > –NH2) on molecules shifts the predicted redox distribution towards higher potentials (Fig. 6a and S2†). On the other hand, as shown in Fig. 6d, the SCScore was the lowest for –Cl functionalized molecules, but it had no clear dependence on the choice of the functional group otherwise. Also observable in Fig. S2† is the fact that the effect of functional groups on  and SCScore is consistent across the different groups of total rings. The potential window of one-ring quinones is quite narrow, while the redox potentials of other size groups span a much broader range. Surprisingly, there are hardly any differences in the potential range spanned by two-ring, three-ring, and four-ring quinone groups despite having very different number positions available for functionalization. The detailed distributions in Fig. S2† also show that the SCScore increases with an increasing number of total rings.
 and SCScore is consistent across the different groups of total rings. The potential window of one-ring quinones is quite narrow, while the redox potentials of other size groups span a much broader range. Surprisingly, there are hardly any differences in the potential range spanned by two-ring, three-ring, and four-ring quinone groups despite having very different number positions available for functionalization. The detailed distributions in Fig. S2† also show that the SCScore increases with an increasing number of total rings.
The number of aromatic rings, which is essentially the number of rings that include no carbonyl groups of a reactant molecule CQS, also had an influence on both properties (Fig. S3†). An increase in the number of aromatic rings in reactant molecules resulted in shifts towards lower redox potentials (Fig. 6b) and higher SCScores (Fig. 6e). This anti-correlation implies that it will be difficult to independently tune the redox potential and the synthesizability of molecules just by varying the number of aromatic rings. It is also observed that the ranges over which both  and SCScore vary also expand with the number of aromatic rings found on molecules. This is not so surprising, as molecules with a large number of rings will have more atomic node positions available for functionalization and they are therefore more likely to show a high degree of structural and electrochemical tunability.
 and SCScore vary also expand with the number of aromatic rings found on molecules. This is not so surprising, as molecules with a large number of rings will have more atomic node positions available for functionalization and they are therefore more likely to show a high degree of structural and electrochemical tunability.
Lastly, we analyzed the effects of the type and number of carbonyl groups on  and SCScore. The distribution of redox potentials, with respect to the type of carbonyl groups, is shown in Fig. 6c, and the detailed information in terms of the total number of rings is shown in Fig. S3.† More detailed 2D histograms of molecules having different carbonyl groups are shown in Fig. S4–S7.† For all the molecules that had the same number of carbonyl groups, the ones that contained 1,2-benzoquinone, [O], spanned a wider range of redox potentials when compared to the ones that contained 1,4-benzoquinone, [P]. Furthermore, it should be noted that [P] and [O] have a quite similar distribution of
 and SCScore. The distribution of redox potentials, with respect to the type of carbonyl groups, is shown in Fig. 6c, and the detailed information in terms of the total number of rings is shown in Fig. S3.† More detailed 2D histograms of molecules having different carbonyl groups are shown in Fig. S4–S7.† For all the molecules that had the same number of carbonyl groups, the ones that contained 1,2-benzoquinone, [O], spanned a wider range of redox potentials when compared to the ones that contained 1,4-benzoquinone, [P]. Furthermore, it should be noted that [P] and [O] have a quite similar distribution of  for one-ring molecules as shown in Fig. S3.† As the total number of rings increases, [O] shows a wider potential window and higher
 for one-ring molecules as shown in Fig. S3.† As the total number of rings increases, [O] shows a wider potential window and higher  than [P]. It can also be observed in Fig. S3† that the distributions of
 than [P]. It can also be observed in Fig. S3† that the distributions of  show little difference for [P + P], [O + O] and [P + O] conformers in two-ring quinones. For the three-ring and four-ring quinones, [O + O] and [P + O] conformers have relatively higher
 show little difference for [P + P], [O + O] and [P + O] conformers in two-ring quinones. For the three-ring and four-ring quinones, [O + O] and [P + O] conformers have relatively higher  than [P + P] conformers. Interestingly, the molecules with two pairs of carbonyl groups showed higher redox potentials, which also motivated our choice of the AT compound for the CV experiments as discussed above. The [O + O] group of molecules achieved the highest median redox potential among all the molecules with four carbonyl groups. This observation can be attributed to favorable energetic interactions between a Li atom and two C
 than [P + P] conformers. Interestingly, the molecules with two pairs of carbonyl groups showed higher redox potentials, which also motivated our choice of the AT compound for the CV experiments as discussed above. The [O + O] group of molecules achieved the highest median redox potential among all the molecules with four carbonyl groups. This observation can be attributed to favorable energetic interactions between a Li atom and two C![[double bond, length as m-dash]](https://www.rsc.org/images/entities/char_e001.gif) O groups at neighboring positions on the same ring, thus resulting in a significantly higher stabilization of the product compound. A similar trend was also pointed out by Poizot et al., in which they observed that ortho-quinones can achieve a redox potential about 300 mV higher than para-quinones.64 The overall influence of carbonyl groups on pushing the redox potential of the molecules to higher values can be summarized as follows: [O + O] ≥ [P + O] > [P + P] > [O] > [P]. As shown in Fig. 6f, the SCScore metric showed a relatively weak correlation with the type of carbonyl group present, although quinones containing two pairs of carbonyl groups had marginally lower SCScores compared to those containing a single pair. Fig. S3† serves as further evidence that an increase in the number of rings present in quinone reactants leads to a higher SCScore.
O groups at neighboring positions on the same ring, thus resulting in a significantly higher stabilization of the product compound. A similar trend was also pointed out by Poizot et al., in which they observed that ortho-quinones can achieve a redox potential about 300 mV higher than para-quinones.64 The overall influence of carbonyl groups on pushing the redox potential of the molecules to higher values can be summarized as follows: [O + O] ≥ [P + O] > [P + P] > [O] > [P]. As shown in Fig. 6f, the SCScore metric showed a relatively weak correlation with the type of carbonyl group present, although quinones containing two pairs of carbonyl groups had marginally lower SCScores compared to those containing a single pair. Fig. S3† serves as further evidence that an increase in the number of rings present in quinone reactants leads to a higher SCScore.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 498 quinone-based molecules for LIBs. To rank the molecules based on their synthesizability, the SCScore metric of each compound was evaluated using a ML-based model, where a low SCScore indicated a high possibility of synthesizability. In parallel, a search for vendor information for the virtually designed molecules was performed in the ZINC database, which revealed the commercial availability records for 245 unique molecules. Interestingly, most of the commercially available molecules were found to have low SCScores, thus validating the usefulness of this metric. Subsequently considering molecules that have low SCScores and a small number of functional groups, 396 additional virtual molecules that had no ZINC records were down-selected from the screening library. Thus, a cumulative number of 641 candidate molecules were subjected to DFT calculations for further validation. Next, an RMSD analysis of the DFT-calculated structural distortions, due to lithiation, was applied to evaluate the propensity of these 641 molecules to structural degradation. Consequently, 349 quinone-based molecules were identified that had low RMSD and high
498 quinone-based molecules for LIBs. To rank the molecules based on their synthesizability, the SCScore metric of each compound was evaluated using a ML-based model, where a low SCScore indicated a high possibility of synthesizability. In parallel, a search for vendor information for the virtually designed molecules was performed in the ZINC database, which revealed the commercial availability records for 245 unique molecules. Interestingly, most of the commercially available molecules were found to have low SCScores, thus validating the usefulness of this metric. Subsequently considering molecules that have low SCScores and a small number of functional groups, 396 additional virtual molecules that had no ZINC records were down-selected from the screening library. Thus, a cumulative number of 641 candidate molecules were subjected to DFT calculations for further validation. Next, an RMSD analysis of the DFT-calculated structural distortions, due to lithiation, was applied to evaluate the propensity of these 641 molecules to structural degradation. Consequently, 349 quinone-based molecules were identified that had low RMSD and high  values. After the use of dimensionality reduction and tailored similarity methods for chemical space exploration, we targeted one readily purchasable molecule, AT, for validation. This compound showed highly promising redox features in CV experiments. The approach presented in the current study resulted in a collection of top-candidates that are not only predicted to-be high-performing battery materials but are also deemed to be synthetically accessible, which together make them attractive for further investigations. Finally, the accompanying calculation dataset of molecules is anticipated to be a useful resource as both a reference and a training data source for the development of new materials and predictive models, respectively.
 values. After the use of dimensionality reduction and tailored similarity methods for chemical space exploration, we targeted one readily purchasable molecule, AT, for validation. This compound showed highly promising redox features in CV experiments. The approach presented in the current study resulted in a collection of top-candidates that are not only predicted to-be high-performing battery materials but are also deemed to be synthetically accessible, which together make them attractive for further investigations. Finally, the accompanying calculation dataset of molecules is anticipated to be a useful resource as both a reference and a training data source for the development of new materials and predictive models, respectively.
    
    
       ,
,  , W, SCScore, and RMSD values, and commercial vendor page retrieved from the ZINC database whenever available. This data frame is available as a self-contained ESI† in CSV format.
, W, SCScore, and RMSD values, and commercial vendor page retrieved from the ZINC database whenever available. This data frame is available as a self-contained ESI† in CSV format.
    
    
      | Footnote | 
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2dd00112h | 
| This journal is © The Royal Society of Chemistry 2023 |