Siyu
Gao‡
a,
Yiqun
Luo‡
b,
Xingyu
Liu
a and
Noa
Marom
*abc
aDepartment of Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA. E-mail: nmarom@andrew.cmu.edu
bDepartment of Physics, Carnegie Mellon University, Pittsburgh, PA 15213, USA
cDepartment of Chemistry, Carnegie Mellon University, Pittsburgh, PA 15213, USA
First published on 7th April 2025
Excited-state properties of crystalline organic semiconductors are key to organic electronic device applications. Machine learning (ML) models capable of predicting these properties could significantly accelerate materials discovery. We use the sure-independence-screening-and-sparsifying-operator (SISSO) ML algorithm to generate models to predict the first singlet excitation energy, which corresponds to the optical gap, the first triplet excitation energy, the singlet–triplet gap, and the singlet exciton binding energy of organic molecular crystals. To train the models we use the “PAH101” dataset of many-body perturbation theory calculations within the GW approximation and Bethe–Salpeter equation (GW+BSE) for 101 crystals of polycyclic aromatic hydrocarbons (PAHs). The best performing SISSO models yield predictions within about 0.2 eV of the GW+BSE reference values. SISSO models are selected based on considerations of accuracy and computational cost to construct materials screening workflows for each property. The screening targets are chosen to demonstrate typical use-cases relevant for organic electronic devices. We show that the workflows based on SISSO models can effectively screen out most of the materials that are not of interest and significantly reduce the number of candidates selected for further evaluation using computationally expensive excited-state theory.
Key parameters for device performance are derived from excited-state properties of crystalline organic semiconductors. The optical gap, which corresponds to the lowest singlet exciton energy, is of fundamental importance for any device based on absorption or emission of light. The triplet exciton energy is important for devices that involve conversion between singlet and triplet states. To increase the efficiency of OLEDs, it is desirable to convert electrically generated triplet excitons into emissive singlet excitons via thermally activated delayed fluorescence (TADF).47–53 In OPV, the thermalization loss may be reduced by converting high-energy singlet excitons into two triplet excitons, via singlet fission (SF).54–62 Conversely, the transmission loss of photons with energies below the optical gap of the absorber may be reduced by up-converting two low-energy triplet excitons into one singlet exciton via triplet–triplet annihilation (TTA).63–71 TTA also plays a crucial role in the detection of X-rays, gamma rays, and neutrons in scintillators.30 Another important parameter for organic devices is the exciton binding energy, which is the electrostatic attraction between the electron and hole that must be overcome in order to separate an exciton into free charge carriers.72 The band lineup e.g., between the donor and acceptor in organic solar cells, is often engineered to overcome the exciton binding energy and provide the driving force for charge separation.
Different applications impose different requirements on the excited-state energies of molecular crystals. For example, solar cells require materials with broad absorption in the visible range, whereas OLEDs require materials with sharp emission in specific colors. Other applications may require absorption in the infrared (IR) or ultra-violet (UV) ranges. SF requires the triplet excitation energy to be slightly smaller than half the singlet excitation energy, whereas TTA requires the triplet energy to be slightly larger than half the singlet energy. TADF requires the difference between the singlet excitation energy and the triplet excitation energy, also known as the singlet–triplet gap, to be as small as possible or even negative. In addition, compatibility with other device components may require singlet and/or triplet energies in a specific range and/or specific band edge positions. Although the structures of over a million organic molecular crystals are known,73,74 the electronic and optical properties of most of them are unknown. A material originally produced for one purpose may turn out to be useful for another.75 Even materials that formed as reaction byproducts may turn out to have useful properties.76 Therefore, the ability to predict the excited-state properties of molecular crystals could lead to materials discovery and advances in organic semiconductor devices.
On present day supercomputers, it is possible to screen thousands of materials in a “high-throughput” manner using density functional theory (DFT) with computationally efficient semi-local exchange-correlation functionals.77,78 However, DFT is a ground-state theory, which inherently cannot describe the excited-state properties required for organic semiconductor devices. The excited-state properties of isolated molecules may be calculated relatively efficiently with time dependent DFT (TDDFT). Indeed, TDDFT has been employed for high-throughput screening efforts to discover organic chromophores.79–86 The properties of crystalline organic semiconductors depend not only on the molecular properties, but also on the crystal packing and the resulting electronic interactions between molecules. There is a vast body of literature reporting how changes in the crystal packing affect device relevant properties of organic semiconductors from excited-state energies to charge carrier mobility.32,36–38,40,44,87–90 Therefore, for applications based on crystalline organic semiconductors it is important for computational materials discovery efforts to focus on predicting the excited-state properties of crystals as opposed to isolated molecules.
The excited-state properties of molecular crystals can be calculated within the framework of Green's function based many-body perturbation theory. The GW approximation, where G represents the one-particle Green's function and W represents the screened Coulomb interaction, can be used to calculate properties associated with charged excitations, such as the fundamental band gap. Subsequently, the GW quasiparticle energies are fed into the Bethe–Salpeter equation (BSE) to calculate properties associated with neutral excitations, such as the singlet and triplet excitation energies.91–95 The high computational cost of GW+BSE is prohibitive for large-scale materials screening. Machine learning (ML) can be used to perform preliminary screening and reduce the need for expensive simulations to a smaller number of promising candidates.96–107 However, training ML models such as neural networks typically requires a large amount of data,108,109 which is difficult to acquire with GW+BSE. Compared to DFT and even TDDFT datasets, GW+BSE datasets are scarce and contain a relatively small amount of data for isolated molecules77,110,111 or for inorganic crystals with a small number of atoms in the unit cell.112 As a consequence, efforts to use ML to predict the outcomes of GW+BSE calculations have also been restricted to small organic molecules and small inorganic systems.112–116 This has limited the ability to train ML models to predict the excited state properties of molecular crystals.
Recently, we have published a first of its kind dataset of GW+BSE calculations for 101 molecular crystals of polycyclic aromatic hydrocarbons (PAHs) with up to ∼500 atoms in the unit cell, known as PAH101.117 We have chosen to focus on PAHs because they are the fundamental building blocks of materials for organic electronics.7,88,118–126Fig. 1a and b shows the size distribution of the molecules and the unit cells in the PAH101 set. The data records contain the GW+BSE singlet and triplet excitation energies, whose distributions are shown in Fig. 1c and d as well as the GW fundamental band gaps. From these quantities we may also calculate the singlet–triplet gap and the singlet exciton binding energy, whose distributions are shown in Fig. 1e and f. Detailed technical validation of the PAH101 dataset is provided in ref. 117, including comparison of the relaxed geometries with experimental data, convergence tests of the GW+BSE calculations, and comparison of the optical gaps and absorption spectra to available experimental data.
The PAH101 dataset was originally generated for the purpose of SF materials discovery.127 To address the challenge of constructing transferable ML models with a small amount of training data, we used the sure-independence-screening-and-sparsifying-operator (SISSO)128,129 algorithm. SISSO takes advantage of physical/chemical knowledge to train ML models based on “small data”. The input of SISSO is a set of scalar primary features, which are physical/chemical descriptors thought to be related to the target property. The primary features used in ref. 127 are also provided in the PAH101 data records.117 SISSO generates a huge feature space by repeatedly combining the primary features using a set of linear and nonlinear algebraic operations with rules to avoid unphysical combinations. Subsequently, linear regression is performed to identify the most predictive models. In the last few years SISSO has been applied increasingly broadly for diverse classes of materials and target properties and continues to perform well with relatively small amounts of data.130–149 In ref. 127 SISSO successfully produced several models that were able to predict the GW+BSE values of the SF driving force, which corresponds to the difference between the singlet exciton energy and twice the triplet exciton energy, with a root mean square error (RMSE) below 0.2 eV. Based on considerations of model accuracy and primary feature computational cost, two of the SISSO models were selected to build a classifier for materials screening. Later, the simpler model of the two was found to deliver robust performance outside of the PAH101 set, whereas the more complex model was found to be overfitted.54
Here, we use the PAH101 dataset to train SISSO models for the singlet exciton energy, which corresponds to the optical gap, the triplet exciton energy, the singlet–triplet gap, and the singlet exciton binding energy. For all four properties, the best performing SISSO models yield predictions within about 0.2 eV of the GW+BSE reference values. We then select SISSO models based on considerations of accuracy and computational cost to construct materials screening workflows for each property. The screening targets are chosen to demonstrate typical use-cases relevant for organic electronic devices. We demonstrate screening for materials with an optical gap in a particular color; materials meeting the requirements for up-conversion of infrared light to the visible range via TTA, namely, a triplet excitation energy in the infrared, optical gap in the visible range, and a higher TTA driving force than rubrene; materials with a small singlet–triplet gap, which is desirable for OLEDs; and materials with a small singlet exciton binding energy to facilitate the separation of excitons into free charge carriers. We show that the workflows based on SISSO models can effectively screen out most of the materials that are not of interest and significantly reduce the number of candidates selected for further evaluation using computationally expensive excited-state theory.
Feature | Cost | Description |
---|---|---|
GapS | 1 | Single molecule gap, i.e., the energy difference between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO) |
IPS | 2 | Single molecule ionization potential, evaluated as the total energy difference between a cation and neutral molecule |
PolarTensorS | 2 | Trace of the molecular polarization tensor, calculated using PBE with the many-body dispersion (MBD) method155 |
E ST | 3 | Single molecule triplet formation energy, corresponding to the total energy difference between the ground-state and triplet-state molecule |
EAS | 3 | Single molecule electron affinity, evaluated as the total energy difference between an anion and neutral molecule |
ΔESST | 3 | Estimated single molecule singlet–triplet gap, evaluated as GapS − EST. |
GapC | 33 | The crystal band gap, extracted from the band structure |
VBCdisp | 33 | Valence band dispersion, corresponding to the energy range of the HOMO-derived band |
CBCdisp | 33 | Conduction band dispersion, corresponding to the energy range of the LUMO-derived band |
ε C | 42 | Dielectric constant calculated using the PBE + MBD@rsSCS polarizability in the Clausius–Mossotti equation156 |
H ab | 93 | The highest value of the transfer integral obtained out of all the unique molecular dimers extracted from the crystal, calculated with fragment orbital DFT157 |
E CT | 148 | Crystal triplet formation energy, corresponding to the total energy difference between the ground-state and triplet-state crystal |
ΔECST | 181 | Estimated crystal singlet–triplet gap, evaluated as GapC − ECT |
MolWtS | 0 | Molecular weight in atomic mass units (amu) |
ρ C | 0 | Crystal density in amu Å−3 |
AtomNumC | 0 | Number of atoms in the crystal unit cell |
In ref. 127, A subset of 10 PAH crystals of different sizes with a range of SF driving force values were completely left out of the SISSO training to serve as the test set of unseen data. The same 10 structures were withheld here as an unseen validation set and kept completely separate from the remaining 91 structures, which were used for model training. An additional set of 9 hydrocarbon crystals, not included in the PAH101 set, was used to test the performance of the best SISSO-generated models outside of the PAH101 set. This set, referred to as “Test 2”, includes: terrylene (CSD reference code AZOXOF), benzo [e]dinaphtho[2,3-a; 10,20,30,40-ghi]fluoranthene (CSD reference code ZERXED), 7,14-diphenylnaphtho[1,2,3,4-cde]bisanthene (CSD reference code ZERXIH), 9,18-diphenyldibenzo[a,o]naphtho[1,2,3,4-ghi]perylene (CSD reference code ZERXON), 8,9-bis(4-methylphenyl)-10-phenylpentaleno[1,2-a]naphthalene (CSD reference code BEGJOO), 3,6-bis-(diphenylmethylene)-1,4-cyclohexadiene (CSD reference code DUPRIP), heptafulvalene (CSD reference code HEPFUL10), 3,12-Di-t-butyl-2,2,13,13-tetramethyl-tetradeca-3,5,7,9,11-pentaene (CSD reference code GAFDUO), and (E)-1-cycloocta-tetraenyl-2-phenylethene (CSD reference code GIWHUP). GW+BSE results were published in ref. 154 for terrylene, in ref. 76 for ZERXED, ZERXIH, and ZERXON, and in ref. 54 for the remaining materials. Some of the materials in the second test set are structurally very different from the materials in PAH101. HEPFUL10 and GIWHUP contain 7- and 8-membered carbon rings, respectively, and GAFDUO comprises a long polyene chain. Therefore, they can provide an estimate for the performance of the SISSO models outside of the PAH101 set.
To generate the feature space, features were constructed with a maximum rung (the number of times primary features are combined) of 3 and a maximum dimension (Dim) of 4. Features were combined using the operator set . The maximum complexity, i.e., the maximum number of operators in one combined model, was set to 10. For the singlet exciton energy, triplet exciton energy, and singlet exciton binding energy a total of roughly 5 × 102, 4 × 105, and 6 × 1010 features were generated by SISSO with a rung of 1, 2, and 3, respectively. For the singlet–triplet gap, a total of roughly 6.5 × 102, 7.5 × 105, and 2 × 1011 features were generated by SISSO with a Rung of 1, 2, and 3, respectively.
After feature generation, SISSO performs linear regression to yield the model prediction, where each model is the scalar product of the SISSO-generated feature with a vector of fitted coefficients. Then, the models are ranked according to their prediction performance. Sure independence screening (SIS) is used to select optimal subspaces from the huge feature space. The number of features saved after SIS was set to 20. SISSO then uses ℓ0-norm minimization as a sparsifying operator (SO) to determine the sparse solution for each such subspace. For each combination of dimension and rung, 40 rounds of leave-10-out cross validation (LCV) were performed. In each round, 10 data points (out of the 91 points used for model training) were randomly selected and held out as an unseen validation set. The model with the lowest RMSE for the validation set was selected in each round. Finally, the model with the lowest RMSE for the combined training and validation data was selected out of the 40 models. This model is denoted as MDim, Rung. A full account of the SISSO models for each target property is provided in the SI. For each model, the training RMSE was calculated for the whole training set with 91 structures and the test RMSE was calculated for the 10 withheld materials, which were excluded from the LCV.
The computational cost of SISSO-generated models varies depending on the number and type of primary features they contain. The relative cost of each primary feature was evaluated in a similar manner to ref. 127, relative to the calculation of the single molecule gap, GapS, whose cost is the lowest of all the primary features. The computer time required to calculate GapS was assigned a value of 1 cost unit and the cost of other features is given in Table 1 as multiples of that unit. The cost of all the primary features has been updated to account for new developments in the latest version of FHI-aims. In particular, the many-body dispersion (MBD)155 calculation has become significantly more efficient than in older versions of the code. In addition, we averaged the cost over the 10 structures in the validation set, rather than picking one system of average size, as in ref. 127. The cost of each SISSO model was evaluated by summing over the costs of all the primary features included in it. The cost of features that appear in the model more than once was counted only once. The cost of features that do not require additional calculations was also counted only once. For example, the crystal band gap, GapC, valence band dispersion, VBCdisp, and conduction band dispersion, CBCdisp, are extracted from the same band structure calculation. Therefore, if two or more of them are included in a SISSO model the cost is counted only once.
A comparison of the SISSO models to baseline models is provided in the ESI.† Linear regression (LR) and Gaussian process regression (GPR) models for each property were trained based on the SISSO primary features and the many-body tensor representation (MBTR).158 The LR and GPR models were trained using the scikit-learn159 and GPyTorch160 Python packages, respectively. For the singlet excitation energy, the triplet excitation energy, and the singlet–triplet gap, we also provide a comparison with a linear fit based only on the corresponding DFT-level approximations, namely the single molecule HOMO–LUMO gap and crystal band gap (GapS, GapC) for the singlet excitation energy, the single molecule and crystal triplet formation energy (EST, ECT) for the triplet excitation energy, and the DFT estimate for the single molecule and crystal singlet–triplet gap (ΔESST, ΔECST), evaluated by calculating the difference between the aforementioned features. For the singlet exciton binding energy there is no corresponding DFT feature. In all cases, the SISSO models provide better prediction performance with less over-fitting than the baseline models (see detailed discussion in the ESI†).
In Fig. 2, the models generated by SISSO for each property are evaluated based on considerations of cost vs. accuracy, represented by Pareto plots. For accuracy, we consider both the cross validation “train” RMSE, obtained for the training set of 91 materials, and the “test” RMSE, obtained for the unseen validation set of 10 materials. In general, the computational cost tends to increase with the model complexity because the more complex models contain more primary features. The training set RMSE generally tends to decrease with the model complexity. The validation set RMSE is generally higher than the training set RMSE. For some of the more complex models, the validation set RMSE is significantly higher than the training set RMSE, indicating over-fitting to the training data.
A detailed discussion of the results for each property is provided in the corresponding sub-sections below. For each property, we select the three models that deliver the most robust performance out of each group of models with a similar computational cost, based on the Pareto plots shown in Fig. 2. For these models, we perform a more detailed analysis of the correlation between the model predictions and the GW+BSE reference data. We examine the most significant outliers and evaluate whether the models perform robustly for the additional materials not included in the PAH101 set. The models that perform robustly for both the PAH101 set and the additional materials are selected to construct hierarchical screening workflows, as suggested in ref. 127, in which a decreasing number of candidate materials are evaluated with increasingly accurate and more expensive models. The performance of the workflows is evaluated in terms of the number of true positives and false positives that pass each filtering stage out of the entire set of 110 materials (PAH101 and the additional test set). In the Conclusion section, we remark on the overall performance of SISSO for predicting the excited-state properties of crystalline organic semiconductors and provide a comparative assessment of the performance for different properties.
We select models out of each cost group that are close to the Pareto front for both the train and test RMSE. In the lower-cost group M4,1 and M3,1 yield the lowest RMSE of 0.15 eV and 0.16 eV, respectively, for the training set. However, their performance for the test set is significantly worse with RMSEs of 0.27 eV and 0.28 eV, respectively. In comparison, M1,3 yields a similar train RMSE of 0.16 eV and a lower test RMSE of 0.24 eV. For this reason, we select M1,3 out of the lower-cost group:
![]() | (1) |
![]() | (2) |
![]() | (3) |
Fig. 3 shows the predictions of the SISSO models selected based on considerations of cost and accuracy as a function of the GW+BSE reference values of the singlet exciton energy (optical gap). Parity plots for all other SISSO models are provided in the ESI.† Overall the model predictions are quite close to the reference values. The low-cost model M1,3 has a few outliers with larger deviations from the reference values. Some of the outliers are chemically and/or structurally distinct from the majority of systems in the dataset.54,127 For example, 2-(naphthalen-2-yl)azulene (CSD reference code PUJQIV) contains a 7-membered ring fused with a 5-membered ring and heptafulvalene (HEPFUL10) contains two 7-membered rings. Biphenyl (CSD reference code BIPHEN) and terphenyl (CSD reference code TERPHE02) comprise benzene rings connected by single bonds, rather than the large aromatic systems of fused rings characteristic of most of the PAH101 set. For biphenyl in particular, the optical gap value of 3.41 eV included in the PAH101 dataset117 is a significant underestimation compared to the experimental values of 4.1–4.18 eV.162–165 Therefore, it could be argued that this data point is less reliable and the model prediction is in fact closer to experiment. Coronene (CSD reference code CORONE01), 9,18-diphenyltetrabenz(a,c,h,j)anthracene (CSD reference code FACPEE), and 9,18-diphenyldibenzo[a,o]naphtho[1,2,3,4-ghi]perylene (CSD reference code ZERXON) do not appear chemically or structurally distinct from most of the materials in the PAH101 set. Coronene and FACPEE have unusually high values for the crystal triplet formation energies, ECT, which may explain why their optical gaps are overestimated by M1,3. Conversely, ZERXON has a particularly low ECT, which causes its optical gap to be significantly underestimated. For many of the M1,3 outliers out of the PAH101 set, the predictions of the intermediate-cost M3,3 and higher-cost M4,3 models are closer to the reference values, with the exception of terphenyl, which remains an outlier. For the additional test set, which was not included in PAH101, the RMSE deteriorates with the model complexity, which indicates over-fitting.
The PAH101 set contains materials with a wide range of optical gaps, as shown in Fig. 1c. Crystalline quaterrylene (QUATER10) and hexacene (ZZZDKE01) have the smallest optical gaps of 1.33 eV and 1.17 eV, respectively. Fig. 4 demonstrates a two-stage screening workflow constructed based on the SISSO models selected for the optical gap. The first stage is M1,3 and the second stage is M3,3. We have decided not to use M4,3 because of its markedly worse performance for the Test 2 set. For demonstration purposes, we screen for materials with an optical gap in the range of 2.2–2.5 eV, corresponding to green color. There are 17 such materials in total in the combined set of PAH101 and the additional test set. The screening thresholds for each stage of the workflow are set to 2.2 eV–0.5 n × RMSE to 2.5 eV + 0.5 n × RMSE, where n = 1,2,3 and RMSE refers to the training set RMSE of each model.
With n = 1, the first-stage model, M1,3, screens out most of the materials, whose optical gap is not within the target range, leaving 9 false positives. 16 of the 17 materials, whose gaps are in the target range are successfully identified. Further screening by the second-stage model, M3,3, significantly reduces the number of false positives from 9 to 4, while retaining the 16 true positives. Setting n = 2 does not improve the performance of the workflow because the final outcome is the same 16 true positives only with a higher number of false positives. With n = 3 all 17 true positives pass the screening along with 19 and 16 false positives after the first and second stage of screening, respectively. Using the same workflow to screen for materials with optical gaps in the red range and blue range produces very similar results, as shown in the ESI.† Based on this, we would suggest a two-stage workflow with thresholds defined depending on one's tolerance for false positives. We also note that the optical gaps obtained from GW+BSE are typically within 0.2 eV from experiment.117 The training RMSE values of M1,3 and M3,3 are 0.16 eV and 0.11 eV, respectively. Therefore, it may be reasonable to set n = 2 to account for the errors of GW+BSE on top of the errors of the ML models.
We select models out of each cost group that are close to the Pareto front for both the train and test RMSE. Of the lower-cost group M3,1 yields the lowest RMSEs of 0.11 eV and 0.22 eV, respectively, for the training set and the validation set.
![]() | (4) |
None of the models in the intermediate-cost group is on the Pareto front for both the training set and the validation set. M3,3 yields the lowest RMSE of 0.07 eV for the training set, however its performance deteriorates considerably for the validation set with an RMSE of 0.22 eV. The next best model for the training set, M2,3, with an RMSE of 0.09 eV also performs significantly worse for the validation set with an RMSE of 0.19 eV. We therefore choose the third model, M3,2, because its RMSE of 0.09 eV for the training set is similar to M2,3, but its performance for the validation set is markedly better with an RMSE of 0.17 eV.
![]() | (5) |
Of the high-cost group, M4,3 is on the Pareto front for both the training set and the validation set with RMSEs of 0.06 eV and 0.14 eV, respectively.
![]() | (6) |
Fig. 5 shows the predictions of the SISSO models selected based on considerations of cost and accuracy as a function of the GW+BSE reference values of the triplet exciton energy. Parity plots for all other SISSO models are provided in the ESI.† Similar to the optical gap, the SISSO model predictions are quite close to the reference values. The low-cost model M3,1 has a few outliers with larger deviations from the reference values. Biphenyl and terphenyl are significant outliers, similar to the optical gap models. 1,2,3,4-Tetraphenylbenzene (CSD reference code FOVVOB) also comprises benzene rings connected by single bonds. Biphenyl, 9,9′-bianthracenyl (CSD reference code KUBWAF01), hexabenzo(bc,ef,hi,kl,no,qr)coronene (CSD reference code HBZCOR), and 1,12-benzoperylene (CSD reference code BNPERY) have unusually high single molecule triplet formation energy, EST, values exceeding the GW+BSE reference values of their crystal triplet excitation energies, which causes the M3,1 predictions to be overestimated. 7,14-Diphenylnaphtho[1,2,3,4-cde]bisanthene (CSD reference code ZERXIH) has an unusually large value for the trace of the molecular polarization tensor PolarTensorS, which causes it to become a significant outlier for M3,2. Interestingly, the lowest-cost model M3,1 has a lower RMSE for the Test 2 set than for the PAH101 held out validation set. In contrast, the performance of the more complex models M3,2 and M4,3 deteriorates very significantly for the additional test set, indicating over-fitting.
The PAH101 set contains materials with a wide range of triplet excitation energies, as shown in Fig. 1d. Fig. 6 demonstrates a one-stage screening workflow for the triplet exciton energy based on the M3,1 SISSO model, which was selected because it retains robust performance for the additional test set, unlike the more complex models. Because of the recent interest in TTA up-conversion of infrared (IR) light to the visible range,71 we screen for materials, whose triplet excitation energy is in the IR, below 1.6 eV. There are 37 such materials in total in the combined set of PAH101 and the additional test set. The screening thresholds are set to 1.6 eV + n × RMSE, where n = 1,2,3 and RMSE refers to the training set RMSE of M3,1, 0.11 eV. With n = 1, the M3,1 model correctly classifies all 37 materials with triplet excitation energy in the IR with only 3 false positives. Setting n to 2 or 3 is not beneficial because it only increases the number of false positives. In this case, screening only with M3,1 delivers robust performance.
Next, we assess whether the 37 materials in the combined set of PAH101 and the additional test set with triplet excitation energies in the IR are likely to undergo TTA. The thermodynamic driving force for TTA is given by the difference between twice the triplet excitation energy and the singlet excitation energy (the opposite of the SF driving force). As discussed in ref. 54,63 and 127, GW+BSE systematically underestimates the SF driving force and therefore overestimates the TTA driving force. Hence, we assess the likelihood of a given material to undergo SF/TTA relative to other known SF/TTA materials. Only 8 out of the 37 materials have a higher TTA driving force than rubrene and perylene: pyracylene (KEGHEJ01), 11-phenylbenzo[a]naphtho[2,1,8-cde]perylene (KAGFUV), the two polymorphs of diindeno[1,2,3 cd:1′,2′,3′-lm]perylene (POBPIG and POBPIG06), anthra(2,1,9,8-hijkl)benzo(de)naphtho(2,1,8,7-stuv)pentacene (BOXGAW), dibenzo(def,i)naphtho(1,8,7-v,w,x)pyranthrene (DUPHAX), benzo[e]dinaphtho[2,3-a; 10,20,30,40 -ghi]fluoranthene (ZERXED), and heptafulvalene (HEPFUL10). All of these have GW+BSE triplet excitation energies in the near-IR and their GW+BSE optical gaps range from 1.89 eV for crystalline heptafulvalene to 2.41 eV for KAGFUV. Materials in the PAH101 set with lower triplet excitation energies tend to be more likely to undergo SF than TTA. Five of these compounds have been previously assessed in ref. 63 as isolated molecules rather than crystals, using a different GW+BSE implementation. Therein, the likelihood of the competing process of conversion of two excitons in the first triplet state to one exciton in a higher triplet state instead of a singlet exciton was also considered. Based on these energetic considerations, pyracylene (labeled as compound C1 in ref. 63) appeared promising, however it is not a good TTA chromophore because of its short triplet state lifetime and the rapid non-radiative decay of its lowest singlet state.166 POBPIG is a perylene derivative (labeled as compound D16 in ref. 63), for which the competing conversion to a higher triplet state was found to be too energetically favorable for high-yield TTA. KAGFUV, BOXGAW, and DUPHAX are large aromatic compounds comprising several fused six-membered rings that can be classified as graphene flakes. Of these, KAGFUV (labeled as compound E2 in ref. 63) and DUPHAX (labeled as compound E3 in ref. 63) were identified as TTA candidates based on energetic considerations. ZERXED has been evaluated based on the same criteria in ref. 76 (therein it was labeled “Compound I”) and found to be a promising TTA candidate. Heptafulvalene is reported here for the first time as a potential TTA candidate and could indicate an interesting new direction of exploring compounds with 7-membered rings as TTA/SF candidates.
The workflow presented here demonstrates how ML models can be used to identify new potential TTA candidates with triplet exciton energies in the IR, optical gaps in the visible range, and favorable TTA energetics. We note that the optical gap SISSO models presented above and the SF driving force SISSO models from ref. 127 could be used to perform further fast screening of materials that pass the triplet exciton energy screening. Thus, the number of candidates selected for computationally expensive excited-state calculations can be significantly reduced.
We select models out of each cost group that are close to the Pareto front for both the train and test RMSE. In the very-low-cost group M1,2 and M2,1 contain the same primary features and have a similar performance with train RMSE of 0.14 eV and test RMSEs of 0.15 eV and 0.14 eV, respectively. Therefore we use an ensemble of the two models:
![]() | (7) |
![]() | (8) |
![]() | (9) |
Fig. 7 shows the predictions of the SISSO models selected based on considerations of cost and accuracy as a function of the GW+BSE reference values of the singlet–triplet gap. Parity plots for all other SISSO models are provided in the ESI.† We note that the energy range of GW+BSE reference data for the singlet–triplet gap is significantly smaller than the energy range of the singlet and triplet exciton energies (see the distributions in Fig. 1). Therefore, the deviations of the SISSO model predictions appear larger in comparison to the reference data even though they are of comparable magnitude or even smaller than the deviations of the SISSO models for the singlet and triplet exciton energies. In the discussion of the outliers we focus on materials with relatively small singlet–triplet gaps because these would be of most interest for OLEDs. The very-low-cost model M1,2 & M2,1 has several significant outliers in the range of singlet–triplet gaps below 0.6 eV. These include materials that are also among the outliers for the singlet and triplet exciton energy models, PUJQIV, FACPEE, HEPFUL10, and coronene. Hexaphenylbenzene (CSD reference code HPHBNZ03) comprises phenyl rings connected by single bonds like some of the outliers for the singlet and triplet excitation energies. Trinaphtho[1,2,3,4-fgh:1′,2′,3′,4′-pqr:1″,2″,3″,4″-za_1_b_1_]trinaphthylene (CSD reference code GUQZUP) and (5)helicene (CSD reference code DBPHEN02) are not structurally or chemically distinct from most of the materials in the PAH101 set. In addition, they do not have any primary features with unusual values. It is possible that the relatively large number of outliers, especially in the low singlet–triplet gap range, is a reflection of the difficulty of training reliable models to predict very small target values, in particular considering that there are few materials with a singlet–triplet gap below 0.5 eV in the training data (See Fig. 1e). The performance of all three models for the Test 2 set is worse than for the PAH101 validation set. In particular, for the more complex model, M3,3, the Test 2 RMSE is significantly higher than the lower cost models. For HEPFUL10, M3,3 even predicts a negative value. The poor performance of M3,3 for the Test 2 set indicates overfitting.
Fig. 1e shows the distribution of singlet–triplet gaps in the PAH101 dataset. Small singlet–triplet gaps are rare among this class of materials. The materials with lowest singlet–triplet gaps (in parentheses) are: trinaphtho[1,2,3,4-fgh:1′,2′,3′,4′-pqr:1″,2″,3″,4″-za_1_b_1_]trinaphthylene (GUQZUP; 0.36 eV), 9,18-diphenyltetrabenz(a,c,h,j)anthracene (FACPEE; 0.38 eV), acenaphtho[3,2,1,8-fghij]tetrabenzo[a,c,m,o]picene (VUFHUA; 0.435 eV), benzo(1,2,3-bc:4,5,6 b′,c′)dicoronene (YOFCUR; 0.44 eV), and 2-(naphthalen-2-yl)azulene (PUJQIV; 0.45 eV). Even the lowest singlet–triplet gaps in the PAH101 set would be considered marginal or too high for TADF. It is interesting to note that with the exception of PUJQIV (shown in Fig. 3 and 7), the materials with the smallest singlet–triplet gaps in the PAH101 set bear no resemblance to the donor–acceptor compounds typically used for TADF.47,167 Rather, they are large PAHs with extended π systems. FACPEE (shown in Fig. 3 and 7), VUFHUA, and YOFCUR have segments that could lead to charge-transfer-like intramolecular excitations. GUQZUP can be described as a graphene flake with no obvious segments. The twisted conformation it adopts in the crystal structure may contribute to orbital localization and charge-transfer-like excitations. The effect of crystal packing and intermolecular vs. intramolecular charge-transfer excitations on singlet–triplet gaps is not well-understood and should be further investigated in relation to TADF in crystalline materials.168,169
Fig. 8 demonstrates a two-stage screening workflow constructed based on the SISSO models selected for the singlet–triplet gap. The M1,2 & M2,1 and M2,2 models were selected for the first and second stages of the workflow, respectively, based on their performance for the Test 2 set. For demonstration purposes, we screen for materials with a singlet–triplet gap below 0.5 eV. There are 10 such materials in total in the combined set of PAH101 and the additional test set. The screening thresholds for each stage of the workflow are set to 0.5 eV + n × RMSE, where n = 1,2,3 and RMSE refers to the training set RMSE of each model. With n = 1, the first-stage ensemble model of M1,2 & M2,1 screens out most of the materials, whose singlet–triplet gap is above 0.5 eV, leaving only 8 false positives. However, 5 of the 10 materials whose singlet–triplet gaps are below 0.5 eV are also screened out, leaving only 5 true positives. Further screening by the second-stage model, M2,2 reduces the number of false positives to 5 without losing any additional true positives. Setting n = 2 results in 7 and 6 true positives passing the first and second stage, respectively, with a significantly higher number (23) of false positives passing the screening. With n = 3, 9 true positives pass the screening along with 55 false positives. One may define the screening thresholds depending on their tolerance for false positives. With the thresholds set to one RMSE, the workflow presented here effectively eliminates most of the materials that are not of interest, which significantly reduces the number of candidates that need to be evaluated using more accurate and computationally expensive methods. Training better models for predicting the singlet–triplet gap would require acquiring more GW+BSE data for materials with small singlet–triplet gaps. To this end, the present models could be used to select materials for data acquisition.
We select models out of each cost group that are close to the Pareto front for both the train and test RMSE. The very-low-cost model M1,1 yields RMSEs of 0.17 eV for the training set and 0.28 eV for the validation set (0.19 eV without terphenyl).
M1,1 = 0.24 × (IPS/ρC) −1.35 | (10) |
Of the low-cost group, M4,1 and M2,2 yield the same RMSEs of 0.14 eV for the training set and 0.27 eV for the test set. M2,2 delivers a slightly better performance for the validation set without terphenyl with an RMSE of 0.18 eV, compared to 0.20 eV for M4,1. Therefore, we select M2,2:
M2,2 = 0.012 × (EST + IPS) × (IPS/ρC) + 0.020 × ((EST − GapS) × (CBCdisp × AtomNumC)) −0.17 | (11) |
Of the intermediate-cost group M4,2 yields the lowest RMSE of 0.12 eV for the training set, however its performance deteriorates significantly for the validation set with an RMSE of 0.27 eV (0.18 eV without terphenyl), indicating over-fitting. The next model M2,3 has a more robust performance with RMSEs of 0.12 eV for the training set and 0.22 eV for the validation set (0.13 eV without terphenyl).
![]() | (12) |
Both models in the high-cost group are over-fitted, as indicated by the large difference between their performance for the training set vs. the validation set. Therefore, these overly complex models are not useful.
Fig. 9 shows the predictions of the SISSO models selected based on considerations of cost and accuracy as a function of the GW+BSE reference values of the singlet exciton binding energy. Parity plots for all other SISSO models are provided in the ESI.† We note that, similar to the singlet–triplet gap, the energy range of the GW+BSE reference data for the singlet exciton binding energy is significantly smaller than the energy range of the singlet and triplet exciton energies (see the distributions in Fig. 1). Therefore, the deviations of the SISSO model predictions appear larger in comparison to the reference data even though they are of comparable magnitude or even smaller than the deviations of the SISSO models for the singlet and triplet exciton energies. Terphenyl is a major outlier to the point that it noticeably skews the test RMSE. This is perhaps not surprising because terphenyl is an outlier for most other properties considered here. Some of the outliers encountered for other properties are also outliers for the singlet exciton binding energy models, including PUJQIV and FACPEE (not shown), as well as HEPFUL10. Some of the outliers, such as β-tribenzopyrene (CSD reference code TBZPYR) and dinaphtho(1,2-a:1′,2′-h)anthracene (CSD reference code DNAPAN), do not appear structurally or chemically distinct from most of the materials in the PAH101 set. For M1,1, a relatively high value of the single molecule ionization potential, IPS, combined with a low crystal density, ρC can lead to significant overestimation (e.g., for terphenyl, DNAPAN, and GAFDUO). Conversely, a low IPS value combined with a high ρC can lead to significant underestimation (e.g., for TBZPYR, BEGJOO and HEPFUL10). The relatively large number of persistent outliers for the singlet exciton binding energy models could be a reflection of the difficulty of training reliable models to predict very small target values. The simpler models M1,1 and M2,2 have a lower RMSE for the Test 2 set than for the PAH101 validation set, whereas the more complex model M2,2 has a significantly higher RMSE for the Test 2 set, indicating over-fitting.
Fig. 1f shows the distribution of singlet exciton binding energies in the PAH101 dataset. In most organic materials the exciton binding energy is significant compared to inorganic materials because the dielectric screening of charges is not as strong. However, some materials in the PAH101 set have low exciton binding energies (in parentheses), including: anthra(2,1,9,8-hijkl)benzo(de)naphtho(2,1,8,7-stuv)pentacene (BOXGAW; 0.013 eV), dinaphtho(1,2-a:1′,2′-h)anthracene (DNAPAN; 0.071 eV), tetrabenzo(de,no,st,c1d1)heptacene (TBZHCE; 0.130 eV), benzo[lm]chryseno[1,12,11,10-opqrab]perylene (YUNYAJ; 0.165 eV), and hexabenzo(bc,ef,hi,kl,no,qr)coronene (HBZCOR; 0.169 eV). All of these compounds are characterized by very extended and/or elongated π systems, which likely lead to an already low molecular exciton binding energy (not calculated here), further reduced by dielectric screening in the solid form.
Fig. 10 demonstrates a two-stage screening workflow constructed based on the SISSO models selected for the singlet exciton binding energy. The M1,1 and M2,2 models have been selected for the first and second stage, respectively, based on their robust performance for the Test 2 set. A small exciton binding energy means it is easier to separate excitons into free charge carriers. It is also often associated with strong dielectric screening and better charge transport. For demonstration purposes, we screen for materials with a singlet exciton binding energy below 0.3 eV. There are 17 such materials in total in the combined set of PAH101 and the additional test set. The screening thresholds for each stage of the workflow are set to 0.3 eV + n × RMSE, where n = 1,2,3 and RMSE refers to the training set RMSE of each model. With n = 1, the first-stage model, M1,1, screens out many of the materials, whose exciton binding energy is above 0.3 eV, leaving 21 false positives. However, 4 of the 17 materials with exciton binding energy below 0.3 eV are also screened out, leaving 13 true positives. Further screening by the second-stage model, M2,2 reduces the number of false positives to 15 with two additional true positives lost. Setting n = 2 results in 16 out of 17 true positives (all except terphenyl) passing the screening, but with a high number (41) of false positives. Setting n = 3 is not beneficial because the same 16 true positives pass the screening (terphenyl is still misclassified) along with 57 false positives. One may define the screening thresholds depending on their tolerance for false positives. With the thresholds set to one RMSE, the workflow presented here effectively eliminates most of the materials that are not of interest, which significantly reduces the number of candidates that need to be evaluated using more accurate and computationally expensive methods. Training better models for predicting the singlet exciton binding energy would require acquiring more GW+BSE data for materials with small exciton binding energies. To this end, the present models could be used to select materials for data acquisition.
For all four properties, the workflows based on SISSO models can effectively screen out most of the materials that are not of interest. However, the classification performance varies across properties and screening targets in terms of the number of false positives that pass the workflow and the number of true positives missed. The workflows for the optical gap and triplet excitation energy yield a more robust performance than the workflows for the singlet–triplet gap and exciton binding energy. The SISSO models for the triplet exciton energy performed particularly well. The lowest cost model, which only requires evaluating single molecule DFT features, successfully classified all the materials with triplet exciton energies in the IR with a very small number of false positives. This analysis helped identify compounds with 7-membered rings, such as heptafulvalene, as a potential new direction to be explored for TTA/SF.
The overall narrow energy range of the singlet–triplet gap and exciton binding energy in the PAH101 dataset, as well as the small number of samples with the desirable low values, made it more challenging to train reliable models for these properties. Improving the models would require additional data acquisition. To this end, the present models could be used to select materials likely to have low singlet–triplet gaps or low exciton binding energy for additional GW+BSE calculations. We also note that ML models trained on PAH101 are not guaranteed to perform well for materials that are significantly different chemically or structurally. This has been demonstrated here and in ref. 54 by the worse performance of some of the more complex SISSO models for an additional test set of 9 hydrocarbon crystals. Therefore, we recommend carefully validating the performance of these models for materials outside of the PAH101 set before using them for large scale screening.
In conclusion, we have demonstrated that the SISSO algorithm can generate ML models to predict the excited-state properties of molecular crystals using only a small amount of GW+BSE training data by incorporating physical/chemical knowledge into the selection of primary features. The resulting ML models, which require only relatively low-cost DFT calculations to evaluate, can provide estimates for excited-state properties of molecular crystals including the optical gap, triplet exciton energy, singlet–triplet gap, and exciton binding energy. These properties would not otherwise be accessible via DFT calculations. Using ML models in the early stages of materials screening workflows can effectively narrow down the number of candidates selected for further evaluation using computationally expensive excited-state theory. ML models can thus significantly accelerate the discovery of crystalline organic semiconductors with desirable properties for applications in optoelectronic devices.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4dd00396a |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2025 |