Taehyun Wona,
Naoya Aizawa*abcd,
Yu Harabuchi*de,
Reo Kuriharaa,
Mitsuharu Suzuki
a,
Satoshi Maeda
def,
Yong-Jin Pu
c and
Ken-ichi Nakayama
a
aDivision of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
bCenter for Future Innovation, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
cRIKEN Center for Emergent Matter Science (CEMS), Wako, Saitama 351-0198, Japan
dInstitute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo, Hokkaido 001-0021, Japan
eJST, ERATO Maeda Artificial Intelligence in Chemical Reaction Design and Discovery Project, Sapporo, Hokkaido 060-0810, Japan
fDepartment of Chemistry, Faculty of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
First published on 15th April 2025
Spin conversion in molecular excited states is crucial for the development of next-generation optoelectronic devices. However, optimizing molecular structures for rapid spin conversion has relied on time-consuming experimental trial-and-error, which limits the elucidation of the structure–property relationships. Here, we report a Bayesian molecular optimization approach that accelerates virtual screening for rapid triplet-to-singlet reverse intersystem crossing (RISC). One of the molecules identified through this virtual screening exhibits a fast RISC rate constant of 1.3 × 108 s−1 and a high external electroluminescence quantum efficiency of 25.7%, which remains as high as 22.8% even at a practical luminance of 5000 cd m−2 in organic light-emitting diodes. Post-hoc analysis of the trained machine learning model reveals the impact of molecular structural features on spin conversion, paving the way for informed and precise materials development.
The time constants of RISC are generally in the order of microseconds,19–22 which presents a challenge in the form of competing bimolecular recombination processes, such as triplet–triplet annihilation. Such detrimental processes become increasingly dominant at higher current densities in OLEDs, resulting in an appreciable decrease in device efficiency, a phenomenon known as efficiency roll-off.23 Thus, accelerating RISC and reducing efficiency roll-off are important hurdles to overcome for the practical use of TADF molecules in high-performance OLEDs.
Traditionally, TADF materials have been discovered through time-intensive experimental approaches; however, the advent of high-throughput virtual screening has offered a significant change in the materials design process.24–29 By employing computationally inexpensive density functional theory (DFT) and machine learning techniques, researchers can extensively explore the relevant molecular space for potential TADF materials with a small singlet–triplet energy gap (ΔEST). However, despite these advances, calculating ΔEST alone remains an inefficient way to estimate the rate constant of RISC (kRISC) and narrow down the most promising candidate molecules.
In the simplest physical picture, RISC is driven by a spin–orbit coupling at the crossing seam between the potential energy surfaces of singlet and triplet excited states. In cases of weak spin–orbit coupling in organic molecules, kRISC is expected to follow the Marcus-type non-adiabatic rate expression:
![]() | (1) |
Here we implement a machine learning technique called Bayesian optimization to computationally screen for TADF molecules with high kRISC. While conventional machine learning models for virtual screening require large number of training datasets, Bayesian optimization iteratively trains a probabilistic surrogate model with a limited number of datasets, strategically selecting the next data points to evaluate based on both exploration of uncertain space and exploitation of known space.37–42 This dual focus allows Bayesian optimization to rapidly identify the optimal molecules with a minimized number of high-fidelity excited-state calculations.
In the following section, we describe the development of our Bayesian optimization-based virtual screening that identified a TADF molecule with experimentally determined kRISC of 1.3 × 108 s−1. The external electroluminescence quantum efficiencies of the identified molecule reached 25.7%, which remained as high as 22.8% even at a practical luminance of 5000 cd m−2.
For each molecule, four quantum chemical properties were represented as descriptors for a vector search space: EHOMO, ELUMO, ΔEST, and HSO, derived from low-cost DFT calculations with the 6-31G basis set without diffuse and polarization functions, as detailed in ‘Methods’. We also used in-house binary molecular fingerprints FP as descriptors, which explicitly represent the number, positions, and categories of donor units (i.e., Cz, MCz, Acr, or Pxz) to classify candidate molecules in a six-dimensional space (Fig. 1B).
Fig. 1C compares the search performance of the Bayesian molecular optimization with different descriptor sets to that of uniform random sampling. This comparison evaluates the cumulative probability of finding the molecule with the highest kRISC as a function of the number of iterations, derived from 100 independent trials of the Bayesian optimization procedures, each with a different starting molecule. Essentially, Bayesian optimization outperforms uniform random sampling; however, its performance is highly dependent on the descriptor sets. In all 100 independent Bayesian optimization trials, the descriptor set (ΔEST, HSO) consistently found the highest kRISC within 82 iterations, whereas (EHOMO, ELUMO) required up to 148 iterations. Superficially, the superior performance of (ΔEST, HSO) is not surprising, given their physical relationship to kRISC. What is surprising, however, is that the DFT calculations of these descriptors rely on overly broad approximations to remain cost-effective, whose results may not even qualitatively align with those of the high-fidelity calculations for kRISC. Such circumstances suggest that the surrogate model captures the complex correlation between the descriptors and kRISC.
Combining FP with the descriptors further improves the performance of Bayesian optimization, locating the maximum point within only 55 iterations using (ΔEST, HSO, FP). Incorporating FP offers an additional advantage of chemical interpretability, allowing us to quantify the impact of molecular features on kRISC based on the trained surrogate model, as will be discussed later.
To extract chemical knowledge from the trained machine learning model, we assessed the descriptor importance using Shapley additive explanations (SHAP). This method assigns a SHAP value to each descriptor, quantifying its contribution to the model output by fairly distributing the difference between a given output and the average across all descriptors.53 Fig. 1D presents a series of beeswarm plots of the SHAP values for each descriptor in the trained model. A positive SHAP value indicates that a specific descriptor value, represented by the color, increases the model output (i.e., log10 kRISC) above the average, whereas a negative SHAP value indicates that it decreases the output below the average. A wider spread of the SHAP values for ΔEST and HSO in Fig. 1D suggests that these two descriptors are the most influential. However, the relationship between the descriptor values and the SHAP values appears to be nonlinear, implying that a smaller ΔEST and a larger HSO do not necessarily equate to a higher kRISC. These results highlight the limitations of the low-cost DFT outputs, and the importance of accounting for the spin-vibronic effects through the high-fidelity calculations of singlet–triplet crossing seams in predicting kRISC. For FP, a simpler pattern emerges in the SHAP plots: the presence of MCz positively correlates with the output, whereas the other descriptors (Cz, Pxz, and Acr) tend to correlate negatively with the output. The SHAP plots for FP also indicate that two substitutions, especially at the 3,6-positions (para-positions) of thioxanthone, are more beneficial for enhancing kRISC than one substitution or those at the 2,7-positions (meta-positions).
To understand the mechanism responsible for the trends in the importance of FP for kRISC, excited-state energy landscapes were established for seven model molecules by the high-fidelity TD-DFT calculations (Fig. 2A and S1† for relevant FP). Four molecules, MCz-m1, MCz-m2, MCz-p1, and MCz-p2, share common MCz donor units, but possess different substitution numbers and positions. The calculated adiabatic ΔEST of MCz-p1 is smaller than that of MCz-m1 by increasing the substitution number from one to two and changing the meta-substitution to para-substitution, resulting in a lower EA (the activation energy to reach the minimum-energy singlet–triplet crossing seam from equilibrium T1) and thus a higher kRISC of MCz-p2 (Table 1). Furthermore, the singlet–triplet crossing seam of MCz-p2 comprises more distinct orbital characters (1CT and 3LE) compared to MCz-m2 (1CT and 3CT + 3LE) (Fig. S2†), which is responsible for a higher HSO of MCz-p2, consistent with El-Sayed's rule.54 Additional calculations reveal that the 3,6-positions of thioxanthone are located at the nodes of its HOMO (Fig. 2B), leading us to interpret that the para-substitution of donor units suppresses π-conjugation and thus maintain a relatively pure 3LE on the thioxanthone unit, as in MCz-p2, resulting in higher kRISC.
Molecule | ES1a (eV) | ET1b (eV) | ΔESTc (eV) | EAd (eV) | λe (eV) | HSOf (cm−1) | kRISCg (s−1) |
---|---|---|---|---|---|---|---|
a Vertical S0–S1 excitation energy at the optimized S1 geometry.b Vertical S0–T1 excitation energy at the optimized T1 geometry.c Adiabatic energy gap between S1 and T1 at their respective optimized geometries.d Activation energy from the optimized T1 state to the minimum-energy singlet–triplet crossing seam.e Reorganization energy from the optimized T1 state to the optimized S1 state.f Spin–orbit coupling matrix element between S1 and T2 at the minimum-energy singlet–triplet crossing seam.g Rate constant of RISC calculated by using eqn (1). | |||||||
MCz-m1 | 2.550 | 2.341 | 0.262 | 0.264 | 0.227 | 2.32 | 1.10 × 105 |
MCz-m2 | 2.334 | 2.254 | 0.161 | 0.191 | 0.134 | 1.44 | 9.20 × 105 |
MCz-p1 | 2.724 | 2.518 | 0.261 | 0.269 | 0.169 | 3.96 | 3.05 × 105 |
MCz-p2 | 2.524 | 2.257 | 0.110 | 0.121 | 0.0953 | 6.11 | 2.94 × 108 |
Cz-p2 | 2.614 | 2.305 | 0.380 | 0.452 | 0.235 | 3.74 | 1.96 × 102 |
Acr-p2 | 2.374 | 2.308 | 0.0424 | 0.100 | 0.0292 | 0.599 | 1.13 × 107 |
Pxz-p2 | 1.993 | 1.994 | 0.0181 | 0.0896 | 0.00669 | 0.385 | 1.47 × 107 |
The effect of the structural modification of the donor units is also evaluated in MCz-p2, Cz-p2, Acr-p2, and Pxz-p2 with the same substitution numbers and positions (Fig. 2A). Removal of the electron-donating methyl groups from MCz-p2 results in a blue-shifted S1 of Cz-p2, which explain its larger ΔEST between S1 (1CT) and T1 (3LE) and lower kRISC (Table 1, Fig. 2A and 3 ESI†). In contrast, Acr-p2 and Pxz-p2, with more electron-rich donor units, possess a red-shifted S1 (1CT) and T1 (3CT) with a smaller ΔEST, due to weak electron exchange coupling associated with the spatially separated orbitals comprising these CT excited states. However, as the 1CT and 3CT undergo a red shift, their energy separation with the higher-lying 3LE increases, and thus the minimum-energy singlet–triplet crossing seam comprises CT characters, rather than LE, for both singlet and triplet states (Fig. S2†). The resultant lack of orbital change at the crossing seam suppresses spin–orbit coupling, as corroborated by the lower HSO and kRISC of Acr-p2 and Pxz-p2 than that of MCz-p2 (Table 1). These quantum chemical interpretations of the seven model molecules are in good agreement with the general trends in the FP importance shown in Fig. 1D.
![]() | ||
Fig. 3 (A) Steady-state absorption and PL spectra of MCz-p2 in a toluene solution (10−5 M). The inset in (A) is a photograph of the MCz-p2 solution exposed to UV light. (B) Transient PL decays of MCz-p2 measured with picosecond pulsed excitation at 375 nm. The blue line in (B) shows the fit of the transient PL decay to eqn (2) in ‘Methods’. The inset in (B) is the log–log representation. (C) Transient PL decays of MCz-p2 measured at varying temperatures. The inset in (C) represents the temperature-dependence of kISC and kRISC with their fits to the Arrhenius equation. (D–F) Current density–voltage–luminance characteristics (D), EL spectra (E), and external quantum efficiency-luminance characteristics (F) of the fabricated OLEDs. The inset in (F) is a photograph of an OLED using MCz-p2 working at high luminance (>5000 cd m−2). |
λPLa (nm) | ΦPLb (%) | τPFc (ns) | τDFd (ns) | kre (s−1) | knrf (s−1) | kISCg (s−1) | kRISCh (s−1) | ΔEST (eV) |
---|---|---|---|---|---|---|---|---|
a Photoluminescence (PL) peak wavelength.b PL quantum yield.c Time constant of prompt fluorescence.d Time constant of delayed fluorescence.e Rate constant of radiative decay of the S1 state to the S0 state.f Rate constant of non-radiative decay of the S1 state to the S0 state.g Rate constant of intersystem crossing (ISC) of the S1 state to the T1 state.h Rate constant of reverse intersystem crossing (RISC) of the T1 state to the S1 state.i Energy gap between the S1 and T1 states, estimated from the kRISC/kISC ratio, assuming the equilibrium between the S1 state and the three T1 sublevels.j Energy gap between the S1 and T1 states, estimated as the difference between activation energies of RISC and ISC. | ||||||||
485 | 82 | 1.4 | 852 | 5.3 × 106 | 1.2 × 106 | 5.8 × 108 | 1.3 × 108 | 0.010i/0.050j |
Transient PL decay measurements of MCz-p2 confirmed both prompt and delayed fluorescence components, with time constants (τPF and τDF) of 1.4 ns and 852 ns, respectively (Fig. 3B and Table 2). In comparison to this sub-microsecond τDF of MCz-p2, typical TADF materials exhibit τDF in the microsecond to millisecond time range, e.g. 2.8 μs for the seminal TADF molecule 4CzIPN.6 Further kinetic analysis of the transient PL decay (described in ‘Methods’) revealed that the RISC of MCz-p2 is indeed on the order of nanoseconds with an exceptionally high kRISC of 1.3 × 108 s−1, in good quantitative agreement with the model-predicted kRISC of 1.11 × 108 s−1. We note that the rapid RISC of MCz-p2 can be retained in a solid-state host matrix 3,3′-bis(carbazol-9-yl)biphenyl (mCBP) (Table S1 and Fig. S4†), though the emission is slightly red-shifted to a maximum peak wavelength of 495 nm, indicating a stabilization of a 1CT state in a higher dielectric environment of mCBP than toluene.
We measured the temperature-dependent transient PL decays of MCz-p2 in toluene (Fig. 3C). The decay kinetics suggest that the delayed fluorescence is attributed to a thermally activated process, as evidenced by the increase of τDF with decreasing temperature. Using the Arrhenius equation, the corresponding activation energies of ISC and RISC were estimated to be 0.001 eV and 0.051 eV, respectively. The difference between these activation energies gives ΔEST of 0.050 eV. An estimate of ΔEST was also obtained from the kRISC/kISC ratio, yielding a value of 0.010 eV (Table 2). We note that the former estimate is reasonably closer to the calculated value of 0.110 eV (Table 1).
The electroluminescence properties of MCz-p2 were also evaluated to demonstrate the practical implications of high kRISC in OLED performance (Fig. 3D–F). The details of the OLED structures and fabrication procedures are described in ‘Methods’. The fabricated OLEDs were confirmed to have MCz-p2 as the only source of electroluminescence with a maximum peak wavelength of 495 nm (Fig. 3E). The measured external quantum efficiency reached a maximum of 25.7% (Fig. 3F), corresponding to an internal quantum efficiency of 85.7%, assuming a light-outcoupling efficiency of 30% for a bottom-emission OLED. Crucially, a high external quantum efficiency of 22.8% was retained even at a practically high luminance of 5000 cd m−2. The suppression of efficiency roll-off was further quantified by the critical current density (J80%) at which the external quantum efficiency decreases to 80% of its maximum. A J80% of 19 mA cm−2 was recorded for the MCz-p2 device, indeed higher than 8.2 mA cm−2 observed for a device using 4CzIPN as an emitter. These results are attributed to the higher kRISC of MCz-p2, which maintains a comparatively low triplet exciton concentration and thus suppresses bimolecular recombination processes at high luminance.
Bayesian optimization is poised to become an empowering tool for materials scientists in both academic and industrial settings by mitigating costly evaluations. Its efficient and scalable nature can further expedite the development of spin-conversion materials for organic optoelectronics, as demonstrated in this study, as well as in broader applications such as photocatalysis, fluorescent bioimaging, and photodynamic therapy.
For the high-fidelity calculations of kRISC using eqn (1), the geometries of the S1, T1, and singlet–triplet crossing seam were optimized for each molecule using the TD-DFT with the LC-BLYP functional and the 6-31+G(d) basis set within the Tamm–Dancoff approximation, as implemented in the GRRM17 program,57 which refers to the energies and gradients calculated by the Gaussian16 program. The minimum-energy singlet–triplet crossing seam was calculated between S1 and T2 for all molecules except for mol-00003 (its molecular structure is given in ‘ESI†’), where the calculations converged between S1 and T3. The range-separation parameter of the LC-BLYP functional was non-empirically optimized for each molecule to minimize the differences between the HOMO energies and the ionization potentials in both the neutral and radical anion systems.58 Geometry optimization calculations for the seven model molecules were carried out, considering relevant conformers.
The differential rate equation of the S1 and T1 populations (eqn (2)) was employed in the numerical fitting of the transient PL decays to obtain the rate constants of each excited-state transition, assuming negligible radiative and non-radiative decays from T1 to S0.
![]() | (2) |
Footnote |
† Electronic supplementary information (ESI) available: Fig. S1–S6 and Table S1, ESI method, materials, synthesis, note, and data for machine learning. See DOI: https://doi.org/10.1039/d5sc01903f |
This journal is © The Royal Society of Chemistry 2025 |