Generative design of singlet fission materials leveraging a fragment-oriented database

Thanapat Worakul; Rubén Laplaza; J. Terence Blaskovits; Clémence Corminboeuf

doi:10.1039/D5SC03184B

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/D5SC03184B (Edge Article) Chem. Sci., 2025, 16, 17956-17969

Generative design of singlet fission materials leveraging a fragment-oriented database

Thanapat Worakul ^a, Rubén Laplaza ^ab, J. Terence Blaskovits ^ac and Clémence Corminboeuf *^ab
^aLaboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland. E-mail: clemence.corminboeuf@epfl.ch
^bNational Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
^cMax-Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany

Received 1st May 2025 , Accepted 22nd August 2025

First published on 25th August 2025

Abstract

Recently, we leveraged the FORMED repository made up of 116 [thin space (1/6-em)] 687 synthesizeable molecules to deploy fragment-based high-throughput virtual screening (HTVS) and genetic algorithm (GA) searches of singlet fission (SF) molecular candidates. With these approaches, both prototypical (e.g., acenes, boron-dipyrromethane (BODIPY)) and unreported (e.g., heteroatom-rich mesoionic) classes of chromophore candidates fulfilling specific SF energetic requirements were identified. Yet, the reliance on predefined fragments limits chemical space exploration and, thus, the discovery of truly unforeseen molecular cores. Here, we exploit FORMED to train a generative learning framework driven by reinforcement learning and property predictions. The generative model rediscovers a diverse range of previously reported SF chromophore classes, including polyenes, benzofurans, fulvenoids and quinoidal systems, but also suggests an unexpected scaffold absent from the training data, neocoumarin (2-benzopyran-3-one), characterized by two endocyclic double bonds in an ortho arrangement and capped by a lactone group. An in-depth investigation reveals a diradicaloid behavior over the conjugated core comparable to 2-benzofuran, a widely known SF compound. This work highlights the potential of using both generative and property prediction models to discover candidates beyond derivatives of known chemistry for tailored material applications.

1 Introduction

The singlet fission (SF) process^1–4 refers to the spin-allowed conversion of a singlet excited-state (S₁) into two lower-lying triplet states (T₁). This phenomenon has the potential to improve the power conversion efficiency of silicon single-junction solar cells by exceeding the Shockley–Queisser thermodynamic efficiency limit of 33.7% (ref. 5 and 6) in silicon single-junction solar cells. However, materials suitable for SF must satisfy several stringent energy-based criteria: (1) the energy of S₁ must be at least twice or greater than that of T₁ for the process to be thermodynamically feasible,¹ (2) the energy of T₁ must be higher than the conduction band of silicon (∼1.12 eV) to ensure triplet energy transfer to a semiconductor^7,8 and (3) the S₁ energy must align with the energy of the incoming photon, typically in the visible or near-visible range (1.5–3.5 eV).⁹ These preliminary requirements make the identification and design of suitable SF materials a challenging task.

Extensive experimental and computational work has focused on designing SF materials, primarily through the screening and modification of known compounds, leading to tailored design rules.^10–21 In this spirit, some of us previously built the FORMED dataset by mining the Cambridge Structural Database (Fig. 1a) and characterizing 116 [thin space (1/6-em)] 687 experimentally accessible organic molecules with time-dependent density functional theory (TD-DFT).²² FORMED enabled the construction of over a million donor–acceptor copolymers by cross-coupling the C(sp²) sites of selected fragments in silico, which were subsequently screened using statistical models to identify systems with suited SF thermodynamics. Although this approach successfully identified several potential donor–acceptor systems, it relied on previously defined heuristic rules²³ to limit the combinatorial space.


	Fig. 1 Overview of molecular design strategies: (a) high-throughput virtual screening (HTVS), which evaluates the properties of compounds in a virtual library, ranks them, and selects candidate compounds with the top properties. (b) Genetic algorithm, where the property space is explored using predefined structure generation rules in the crossover and mutation steps. (c) Data-driven generative models, which learn structure generation rules by encoding chemical structures into a latent space. Structures are generated by sampling the learned latent space. With reinforcement learning, the generative process is biased toward molecules with tailored properties.

To navigate the chemical space more efficiently, this high-throughput effort was followed by the development of an uncertainty-controlled genetic algorithm (GA),²⁴ based on NavicatGA,^25,26 (Fig. 1b). Upon GA optimization, molecules were assembled from a FORMED-derived pool of fragments, called reFORMED, and ensemble machine learning predictive models trained on FORMED data served to score candidates. This approach led to the rediscovery of known SF compounds and to the identification of acceptors, such as heteroatom-rich mesoionic compounds, not previously investigated for SF. However, genetic optimization requires a predefined fragment database and fixed rules for their recombination, which inherently limits the potential of genetic approaches to identify structural motifs completely outside the box.^27–29

This limitation is potentially overcome by deep learning-based generative models,^30–37 which implicitly learn the rules for generating chemical structures. These models, trained on molecules, uncover the underlying structural patterns and relationships among them, encoding this information into a continuous latent space, a compressed representation of molecules. By sampling from this latent space, generative models create molecules that reside within the learned chemical domain. Free from the constraints of manually predefined recombination rules and fragment libraries, generative models enable a broader, unbiased chemical space exploration, with the potential of discovering compounds that traditional approaches would have missed.

Once the generative models have learned to create molecules, multiple conditioning strategies are devised to steer the generative process toward desirable properties, thereby enabling the inverse design process.^38–44 There are several approaches to direct the generation to the target molecules, such as gradient optimization in the latent space,⁴⁵ gradient-based guidance diffusion,³⁸ and classifier-free guidance diffusion.^46,47 Among these, reinforcement learning (RL)^39–44 is the optimization method that iteratively refines the model to meet target objectives. In each RL iteration, the model generates a batch of candidate molecules, which are then evaluated using a scoring function that quantifies how closely the generated molecules align with the desired properties. Based on the evaluation, the model parameters are updated through a feedback loop that improves its ability to generate higher-scoring molecules in subsequent iterations. Over multiple iterations, the generative model's outputs are refined toward compounds that satisfy the expected properties. Li and Tabor⁴⁸ demonstrated the potential of this approach by integrating a generative model with RL to identify SF candidates. Because their methodology relied on semi-empirical computations to evaluate excited-state energies during each RL iteration and because their model was trained on a ChEMBL database⁴⁹ containing only small drug-like molecules, the exploration of the chemical space remained limited, leading to the identification of thiophene and acene derivatives primarily. Replacing these expensive computations with machine learning models as scoring functions and leveraging a chemically diverse database to train the models offers an efficient alternative, enabling scalable and broader chemical space exploration.

Here, we develop a data-driven design platform in which both structure generation and structure–property estimation are accomplished via machine learning. We leverage the FORMED database to train both the generative and property prediction models (Fig. 1c). By combining a generative model capable of creating diverse molecules with a robust predictive model that estimates excited-state properties with minimal expense, our approach offers an efficient means for discovering unexpected molecular scaffolds fulfilling the energetic criteria of SF materials. Our approach specifically discovers a hitherto unknown cyclic ester core, neocoumarin not present in FORMED, that has been previously synthesized. This class of molecules follows the well-known diradical design principle elaborated by Michl and coworkers,¹⁷ while overcoming some of the limitations of the 2-benzofuran core. Importantly, the approach also rediscovered most other known or predicted classes of SF molecules, including acenes, polyenes, and benzofurans.

2 Methodology

The proposed generative design workflow involves both a generative and a property prediction model (blue boxes, Fig. 2). Both models utilize SMILES strings as the molecular representation. The pre-RL generative model is first trained to generate chemically valid molecules without specific property constraints, while the property prediction model evaluates the excited-state properties of these molecules. This pre-RL generative model is then coupled with the prediction property model through Reinforcement Learning, a training framework that iteratively updates the generative model's parameters to increase the likelihood of generating desirable molecules. We implement RL in two stages, following a curriculum of increasing complexity (red boxes, Fig. 2). The resulting optimized post-RL generative model (yellow box) is able to selectively generate molecules with properties suitable for SF. In what follows, we briefly describe all three components, while additional details are provided in the SI.


	Fig. 2 Workflow of the generative design pipeline, which incorporates three deep learning platforms: REINVENT v3.2 for training the generative model, Chemprop v1.5.2 for training the property prediction model, and REINVENT v4, which conditions the molecular generation process using a reinforcement learning curriculum approach, yielding the post-RL generative model that selectively creates molecules that meet the SF requirements. The reinforcement learning curriculum consists of two stages: the first stage focuses on structural constraints, while the second stage optimizes excited-state properties for singlet fission.

2.1 Prediction of excited-state properties

To predict the excited-state properties of interest, a GNN-based multi-target property prediction model was trained on the FORMED database, which consists of 116 [thin space (1/6-em)]

687 molecules, along with their excited-state properties, using Chemprop v1.5.2.⁵⁰ This model predicts four key electronic excited-state properties—singlet and triplet vertical excitation energies (E_S₁,ve, E_T₁,ve, and E_T₂,ve) as well as exciton size

—from SMILES as a molecular representation. The Chemprop model architecture consists of a 3-layer GNN with a hidden size of 300 and a dropout probability of 0.2. The dataset was randomly split into training, validation, and test sets with an 80/10/10 ratio. On the test set, the Chemprop model achieved mean absolute error (MAE) losses of 0.13 eV for E_S₁,ve, 0.12 eV for, 0.13 eV for E_T₂,ve, and 0.17 Å for the exciton size (Fig. 3).


	Fig. 3 Correlation plots of excited-state properties comparing the predicted values from the Chemprop model with the true values of molecules in the test set, obtained from a random split of the FORMED database: (a) E_S₁,ve, (b) E_T₁,ve, (c) E_T₂,ve, and (d) S1 exciton size .

In line with previous work,²⁴ we also evaluated the predictive performance of the Chemprop model on an external test set derived from the reFORMED database. The trained model demonstrated acceptable predictive accuracy across all excited-state energies, with MAE values of 0.22 eV for E_S₁,ve, 0.20 eV for E_T₁,ve, and 0.31 Å for the exciton size (Fig. S1). As such, we concluded that the multi-target Chemprop model is able to predict the excited-state properties of unseen molecules accurately enough for the RL process (vide infra).

2.2 Molecular generative models

Numerous generative deep learning models have been developed for creating molecules, including variational autoencoders,^51–53 generative adversarial networks,^53,54 flow-matching models,^55,56 and diffusion models.^57–63 Among these, REINVENT,^64,65 a recurrent neural network (RNN)-based framework designed to learn and generate SMILES strings, has emerged as one of the most effective tools for molecular design applications. REINVENT is especially appealing due to its user-friendly interface. Model training and fine-tuning can be easily configured via TOML or JSON files. In this work, we adopt REINVENT v3.2 (ref. 64) to learn canonicalized SMILES strings from the FORMED database.²² In this framework, a SMILES string is treated as a sequence of tokens, where each token is either a single character or a combination of characters. A token pool is created at the start of the training process, and the model is trained unconditionally to learn the joint probability P(T) of generating a SMILES sequence T of length

with tokens

. The joint probability is expressed as:


	(1)

The training involves minimizing the negative log-likelihood (NLL), which quantifies how well the model predicts the sequences in the training data. The NLL is defined as:


	(2)

Once trained, the model generates SMILES strings by sampling tokens sequentially from the learned probability distribution P(T). Starting with an initial token (^ in our case), the model predicts the probability distribution for the next token based on the conditional probabilities it has learned. The process is repeated iteratively, with each generated token influencing the prediction of the subsequent token, until sequence generation is complete. The sequence is terminated when a predefined stop token is added ($ in our case).

2.3 Reinforcement learning

The trained generative model (vide supra), is then optimized with RL using the REINVENT v4 implementation. In the RL process, molecules sampled from the agent model (that is, the generative model that undergoes optimization) are evaluated using a scoring function that quantifies their suitability for SF based on predicted excited-state properties. The construction of the scoring function is briefly outlined in the following and is covered in detail in the SI. Following the policy gradient approach,^66,67 these score values are used as optimization signals to adjust the generative process. Specifically, these scores are used to define the augmented likelihood (P_aug) for each SMILES sequence T as


logP_aug(T) = logP_prior(T) + σS(T),	(3)

where S(T) is the reward value associated with the SMILES sequence T, σ is the hyperparameter used to scale the reward value, and P_prior(T) is the likelihood of the SMILES sequence T in the prior model (the initial generative model in our case). Note that, in the case of invalid SMILES sequences, a reward value of zero is assigned, although their likelihoods are still taken into account.

To optimize the agent model, the loss function is defined as


	(4)

where and P_agent(T) is the likelihood of the SMILES sequence T in the agent model. The presence of P_prior(T) in P_aug(T) serves as a regularization mechanism, constraining the agent model to remain close to the learned chemical space and ensuring the generation of chemically valid SMILES sequences. The balance between prioritizing the reward and enforcing the regularization agent model's knowledge can be controlled with the hyperparameter σ.

During the optimization loop, the RL tends to overexploit specific SMILES sequence patterns, corresponding to high reward value, which leads to the generation of structures with similar scaffolds within an iteration loop. To promote structural diversity among the molecules generated from the agent model, we employed the diversity filter implemented in REINVENT v4.⁶⁵ This filter penalizes SMILES strings that are too similar to those already stored in a memory bucket, which keeps track of previously generated molecules. By discouraging the agent model from repeatedly generating structurally similar compounds, the filter ensures greater diversity. In addition, we used experience replay^68–70 to improve the convergence of the RL process by storing high-scoring molecules generated during previous iterations and periodically reintroducing them into the training process.

The energy score function taken from previous work,²⁴ where a higher score corresponds to a higher reward, was used to optimize the S₁/T₁ energy levels to satisfy three SF requirements:

(1) Thermodynamic constraint: E_S₁,ve − 2E_T₁,ve > −1 eV.

(2) Solar cell semiconductor compatibility: E_T₁,ve > 1.5 eV.

(3) Matching with the solar emission spectrum: E_S₁,ve < 3.8 eV.

A detailed mathematical definition of the energy score is given in the SI. In addition to optimizing the energy levels of S₁ and T₁, our goal was to maximize the exciton size (i.e., the root-mean square electron–hole separation) to promote delocalized singlet exciton formation. A larger exciton size is indicative of charge-transfer or delocalized excited-state character, which are both beneficial for the triplet-pair formation from the singlet state. We also consider the energy gap between the vertical second and first excited triplet states, which is to be maximized to reduce the likelihood of competing T₁ to T₂ upconversion processes.⁷¹ Furthermore, we bias the generative model against the generation of charged structures by penalizing the score of charged molecules.

Given the complexity of the score function, which involves multiple objectives to be optimized simultaneously, we adopted a two-stage curriculum for the RL optimization process (Fig. 2).⁷² In the first stage, we focused on structural constraints (more details are given in the SI). To avoid overfitting, this first stage of RL was limited to 20 iterations. In the second stage, we focus on optimizing the SF-related properties, namely the energy score, exciton size, and T₁/T₂ gap (E_T₂,ve − E_T₁,ve). Splitting the RL into two stages focused on different aspects significantly simplifies the learning. During each stage, the individual components of the score function were aggregated using a weighted geometric mean:


	(5)

where S_i(T) is the individual score component i for the SMILES sequence T, and w_i is the weight assigned to the i-th score component. Additional details regarding the implementation of the different objectives and the weighting strategy are given in Table S1.

At the end of the second stage of each trial, 1280 molecules were generated using the post-RL generative model, and their excited-state properties were predicted using the Chemprop model. Subsequently, the 10 best molecules, ranked according to their energy scores, were selected for further validation and optimized with DFT to have their properties evaluated using TD-DFT computations.

2.4 Computational details

The SMILES strings created with the generative models were converted into 3D geometries using the distance geometry approach implemented in RDKit,⁷³ followed by refinement of atomic positions with the MMFF94 force field.^74,75 These initial geometries were optimized at the GFN2-xTB^76,77 level, providing an initial 3D geometry for subsequent gas phase density functional theory (DFT) geometry optimization at the ωB97X-D⁷⁸/6-31G(d)^79–81 level. Using the same functional and basis set, TD-DFT computations with Tamm-Dancoff approximation⁸² were carried out to determine vertical excitation energies and to perform excited-state geometry optimizations to extract adiabatic excited-state energy values. The diradicaloid character was assessed by computing open-shell singlet wavefunctions at the UHF/6-31G(d) level on the DFT optimized geometries and extracting the diradical character (y₀) and the tetraradical character (y₁) diagnostics as introduced by Nakano and coworkers.^24,83–85 All electronic structure computations were performed using the Gaussian16 (ref. 86) (revision A.03) software package.

3 Results and discussion

3.1 Exploration of chemical space

To identify potential candidates for SF, we follow the workflow shown in Fig. 2 and optimize a generative model. In the RL optimization loop, we rely on the Chemprop property prediction model for fast and inexpensive scoring. Within 600 iterations, the overall score, the energy score, and

of the generated molecules reach convergence (Fig. 4a, b and S5). The E_T₂,ve − E_T₁,ve is also sufficiently high (∼1.2 eV) to suppress undesired triplet–triplet upconversion (Fig. 4c). For simplicity, we narrow the discussion to the energy score and the singlet and triplet excitation energies, which are the most critical factors for SF propensity. To evaluate the impact of reinforcement learning, we compare the average excited-state properties of molecules generated before and after the initial optimization trial. Specifically, we sample 1280 molecules from both the pre-RL and post-RL generative models (model 1) and analyze their chemical structures and predicted excited-state properties.


	Fig. 4 RL optimization curves during stage 2 of the curriculum for (a) energy score, (b) exciton size , and (c) E_T₂,ve − E_T₁,ve. Kernel density histogram plots of E_T₁,ve, E_T₁,ve, and predicted with Chemprop model for 1280 molecules generated from (d) the pre-RL generative model and (e) the post-RL generative model 1. The yellow region designates where S₁ → 2T₁ conversion is thermodynamically feasible and E_T₁,ve is aligned for potential integration into solar cell applications. (f) Property map between E_T₁,ve and E_T₁,ve of the 1280 generated structures from the post-RL generative model 1 (blue) overlaid on top of the FORMED database (gray). Note that energy values of the generated structures in the map are predicted with the Chemprop model.

The pre-RL generative model creates molecules that broadly span the chemical space of the FORMED database (Fig. S8). The resulting molecules are chemically diverse, exhibiting an average Tanimoto similarity score of 0.10 and yielding ∼367 unique scaffolds⁸⁷ from a sample of 1280 generated molecules (Table S3). Unique scaffolds are defined as a Murcko scaffold with a Tanimoto similarity below 0.70 relative to other Murcko scaffolds in the dataset. In terms of their excited-state properties, the distributions of vertical excitation energies for the singlet (E_T₁,ve) and triplet (E_T₁,ve) states are largely overlapping (Fig. 4d), suggesting that most generated molecules do not meet the thermodynamic requirements for SF. Of the 1280 molecules generated by the pre-RL model, only two are predicted to satisfy the energetic criteria, a low hit rate at this stage. In contrast, molecules generated by the post-RL generative model 1 are confined to a narrow region of chemical space (Fig. 5) and are less diverse, with an average Tanimoto similarity score of 0.35. These molecules mainly share the same Murcko scaffolds (1–5 unique scaffolds among 1280 generated molecules, Table S3). However, the distribution of their excited-state energy levels is well separated (Fig. 4e), with the E_T₁,ve centered at approximately half E_S₁,ve. Consequently, more than 750 structures of the 1280 molecules created by the post-RL generative model 1 are within the energetic target region for the desired SF property (Fig. 4f).


	Fig. 5 t-SNE plot generated from the Morgan fingerprint representation of the generated structures from the pre-RL and post-RL generative models.

Despite improving the excited-state properties, the post-RL generative model 1 predominantly suggests polyenes (e.g., 1 in Fig. 5 and 6) and similar molecules with extended π-conjugated systems that are known SF chromophores. This apparent preference for polyenes is sound considering the simplicity of their SMILES pattern, which consists of frequently occurring tokens in the FORMED database (e.g., C and =). Furthermore, these molecules are abundant in FORMED (used to train both the generative and property prediction models).


	Fig. 6 (a) Candidate molecules generated by the post-RL generative models from different optimization trial runs. E_T₁,ve and E_T₁,ve are computed using TD-DFT. (b) Property map between E_T₁,ve and E_T₁,ve of structures in the FORMED database colored by frontier molecular orbital (FMO) gap. The yellow region designates where S₁ → 2T₁ conversion is thermodynamically feasible. The corresponding adiabatic excitation energies and chemprop-predicted vertical excitation energies of these structures can be found in Fig. S12.

To steer the generative exploration toward other chemical space regions, we introduce custom alerts,³⁹ SMARTS-based filters for unwanted substructures into the scoring function at both stages of the RL curriculum. If a generated molecule contains any substructure from the predefined list, its score is set to zero, effectively penalizing the generation of such molecules. Molecules containing ring systems and extensive π-conjugated frameworks (entries 1–15 in the complete list of unwanted substructures provided in the SI) were initially penalized.

With these structural constraints, the second and third optimization trials guide the generative models toward more compact and tunable molecules featuring rigid π-conjugated cores. However, each post-RL generative model continues to create molecules within the same structural class. Specifically, the second and third post-RL models predominantly generate substituted acenes^88–93 (e.g., 2) and derivatives of boron-dipyrromethane (BODIPY)^94–97 (e.g., 3), respectively. Notably, despite these structural constraints, the diversity of molecules generated in these trials remains comparable to that of the first RL trial in terms of the number of unique scaffolds and similarity scores (Table S3), where no custom alert filter was used. Similar to polyenes, these molecule classes have been investigated for SF and are prominently represented in FORMED, with BODIPY and anthracene appearing 621 and 928 times, respectively.

We thus include BODIPY and acene in the unwanted structure list (entries 16–17 in the complete list of unwanted substructures provided in the SI) for further exploration. Incorporating the full list of unwanted substructures in the structural constraints leads to a more challenging optimization, requiring a larger number of iterations to improve the excited-state properties of the generated molecules (Fig. S4 and S5). Seven additional optimization trials were carried out with the full list of unwanted structures. Each trial directs the generative model toward a distinct family of SF chromophores, corresponding to a unique region of chemical space, as illustrated by the dimensionality reduction plot in Fig. 5. The RL optimization curves and kernel density histogram plots of the excited-state properties of all trials are presented in Fig. S3–S5, respectively. A summary of the performance of all generative models is provided in Tables S3 and S4.

At the end of each of the 10 optimizations, we collect the top 10 candidates from 1280 molecules created by the post-RL generative model according to their predicted energy scores. These top performers are then optimized with DFT, and their vertical energies computed with TD-DFT. Among the 100 evaluated molecules, 73 meet the SF criterion, demonstrating that Chemprop reliably guides the generative model toward high-reward regions of chemical space via RL. From this pool of computationally validated SF molecules, we (re)discover molecular classes with energy splittings that satisfy the SF thermodynamic criteria. One to three representative molecules from each post-RL generative model are presented in Fig. 6. Their adiabatic excited-state energies, along with the Chemprop-predicted vertical excitation energies, are provided in Fig. S10 and S11, respectively. The computed E_S₁,ve and E_T₁,ve of our candidate molecules fall within the thermodynamically favorable region for S₁ → 2T₁ conversion. Furthermore, these candidates' E_T₁,ve value is adequate for their potential integration into solar cell applications (Fig. 6) and the T₁–T₂ energy gaps are large enough to suppress unwanted triplet upconversion. As stated above, the three initial trials coincide with the rediscovery of polyenes (1), acenes (2), and BODIPY (3). In the following seven runs, which exclude the full list of unwanted substructures, the generative model uncovered derivatives such as substituted 2-benzofurans (4), which were initially screened and identified by Michl and coworkers¹⁷ as potential SF chromophores. Similarly to their findings, our candidates feature substitutions at the C1 and C3 positions,^98–100 lowering the triplet energies to better align with the energy requirements for solar cell applications than the unsubstituted counterparts (Fig. S14).

We also encountered a variety of fulvenoid and quinoidal compounds (5–8), recently identified as promising SF scaffolds.^12,101 The fulvenoid derivatives feature diverse heterocyclic rings such as furan, thiophene, imidazole, and hydantoin, linked to aromatic rings via an exocyclic bridge containing one or more methine moieties, often decorated with a cyano group. Similar to the substituted fulvenes previously identified,¹² these fulvenoid structures maintained a favorable E_S₁,ve [thin space (1/6-em)] :E_T₁,ve ratio. While these structures merit further investigation, they will not be the focus of the remainder of this work.

In line with a previous work by some of us, the post-RL generative model 9 yielded molecules containing mesoionic N-oxide motifs (9).¹⁰² Mesoionic heterocycles have been identified as good acceptor units for charge transfer in donor–acceptor systems.^14,21,24 Interestingly, structures embedding the N-oxide in an anthracene core exhibit lower E_T₁,ve and similar E_S₁,ve compared to the substituted acenes, which increases the splitting (E_T₁,ve = 1.4 eV, and E_T₁,ve = 3.4 eV).

Overall, the diversity of the (re)discovered singlet fission chromophore candidates demonstrates the capability of the optimization pipeline, powered by a tailored scoring function and the FORMED database, to identify SF molecules across different structural classes. Furthermore, the successive inclusion of unwanted substructures as a penalty term in the RL step underscores the flexibility of the workflow.

3.2 A coumarin isomer as a promising scaffold

Along with the above finding, a series of coumarin derivatives sharing the same 2-benzopyran-3-one core (10–12) and emerging from the generative model 10 caught our attention, as, to the best of our knowledge, such systems have not yet been explored for SF applications. We term this scaffold neocoumarin to distinguish it from the well-known coumarin (1-benzopyran-2-one) and from its primary constitutional isomer, isocoumarin (2-benzopyran-1-one). Akin to coumarin and isocoumarin, neocoumarin derivatives feature a bicyclic system consisting of a benzene ring fused to a 2-pyrone ring.

The derivatives selected by the model include amine and aryl substituents on the pyrone ring. Substituted neocoumarin structures (10 and 11) generated by our optimization trials exhibit proper energetics for SF, with E_T₁,ve ≈ 1.4 eV and E_S₁,ve ≈ 3.0 eV. The distribution of the excited-state character, as visualized through the hole and electron densities derived from the natural transition orbitals (Fig. S15), shows significant overlap between the singlet and triplet states, both being localized near the aromatic subsystem of the fused ring core.

To identify the molecular core responsible for the SF-relevant properties and extract concrete design principles, we performed adiabatic TD-DFT computations on the bare neocoumarin structure (13, Fig. 7), which confirmed good S₁/T₁ splitting (E_T₁,ad = 1.49 eV and E_S₁,ad = 3.34 eV). We also noted that the synthesis of neocoumarin and derivatives thereof has been accomplished in a number of previous works.^75,103–105 Interestingly, while coumarin (1-benzopyran-2-one) and isocoumarin (2-benzopyran-1-one) motifs appear 889 and 121 times, respectively, in the FORMED database, neocoumarin (2-benzopyran-3-one) is not present. Neither coumarin (E_T₁,ad = 2.75 eV, E_S₁,ad = 4.43 eV) nor isocoumarin (E_T₁,ad = 2.46 eV, E_S₁,ad = 4.43 eV) exhibit energies which are conducive to SF (green crosses in Fig. 8; see also Fig. S14). This implies that the optimization strategy succeeded at finding an out-of-sample, synthesizable chemical motif with good SF properties in spite of its absence from the training data and the inadequacy of its closest constitutional isomers.


	Fig. 7 Derivative structures of neocoumarin from (a) adding or removing fused rings, (b) modifying the capping functional group, (c) switching the benzene ring to other aromatic fused rings, and (d) altering the aromaticity of the conjugated system. Structural changes that preserve excited-state energies that satisfy SF criteria are indicated by the yellow background.


	Fig. 8 Property map between E_T₁,ad and E_S₁,ad of the neocoumarin-related structures. The yellow region designates where S₁ → 2T₁ conversion is thermodynamically feasible and E_T₁,ve is aligned for potential integration into solar cell applications.

The synthetic viability of neocoumarin opens the door to a whole class of compounds with shared structural and electronic characteristics. In order to exhaustively identify beneficial modifications and rule out modifications that disturb the SF energetics, we systematically explored the chemical space around the neocoumarin core by manually constructing diverse derivatives and computing their excited-state properties using adiabatic TD-DFT (Fig. 7). Vertical T₁ and T₂ excited state energy levels of these molecules are reported in Fig. S13.

We first varied the number of fused rings in the system. Removing the fused benzene ring increases E_S₁,ad to 4.38 eV (14), while extending the number of fused rings reduces E_T₁,ad to 0.86 eV for two fused benzene rings (15) and 0.47 eV for three fused benzene rings (16), leading to poor SF thermodynamics in both cases, thus establishing that an energetic sweet spot is achieved with two rings. The role of the lactone moiety was then explored by swapping the ester functional group in the lactone ring with an amide (17, forming a lactam), which lead to an increase of E_T₁,ad by around 0.2 eV, and thus a poorer splitting. Replacing the lactam with a thioester instead (19) lowers E_T₁,ad by 0.1 eV while E_S₁,ad decreases by about 0.2 eV, which keeps the splitting ratio approximately constant. These results corroborate that the ortho-quinoidal double bonds are key to the photophysical properties of the system, whereas the nature of the lactone-type endcapping functional group and substituents on the pyranone unit are synthetic handles of interest for potential kinetic stability or synthesizability reasons.

The choice of ring system was also explored by manually constructing and testing different 5- and 6-membered aromatic heterocycles as a replacement for benzene to understand the limitations of this strategy. With 5-membered heterocyclic rings such as pyrrole (21, and 22), thiophene (23), the E_S₁,ad [thin space (1/6-em)] :E_T₁,ad ratio remains close to ideal while E_T₁,ad increases to around 2.0 eV. However, replacing benzene with pyridine (24) has a negligible effect on SF energetics.

Since replacing benzene with other aromatic rings proved to be a viable strategy to fine-tune the neocoumarin core, we tested the role of aromaticity by using non-aromatic or anti-aromatic fused rings instead. Replacing benzene by cyclobutene or cyclooctatriene (25–26) leads to a dramatic lowering in E_T₁,ad, while breaking the delocalized π system through saturation (27–28) leads to a consistent increase in E_T₁,ad, in both cases hampering SF energetics.

We note the similarity between the neocoumarin core and the previously identified 2-benzofuran derivatives pioneered by Michl and coworkers (Fig. 9a).^17,106–109 In both cases, the presence of o-xylylene motif characterized by two ortho quinoid endocyclic double bonds appears to be conducive to good SF energies. As expected from this analysis, removing this motif by altering aromaticity (as in 27 and 28) or by changing the position of the ester functional group in the 2-pyrone ring (Fig. 8), disturbs the energetics of the system.


	Fig. 9 (a) Spin-density plots and comparison of coumarin, benzofuran, and neocoumarin. (b) Components of the neocoumarin scaffold. The ortho quinoid motif responsible for SF-relevant properties is highlighted in green.

In the case of 2-benzofuran, the presence of the o-xylylene motif (dark green in Fig. 9b), has been linked with diradicaloid character, an electronic property associated with SF propensity.^{100,103,104,110,111} To assess whether the same applies to the identified class of molecules, we evaluated the di- and tetraradical character (y₀ and y₁, respectively) of neocoumarin and selected derivatives via natural orbital analysis of their unrestricted Hartree–Fock (UHF) wavefunctions (see SI for additional analysis).^110,112,113 The results are compiled in Fig. 9. We find that the diradical character in neocoumarin (y₀ = 0.16) is slightly higher than in benzofuran (y₀ = 0.09, hence the lower T₁ energy of the former, cf.Fig. 8) while both retain very small tetraradical character (y₁ = 0.01). By comparison, coumarin has little of both diradical and tetraradical character (y₀ = 0.02, y₁ = 0.00). These findings, along with the features of the spin density plots (Fig. 9) highlight the critical role of the ortho quinoid arrangement and the necessity of preserving aromaticity in maintaining singlet fission (SF)-relevant electronic properties, offering valuable insights for the design of next-generation SF molecules.

4 Conclusions

In this work, we demonstrated the relevance of a data-driven generative framework for the discovery of potential SF materials, combining structure generation and property prediction models. Building upon the FORMED database and leveraging curriculum-based reinforcement learning, this approach successfully rediscovered a broader range of SF chromophores than our previous fragment-based design methods, including polyenes, acenes, boron-dipyrromethane (BODIPY), benzofurans, fulvenoids, quinoidal structures, and mesoionic compounds. More significantly, the generative framework identified a molecular class, neocoumarins (2-benzopyran-3-one), which is uncharted for optoelectronic applications but exhibits favorable excited-state energetics for SF. While the coumarin and isocoumarin systems, well-represented in the FORMED database, exhibited poor SF properties, the generative model, guided by our RL optimization strategy, uncovered this third coumarin isomer, which is absent from the FORMED database but is found to possess promising SF energetics. This ortho-xylylene core capped by a cyclic ester group follows Michl's diradical design principles while addressing some of the limitations of the benzofuran core and offering novel opportunities for SF material development.

Our results thus highlight the potential of generative design not only to rediscover known SF candidates but also to explore uncharted regions of chemical space, enabling the identification of out-of-the-box chromophores with tailored properties. This offers a promising pathway for advancing the discovery of functional materials. Although we successfully identified a variety of target molecules, our current approach relies on manually guiding the generation process by excluding previously discovered scaffolds. This process could be streamlined by automatically detecting key scaffolds identified at the end of each trial and dynamically updating the list of excluded motifs for subsequent trials. Furthermore, direct comparisons with other approaches (e.g., genetic optimization) across a broader range of practical chemical applications are still awaited.

Author contributions

T. W. proposed the project. T. W. developed the generative design pipeline and performed all experiments. T. W., R. L., and J. T. B. analyzed the data and wrote the initial draft of the manuscript. C. C. guided the research direction and supervised the project, provided funding, edited and modified the content of the manuscript.

Conflicts of interest

There are no conflicts to declare.

Data availability

The codes for training the generative models and performing the reinforcement learning, which include all the scripts, the configuration files, the pre-RL and post-RL generative models, and the Chemprop excited-state property prediction model, are available at GitHub: https://github.com/lcmd-epfl/SF_generative. The FORMED database, used to train the models, is publicly available in the Materials Cloud Archive at https://doi.org/10.24435/materialscloud:nh-gb.

The SI contains the Chemprop model performance metrics, the definitions of the scoring functions used for the reinforcement learning, the implementation details related to the training of the generative model and to the curriculum-based optimization, the complete results for all optimization trials, the additional reinforcement learning runs targeting unexplored scaffolds, the computed excited-state properties and the Chemprop prediction errors for the candidate molecules as well as further excited-state analyses of coumarin, isocoumarin, and neocoumarin derivatives. Supplementary information is available. See DOI: https://doi.org/10.1039/d5sc03184b.

Acknowledgements

The authors thank EPFL for computational resources. T. W acknowledges the Swiss National Science Foundation (grant number 204178) for financial support. This publication was created as part of NCCR Catalysis (grant number 180544), a National Centre of Competence in Research funded by the Swiss National Science Foundation.

Notes and references

M. B. Smith and J. Michl, Chem. Rev., 2010, 110, 6891–6936 CrossRef.
B. J. Walker, A. J. Musser, D. Beljonne and R. H. Friend, Nat. Chem., 2013, 5, 1019–1024 CrossRef PubMed.
S. Singh and B. Stoicheff, J. Chem. Phys., 1963, 38, 2032–2033 CrossRef.
S. Singh, W. Jones, W. Siebrand, B. Stoicheff and W. Schneider, J. Chem. Phys., 1965, 42, 330–342 CrossRef.
W. Shockley and H. J. Queisser, J. Appl. Phys., 1961, 32, 510–519 CrossRef.
A. De Vos and H. Pauwels, Appl. Phys., 1981, 25, 119–125 Search PubMed.
J. Xia, S. N. Sanders, W. Cheng, J. Z. Low, J. Liu, L. M. Campos and T. Sun, Adv. Mater., 2017, 29, 1601652 CrossRef.
A. B. Pun, S. N. Sanders, E. Kumarasamy, M. Y. Sfeir, D. N. Congreve and L. M. Campos, Adv. Mater., 2017, 29, 1701416 CrossRef.
R. Merrifield, P. Avakian and R. Groff, Chem. Phys. Lett., 1969, 3, 155–157 CrossRef.
M. B. Smith and J. Michl, Chem. Rev., 2010, 110, 6891–6936 CrossRef PubMed.
A. J. Baldacchino, M. I. Collins, M. P. Nielsen, T. W. Schmidt, D. R. McCamey and M. J. Y. Tayebjee, Chem. Phys. Rev., 2022, 3, 021304 CrossRef.
O. El Bakouri, J. R. Smith and H. Ottosson, J. Am. Chem. Soc., 2020, 142, 5602–5617 CrossRef PubMed.
Ö. H. Omar, D. Padula and A. Troisi, ChemPhotoChem, 2020, 4, 5223–5229 CrossRef.
D. Padula, Ö. H. Omar, T. Nematiaram and A. Troisi, Energy Environ. Sci., 2019, 12, 2412–2416 RSC.
W.-L. Chan, M. Ligges, A. Jailaubekov, L. Kaake, L. Miaja-Avila and X.-Y. Zhu, Science, 2011, 334, 1541–1545 CrossRef PubMed.
R. Casillas, I. Papadopoulos, T. Ullrich, D. Thiel, A. Kunzmann and D. M. Guldi, Energy Environ. Sci., 2020, 13, 2741–2804 RSC.
I. Paci, J. C. Johnson, X. Chen, G. Rana, D. Popović, D. E. David, A. J. Nozik, M. A. Ratner and J. Michl, J. Am. Chem. Soc., 2006, 128, 16546–16553 CrossRef PubMed.
J. T. Blaskovits, M. Fumanal, S. Vela and C. Corminboeuf, Chem. Mater., 2020, 32, 6515–6524 CrossRef.
S. Kawata, Y.-J. Pu, A. Saito, Y. Kurashige, T. Beppu, H. Katagiri, M. Hada and J. Kido, Adv. Mater., 2015, 28, 1585–1590 CrossRef PubMed.
T. Nagami, H. Miyamoto, W. Yoshida, K. Okada, T. Tonami and M. Nakano, J. Phys. Chem. A, 2020, 124, 6778–6789 CrossRef PubMed.
J. T. Blaskovits, M. Fumanal, S. Vela, Y. Cho and C. Corminboeuf, Chem. Commun., 2022, 58, 1338–1341 RSC.
J. T. Blaskovits, R. Laplaza, S. Vela and C. Corminboeuf, Adv. Mater., 2024, 36, 2305602 CrossRef PubMed.
J. T. Blaskovits, M. Fumanal, S. Vela, R. Fabregat and C. Corminboeuf, Chem. Mater., 2021, 33, 2567–2575 CrossRef.
L. Schaufelberger, J. T. Blaskovits, R. Laplaza, K. Jorner and C. Corminboeuf, Angew. Chem., Int. Ed., 2025, 64, e202415056 CrossRef PubMed.
R. Laplaza, S. Gallarati and C. Corminboeuf, Chem.:Methods, 2022, 2, e202100107 Search PubMed.
S. Gallarati, P. Van Gerwen, A. A. Schoepfer, R. Laplaza and C. Corminboeuf, Chimia, 2023, 77, 39–47 CrossRef PubMed.
B. L. Greenstein, D. C. Elsey and G. R. Hutchison, J. Chem. Phys., 2023, 159, 091501 CrossRef.
O. D. Abarbanel and G. R. Hutchison, Phys. Chem. Chem. Phys., 2023, 25, 11278–11285 RSC.
D. Douguet, E. Thoreau and G. Grassy, J. Comput.-Aided Mol. Des., 2000, 14, 449–466 CrossRef.
D. M. Anstine and O. Isayev, J. Am. Chem. Soc., 2023, 145, 8736–8750 CrossRef.
X. Tang, H. Dai, E. Knight, F. Wu, Y. Li, T. Li and M. Gerstein, Briefings Bioinf., 2024, 25, bbae338 CrossRef.
C. Bilodeau, W. Jin, T. Jaakkola, R. Barzilay and K. F. Jensen, Wiley Interdiscip. Rev.:Comput. Mol. Sci., 2022, 12, e1608 Search PubMed.
Y. Du, A. R. Jamasb, J. Guo, T. Fu, C. Harris, Y. Wang, C. Duan, P. Liò, P. Schwaller and T. L. Blundell, Nat. Mach. Intell., 2024, 6, 589–604 CrossRef.
J. Horwood and E. Noutahi, ACS Omega, 2020, 5, 32984–32994 CrossRef CAS PubMed.
M. Popova, O. Isayev and A. Tropsha, Sci. Adv., 2018, 4, eaap7885 CrossRef CAS PubMed.
S. Hochreiter and J. Schmidhuber, Neural Comput., 1997, 9, 1735–1780 CrossRef CAS PubMed.
L. Kreimendahl, M. Karnaukh and M. I. S. Röhr, J. Phys. Chem. A, 2025, 129, 407–414 CrossRef CAS PubMed.
T. Weiss, E. Mayo Yanes, S. Chakraborty, L. Cosmo, A. M. Bronstein and R. Gershoni-Poranne, Nat. Comput. Sci., 2023, 3, 873–882 CrossRef PubMed.
M. Olivecrona, T. Blaschke, O. Engkvist and H. Chen, J. Cheminf., 2017, 9, 48 Search PubMed.
Z. Zhou, S. Kearnes, L. Li, R. N. Zare and P. Riley, Sci. Rep., 2019, 9, 10752 CrossRef PubMed.
J. He, A. Tibo, J. P. Janet, E. Nittinger, C. Tyrchan, W. Czechtizky and O. Engkvist, J. Cheminf., 2024, 16, 95 CAS.
H. H. Loeffler, S. Wan, M. Klähn, A. P. Bhati and P. V. Coveney, J. Chem. Theory Comput., 2024, 20, 8308–8328 CAS.
N. Ståhl, G. Falkman, A. Karlsson, G. Mathiason and J. Boström, J. Chem. Inf. Model., 2019, 59, 3166–3176 CrossRef.
S. R. Atance, J. V. Diez, O. Engkvist, S. Olsson and R. Mercado, J. Chem. Inf. Model., 2022, 62, 4863–4872 CrossRef CAS.
R. Gómez-Bombarelli, J. N. Wei, D. Duvenaud, J. M. Hernández-Lobato, B. Sánchez-Lengeling, D. Sheberla, J. Aguilera-Iparraguirre, T. D. Hirzel, R. P. Adams and A. Aspuru-Guzik, ACS Cent. Sci., 2018, 4, 268–276 CrossRef.
J. Ho and T. Salimans, arXiv, 2022, preprint, arXiv:2207.12598, DOI:10.48550/arXiv:2207.12598.
C. Zeni, R. Pinsler, D. Zügner, A. Fowler, M. Horton, X. Fu, Z. Wang, A. Shysheya, J. Crabbé, S. Ueda, R. Sordillo, L. Sun, J. Smith, B. Nguyen, H. Schulz, S. Lewis, C.-W. Huang, Z. Lu, Y. Zhou, H. Yang, H. Hao, J. Li, C. Yang, W. Li, R. Tomioka and T. Xie, Nature, 2025, 639, 624–632 CrossRef CAS.
C.-H. Li and D. P. Tabor, Chem. Sci., 2023, 14, 11045–11055 RSC.
A. Gaulton, L. J. Bellis, A. P. Bento, J. Chambers, M. Davies, A. Hersey, Y. Light, S. McGlinchey, D. Michalovich, B. Al-Lazikani and J. P. Overington, Nucleic Acids Res., 2012, 40, D1100–D1107 CrossRef CAS PubMed.
E. Heid, K. P. Greenman, Y. Chung, S.-C. Li, D. E. Graff, F. H. Vermeire, H. Wu, W. H. Green and C. J. McGill, J. Chem. Inf. Model., 2023, 64, 9–17 CrossRef PubMed.
J. Lim, S. Ryu, J. W. Kim and W. Y. Kim, J. Cheminf., 2018, 10, 31 Search PubMed.
W. Jin, R. Barzilay and T. Jaakkola, International conference on machine learning, 2018, pp. 2323–2332 Search PubMed.
D. Merk, L. Friedrich, F. Grisoni and G. Schneider, Mol. Inf., 2018, 37, 1700153 CrossRef.
N. De Cao and T. Kipf, arXiv, 2018, preprint, arXiv:1805.11973, DOI:10.48550/arXiv:1805.11973.
C. Shi, M. Xu, Z. Zhu, W. Zhang, M. Zhang and J. Tang, arXiv, 2020, preprint, arXiv:2001.09382, DOI:10.48550/arXiv:2001.09382.
R. Irwin, A. Tibo, J. P. Janet and S. Olsson, ICML 2024 AI for Science Workshop, 2024 Search PubMed.
A. Alakhdar, B. Poczos and N. Washburn, J. Chem. Inf. Model., 2024, 64, 7238–7256 CrossRef CAS PubMed.
X. Peng, J. Guan, Q. Liu and J. Ma, arXiv, 2023, preprint, arXiv:2305.07508, DOI:10.48550/arXiv:2305.07508.
C. Vignac, N. Osman, L. Toni and P. Frossard, Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2023, pp. 560–576 Search PubMed.
C. K. Joshi, X. Fu, Y.-L. Liao, V. Gharakhanyan, B. K. Miller, A. Sriram and Z. W. Ulissi, arXiv, 2025, preprint, arXiv:2503.03965, DOI:10.48550/arXiv:2503.03965.
T. Le, J. Cremer, F. Noé, D.-A. Clevert and K. Schütt, arXiv, 2023, preprint, arXiv:2309.17296, DOI:10.48550/arXiv:2309.17296.
L. Huang, H. Zhang, T. Xu and K.-C. Wong, arXiv, 2022, preprint, arXiv:2209.05710, DOI:10.48550/arXiv:2209.05710.
E. Hoogeboom, V. G. Satorras, C. Vignac and M. Welling, International conference on machine learning, 2022, pp. 8867–8887 Search PubMed.
REINVENT 3.2, https://github.com/MolecularAI/Reinvent.
H. H. Loeffler, J. He, A. Tibo, J. P. Janet, A. Voronov, L. H. Mervin and O. Engkvist, J. Cheminf., 2024, 16, 20 Search PubMed.
R. S. Sutton, D. McAllester, S. Singh and Y. Mansour, Proceedings of the 13th International Conference on Neural Information Processing Systems, Cambridge, MA, USA, 1999, p. 1057–1063 Search PubMed.
V. Fialková, J. Zhao, K. Papadopoulos, O. Engkvist, E. J. Bjerrum, T. Kogej and A. Patronov, J. Chem. Inf. Model., 2021, 62, 2046–2063 CrossRef PubMed.
J. Guo and P. Schwaller, JACS Au, 2024, 4, 2160–2172 CrossRef CAS.
M. Korshunova, N. Huang, S. Capuzzi, D. S. Radchenko, O. Savych, Y. S. Moroz, C. I. Wells, T. M. Willson, A. Tropsha and O. Isayev, Commun. Chem., 2022, 5, 1–11 CrossRef PubMed.
T. Blaschke, O. Engkvist, J. Bajorath and H. Chen, J. Cheminf., 2020, 12, 68 CAS.
A. J. Carrod, V. Gray and K. Börjesson, Energy Environ. Sci., 2022, 15, 4982–5016 RSC.
J. Guo, V. Fialková, J. D. Arango, C. Margreitter, J. P. Janet, K. Papadopoulos, O. Engkvist and A. Patronov, Nat. Mach. Intell., 2022, 4, 555–563 CrossRef.
RDKit: Open-source cheminformatics, http://www.rdkit.org.
P. Tosco, N. Stiefl and G. Landrum, J. Cheminf., 2014, 6, 1–4 Search PubMed.
T. A. Halgren, J. Comput. Chem., 1996, 17, 490–519 CrossRef CAS.
C. Bannwarth, S. Ehlert and S. Grimme, J. Chem. Theory Comput., 2019, 15, 1652–1671 CrossRef CAS PubMed.
C. Bannwarth, E. Caldeweyher, S. Ehlert, A. Hansen, P. Pracht, J. Seibert, S. Spicher and S. Grimme, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2020, 11, e01493 Search PubMed.
J.-D. Chai and M. Head-Gordon, Phys. Chem. Chem. Phys., 2008, 10, 6615–6620 RSC.
P. C. Hariharan and J. A. Pople, Theor. Chim. Acta, 1973, 28, 213–222 CrossRef CAS.
A. Petersson, A. Bennett, T. G. Tensfeldt, M. A. Al-Laham, W. A. Shirley and J. Mantzaris, J. Chem. Phys., 1988, 89, 2193–2218 CrossRef.
G. Petersson and M. A. Al-Laham, J. Chem. Phys., 1991, 94, 6081–6090 CrossRef CAS.
S. Hirata and M. Head-Gordon, Chem. Phys. Lett., 1999, 314, 291–299 CrossRef CAS.
T. Minami and M. Nakano, J. Phys. Chem. Lett., 2012, 3, 145–150 CrossRef CAS.
T. Minami, S. Ito and M. Nakano, J. Phys. Chem. Lett., 2013, 4, 2133–2137 CrossRef CAS.
K. Yamaguchi, Chem. Phys. Lett., 1975, 33, 330–335 CrossRef CAS.
M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, G. Scalmani, V. Barone, G. A. Petersson, H. Nakatsuji, X. Li, M. Caricato, A. V. Marenich, J. Bloino, B. G. Janesko, R. Gomperts, B. Mennucci, H. P. Hratchian, J. V. Ortiz, A. F. Izmaylov, J. L. Sonnenberg, D. Williams-Young, F. Ding, F. Lipparini, F. Egidi, J. Goings, B. Peng, A. Petrone, T. Henderson, D. Ranasinghe, V. G. Zakrzewski, J. Gao, N. Rega, G. Zheng, W. Liang, M. Hada, M. Ehara, K. Toyota, R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, T. Vreven, K. Throssell, J. A. Montgomery Jr, J. E. Peralta, F. Ogliaro, M. J. Bearpark, J. J. Heyd, E. N. Brothers, K. N. Kudin, V. N. Staroverov, T. A. Keith, R. Kobayashi, J. Normand, K. Raghavachari, A. P. Rendell, J. C. Burant, S. S. Iyengar, J. Tomasi, M. Cossi, J. M. Millam, M. Klene, C. Adamo, R. Cammi, J. W. Ochterski, R. L. Martin, K. Morokuma, O. Farkas, J. B. Foresman and D. J. Fox, Gaussian 16 Revision C.01, Gaussian Inc. Wallingford CT, 2016 Search PubMed.
G. W. Bemis and M. A. Murcko, J. Med. Chem., 1996, 39, 2887–2893 CrossRef CAS PubMed.
K. Bhattacharyya and A. Datta, J. Phys. Chem. C, 2017, 121, 1412–1420 CrossRef CAS.
Y. J. Bae, G. Kang, C. D. Malliakas, J. N. Nelson, J. Zhou, R. M. Young, Y.-L. Wu, R. P. Van Duyne, G. C. Schatz and M. R. Wasielewski, J. Am. Chem. Soc., 2018, 140, 15140–15144 CrossRef CAS PubMed.
J. Zhou, X. Wang, W. Wang, H. Liu, T. Wang, X. Song, Z. Li, Z. Liu, Y. Chen and X. Li, Mater. Chem. Front., 2022, 6, 3016–3022 RSC.
D. M. de Clercq, M. I. Collins, N. P. Sloane, J. Feng, D. R. McCamey, M. J. Tayebjee, M. P. Nielsen and T. W. Schmidt, Chem. Sci., 2024, 15, 6402–6409 RSC.
J. E. Anthony, Angew. Chem., Int. Ed., 2008, 47, 452–483 CrossRef CAS PubMed.
Z. Li, F. J. Hernández, C. Salguero, S. A. Lopez, R. Crespo-Otero and J. Li, Nat. Commun., 2025, 16, 1194 CrossRef CAS PubMed.
Y. Zhou, W. Ni, L. Ma, L. Sun, J. Zhao and G. G. Gurzadyan, J. Phys. Chem. C, 2022, 126, 17212–17222 CrossRef CAS.
T. Tsuneda and T. Taketsugu, Sci. Rep., 2022, 12, 19714 CrossRef CAS PubMed.
A. Stafford, S. R. Allen, T. A. Grusenmeyer, C. J. O'Dea, L. Estergreen, S. T. Roberts and Z. A. Page, J. Mater. Chem. A, 2023, 11, 22259–22266 RSC.
T. Tsuneda and T. Taketsugu, Sci. Rep., 2024, 14, 829 CrossRef CAS PubMed.
J. C. Johnson, A. J. Nozik and J. Michl, J. Am. Chem. Soc., 2010, 132, 16302–16303 CrossRef CAS PubMed.
P. I. Dron, J. Michl and J. C. Johnson, J. Phys. Chem. A, 2017, 121, 8596–8603 CrossRef CAS PubMed.
J. C. Johnson and J. Michl, Top. Curr. Chem., 2017, 375, 80 CrossRef PubMed.
A. V. Girija, W. Zeng, W. K. Myers, R. C. Kilbride, D. T. Toolan, C. Zhong, F. Plasser, A. Rao and H. Bronstein, J. Am. Chem. Soc., 2024, 146, 18253–18261 CrossRef CAS PubMed.
E. Pradhan and T. Zeng, J. Phys. Chem. Lett., 2022, 13, 11076–11085 CrossRef CAS PubMed.
D. W. Jones and G. Kneen, J. Chem. Soc., Perkin Trans. 1, 1976, 1, 1647–1654 RSC.
D. W. Jones, J. Chem. Soc., Perkin Trans. 1, 1977, 1, 980–987 RSC.
D. A. Bleasdale and D. W. Jones, J. Chem. Soc., Chem. Commun., 1985, 1027–1028 RSC.
J. N. Schrauben, J. L. Ryerson, J. Michl and J. C. Johnson, J. Am. Chem. Soc., 2014, 136, 7363–7373 CrossRef CAS PubMed.
E. A. Buchanan and J. Michl, Photochem. Photobiol. Sci., 2019, 18, 2112–2124 CrossRef CAS.
J. C. Johnson and J. Michl, Top. Curr. Chem., 2017, 375, 80 CrossRef PubMed.
A. F. Schwerin, J. C. Johnson, M. B. Smith, P. Sreearunothai, D. Popović, J. Černý, Z. Havlas, I. Paci, A. Akdag, M. K. MacLeod, X. Chen, D. E. David, M. A. Ratner, J. R. Miller, A. J. Nozik and J. Michl, J. Phys. Chem. A, 2010, 114, 1457–1473 CrossRef CAS PubMed.
T. Minami and M. Nakano, J. Phys. Chem. Lett., 2012, 3, 145–150 CrossRef CAS.
J. M. Holland and D. W. Jones, J. Chem. Soc. C, 1970, 536 Search PubMed.
S. Yamanaka, M. Okumura, M. Nakano and K. Yamaguchi, J. Mol. Struct., 1994, 310, 205–218 Search PubMed.
T. Minami, S. Ito and M. Nakano, J. Phys. Chem. Lett., 2013, 4, 2133–2137 CrossRef CAS.

Click here to see how this site uses Cookies. View our privacy policy here.