OM-Diff: inverse-design of organometallic catalysts with guided equivariant denoising diffusion

François Cornet; Bardi Benediktsson; Bjarke Hastrup; Mikkel N. Schmidt; Arghya Bhowmik

doi:10.1039/D4DD00099D

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

DOI: 10.1039/D4DD00099D (Paper) Digital Discovery, 2024, 3, 1793-1811

OM-Diff: inverse-design of organometallic catalysts with guided equivariant denoising diffusion†

François Cornet ^a, Bardi Benediktsson ^b, Bjarke Hastrup ^b, Mikkel N. Schmidt ^a and Arghya Bhowmik *^b
^aDepartment of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby 2800, Denmark. E-mail: frjc@dtu.dk
^bDepartment of Energy Conversion and Storage, Technical University of Denmark, Kgs. Lyngby 2800, Denmark. E-mail: arbh@dtu.dk

Received 10th April 2024 , Accepted 19th July 2024

First published on 23rd July 2024

Abstract

Organometallic complexes are ubiquitous in numerous technological applications, and in particular in homogeneous catalysis. Optimization of such complexes for specific applications is challenging due to the large variety of possible metal–ligand combinations and ligand–ligand interactions. Here we present OM-Diff, an inverse-design framework based on a diffusion generative model for in silico design of such complexes. Due to the importance of the spatial structure of a catalyst, the model operates on all-atom (including H) representations in 3D space. To handle the symmetries inherent to that data representation, OM-Diff combines an equivariant diffusion model with an equivariant property predictor. The diffusion model generates ligands conditioned on a specified metal-center, while the property predictor guides the generation towards novel complexes with desired properties. We demonstrate the potential of OM-Diff by designing optimized catalysts for a family of cross-coupling reactions, and validating a selection of novel proposed compounds with DFT calculations.

1 Introduction

Organometallic complexes are ubiquitous in numerous application areas such as energy, medicine, functional materials, sensors, or optical devices. They are designed for therapeutics as metallodrugs,¹ can be used to form specialty polymers,² to make organic light emitting materials,³ as photovoltaic materials,⁴ are used in batteries⁵ or as sensors.⁶ Importantly, organometallics are also widely used in catalytic processes, in particular in homogeneous catalysis, and are crucial for many industrial chemical reactions, such as hydrogenation,⁷ hydroformylation,⁸ olefin metathesis,⁹ or cross-coupling reactions (e.g., Suzuki,¹⁰ Heck,¹¹ and Stille¹² reactions). In silico catalyst design is a grand chemical challenge,^13,14 and the combination of machine learning (ML) and quantum chemistry (QC) methods is an appealing strategy for such endeavor.¹⁵

Augmenting conventional high-throughput screening workflows with machine learned predictive models¹⁶ is a common approach. Comprehensive overviews of the role of ML predictive models in the computational discovery of organometallic complexes can be found in the review of Kulik et al.,¹⁷ and in the perspective of Aspuru-Guzik et al.¹⁸ for the specific case of homogeneous catalysis. In general, the usual workflow consists of an initial training phase on a moderately-sized database of representative examples,^19–25 followed by a screening campaign on a large virtual library containing similar compounds.^26,27 The screening library is usually constructed based on the chemical intuition of the practitioner, e.g. by combinatorial enumeration of pre-defined building blocks,^26,27 or via systematical functionalization of a starting scaffold.²⁸ ML models offer fast property evaluation while preserving an accuracy nearly identical to that of the reference method.

A more recent paradigm²⁹ attempts to move beyond chemical intuition and systematical enumeration by using generative models to learn the distribution of the chemical space of interest in a data-driven way.³⁰ Novel complexes that share aggregate properties with the training data can in turn be generated by sampling from the model. When combined with predictive modelling, generative models open the door to the inverse-design of materials and molecules with optimised properties, i.e. goal-directed generation.³¹ Homogeneous catalysts are molecular in nature, and numerous generative models for molecules have been proposed in the literature.³² Existing models mainly differ by the data representation and the generative paradigm they employ. Strings^33,34 and molecular graphs are the most extensively used representations. On these geometry-free representations, common approaches include Generative Adversarial Networks (GAN),^35,36 auto-regressive models,^37,38 Variational Auto-Encoders (VAE),^39–42 or Reinforcement Learning.^43–45 Genetic Algorithms (GA)^46,47 are a notable exception that does not involve learning but instead evolves a population of candidates via a set of predefined operations. Due to the importance of spatial information in molecular properties, another line of work has focused on the generation of 3D structures, as point clouds^48,49 or voxels.⁵⁰ Instead of relying on external software^51–53 to map the generated geometry-free descriptors to possible conformations, these models^{28,48–50,54,55} directly output structures in 3D space. Among the different generative paradigms, diffusion models^56,57 on point clouds⁴⁸ have emerged as particularly promising. Their expressivity, resulting from a series of (simple) transformations with shared parameters that map the prior distribution to the data distribution, combined with the possibility of leveraging architectural advances in neural network force fields⁵⁸ are supposedly two reasons behind this success. Since the introduction of the equivariant diffusion model for molecules,⁴⁸ multiple further developments have been done to tackle various related problems such as conformer generation,^59,60 transition-state generation,^61,62 linker design,⁶³ structure-based design,⁶⁴ target-aware design,⁶⁵ or molecular docking.⁶⁶ For the specific task of molecular inverse-design, conditional diffusion⁴⁸ and guidance⁶⁷ constitute the two principal approaches, and in particular guidance was recently demonstrated to be practically effective.⁶⁸ The success of diffusion models at inverse-design has not been limited to point cloud representations, but also to image representations encoding microstructures⁶⁹ and zeolites.⁷⁰

1.1 Contributions

In this work, we introduce OM-Diff, a generic inverse-design framework for generating organometallic complexes with optimized properties. Given the importance of geometry in catalytic activity, our framework operates on 3D atomistic structures. At its core, OM-Diff implements an equivariant diffusion model generating ligands conditioned on a fixed geometric description of the center. To perform inverse-design, an auxiliary equivariant property predictor steers the generative process of the diffusion model towards properties of interest via a guidance mechanism. Compared to a property-conditional generative model, guidance limits the amount of expensive task-specific labeled data required (e.g. energy barriers for a particular reaction) to what is needed for obtaining a surrogate model of satisfactory accuracy, and enables unconditional training over large relevant databases.^21–23 To demonstrate the potential of OM-Diff, we inverse-design organometallic catalysts for a family of cross-coupling reactions.²⁶ The corresponding workflow is depicted in Fig. 1.


	Fig. 1 Inverse-design workflow. (Top) Overview of the conditional generation process of an organometallic complex : (1) a metal center, , is sampled; (2) based on the centre, the number of atoms contained in the coordinated ligands is sampled, (3) random atomic types and positions are assigned to all ligand atoms, and (4) the conditional denoising runs for T steps. Each denoising step involves an unconditional denoising update (steering towards valid molecules, via ε_θ) followed by a property target correction (steering towards molecules with the desired properties, via y_ϕ). The inset shows an example of a denoising trajectory for a complex with a Pd center. The position and atomic type of the center are kept fixed during the whole trajectory, and only the surrounding atoms are denoised. Their positions and types are allowed to change over the course of the generation. (Bottom) After a validity check, the generated complexes are screened using a surrogate model y_ϕ′. The promising complexes are further validated with DFT calculations. In the experiments of this paper, the property of interest is an energy difference, vide infra in Section 3.3.

We summarize our contributions as follows:

• We propose OM-Diff, a framework that implements a 3D equivariant diffusion generative model, specifically designed for organometallic complexes, combined with an equivariant property predictor to perform regressor-guidance, and sample novel complexes with targeted properties;

• We analyze several key design choices needed to attain a practical performance level, such as treating the center as contextual information, and varying the expressivity of the denoising neural network architecture;

• For the specific problem of optimizing a critical step in a family of cross-coupling reactions, we close the loop by validating a selection of generated complexes using DFT calculations, and identify several novel compounds of potential interest.

1.2 Related work

Existing generative models that have been successfully applied to catalyst optimization are based on geometry-free representations. Most notably, GA^71–74 have proven very effective when coupled with a reliable and reasonably cheap fitness function. A VAE operating on SMILES⁴² has also shown promises. While concurrent work⁷⁵ similarly leverages an equivariant diffusion model to perform scaffold-conditioned ligand design for organometallic complexes, OM-Diff constitutes, to the best of our knowledge, the first⁷⁶ complete inverse-design framework based on equivariant diffusion for organometallic catalysts. Methodologically, OM-Diff bears similarities with previous work in different application areas. The 3D-conditioning mechanism employed in OM-Diff resembles that introduced in DiffSBDD⁶⁴ and in DiffLinker,⁶³ where an equivariant diffusion model also generates ligands conditioned on a fixed geometric context. In terms of inverse-design, the closest framework in the literature is GauDi,⁶⁸ where property-conditioned generation is performed similarly using the guidance mechanism proposed by Bao et al.⁶⁷ Compared to GauDi, OM-Diff is conditioned on the 3D geometry of the center, operates on all-atom representations instead of coarse-grained graphs of fragments, and targets a different chemical space (organic molecules vs. organometallic complexes). In summary, the combination of the 3D conditioning on the center, the guidance mechanism and target application is what delineates OM-Diff from previous work.

2 Methods

2.1 Problem statement

The goal of OM-Diff is to inverse-design novel organometallic complexes, that we refer to as

, with desired properties, that we denote y. We frame this task as a conditional sampling problem, where we aim to sample from the conditional distribution

. We approximate the conditional distribution,

, by combining an unconditional diffusion model,

, with a property predictor

2.2 Outline

In Section 2.3, we introduce how organometallic complexes

are represented in practice, i.e. the data representation that the different models operate on. In Section 2.4, we present the unconditional generative model. Using the unconditional generative model and an auxiliary property predictor, we present in Section 2.5 the guidance procedure for performing conditional sampling. Finally in Section 2.6, we present details about the surrogate model used to screen the complexes sampled from the generative model.

2.3 Data representation

Organometallic complexes. In a computer, molecular complexes can be represented as unordered point clouds of atoms embedded in Euclidean space. Each atom is assigned a position vector, an atom type, and potentially other features such as charges. Organometallic complexes are typically composed of a center, made of one or more transition metals, surrounded with ligands coordinated in specific ways. Based on that observation, we represent an organometallic complex,

, by two distinct subsets: one with the atoms belonging to the center, denoted

, and the other with the atoms belonging to the ligands, denoted

. Formally, we write


	(1)

where

represents the atomic coordinates, and

the atom types.

Modelling metal center and ligands separately is motivated by two factors. On one hand, the geometry of the region around the center often has to follow strict (known) rules, and is thereby often fixed. For instance, the square planar geometry is prevalent for transition metal complexes with d⁸ configuration. This is known and should therefore not be learnt by the model. On the other hand, when generating novel catalysts one wants to have full control over the center in order, for example, to enforce the central atom to be an earth-abundant transition metal. Depending on the problem under study, the positions (x^(C)), and/or the compositions (h^(C)), or possibly parts thereof, can therefore be fixed and viewed as a form of context around which the ligands should be built.

In our experiments in Section 3, we consider a simple case where only the metal center is fixed and only one coordination pattern is present in the training data. However, the formulation in eqn (1) entails more complicated cases such as centers featuring different coordination patterns, or scaffold-based design where some of the ligands can be also be considered part of the center subset.

Target properties. The goal of inverse-design to generate novel complexes with desired properties. Such properties can be of different natures such as polarizability, dipole moment magnitude, or binding energies as in our experiments in Section 3.6. To learn a structure–property relationship model, we often have at our disposal a database of complex – property pairs:

. In our experiments, we specifically consider a case where the target property corresponds to the binding energy of a specific reaction.

2.4 Diffusion model for organometallic catalysts

In our framework, the objective of the generative model is to learn an unconditional distribution over organometallic complexes,

, from which we can readily sample novel complexes. In practice, we implement a tailored extension of the equivariant diffusion model for atomistic point clouds (EDM).⁴⁸

Diffusion models^56,57 are generative models that include two distinct processes: (1) a fixed diffusion process that iteratively corrupts data points (atomistic structures) towards a known prior distribution (i.e. white noise) through additive noise, and (2) a generative denoising process optimised to reverse the diffusion process. Starting from complete noise, the trained denoising process produces realistic atomistic point clouds through iterative denoising.

Diffusion process. The forward diffusion gradually corrupts data points over T steps, ending up in complete noise. This procedure formally defines a distribution over T latent variables

that are increasingly noisy versions of an initial atomistic structure


	(2)

As is fixed, and transitions are Gaussian, each step is defined as


	(3)

where 0 < β_t < 1 is some user-specified noise schedule that determines how much information is destroyed, and δ(·) is the Dirac delta measure. Intuitively, eqn (3) expresses that, at each transition, the center is left unchanged, while the ligand features are corrupted by (1) being scaled down by

, and (2) being summed with Gaussian noise of variance β_t.

For large enough T, the terminal distribution of the ligand features (i.e. positions and atom types) becomes data-independent, . Due to the formulation of the diffusion process in eqn (2) and (3), i.e. Markovianity and Gaussian transitions, any time marginal, or said otherwise distribution of any given , can be derived analytically as


	(4)

where

, and σ_t is a function of the noise schedule

up to time t. This implies that any noisy version of

, can then be obtained without the need of going through the whole chain defined in eqn (2),


	(5)

where

. This formulation reveals particularly useful for learning the generative denoising process.

Denoising process. We seek to learn a denoising process that reverseseqn (3), i.e. that can denoise

into

. In what follows, we simplify notations by omitting the Dirac distribution for the center. With access to

, the true denoising process is another normal distribution that writes


	(6)

where the mean and variance are given by


	(7)


	(8)

While the variance of the true denoising process in eqn (8) only depends on the (known) time schedule, the mean in eqn (7) involves that is unknown at sampling time, as it is what we seek to generate. We therefore learn an approximation to . Using eqn (5), we can rewrite , such that can now be determined given its current noisy version , and the noise ε. The latter is not known but can be approximated using a denoising neural network ε_θ trained to map to ε. We can then parameterise our generative model using ε_θ, as


	(9)

where the variance comes from eqn (8), and the mean is expressed as


	(10)

obtained by replacing

by its approximation

in eqn (7). We note that, in addition to

, the denoising neural network ε_θ is also provided with the time step t to facilitate the learning of a time-dependent function.

Task and training procedure. As the denoising neural network ε_θ is presented with noisy atomistic structures,

, obtained using eqn (5), and is tasked with predicting the (unscaled) noise ε that was sampled to obtain the corrupted structures, its training objective follows naturally. The parameters of ε_θ are optimized by the so-called simplified loss objective,⁷⁷


	(11)

where

denotes the output of the denoising neural network, and

is the noise sampled to form

according to eqn (5).

Symmetries of the learning problem. Due to the geometric nature of atomistic structures, the learning problem defined in eqn (11) features different symmetries that should be accounted for. First, as atomistic structures have a set structure, i.e. their atoms feature no intrinsic order, they require ε_θ to be permutation-equivariant. This requirement intuitively means that permuting the order of the input atoms should result in a similar permutation of the output of ε_θ. Second, ε_θ has to be invariant to translations of the input. In other words, the output of ε_θ should not depend on the geometric center of the input atomistic structure. Finally, ε_θ should be equivariant to rotations and reflections of the input atomistic structure. In other words, rotating/reflecting the input structure should lead to an identical rotation/reflection of the output. Formally, the two last desirata (translation invariance and rotation equivariance) write


ε_θ([R_xx + t_x,h],t) = [R_x_x^(L),_h^(L)]	(12)

where

denotes any vector, and

denotes any orthogonal (rotation/reflection) matrix.

Denoising neural network parameterisation. To handle the symmetries of the learning problem, ε_θ should be implemented using an appropriate architecture. In the original EDM,⁴⁸ε_θ is based on EGNN,⁷⁸ an equivariant graph neural network. We instead parameterize our ε_θ using an architecture inspired from PaiNN.⁷⁹ Along with the usual scalar hidden states, a set of (equivariant) vectorial hidden states is also maintained and updated for each atom through message-passing.⁸⁰ The messages exchanged in our ε_θ are constructed using local descriptors based on pairwise distances and angles, whereas EGNN⁷⁸ only leverages pairwise distances to construct messages. This leads to a more expressive architecture, as local angular information can now be resolved,⁸¹ while remaining cheap to evaluate compared to architectures leveraging higher-order tensors. After the message-passing phase, the final scalar states are pooled to predict [small epsilon, Greek, circumflex]

_h^(L), while the final vectorial states are aggregated in a single vector [small epsilon, Greek, circumflex]

_x^(L). More details about the neural network architecture are given in Section S1.2.†

Invariance of the learned distribution. The likelihood of a given complex under the learned distribution

should be invariant under rigid transformations. In other words, all possible orientations of a given complex should have the same probability of being sampled. Rotation invariance is ensured by the combination of the rotation-equivariant architecture of ε_θ, and the isotropic Gaussian prior and transition distributions.⁵⁹ Translation invariance can be ensured by the translation-invariant architecture of ε_θ, and by having a prior and transition distributions over atomistic positions with fixed center of mass,

. Alternatively, translation invariance can be ensured by keeping the position of the metal center fixed, which is what we do.

Unconditional sampling procedure. Once trained, ε_θ can be used to generate novel samples. The unconditional sampling procedure is outlined as follows: (1) we sample the center

– possibly composed of multiple atoms, by drawing it from an empirical distribution over centers;‡ (2) we then sample the number of remaining atoms that will compose the ligands given the center,

; (3) we sample the initial positions and atom types of the ligand atoms,

; (4) we finally employ the trained denoising neural network ε_θ, and execute the standard ancestral sampling procedure by iteratively applying eqn (9).

2.5 Regressor guidance

So far, we have introduced the necessary tools for learning a generative model from which novel atomistic structures can be sampled unconditionally. When performing inverse-design, we are interested in being able to sample novel atomistic structures with optimised properties, i.e. sample from

. In this section, we present a procedure, named regressor-guidance, for performing conditional sampling using a pretrained unconditional generative model presented in Section 2.4 combined with an auxiliary property predictor.

Conditional model. Inspired by classifier-guidance,⁸² regressor-guidance⁶⁷ builds on the observation that conditioning on a property of interest y can be done by sampling from a conditional denoising process


	(13)

where

is the unconditional denoising process from eqn (9), and

is a conditional distribution over properties induced by a property predictor y_ϕ. Here, we define

in terms of an energy function,


	(14)

such that

. This formula is also similar to that of loss-guided diffusion (LGD),⁸³ with a loss function that includes a learnt component, i.e. y_ϕ. The conditional denoising process from eqn (13) writes as a corrected version of the unconditional denoising process defined in eqn (9),


	(15)

where the corrected mean is obtained as


	(16)

The correction is obtained by evaluating the gradient of f_ϕ with respect to the mean predicted by the unconditional model. In practice, sampling from the conditional distribution amounts to first evaluating the mean of the unconditional distribution, then evaluating the energy function in eqn (14) using the estimated mean, and finally computing the correction expressed as the gradient of eqn (14) with respect to .

Task and training procedure. As per eqn (14), the property predictor y_ϕ is tasked with estimating the property of interest y of structure

, given

a noisy version thereof. This implies that y_ϕ should be trained on noisy structures obtained with the same diffusion process as that of the generative model. In practice, the prediction task becomes very difficult when t → T, as the input structures are close to pure noise. To limit the negative influence of the noisiest structures on the learning process, a solution is to resort to a robust objective function such as the Huber loss


	(17)

where

denotes the model prediction.

Property predictor parameterisation. In Experimental section of this paper, we consider a case where the target property y is a scalar value that is invariant to rigid transformations of the input atomistic structure. The property predicted by y_ϕ should therefore not depend on the orientation nor the absolute position of

. We consequently parameterise the property predictor y_ϕ in eqn (14) with a neural network, that features an encoder similar to that of the denoising neural network ε_θ, i.e. that maintains and updates a set of scalar and vectorial states for each atom. The encoder is followed by a readout layer that aggregates the final scalar states into a sole complex-level state using an attention mechanism. That aggregated state is in turn passed to a fully-connected neural network outputting the predicted value of y.

Advanced conditioning. While the exposition of the guidance mechanism was particularised to a single target scalar value, it can readily be extended to more advanced conditioning schemes. To condition on multiple properties for instance, an energy function that rely on several property predictors, {y_{ϕ_i}}^I_i=1, can be designed. An example of such function could write,


	(18)

where [y_i]^I_i=1 refer to the individual targets, and [α_i]^I_i=1 with α_i > 0 denote how the individual targets should be weighted in order to e.g. emphasize certain targets more than others or account for differences in scale.

Open-ended target conditioning can also be achieved by altering eqn (14). An instance of suitable energy function can be , where s = 1 in the case of a minimization and s = −1 in the case of a maximization.

2.6 Screening surrogate

Once samples have been generated, it is not feasible to evaluate all of them using DFT calculations. We therefore employ a surrogate model for screening the generated samples, before running further calculations with DFT on the most promising candidates. The screening surrogate shares the same architecture as the time-conditioned regressor presented in Section 2.5, but without time input. We are interested in having a surrogate that behaves well over the whole property space, and avoids very large errors, as those could lead to missed candidates, or unfruitful expensive DFT calculations. We consequently train the screening surrogate with a loss function that penalizes large errors more. In particular, we resort to the reverse Huber loss⁸⁴ which, compared to eqn (17), apply the L₂-norm to outliers.

3 Experiments and results

As a test bench for OM-Diff, we choose the inverse-design of organometallic catalysts for cross-coupling reactions. This family of reactions constitutes an interesting case study for OM-Diff for two reasons: (1) its practical relevance, and (2) the compelling literature showing that relevant catalysts can be searched through optimization of a binding energy that acts as a proxy for the actual catalytic activity.⁸⁵

3.1 Practical relevance

As mentioned in the Introduction, cross-coupling reactions are important in several fields. They are crucial in the synthesis of various organic molecules, as they enable facile formation of carbon–carbon (C–C) and carbon–heteroatom (C–N, C–O, C–S, etc.) bonds. Examples of applications of the compounds synthesized by cross-coupling include active pharmaceutical ingredients,⁸⁶ organic semiconductors,⁸⁷ and conductive polymers.⁸⁸ Cross-coupling reactions can be made to be environmentally friendly, as it is possible to utilize catalytic systems that minimize waste and avoid the use of toxic reagents. Furthermore, the Suzuki cross-coupling reaction enables the formation of carbon–carbon bonds by coupling olefins with organoboron compounds under mild conditions. This method is popular due environmental friendliness, using less toxic and more stable reagents, thereby aligning with principles of green chemistry. The importance of the Suzuki cross-coupling reaction is perhaps best demonstrated by the fact that it led to the inventors to receive the 2010 Nobel prize in chemistry.^89,90

3.2 Volcano plots

The Suzuki reaction is part of a larger family of cross-coupling reactions (Suzuki, Kumada, Negishi, Stille and Hiyama)^91,92 – whose cycle is generically represented in Fig. 2a, where the leaving group is usually a halide (represented by an X in Fig. 2a). The five reactions differ by the nature of the cross-coupling partner (represented by Y in Fig. 2a). As Corminboeuf et al.^26,85 proposed, the rate limiting step of these reactions can be described with volcano plots. Although not common in homogeneous catalysis, volcano plots have proved to be very useful in heterogeneous catalysis. Volcano plots for the cross-coupling reactions under study have been shown to be similar,⁹² differing only by the width of the plateau region. The Hiyama reaction has widest plateau region spanning [−82.2, 2.0] kcal mol⁻¹, while the region of interest for the Suzuki reaction is only between −32.1 and −23.0 kcal mol⁻¹. The other reactions have a plateau region lying in between that of the two aforementioned reactions. As a consequence, catalysts that bind too strongly or too weakly for the Suzuki reaction might still be efficient for other cross-coupling reaction variants. Using volcano plots, the reaction energy of the oxidative addition process, highlighted in blue in Fig. 2a, can be used as a descriptor to estimate catalytic activity.^26,92 This approximation allows for faster screening of candidates, since the whole catalytic cycle does not need to be explored. All possible catalyst candidates should sit on the volcano plateau, or near the top.


	Fig. 2 (a) Reaction under study in this work—the model generates L₁–M–L₂ and optimises the reaction energy of the oxidative addition. (b) Distribution of binding energy in the considered dataset.²⁶ (c) Distribution of number of atoms in the considered dataset.²⁶ On average complexes are composed of 53 atoms.

3.3 Dataset

We perform our experiments using the DFT-level subset of the C–C cross-coupling database.²⁶ In the database, each catalyst is an inorganic complex made of a metal center that binds to two ligands, as L₁–M–L₂ with M ∈ {Ni, Pd, Pt, Cu, Ag, Au}. The ligands L₁ and L₂ can either be identical or different. In addition to the aforementioned DFT-level subset, the database also contains geometries of all the possible combinations of metals and ligands at the MMFF-level of theory,⁹³ that is, 6 metal centers with 91 distinct ligands. This makes for a total of 25 [thin space (1/6-em)]

116 possible combinations. However, in the database, optimized geometries and the corresponding oxidative addition binding energies were computed with DFT for around 7000 complexes. Those complexes cover the 6 different metal centers, and 72 unique ligands (all depicted in Fig. S16 and S17†). Pd is the only metal present with all possible ligands, while the other metals are only combined with a limited number thereof. An overview of the metal–ligand combinations present in the data is provided in Fig. S15.† Although DFT relaxed geometries and binding energies are available for around 7000 complexes, the diversity of the data is limited because only 72 ligands, or building blocks, were used. The binding energies are for the oxidative addition reaction of the substrate with the transition metal, depicted in blue in Fig. 2a. The corresponding distribution across the dataset is illustrated in Fig. 2b.

DFT computational details. We confirmed with DFT calculations (vide infra in Section 3.6) a few promising catalyst candidates generated by OM-Diff. We performed the calculations with the quantum chemistry software ORCA version 5.0.4 (ref. 94 and 95) using a protocol similar to that of the initial study²⁶ that generated the training data. Specifically, we used the B3LYP functional^96–98 with the Pople 3-21G basis set^99–102 for the geometry relaxation of Cu and Pd complexes and the Ahlrichs def2-SVP double-ζ basis set¹⁰³ (ORCA keyword

) for Pt complexes. The parameters for the 3-21G basis set were downloaded from the basis set exchange.¹⁰⁴ We used the original D3 dispersion correction¹⁰⁵ (ORCA keyword

), as per the original protocol.²⁶ The RIJCOSX approximation^106,107 was used to speed up Coulomb and Exchange integrals, with the automatic generation of an auxiliary basis set¹⁰⁸ (ORCA keyword

) for calculations that used the 3-21G basis set, and the def2/J auxiliary basis set¹⁰⁹ (ORCA keyword

) for the def2 family of basis sets. For energy evaluations, we performed single point calculations using the Ahlrichs def2 triple-ζ basis set¹⁰³ (ORCA keyword

Previous work. In the initial study,²⁶ the remaining MMFF-level⁹³ configurations were screened using a surrogate model trained to map MMFF-level geometries to DFT-level binding energies. While promising candidates were identified, none of them was investigated further with DFT. In a generative context, a VAE operating on string representations⁴² has also been applied to the dataset. The model displayed controllability, and could generate novel and promising candidates. Very recently, a GA⁷⁴ was successfully used to generate promising catalysts for the Suzuki reaction.

3.4 Unconditional generation

We first test the ability of OM-Diff to perform effective unconditional generation of organometallic complexes. This constitutes a prerequisite for effective conditional generation, and is also a valid inverse-design procedure when combined with screening. We evaluate and compare the ability of different ablated versions of OM-Diff to learn the unconditional data distribution. We specifically study two aspects of the generative diffusion model: (1) the modelling of the central region as context around which the model is tasked to build the ligands, and (2) the expressiveness of the neural network architecture.

Setting. After training each model variant, we generate 10 [thin space (1/6-em)]

000 samples, where the number of atoms is drawn from the empirical distribution displayed in Fig. 2c. Firstly, we evaluate the properties of the generated samples in terms of structural metrics: validity, uniqueness and novelty. Secondly, we compare the samples with the empirical data distribution in terms of the geometry around the metal center M, the binding energy as estimated by a surrogate model, and the composition. For the geometric metrics, we also include metrics computed on the force-field-level subset²⁶ to get an idea of the advantage yielded by a generation in 3D. In Fig. 6b, we additionally compare the distribution of the strain energy at xTB-level¹¹⁰ of the generated compounds, i.e. energy difference between generated structure and xTB-optimised structure. Additional details about the evaluation procedure and the different baselines are provided in Section S2,† where we additionally report the error of the diffusion variants on a held-out set as a function of the noise level Fig. S1.†

Effect of center. Modelling the center as context (i.e. the model only acts on the ligands) is beneficial for two things: (1) the geometry around the metal center, and (2) the composition. Regarding geometry, the model reproduces the training distribution of the geometry around M better, as seen from W^L_1,2–M and W^{L₁–M–L₂} in Table 1. Graphical depictions of the corresponding distributions can be found in Fig. S4 and S5.† Regarding composition, the generated distributions over atom types are closer to the training distributions when the center is modelled as context, as hinted by TV_M, TV_non-M, and TV_prox (all defined in eqn (S10)†) in Table 1. TV_M is virtually zero, since the metal-center is directly sampled from the empirical distribution obtained from the training data. The detailed metal center distribution for each variant of the model can be found in Table S5.† The distribution of center and non-center atomic elements are shown in Fig. 3a and b, whereas the distribution of center-proximal atomic elements can be found in Fig. S8† as well as the associated total variations in Table S6.†Fig. 3c and S9† also indicate that models where the center is part of the context tend to generate complexes whose molecular weight (proxy for joint distribution of atom types) tend to be closer to the training data distribution.

Table 1 Results of the unconditional sampling experiment. Structural metrics are reported in % of the number of generated samples. Distances to the empirical data distribution are Wasserstein (W) distances for continuous features and total variation (TV) for discrete features. Details are given in S2

	Fixed	Improved ε_θ	Structural metrics [%] (↑)			Distance to empirical distribution (↓)
			Structural metrics [%] (↑)			Center-proximal geom.		Binding energy [kcal mol⁻¹]	Generated atom types [TV]
			Valid	Valid and unique	Valid, unique and novel	Length [Å], W^L_1,2–M	Angle [rad], W^{L₁–M–L₂}	W ^ΔE	Center, TV_M	Non-center, TV_non-M	Center-proximal, TV_prox
a The TV_M is virtually 0 for models that sample directly from the empirical distribution.
OM-Diff	✗	✗	8.9	7.9	4.8	0.256	0.0113	0.0044	0.347	1.036	0.079
	✓	✗	19.6	16.4	7.5	0.165	0.0084	0.0044	≅0.0	0.482	0.029
	✗	✓	18.9	17.6	11.1	0.203	0.0089	0.0040	0.388	1.005	0.041
	✓	✓	28.2	23.9	11.8	0.147	0.0079	0.0036	≅0.0	0.593	0.019
Merck molecular force field			—	—	—	0.814	0.0257	—	—	—	—


	Fig. 3 Distribution of (a) metal-centers, (b) non-metal elements, and (c) molecular weight in unconditional sampling.

We hypothesize that the observed improvement is due to a simplified classification task. When inspecting Fig. S1,† we see lower losses on ε_h and h, especially for larger noise levels. In Table S2,† we can also see that chemical validity is improved, i.e. fixing the center can make up for a less expressive backbone in terms of validity.

Effect of representation. We find that more expressive geometric neural networks generally bring an increased validity, noticeably so as generated complexes get larger as highlighted in Fig. 4a. We also provide Table S2,† where we can clearly see that the added expressivity yields more configurations that pass the pairwise distances check, i.e. more expressive architecture leads to less atom clashes and disconnected fragments. In Fig. S1,† we can also observe that the more geometrically expressive variants yield lower losses on the ε_x and ultimately on x, i.e. the atomic coordinates. The difference is especially stronger for lower noise levels, when getting closer to the data manifold. Our findings are in line with previous work, e.g. ref. 111, that also showed improved generative capabilities resulting from geometrically more expressive architectures.


	Fig. 4 (a) Validity as a function of the complex size. (b) Sources of novelty for the different variants of the model, where ‘NC’ stands for ‘Novel Combination’, ‘1L’ refers to samples where 1 ligand is new, and ‘2L’ refers to samples where both ligands are novel. For each variant, 10000 complexes were sampled.

Validity. As described in details in Section S2.1,† we deem a complex valid, if it features exactly one metal center, has no isolated nor clashing atoms, and passes a validity check based on the RDKit software.⁵¹ OM-Diff(✗, ✗), the ablated version of OM-Diff similar to the original EDM model,⁴⁸ yields only around 900 valid complexes among the 10 [thin space (1/6-em)]

000 generated. The two proposed modifications yield a substantial improvement, allowing a better modelling of the chemical space under study. With nearly 3000 valid structures out of 10 [thin space (1/6-em)]

000, this corresponds to over threefold improved efficiency of valid sample generation. In Fig. 4a, we display validity as a function of the size of the generated complexes. As expected, larger complexes tend to be more difficult to generate than smaller ones. As a molecule is deemed invalid if e.g. the valence of any of its atoms is not respected, validity at complex-level decreases quickly with complex size, even if the atomwise validity is kept constant. In Fig. 4a, we can clearly see that the full OM-Diff is the only variant of the model able to generate larger valid complexes.

In Table S2,† we provide more details regarding the validity results. We observe that already 50% of the generated structures are deemed invalid due to abnormal pairwise distances, i.e. isolated or clashing atoms. Among the structures passing the distance check, only around 60% can be properly parsed by ,⁵¹i.e. the bonding structure can be inferred without charge and the resulting object contains exactly two fragments corresponding to L₁ and L₂. This highlights the most recurring failure modes that were observed: disconnected/clashing atoms, and missing protons leading to charged complexes. While an appropriate post-processing procedure, e.g. removing the disconnected fragments or fixing valences, could further increase the effective validity, modelling the bonding information^112,113 could help solve connectivity and valence issues at the source.

Uniqueness and novelty. While generating complexes that are chemically valid is an important first step, these complexes also need to be different from each other and novel. A generative model that only regenerates data it has been trained on is not extremely useful for inverse design. As seen in Table 1, all variants are able to produce varied and novel complexes. In Fig. S2,† we show how validity, uniqueness and novelty evolve with the number of sampled complexes. While validity remains rather constant as the number of generated compounds is increased, uniqueness tends to decrease, indicating that the model tends to generate given compounds multiple times. It needs to be emphasized that these metrics are obtained by converting the actual geometry into

strings and limiting the comparison based on

description. Two identical

strings can actually have been obtained from (slightly) different geometries. Novelty is also observed to remain constant as the number of generated complexes is increased. Interestingly, while OM-Diff clearly yields more valid (and unique) complexes, it is on par in terms of novelty with the variant where the metal-center is not modelled as context. Finally, when looking closer at non-unique complexes, we observe that it is mostly non-novel complexes (i.e. present in the training data) that are generated multiple times. However, the model is also able to generate novel compounds multiple times. In Fig. S3,† we show the number of novel ligands generated as a function of the number of generated complexes. We can see that it steadily increases with the number of complexes generated.

Sources of novelty. Due to the way the dataset was constructed, i.e. as combination of 72 ligands as detailed in Section 3.3, novelty can take different forms. Novel compounds fall into 3 different categories: (1) novel combinations of 2 existing ligands, (2) combinations of an existing ligand with a novel ligand, and (3) combination of 2 novel ligands. In Fig. 4b, we display the distribution of novelty for the different variants of the model, in percentage of the novel samples generated. We observe that variants where the center is part of the context tend to get a larger percentage of novelty coming from novel combinations, highlighting that they have learnt the data distribution (slightly) better than their counterparts that do not fix the center.

Generated complexes. Given the rather limited structural diversity of the training data: 6 metal centers and 72 ligands, the model only gets to see a very restricted part of the chemical space. We have shown in the previous section that while the model can be prone to regenerate the building blocks seen during training, all model variants were able to generate novel ligands, as highlighted in Fig. 4b. We provide an excerpt of 18 novel ligands in Fig. 5 generated with the full OM-Diff. Compared to the ones used to build the dataset, displayed in Fig. S16 and S17,† we observe that the generative model tends to produce ligands in the neighbourhood of the training data, hinting that the model has learnt the underlying distribution. As a concrete example, ligand

is similar to ligand

with an extra F atom.


	Fig. 5 Novel ligands generated by OM-Diff through unconditional sampling. The ligands have been randomly picked, and ordered by visual similarity.

Quality of generated complexes. In Table 1, we reported W^ΔE that quantified the discrepancy between the binding energy distribution of the generated complexes and the ground-truth one. We can additionally leverage an ensemble of screening surrogates to estimate uncertainty of the generated complexes. This is what we display in Fig. 6a and S7.† The predictive uncertainty is taken as the standard deviation across 10 surrogates. While uncertainty has not been calibrated to match the actual errors, relative comparisons can still be performed under the reasonable assumption that the uncertainty estimate is capable of ranking. As the different surrogate models have been trained on clean data only, we can expect them to disagree as data starts looking less and less realistic. Similar to the validity results, the full OM-Diff model variant generates samples about which the screening surrogates disagree the least. We additionally computed the strain energy for the samples generated by the different variants of OM-Diff. Their cumulative distributions are displayed in Fig. 6b. The strain energy per atom is defined as the energy difference between the structure generated by the generative model, and its energy after relaxation using xTB normalised by the number of atoms. For a fair comparison, we only included complexes with up to 75 atoms, as the simplest version of OM-Diff could not generate any larger sample that was valid. Interestingly, we find variants of OM-Diff where the center is modelled as context to generally lead lower strain energy. This can probably be explained by a better modelling of the region around the metal center, as also displayed in Table 1, via lower W^L_1,2–M and W^{L₁–M–L₂} values.


	Fig. 6 Cumulative distribution of (a) predictive uncertainty, as estimated by an ensemble of 10 surrogates, evaluated on the samples generated by the different variants of the generative diffusion model; (b) strain energy per atom as estimated by xTB.¹¹⁰

3.5 Performance of the surrogate models

As illustrated in Fig. 1, our framework heavily relies on surrogates trained to approximate DFT-level binding energies. Such models are involved at two different steps: (1) to steer the conditional generation, and (2) to screen and filter final samples prior to DFT computations. It is therefore crucial to evaluate the accuracy of the different surrogates across the chemical space of interest. The range of the binding energy across the dataset, is around 120 kcal mol⁻¹. Following previous work, we consider a model useful if its error is within 5% of that range, i.e. around 5 kcal mol⁻¹. A dummy model that outputs the conditional mean of the dataset gets an average root-mean-squared error of around 8.3 kcal mol⁻¹. To evaluate errors, we perform a stratified 10-fold cross-validation, where folds were designed to keep the proportion of metal centers approximately equal across folds, and to cover the property range uniformly. We compare against two baselines: one based on SLATM,²⁶ and another that trains a neural network surrogate in the latent space of a VAE.⁴²

Loss function. To reduce large errors on outlying data points, we experimented with the reverse Huber loss (coined ‘revHuber’) for the screening surrogate. We observed a slight improvement in performance for model trained with the ‘revHuber’ loss, as illustrated in Fig. 7a and S10.† We also summarize the error diagnostic of our screening surrogate in Tables 2 and S7–S10.† While our model is accurate on average, slightly more than the compared baselines, and within the 5 kcal mol⁻¹, we still observe large errors on some specific outliers. In Fig. S11,† we additionally show the surrogate error across the property space for both loss functions. The different curves display a U-shape, highlighting that the surrogate predictive accuracy decreases as we get closer to the tails of the training distribution. For instance, in the case of Cu the MAE of the surrogate is above 5 kcal mol⁻¹ in the plateau region of the Suzuki reaction. This has implications when looking for compounds at the boundaries of the training distribution – predictions in that area should be used cautiously.


	Fig. 7 (a) Residuals of the two variants of loss functions employed to train the screening regressor. ‘MSE’ refers to mean-square error, while ‘revHuber’ stands for reverse Huber. (b) Performance of the two variants of the time-conditioned regressor as a function of the diffusion time step. C+L refers to the noise model that jointly corrupt center and ligands, whereas L stands for the noise where the corruption is limited to the ligands. The horizontal dotted lines represent the errors of the mean and conditional mean predictors.

Table 2 Validation of the surrogate used for final screening. We report mean and standard deviation across 10-fold cross-validation of the mean absolute error (MAE), root mean squared error (RMSE), 95th quantile absolute error (Q95 AR), maximum absolute error (Max AE), and coefficient of determination (R²). Results for SLATM²⁶ and string + MLP⁴² are obtained from the respective papers

	MAE [kcal mol⁻¹] (↓)	RMSE [kcal mol⁻¹] (↓)	Q95 AE [kcal mol⁻¹] (↓)	Max AE [kcal mol⁻¹] (↓)	R ² [—] (↑)
μ _M	6.49	8.50	—	41.71 ± 12.38	—
SLATM²⁶	2.61	—	—	—	—
String + MLP⁴²	2.42	3.85	—	26.02	0.974
Ours (MSE)	2.14 ± 0.08	3.50 ± 0.33	6.98 ± 0.31	32.09 ± 16.95	0.978 ± 0.004
Ours (revHuber)	2.04 ± 0.08	3.42 ± 0.29	6.92 ± 0.45	32.36 ± 16.37	0.979 ± 0.004

Time-conditioned surrogate. In Fig. 7b and S12,† we display the error of the time-conditioned regressor for the two different variants of corruption, i.e. whether the center is part of the context (variant

) or not (variant

). In terms of error, models tend to behave similarly to dummy regressors – respectively mean predictor and conditional mean predictors (represented by the dotted horizontal lines), as structures are getting noisier. The intuition is that making meaningful predictions from (or close to) pure noise is difficult. Both types of models reach a similar accuracy for lower levels of noise. Initially, the error of model

is significantly larger than that of

as the model cannot guess what metal center the complex is going to feature. From Fig. 2b, we know that the metal center has a determinant influence on the binding energy.

3.6 Conditional generation and inverse-design

In this section, we study the ability of OM-Diff to generate novel optimized organometallic complexes, that can effectively catalyze cross-coupling reactions.

Conditional generation. We first investigate whether the conditioning mechanism works overall. To do so, we conditionally sample complexes whose target binding energies are spread across the property space. We study the generated compounds in terms of novelty, and examine their spread around the target value. We show that the conditioning mechanism is effective in parts of the space with sufficient data coverage, but that the effectiveness gradually decreases as we approach the tails of the distribution.

For each metal center, we choose the target values to be the following percentiles of the training data distribution: [0.05, 0.25, 0.50, 0.75, 0.95]. This is to make sure that the surrogate predictions remain somewhat reliable, as we cannot afford DFT calculations for all generated compounds. For each pair metal center – target value, we sample 10 [thin space (1/6-em)] 000 compounds, evaluate their binding energy using the surrogate model, and finally display the corresponding distribution. As an example in Fig. 8a, we show the conditional distributions obtained for Pd, while the same plots for the other metal centers are provided in Fig. S13.† We can observe that for target values above the median, the conditional distributions tend to become more spread out. We hypothesize that this is due to the fact that the unconditional distribution, shown in grey in the back of Fig. 8a, is not symmetric around its median, and usually that target values above the median are more sparsely distributed. Nonetheless, we can effectively steer the conditional distribution. We also expect that the conditional distributions can be made sharper by upscaling the contribution of the guidance term in eqn (16), at the cost of a lower validity and uniqueness.


	Fig. 8 (a) Binding energy distributions obtained through the conditional sampling of Pd, as evaluated by the surrogate. The distribution in grey in the background represents the training data distribution, i.e. DFT labels. Black vertical lines represent target values, and correspond to the [0.05, 0.25, 0.50, 0.75, 0.95] percentiles of the training data distribution. Only valid samples were taken into account. (b) #Valid, #(valid & unique) and #(valid & unique & novel) complexes for conditionally sampled Pd complexes. The novelty is further divided in 3 categories: ‘NC’ standing for ‘Novel Combination’, ‘1L’ referring to samples where 1 ligand is novel, and ‘2L’ referring to samples where both ligands are novel.

As for property controllability, metrics such as validity, uniqueness and novelty are impacted by the target value as seen in Fig. 8b for Pd complexes, and in Fig. S14† for the other metal centers. For parts of the property space that are less well covered by training data, the different metrics tend to drop. Specifically, when targeting the 0.95 percentile, the conditional distribution tends to spread out, and (abnormally) high binding energies are predicted by the model. While the binding energy of these samples is probably mistakenly estimated by the surrogate, we observe that the conditional generative model is still able to produce novel complexes, namely around 250.

Inverse-designing optimized catalysts for the Suzuki reaction. In the previous section, we showed that OM-Diff could be effectively steered towards target binding energies of interest. Here, we attempt to design optimized catalysts that are relevant for the Suzuki cross-coupling reaction. Among the family of reactions under study, the Suzuki reaction has the narrowest plateau region of the volcano plot, spanning [−32.1; −23.0] kcal mol⁻¹.²⁶ We therefore set the middle point of that interval, i.e. −27.55 kcal mol⁻¹, as a target value when performing conditional generation.

As previously, we generate 10 [thin space (1/6-em)] 000 complexes for Pd and Pt. After checking for validity and uniqueness, we only keep novel complexes. We additionally discard the ones with an estimated binding energy that does not fall within the range of interest. Of the remaining samples, we randomly keep 5 compounds for each metal center. For these compounds, we recompute the binding energies using DFT, employing the protocol described in Section 3.3. The novel catalysts are displayed in the two upper rows of Fig. 9, along with their binding energy estimated by the surrogate and calculated using DFT. For the Pd complexes, displayed in the top row, 4 out of 5 fall in the range of interest, and within 2 kcal mol⁻¹ of the energy calculated with DFT. For the Pt complexes, all 5 complexes fall in the range of interest and within 2.5 kcal mol⁻¹ of DFT. These slight discrepancies between DFT and surrogate predictions are in accordance with the errors reported Fig. S11,† where the estimated MAE in that region of the property space is shown to be around 2−3 kcal mol⁻¹.


	Fig. 9 Overview of the novel complexes validated with DFT. (Top row) Pd complexes, (middle row) Pt complexes, and (bottom row) Cu complexes. All binding energies are expressed in kcal mol⁻¹.

Interestingly, we note that although the training dataset contains various ligand types, eight out of ten Pd/Pt complexes contain phosphine-type ligands, that are σ-donors and π-acceptors. Furthermore, four out of five Pd-complexes contain a CO ligand, a σ-donor and π-acceptor. These ligand properties are known to be important for cross-coupling reactions. Although the model does not explicitly learn the electronic structure properties, its choice of ligands can nonetheless be rationalized.

Inverse-designing optimized Cu catalysts. We also tried to inverse-design Cu complexes relevant for the Suzuki reaction. This constitutes an interesting use-case as catalysts made of earth-abundant transition metals are highly desirable. As the screening surrogate could not identify valid samples in the said range of interest, we repeated the experiment with the 5% percentile, i.e. −20 kcal mol⁻¹, as a target value instead. Among the novel complexes generated, we kept 5 that were deemed close to the target value by the surrogate model. The considered complexes are illustrated in the bottom row of Fig. 9. With respect to the previous experiment, the spread between estimated and calculated binding energies appears to be larger. This can be explained by a less accurate surrogate model in that part of the property space, as illustrated in Fig. S11† where a sharp increase in MAE can be observed around −20 kcal mol⁻¹.

The inaccuracy of the screening surrogate is not the only explanation to the unsuccessful conditional generation for the initial target of −27.55 kcal mol⁻¹. The time-conditioned surrogate is also inaccurate in that region of the property space due to lack of training data, and thereby likely to drive the generation process towards complexes with erroneous binding energies. Finally, as the generative model has only seen a handful of complexes in that part of the property space, we can imagine that it is not extremely good at modelling the distribution in that particular area.

4 Discussion and conclusion

We have introduced OM-Diff, a framework for inverse-designing organometallic complexes for target applications. The framework is based on a guided equivariant denoising diffusion model specifically tailored for the generation of organometallic complexes with targeted properties. Instead of directly learning a property-conditional generative model, which would require large volume of computationally expensive data, OM-Diff decouples the structure generative section from the conditional guidance towards coveted property, allowing the use of larger molecular databases for training, and requiring only a limited set of task-specific labeled data for accurate model performance. To inverse-design organometallic catalysts for reactions of interest with OM-Diff, one needs to establish (1) the chemical space of the complexes that catalyze the reaction and (2) the mechanism of the catalyzed reaction along with the rate-determining step.

While OM-Diff is in principle a generic framework for inverse-designing homogeneous catalysts, the demonstration of its potential was limited to the inverse-design of catalyst candidates for a family of cross-coupling reactions, where the considered chemical space had restricted variety and only featured one coordination pattern. In that context, we first showed that the increased expressivity of the denoising neural network combined with a proper modelling of the metal center enables effective unconditional generation of novel complexes. Second, we showed that the model offered controllability, and that sampling could effectively be steered across the property space, while maintaining novelty in the sampled complexes. For the specific case of the Suzuki reaction, we further validated a handful of optimized complexes with DFT calculations. We could successfully generate promising Pd- and Pt-based complexes, and the compounds were shown to have a binding energy (for the activity determining step) within the range of interest. For Cu complexes, we could not generate complexes in the range of interest, highlighting the limitations of the proposed approach in parts of the property space sparsely represented in the training data. However, when the target value was around a less extreme percentile of the property distribution, novel complexes featuring the prompted binding energy (or close to) could successfully be generated. This is promising and indicates that our framework could effectively be integrated in an active learning setup towards real discovery, where the training data is progressively extended towards properties of interest.

As illustrated in the Experimental section, goal-directed generation is attainable but remains a difficult endeavor for models trained offline, with a static dataset. Other methods, based on GA or RL, are known to perform well in settings where they can query the function to be optimized. A hybrid method that uses an offline pretrained diffusion model, and that then further gets optimised online through RL or in an active learning setup is an interesting avenue for future work. As samples with attractive properties often lie at the boundaries of the training data, such candidates can be evaluated with the QM method of choices, added to the training data, and the surrogate retrained on the augmented dataset. An iterative framework,¹¹⁴ where data is gathered continuously, is also a promising avenue that would allow the generative model to progressively move towards regions of interest that were not well covered in the initial dataset.

We envision a significant scope for future work in both application areas and methodological development. While the framework was shown to work in a rather small data regime and with limited variety in the training data, pre-training on a relevant database,¹¹³e.g. the TMQM database,²³ before fine-tuning it in the chemical space of interest might allow for more valid, and novel molecules that lie beyond the neighborhood of the property-labeled training data. For practical relevance, validity and novelty are necessary but not sufficient, and an in silico inverse-design framework should also suggest compounds that are amenable to synthesis.³¹ In OM-Diff, the generation is steered towards promising parts of the chemical space based on a chemical target function, regardless of the potential validity and synthesizability. To address the limited effective validity of the generated complexes, a possible avenue would be to only consider heavy atoms in the generative process, and adding an hydrogenation post-processing step. This would also make the training and sampling faster by reducing the size of generated point clouds. Another related approach would consist in grouping atoms together, i.e. representing fragments as coarse-grained nodes.⁶⁸ Such procedure would require to define (1) a mapping transforming all-atom structures to coarse-grained representations, and (2) a post-processing step replacing the group nodes in the generated structures by their actual composition. The latter would additionally require to decide on the orientation and/or attachment point if multiple possibilities exist.¹¹⁵ A third approach could leverage the guidance setup described from Section 2.5, where a target function could be designed to include the feedback of a classifier trained to distinguish between valid and invalid compounds. Similarly the feedback of a surrogate trained to estimate synthesizability could be leveraged to favour the generation of easily synthesizable complexes. Another relevant direction for future work is multi-property conditioning. The scheme introduced in eqn (18) could for instance be leverage to inverse-design catalysts for more complex reactions, where high catalytic activity requires optimal binding energy in multiple reaction steps. Other methodological improvements in OM-Diff could include modelling of atom and bond types as categorical variables^112,113 (instead of the continuous relaxation used in this work) and predicting the denoised structures directly, as it has been shown to work better for atomistic data.¹¹³ Regarding the conditional sampling procedure, finding a better way to combine the feedback from the generative model and the guide would be useful as well. As seen in Fig. S12,† the time-conditioned property predictor is equivalent to a random guess on the early phases of the denoising process. If the surrogate provides uncertainty, this could also be leveraged to bias the generation, either to avoid uncertain regions or, in an active learning setting, to explore uncertain areas. Finally, while we have demonstrated the applicability of OM-Diff on the basis of generation of all ligands from scratch (i.e. where the center is composed of one atom), the formulation introduced in Section 2.4 is more general, and naturally extends to problems where the context is composed of multiple atoms, for instance in cases where the catalyst is designed based on a handful of scaffolds,¹¹⁶ or in a functionalization setting.²⁸

Data availability

Along with this paper, we release a code repository, that can be accessed at https://github.com/frcnt/om-diff, allowing other researchers to build upon our work.

Author contributions

F. C., M. N. S. and A. B. conceived the research idea. F. C. developed the code with help from B. H., ran the experiments, analyzed the results, and wrote the original draft based on inputs from all authors. B. B. performed validation with DFT calculations. M. N. S. and A. B. supervised the research. All authors discussed, commented on, and revised the manuscript.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

The authors acknowledge financial support from the Independent Research Fund Denmark with project DELIGHT (Grant No. 0217-00326B), TeraBatt project (Grant No. 2035-00232B) and ADANA project (Grant No. 3164-00297B).

References

P. Zhang and P. J. Sadler, Advances in the design of organometallic anticancer complexes, J. Organomet. Chem., 2017, 839, 5–14 CrossRef CAS.
K. A. Williams, A. J. Boydston and C. W. Bielawski, Main-chain organometallic polymers: synthetic strategies, applications, and perspectives, Chem. Soc. Rev., 2007, 36(5), 729–744 RSC.
J. Kalinowski, V. Fattori, M. Cocchi and J. A. G. Williams, Light-emitting devices based on organometallic platinum complexes as emitters, Coord. Chem. Rev., 2011, 255(21–22), 2401–2425 CrossRef CAS.
W.-Y. Wong and C.-L. Ho, Organometallic photovoltaics: a new and versatile approach for harvesting solar energy using conjugated polymetallaynes, Acc. Chem. Res., 2010, 43(9), 1246–1256 CrossRef CAS PubMed.
D.-Y. Wang, R. Liu, W. Guo, G. Li and Y. Fu, Recent advances of organometallic complexes for rechargeable batteries, Coord. Chem. Rev., 2021, 429, 213650 CrossRef CAS.
J. W. Steed, Coordination and organometallic compounds as anion receptors and sensors, Chem. Soc. Rev., 2009, 38(2), 506–519 RSC.
M. D. Jones, R. Raja, J. M. Thomas, B. F. G. Johnson, D. W. Lewis, J. Rouzaud and K. D. M. Harris, Enhancing the enantioselectivity of novel homogeneous organometallic hydrogenation catalysts, Angew. Chem., Int. Ed., 2003, 42(36), 4326–4331 CrossRef CAS PubMed.
J. Pospech, I. Fleischer, R. Franke, S. Buchholz and M. Beller, Alternative metals for homogeneous catalyzed hydroformylation reactions, Angew. Chem., Int. Ed., 2013, 52(10), 2852–2872 CrossRef CAS PubMed.
T. M. Trnka and R. H. Grubbs, The development of L₂X₂RuCHR olefin metathesis catalysts: an organometallic success story, Acc. Chem. Res., 2001, 34(1), 18–29 CrossRef CAS PubMed.
Z. Xi, X. Zhang, W. Chen, S. Fu and D. Wang, Synthesis and structural characterization of nickel(II) complexes supported by pyridine-functionalized N-heterocyclic carbene ligands and their catalytic activities for Suzuki coupling, Organometallics, 2007, 26(26), 6636–6642 CrossRef CAS.
A. Kumar, G. K. Rao, S. Kumar and A. K. Singh, Formation and role of palladium chalcogenide and other species in Suzuki–Miyaura and heck C–C coupling reactions catalyzed with palladium(II) complexes of organochalcogen ligands: realities and speculations, Organometallics, 2014, 33(12), 2921–2943 CrossRef CAS.
P. Espinet and A. M. Echavarren, The mechanisms of the Stille reaction, Angew. Chem., Int. Ed., 2004, 43(36), 4704–4734 CrossRef CAS PubMed.
J. G. Freeze, H. R. Kelly and V. S. Batista, Search for catalysts by inverse design: artificial intelligence, mountain climbers, and alchemists, Chem. Rev., 2019, 119(11), 6595–6612 CrossRef CAS PubMed.
M. Foscato and V. R. Jensen, Automated in silico design of homogeneous catalysts, ACS Catal., 2020, 10(3), 2354–2377 CrossRef CAS.
B. Sanchez-Lengeling and A. Aspuru-Guzik, Inverse molecular design using machine learning: Generative models for matter engineering, Science, 2018, 361(6400), 360–365 CrossRef CAS PubMed.
C. Duan, A. Nandy and H. J. Kulik, Machine learning for the discovery, design, and engineering of materials, Annu. Rev. Chem. Biomol. Eng., 2022, 13(1), 405–429 CrossRef CAS PubMed.
A. Nandy, C. Duan, M. G. Taylor, F. Liu, A. H. Steeves and H. J. Kulik, Computational discovery of transition-metal complexes: from high-throughput screening to machine learning, Chem. Rev., 2021, 121(16), 9927–10000 CrossRef CAS PubMed.
G. dos P. Gomes, R. Pollice and A. Aspuru-Guzik, Navigating through the maze of homogeneous catalyst design with machine learning, Trends Chem., 2021, 3(2), 96–110 CrossRef.
J. Hachmann, R. Olivares-Amaya, S. Atahan-Evrenk, C. Amador-Bedolla, R. S. Sánchez-Carrera, A. Gold-Parker, L. Vogt, A. M. Brockway and A. Aspuru-Guzik, The harvard clean energy project: large-scale computational screening and design of organic photovoltaics on the world community grid, J. Phys. Chem. Lett., 2011, 2(17), 2241–2251 CrossRef CAS.
L. Chanussot, A. Das, S. Goyal, T. Lavril, M. Shuaibi, M. Riviere, K. Tran, J. Heras-Domingo, C. Ho and W. Hu, et al., Open catalyst 2020 (OC20) dataset and community challenges, ACS Catal., 2021, 11(10), 6059–6072 CrossRef CAS.
J. J. Irwin and B. K. Shoichet, Zinc – a free database of commercially available compounds for virtual screening, J. Chem. Inf. Model., 2005, 45(1), 177–182 CrossRef CAS PubMed.
M. Nakata and T. Shimazaki, PubChemQC project: a large-scale first-principles electronic structure database for data-driven chemistry, J. Chem. Inf. Model., 2017, 57(6), 1300–1308 CrossRef CAS PubMed.
D. Balcells and B. B. Skjelstad, tmQM dataset—quantum geometries and properties of 86k transition metal complexes, J. Chem. Inf. Model., 2020, 60(12), 6135–6146 CrossRef CAS PubMed.
T. Gensch, G. dos P. Gomes, P. Friederich, E. Peters, T. Gaudin, R. Pollice, K. Jorner, A. K. Nigam, M. Lindner-D’Addario and M. S. Sigman, et al., A comprehensive discovery platform for organophosphorus ligands for catalysis, J. Am. Chem. Soc., 2022, 144(3), 1205–1217 CrossRef CAS PubMed.
C. R. Groom, I. J. Bruno, M. P. Lightfoot and S. C. Ward, The cambridge structural database, Acta Crystallogr., Sect. B: Struct. Sci., Cryst. Eng. Mater., 2016, 72(2), 171–179 CrossRef CAS PubMed.
B. Meyer, B. Sawatlon, S. Heinen, O. A. Von Lilienfeld and C. Corminboeuf, Machine learning meets volcano plots: computational discovery of cross-coupling catalysts, Chem. Sci., 2018, 9(35), 7069–7077 RSC.
P. Friederich, G. dos P. Gomes, R. De Bin, A. Aspuru-Guzik and D. Balcells, Machine learning dihydrogen activation in the chemical space surrounding Vaska's complex, Chem. Sci., 2020, 11(18), 4584–4601 RSC.
A. V. Kalikadien, E. A. Pidko and V. Sinha, ChemSpaX: exploration of chemical space by automated functionalization of molecular scaffold, Digital Discovery, 2022, 1(1), 8–25 RSC.
D. M. Anstine and O. Isayev, Generative models as an emerging paradigm in the chemical sciences, J. Am. Chem. Soc., 2023, 145(16), 8736–8750 CrossRef CAS PubMed.
S. Kang and K. Cho, Conditional molecular design with deep generative models, J. Chem. Inf. Model., 2018, 59(1), 43–52 CrossRef PubMed.
W. Gao and C. W. Coley, The synthesizability of molecules proposed by generative models, J. Chem. Inf. Model., 2020, 60(12), 5714–5723 CrossRef CAS PubMed.
C. Bilodeau, W. Jin, T. Jaakkola, R. Barzilay and K. F. Jensen, Generative models for molecular discovery: Recent advances and challenges, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2022, 12(5), e1608 Search PubMed.
M. Krenn, F. Häse, A. K. Nigam, P. Friederich and A. Aspuru-Guzik, Self-referencing embedded strings (selfies): A 100% robust molecular string representation, Mach. Learn.: Sci. Technol., 2020, 1(4), 045024 Search PubMed.
D. Weininger, Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules, J. Chem. Inf. Comput. Sci., 1988, 28(1), 31–36 CrossRef CAS.
G. Lima Guimaraes, B. Sanchez-Lengeling, C. Outeiral, P. L. C. Farias and A. Aspuru-Guzik, Objective-reinforced generative adversarial networks (organ) for sequence generation models, arXiv, 2017, preprint, arXiv:1705.10843, DOI:10.48550/arXiv.1705.10843.
N. De Cao and T. Kipf, MolGAN: An implicit generative model for small molecular graphs, arXiv, 2018, preprint arXiv:1805.11973, DOI:10.48550/arXiv.1805.11973.
J. You, R. Ying, X. Ren, W. Hamilton, and J. Leskovec, GraphRNN: Generating realistic graphs with deep auto-regressive models, in International Conference on Machine Learning, PMLR, 2018, pp. 5708–5717 Search PubMed.
C. Shi, M. Xu, Z. Zhu, W. Zhang, M. Zhang and J. Tang, GraphAF: a flow-based autoregressive model for molecular graph generation, in International Conference on Learning Representations, 2020 Search PubMed.
M. J. Kusner, B. Paige and J. Miguel Hernández-Lobato, Grammar variational autoencoder, in International Conference on Machine Learning, PMLR, 2017, pp. 1945–1954 Search PubMed.
W. Jin, R. Barzilay and T. Jaakkola, Junction tree variational autoencoder for molecular graph generation, in International Conference on Machine Learning, PMLR, 2018, pp. 2323–2332 Search PubMed.
R. Gómez-Bombarelli, J. N. Wei, D. Duvenaud, J. M. Hernández-Lobato, B. Sánchez-Lengeling, D. Sheberla, J. Aguilera-Iparraguirre, T. D. Hirzel, R. P. Adams and A. Aspuru-Guzik, Automatic chemical design using a data-driven continuous representation of molecules, ACS Cent. Sci., 2018, 4(2), 268–276 CrossRef PubMed.
O. Schilter, A. Vaucher, P. Schwaller and T. Laino, Designing catalysts with deep generative models and computational data. A case study for Suzuki cross coupling reactions, Digital Discovery, 2023, 2(3), 728–735 RSC.
M. Olivecrona, T. Blaschke, O. Engkvist and H. Chen, Molecular de novo design through deep reinforcement learning, J. Cheminf., 2017, 9, 1–14 Search PubMed.
J. You, B. Liu, Z. Ying, V. Pande and J. Leskovec, Graph convolutional policy network for goal-directed molecular graph generation, Proceedings of the 32nd International Conference on Neural Information Processing Systems, 2018, pp. 6412–6422 Search PubMed.
E. Bengio, M. Jain, M. Korablyov, D. Precup and Y. Bengio, Flow network based generative models for non-iterative diverse candidate generation, Adv. Neural Inf. Process. Syst., 2021, 34, 27381–27394 Search PubMed.
J. H. Jensen, A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space, Chem. Sci., 2019, 10(12), 3567–3572 RSC.
A. K. Nigam, R. Pollice, M. Krenn, G. dos P. Gomes and A. Aspuru-Guzik, Beyond generative models: superfast traversal, optimization, novelty, exploration and discovery (stoned) algorithm for molecules using selfies, Chem. Sci., 2021, 12(20), 7079–7090 RSC.
E. Hoogeboom, V. Garcia Satorras, C. Vignac and M. Welling, Equivariant diffusion for molecule generation in 3D, in International Conference on Machine Learning, PMLR, 2022, pp. 8867–8887 Search PubMed.
N. Gebauer, M. Gastegger and K. Schütt, Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules, Adv. Neural Inf. Process. Syst., 2019, 32, 7566–7578 Search PubMed.
M. Ragoza, T. Masuda and D. R. Koes, Generating 3d molecules conditional on receptor binding sites with deep generative models, Chem. Sci., 2022, 13(9), 2701–2713 RSC.
G. Landrum, et al., RDKit: A software suite for cheminformatics, computational chemistry, and predictive modeling, ed. G. Landrum, 2013, vol. 8, p. , p. 31 Search PubMed.
E. I. Ioannidis, T. Z. H. Gani and H. J. Kulik, molSimplify: A toolkit for automating discovery in inorganic chemistry, 2016 Search PubMed.
M. G. Taylor, D. J. Burrill, J. Janssen, E. R. Batista, D. Perez and P. Yang, Architector for high-throughput cross-periodic table 3d complex building, Nat. Commun., 2023, 14(1), 2786 CrossRef CAS PubMed.
M. Foscato, G. Occhipinti, V. Venkatraman, B. K. Alsberg and V. R. Jensen, Automated design of realistic organometallic molecules from fragments, J. Chem. Inf. Model., 2014, 54(3), 767–780 CrossRef CAS PubMed.
N. W. A. Gebauer, M. Gastegger, S. S. P. Hessmann, K.-R. Müller and K. T. Schütt, Inverse design of 3d molecular structures with conditional generative neural networks, Nat. Commun., 2022, 13(1), 973 CrossRef CAS PubMed.
J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan and S. Ganguli, Deep unsupervised learning using nonequilibrium thermodynamics, in International Conference on Machine Learning, PMLR, 2015, pp. 2256–2265 Search PubMed.
J. Ho, A. Jain and P. Abbeel, Denoising diffusion probabilistic models, Adv. Neural Inf. Process. Syst., 2020, 33, 6840–6851 Search PubMed.
O. T. Unke, S. Chmiela, H. E. Sauceda, M. Gastegger, I. Poltavsky, K. T. Schuett, A. Tkatchenko and K.-R. Mueller, Machine learning force fields, Chem. Rev., 2021, 121(16), 10142–10186 CrossRef CAS PubMed.
M. Xu, L. Yu, Y. Song, C. Shi, S. Ermon and J. Tang, GeoDiff: A geometric diffusion model for molecular conformation generation, in International Conference on Learning Representations, 2021 Search PubMed.
B. Jing, G. Corso, J. Chang, R. Barzilay and T. Jaakkola, Torsional diffusion for molecular conformer generation, Adv. Neural Inf. Process. Syst., 2022, 35, 24240–24253 Search PubMed.
C. Duan, Y. Du, H. Jia and H. J. Kulik, Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model, Nat. Comput. Sci., 2023, 3(12), 1045–1055 CrossRef PubMed.
S. Kim, J. Woo and W. Y. Kim, Diffusion-based generative AI for exploring transition states from 2D molecular graphs, Nat. Commun., 2024, 15(1), 341 CrossRef CAS PubMed.
I. Igashov, H. Stärk, C. Vignac, A. Schneuing, V. Garcia Satorras, P. Frossard, M. Welling, M. Bronstein and B. Correia, Equivariant 3D-conditional diffusion model for molecular linker design, Nat. Mach. Intell., 2024, 1–11 Search PubMed.
A. Schneuing, Y. Du, C. Harris, A. Jamasb, I. Igashov, W. Du, T. Blundell, P. Lió, C. Gomes, M. Welling, et al., Structure-based drug design with equivariant diffusion models, arXiv, 2022, preprint, arXiv:2210.13695, DOI:10.48550/arXiv.2210.13695.
J. Guan, W. W. Qian, X. Peng, Y. Su, J. Peng and J. Ma. 3D equivariant diffusion for target-aware molecule generation and affinity prediction, in The Eleventh International Conference on Learning Representations, 2022 Search PubMed.
G. Corso, B. Jing, R. Barzilay, T. Jaakkola, et al., DiffDock: Diffusion steps, twists, and turns for molecular docking, in International Conference on Learning Representations (ICLR 2023), 2023 Search PubMed.
F. Bao, M. Zhao, Z. Hao, P. Li, C. Li and J. Zhu, Equivariant energy-guided sde for inverse molecular design, in The Eleventh International Conference on Learning Representations, 2022 Search PubMed.
T. Weiss, E. Mayo Yanes, S. Chakraborty, L. Cosmo, A. M. Bronstein and R. Gershoni-Poranne, Guided diffusion for inverse molecular design, Nat. Comput. Sci., 2023, 3(10), 873–882 CrossRef PubMed.
N. N. Vlassis and W. C. Sun, Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties, Comput. Methods Appl. Mech. Eng., 2023, 413, 116126 CrossRef.
J. Park, A. P. S. Gill, S. M. Moosavi and J. Kim, Inverse design of porous materials: a diffusion model approach, J. Mater. Chem. A, 2024, 12, 6507–6514 RSC.
R. Laplaza, S. Gallarati and C. Corminboeuf, Genetic optimization of homogeneous catalysts, Chem.: Methods, 2022, 2(6), e202100107 CAS.
J. Seumer, J. K. S. Hansen, M. B. Nielsen and J. H. Jensen, Computational evolution of new catalysts for the Morita–Baylis–Hillman reaction, Angew. Chem., Int. Ed., 2023, 62(18), e202218565 CrossRef CAS PubMed.
M. Strandgaard, J. Seumer, B. Benediktsson, A. Bhowmik, T. Vegge and J. H. Jensen, Genetic algorithm-based re-optimization of the Schrock catalyst for dinitrogen fixation, PeerJ Phys. Chem., 2023, 5, e30 CrossRef.
J. Seumer and J. H. Jensen, Beyond predefined ligand libraries: A genetic algorithm approach for de novo discovery of catalysts for the Suzuki coupling reactions, ChemRxiv, 2024, preprint, DOI:10.26434/chemrxiv-2024-9xh38-v2.
H. Jin and K. M. Merz Jr, LigandDiff: de novo ligand design for 3D transition metal complexes with diffusion models, J. Chem. Theory Comput., 2024, 20, 4377–4384 CrossRef CAS PubMed.
F. R. J. Cornet, B. Benediktsson, B. Hastrup, A. Bhowmik and M. N. Schmidt, Inverse-design of organometallic catalysts with guided equivariant diffusion, in AI for Accelerated Materials Design-NeurIPS 2023 Workshop, 2023 Search PubMed.
D. Kingma, T. Salimans, B. Poole and J. Ho, Variational diffusion models, Adv. Neural Inf. Process. Syst., 2021, 34, 21696–21707 Search PubMed.
V. Garcia Satorras, E. Hoogeboom and M. Welling, E(n) equivariant graph neural networks, in International Conference on Machine Learning, PMLR, 2021, pp. 9323–9332 Search PubMed.
K. Schütt, O. Unke and M. Gastegger, Equivariant message passing for the prediction of tensorial properties and molecular spectra, in International Conference on Machine Learning, PMLR, 2021, pp. 9377–9388 Search PubMed.
J. Gilmer, S. S. Schoenholz, P. F. Riley, O. Vinyals and G. E. Dahl, Neural message passing for quantum chemistry, in International Conference on Machine Learning, PMLR, 2017, pp. 1263–1272 Search PubMed.
C. K. Joshi, C. Bodnar, S. V. Mathis, T. Cohen and P. Lio, On the Expressive Power of Geometric Graph Neural Networks, 2023, https://openreview.net/forum?id=Rkxj1GXn9_ Search PubMed.
P. Dhariwal and A. Nichol, Diffusion models beat gans on image synthesis, Adv. Neural Inf. Process. Syst., 2021, 34, 8780–8794 Search PubMed.
J. Song, Q. Zhang, H. Yin, M. Mardani, M.-Y. Liu, J. Kautz, Y. Chen and A. Vahdat, Loss-guided diffusion models for plug-and-play controllable generation, in International Conference on Machine Learning, PMLR, 2023, pp. 32483–32498 Search PubMed.
S. Lambert-Lacroix and L. Zwald, The adaptive BerHu penalty in robust regression, J. Nonparametric Statistics, 2016, 28(3), 487–514 CrossRef.
R. Laplaza, S. Das, M. D. Wodrich and C. Corminboeuf, Constructing and interpreting volcano plots and activity maps to navigate homogeneous catalyst landscapes, Nat. Protoc., 2022, 17(11), 2550–2569 CrossRef CAS PubMed.
M. J. Buskes and M.-J. Blanco, Impact of cross-coupling reactions in drug discovery and development, Molecules, 2020, 25(15), 3493 CrossRef CAS PubMed.
A. Marrocchi, A. Facchetti, D. Lanari, C. Petrucci and L. Vaccaro, Current methodologies for a sustainable approach to π-conjugated organic semiconductors, Energy Environ. Sci., 2016, 9(3), 763–786 RSC.
K. Jiang, L. Zhang, Y. Zhao, J. Lin and M. Chen, Palladium-catalyzed cross-coupling polymerization: A new access to cross-conjugated polymers with modifiable structure and tunable optical/conductive properties, Macromolecules, 2018, 51(23), 9662–9668 CrossRef CAS.
A. Suzuki, Cross-coupling reactions of organoboranes: an easy way to construct C–C bonds (nobel lecture), Angew. Chem., Int. Ed., 2011, 30(50), 6722–6737 CrossRef PubMed.
N. Miyaura and A. Suzuki, Palladium-catalyzed cross-coupling reactions of organoboron compounds, Chem. Rev., 1995, 95(7), 2457–2483 CrossRef CAS.
M. Busch, M. D. Wodrich and C. Corminboeuf, A generalized picture of C–C cross-coupling, ACS Catal., 2017, 7(9), 5643–5653 CrossRef CAS.
B. Sawatlon, M. D. Wodrich, B. Meyer, A. Fabrizio and C. Corminboeuf, Data mining the C–C cross-coupling genome, ChemCatChem, 2019, 11(16), 4096–4107 CrossRef CAS.
T. A. Halgren, Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94, J. Comput. Chem., 1996, 17(5–6), 490–519 CrossRef CAS.
F. Neese, The ORCA program system, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2012, 2(1), 73–78 CAS.
F. Neese, Software update: The ORCA program system—version 5.0, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2022, 12(5), e1606 Search PubMed.
A. D. Beck, Density-functional thermochemistry. III. The role of exact exchange, J. Chem. Phys., 1993, 98(7), 5648–5656 CrossRef.
A. D. Becke, Density-functional exchange-energy approximation with correct asymptotic behavior, Phys. Rev. A: At., Mol., Opt. Phys., 1988, 38(6), 3098 CrossRef CAS PubMed.
C. Lee, W. Yang and R. G. Parr, Development of the Colle–Salvetti correlation-energy formula into a functional of the electron density, Phys. Rev. B: Condens. Matter Mater. Phys., 1988, 37(2), 785 CrossRef CAS PubMed.
J. S. Binkley, J. A. Pople and W. J. Hehre, Self-consistent molecular orbital methods. 21. Small split-valence basis sets for first-row elements, J. Am. Chem. Soc., 1980, 102(3), 939–947 CrossRef CAS.
W. J. Pietro, M. M. Francl, W. J. Hehre, D. J. DeFrees, J. A. Pople and J. S. Binkley, Self-consistent molecular orbital methods. 24. supplemented small split-valence basis sets for second-row elements, J. Am. Chem. Soc., 1982, 104(19), 5039–5048 CrossRef CAS.
K. D. Dobbs and W. J. Hehre, Molecular orbital theory of the properties of inorganic and organometallic compounds 5. Extended basis sets for first-row transition metals, J. Comput. Chem., 1987, 8(6), 861–879 CrossRef CAS.
K. D. Dobbs and W. J. Hehre, Molecular orbital theory of the properties of inorganic and organometallic compounds. 6. Extended basis sets for second-row transition metals, J. Comput. Chem., 1987, 8(6), 880–893 CrossRef CAS.
F. Weigend and R. Ahlrichs, Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy, Phys. Chem. Chem. Phys., 2005, 7(18), 3297–3305 RSC.
B. P. Pritchard, D. Altarawy, B. Didier, T. D. Gibson and T. L. Windus, New basis set exchange: An open, up-to-date resource for the molecular sciences community, J. Chem. Inf. Model., 2019, 59(11), 4814–4820 CrossRef CAS PubMed.
S. Grimme, J. Antony, S. Ehrlich and H. Krieg, A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu, J. Chem. Phys., 2010, 132(15), 154104 CrossRef PubMed.
F. Neese, F. Wennmohs, A. Hansen and U. Becker, Efficient, approximate and parallel Hartree–Fock and hybrid DFT calculations. A ‘chain-of-spheres’ algorithm for the Hartree–Fock exchange, Chem. Phys., 2009, 356(1–3), 98–109 CrossRef CAS.
R. Izsák and F. Neese, An overlap fitted chain of spheres exchange method, J. Chem. Phys., 2011, 135(14), 144105 CrossRef PubMed.
G. L. Stoychev, A. A. Auer and F. Neese, Automatic generation of auxiliary basis sets, J. Chem. Theory Comput., 2017, 13(2), 554–562 CrossRef CAS PubMed.
F. Weigend, Accurate Coulomb-fitting basis sets for H to Rn, Phys. Chem. Chem. Phys., 2006, 8(9), 1057–1065 RSC.
C. Bannwarth, S. Ehlert and S. Grimme, GFN2-xTB—an accurate and broadly parametrized self-consistent tight-binding quantum chemical method with multipole electrostatics and density-dependent dispersion contributions, J. Chem. Theory Comput., 2019, 15(3), 1652–1671 CrossRef CAS PubMed.
A. Morehead and J. Cheng, Geometry-complete diffusion for 3D molecule generation, in ICLR 2023 – Machine Learning for Drug Discovery Workshop, 2023 Search PubMed.
C. Vignac, N. Osman, L. Toni and P. Frossard, MiDi: Mixed graph and 3D denoising diffusion for molecule generation, in Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer, 2023, pp. 560–576 Search PubMed.
T. Le, J. Cremer, F. Noe, D.-A. Clevert and K. T. Schütt, Navigating the design space of equivariant diffusion-based generative models for de novo 3D molecule generation, in The Twelfth International Conference on Learning Representations, 2024 Search PubMed.
J. Westermayr, J. Gilkes, R. Barrett and R. J. Maurer, High-throughput property-driven generative design of functional organic molecules, Nat. Comput. Sci., 2023, 3(2), 139–148 CrossRef CAS PubMed.
X. Fu, T. Xie, A. S. Rosen, T. S. Jaakkola and J. A. Smith, MOFDiff: Coarse-grained diffusion for metal-organic framework design, in The Twelfth International Conference on Learning Representations, 2024 Search PubMed.
A. M. Krieger, V. Sinha, A. V. Kalikadien and E. A. Pidko, Metal-ligand cooperative activation of HX (X = H, Br, OR) bond on Mn based pincer complexes, Z. Anorg. Allg. Chem., 2021, 647(14), 1486–1494 CrossRef CAS.

Footnotes

† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4dd00099d

‡ In the simplest case, this is simply a distribution over a single metal center – as further in this paper. The proposed framework is however not limited to such simple case. Depending on the problem under study, one could for instance sample the metal first, and then sample the coordination pattern conditioned on the metal, leading to a set

made of multiple atoms.

Click here to see how this site uses Cookies. View our privacy policy here.