Robert A.
Ives‡
ab,
William
Maturi‡
ab,
Matthew T.
Gill
a,
Conor
Rankine
*a and
Paul R.
McGonigal
*ab
aDepartment of Chemistry, University of York, Heslington, York, YO10 5DD, UK. E-mail: paul.mcgonigal@york.ac.uk
bDepartment of Chemistry, Durham University, Lower Mountjoy, Stockton Road, Durham, DH1 3LE, UK
First published on 23rd August 2024
Here, we analyze the stereodynamic properties of bullvalenes using principal moments of inertia and exit vector plots to draw comparisons with commonly used ring systems in medicinal chemistry. To aid analyses, we first classify (i) the four elementary rearrangement steps available to substituted bullvalenes, which (ii) can be described by applying positional descriptors (α, β, γ, and δ) to the substituents. We also (iii) derive an intuitive equation to calculate the number of isomers for a given bullvalene system. Using DFT-modelled structures for di-, tri-, and tetrasubstituted bullvalenes, generated using a newly developed computational tool (bullviso), we show that their 3D shapes and the exit vectors available from the bullvalene scaffold make them comparable to other bioisosteres currently used to replace planar aromatic ring systems in drug discovery. Unlike conventional ring systems, the shapeshifting valence isomerism of bullvalenes gives rise to numerous shapes and substituent relationships attainable as a concentration-independent dynamic covalent library from a single compound. We visualize this property by applying population weightings to the principal moments of inertia and exit vector analyses to reflect the relative thermodynamic stabilities of the available isomers.
Fig. 1 (a) The BV isomer barcode labelling system, top, and relative positional labels, bottom. (b) The possible exchange processes following one Cope rearrangement step, enumerated for each BV position. Full isomerization requires sequential steps that include (P = participating) and exclude (NP = non-participating) the substituent in the rearranging 1,5-hexadiene motif (shown in orange).1b (c) Illustration of the higher symmetry in the transition state for γP ⇌ γP Cope rearrangement relative to the ground state. (d) The partial isomer network of a heterodisubstituted BV showing the positional exchange arising from three sequential Cope rearrangement steps. |
In general, building blocks with rigid three-dimensional (3D) structures are key components of materials, such as metal–organic frameworks10 and other porous networks.11 In the context of medicinal chemistry research—particularly for fragment-based drug discovery12 (FBDD) libraries—it has been acknowledged that drug candidates based on flat or rod-like molecules offer limited shape diversity.13 Consequently, there is a desire for a greater number of diverse 3D fragments to be included within these libraries to cover more chemical space and, ultimately, to provide better candidates for drug development.14 One way that this objective can be achieved is by including fragments that possess diverse aliphatic ring systems, particularly as the core ring system is considered the key factor in shape diversity.13
The BV ring system has the seemingly contradictory characteristics of, on one hand, being highly dynamic through its reversible Cope rearrangements, while on the other hand, being a rigidly 3D structure. Its tricyclic hydrocarbon skeleton is a shape-persistent structure with substituents projecting outwards at well-defined angles. Therefore, designing effective materials based on BV derivatives requires understanding of their stereodynamics, i.e., their overall 3D shapes, the relative orientations that are accessible to appended substituents, and the relative energies of the isomers at equilibrium.
Here, we quantify the 3D shape diversity accessible from BVs and demonstrate that their fluxional behavior enables them to reversibly access diverse areas of chemical space. We provide a concise guide to the rearrangement processes of substituted BVs and apply computational modelling to categorize and quantify their stereodynamics. To do so, we perform population-weighted principal moment of inertia (PMI)15 and exit vector (EV)16 analyses. Much of the analysis can be automated using a new computational tool we have developed, bullviso, that generates all the isomers of a given substituted BV, and the input files needed to compute relative energy levels of each isomer. To demonstrate its utility, we apply bullviso to examine di-, tri- and tetramethyl BVs. The analysis illustrates that shapeshifting networks of BV derivatives dynamically sample many different areas of chemical space from a single starting compound. They do so by positioning their substituents at a range of angles, extending beyond those typically found in cis-disubstituted rings, while maintaining a higher degree of sphericity than most common ring systems found in biologically active molecules.
Applying the α–δ and P/NP labels, it becomes clear that there are just four elementary types of positional exchange that occur during any Cope rearrangement step, which are illustrated in Fig. 1b using a monosubstituted BV as a model. Firstly, the sole possible outcome for the α position is migration to a newly formed δ position on the NP arm. A substituent at a β position will migrate to a γ position when on a NP arm, whereas P rearrangement exchanges the β position with a δ site. Finally, a γ substituent remains at a γ position on a P arm following the Cope rearrangement on account of symmetry in the transition state (Fig. 1c). As the Cope rearrangement is reversible, the reciprocal of each of these exchange processes must also occur. Overall, therefore, the elementary types of positional exchange that govern the outcome of any BV rearrangement are (i) αNP ⇌ δNP, (ii) βNP ⇌ γNP, (iii) βP ⇌ δP, and (iv) γP ⇌ γP. These elementary types of positional exchange apply equally to every BV substituent in every isomer, regardless of the total number of substituents or their relative positions. For example, the functional groups of a heterodisubstituted BV migrate relative to one another during sequential Cope rearrangement steps, allowing them to switch from being on separate arms to occupying the same arm and back again (Fig. 1d).
(1) |
Combinatorics is used to derive the total number of ways in which these substituents can be ordered. The 1/3 multiplier adjusts for the fact that the parent BV has C3 symmetry, offsetting triple counting in the term that arises for structures related by rotation. A further correction factor, 2S, readjusts for the BV isomers that have three identically substituted arms and so are represented just once each in the term. S is the number of ways that the substituents can be arranged around the BV core to retain its C3 symmetry, or in other words, where all three arms of the BV possess identical substitution patterns.
Taking heterotrisubstituted BV 1 as a worked example (Fig. 2a), there are seven hydrogen substituents and three distinct non-hydrogen substituents (colored circles). Therefore, the product operation in the denominator of the equation is 7! × 1! × 1! × 1!. The substituents on 1 cannot be arranged in any pattern that gives C3 symmetry, hence S = 0 and applying eqn (1) gives Niso = 240.
Taking structure 2 (Fig. 2b) as a second worked example, there are six hydrogens, a substituent type that occurs three times (blue circles), and another substituent type that occurs just once (orange circle), so the product operation is 6! × 3! × 1!. However, three of the substituents of 2 are the same and thus there are three different ways in which they can be arranged such that the BV possesses C3 symmetry, which are shown in Fig. 2b. These C3-symmetric isomers arise when the unique substituent is at the α position and the three identical substituents occupy the same position on each of the three BV arms, i.e., either all β, all γ, or all δ. Therefore, S = 3 for 2 and applying eqn (1) gives Niso = 282.
There are 42 possible variations of substituted BVs having different numbers and identities of substituents. Based on all of these possible substituent patterns, a comprehensive reference table is provided in the ESI (Table S1)† where this method and eqn (1) have been applied to generate Niso and S, as well as the numbers of achiral and chiral isomers.
We first generated all possible isomers of the methyl-substituted BVs using bullviso, then optimized their geometries by performing density functional theory (DFT) calculations. The PBE0 (ref. 27) functional with Grimme's D3 dispersion correction28 and the def2-SV(P)29 basis set were deemed suitable for modelling the energetics of BV systems. Using this level of theory, we constructed isomer interconversion networks and predicted relative isomer populations for Me2BV (Fig. 3), Me3BV (Fig. S1†), and Me4BV (Fig. S2†), which have 15, 42, and 72 unique isomers, respectively.21 For each of these isomers, we calculated the three principal moments of inertia (I1, I2, and I3 in ascending order) using a KNIME30 Vernalis31 chemoinformatic protocol, then used these values to calculate normal PMI ratios, NPR1 (I1/I3) and NPR2 (I2/I3). The resulting PMI plots (Fig. 4a–g) follow the standard layout (Fig. 4) where the vertices are defined by NPR values associated with rod-like shape [NPR1 = 0, NPR2 = 1], disc-like shape [NPR1 = 0.5, NPR2 = 0.5], and spherical shape [NPR1 = 1, NPR2 = 1]. The diagrams are also labelled with representative structures for each vertex, i.e., butadiyne (rod-like), benzene (disc-like), and adamantane (sphere-like). To guide the eye, parallel lines on the PMI plot correspond to increments of 0.1 in ∑NPR values (∑NPR = NPR1 + NPR2) between the limits of 1.0 and 2.0. The points that lie furthest from the rod–disc axis, i.e., toward the top right of the diagram, are associated with greater sphericity.
Fig. 4 (a–c) PMI plots for the shapeshifting networks of (a) Me2BV, (b) Me3BV, and (c) Me4BV. Substituent positional labels are given for Me2BV. For clarity, these labels are not shown on the plots for Me3BV and Me4BV. See Tables S6 and S7† for labelled data. (d–e) Population-weighted PMI plots for the shapeshifting networks of (d) Me2BV, (e) Me3BV, and (f) Me4BV where the data points are scaled by calculated Boltzmann distributions at 298 K (PBE0-D3/def2-SV(P)). The modelled structure of the lowest-energy isomer for each BV is shown inset. (g) An overlay of the PMI plots of Me2BV (blue), Me3BV (orange), and Me4BV (purple) showing that none of the isomers have ΣNPR values close to the rod–disc axis. (h) A PMI plot for common ring systems. See Table S8† for compound identities. (i) Structural formulae and ΣNPR values for 1,4-dimethyladamantane (3), 1,2-dimethylcubane (4), and 1,4-dimethylcubane (5). |
The PMI plot of Me2BV (Fig. 4a) shows the structural diversity in the population of constitutional isomers, which fall in the range 1.44 ≤ ∑NPR ≤ 1.57. For ease of reference, isomers in Fig. 4a are labelled using the α–δ locants, however, the full isomer barcodes are also given in the ESI (Tables S2–S4).† Darker colored points indicate overlap of enantiomers on the diagram as they give identical PMI coordinates, e.g., for (±)-β,γ′-Me2BV.
As the unique isomers of a substituted BV are non-degenerate, they are present in varying concentrations at equilibrium. To visualize how this property influences which molecular shapes are most prevalent, we made PMI plots with the data points scaled by the Boltzmann distribution at 298 K. A population-weighted PMI plot (Fig. 4d) shows that the shapeshifting network of Me2BV consists predominantly (∼93%) of the four lowest-energy isomers, which include two achiral isomers, β,β′-Me2BV and γ,γ′-Me2BV, and the enantiomeric pair of (±)-β,γ′-Me2BV. The β,δ-Me2BV, (±)-β,δ′-Me2BV, and (±)-γ,δ′-Me2BV isomers are also present in ∼0.5–2% each (Table S2†), leaving ∼0.3% of the remaining six isomers combined.
The introduction of more substituents to the BV scaffold (i.e., in Me3BV and Me4BV) increases structural variety and overall sphericity (Fig. 4b and c). Several structures extend beyond ΣNPR > 1.6, reaching maxima of 1.89 and 1.90 for Me3BV and Me4BV, respectively. In both cases, the PMI distributions are markedly broader than that of Me2BV, spanning 1.3 ≤ ΣNPR ≤ 1.9, which reflects the change in shape that can occur when several substituents are located close to one another around the BV scaffold (giving rod-like shape) or are spread around the BV evenly to maintain sphericity. Like Me2BV, a subset of the Me3BV and Me4BV constitutional isomers are most prevalent in the network at equilibrium. The 14 most stable isomers of Me3BV are within ∼15 kJ mol−1 of one another, so they are each present in amounts ranging from 0.1% to 60% (Table S3†). Similarly, there are 12 isomers of Me4BV present in proportions of 0.1% to 52%.||32 The energetically preferred isomers are those with most of their substituents at β and γ positions, and in which substituents do not neighbor one another directly. Therefore, the population-weighted PMI plots (Fig. 4e and f) show that the most spherical isomers are present in higher populations.
To contextualize the PMI data of the functionalized BVs, it is useful to compare the Me2BV isomers (Fig. 4a) to a small representative library of rings prominent in pharmaceuticals (Fig. S3†).33 PMI analysis of the dimethyl derivatives of these pharmaceutical building blocks (Fig. 4h) shows the relative lack of 3D diversity in most currently used ring systems.32 The majority of compounds are close to the rod–disc axis (ΣNPR ≤ 1.3) with the notable exception of a few structures such as substituted adamantanes and cubanes 3–5 (Fig. 4i), which are increasingly popular as more spherical replacements for flat ring systems.34 The PMI ratios of the Me2BV isomers (1.44 ≤ ∑NPR ≤ 1.57) and 3–5 (1.40 ≤ ∑NPR ≤ 1.53) are nearly identical, suggesting that BVs could be similarly useful as 3D scaffolds in medicinal chemistry. BVs have the additional, unique property of spontaneously sampling different structures through their Cope rearrangements and are now readily accessible by short synthetic routes.1c,2
Fig. 5 (a) The vectors v1 and v2 for two substituent attachment points on a BV (shown for (αS)-γ,δ′-Me2BV) which are defined by (b) the geometric parameters r, φ1, φ2, and θ. |
We constructed EV plots (Fig. 6) for the C–Me bonds of the methyl-substituted BVs to quantify the stereodynamics of the BV scaffold, i.e., the relative orientations and spacings of its substituents. Typically, EV plots span θ values of 0° to 180°, where all dihedral angles are defined as being positive. For BV isomer networks, however, it is also beneficial to extend the EV plots to include negative values of θ, allowing pairs of enantiomers that are present in the dynamic equilibrium to be shown on the same diagram (e.g., (±)-β,γ′-Me2BV, Fig. 6d), as well as revealing enantiotopic relationships between substituents at equivalent positions on different arms (e.g., the β′ and β′′ positions of β,β′,β′′,δ-Me4BV, Fig. 6f). As the cage-like structure of BV imposes dihedral angles between −60° and +60°, plotting θ from −90° to +90° (Fig. 6a–f) gives an informative representation of the data.
Fig. 6 (a–c) Distance–dihedral angle EV plots and (d–i) Boltzmann population-weighted distance–dihedral angle EV plots (298 K, PBE0-D3/def2-SV(P)) for the isomers of (a, d and g) Me2BV, (b, e and h) Me3BV, and (c, f and i) Me4BV. (j) Overlaid distance versus dihedral angle EV plot of all three methyl-substituted BVs. (g–j) Plots include characteristic areas of EV plots in grey that correspond to those found in disubstituted cycloalkanes,16 a = cis-1,2-disubstituted cyclopropanes, b = cis-1,3-disubstituted aliphatic rings and cis-1,4-disubstituted cyclohexanes, c = trans-1,4-disubstituted cyclohexanes, d = trans-1,3-disubstituted cyclopentanes and cyclohexanes. |
An EV plot of the Me2BV isomers reveals C–Me dihedral angles clustered in two regions of either θ ∼ 0° or θ ∼ ±30°. The points in the former region span distances of r ∼ 1.3–3.1 Å (Fig. 6a). The majority of coplanar EVs (θ ∼ 0°) arise from isomers that are either functionalized (i) at two different positions on the same arm of the BV or (ii) at the same type of position on BV on different arms, e.g., β,β′-Me2BV, accounting for nine of the 15 possible substitution patterns. The four points at θ ∼ ±30° correspond to two of the three enantiomer pairs of the isomers with differently substituted arms ((±)-β,γ′-Me2BV and (±)-β,δ′-Me2BV). The final enantiomer pair, (±)-γ,δ′-Me2BV, has coplanar EVs of θ = ±0.5°. Boltzmann population-weighted EV analysis of Me2BV (Fig. 6d) reveals that the most populated isomers β,β′-Me2BV (r = 2.5 Å, θ = 0°, p = 53%) and (±)-β,γ′-Me2BV (r = 3.1 Å, θ = 30°, p = 17% for each enantiomer) exhibit substantial changes in the dihedral angles between the C–Me EVs.
Standard EV plots of Me2BV, Me3BV, and Me4BV spanning θ values of 0° to 180° (Fig. 6g–j) are also shown to aid comparison to the four regions, a–d, determined by Grygorenko et al. that are characteristic of EVs found in common disubstituted cycloalkanes.16 Note that these four regions were determined by plotting the EVs found for ∼2900 cycloalkanes in the Cambridge Structural Database, so they represent the span of angles that are obtained using a variety of functional groups. The majority of data points for the BV isomers fall within the a or b region. EVs in region a are characteristic of cis-1,2-disubstituted cyclopropyl compounds while region b is associated with cis-1,3-disubstituted and cis-1,4-disubstituted aliphatic rings.16 The presence of EVs in these regions for BV is expected, therefore, as the structure of BV contains these motifs.
The diversity of EVs arising from the shapeshifting isomerization becomes even more apparent when considering the plane angles subtended by the C–Me EVs, φ1 and φ2 (Fig. 5), of the Me2BV isomers. There is a spread of possible plane angles spanning from 15° to 60° found in the most energetically accessible isomers (Fig. 7), extending to 67° for higher-energy isomers (Fig. S4†). Therefore, sequential Cope rearrangements between isomers significantly alter the relative orientations of substituents in space, granting BV its unique stereodynamic properties.
Fig. 7 The plane angles subtended by C–Me EVs in the nine lowest energy isomers of Me2BV. Data points are scaled according to the Boltzmann population at 298 K (PBE0-D3/def2-SV(P)). |
As the cage-like geometry of the BV scaffold is well defined and insensitive to the addition of more substituents, Me3BV and Me4BV would be expected to exhibit similar EVs between pairs of their C–Me bonds as those found for Me2BV. This generalization applies when considering the plane angles (Fig. S7–S14†) and distances between substituents. It is also the case for the dihedral angles, but only when considering the most energetically favorable isomers (Fig. 6e and f).
A wider spread of dihedral angles (Fig. 6b and c) that extends beyond region b to angles between −60° and +60° is apparent when the complete isomer networks of Me3BV and Me4BV are taken into account (Fig. 6j), including isomers that lie at higher energy. Significant deviations from the 0° and ±30° dihedral angles arise to minimize unfavorable steric interactions when substituents are close to one another in space. For the di-, tri-, and tetrasubstituted BVs investigated here, the isomers that bear methyl groups at neighboring positions are relatively high in energy, so are not very prevalent. The most significant (albeit still low) populations of such isomers are present for Me4BV because of the increased likelihood of substituents being close to one another in the tetrasubstituted system, such as the ∼0.1% of β,β′,γ,δ′-Me4BV, which has θ = 51° between its β and δ′ methyl groups. But more highly substituted BVs, or those bearing bulkier substituents, will likely have larger isomer populations with varied dihedral angles.
Of course, shape differentiation between the orientations of functional groups attached to BV diminishes if conformationally flexible linking groups are used, so elaborating BVs with short rigid groups2c,8b,35 or fused rings36 may be advantageous. Each of the molecular shapes described by the PMI (Fig. 4a–c) and EV plots (Fig. 6a–c and S4–S14†) correspond to accessible states for the methyl-substituted BVs, demonstrating that a single BV derivative can cover a significant region of chemical space on its own. Heterosubstituted systems with varied functional groups possess still greater structural diversity. Although realizing the full extent of this shape diversity in some BVs may require higher-energy isomers to be invoked, it is important to note that, in the contexts of drug discovery and materials chemistry, noncovalent bonding interactions with biomolecular targets, confinement effects, or forces imposed by the surrounding medium may compensate for the moderately low energy differences between isomers to amplify certain BV structures within the shapeshifting network.2d,4,6,37 Therefore, isomers that have otherwise low populations should not be dismissed, as they may become more significant components of the network in the right environment.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4sc03700f |
‡ These authors contributed equally. |
§ The use of Greek letter locants to label relative positions and prime symbols to label identical rings in multiple ring systems follows IUPAC conventions.18 |
¶ A Fortran code to generate input geometries of BV isomers is mentioned in ref. 2a but it has not been made publicly available. |
|| A consequence of there being more possible isomers in the networks of highly substituted BVs is that, statistically, any individual isomer is expected to be present in lower concentrations. However, certain BV systems, e.g., hexasubstituted BVs, are biased toward a relatively small number of isomers, which act as energetic sinks that also slow the rate of exchange.32 |
This journal is © The Royal Society of Chemistry 2024 |