Topological landscapes of porous organic cages

Valentina Santolini; Marcin Miklitz; Enrico Berardo; Kim E. Jelfs

doi:10.1039/C7NR00703E

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/C7NR00703E (Paper) Nanoscale, 2017, 9, 5280-5298

Topological landscapes of porous organic cages†

Valentina Santolini , Marcin Miklitz , Enrico Berardo and Kim E. Jelfs *
Department of Chemistry, Imperial College London, South Kensington, London, SW7 2AZ, UK. E-mail: k.jelfs@imperial.ac.uk; www.twitter.com/JelfsChem; Tel: +44 (0)20759 43438

Received 30th January 2017 , Accepted 10th March 2017

First published on 16th March 2017

Abstract

We define a nomenclature for the classification of porous organic cage molecules, enumerating the 20 most probable topologies, 12 of which have been synthetically realised to date. We then discuss the computational challenges encountered when trying to predict the most likely topological outcomes from dynamic covalent chemistry (DCC) reactions of organic building blocks. This allows us to explore the extent to which comparing the internal energies of possible reaction outcomes is successful in predicting the topology for a series of 10 different building block combinations.

Introduction

Molecular design is the perpetual ambition of chemists as they strive for more complex and unusual molecular assemblies, both for their inherent beauty and to encode more sophisticated functions. These assemblies include both network materials, such as metal–organic frameworks (MOFs), and discrete molecular architectures that are of the nanometre scale. For network materials, the enumeration of possible framework topologies^1–4 has allowed for the targeting of topologies through the design of ligands and nodes that meet the necessary geometric requirements. This “reticular design” has assisted in the synthetic realisation of >91 different MOF topologies,⁵ from a potential of >2500.⁴ However, although reticular design has achieved many notable successes, such as the series of IRMOFs with a cubic-based topology of increasing ligand length,⁶ too often the topological outcome of a synthesis remains a matter of serendipity, rather than successful a priori design.

For finite molecular hosts, a wide range of different motifs have been realised, including rings, cycles, knots, polyhedra and catenanes.^7–9 Here we focus upon molecular cage compounds, defined by IUPAC as polycyclic compounds with the shape of a cage. These have the potential to host other molecules inside their internal cavity, but, unlike macrocycles, also have three dimensional structures with multiple possible entry and exit routes through molecular windows. Organic molecular cages have been of increasing research interest in recent years,^9–13 although they have a longer history, including a range of cryptands,¹⁴ carcerands,^15,16 and capsule-like molecules. Potential applications for these molecular hosts include encapsulation,¹⁷ catalysis,¹⁸ molecular separations of organic molecules^19,20 or gases,^21,22 molecular sensing,^23,24 molecular reaction vessels, or as porous liquids.²⁵

Whilst framework materials can be related to underlying extended network topologies, the equivalent topologies for molecular materials include, but are not limited to, polyhedra such as Platonic and Archimedean solids, see Fig. 1. Throughout this work, we use the term topology to refer to the underlying connectivity of molecular building blocks (BBs) in the molecular cage, which is unchanged upon any physical deformation. Whilst initially capsule-like topologies (with two end groups such as cavitands connected by multiple ditopic ligands to form a cavity) dominated, in the last decade a broader range of topologies have been synthetically realised. These polyhedra include tetrahedra, cubes, octahedra, square antiprisms and cuboctahedra, although capsules, tetrahedra and cubes are most commonly observed. A molecule with a single topology could adopt multiple different geometrical shapes, dependent upon factors such as the geometry of the component BBs and their conformational flexibility. For example, as shown in Fig. 2, a molecule with an underlying tetrahedral topology could have the approximate geometrical shape of a tetrahedron or an octahedron, or related intermediate shapes, dependent upon the component BBs.


	Fig. 1 Platonic, Archimedean, and Catalan polyhedra that can be related to organic cage topologies. (1) Tetrahedron, (2) trigonal prism, (3) cube, (4) octahedron, (5) dodecahedron, (6) square antiprism, (7) rhombic dodecahedron, (8) cuboctahedron, (9) rhombicuboctahedron.


	Fig. 2 An example of how two porous organic cages that both have an underlying topology of a tetrahedron adopt different geometric shapes, in one case maintaining a tetrahedron shape²⁶ (right hand side), and in the other adopting an octahedral shape (left hand side).²⁷ The underlying topologies are shown in purple and the geometric shapes in orange. Hydrogen atoms and multiple bonds are omitted for clarity throughout the text.

The growing interest in the field of organic cage molecules makes it timely to identify the potential topological possibilities for these materials and to establish a uniform classification system for them. As stated by Brunner in 1981, the “synthesis of new structures requires not only chemical skill but also some knowledge of the principal topological possibilities”.²⁸ Hay has recently discussed design principles for metal–organic polygons and polyhedra (MOPs), emphasising that referring to these molecules by their shape can lead to multiple classifications for the same underlying topology, particularly for polyhedra that are face-directed, dependent on how you relate the metals and ligands to the vertices, edges or faces.²⁹ Instead, Hay proposes a nomenclature based upon the number of vertices, edges or faces involved in the MOP and its point group. We agree with this approach, and here lay out an extension of this nomenclature for wholly organic molecules.

We then examine which topologies have been synthetically reported and choose a set of molecules from each topological family described to computationally investigate the underlying factors that influence the observed topological forms. Firstly however, in the following section we discuss the factors that can influence the topology formed for a porous organic cage.

Topological control?

Currently, the ‘design’ of a particular topology in a new porous cage system is possible for fairly rigid BBs, although is not always successful, as there can be unexpected outcomes. There are examples of BBs designed to meet the appropriate geometrical requirements for, for example, forming a molecular cube in an [8 + 12] reaction of eight corner units (with angles of ∼90°) and 12 linear struts for the edge,^30,31 or forming a cuboctahedron via molecular panelling of tritopic and tetratopic BBs.³² More commonly, once a particular polyhedral cage has been made, a family of related polyhedra can be synthesised via small variations in the functionality of one or the other of the BBs. However, even this can fail, for example we reported a change from a cyclohexane diamine to a cyclopentane diamine resulting in a larger [8 + 12] molecule, with an underlying topology of a cube, forming in place of a [4 + 6] molecule.³³ Worse still, whilst the [4 + 6] molecule formed a porous solid state material, with a BET surface area of >1300 m² g⁻¹, the [8 + 12] molecule was found to lack shape persistence in the absence of a solvent and hence to form an amorphous, non-porous material. Thus, a very subtle difference in the reaction precursors had an enormous effect on the reaction outcome and therefore properties of the molecular cage material. Fujita describes this type of phenomenon as “emergent behaviour”³⁴ and it is clear that this can also thwart attempts at the topological design of organic cage materials. Further complicating the matter, there are some reports of the kinetic trapping of products²⁶ or of solvent choice influencing the topological outcome; in an example from Liu and Warmuth, where they reacted a tetraformylcavitand with ethylene diamine in an imine condensation reaction, it was found that changing between tetrahydrofuran, chloroform and dichloromethane as the solvent could change the dominant product from a [4 + 8] cage, to a [6 + 12] octahedron to a [8 + 16] square antiprism.³⁵

As the majority of organic cages are formed by reversible dynamic covalent chemistry (DCC), the product distribution should be thermodynamically controlled, with those molecules with the lowest free energy being the dominant reaction product. Of course, this depends upon the reaction mixture being able to afford this product, rather than being kinetically trapped into alternative products, which is known to have happened with alkyne metathesis formed cages²⁶ and for 2D imine molecular ladders,³⁶ and may become more common as the number of bond formation reactions in a structure increases. Alternative products could include a range of different size oligomeric products, a polymeric material, alternative molecules with different topologies, or catenated molecules. Whilst reports of catenanes for porous organic cages are so far rare, examples from Hasell et al.³⁷ and Zhang et al.³⁸ suggest that in some cases catenanes can be the thermodynamic product, rather than their monomeric equivalent, as a result of ‘self-templating’ driven by the introduction of intermolecular interactions such as π–π stacking and alkyl–π interactions in the catenated form. In general, it is likely that, dependent on after how much time the product is isolated and characterised, different products might be observed as the mixture evolves. Whilst many organic cage molecules have been isolated in high yield (up to 100%),³⁹ there are reports of systems where a mixture of products has been isolated from a single solution,³⁵ presumably a greater driving force for a single topology being required for isolation of a single product.

We now consider factors that can influence the topological energy landscape and thus the likely topology and shape of the organic cage molecules formed for a set of BBs.

Number of reactive end groups

Although two BBs that are ditopic can only form single cycles, larger numbers of end groups can form more complex, polycyclic topologies. Examples of common BBs used in the synthesis of organic cage molecules thus far are shown in Fig. 3. For organic BBs, where high symmetry is typically sought in the BBs, at least with a C_n symmetry axis with n equal to the number of reactive end groups, there are fewer options as n increases. This distinguishes organic cage molecules from MOPs, where higher coordination numbers at metal sites may be more easily accessed.


	Fig. 3 Examples of ditopic (Di), tritopic (Tri) and tetratopic (Tet) BBs reported for the synthesis of organic cages. Starred ditopic precursors have been used for the synthesis or boronate cages, where two hydroxy groups are required for the formation of one boronate bond. Thus, we still label these precursors as ditopic rather than tetratopic.

Stoichiometry of building blocks

The component BBs need to be combined in the correct stoichiometric ratio to access the targeted topology, for example a cube with 8 tritopic vertex BBs and 12 ditopic edge BBs is formed from a 2 [thin space (1/6-em)]

3 reactant ratio. Providing an excess of one BB has been known to skew the topological landscape towards alternative products, where not all of the reactive end groups have formed covalent bonds.³²

Building block geometry and rigidity

The geometry of the BB and the arrangement and direction of the reactive end groups can play a vital role in forming a desired topology. The angle between the reactive end groups and the centre of at least one of the BBs must obviously be less than 180° for a chance of making a finite molecular, rather than periodic, structure. If one wanted to synthesise a molecule with a cubic shape and underlying cubic topology, then we might consider that we want tritopic ligands with (end group)–(centre of mass)–(end group) angles of 90° to form a corner and ditopic ligands of (end group)–(centre of mass)–(end group) angles of 180° to form the edges. Indeed, this approach has been successful for the design of the two molecular cubes reported thus far from both the Warmuth³⁰ and Beuerle³¹ groups. For MOPs, it was reported that the angles need not be perfectly matched to the desired topology, with deviations of <10° likely to be tolerated, and even larger deviations often able to be compensated by the summation of small adaptions from other BBs, although significant deviations can lead to structures with shapes more severely distorted from the base topological shape.⁴⁰ For MOPs, design rules for both the topology and shape design through complementary end group vector selection in the BBs have been proposed,^8,40 however, recent analysis of the crystal structures of MOPs by Young and Hay found that whilst sometimes this approach works, it does not do so in the majority of cases.²⁹ These failures were due to either deviation from the ideal bonding vector through conformational flexibility or, more often, due to a concerted rotation about symmetry axes in the molecule.²⁹

If the lack of rigidity in MOP assemblies frustrates the simple geometric design of BBs, then this is even more problematic for organic cages that have less strongly directional bonding. In the successful cases of geometric BB design thus far, rigidity of the underlying BBs has assisted in this design simplification. However, rigidity of the precursors is not a requirement for an organic cage assembly and with many functions of these materials, such as the host–guest response behaviour, often relying on dynamic motion of the final assembly, we conclude that it is important not to rule out more flexible BBs as potential cage components. This does however significantly increase the design challenge, and we believe necessitates the application of computational modelling in order to predict the conformations of the resultant assemblies. Of course ‘rigidity’ and ‘flexibility’ are not absolutes; whilst some flexibility and motion of a component is inevitable, excessive flexibility in the linkers is linked to lower product yields⁹ and increases the likelihood that the end assembly will not be shape persistent.

Entropy

Following the second law of thermodynamics, every spontaneous process is associated with an increase in symmetry (dynamic or static) of the original system.⁴¹ The entropy of symmetry can play a role in favouring structures with a higher symmetry, for example disfavouring linear oligomers over high symmetry structures, such as the symmetric topologies based on Platonic or Archimedean solids discussed here. The entropy of symmetry can be correlated to the symmetry number (σ)^42,43 and as previously reported by Skowronek et al.,⁴⁴ these contributions range from 3–8 kJ mol⁻¹ from a C₃ to D₃ to T_d to O_h symmetry molecule. Whilst significant in driving the formation of symmetric finite molecules, it is however unlikely that these contributions would influence which molecular topology is formed. Likely, more energetically significant is that entropy can be expected to disfavour large molecules over the alternative outcome of multiple smaller assemblies. Thus, to synthesise large architectures, a significant driving force is required.

Solvent

The non-covalent interactions between solvent molecules and cages can alter both the thermodynamics and the kinetics of the cage formation process, potentially favouring the selective synthesis of a specific product.^45,46 Poor solubility of reaction intermediates could result in low-yielding synthesis or precipitation during the assembly process.⁴⁷ Warmuth and co-workers propose that the major contribution from solvent effects is played by the improper solvation of flexible BBs.³⁵ Solvent molecules enclosed in a cavity tend to interact with the capsule's internal surface for a prolonged time (e.g. from 10⁻³ to 10³ s), having enough time to establish a strong interaction with the BBs.⁴⁵ During this process, solvent molecules located in the inner cavity and in the large windows will favour specific geometrical conformations of the BBs, preferentially driving the assembly towards a specific topology. The complex role of solvent effects on cage formation can therefore be summarised as depending on three main factors: (a) the entropic cost associated with molecular reorganisation by collecting the n BBs from the solvent phase; (b) the energetics of BB–solvent interactions (e.g. hydrophobic effects) and (c) the amount of strain felt by the BB within the final assembled system. Due to the complex and very variable character of solvent effects, these interactions are very hard to quantify and predict. In the work of Warmuth and co-workers, the differences in solvent effects estimated to explain the yields observed for different imine cage topologies in different solvents amounted to around ∼17 kJ mol⁻¹ per imine bond.

Synthesis method

The majority of organic cage molecules have been synthesised with batch chemistry at high dilution and slow addition, however recently cage molecules have also been synthesised through a continuous flow methodology.^48,49 With the flow methodology, very short reaction times with less solvent were achieved, but with high purity and yield. These approaches and novel synthesis methods such as mechanochemistry⁵⁰ may also provide alternative synthesis routes to different topologies because they influence the kinetics of the reaction and the reaction pathways.

Other synthesis variables

These variables include the reaction temperature, concentration, rate of mixing, order of addition of precursors, use of catalysts, pH and template molecules.

Enumeration of organic cage topologies

In our enumeration of possible organic cage topologies, we restrict ourselves to the following criteria: (i) the topology must be a polycyclic cage molecule, with the potential for a 3D internal cavity, excluding macrocycles such as cyclodextrins, crown ethers, cucurbiturils and 2D topologies that are polygons such as triangles, squares, pentagons etc.; (ii) it must be possible to form the topology from a two-component reaction of BBs through DCC; (iii) combinations of components that are ditopic, tritopic and tetratopic only; (iv) we presume that the BBs have a C_n symmetry axis with n equal to the number of reactive end groups and (v) for topologies with larger numbers of BBs, we include only higher symmetry structures that are relatable to Platonic or Archimedean solids that have been reported for MOPs, such that they are more feasible targets for organic cage molecules.

These criteria, whilst not producing an exhaustive set of possible topologies, result in 20 topologies that are the most plausible organic cage structures and indeed, almost two-thirds have already been synthetically reported. Recently, mathematicians have developed an algorithm for generating maps of ‘stable planar cages’, where they did not restrict themselves to 2-component systems and generated >400 [thin space (1/6-em)] 000 unique maps.⁵¹ The majority of these maps are too complex to be feasible with organic chemistry and thus our limited number of plausible topologies is more practical. However, the topologies we discuss can still be used to describe the underlying connectivity in a multicomponent system and indeed, multiple components may be an attractive design approach for the synthesis of some of the lower symmetry topologies. Several examples of multiple component systems,^52–55 including those with orthogonal DCC formation reactions, for example both boronate and imine formation,^50,56 have been reported.

For our discussion of organic cage topologies, we introduce a new nomenclature, labelling each structure as:

X^m_pYⁿ

where X and Y are the two different component BBs that constitute the cage. X and Y are labelled Di if they are ditopic, Tri if they are tritopic and Tet if they are tetratopic. The first BB, X, has the highest number of reactive end groups and if the underlying topology relates to a polyhedron, it will lie at its vertices. The second BB, Y, can have a number of reactive end groups less than or equal to X. Where the number of reactive end groups for X and Y is equal, which is denoted as X and which is denoted as Y is arbitrary. The superscripts m and n denote the number of each BB incorporated into the topology for X and Y respectively. The majority of the time, X-type BBs are connected to other X-type BBs through only one Y-type BB; in this case no subscript p will be given. However, if two X-type BBs are directly connected through links with two distinct Y-type BBs, then they have p = 2. The subscript p thus gives the number of double connections between BB pairs within a topology. These multiple links are also reported for periodic nets² and result in topologies with multiple ring sizes. For some of the smaller topologies with only two X-type BBs, there is triple or quadruple linking of these BBs; in this case no subscript is given as there is no alternative connectivity for that topology. We finally suggest using the prefix c- in the case of catenated cages, equivalent to its usage as a prefix with interpenetrated periodic nets.⁴

We will now discuss each of the topologies within the four families that consist of combinations of different connectivity BBs. The topologies and their features are summarised in Table 1 and experimentally reported topologies in Table 2.

Table 1 Table of possible organic cage topologies and their key features. Ring sizes are defined as a^b, where a is the number of individual BBs that form part of the ring and b is the number of rings of size a

Topology	Edge-directed form	Face-directed form	Solid type	Multiple connections	Point group in high symmetry form	Ring sizes
Tri ² Di ³				1 triple	D _3h	4³
Tri ⁴ Di ⁶	Tetrahedron		Platonic		T _d	6⁴
Tri ⁴₂ Di ⁶				2 double	D _2h	4², 8²
Tri ⁶ Di ⁹	Triangular prism				D _3h	6², 8³
Tri ⁸ Di ¹²	Cube		Platonic		O _h	8⁶
Tri ²⁰ Di ³⁰	Dodecahedron		Platonic		I _h	10¹²

Tet ² Di ⁴				1 quadruple	D _4h	4⁴
Tet ³₃ Di ⁶				3 double	D _3h	4³, 6²
Tet ⁴₂ Di ⁸				2 double	C _2v	4², 6⁴
Tet ⁴₄ Di ⁸				4 double	D _4h	4⁴, 8²
Tet ⁵₅ Di ¹⁰				5 double	D _5h	4⁵, 10²
Tet ⁶ Di ¹²	Octahedron		Platonic		O _h	6⁸
Tet ⁸ Di ¹⁶	Square antiprism				D _4d	6⁸, 8²
Tet ¹⁶ Di ³²	Cuboctahedron		Archimedean		O _h	6⁸, 8⁶
Tet ²⁴ Di ⁴⁸	Rhombicuboctahedron		Archimedean		O _h	6⁸, 8¹⁸

Tri ¹ Tri ¹					C _3v	2³
Tri ²₂ Tri ²				2 double	D _2h	2², 4²
Tri ³₂ Tri ³				2 double	C ₁	2², 4⁴
Tri ⁴ Tri ⁴		Tetrahedron	Platonic		T _d	4⁶
Tet ⁶ Tri ⁸		Rhombic-dodecahedron	Catalan		O _h	4¹²

Table 2 Experimentally reported organic cage topologies, synthesised via two-component DCC reactions. Di stands for ditopic precursors, Tri for tritopic, Tet for tetratopic

Topology	Reported?	CSD reference codes for crystal structures of archetypal examples
a This topology has been reported for a single component alkene metathesis reaction.⁶⁹ b This topology has been reported with a bifunctional ditopic ligand and a cyclisation reaction.
Tri ² Di ³	Yes	ZUYPUG,⁵⁷ AJOHUD,⁵⁸ SATJAA⁵⁹
Tri ⁴ Di ⁶	Yes	PUDXES,²⁷ TOVWUY,⁶⁰ EKUKUR⁶¹
Tri ⁴₂ Di ⁶	No^a	—
Tri ⁶ Di ⁹	No	—
Tri ⁸ Di ¹²	Yes	KATJAS,³³ REYMAL,⁴⁴ ZIRCIO⁶²
Tri ²⁰ Di ³⁰	No	—
Tet ² Di ⁴	Yes	LUXVAB⁶³
Tet ³₃ Di ⁶	Yes	VILCEZ⁶⁴^b
Tet ⁴₂ Di ⁸	Yes	No reported crystal structures³⁵
Tet ⁴₄ Di ⁸	Yes	AVAFIN⁶⁵
Tet ⁵₅ Di ¹⁰	No	—
Tet ⁶ Di ¹²	Yes	No reported crystal structures³⁵
Tet ⁸ Di ¹⁶	Yes	No reported crystal structures³⁵
Tet ¹⁶ Di ³²	No
Tet ²⁴ Di ⁴⁸	No
Tri ¹ Tri ¹	Yes	No reported crystal structures⁶⁶
Tri ²₂ Tri ²	No	—
Tri ³₂ Tri ³	No	—
Tri ⁴ Tri ⁴	Yes	VOFROZ⁶⁷
Tet ⁶ Tri ⁸	Yes	QUFYIB, QUFYOH⁶⁸

Tritopic + ditopic topology family

Tritopic and ditopic BBs can be combined in a 2 [thin space (1/6-em)]

3 ratio to form molecular cages. The smallest topology for this family is the capsular Tri²Di³ topology formed from a [2 + 3] reaction of two tritopic BBs linked to three ditopic BBs, as shown in Fig. 4, where all the topologies of this family are reported. There are multiple synthetic reports of this topology; example references, including reference codes for structures in the Cambridge Structural Database (CSD) are given in Table 2, these cages have reported functions in gas separation⁷⁰ and sensing.²⁴ This structure can have an intrinsic, shape-persistent cavity if the BBs are relatively rigid and has three cycles which form windows to access the central cavity; all these have a ring size of 4 (where 4 is the number of BBs that constitute the ring).


	Fig. 4 The tritopic + ditopic topology family. Tritopic vertices are in blue, ditopic linkers in purple.

The Tri⁴Di⁶ topology is formed from a [4 + 6] reaction and is related to a tetrahedron via an edge-directed assembly, with the tritopic BB placed on each of the vertices of the tetrahedron and the ditopic BB along the edges. The topology has four potential access windows and all these have a ring size of 6. Again, this is a very commonly reported topology in the literature, including systems from Cooper and co-workers²⁷ for molecular separations^19,20,22 and as porous liquids,²⁵ and from Mastalerz and co-workers for sensing.²³

This topology has also been reported for CC1 (Covalent Cage 1), CC2, and CC4 molecules in a triply-interlocked catenated form, which we label c-Tri⁴Di⁶.³⁷ A molecule with a Tri⁴Di⁶ topology, if shape-persistent, can adopt final shapes ranging anywhere between a tetrahedron at one extreme to an octahedron at the opposite extreme. Reported examples of the Tri⁴Di⁶ topology molecules with these shapes are also shown in Fig. 2. For a tetrahedral shape to form, the tritopic BB could adopt a conformation to act as a capping vertex, with (end group)–(centre of mass)–(end group) angles of ∼60°, and the ditopic BB being linear. For an octahedral shape, the tritopic BB could be planar with (end group)–(centre of mass)–(end group) angles of ∼120° and the ditopic BB bent with an angle of ∼60°. With consideration of conformational flexibility in the BBs of organic cages, these geometric requirements are not essential, as compensation in each of the BBs could still achieve a given shape. The transformation of a tetrahedral shape to an octahedral shape occurs through a process of truncation of the tetrahedron's vertices. In Fig. 6, we show this transformation alongside a series of hypothetical porous organic cages of the Tri⁴Di⁶ topology, whose changing BBs show one way in which this range of shapes could be accessed. From an initial molecule with both the shape and topology of a tetrahedron, one can cause “flattening” of the corners of the molecule by swapping the starting tritopic BB with a planar tritopic precursor. Then, increasing the size of the planar tritopic BBs can lead to geometric shapes with higher degrees of truncation. Finally, an octahedral geometric shape is obtained by increasing the “bending” of the linear ditopic precursor. The key point is that all these molecules have a tetrahedral topology, despite adopting different geometrical shapes (refer to Fig. 2).


	Fig. 5 The tetratopic + ditopic topology family. Tetratopic vertices are in orange, ditopic linkers in purple.


	Fig. 6 Top row: The range of geometric shapes potentially accessible with a Tri⁴Di⁶ topology, from tetrahedron (left) to octahedron (right). The evolution of the shape is expressed through successive truncations of the vertices of the tetrahedron. The octahedron has been rotated into the page relative to the other shapes to give a clearer perspective. Bottom row: Stages of truncation reproduced in hypothetical porous organic cages. The initial cage with a tetrahedral shape is shown in grey, precursors that lead to increasing truncation highlighted in teal, and the final (completely truncated) cage with an octahedral shape is in teal. The cage molecules are not shown to scale.

The Tri⁴₂Di⁶ topology is also formed from a [4 + 6] reaction, but it contains two doubly connected tritopic BBs as well as two singularly connected BBs. The double connections result in two distinct window sizes, two rings of size 4 and two rings of size 8. This topology has not yet been reported for a DCC reaction, however, in 2014 Wang et al. reported the synthesis of this topology via a one-component alkene metathesis reaction.⁶⁹ The component BB has a C₃ symmetry axis and therefore reduced symmetry in the BB need not necessarily be employed to reach this topology if BB flexibility can account for this. This topology should be achievable through two-component DCC reactions.

The Tri⁶Di⁹ topology is formed from a [6 + 9] reaction and is related to a triangular prism via an edge-directed assembly and has five potential access windows, two with ring size 6 and three with ring size 8. We do not believe this topology has been synthetically reported, it will require tritopic BBs that have two (end group)–(centre of mass)–(end group) angles of 90° and one of 60°, to form both triangular and square connectivity rings, or BBs that can compensate for the differing angles required. This topology may be more easily achieved with lower symmetry BBs.

The Tri⁸Di¹² topology is formed from a [8 + 12] reaction and is related to a cube via an edge-directed assembly and has 6 potential access windows, all with ring size 8. Multiple examples of this topology have been synthetically reported, although several lack shape-persistence. A catenated form of this topology, c-Tri⁸Di¹² has been reported by Zhang et al.³⁸ This molecule can adopt final geometric shapes ranging from a cube to the opposite extreme of a shape formed from planar tritopic BBs; reported examples of Tri⁸Di¹² topology molecules with these shapes are shown in Fig. 7. For a cubic shape to form, the tritopic BB could adopt a conformation to act as a capping vertex, with (end group)–(centre of mass)–(end group) angles of ∼90°, and the ditopic BB would then be linear. For the vertex-folded shape, the tritopic BB could be more planar with (end group)–(centre of mass)–(end group) angles of ∼120° and the ditopic BB bent.


	Fig. 7 The range of geometric shapes possible for Tri⁸Di¹² topology cages, from a vertex-folded shape⁷¹ (top), to a cube³¹ (bottom).

The final topology that we suggest for this family is a Tri²⁰Di³⁰ molecule from a [20 + 30] reaction and is related to a dodecahedron via an edge-directed assembly and has twelve potential access windows, all with ring size 10. This topology has not been reported for an organic cage molecule, although it has been reported for a MOP.⁷² It remains an alluring synthetic target, albeit very challenging due to potential issues with the solubility of the intermediates and the need to have a thermodynamic driving force for a dodecahedron over other topologies or an infinite polymeric product. Furthermore, it will be particularly challenging for this large molecule to be shape-persistent. The most obvious way to design this topology is to have an (end group)–(centre of mass)–(end group) angle of ∼108° in the tritopic BB, to match that of a dodecahedron's vertices, and to combine this with a linear ditopic BB. The required angle of ∼108° is close to that of a tetrahedral carbon, thus tri-functionalised methane molecules, such as those used by Olenyuk et al. for the synthesis of the dodecahedral MOP,⁷² are plausible BBs.

Tetratopic + ditopic topology family

Tetratopic and ditopic BBs can be combined in a 2 [thin space (1/6-em)]

4 ratio to form molecular cages. The smallest topology in this family is the capsular Tet²Di⁴ topology, formed from a [2 + 4] reaction of two tetratopic end groups linked with three ditopic linkers, as shown in Fig. 5, where all the topologies of this family are reported. The topology has four windows of ring size 4. There are multiple synthetic reports of this topology (see Table 2) and the topology can have an intrinsic, shape-persistent cavity if the BBs are relatively rigid.

The Tet³₃Di⁶ topology is formed from a [3 + 6] reaction, containing three sets of doubly connected tetratopic BBs, so all have the same connectivity environment. The double connections result in two distinct window sizes, three rings of size 4 and two rings of size 6. This topology has been reported using a bifunctional ditopic ligand with a cyclisation reaction.⁶⁴ The Tet⁴₂Di⁸ topology is formed from a [4 + 8] reaction and contains two double connections, such that each tetratopic BB is part of one double-link and two single-links. The double connections result in two distinct window sizes, two rings of size 4 and four rings of size 6. Although there are no crystal structures reported, Warmuth and co-workers have reported this structure with tetratopic cavitands and ethane diamines in a solvent of tetrahydrofuran and a distorted tetrahedral shape.³⁵

The Tet⁴₄Di⁸ topology is also formed from a [4 + 8] reaction, but in this case there are four double connections, such that each tetratopic BB is doubly connected to its two neighbours. The topology has two distinct window sizes, four windows of ring size 4 and two windows of ring size 8. This structure has been reported by Warmuth and co-workers, again with tetratopic cavitands and ethane diamine, although even in a solvent it forms a folded structure that does not contain an internal void.⁶⁵ The Tet⁵₅Di¹⁰ topology is formed from a [5 + 10] reaction that also has all building blocks doubly connected to each of their neighbours, creating two distinct windows, four with ring size 5 and two with ring size 10. This topology has not been experimentally reported and would have a high likelihood of lacking an internal void, as with the Tet⁴₄Di⁸ topology, for this reason we exclude larger topologies with this type of connectivity.

The Tet⁶Di¹² topology is formed from a [6 + 12] reaction and is the first of the (tetratopic + ditopic) family that can be related to a polyhedron. Through an edge-directed assembly it relates to an octahedron, where the tetratopic BBs are placed on the vertices and the ditopic BBs are placed along the edges. The topology has eight windows of ring size 6. Whilst there are no reported crystal structures, this topology has been reported by Warmuth and co-workers, from the same BBs as the Tet⁴₂Di⁸ topology, but using chloroform as a solvent, rather than tetrahydrofuran.³⁵ A molecule with this topology could adopt final shapes ranging from an octahedron to a cube, as shown in Fig. 8. For an octahedral shape to form, the tetratopic BB could adopt a conformation to act as a capping vertex, with (end group)–(centre of mass)–(end group) angles of ∼60°, and the ditopic BB linear. For the cubic shape, the tetratopic BB could be more planar with (end group)–(centre of mass)–(end group) angles of ∼90° and the ditopic BB bent to angles of ∼90° also.


	Fig. 8 The range of geometric shapes possible for a Tet⁶Di¹² topology cage,³⁵ from an octahedron (middle) to a cuboctahedron (right). The molecule reported on the left is an example of an octahedral shaped cage.

The Tet⁸Di¹⁶ topology is formed from a [8 + 16] reaction and can be related to a square antiprism through an edge-directed assembly of tetratopic BBs on vertices and ditopic BBs on edges. The topology has eight windows of ring size 6 and two larger windows of size 8. There are no crystal structures for this topology, however it was reported by Warmuth and co-workers for the same building blocks as the Tet⁴₄Di⁸ and Tet⁶Di¹² topologies, but using a dichloromethane as the solvent.³⁵ Molecular mechanics simulations suggested that their molecule would maintain a square prism shape that is approximately a spherical ring, as shown in Fig. 9.³⁵


	Fig. 9 Example of a Tet⁸Di¹⁶ topology cage reported by Warmuth and co-workers (left) with the geometric shape of a square antiprism (right).³⁵

The Tet¹⁶Di³² topology is formed from a [16 + 32] reaction and can be related to the Archimedean solid of a cuboctahedron through an edge-directed assembly of tetratopic BBs on vertices and ditopic BBs on edges. There are two window sizes, eight of size 6 and six of size 8. There are no synthetic reports of this topology. Finally, there is a Tet²⁴Di⁴⁸ topology, formed from a [24 + 48] reaction that can be related to another Archimedean solid, the rhombicuboctahedron through edge-directed assembly. There are two window sizes, eight of size 6 and eighteen of size 8. There are also no synthetic reports of this topology, however, Fujita and co-workers have reported the synthesis of a MOP with this topology using square planar Pd²⁺ as the tetratopic vertex and a dipyridylfuran ligand with an angle of 127° as the edge.³⁴ This structure therefore remains a synthetic design target for an organic cage molecule.

Tritopic + tritopic topology family

Two different tritopic BBs can be combined in a 1 [thin space (1/6-em)]

1 ratio to form molecular cages. The smallest topology for this family is Tri¹Tri¹, which consists of a [1 + 1] reaction, with one of each tritopic BB being connected directly to the other tritopic BBs, resulting in three windows of ring size 2. This is shown in Fig. 10, where all the tritopic + tritopic topologies are shown. Provided the tritopic BBs have a concave shape, the resultant organic cage can have an intrinsic internal cavity. This type of topology has been reported by Kataoka et al. via a condensation reaction of a bowl-shaped triboronic acid and a bowl-shaped tri-diol molecule; the resultant capsule was shown to encapsulate small guests.⁶⁶


	Fig. 10 The tritopic + tritopic topology family. One of the tritopic precursors is in blue, the other in teal.

The Tri²₂Tri² topology is formed from a [2 + 2] reaction, containing two of each type of tritopic BB, with each BB having one set of double connections and one single connection to the opposite building block. Therefore all BBs have the same connectivity environment and the molecule has two windows of ring size 2 and two of ring size 4. The Tri³₂Tri³ topology is formed from a [3 + 3] reaction, with two pairs of BBs having double connections and the remainder having single connections. The molecule has two windows of size 2 and four of size 4. There have been no synthetic reports of either the Tri²₂Tri² or Tri³₂Tri³ topology to our knowledge.

The Tri⁴Tri⁴ topology is formed from a [4 + 4] reaction and can be related to a tetrahedron through a face-directed assembly, where one of the tritopic BBs is placed on the vertices and another on the faces of the Platonic solid. This type of face-directed assembly has been termed ‘molecular panelling’ when applied to the construction of MOPs.⁷³ All the BBs are singularly linked and there are six windows of ring size 4. This topology has been reported by both Mastalerz and co-workers⁷⁴ and Cooper and co-workers.⁶⁷ A molecule with this topology could adopt final shapes ranging from a cube⁷⁴ to a tetrapod,⁶⁷ as shown in Fig. 11. For a cubic shape to form, both tritopic BBs could have a similar conformation with (end group)–(centre of mass)–(end group) angles of ∼90°, such that they form the eight corners of the cube between them. For a more tetrapodal shape, one tritopic BB could be planar, with angles of ∼120°, whereas the other could have very narrow angles, acting as the end of each ‘foot’ of the tetrapod.


	Fig. 11 The range of shapes possible for the Tri⁴Tri⁴ topology cages, from a tetrapod⁶⁷ (top), to a cube⁷⁴ (bottom).

Tetratopic + tritopic topology family

A tetratopic and tritopic BB can be combined in a 3 [thin space (1/6-em)]

4 ratio to form molecular cages. We identify only one plausible organic cage topology for this system, Tet⁶Tri⁸, as shown in Fig. 12. Alternative topologies with fewer components do not form an organic cage, rather open bowl topologies, or require interweaved connections, thus we exclude them here. The Tet⁶Tri⁸ topology is formed from a [6 + 8] reaction that can be related to a rhombic dodecahedron by a face-directed assembly, where both tetratopic BBs and tritopic BBs are on the vertices of the polyhedron. This topology consists of twelve windows of ring size 4.


	Fig. 12 The Tet⁶Tri⁸ topology for the tetratopic + tritopic family. Tetratopic precursors are in orange, tritopic in blue.

The Tet⁶Tri⁸ topology has been reported by Warmuth and co-workers using a tetratopic cavitand and a triphenylamine (although no crystal structure was reported for this experiment)³² and two examples using porphyrin building blocks and triamines by Hong et al. (for which crystal structures are available); the resultant molecules showed selectivity for small gases.⁶⁸ The range of shapes possible for the Tet⁶Tri⁸ topology spans from a rhombic dodecahedron³² through an octahedron to a cube⁶⁸ as shown in Fig. 13. For a rhombic dodecahedron shape to form, the tritopic and tetratopic BBs should have (end group)–(centre of mass)–(end group) angles of ∼60° and ∼90°, respectively. For a cubic shape to form, the tetratopic BB could be planar, with angles of ∼90° and the tritopic BB acting as a cube corner, with angles of ∼60°.


	Fig. 13 The range of geometric shapes possible for a Tet⁶Tri⁸ topology cage. Top: The Tet⁶Tri⁸ cage³² can assume the shape of a rhombic dodecahedron (middle) when considering both tetratopic and tritopic precursors as vertices of a solid. It has the shape of an octahedron (right) when only the tetratopic vertices are linked together. Bottom: The Tet⁶Tri⁸ cage⁶⁸ can be seen as a rhombic dodecahedron (middle, as before). It assumes the shape of a cube (right) when only the tritopic precursors are linked together.

In the remainder of this article, we will examine specific case studies for each of the four topology families discussed above. This will allow us to compare the possible reaction outcomes – the alternative topologies that could be formed – and to investigate to what extent computer simulations can assist in predicting the topology from knowledge of the BBs alone.

Methods

In this work, where we consider the topological outcome resulting from the reaction of a pair of BBs, we assemble the possible topologies and then conduct a thorough search of the potential energy landscape for low energy conformations; this is essential, as without care, assumptions can lead to only local energy minimum conformations being found.⁷⁵ Either a conformer search calculation or a high temperature molecular dynamics simulation (MD) was performed to explore the potential energy surface for each molecule, and to locate the lowest energy conformations using the OPLS3 force field.⁷⁶ This conformational searching will typically take >1 week for a relatively small cage molecule. More flexible molecules can adopt a large number of different conformations; for example, Fig. 14 shows an overlay of 395 conformations sampled within a 20 kJ mol⁻¹ energy range of the lowest energy conformation during an MD simulation for Tet⁶Tri⁸-C (molecule described later in the text). In this specific example, the flexibility of the cage molecule is mainly a result of the possible phenyl ring rotations in the tritopic BB. Of course, more rigid molecules are characterised by a small number of low energy conformers; for example, the Tri⁴Di⁶ topology of CC3 has only a single conformation within a 20 kJ mol⁻¹ energy range.


	Fig. 14 Overlay of 395 conformations found within a 20 kJ mol⁻¹ energy window for Tet⁶Tri⁸-C (molecule described later in the text). For all conformations, carbons are coloured grey, oxygens red and nitrogens blue. The lowest energy conformation is depicted with a thicker stick representation, where the tetratopic BBs are orange and tritopic BBs blue.

In cases where the cage molecule is found to be non-shape persistent, with the lowest energy conformations lacking an internal void, we then proceed with our previously developed enhanced sampling technique to “inflate” the molecules and find the open conformations that lie higher on the potential energy landscape.⁷¹ This is essential to compare the conformations that the topologies would be adopting in solution when formed. The time requirements of this procedure naturally increase with molecular size, but as an example, the Tet⁸Di¹⁶ molecule described below required multiple simulations of several weeks duration. Once a collection of open conformations was generated, we performed a further refinement with density functional theory (DFT) calculations in order to obtain more reliable relative energies. Each conformation was geometry optimised with the PBE functional,⁷⁷ and then its energy value was refined further with a single point calculation with the meta-GGA M06-2X functional.⁷⁸ Following this, relative internal energies were compared for each family of cages. The use of DCC reactions allowed us to assume that the molecule with the lowest relative energy should be the most synthetically accessible. When the comparison of relative energies was not possible due to molecules containing different BBs, formation energies were calculated instead, with the assumption that the molecule with the lowest formation energy is the most likely to form. For full simulation details, please refer to the ESI.†

Computational challenges

The described procedure allows us to compare the relative internal energies of the hypothetical topologies for a given assembly of BBs, giving an indication of the relative strain (or lack of it), should the BBs adopt those particular topologies. At the start of this paper, we outlined a large number of factors that could influence the topological outcome, some of which we are not able to consider here, based on computational limitations that we will now explain. However, our calculations allow us to uncover to what extent these relative internal energies can explain previous experimental observations.

Firstly, in terms of solvent effects, our previously reported “inflation” procedure⁷¹ allows us to look at how a solvent influences the conformations of cage molecules. This approach allows us to reproduce the scaffolding or “templating” effect of the solvent molecules, but does not explicitly model individual solvent interactions. To reproduce explicit solvent interactions, we would firstly have to greatly increase the number of atoms in our simulations due to the necessity of including hundreds of solvent molecules; in previous work we found a single cage molecule to be solvated by ∼80 dichloromethane molecules (400 atoms) even in the solid state solvate structure.³³ Secondly, we would then need to sample the conformations of these molecules surrounding the cage molecule to consider the dynamic nature of the solvation. Combined, these simulations are not currently tractable for these systems. An alternative implicit solvation approach, where a dielectric constant is applied to reproduce the dielectric screening a molecule in solution experiences, does not reproduce the scaffolding effect of the solvent, and our preliminary tests have found this to make no significant difference to the relative energies of cage topologies. This issue of solvent effect remains a big challenge in the field and efficient approaches need to be developed for the modelling of such complex systems.

Ideally we would go beyond the relative internal energies from DFT calculations and compare free energies. This would include a consideration of the translational, rotational and vibrational entropy of the systems, which should tend to disfavour the formation of larger assemblies over smaller ones. Practically, this would require a frequency analysis for each molecular conformation, and this is computationally intractable on a routine basis even for the smaller cage systems. Indeed, in our previous work we had to resort to considering only fragments of a comparably small molecular cage for calculations of the free energy correction to guest binding energies to be tractable.⁷⁹

Whilst not the focus of our work, one can also consider the use of simulations to predict preferential catenane formation over monomeric forms of the topologies discussed. Catenanes could be expected to form as thermodynamic products if a combination of solvent interactions, steric compatibility of the BBs involved in the reaction, and the formation of stabilising interactions between the two monomeric units act as a thermodynamic driving force. The obvious starting point for simulations is to calculate the binding energy of a catenane pair by comparing the total energy to that of two gas phase monomers. An unfavourable binding energy, for example resulting from steric clashes of the molecules in a catenated form, would suggest that the monomeric form would be preferred. However, favourable binding energies would need to be considered with caution, as these would indicate stabilisation of the catenane relative only to isolated gas phase monomers, without consideration of stabilisation of those monomers by either solvent interactions or intermolecular interactions gained via crystal packing. Thus, arguably, for a series of systems, binding energies might be best used to qualitatively rank a series of monomers in terms of the likelihood of catenane preference over monomeric forms. We further note that for some systems there would be significant, and even potentially prohibitive, sampling requirements when looking for low energy catenated forms. These would arise from the requirement to consider both different mechanical interlocking arrangements and also configurational sampling for each interlocked form. Finally, reliable binding energies across a series of different systems are likely to require DFT calculations, rather than computationally cheaper forcefield calculations.

Results

The tritopic + ditopic family: CC3

For the tritopic + ditopic family, we investigated the outcome of combining 1,3,5-triformylbenzene (1 in Fig. 15) and (R,R)-1,2-diaminocyclohexane (2 in Fig. 15), used in the synthesis of CC3, a Tri⁴Di⁶ imine molecule.^27,80 The relative energies per [2 + 3] unit were compared for the lowest energy conformations of the Tri²Di³, Tri⁴Di⁶, Tri⁴₂Di⁶, Tri⁶Di⁹ and Tri⁸Di¹² topologies. The structures of the lowest energy open conformations are shown in Fig. 15, with the relative energies in Fig. 16 and Table 3. The Tri²Di³, Tri⁴Di⁶, and Tri⁴₂Di⁶ topologies exhibit shape persistency, whereas the larger Tri⁶Di⁹ and Tri⁸Di¹² topologies are more flexible and collapse under MD simulations, losing their internal cavities. Constrained MD simulations on the latter two topologies were able to find higher energy open metastable states, 4 and 2 kJ mol⁻¹ per [2 + 3] unit above the collapsed conformations for Tri⁶Di⁹ and Tri⁸Di¹² respectively. We compare the relative energies of the open conformations only, as these are likely the structures that initially form in the reaction solution, and therefore comparison between these structures is most pertinent for determining the likely reaction outcome.


	Fig. 15 Molecular structures of the lowest energy open conformations of the tritopic + ditopic topologies for the CC3 imine cage system, formed from the reaction of 1,3,5-triformylbenzene (1) and (R,R)-1,2-diaminocyclohexane (2). Tritopic BBs are in blue, ditopic precursors in purple, the imine bonds formed in the reaction are in grey.


	Fig. 16 The relative energies of the lowest energy conformations of the CC3-based tritopic + ditopic topologies, as calculated by M06-2X.

Table 3 Information on the lowest energy open conformations of the tritopic + ditopic topologies

	Synthetically realised?	Energy relative to Tri⁴Di⁶ per [2 + 3] unit (kJ mol⁻¹)	Shape persistent?	Void diameter (Å)
Tri ² Di ³	No	52	Yes	1.3
Tri ⁴ Di ⁶	Yes	0	Yes	5.5
Tri ⁴₂ Di ⁶	No	39	Yes	3.1
Tri ⁶ Di ⁹	No	14	No	5.2
Tri ⁸ Di ¹²	No	5	No	7.4

The Tri⁴Di⁶ molecule lies lowest in energy and is the synthetically realised topology, which can therefore be rationalised on the energy of the gas phase conformations alone. This is followed by the higher energy metastable inflated conformers of Tri⁸Di¹² (5 kJ mol⁻¹ per [2 + 3] unit) and Tri⁶Di⁹ (14 kJ mol⁻¹ per [2 + 3] unit). An alternative topology is available with a [4 + 6] reaction, in which precursors are assembled into a doubly-bonded ring to give Tri⁴₂Di⁶, however this is considerably higher in energy (39 kJ mol⁻¹ per [2 + 3] unit). The Tri²Di³, in which the precursors are assembled into a capsule, is the topology with the highest energy, 52 kJ mol⁻¹ per [2 + 3] unit higher, unsurprisingly the geometry is characterised by a high degree of strain. To compare not only the topological prediction, but also the structure prediction, we overlaid our predicted structure of the Tri⁴Di⁶CC3 molecule to that of the single crystal X-ray diffraction structure.²⁷ We found a Root Mean Square Deviation (RMSD) value of 0.107 Å, showing that the conformation is well reproduced, as shown in Fig. S1-A.†

The tritopic + ditopic family: CC5 and CC8

For the tritopic + ditopic family, we investigated a further case that has been reported as an example of emergent behaviour. When tris(4-formylphenyl)amine (3 in Fig. 17) and (1S,2S)-cyclopentanediamine (4 in Fig. 17) are combined, they produce the porous Tri⁴Di⁶CC5 molecule, but changing the diamine to (R,R)-1,2-diaminocyclohexene (5 in Fig. 17), adding only an additional CH₂ group, results in a Tri⁸Di¹²CC8 molecule, which is non-porous and lacks shape persistence.^33,81 The relative energies per [2 + 3] unit and structures for the lowest energy conformations of the Tri²Di³, Tri⁴Di⁶, and Tri⁸Di¹² topologies were compared. The structures are shown in Fig. 17 and the energies in Fig. 18 and Table 4. The Tri²Di³ topology is too small to contain a cavity, whereas the Tri⁴Di⁶ molecules are shape persistent with cavities ∼7 Å in diameter. However, both Tri⁸Di¹² structures collapse, as is known experimentally for the CC8 molecule, losing their internal cavities. Constrained MD simulations on the Tri⁸Di¹² structures were used to find higher energy open metastable states, 21 and 28 kJ mol⁻¹ per [2 + 3] unit above the collapsed conformations for CC5 and CC8 respectively.


	Fig. 17 Molecular structures of the lowest energy open conformations of the ditopic + tritopic topologies for the CC5 and CC8 imine cage systems, formed from the reaction of 1,3,5-triformylbenzene (3) and (1S,2S)-cyclopentanediamine (4) for CC5 and (R,R)-1,2-diaminocyclohexene (5) for CC8. Tritopic BBs are in blue, ditopic precursors in purple, the imine bonds formed in the reaction are in grey.


	Fig. 18 The relative energies of the lowest energy conformations of the CC5 and CC8-based ditopic + tritopic topologies, as calculated by M06-2X.

Table 4 Information on the lowest energy open conformations of the CC5 and CC8-based tritopic + ditopic topologies

	Synthetically realised?	Energy relative to Tri⁸Di¹² per [2 + 3] unit (kJ mol⁻¹)	Shape persistent?	Void diameter (Å)
CC5 Tri ² Di ³	No	87	—	0.4
CC5 Tri ⁴ Di ⁶	Yes	−1	Yes	7.3
CC5 Tri ⁸ Di ¹²	No	0	No	10.1
CC8 Tri ² Di ³	No	67	—	1.6
CC8 Tri ⁴ Di ⁶	No	25	Yes	6.5
CC8 Tri ⁸ Di ¹²	Yes	0	No	9.2

For both cage families, the Tri²Di³ molecule, with only two carbon atoms bridging the tritopic BBs, is strained and consequently high in energy compared to the other topological possibilities in all cases (by >40 kJ mol⁻¹). It is not surprising therefore that this topology is not experimentally observed. For the CC8 molecule, the lowest energy conformation has the large Tri⁸Di¹² topology, with an approximately octahedral symmetry. This corresponds to the experimental reports of this molecule formed with the diaminocyclohexene group.³³ As we previously reported,⁷¹ the computed open conformation for CC8 is a good match to the single crystal X-ray diffraction structure of the solvate (with an RMSD value of 1.672 Å, see Fig. S2-A†), which improves slightly when minimising the single crystal structure (with the RMSD value moving to 0.629 Å, see Fig. S2-B†), suggesting some influence of the solvent molecules in opening the vertices slightly outwards.

For the CC5 molecule, there was a very different result to the CC8 molecule; now there is no clear preference between the Tri⁴Di⁶ and Tri⁸Di¹² molecules, with a relative energy difference of only 1 kJ mol⁻¹ per [2 + 3] unit. This molecule has in fact been synthetically realised only as a Tri⁴Di⁶ topology, which had a BET SA of 1333 m² g⁻¹.⁸¹ An overlay of the computed and single crystal X-ray diffraction structure of the solvate (see Fig. S3†) finds a good match, with an RMSD of 0.706 Å. Again, this match improves when the solvate conformation is geometry optimised (RMSD of 0.449 Å), which contracts the molecule, suggesting a scaffolding effect of the solvent that is lost in the calculations when the conformation is completely energy minimised after our artificial inflation procedure. In this case therefore, the calculations would not have been able to successfully distinguish between whether a Tri⁴Di⁶ or Tri⁸Di¹² topology was formed, although it was clearly a different scenario to the CC8 topological energy landscape. We attribute this to our not being able to consider all solvent effects in our simulations (as evidenced by the overlays). Further, as discussed in the Computational challenges section, if the entropic contribution to the free energies was able to be considered, this can be expected to destabilise the larger Tri⁸Di¹² topology relative to the Tri⁴Di⁶.

The tritopic + tritopic family

For the tritopic + tritopic family, we investigated the reaction of tris(2-aminoethyl)amine (6 in Fig. 19) and tris(4-formylphenyl)aldehyde (7 in Fig. 19), used in the synthesis of CC11, a tetrapodal-shape Tri⁴Tri⁴ molecule.⁶⁷ The relative energies per [1 + 1] unit were compared for the lowest energy conformations of the Tri¹Tri¹, Tri²Tri² and Tri⁴Tri⁴ topologies. The structures of the lowest energy conformations are shown in Fig. 19, with the relative energies shown in Fig. 20 and Table 5. Both Tri¹Tri¹ and Tri²Tri² present very strained geometries and therefore their energy is hundreds of kJ mol⁻¹ higher than that of Tri⁴Tri⁴ (376 and 123 kJ mol⁻¹ per [1 + 1] unit, respectively). This is in agreement with the experimental result, where it is the Tri⁴Tri⁴ topology with a tetrapodal shape that is formed. When we compared it with the single crystal X-ray diffraction structure, we find an RMSD of 0.548 Å, with a good match of our predicted structure to the experimentally reported one (see Fig. S1-B†).


	Fig. 19 Molecular structures of the lowest energy conformations of the tritopic + tritopic topologies for the CC11 imine cage system, formed from the reaction of tris(2-aminoethyl)amine (6) and tris(4-formylphenyl)aldehyde (7). Tritopic BBs of 3 are in blue, tritopic BBs of 4 are in teal, the imine bonds formed in the reaction are in grey.


	Fig. 20 The relative energies of the lowest energy conformations of the CC11-based tritopic + tritopic topologies, as calculated by M06-2X.

Table 5 Information on the lowest energy open conformations of the CC11-based tritopic + tritopic topologies

	Synthetically realised?	Energy relative to Tri⁴Tri⁴ per [1 + 1] unit (kJ mol⁻¹)	Shape persistent?	Void diameter (Å)
Tri ¹ Tri ¹	No	376	—	1.2
Tri ² Tri ²	No	123	—	0.4
Tri ⁴ Tri ⁴	Yes	0	Yes	3.2

The tetratopic + ditopic family

For the tetratopic + ditopic family, we investigated the reaction of the tetraformylcavitand (8 in Fig. 21) and ethylene-1,2-diamine (9 in Fig. 21), reported by Warmuth and co-workers to produce a mixture of Tet⁴₂Di⁸, Tet⁶Di¹² and Tet⁸Di¹⁶ topology molecules, dependent upon the solvent used.³⁵ Five different molecules were generated, Tet²Di⁴, Tet³Di⁶, Tet⁴₂Di⁸, Tet⁶Di¹² and Tet⁸Di¹⁶. In the MD simulations, Tet²Di⁴, Tet³₃Di⁶ and Tet⁴₂Di⁸ were found to be shape persistent, whereas the larger Tet⁶Di¹² and Tet⁸Di¹⁶ topologies were not. Constrained MD simulations on the latter two were able to locate metastable, higher energy open conformations. The structures of the lowest energy open conformations are shown in Fig. 21, with the relative energies shown in Fig. 22 and reported in Table 6.


	Fig. 21 Molecular structures of the lowest energy open conformations of the tetratopic + ditopic topologies, formed from the reaction of tetraformyl cavitand (8) and ethylene-1,2-diamine (9). Also shown with the Tet²Di⁴ topology, are the lowest energy conformations with m-phenylenediamine (10) and p-xylylenediamine (11) as the ditopic BBs. Tetratopic precursors are in orange, ditopic precursors in purple, the imine bonds formed in the reaction are in grey.


	Fig. 22 The relative energies of the lowest energy conformations of the Warmuth tetratopic + tritopic topologies, as calculated by M06-2X. The error bar on point Tet⁸Di¹⁶ shows the range of energies covered by conformations with differing degrees of inflation (see Table S1 and Fig. S4†), with more open conformers displaying higher energies.

Table 6 Information on the lowest energy open conformations of the Warmuth tetratopic + ditopic topologies

	Synthetically realised?	Energy relative to Tet⁴Di⁸ per [1 + 2] unit (kJ mol⁻¹)	Relative formation energy per bond formed (kJ mol⁻¹)	Shape persistent?	Void diameter (Å)
Tet ² Di ⁴ -A	No	27	8	Yes	5.8
Tet ² Di ⁴ -B	Yes	—	6	Yes	5.1
Tet ² Di ⁴ -C	Yes	—	0	Yes	4.6
Tet ³ Di ⁶	No	17	—	Yes	3.1
Tet ⁴₂ Di ⁸	Yes	0	—	Yes	6.3
Tet ⁶ Di ¹²	Yes	1	—	No	12.3
Tet ⁸ Di ¹⁶	Yes	Range from −21 to 17	—	No	8.4–10.4

The cage with the lowest energy was a partially collapsed conformer of Tet⁸Di¹⁶. The cage was not found to be shape persistent after high temperature MD, and the inflating procedure provided us with a very large number of higher energy metastable conformers, some of which were partially open, like the lowest energy one reported, and others containing a larger pore size (as shown in Fig. S4†). As no crystal structure was available and it was not possible to be certain whether a single conformation represented the open structure, we plot the range of energies found for conformations with varying pore sizes in Fig. 22 and show the conformation with the widest pore diameter (10.4 Å) in Fig. 21. This more open conformation is located 38 kJ mol⁻¹ above the less inflated lower energy conformer, which has a smaller pore diameter of 8.4 Å. The degree of strain of the structure increases with the degree of inflation.

Experimentally, the formation of Tet⁸Di¹⁶ was obtained using CH₂Cl₂ as a solvent. We carried out a structural analysis on the resulting cage, in order to understand whether it was possible to relate the solvent used during the synthesis to the final topology. Tet⁸Di¹⁶ has 10 windows, with diameters that range from 4–6 Å for the 8 triangular windows and from 6–9 Å for the 2 square windows, depending on the degree of inflation (see Table S1†). About 22 and 21 kJ mol⁻¹ above the lowest energy conformer of Tet⁸Di¹⁶ are Tet⁶Di¹² and Tet⁴₂Di⁸, that were respectively found experimentally in THF and chloroform. We analysed their windows’ diameters (an average of 5.2 Å for the 8 triangular windows of the octahedral Tet⁶Di¹², and an average of 5.6 Å for the 4 biggest windows of Tet⁴₂Di⁸), but we were not able to find any geometric correlation between the solvent used for the synthesis and the cage topologies. We leave this particular aspect to further studies.

The hypothetical topologies we generated, Tet³₃Di⁶ and Tet²Di⁴, are more strained and thus higher in energy, explaining the fact that they have not been experimentally observed. They are located respectively 17 and 27 kJ mol⁻¹ higher than Tet⁴₂Di⁸. We then investigated the effect of changing the ditopic BB's geometry on the topological landscape. The study carried out by Warmuth and co-workers suggested that it was not possible to synthesise a cage with the shape of a Tet²Di⁴ capsule when using ethylene-1,2-diamine, which is rationalisable by the fact that it lies 27 kJ mol⁻¹ higher in energy per [1 + 2] unit than the observed Tet⁴₂Di⁸ topology.³⁵ However, they did find that the Tet²Di⁴ capsule could be formed by using different ditopic BBs, benzenes amino-functionalised in the meta or para position. We therefore generated two capsules in a [2 + 4] ratio, Tet²Di⁴-B and Tet²Di⁴-C, using m-phenylenediamine (10) and the chiral precursor p-xylylenediamine (11) respectively. As these molecules contain different BBs, we cannot directly compare their relative energies per formula unit, and therefore instead compare the relative formation energies of the three capsular Tet²Di⁴ molecules (see Table 6). In agreement with experimental results, both cages B and C are lower in energy than A, in particular Tet²Di⁴-C is 6 kJ mol⁻¹ per imine bond lower than Tet²Di⁴-B, and Tet²Di⁴-C is 8 kJ mol⁻¹ lower than Tet²Di⁴-A. This is because kinked or twisted diamines allow the two cavitands to align in a more stable and less hindered conformation than when using BB 9. The latter should adopt a very strained gauche conformation to permit the cavitands to align in the coplanar conformation required to reduce the strain.

The tetratopic + tritopic family

Finally, for the (tetratopic + tritopic) family, we investigated the reaction of the tetraformylcavitand (8 in Fig. 23) and 1,3,5-tris(4-aminophenyl)benzene (12 in Fig. 23) to form a Tet⁶Tri⁸ molecule, as previously reported by Warmuth and co-workers.³² We constructed models with the BBs mixed in both a [6 + 8] ratio and in a [2 + 4] ratio; both topologies were observed depending upon whether a 3 aldehyde [thin space (1/6-em)]

4 amine stoichiometric ratio was used experimentally (resulting in Tet⁶Tri⁸) or an excess of the amine was used (resulting in a capsule Tet²Tri⁴ with 3 unreacted amine groups). The lowest energy conformations are shown in Fig. 23. The relative formation energies of the two topologies were compared (reported in Table 7) and found to be very similar (within 1 kJ mol⁻¹). Such a small difference in energy would explain why both topologies were experimentally observed in solution. Although the presence of the molecules in solutions was identified by Warmuth and coworkers with different techniques such as ¹H NMR, ¹³C NMR and MALDI-TOF, the structures were never isolated from the solvent and neither solvated or desolvated crystal structures are currently available, therefore a direct geometric comparison between experimental and computed structures is not currently possible. Nevertheless, the calculations seem to suggest that the lowest energy conformers of both the capsule and rhombic dodecahedron cage are shape persistent and porous, therefore of synthetic interest. The void dimensions of both molecules were calculated and reported in Table 7. Tet²Tri⁴ shows a void diameter of 9.9 Å, and Tet⁶Tri⁸ of 17.8 Å, which is very close to the 20.6 Å void diameter of the boronate cube synthesised by Mastalerz and coworkers, which currently holds the record for the molecule with the biggest intrinsic pore synthesised.


	Fig. 23 Molecular structures of the lowest energy conformations of the tetratopic + tritopic topologies formed from the reaction of tetraformylcavitand (8) and 1,3,5-tris(4-aminophenyl)benzene (12). Also shown with the Tet⁶Tri⁸ topology are the lowest energy conformations with 1,3,5-triaminobenzene (13) and 4,4′,4-triaminotriphenylamine (14) as the tritopic BBs. Tetratopic precursors are in orange, tritopic precursors in blue, the imine bonds formed in the reaction are in grey.

Table 7 Information on the lowest energy open conformations of the Warmuth tetratopic + tritopic topologies

	Synthetically realised?	Relative formation energy per bond formed (kJ mol⁻¹)	Shape persistent?	Void diameter (Å)
Tet ² Tri ⁴	Yes	3	Yes	9.9
Tet ⁶ Tri ⁸ -A	Yes	4	Yes	17.8
Tet ⁶ Tri ⁸ -B	No	3	Yes	12.2
Tet ⁶ Tri ⁸ -C	No	0	Yes	15.5

Considering the possible interest in a porous molecule of this cavity volume, the effect of the size of the triamine BB was investigated and two similar molecules with the Tet⁶Tri⁸ topology were generated. Cavitand 8 was mixed with 1,3,5-triaminobenzene (13) to give Tet⁶Tri⁸-B and with 4,4′,4-triaminotriphenylamine (14) to obtain Tet⁶Tri⁸-C. Both molecules were found to be shape persistent and therefore the formation energies per bond were calculated and compared to that of Tet⁶Tri⁸-A (see Table 7). Tet⁶Tri⁸-B was found to have a formation energy ∼1 kJ mol⁻¹ lower than Tet⁶Tri⁸-A, thus is potentially equally likely to form. Tet⁶Tri⁸-C shows a less strained geometry; we attribute this to the central nitrogen atom on each tritopic BB allowing rotational freedom, bringing the formation energy 4 kJ mol⁻¹ per imine bond lower than Tet⁶Tri⁸-A. This suggests that Tet⁶Tri⁸-C could be a particularly promising synthetic target. Both molecules are porous, showing respectively a shape persistent pore diameter of 12.2 Å and 15.5 Å.

Conclusions

We have enumerated the 20 possible topologies that can be used as underlying structures for the design and the synthesis of new porous organic molecules. We have suggested a naming convention that relies on the type, number and multiplicity of connectivity of the BB and propose that this will cause less confusion than naming based on relationships to Platonic or Archimedean solids. Whilst 12 of these topologies have already been synthetically realised, several interesting target candidates have also been identified.

We have tested to what extent calculations focusing on relative internal energies can assist in the prediction of topological outcomes for a given DCC reaction that should afford the thermodynamic product. This approach was successful in the majority of the BB combinations we examined, including for CC3, CC5, CC11, the (tetratopic + ditopic) family from Warmuth and the (tetratopic + tritopic) family from Warmuth. However, for the case of the emergent behaviour in the CC5 and CC8 systems, whilst we can correctly identify that the cyclohexane BB leads to a Tri⁸Di¹² topology, for the cyclopentane BB, we could not distinguish between the relative energies of Tri⁴Di⁶ and Tri⁸Di¹², thus not correctly identifying the experimental outcome of a Tri⁴Di⁶ topology. We believe that this result is likely due to a combination of the influence of the solvent in disturbing the energy landscape and an entropic contribution disfavouring the larger topology, which we were not able to directly include in our simulations. For the Warmuth (tetratopic + ditopic) family, where the topology is known to be influenced by solvent choice, we were not able to find any obvious geometric correlation between the structures and the successful solvent, thus this remains the subject of future research.

We highlight the design opportunity behind using simulations to consider the thermodynamic viability of a given target, which, together with chemical intuition, can provide new developments for this emerging field. The natural evolution of this project would be to explore the topological outcome for a much wider library of precursors with different topicity, length, angles, and rigidity, with the goal being to rationalise which criteria would favour the formation of promising porous organic cages. However, we have shown here computational challenges with modelling a relatively small set of organic precursors, suggesting that the brute force screening of large libraries is not currently computationally feasible. Instead, we would suggest it is better to focus on answering specific questions with smaller library subsets, such as: how does variation in the BB geometry influence topological outcome? Which BBs provide the most promising route to a desired topology? Which BB features are most critical to shape persistence in a desired topology?

Acknowledgements

We acknowledge a Royal Society University Research Fellowship (KEJ), the ERC (ADG-2012-321156-ROBOT), and the EPSRC (EP/M017257/1 and EP/N004884/1) for funding and ARCHER time through the Materials Chemistry Consortium (EP/L000202).

References

O. M. Yaghi, M. O'Keeffe and M. Kanatzidis, J. Solid State Chem., 2000, 152, 1–2 CrossRef CAS.
M. O'Keeffe and O. M. Yaghi, Chem. Rev., 2012, 112, 675–702 CrossRef PubMed.
O. Delgado-Friedrichs, M. D. Foster, M. O'Keeffe, D. M. Proserpio, M. M. J. Treacy and O. M. Yaghi, J. Solid State Chem., 2005, 178, 2533–2554 CrossRef CAS.
M. O'Keeffe, M. A. Peskov, S. J. Ramsden and O. M. Yaghi, Acc. Chem. Res., 2008, 41, 1782–1789 CrossRef PubMed.
E. V. Alexandrov, V. A. Blatov, A. V. Kochetkov and D. M. Proserpio, CrystEngComm, 2011, 13, 3947–3958 RSC.
O. M. Yaghi, M. O'Keeffe, N. W. Ockwig, H. K. Chae, M. Eddaoudi and J. Kim, Nat. Commun., 2003, 423, 705–714 CrossRef CAS PubMed.
R. S. Forgan, J.-P. Sauvage and J. F. Stoddart, Chem. Rev, 2011, 111, 5434–5464 CrossRef CAS PubMed.
R. Chakrabarty, P. S. Mukherjee and P. J. Stang, Chem. Rev., 2011, 111, 6810–6918 CrossRef CAS PubMed.
G. Zhang and M. Mastalerz, Chem. Soc. Rev., 2014, 43, 1934–1947 RSC.
M. Mastalerz, Angew. Chem., Int. Ed., 2010, 49, 5042–5053 CrossRef CAS PubMed.
J. R. Holst, A. Trewin and A. I. Cooper, Nat. Chem., 2010, 2, 915–920 CrossRef CAS PubMed.
J. Tian, P. K. Thallapally and B. P. McGrail, CrystEngComm, 2012, 14, 1909–1919 RSC.
T. Hasell and A. I. Cooper, Nat. Rev. Mater., 2016, 1, 1–15 CrossRef.
B. Dietrich, J. M. Lehn and J. P. Sauvage, Tetrahedron Lett., 1969, 10, 2889–2892 CrossRef.
D. J. Cram, S. Karbach, Y. H. Kim, L. Baczynskyj and G. W. Kallemeyn, J. Am. Chem. Soc., 1985, 107, 2575–2576 CrossRef CAS.
P. Stutte, W. Kiggen and F. Vögtle, Tetrahedron, 1987, 43, 2065–2074 CrossRef CAS.
M. Yoshizawa, J. K. Klosterman and M. Fujita, Angew. Chem., Int. Ed., 2009, 48, 3418–3438 CrossRef CAS PubMed.
T.-C. Lee, E. Kalenius, A. I. Lazar, K. I. Assaf, N. Kuhnert, C. H. Grün, J. Jänis, O. A. Scherman and W. M. Nau, Nat. Chem., 2013, 5, 376–382 CrossRef CAS PubMed.
A. Kewley, A. Stephenson, L. Chen, M. E. Briggs, T. Hasell and A. I. Cooper, Chem. Mater., 2015, 27, 3207–3210 CrossRef CAS.
T. Mitra, K. E. Jelfs, M. Schmidtmann, A. Ahmed, S. Y. Chong, D. J. Adams and A. I. Cooper, Nat. Chem., 2013, 5, 276–281 CrossRef CAS PubMed.
T. Hasell, M. Miklitz, A. Stephenson, M. A. Little, S. Y. Chong, R. Clowes, L. Chen, D. Holden, G. A. Tribello, K. E. Jelfs and A. I. Cooper, J. Am. Chem. Soc., 2016, 138, 1653–1659 CrossRef CAS PubMed.
L. Chen, P. S. Reiss, S. Y. Chong, D. Holden, K. E. Jelfs, T. Hasell, M. A. Little, A. Kewley, M. E. Briggs, A. Stephenson, K. M. Thomas, J. A. Armstrong, J. Bell, J. Busto, R. Noel, J. Liu, D. M. Strachan, P. K. Thallapally and A. I. Cooper, Nat. Mater., 2014, 13, 954–960 CrossRef CAS PubMed.
M. Brutschy, M. W. Schneider, M. Mastalerz and S. R. Waldvogel, Adv. Mater, 2012, 24, 6049–6052 CrossRef CAS PubMed.
K. Acharyya and P. S. Mukherjee, Chem. Commun, 2014, 50, 15788–15791 RSC.
N. Giri, M. G. Del Pópolo, G. Melaugh, R. L. Greenaway, K. Rätzke, T. Koschine, L. Pison, M. F. C. Gomes, A. I. Cooper and S. L. James, Nat. Commun., 2015, 527, 216–220 CrossRef CAS PubMed.
S. Lee, A. Yang, T. P. Moneypenny II and J. S. Moore, J. Am. Chem. Soc., 2016, 138, 2182–2185 CrossRef CAS PubMed.
T. Tozawa, J. T. A. Jones, S. I. Swamy, S. Jiang, D. J. Adams, S. Shakespeare, R. Clowes, D. Bradshaw, T. Hasell, S. Y. Chong, C. Tang, S. Thompson, J. Parker, A. Trewin, J. Bacsa, A. M. Z. Slawin, A. Steiner and A. I. Cooper, Nat. Mater., 2009, 8, 973–978 CrossRef CAS PubMed.
G. O. Brunner, Z. Kristallogr., 1981, 156, 295–303 CAS.
N. J. Young and B. P. Hay, Chem. Commun., 2013, 49, 1354–1379 RSC.
D. Xu and R. Warmuth, J. Am. Chem. Soc., 2008, 130, 7520–7521 CrossRef CAS PubMed.
S. Klotzbach, T. Scherpf and F. Beuerle, Chem. Commun., 2014, 50, 12454–12457 RSC.
Y. Liu, X. Liu and R. Warmuth, Chem. – Eur. J., 2007, 13, 8953–8959 CrossRef CAS PubMed.
K. E. Jelfs, X. Wu, M. Schmidtmann, J. T. A. Jones, J. E. Warren, D. J. Adams and A. I. Cooper, Angew. Chem., Int. Ed., 2011, 50, 10653–10656 CrossRef CAS PubMed.
Q. F. Sun, J. Iwasa, D. Ogawa, Y. Ishido, S. Sato, T. Ozeki, Y. Sei, K. Yamaguchi and M. Fujita, Science, 2010, 328, 1144–1147 CrossRef CAS PubMed.
X. Liu and R. Warmuth, J. Am. Chem. Soc., 2006, 128, 14120–14127 CrossRef CAS PubMed.
E. L. Elliott, C. S. Hartley and J. S. Moore, Chem. Commun., 2011, 47, 5028–5024 RSC.
T. Hasell, X. Wu, J. T. A. Jones, J. Bacsa, A. Steiner, T. Mitra, A. Trewin, D. J. Adams and A. I. Cooper, Nat. Chem., 2010, 2, 750–755 CrossRef CAS PubMed.
G. Zhang, O. Presly, F. White, I. M. Oppel and M. Mastalerz, Angew. Chem., Int. Ed., 2014, 53, 5126–5130 CAS.
D. P. Lydon, N. L. Campbell, D. J. Adams and A. I. Cooper, Synth. Commun., 2011, 41, 2146–2151 CrossRef CAS.
D. J. Tranchemontagne, Z. Ni, M. O'Keeffe and O. M. Yaghi, Angew. Chem., Int. Ed., 2008, 47, 5136–5147 CrossRef CAS PubMed.
S. K. Lin, J. Chem. Inf. Model., 1996, 36, 367–376 CrossRef CAS.
M. K. Gilson and K. K. Irikura, J. Phys. Chem. B, 2010, 114, 16304–16317 CrossRef CAS PubMed.
G. Ercolani, C. Piguet, M. Borkovec and J. Hamacek, J. Phys. Chem. B, 2007, 111, 12195–12203 CrossRef CAS PubMed.
P. Skowronek, B. Warzajtis, U. Rychlewska and J. Gawronski, Chem. Commun., 2013, 49, 2524–2526 RSC.
D. B. Amabilino, Template Strategies in Self-Assembly, John Wiley & Sons, Ltd, Chichester, UK, 2012 Search PubMed.
M. Rekharsky and Y. Inoue, Solvation Effects in Supramolecular Recognition, John Wiley & Sons, Ltd, Chichester, UK, 2012 Search PubMed.
Y. Jin, A. Jin, R. McCaffrey, H. Long and W. Zhang, J. Org. Chem., 2012, 77, 7392–7400 CrossRef CAS PubMed.
M. E. Briggs, A. G. Slater, N. Lunt, S. Jiang, M. A. Little, R. L. Greenaway, T. Hasell, C. Battilocchio, S. V. Ley and A. I. Cooper, Chem. Commun., 2015, 51, 17390–17393 RSC.
M. Kitchin, K. Konstas, C. J. Sumby, M. L. Czyz, P. Valente, M. R. Hill, A. Polyzos and C. J. Doonan, Chem. Commun, 2015, 51, 14231–14234 RSC.
B. Içli, N. Christinat, J. Tönnemann, C. Schüttler, R. Scopelliti and K. Severin, J. Am. Chem. Soc., 2009, 131, 3154–3155 CrossRef PubMed.
D. Barth, O. David, F. Quessette, V. Reinhard, Y. Strozecki and S. Vial, Lect. Notes Comput. Sci., 2015, 9125, 235–246 Search PubMed.
K. D. Okochi, G. S. Han, I. M. Aldridge, Y. Liu and W. Zhang, Org. Lett., 2013, 15, 4296–4299 CrossRef CAS PubMed.
A. Dhara and F. Beuerle, Chem. – Eur. J., 2015, 21, 17391–17396 CrossRef CAS PubMed.
S. Klotzbach and F. Beuerle, Angew. Chem., Int. Ed., 2015, 127, 10497–10502 CrossRef.
S. Jiang, J. T. A. Jones, T. Hasell, D. Bléger, C. E. Blythe, D. J. Adams, A. Trewin and A. I. Cooper, Nat. Commun., 2011, 2, 1–7 CAS.
N. Christinat, R. Scopelliti and K. Severin, Angew. Chem., Int. Ed., 2008, 47, 1848–1852 CrossRef CAS PubMed.
P. Kieryk, J. Janczak, J. Panek, M. Miklitz and J. Lisowski, Org. Lett., 2016, 18, 12–15 CrossRef CAS PubMed.
H. Takahagi, S. Fujibe and N. Iwasawa, Chem. – Eur. J., 2009, 15, 13327–13330 CrossRef CAS PubMed.
M. W. Schneider, I. M. Oppel and M. Mastalerz, Chem. – Eur. J., 2012, 18, 4156–4160 CrossRef CAS PubMed.
H. Ding, Y. Yang, B. Li, F. Pan, G. Zhu, M. Zeller, D. Yuan and C. Wang, Chem. Commun., 2015, 51, 1976–1979 RSC.
M. Mastalerz, Chem. Commun., 2008, 4756–4758 RSC.
G. Zhang, O. Presly, F. White, I. M. Oppel and M. Mastalerz, Angew. Chem., Int. Ed., 2014, 53, 1516–1520 CrossRef CAS PubMed.
R. Warmuth, E. F. Maverick, C. B. Knobler and D. J. Cram, J. Org. Chem., 2003, 68, 2077–2088 CrossRef CAS PubMed.
V. Barba and I. Betanzos, J. Organomet. Chem., 2007, 692, 4903–4908 CrossRef CAS.
J. Sun, J. Bennett, R. Emge and R. Warmuth, J. Am. Chem. Soc., 2011, 133, 3268–3271 CrossRef CAS PubMed.
K. Kataoka, T. D. James and Y. Kubo, J. Am. Chem. Soc., 2007, 129, 15126–15127 CrossRef CAS PubMed.
M. E. Briggs, K. E. Jelfs, S. Y. Chong, C. Lester, M. Schmidtmann, D. J. Adams and A. I. Cooper, Cryst. Growth Des., 2013, 13, 4993–5000 CAS.
S. Hong, M. R. Rohman, J. Jia, Y. Kim, D. Moon, Y. Kim, Y. H. Ko, E. Lee and K. Kim, Angew. Chem., Int. Ed., 2015, 13241–13244 CrossRef CAS PubMed.
Q. Wang, C. Zhang, B. C. Noll, H. Long, Y. Jin and W. Zhang, Angew. Chem., Int. Ed., 2014, 53, 10663–10667 CrossRef CAS PubMed.
Y. Jin, B. A. Voss, R. D. Noble and W. Zhang, Angew. Chem., Int. Ed., 2010, 49, 6348–6351 CrossRef CAS PubMed.
V. Santolini, G. A. Tribello and K. E. Jelfs, Chem. Commun., 2015, 51, 15542–15545 RSC.
B. Olenyuk, M. D. Levin, J. A. Whiteford, J. E. Shield and P. J. Stang, J. Am. Chem. Soc., 1999, 121, 10434–10435 CrossRef CAS.
M. Fujita, K. Umemoto, M. Yoshizawa, N. Fujita, T. Kusukawa and K. Biradha, Chem. Commun., 2001, 509–518 RSC.
S. M. Elbert, F. Rominger and M. Mastalerz, Chem. – Eur. J., 2014, 20, 16707–16720 CrossRef CAS PubMed.
K. E. Jelfs and A. I. Cooper, Curr. Opin. Solid State Mater. Sci., 2013, 17, 19–30 CrossRef CAS.
E. Harder, W. Damm, J. Maple, C. Wu, M. Reboul, J. Y. Xiang, L. Wang, D. Lupyan, M. K. Dahlgren, J. L. Knight, J. W. Kaus, D. S. Cerutti, G. Krilov, W. L. Jorgensen, R. Abel and R. A. Friesner, J. Chem. Theory Comput., 2016, 12, 281–296 CrossRef CAS PubMed.
J. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1996, 77, 3865–3868 CrossRef CAS PubMed.
Y. Zhao and D. G. Truhlar, Theor. Chem. Acc., 2007, 120, 215–241 CrossRef.
M. A. Zwijnenburg, E. Berardo, W. J. Peveler and K. E. Jelfs, J. Phys. Chem. B, 2016, 120, 5063–5072 CrossRef CAS PubMed.
P. Skowronek and J. Gawronski, Org. Lett., 2008, 10, 4755–4758 CrossRef CAS PubMed.
J. T. A. Jones, T. Hasell, X. Wu, J. Bacsa, K. E. Jelfs, M. Schmidtmann, S. Y. Chong, D. J. Adams, A. Trewin, F. Schiffman, F. Cora, B. Slater, A. Steiner, G. M. Day and A. I. Cooper, Nature, 2011, 474, 367–371 CrossRef CAS PubMed.

Footnote

† Electronic supplementary information (ESI) available: Additional computational details, results, figures and structure files. See DOI: 10.1039/C7NR00703E