Open Access Article
William M.
Jacobs
*a and
Daan
Frenkel
*b
aDepartment of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02139, USA. E-mail: wjacobs@fas.harvard.edu
bDepartment of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK. E-mail: df246@cam.ac.uk
First published on 15th September 2015
Assembling molecular superstructures with many distinct components will allow unprecedented control over morphology at the nanoscale. Recently, this approach has been used to assemble periodic structures with precisely defined features, such as repeating arrays of pores and channels, using a large number of building blocks. Here we propose a predictive tool that allows us to optimize the nucleation and growth of unbounded, ordered structures. In what follows, we call these structures ‘crystals’, even though they may only be periodic in one or two dimensions. We find that the nucleation barriers and growth pathways for crystals consisting of many components exhibit generic features that are very different from those of simple crystals. To illustrate the very non-classical nature of the nucleation and growth of such structures, we study the formation of one and two-dimensional crystals with multicomponent unit cells. We find that, whilst the boundaries in the non-periodic dimensions significantly affect the stabilities of these crystals, the nucleation barriers are largely determined by the local connectivity of the associated bulk crystal and are independent of the number of distinct components in the unit cell. We predict that the self-assembly of crystals with complex morphologies can be made to follow specific pathways toward the target structure that successively incorporate key features of the three-dimensional target structure. In contrast with simple crystals, it is possible to tune the kinetics of nucleation and growth separately, thus minimizing defect formation. We show how control over self-assembly pathways can be used to optimize the kinetics of formation of extended structures with arbitrary nanoscale patterns.
Addressable crystals are an exciting prospect because the use of many components offers direct control over the mechanism of self-assembly. In conventional crystals, thermodynamic stability and the kinetics of nucleation and growth are inextricably linked. This relationship between stability and kinetics is customarily described by the classical theory of nucleation, which predicts a single free-energy barrier separating a disordered fluid phase from a thermodynamically favored crystal phase.15–17 Under conditions where the fluid phase is only mildly supersaturated, the rate-limiting free-energy barrier may result in nucleation that is impractically slow. Typically the only means of speeding up nucleation is to increase the supersaturation of the fluid, but this strategy often results in crystals with many defects.5,18 Such limited control over the self-assembly pathway is one fundamental reason why growing high-quality crystals of large particles, such as proteins19,20 and functionalized colloids,21 is notoriously difficult. However, with the introduction of heterogeneous building blocks, it is now possible to decouple the nucleation and growth kinetics while tuning the stability of the crystal phase independently. The ability to optimize the kinetics at all stages of assembly is crucial for achieving high yields of intricate multicomponent crystals.
The recent experiments on DNA-brick crystals indicate that self-assembly only occurs in a narrow temperature window11 and that it appears to be non-hierarchical. One might be tempted to assume that the nucleation of these extended structures is qualitatively similar to the (classical) nucleation of simple crystals, and therefore different from the highly non-classical nucleation that we reported in model studies of finite-sized addressable structures.22–24 However, as we show below, the nucleation of addressable crystals can also be surprisingly non-classical. Moreover, the compositional heterogeneity of the multicomponent unit cell has profound effects on the dominant self-assembly pathways well into the growth phase. We predict non-hierarchical self-assembly at the level of the simplest periodic structures, but our study also shows that we can design hierarchical pathways that achieve controlled multi-step nucleation on the way to the completion of the entire crystal. Understanding how to design such pathways will allow us to engineer increasingly complex crystals using a wider variety of molecular building blocks.25–27
In order for a specific three-dimensional crystal to be thermodynamically stable, the bonds between neighboring particles must be significantly stronger than any off-target interactions between subunits. We therefore consider a model crystal in which the building blocks only interact through ‘designed’ bonds. A simple example of a target structure is shown in Fig. 1a, where the designed bonds are indicated by lines connecting particles whose various colors indicate distinct component types. A single unit cell, which contains exactly one particle of each component, is highlighted. The connectivity of all particles in the perfectly formed crystal can be uniquely represented by a unit-cell graph, G, in which each vertex indicates a distinct component. The edges of G represent the designed bonds, either between particles that are physically adjacent within a single unit cell (solid lines in Fig. 1b) or between a particle and the periodic image of its neighbor in an adjacent unit cell (dotted lines). This distinction between types of edges depends only on how the graph is drawn and does not affect the thermodynamic properties of the crystal. Importantly, all topological properties of the infinite crystal are encoded in this unit cell.
The free energy per unit cell of the perfect crystal, relative to a dilute solution of free monomers, can be determined directly from the unit cell graph,
![]() | (1) |
and
are the edge and vertex sets, respectively, of the unit cell, β ≡ 1/kBT is the inverse temperature and kB is the Boltzmann constant. The chemical potentials of the components, {μv}, and the magnitudes of the designed bond energies, {εb}, may, in principle, be different for each component v or bond b. The final term accounts for the loss of rotational entropy in a completely assembled crystal, where qr is the rotational partition function of a monomer. Eqn (1) follows from the fact that the free energy of a macroscopic crystal must be extensive in the number of unit cells, and the equation ΔFG = 0 defines the phase boundary at which the defect-free crystal becomes stable. Although this suggests that self-assembly will only occur if ΔFG < 0, we shall see that it is possible to assemble stable, correctly bonded substructures under conditions where ΔFG > 0. In this regime, a structure with many missing components is substantially more stable than the defect-free target crystal. This entropic effect is much larger than in the case of simple crystals since not all subunits form the same number of designed bonds and, depending on the designed interactions, some particles may be bound much less strongly than others. However, this property of addressable structures is a strength, not a weakness: it means that, depending on the experimental conditions such as temperature and monomer concentrations, a given combination of building blocks can form a variety of structures with the same periodicity, but with very different patterns.
While a thermodynamic stability analysis is useful for estimating the phase boundaries of a multicomponent crystal, it does not predict whether a structure will assemble robustly. Thus, in order to study the kinetics of self-assembly, we calculate a free-energy landscape that takes into account all possible partial structures of the target crystal. These partial structures correspond to the connected clusters of the infinite target crystal. In a dilute solution, we can assume that the solution of correctly bonded clusters is ideal. The fugacity of a cluster represented by a connectivity graph g is then
![]() | (2) |
In order to predict the free-energy barrier for nucleation, we must account for all distinct clusters of the infinite target crystal. This includes clusters that span multiple unit cells and thus contain multiple particles of the same type. Because we can only explicitly consider finite unit cells in our calculations, we define a ‘super’ cell, G′, that consists of multiple adjacent copies of the fundamental unit-cell, G. However, we must take care to exclude unphysically bonded clusters that are introduced by edges between periodic images of neighboring particles. We may thus only examine clusters with fewer edges than Emax, the length of the shortest path connecting a particle with its periodic image in a crystal of super cells. For example, in the nine-unit-cell super cell shown in Fig. 1c, some connected subgraphs with Emax = 12 percolate through the super cell and are therefore unphysical.
Because the total number of distinct clusters grows exponentially with Emax, directly enumerating these clusters quickly becomes intractable. We instead group the clusters into sets with the same numbers of edges, E, and vertices, V, since these topological properties are the primary determinants of the cluster fugacity given in eqn (2). We determine the number of clusters in each (E,V)-set and then estimate 〈exp[kB−1ΔSr(g)]〉E,V in order to calculate the average fugacity of each set of clusters. (See ESI,† Section S1B for further details of these calculations.) Finally, we calculate the free energy of a cluster composed of a given number of particles, regardless of the identities of the particles involved,
![]() | (3) |
Besides providing a practical algorithm for calculating the free-energy landscape of an addressable crystal, our approach provides an important insight into the thermodynamics of these structures. By computing the total number of distinct connected clusters in a super cell of size |G′|, we are over-counting every distinct cluster by a factor |G′|/|G|. It is then clear from eqn (3) that all free-energy differences are in fact independent of our definition of the unit cell, assuming that the unit cell does not possess any rotational symmetries (see ESI,† Section S1C). We therefore find that the number of distinct components in the unit cell does not affect the thermodynamics of self-assembly, including both the height of the nucleation barrier, ΔF‡ ≡ max[F(V)] − F(1), and the unit-cell stability, ΔFG. All effects related to the use multiple distinct components instead originate from the non-periodic dimensions or any spatially heterogeneous motifs within the unit cell, as we shall see in the following sections. We discuss some practical aspects of this observation in Section IV.
ρ, where the per-component number density in the reference fluid is ρ = 1 × 10−5. While the results that we present here are specific to model DNA-brick crystals, these examples highlight many generic features of addressable multicomponent crystals.
The free-energy landscapes shown in Fig. 2c and d indicate that the kinetics of nucleation and growth are affected differently by the design of the unit cell. We note that for every cluster size V > 1, the cluster free energy of a semi-infinite structure, eqn (3), is determined by a subset of the connected clusters of the bulk lattice. In particular, the removal of certain maximally connected substructures, which form the greatest number of bonds for a given number of particles and are associated with the local minima of F(V), drastically alters the landscape. This effect can be seen in the divergence of the free-energy profiles of the l = 2a and z = 2a structures from the bulk profile in Fig. 2c and d. A more extreme example is the z = a slab (Fig. 2d), which has a planar connectivity graph, and, as a result, is missing the strongly connected substructures that give rise to the step-like behavior of the bulk free-energy profile. These examples show that the thermodynamic driving force for self-assembly, ∂F/∂V, can be significantly weaker for semi-infinite structures than for the bulk crystal under equivalent conditions. However, the free energies of small clusters, which determine the nucleation barrier, converge much more rapidly to the bulk values (Fig. 2b). This is because the non-classical nucleation barriers shown in Fig. 2c and d are primarily determined by the appearance of certain stable motifs, which may span multiple adjacent unit cells and are only weakly affected by the non-periodic boundaries of a semi-infinite structure. These observations imply that semi-infinite structures can be self-assembled at lower supersaturation than the corresponding bulk crystal without significantly affecting the nucleation rate. Simple crystals contain fewer defects when they are grown at low supersaturation, and recent simulation results5 have demonstrated that the same holds true for the assembly of addressable structures. The free-energy landscapes of semi-infinite structures are thus more favorable than those of bulk crystals for achieving defect-free self-assembly. In Section IIIC, we shall show how this feature can be exploited in the design of a hierarchical self-assembly pathway.
Our calculations show that the probability of completing each mesoscale feature increases sharply beyond a particular cluster size. Strikingly, we find that the dominant near-equilibrium assembly pathways are highly reproducible, despite the enormous number of potential on-pathway clusters. Furthermore, in both examples, the completion of these features occurs far beyond the initial nucleation barrier and deep into the growth phase of the crystal (i.e., the regime where the free energy is steadily decreasing with increasing cluster size). There is also a significant change in the slope of F(V) accompanying the completion of these features, confirming that the enclosing of a channel or a pore significantly enhances the thermodynamic driving force for subsequent growth. We stress that the cluster size at which a channel or pore is enclosed is unrelated to the number of components in the unit cell, and is in fact is considerably larger than the number of particles contained in a single unit cell. This example illustrates that, in order to predict the dominant growth pathway, it is necessary to consider maximally connected substructures that extend over multiple adjacent unit cells.
For both of the structures examined in the previous example, we choose a subset of bonds that stabilize maximally connected substructures, which correspond roughly to the local minima in homogeneous-bond-energy landscapes. These bonds, which are highlighted in Fig. 4a and b, are assigned bond energies −εB, while all other bonds are assigned −εA. We apply eqn (1) to predict the phase boundaries separating stable substructures, and then use the complete free-energy landscape to map the detailed phase diagrams shown in Fig. 4e and f. (For details, see ESI,† Section S4.)
![]() | ||
| Fig. 4 Hierarchical self-assembly pathways can be designed by selectively strengthening bonds in an addressable crystal. (a and b) Connectivity graphs representing the structures examined in Fig. 3 with specific substructures highlighted; unterminated edges continue into adjacent unit cells. Darkened edges are assigned bond energies −εB, while all other edges are assigned bond energies −εA. (c and d) Representative free-energy landscapes showing both hierarchical and non-hierarchical assembly pathways, corresponding to the labeled points in the phase diagrams below. Depending on the bond energies, a secondary nucleation barrier, ΔF‡‡, may appear following the assembly of the stabilized substructure. (e and f) Phase diagrams showing the conditions under which various substructures are globally stable, and the predictions of a simple stability analysis (dashed lines). Lines of constant nucleation-barrier height are shown in red. In (e), the phases are as follows: I, unassembled fluid; II, infinite cluster with type-B bonds; III, complete crystal; and IV, infinite cluster with type-A bonds. In (f), the phases are: I, unassembled fluid; II, 27-particle cluster; III, 30-particle cluster; IV, complete crystal; and V, mixed clusters with fewer than 24 particles. | ||
In the case of the self-assembling column (Fig. 4a, c and e), assembly may proceed via compositionally distinct phases. By choosing εB to be significantly larger than εA, we assemble a structure consisting primarily of type-B bonds that does not enclose the interior channel (phase II). On the other hand, we predict that crystals assembled in the hatched region of the phase diagram will enclose this channel, as shown previously in the case εA = εB (phase III). We also note that the height of the dominant nucleation barrier depends primarily on the strongest bonds, whereas the phase boundaries depend in general on both εA and εB. This means that rapid nucleation occurs closer to the phase boundary, i.e., at lower supersaturation, in phase II, where only part of the complete structure is stable. We therefore expect to assemble the target crystal with fewer defects by following an annealing protocol that passes slowly through phase II before entering phase III.
In the case of the perforated slab (Fig. 4b, d and f), a large difference in the two bond energies leads to multiple nucleation barriers, as well as phases in which finite-sized clusters are the lowest-free-energy structures. In the region marked phase III, we find that a fluid of 30-particle substructures is thermodynamically stable. This phase is separated from the completely unassembled fluid (phase I) by a narrow region of stability (phase II) of 27-particles clusters, which correspond to 30-particle clusters missing one of their four ‘arms’. Upon equilibration in both phases II and III, the free-energy landscapes indicate that a proliferation of clusters with type-B bonds will coexist with a mostly monomeric solution of the remaining unassembled components. In the hatched region of phase IV, the complete crystal is globally stable, although the first 30-particle substructure is kinetically separated from further growth by a large secondary nucleation barrier, ΔF‡‡. As a result, we predict that assembly in this region will proceed via a series of nucleation events, each having a well defined nucleation barrier. If this region is entered from phase III, then these nucleation events will correspond to the assembly of type-A ‘linker’ substructures between pre-assembled type-B clusters. It is important to point out that comparable behavior is not observed upon stabilizing the complementary substructure, such that εA > εB. Instead of monodisperse 24-particle clusters connected by type-A bonds, we find a stable phase (phase V) corresponding to a mixture of various substructures, including some that incorporate type-B bonds. This mixed phase arises because, in this case, the selected substructure is poorly stabilized by a low density of designed interactions, and, as a result, corresponds to a low-probability cluster on the homogeneous-bond-energy landscape. Thus, while it is in principle possible to stabilize any part of the unit cell with a judicious choice of bond energies, maintaining control over a multistep assembly pathway requires attention to the intrinsic stabilities of competing substructures.
• We find that classical nucleation theory fails to describe the self-assembly of addressable multicomponent crystals. In particular, in addressable structures, nucleation and growth can be tuned independently.
• Magic-number clusters play a key role in the nucleation of addressable structures and can even be made stable.
• In addressable crystals, the thermodynamics of self-assembly is insensitive to the number of distinct components used to construct a periodically repeating structure, provided that this number is large enough to specify the geometry of the fundamental unit cell.
• Depending on the conditions, the same set of building blocks can form a variety of patterns that are substructures of the complete lattice. Periodic structures of incompletely formed unit cells may be entropically favored over ‘perfect’ lattices.
• None of these findings are specific to structures with DNA-mediated bonds. These observations apply equally to addressable crystals that are periodic in all three dimensions, such as multicomponent zeolite-like structures.
Experimentally, this strategy for hierarchical assembly is currently realizable using DNA bricks. In this system, the hybridization free energies between subunits can be specified independently by rationally designing the complementary DNA sequences. Unlike the single optimal assembly temperature reported in ref. 11, we expect that an assembly protocol that passes through a stable cluster phase before forming the complete crystal would exhibit two distinct transition temperatures. Furthermore, the strategies described here for lowering the nucleation barrier near the fluid-crystal phase boundary are expected to reduce the hysteresis in the observed freezing and melting temperatures. As a result, we expect that such strategies will broaden the temperature range over which high quality crystals can be assembled. These predictions can also be tested with physically realistic Monte Carlo simulations using the model described in ref. 22 and 24. Through the analysis of dynamic assembly trajectories, simulations of this type could be used to validate the predicted growth pathways in structures with complex morphologies.
Finally, we stress once more that the general principles described in this paper are in no way limited to DNA bricks. These principles could also apply to other forms of multicomponent molecular assembly, or to the assembly of multicomponent colloidal structures, provided that the building blocks have been designed to reproduce the functionality assumed in our model.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c5sm01841b |
| This journal is © The Royal Society of Chemistry 2015 |