Hannah K.
Wayment-Steele†
,
Daan
Frenkel
and
Aleks
Reinhardt
*
Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK. E-mail: ar732@cam.ac.uk
First published on 31st January 2017
In the standard DNA brick set-up, distinct 32-nucleotide strands of single-stranded DNA are each designed to bind specifically to four other such molecules. Experimentally, it has been demonstrated that the overall yield is increased if certain bricks which occur on the outer faces of target structures are merged with adjacent bricks. However, it is not well understood by what mechanism such ‘boundary bricks’ increase the yield, as they likely influence both the nucleation process and the final stability of the target structure. Here, we use Monte Carlo simulations with a patchy particle model of DNA bricks to investigate the role of boundary bricks in the self-assembly of complex multicomponent target structures. We demonstrate that boundary bricks lower the free-energy barrier to nucleation and that boundary bricks on edges stabilize the final structure. However, boundary bricks are also more prone to aggregation, as they can stabilize partially assembled intermediates. We explore some design strategies that permit us to benefit from the stabilizing role of boundary bricks whilst minimizing their ability to hinder assembly; in particular, we show that maximizing the total number of boundary bricks is not an optimal strategy.
DNA brick self-assembly is also perhaps the best example of a viable addressable8 self-assembled system: each subunit in the target structure is unique, and knowing the identity of a particle therefore means knowing its location, and vice versa. Systems with addressable complexity and their potential for designing structures with arbitrary shape and complexity have great promise in fields ranging from medical applications to nanoelectronics.9
A recent application of DNA bricks as a nano-breadboard for chromophore-based excitonic gates10 exemplifies the benefits of DNA bricks over DNA origami. In excitonic devices, where the FRET radius is less than 5 nm, it is necessary to have nanometre-scale control over the placement of chromophores. DNA bricks have an advantage over DNA origami by having twice the spatial resolution: it is difficult to functionalize the scaffold strand in DNA origami, and thus only the staple strands, one out of two strands in any helix, are available for functionalization.10 By contrast, in DNA brick structures, all strands are available for functionalization. In such technologies, the excitonic transmission behaviour is challenging to predict and the modular nature of DNA bricks allows for straightforward modification of structures, permitting a number of possible layouts to be tested and screened rapidly.10 Moreover, these benefits may prove useful for other applications as well, for instance in scaffolding for multi-enzyme complexes for single-molecule reactions,11 molecular rulers,12 inorganic nanoparticle synthesis13 and nano-robots.14
In DNA brick structures, the final structure is designed to be the thermodynamic product, a benefit over folding assembly structures, where it is difficult to predict if the designed target structure is the preferred equilibrium structure.9,15 However, DNA brick assembly has a much more complex pathway to assembly.16–18 Because there are a vast number of intermediate states that all have similar energies, DNA bricks are very prone to kinetic traps.9,19 This is a disadvantage in comparison to folding assembly, where the constraint offered by having all interacting particles on a backbone offers more direction to the final assembled state. Experimentally, typical DNA brick yields range from a few per cent to 30 per cent,2 whereas yields for some DNA origami structures are approaching 99%.20,21 However, because we have control over interactions between DNA brick subunits, it should be possible to design interactions that can direct the assembly to avoid kinetic traps.
One design strategy implemented for increasing the yield in experiment was to include larger bricks at the surfaces of target structures.2 Because bricks are staggered in the xz and yz planes of the structures, at the faces of a structure, in alternating rows a brick must be bisected (Fig. 1(a)), leaving behind 16-nucleotide half-bricks in every other row (Fig. 1(b)). These half-bricks were then connected to the bricks in the row preceding them to form larger 48-nucleotide bricks, termed ‘boundary bricks’ (BBs) (Fig. 1(c)). The use of BBs was shown to increase the yield by a factor of 1.4 and was implemented for all subsequent structures in the experiments of Ke et al.2
Despite the effectiveness of BBs in increasing the yield, the cause of this observed effect has not been well studied. There are two principal mechanisms one can envisage by which BBs could lead to an increase of the yield. Firstly, because they are larger and have more interaction domains than regular bricks, they could serve as a larger seed particle to promote nucleation. Secondly, they may stabilize the final structure by binding the edge half-bricks that have fewer interaction points to bind to the rest of the structure.
To investigate the effects of BBs on the DNA brick nucleation and assembly process, in this work we extend a simulation-based model previously used with success to describe DNA bricks16,17,22–24 to include BBs. With this model, we use Monte Carlo simulations to show that depending on the location, BBs may differ in their contribution to increasing the nucleation rate or stabilizing the final structure. We also demonstrate that BBs are more prone to aggregation than regular bricks, and we suggest a method to overcome this whilst still benefiting from the stabilizing effect of incorporating BBs.
In an extension of the previous model,16 we model boundary bricks as dimers of these particles, i.e. as two patchy particles that are connected by a rigid bond of length corresponding to their distance in the target structure (Fig. 2), with (at least) two patches on one of the particles given a poly-T sequence27 to passivate them (cf.Fig. 1(a)).28 The two particles connected in this way are also fixed in their orientation with respect to one another.29
To investigate the effect of BBs on structure nucleation and final stability, we perform simulations of a DNA brick structure with 374 bricks in the canonical ensemble (i.e. at a constant number of particles, volume and temperature). In the majority of simulations reported here, the density of each type of brick was set to 1/(62a)3, where a is the lattice grid parameter, and a single copy of each brick that appears in the target system was present in the simulation box. The simulated structures contain up to 84 boundary bricks on the faces in the xz and yz planes (Fig. 3), as described below. Simulations were run for structures with DNA-specific interactions between bricks. For each set of bonded patches in the final target structure, complementary sequences were randomly generated and assigned. The interaction strength of hybridization between two complementary DNA strands largely depends on the proportion of paired guanine/cytosine (GC) pairs in the sequence. The average GC content for the structure studied was 44.6%, with standard deviation 14.2%. Each patch can interact with every other patch provided the patches point at each other and the corresponding particles are diagonally adjacent to each other, and the energy of interaction corresponds to the hybridization free energy obtained from the SantaLucia thermodynamic model.30
In simulations investigating whether BBs in different locations of the final structure had differing impacts on structure nucleation and final stability, we chose to include various sets of BBs out of the total possible 84 BBs for this structure. One subset consisted of 26 BBs on the edges of the cubic structure, henceforth referred to as the ‘edge-BB structure’, shown in red in Fig. 3. The other subset included 26 BBs at the centre of the faces of the cubic structure, henceforth referred to as the ‘face-BB structure’, with BBs shown in green in Fig. 3. The structure with all possible BBs31 (red, green and blue) will be referred to as the ‘all-BB structure’. In the original experimental design, bricks with sequence domains on the external faces of the structure had those domains either removed or replaced with non-interacting poly-T sequences.2 Our model similarly passivates patches of bricks that are on the external face by assigning them a poly-T sequence. We ensured that there is no significant difference in the number of passivated patches on BBs between the edge and face conditions.
In addition to simulations of structures with specific DNA interactions, we have also run simulations of structures with designed interactions all having the same fixed interaction energy. This simplification still retains the specificity of interactions required for addressable self-assembly, since patches still have a unique identity and only interact with specific patches with which they were designed to bond in the target structure, but removes variation in interaction strengths that arises from DNA sequence dependence. In such simulations, all designed interactions were assigned a fixed interaction energy of ε/kB = −4000 K, which corresponds roughly to an average sequence interaction strength at 320 K.17 All other (‘incidental’) patch–patch interactions were set to zero.
We can estimate the effect of BBs on increasing the nucleation rate by measuring the mean first-passage time (MFPT). This approach is commonly used when computing nucleation rates in molecular dynamics simulations;32 the mean first-passage time corresponds to the time needed on average for a stochastic process to reach a certain state for the first time. Although MC simulations do not faithfully reproduce the various time scales that may be involved in a dynamic process, we can nevertheless estimate the relative effect of BBs on the nucleation rate by calculating the MFPT for the different BB structures, using MC steps as a time step approximation.
In order to compute the MFPT for nucleation in our simulations, we recorded the number of MC steps required for the size of the largest correctly assembled cluster in the system to reach 8 particles and large-scale growth to begin. This cluster size was chosen since we have previously shown that the critical cluster at temperatures at which DNA bricks can nucleate usually comprises 8 bricks,17 and the largest cluster in the system typically grows rather than shrinks once clusters grow beyond this size. The reason for this well-defined critical cluster size is that 9 bricks are required to form a subunit with two closed ‘cycles’ (i.e. closed loops of particles that are bonded to one another), and, as a monomer comes in to close a cycle, two bonds are formed simultaneously, energetically stabilizing the resulting structure at roughly the same entropic cost. The critical cluster structure comprises one brick less than this stabilized bicyclic motif.17
The difference in the free-energy barrier height relative to the system with no boundary bricks, , is calculated from the ratios of the average nucleation rate R, which is the reciprocal of the MFPT. We use the classical nucleation theory relation33
R = NSZj![]() | (1) |
![]() | (2) |
The relative changes in the free-energy barriers are in agreement with the trends observed qualitatively from monitoring the largest cluster over time, with the edge BBs having the smallest reduction in the free-energy barrier, followed by face BBs, and finally followed by the system with all possible BBs. Although the latter system has 3.2 times as many boundary bricks as do the edge and face-BB systems, there is only a relatively small decrease in the free-energy barrier from the edge and face-BB systems to the all-BB system. This is perhaps not particularly surprising, since the reduction of the free-energy barrier is affected principally by the bricks first involved in nucleation, not their overall number.
The difference in the MFPT between the face-BB structures and edge-BB structures is interesting, as in many ways besides the obvious difference in the location of the BBs, the structures are identical. Both have 26 BBs, and they have no significant difference in the number of interacting patches or GC content. The same trend is also observed in FDI simulations, in which all designed interactions have a fixed interaction energy, indicating this is not an effect of GC content in the structures. The increased nucleation rate in face-BB structures likely arises because there are more interactions between face BBs and non-face bricks than there are interactions between edge BBs and non-edge bricks, since the latter have fewer neighbouring molecules.
We first consider the frequencies of bricks in initial nucleation clusters for simulations of structures with assigned DNA sequences, where every pair of patches can interact with an interaction energy based on the SantaLucia thermodynamic model. Intuitively, we would expect that the stronger the bonding of a particle's patches is, the more likely it is for a particle to be found in the initial nucleus. Indeed, this is what is largely observed (Fig. 5(i)(a)): bricks with a higher average GC content (and hence stronger bonding) are more likely to be present in the critical nucleus. Intriguingly, it is not the bricks with the highest GC content that dominate; instead, nucleation tends to occur in regions where several neighbouring bricks have a high GC content. In other words, it appears that designing preferential nucleation pathways would require a careful analysis of not only the bonding strength of individual particles, but how they come together in the final structure, making it a more difficult task than it might first appear. We propose to investigate this interplay of factors more systematically in future work.
However, boundary bricks have a dominant effect as far as nucleation is concerned, and as soon as boundary bricks are added to a system, the small random variations in GC content that seem to determine the nucleation behaviour for the system with no BBs (Fig. 5(i)(a)) no longer play any significant role in determining a particle's nucleation propensity. For the structure with DNA-specific interactions and all 84 BBs present (Fig. 5(i)(b)), BBs are predominant in the initial nucleation cluster: the increased number of interactions per BB when compared to a ‘monomer’ brick favours boundary bricks as preferred sites for nucleation to occur.
The same observation holds for the edge-BB and face-BB structures (Fig. 5(i)(c) and Fig. 5(i)(d)). These simulations demonstrate that it is possible to tune the nucleation site to different parts of the structure depending on where the BBs are located. The edge-BB structure nucleates essentially only at the edge BBs, and the face-BB structure nucleates only at the face BBs. The locations with the highest nucleation frequencies did contain the single BB among each subset with the highest GC content; however, beyond this, there was no significant correlation between nucleation frequency and GC content, perhaps indicating that the BB with the highest GC content is fastest to nucleate.
The nucleation location in structures with SantaLucia interactions is driven by both the location of BBs and the location of bricks with high GC content. BBs have a stronger proclivity for nucleation than monomers, and by selecting which bricks are bonded to others as BBs, we are able to control the nucleation site on the structure. For both the case without BBs and with BBs, bricks with higher GC content are involved in nucleation, though the exact effect of GC content on nucleation cannot be well understood from only one structure. Nevertheless, this finding could be a very useful tool in the rational design of self-assembly pathways of DNA brick structures.
We can investigate the underlying behaviour that is solely due to boundary bricks notwithstanding the effect of having varying interaction strengths across the system by considering the frequencies of bricks in initial nucleation clusters for the case of fixed designed interactions (Fig. 5(ii)). This system allows us to focus on only the size and geometry effects of BBs on nucleation, without the complication of non-uniform interaction strengths of DNA sequences. For the FDI structure with no BBs (Fig. 5(ii)(a)), nucleation appears to be dispersed throughout the volume of the structure, with bricks on the faces somewhat less likely to be involved in nucleation clusters. Since such bricks have non-interacting patches on the outside and therefore fewer possibilities for bonding, this behaviour is entirely consistent with expectations.
For the FDI structure with all 84 BBs present (Fig. 5(ii)(b)), nucleation is again largely confined to the faces and edges of the cube, where the BBs are located. Notably, nucleation essentially never occurs in the body of the structure. The presence of BBs, because they are larger units with more interaction sites per unit than regular bricks, causes nucleation to shift to the outer regions of the cube. This is further demonstrated by the FDI structures with only edge BBs (Fig. 5(ii)(c)) and only face BBs (Fig. 5(ii)(d)), where nucleation occurs primarily on the edges and faces, respectively, and is consistent with the behaviour seen in simulations with full sequence-dependent interactions: the nucleation propensities shown in Fig. 5(i)(b–d) for full sequence-dependent interaction simulations largely correspond to those of Fig. 5(ii)(b–d) of the analogous FDI simulations, demonstrating that the influence of GC content on the nucleation location is minimal as soon as boundary bricks are included.
Since the entropic cost of binding a boundary brick is comparable to that of binding a non-brick monomer, but the degree of bonding can be greater, we expect that boundary bricks will stabilize the target structure in the sense that it can grow to a larger size even with a fixed-temperature growth protocol. In single-target simulations, the boundary bricks behave largely in the way we would expect them to: the all-BB structure grows to the largest final size in both fixed-temperature and gradually cooled set-ups (Fig. 6), as it has the largest number of boundary bricks to stabilize it.
However, the all-BB structure initially nucleates at very high temperatures, and it is only at approximately 318 K that the target structure is nearly complete with few errors. Below this temperature, monomer nucleation is evidently too facile, which prevents successful assembly later on in the self-assembly process, as the probability of exactly aligning and forming all the right bonds to connect two larger clusters is prohibitively low. At low temperatures, self-assembly in simulations with all brick types becomes less and less favourable.
Interestingly, even though the edge-BB structures nucleate less rapidly than the face-BB structures, edge-BB structures had the second highest degree of final assembly across all optimal assembly temperatures (Fig. 6). This suggests that edge BBs are able to stabilize the final structure and bind bricks on the edge that would be entropically favoured to be unbound,17 and are able to do so more effectively than face BBs. In Fig. 6(a), we show a typical example of a large correctly assembled cluster for each of the BB structures formed in constant-temperature simulations. In particular, the face-BB structure shown is missing all four edges. Of course, this is not wholly surprising, since the edge monomers typically have fewer bonds than the face monomers, and boundary bricks therefore play a much more significant role by comparison. This observation is supported by the fact that the face-BB structures, which only have normal monomer bricks on their edges, have nearly identical assembly sizes as the no-BB structures once this size has reached a plateau in constant-temperature simulations.
![]() | ||
Fig. 7 Snapshots of typical configurations with multiple target structures in the simulation box, for systems with different boundary bricks at different temperatures (as labelled). These configurations are taken from constant-temperature simulations once the cluster sizes have stabilized over time. Clusters of particles bonded in the same way as in the target structures are shown in the same colour; different colours are used for clusters that are not connected to each other. Boundary bricks are connected by violet bonds, as in Fig. 2. |
System | T/K | Number of all clusters | Number of large clusters | Mean size of large clusters | Median size of large clusters |
---|---|---|---|---|---|
(a) Face BBs | 322 | 3.4 | 1.2 | 268 | 268 |
(b) Face BBs | 319 | 6.2 | 4.3 | 272 | 285 |
(c) Edge BBs | 319 | 4.5 | 2.5 | 314 | 314 |
(d) All BBs | 327 | 11.5 | 3.5 | 204 | 195 |
Of the systems studied, simulations with face-BB structures exhibit nucleation and growth over the largest range of temperatures. The temperature largely controls the number of large nuclei in the system: at 322 K, only a single structure grows to an appreciable size (Fig. 7(a)), whilst at 320 K, up to 4 nearly complete structures self-assemble. At 319 K, many simulations result in the successful self-assembly of roughly the same number of clusters (Fig. 7(b)), but in a number of cases, these clusters merge incorrectly, and so the resulting structure can be classed as a kinetic aggregate. The behaviour of systems with edge-BB structures is similar, and, in keeping with the monomer results (see e.g.Fig. 4(c) and 6(b)), nucleation occurs at somewhat lower temperatures. However, aggregation is not shifted by the same amount in temperature, and so the range over which self-assembly occurs is narrower (roughly 319 K to 320 K), and the number of large structures that grow successfully at this lower temperature is also smaller (typically only 2 at 319 K, Fig. 7(c)). As with single-target simulations, the protocol used for self-assembly is important: although clusters grow to medium sizes in multiple-target simulations after successful nucleation has occurred, the largest clusters can be made to grow essentially to completion if the system is subsequently cooled.
Finally, in simulations with all possible BBs, nucleation begins at very high temperatures (∼330 K), consistent with single-target simulations. However, the clusters do not grow significantly at such high temperatures. As the temperature is decreased, the self-assembly process becomes very error-prone; whilst a single target structure typically grows much larger than the remaining structures, it often lacks the necessary components that have been used up to form other, smaller clusters already (see Table 1), and so structures grow with large sections missing. For example, in Fig. 7(d), showing simulation results at 327 K, several of the walls have nucleated separately from the rest of the target structure, making further growth very difficult. At 326 K, several simulations resulted in the successful growth of a single target structure (out of a possible 6 that could grow from the monomers), but in a similar number of simulations, no single target structure grew to a large size. Of course, since there are many monomers in solution, it is in some sense easier to assemble a single copy of the target structure than in single-target simulations; however, assembling multiple target structures simultaneously is difficult, since too many clusters nucleate and it is not straightforward then for them to meet in the correct geometry to form larger structures, and it appears Ostwald ripening is also not particularly fast. The choice of which boundary bricks to include when assembling a given target structure therefore appears to be a very important consideration in DNA brick self-assembly, and it appears from our simulations that opting for all possible boundary bricks is not the most favourable design choice.
One possible way of reducing the propensity for nucleation when many boundary bricks are included is to reduce their concentration relative to the remaining monomer bricks. To a first approximation, it is reasonable to assume that the chemical potential of a species appears in the same place in the hamiltonian as the binding energies. Increasing or decreasing the chemical potential of a species, for example by changing the species concentration, is thus effectively equivalent to shifting the strength of all the interactions of that species. We have run simulations of the all-BB system with only half the boundary bricks present. In keeping with expectations, the point at which nucleation occurs in brute-force simulations shifts to lower temperatures, and, with fewer clusters forming, multiple clusters can grow to larger sizes. Choosing an appropriate ratio of initial concentrations is therefore a further control parameter that can be tuned to improve assembly yields.
In our simulations, we observe both point defects as well as larger misbonded aggregates and missing features in the target structure. As far as we are aware, experiments on DNA brick systems have not thus far focussed on characterizing the nature of assembly errors in self-assembly, and indeed such defects may be rather difficult to probe experimentally; however, if we wish to ensure a faithful assembly of the complete target structure, this issue may be of great importance for the future of the field.
It is important to bear in mind that our results correspond to a simple ‘toy model’ of DNA bricks. In reality, many effects that we have neglected may also be important, yet the system sizes involved are such that they make simulations with a more realistic potential intractable at present. However, the simplicity of our model suggests that our findings reflect the fundamental underlying physics of addressable self-assembly.
By simulating structures with varying placement of BBs in the canonical ensemble, we have shown that BBs located on the faces of the cubic target structure were primarily responsible for increasing the nucleation rate, whilst BBs located on the edges of the structure were primarily responsible for the stability of the final target structure. However, we have also found that structures that included BBs on all four possible faces were prone to misassembly, particularly in multiple-target simulations, as nucleation is too facile and multiple competing nuclei can grow and are subsequently unable to come together in the correct manner. This indicates that a strategy where all possible DNA strands that can be fused into boundary bricks actually are, as was done in previous experimental work, may not in fact be the optimal choice; a more careful consideration of the possible mechanisms of assembly and misassembly is warranted.
While the self-assembly pathways behave in fairly predictable ways in simulations where all patch–patch interactions are of the same strength, further complications arise when DNA sequences are taken into account, since the dominant nucleation locations depend on the strengths of the nearby interactions. We have briefly investigated this effect by examining the GC content of the brick structures, and we found that it was regions with a higher than average GC content that were most likely to nucleate first, rather than necessarily single bricks with an especially high GC content. It would be particularly interesting to investigate this behaviour further and determine whether any simple rules that govern the nucleation location as a function of interaction strength can be identified. However, although the GC content seems to play a significant role for structures without boundary bricks, as soon as BBs are included, the nucleation location is almost completely dominated by the BBs: in structures where BBs are localized to the edges, the edges were involved in nucleation, whilst in structures where BBs are localized to the faces, nucleation occurred primarily on the faces.
However, we have shown that BBs affect more than just nucleation. Since they entail the formation of more bonds, the bonding of a BB to the growing structure results in a more favourable enthalpic contribution to the free energy than a monomer brick would give, whilst the loss of entropy is only marginally more disfavourable. This means that target structures can grow larger at a given temperature than they would for a system without boundary bricks. In particular, boundary bricks can stabilize any ‘fine structure’ on the surface of the target structure, which could be especially important for those practical applications for which the assembled structure must be as perfect as possible.
Although boundary bricks do allow us to construct structures that are more ‘perfect’ in their final assembled state, including them can be something of a double-edged sword, since they not only stabilize the final structure, but are also easier to nucleate, which means they are more prone to misassembly and aggregation. This may to a significant extent negate the benefit of brick self-assembly being a nucleation-initiated process. In practice, it might be necessary to balance the two effects. For example, it may be possible to increase the yield by keeping the concentration of BBs lower than that of the monomer bricks or keeping the temperature higher for longer in order to keep nucleation a sufficiently rare event. It could also be possible to make the average bonding strength in boundary bricks weaker than that of the remaining monomer bricks, reducing the propensity for premature nucleation, whilst still allowing a degree of stabilization of the final structure. However, we have found that in our simulations, even when multiple structures were allowed to compete with one another in the same simulation box, there was a range of temperatures at which nucleation was rate limiting, but nevertheless sufficiently common for multiple target structures to grow essentially to completion even when boundary bricks were included.
Including all possible bricks was not particularly advantageous for assembly in multiple-target simulations, and including only face or only edge BBs resulted in self-assembly that was much less error prone. We envisage that a careful consideration of the types of boundary brick to include to maximize the yield and the quality of the target structures will be particularly important when looking at more complex structures than the ones we considered here. We have only looked at a cubic target structure in this work, as such a system is easiest to study systematically. When target structures include a complex array of peaks and troughs, the choice of the types of boundary brick which will stabilize the target structure whilst minimizing misassembly is considerably less straightforward. Having a clear design strategy is even more important for such systems, but intuition alone may not be enough; indeed, a simple simulation with our coarse-grained potential may well provide a convenient design tool for this purpose. Our simple coarse-grained potential may provide a useful first approximation when faced with a realistic design problem involving DNA bricks, and we hope that our work will provide useful insight to experimentalists interested in the practical applications of such systems.
Footnote |
† Present address: Department of Chemistry, Stanford University, Stanford, California 94305, USA. |
This journal is © The Royal Society of Chemistry 2017 |