Open Access Article
Beth Hsiao-Yen
Wei†
a,
C. Levi
Petix†
b,
Qizan
Chen
a,
Michael P.
Howard
*b and
Jeetain
Mittal
*acd
aArtie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, USA. E-mail: jeetain@tamu.edu
bDepartment of Chemical Engineering, Auburn University, Auburn, AL 36849, USA. E-mail: mphoward@auburn.edu
cDepartment of Chemistry, Texas A&M University, College Station, USA
dInterdisciplinary Graduate Program in Genetics and Genomics, Texas A&M University, College Station, USA
First published on 22nd December 2025
Computational methods for designing interactions between colloidal particles that induce self-assembly have received much attention for their promise to discover tailored materials. However, it often remains a challenge to translate computationally designed interactions to experiments because they may have features that are too complex, or even infeasible, to physically realize. Toward bridging this gap, we leverage relative-entropy minimization to design pair potentials for binary mixtures of colloidal particles that assemble crystal superlattices. We reduce the dimensionality and extent of the interaction design space by enforcing constraints on the form and parametrization of the pair potentials that are physically motivated by DNA-functionalized nanoparticles. We show that several two- and three-dimensional lattices, including honeycomb and cubic diamond, can be assembled using simple interactions despite their complex structures. We also find that the initial conditions used for the designed parameters as well as the assembly protocol play important roles in determining the outcome and success of the assembly process.
Design, System, ApplicationWe present a constrained inverse design strategy that uses relative-entropy minimization to discover simple, experimentally relevant pair potentials for binary colloidal mixtures. By imposing physically motivated constraints inspired by DNA-functionalized nanoparticles, the design space is reduced to parameters directly linked to experimentally tunable variables, enabling the assembly of complex two- and three-dimensional superlattices, including honeycomb and cubic diamond. The system studied, binary colloidal particles with tunable attractions and repulsions, is broadly important for engineering materials with tailored optical, photonic, and catalytic properties. By focusing on experimentally realizable interaction forms rather than highly flexible spline potentials, we show that even low-coordination lattices can be assembled robustly while maintaining a clear path toward physical implementation. This work bridges the gap between computational inverse design and experimental colloidal self-assembly, providing a practical framework to design and realize functional materials with complex architectures. |
While inverse design has had considerable computational success13,33 in identifying interactions for specific target structures—avoiding exhaustive parameter sweeps34,35 that identify all structures in a space and become prohibitive in high-dimensional space—translation to experiment remains far more limited.36–39 One major challenge is that pairwise potential energy functions, representing the effective interaction between two particles, are typically designed computationally, but creating particles that will produce these interactions can be difficult if the pair potentials do not have physically interpretable parameters. Pair potentials having no experimental realization may even be designed if physical requirements are not enforced in the optimization. For example, relative-entropy minimization has been successfully applied many times to design pair potentials represented using Akima splines.20–23,26,40 Splines have great flexibility in the functions they can represent and so are useful for finding a feasible design, but this flexibility may produce features such as high-frequency oscillations or plateaus that do not readily map to a known particle surface chemistry. Spline potentials can be constrained20 or filtered40 to remove some of these features, but the resulting interactions are still typically challenging to realize. Beyond spline representations, functional forms with many adjustable parameters can also yield interactions that are hard to realize experimentally.41,42 Using multiple types of particles has also been proposed as a strategy to simplify interaction complexity at the expense of compositional complexity;43 however, spline potentials designed for binary mixtures of particles using relative-entropy minimization were not necessarily simpler than those designed for equivalent one-component assemblies.23
We postulate that this challenge is not caused by a nonexistence of experimentally producible designs. The loss function guiding the design may have local minima and/or regions with low curvature near a global minimum, suggesting that other, simpler interactions may produce acceptable or near-optimal structures. Indeed, forward modeling has shown that colloidal particles interacting through simple, often experimentally-motivated, pair potentials can assemble a variety of structures.44–46 However, assembly of open lattices with such potentials remains difficult; cubic diamond has remained particularly challenging despite great interest because of its desirable for photonics properties.47,48 Additionally, these simple interactions may not be selected by inverse methods if they are less optimal than other, more complex interactions that are admissible. Inverse-design methods can be forced to consider only these simpler interactions by designing physically motivated pair potentials with parameters that are directly related to experimentally controllable variables.49–51 The tradeoff is that it may now be more difficult, or even impossible, to produce the desired structure than with a spline potential because the particle interactions are more constrained.
At the same time, DNA-functionalized particles (DFPs) are known to be a versatile material platform of particular interest for colloidal self-assembly. Simple pair potentials,52–54 including Fermi–Jagla55,56 and Lennard-Jones-nm57 potentials, have been developed to capture the effective interactions between DFPs, such as hybridization-driven attraction and repulsion due to chain overlaps, and have been experimentally validated.58–63 While experimental self-assembly of single-component crystals has mostly been limited to simple close-packed structures,62,64,65 binary mixtures of DFPs offer a richer design space because stoichiometry and differences in particle interactions can be exploited to stabilize structures that would be inaccessible with only a single type of DFP.62,66
In this work, we use relative-entropy minimization to optimize pair potentials for binary mixtures of colloidal particles under physical constraints that reflect known features of DFP interactions, namely tunable repulsion ranges and attraction strengths, as well as symmetries of the superlattice. Our goal is to show that simple interactions can be designed to assemble complex crystal superlattices for binary particle mixtures, even though they should not form with only one particle type. We target four lattices of varying complexity: square and honeycomb in two dimensions as well as body-centered cubic (BCC) and cubic diamond in three dimensions. We demonstrate that all these superlattices can be successfully designed for using a bulk isochoric temperature cycling protocol; however, we find that only some designs subsequently reliably self-assemble under bulk isothermal compression. We show that one reason for this behavior may be sensitivity to the initial choice of parameters for optimization, consistent with our postulation about the viability of multiple designs. We additionally probe how our designed interactions perform under dilute conditions typically used in experiments, highlighting the important role of protocol in both the design and assembly processes.
![]() | (1) |
![]() | (2) |
Our target crystal superlattices (Table S1) are formed from binary mixtures containing particles of types A and B, so in principle, 9 parameters are needed to specify the potential energy function corresponding to eqn (1). However, we were able to reduce the dimensionality and size of this design space using physical insights, which is an advantage of performing inverse design using a physically motivated pair potential. Specifically, we assumed that all particles had the same repulsion strength, coming primarily from the core nanoparticle and grafted DNA chains, at the design temperature, so εAA = εAB = εBB = ε, where ε can be considered the unit of energy. Further, the symmetry of the 4 lattices we designed (Fig. 1, Table S1) implies that σAA = σBB and λAA = λBB. Hence, only σAA, σAB, λAA, and λAB needed to be treated as independent parameters. In preliminary tests, we found σij to be numerically challenging to optimize concurrently with λij because eqn (1) is a highly nonlinear function of σij. We accordingly considered 4 parameters θ = (σ6AA, σ6AB, λAA, λAB) for optimization. Based on forward modeling and experiments,57,69,70 we constrained this parameter space to be consistent with 0.5σ ≤ σij ≤ 2σ and 0 ≤ λij ≤ 1, where σ is the unit of length.
![]() | ||
| Fig. 1 Designed pair potentials and representative snapshots for each crystal (top row) along with g(ij) measured between like (middle row, filled line) and unlike (bottom row, filled line) particles, compared to the target g(ij)0 (dashed line). The four crystal structures are: (a)–(c) square, (d)–(f) honeycomb, (g)–(i) BCC, and (j)–(l) cubic diamond. Snapshots of the target structures (inset) were rendered using VMD (version 1.9.4).77 | ||
To design the parameters θ that self-assembled a specific lattice, we minimized the relative entropy,22,71,72
![]() | (3) |
![]() | (4) |
To evaluate g(ij)0, initial configurations that contained at least 1000 total particles were generated at a number density and stoichiometry consistent with the target structure (Table S1). Each particle was tethered to a lattice site by a harmonic potential ϕ(r) = kr2/2, where r is the distance of the center of the particle from the lattice site and k is a spring constant. A target ensemble of particle configurations was then simulated by drawing an independent random displacement for each particle from a Gaussian distribution with zero mean and variance (βk)−1. We chose a spring constant k = 1000ε/σ2, which we found to produce well-defined but numerically integrable peaks in g(ij)0, and used 1000 configurations to compute g(ij)0 up to a distance of 5σ with a bin spacing of 0.05σ.
Relative-entropy minimization was carried out with relentless73 (version 0.1.1 with operations modified to support volume resizing) using HOOMD-blue75 (version 2.9.7). Langevin dynamics simulations were performed to evaluate g(ij) in eqn (4). The simulation timestep was 0.001τ and the particle friction factor was 0.1m/τ, where m is the mass of a particle (taken to be the same for both types) and
is the unit of time. The particles were initialized in the target structure, then equilibrated for 3 × 104τ at temperature T = 3ε/kB to attempt to melt the crystal. We found this initialization protocol to be necessary because it was often not possible to place the particles in randomized or fluid-like configurations without overlap for suboptimal values of σij. The temperature was then slowly quenched from 3ε/kB to 1ε/kB over 2 × 104τ at a constant rate. The radial distribution function g(ij) was finally sampled every 10τ during a 104τ production period with the same maximum distance and bin spacing as was used for g(ij)0. The parameters θ were then adjusted to attempt to minimize Srel using gradient descent with a step size chosen for each parameter such that the change in σ6ij was approximately 0.25σ6 and the change in λij was approximately 0.05 for the first iteration. We initialized σ6ij to be consistent with the position of the first peak in g(ij)0 based on physical knowledge that it relates to the size of the particle and DNA corona, while λAA and λAB were initialized to 0 and 1, respectively, based on previous work.43,57 The optimization was considered converged when the absolute values of all components of the gradient were less than 0.01 in their respective units or further progress could not be made due to the box constraints on the parameters.
Encouraged by this success, we proceeded to design the remaining lattices that we anticipated to be more challenging than square: honeycomb (Fig. 1d–f) is a low-coordinated two-dimensional lattice, BCC (Fig. 1g–i) is a three-dimensional lattice, and cubic diamond (Fig. 1j–l) is a low-coordinated three-dimensional lattice. As for the square lattice, σAA typically remained close to its initial value, set by the first peak in g(AA)0, but σAB tended to decrease for all three structures (Fig. S1). For honeycomb, λAB departed from its initial value but quickly returned; the same happened with λAA for BCC. Hence, the interactions for both honeycomb and BCC were purely repulsive between like particles and a Lennard-Jones attraction between unlike particles. Interestingly, the cubic diamond was found to have a nonzero λAA when the optimization converged, corresponding to a weak attraction between like particles in addition to a Lennard-Jones attraction between unlike particles. Such attraction can be achieved by mixing two different DNA strands on each particle78,79 or via linker-mediated binding.79
Overall, our designed interactions are consistent with a pattern observed in a forward modeling study of DFPs:57 the denser crystals (square and BCC) were assembled by competing repulsive and attractive interactions that were closer in range than those of the less dense crystals (honeycomb and cubic diamond). For the less dense crystals, such long-ranged repulsions between like particles can be attained in experiments by shifting the location of the complementary “sticky” DNA sequence from the tail to the middle of the molecule to enhance shell overlap.57 The difference in ranges can be approximately quantified through σAB/σAA, which was about 1.5 (square) and 1.2 (BCC) for the more dense crystal structures and about 1.9 for the less dense crystals (Table S2). This finding is analogous to previous studies that demonstrated the influence of interaction ranges on DFP self-assembly, for instance, different DNA-linker to particle size ratios producing different structures.62,80,81
We further note that it appeared that some of the interaction parameters would slowly evolve with more iterations of gradient descent, particularly for challenging designs like cubic diamond (Fig. S7); however, the optimization stopped because the convergence criteria were satisfied. To investigate whether continued optimization would improve the design, we performed additional iterations with an order of magnitude stricter tolerance. As expected, the parameters continued to evolve but eventually converged (Fig. S2). Counterintuitively, these converged variables performed worse during isothermal compression (see section III.B), producing more polymorphic crystals with greater variability between independent simulations compared to the original convergence criterion (Fig. S3). This behavior reflects our operational definition of convergence, under which the parameters are considered converged even though small changes may continue with further iterations. The target g(ij) were also well-matched at both convergence thresholds, supporting the notion that multiple acceptable designs exist within a relatively shallow region of the relative entropy function landscape and that some designs may be more robust under different assembly protocols depending on the specific numerical details of the optimization procedure. The design that is found depends on the optimization method, convergence criteria, and initial guess. We will discuss some of these considerations later.
We hence simulated another assembly protocol based on isothermal compression to validate the interactions we designed using isochoric temperature cycling. We placed the same number of particles quasi-randomly in a simulation box whose edge lengths were three times longer than that of the target, then compressed the box isothermally at T = 1ε/kB to its target size by reducing the edge lengths at a constant rate over a period of 5 × 104τ. The particles in the expanded box were gas-like, so they were required to assemble in the validation simulations even if the crystal did not melt in the design simulations (Fig. 2). We then simulated a production period of 104τ, sampling particle configurations every 102τ for structural analysis. We repeated this procedure 5 times starting from different initial configurations to probe variability.
![]() | ||
| Fig. 2 Snapshots from the first validation (isothermal compression) simulation using the potential designed during isochoric temperature cycling for square, honeycomb, BCC and, cubic diamond. | ||
We are using binary mixtures to assemble the targeted superlattices because the simple interactions we considered are not expected to form them with one particle type; however, from a practical perspective, we may not require perfect compositional order in the final assembled structure. For example, two types of DFPs may have the same nanoparticle core and differ only in their DNA functionalization, making them essentially interchangeable in the lattice for properties that depend only on the arrangement of cores. Hence, we employed two different structural metrics that were agnostic to composition to assess if the designed interactions assembled the target structure. The first was based on the radial distribution function g ignoring particle type, which we calculated from the recorded particle configurations up to distance R = 5σ using bin width 0.05σ. We then evaluated the mean squared error (MSE) in g relative to the type-agnostic radial distribution function g0 for the target,
![]() | (5) |
We found that square, honeycomb, and BCC all reliably self-assembled in our validation simulations, with at least 80% of particles being classified as belonging to the target lattice on average (Fig. 3). The MSE in g was consistently larger in the validation simulations than in the design simulations. There was also significant variability in g MSE, but not as much in the fraction of particles classified as being in the correct lattice (Fig. S5). In contrast, there was essentially no variability in either quantity between 3 independent simulations performed using the design protocol, with essentially all particles being classified as the target structure. To better understand the assemblies formed using the validation protocol, we inspected the type-agnostic g for the different crystals (Fig. S4). We found that although their peaks were typically correctly located, they often had different heights and widths, particularly at longer distances. This behavior is consistent with the presence of defects disrupting long-ranged ordering, which were also visually apparent in particle configurations colored by the structural classifier (Fig. 2).
Cubic diamond, on the other hand, had a large increase in g MSE and decrease in fraction of particles classified (only about 27%) in the validation simulations. There was also significant variability between the 5 independent simulations. The radial distribution functions all showed a lack of long-ranged order, but some also had a few additional small peaks at shorter separation distances that were not present in the target (Fig. S4d). The structural classification method we used for cubic diamond identified a significant fraction of particles as being hexagonal diamond, a competing polymorph, in some simulations (Fig. S6). Despite their substantial structural differences—75% of rings are boat-like in hexagonal diamond rather than chair-like in cubic diamond—cubic and hexagonal diamond have similar free energies86,87 and subtly different radial distribution functions, making it difficult to select for only one. Overall, our results indicate substantial polymorphism in the diamond structures produced under bulk isothermal compression that was absent under isochoric temperature cycling, likely due to lack of melting. We confirmed that this polymorphism tended to set in early during compression (Fig. 4a) and persist.
![]() | ||
| Fig. 4 Representative particle snapshots and fractions of particles classified as cubic or hexagonal diamond as a function of (a) number density ρ during bulk isothermal compression and (b) temperature T during isochoric cooling of an initially dilute suspension. In (b), the grey line is the largest cluster size, indicating that essentially all particles have incorporated into the final crystallite. The inset particle snapshots show the transition from an amorphous cluster to a crystallite with diamond structure. Snapshots here, in Fig. 5, and in the supplementary information were rendered using OVITO (version 3.12.0).83 | ||
We were curious whether less polymorphism and/or more consistent crystallization might be achievable for cubic diamond by assembling under conditions that more easily allowed for particle rearrangement. In experiments, self-assembly often occurs from a dilute suspension through a gas–solid-like phase transition.88 Drawing inspiration from this procedure, we initialized simulations in the large simulation box used to start the bulk isothermal compression; however, we now isochorically cooled from T = 1ε/kB to 0.01ε/kB at a constant rate over a period of 105τ. This slow cooling should drive the particles to self-assemble finite-size crystallites. Indeed, we found that a single crystallite formed at around T = 0.18ε/kB by a previously identified two-step mechanism,57 in which the particles first aggregated into an amorphous cluster then crystallized (Fig. 4b). Like the bulk compression simulations, the final crystallite contained both cubic and hexagonal diamond structures; however, unlike the bulk compression simulations (Fig. S6), there was significantly less variability in the fraction of each structure between independent simulations (Fig. S12). The similar free energies of cubic and hexagonal diamond make it challenging to thermodynamically favor one over the other.57 We suspect that our isothermal compression protocol inhibited particles' ability to rearrange after nucleation as they continued to densify, leading to more variability than under dilute assembly conditions.
The −+ design produced crystals of similar quality as the ++ design under bulk isothermal compression (Fig. 5a and b). The −+ design had a slightly larger deviation from the target g, but there was less variability between independent simulations (Fig. 5a and S8). Both designs had nearly all particles classified as being in the target structure (Fig. 5b). However, there were significant differences in assembly from a dilute suspension (Fig. S9). The −+ design required significantly colder temperature to assemble than the ++ design (Fig. 5c and d). Assembly occurred in two steps for the −+ design, with string-like structures forming before honeycomb, whereas honeycomb directly nucleated for the ++ design. The assembled structures for the −+ design also tended to be less compact than for the ++ design (Fig. S9). The structural classifier identified more particles as being honeycomb for the ++ design than for the −+ design; however, this may be partially due to the larger number of interfacial particles in the −+ assembly that are more challenging to classify.
The string-like structures for the −+ design may be a kinetic effect because the particle dynamics slow considerably as T decreases and assemblies form. To probe this possibility, we ran an additional dilute isochoric simulation where the particles were initialized in the crystallite assembled by the ++ design, and the temperature was held constant at T = 0.0755ε/kB. This temperature is near the nominal melting point for the −+ design estimated from a trace of the potential energy during cooling. We ran a long simulation (duration 105τ), but the honeycomb crystallite remained stable and strings did not form. We then ran similar simulations at 0.1 ≤ kBT/ε ≤ 0.5 at increments of 0.1 (Fig. S13). We found for the −+ potential that strings started to detach from the crystallite surface at T = 0.1ε/kB, and the crystal completely melted at higher temperatures. We performed the same test using the ++ potential and found that it stabilized the crystallite up to at least T = 0.3ε/kB. At T = 0.4ε/kB, there was a coexistence between the crystal and a dilute phase, before complete melting at T = 0.5ε/kB. This behavior is consistent with the crystallization curves observed in cooling simulations (Fig. 5d).
It has been shown that pair potentials designed for two-dimensional structures can also stabilize their three-dimensional analogs, e.g., a potential designed to self-assemble a square lattice in two dimensions may also self-assemble a simple cubic lattice in three dimensions.89 We accordingly tested the ability of both our −+ and ++ designs for honeycomb, performed in two dimensions, to form diamond from a dilute suspension in three dimensions. Consistent with our results in two dimensions, we found that the two designs formed distinct structures in three dimensions. In particular, a multilayered graphene-like structure that was not stacked in the manner of graphite was assembled by the −+ design (Fig. S10), while the ++ design assembled mostly cubic and hexagonal diamond as well as their closest neighbors (Fig. S11). Interestingly for the −+ design, the particles first formed strings, as in two dimensions (Fig. 5c), before the final structure. We suspect that this structure may also be a kinetic effect for similar reasons as in two dimensions. Overall, the difference in transferability to three dimensions demonstrates another practical difference between the two designs, even though both were considered converged and behaved similarly in bulk simulations.
A key observation from this work is that multiple viable designs may be identified depending on the initial parameter guess, but these designs may have rather different behavior under other assembly conditions. We demonstrated this sensitivity by testing several initial values of λij in the two-dimensional honeycomb lattice, which showed distinct assembly behavior during validation despite similar performance in the design protocol. While the present study focuses on stoichiometrically balanced binary mixtures, exploring compositional complexity may offer additional routes to stabilize or tune targeted structures.90 Nevertheless, especially complex lattices may require additional design elements beyond composition and isotropic interactions; directional (e.g., patchy) interactions represent one promising approach that can be systematically parameterized and connected to experiments.91–93 To enhance robustness and practical translation, it may then be important to consider the assembly protocol that will be used in experiments when configuring the computational design protocol. For example, dilute simulations of finite-size crystallites might be used for design rather than simulations of bulk crystals. Moreover, while our design strategy assumes equilibrium thermodynamics, designing for non-equilibrium assembly or processing conditions may also be fruitful.94,95
Supplementary information is available. See DOI: https://doi.org/10.1039/d5me00129c.
Footnote |
| † These authors contributed equally. |
| This journal is © The Royal Society of Chemistry 2026 |