Karim R.
Gadelrab
a,
Adam F.
Hannon†
*ab,
Caroline A.
Ross
a and
Alfredo
Alexander-Katz
*a
aDepartment of Materials Science and Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA. E-mail: aalexand@mit.edu
bMaterials Science and Engineering Division, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD 20899, USA
First published on 9th October 2017
Recent success of inverse design methodologies in the realm of self-assembled materials has allowed us to envision an inverse path of discovery where we go from a desired target function to building blocks. In this review we examine recent advances of such inverse design methods in soft materials containing block copolymers, colloids, or DNA. By combining well-developed theoretical models with advanced inverse search algorithms, the design of such systems has been dramatically enhanced over the past decade. Advantages and disadvantages of the most prominent inverse search algorithms are discussed in the context of block copolymer directed self-assembly inverse design. The success of these methodologies in such systems shows great promise for the future of self-assembling materials, particularly for applications where the desired structure and properties of the system needed for a functional device are known.
Design, System, ApplicationThe progress in understanding the underlying physics governing materials behavior enables designing materials with new and improved properties. This advancement relies on a strong theoretical and computational effort to make informed experimental decisions. While the theoretical and computational investigations systematically explore a parameter space, this parameter space is guided by what is experimentally known in what can be considered a forward type of analysis. However, the ideal role of theoretical and computational work is to provide scientists and engineers with the right starting conditions that would lead to a particular outcome. This inverse approach resembles an optimization problem that needs to be efficiently solved. We discuss several areas within the self-assembly materials community where inverse design strategies have been successfully applied with emphasis on directed self-assembly of block copolymers using topographical template. These strategies share a common theme of defining a target property or structure, translating this output into an objective function, selecting a computational technique that can probe this output for a given parameter space, and finally running an inverse optimization scheme that finds the best set of parameters to yield the desired output. We believe that the pressing need of new advanced materials will push the field of materials design to rely on such inverse design approaches as the parameter space become practically unsearchable. |
A general flow diagram of how the material design process takes place is shown in Fig. 1. The forward design approach starts from hypothesizing candidate building blocks (chemistry, molecular architecture, guiding template, etc.) that are expected to result in target functional properties (structural, physical, mechanical, etc.). The dependence of the outcome on these input blocks can be quickly assessed in a direct fashion using computer simulations. Hence, the target properties are realized through an exhaustive search of the input parameter space. On the contrary, the inverse design method sets the target property as a goal that needs to be achieved through search and optimization techniques. This method's success depends on finding the right set of input design parameters that would produce the closest output to the target goal. In particular, the target application is used to define an objective function of a metric to be optimized by the inverse methodology. Simulations are then performed, seeded with random parameters based on the experimental bounds of the problem, using a given structure model for the system. The structure model simulation results are used with the objective function to compute a goodness of fit for the target metric. The model parameters are updated using an inverse search algorithm or optimization routine of choice. The new parameters are then used to perform new forward simulations and the process is reiterated until a satisfactory solution is found. Using this general methodology, a given problem can be solved by defining the simulation technique and structure model, objective function and data model, and inverse search algorithm. The objective function is usually dependent on the data used for the target metric and can take many forms such as a simple mathematical function from statistical theory (χ2-function), mean-absolute error function, a log-normal function, or, if possible, can be derived from the Hamiltonian describing the physics of the system.9 Particularly, in the field of block copolymer self-assembly, a number of increasingly sophisticated algorithms and strategies have now been proposed for inverse design, starting from pioneering work using simple Monte-Carlo based optimization processes,10,11 to evolutionary computation9,12–15 and swarm-based approaches.16,17
Inverse design has gained much attention in the field of materials self-assembly.15,18 Self-assembly is the process through which a collection of simple units spontaneously form larger complex structures.19,20 The inherent ability of these units to morph into specific patterns provides a valuable tool for materials fabrication, as it has the potential to produce intricate features at the smallest of length scales.21–27 The possible outcomes from a variety of self-assembly processes have been explored experimentally. For example, block copolymers (BCPs) can self-assemble into various periodic structures (spheres, cylinders, lamellae, etc.) that can be used for surface patterning,22–29 battery electrolytes,30 light polarizers,31 photonic crystals,32,33 and optoelectronics.34 DNA origami uses the information encoded via DNA synthesis for self-assembly into almost arbitrarily programmable shapes.35–38 Furthermore, in DNA-mediated assembly, colloidal nanoparticles functionalized with single strand DNA can be used to self-assemble into a variety of crystal structures.39–41
The theoretical perspective for the problem of self-assembly depends mostly on the system of interest. In the case of colloids, the shape of the force profile surrounding a suspended particle has been studied to correlate the self-assembled pattern of the colloid with the strength of the attraction and repulsion components of the field.42–44 Attempts to solve the colloid inverse problem were based on statistical mechanics techniques where a functional form was proposed for the isotropic pair potential. The parameters of the function that gave the desired crystal structure were estimated through an optimization process for an objective function describing structural stability.45–47 A counterintuitive outcome of these studies is that radially symmetric force fields can produce ground state crystals with low symmetry. Low symmetry structures such as 2D rectangular,48,49 hexagonal,49 and diamond crystal43 were achieved through monotonic and convex-repulsive potential expressions that could be experimentally realized.42,43
Inverse search approaches are found in other disciplines including spectroscopy and scattering where instead of trying to design a target structure, an intensity measurement is made by a technique such as transmission scattering,50 reflectivity,51,52 or spectroscopic ellipsometry53 and the resulting data are fit using an inverse algorithm. In this case, the measured data set is an intensity signal that is produced from the path taken by the radiation used to interact with the material medium. This path is generally dependent on the initial conditions of the radiation and the sample material properties. Many of the radiation characteristics are lost in the intensity measurement such as the phase of the radiation. Thus, while there are many processes where the forward calculation of the measured intensity is well understood, in practice one is always solving the inverse problem starting with the measured intensity and having to calculate the material structure and the conditions of radiation. A general inverse methodology for finding the material structure required to produce a given X-ray scattering pattern for periodic nanogratings was shown by Hannon et al.54 using a shape based model and by Khaira et al.55 using a physics-based model. These studies both use a similar inverse methodology analogous to a general inverse materials design approach where the measured intensity data replace the target structure metric. The methodology by Hannon et al. obtained the shape profile of a periodic nanograting from scattered X-ray intensity data via a trapezoid stack model and compared three different inverse search algorithms: Markov chain Monte Carlo, differential evolution, and covariance matrix adaptation evolutionary strategy (CMA-ES). The results of that study suggested that CMA-ES is one of the best inverse search methods for such scattering data and shape models in terms of both speed and consistent convergence to the global minimum solution. The study by Khaira et al. used the CMA-ES algorithm with a more complex physics-based model that not only gives the resulting periodic structure from the scattering data, but also thermodynamic processing and property information of the system studied. Future work in materials inverse design should explore new avenues using this inverse search paradigm with indirect experimental measuring techniques to access the real material structure and properties of the system with physics-based models.56
One of the most prominent self-assembling soft materials that inverse design has recently been applied to are BCPs, which self-assemble through a free energy minimization process known as microphase separation.57 The immiscibility of the constituent polymers in each block promotes phase separation that reduces interfacial contact between the blocks, while the arrangement of different blocks within the same molecule limits the extent of the spatial separation resulting in a periodic pattern rather than macroscale phase separation.58 The critical dimensions of the ordered morphology are on the order of a few nm to 100 s of nm depending on the chain length and chemistry which makes BCPs a very attractive tool for producing complex nanoscale structures, for example in microelectronic device fabrication.58–62 Functional devices have been fabricated successfully through patterning based on BCP thin films, such as quantum dots,63,64 FinFET logic circuits,65 memory devices,66 magnetic storage media,67,68 and semiconductor capacitors.69,70
The spontaneous microphase separation of BCPs typically produces regions (grains) of microdomains with short range order resembling a polycrystalline structure, each grain having a different orientation and localized defects.71 However, many device applications of BCPs require defect-free extended areas of periodic BCP microdomains, in some cases with controlled aperiodic features. This can be accomplished by guiding the self-assembly of a BCP film through a chemical and/or topographical template formed on the substrate, a process known as directed self-assembly (DSA).22–29,31,72–74 Defect annealing and pattern alignment are defined by the template based on factors such as preferential surface wetting and the commensurability between the BCP period and the period of the template.75
Computational modeling of BCP DSA has proved valuable in determining the effects of different parameters such as film thickness, polymer–substrate interactions, and geometrical confinement have on the self-assembly process.76,77 However, transitioning BCP DSA to a commercial manufacturing technique requires implementation of an inverse design method to predict the structure of a template that will yield a desired self-assembled BCP pattern. In applications where a complex pattern needs to be manufactured, the role of inverse design is to start from the target structure pattern and determine the optimum directing template design as well as the polymer composition and processing conditions.8–11,13–16,78
Liu and Lai et al.79,80 incorporated DSA with computational lithography (CL) to implement patterns relevant to circuit design. The inverse design flow works iteratively to optimize the photo-mask for lithography. The role of BCP DSA can be handled through models of varying complexity such as theoretically informed coarse-grained (TICG) Monte Carlo (MC),81 dissipative particle dynamics (DPD),82 and self-consistent field theory (SCFT).83 A phenomenological 2D Cahn–Hilliard (CH) model proved to be a fast and practical alternative from full field analysis.80 Furthermore, a very fast DSA model (orders of magnitude faster compared to 3D MC simulations) was developed specifically to target vertical via interconnects where it was very suitable for inverse analysis and optimizing guiding patterns.80 While the details of this simulator were not explicitly revealed, it is described to be empirical, requiring calibration when processing conditions change, and is not transferable to other DSA processes. Nevertheless, the framework was successfully verified on designing FinFET logic circuit elements and a complex set of via patterns. The integrated process of DSA-CL achieved a high enough computation speed to be incorporated in full scale lithography mask design.79,80,84
In order to achieve a reliable inverse design of BCP patterns through DSA, a significantly large parameter space has to be explored. In addition to materials related parameters, DSA process related ones such as the number, size, and shape of guiding elements, element locations, and surface functionalization also have to be defined.10,11 Optimum guiding templates should generally be sparse (i.e. fewer patterned features per unit area than the target BCP pattern), taking advantage of feature density multiplication in order to achieve short fabrication time and high cost efficiency.22–24 Hence, the inverse design problem is generally ill-posed, and reaching the optimum set of parameters demands an efficient optimization algorithm.9,12–17,85
Meta-heuristic algorithms86 provide an attractive approach to solve the inverse problem. They have been used to obtain global or near-global optimum solution due to their capability of finding promising regions in the search space within a reasonable computational time87 by combining rules and randomness to imitate nature.88 Paradiso et al.17 employed the particle swarm optimization (PSO) to generate elaborate structures of multiblock copolymer blends using SCFT (see Fig. 2a). In the implementation of PSO, a set of randomly generated solutions (swarm) propagates in the design space towards the optimum solution over a number of iterations based on the information gathered from the design space and shared by all the members of the swarm.89 The moves of the swarm are controlled by inertia and drag forces to minimize a simple error function. The algorithm successfully optimized the blend composition, and cell size. The adaptation of PSO for BCP inverse design is in its early stage, however, the technique has the potential to comply with more demanding applications.17
Fig. 2 Inverse design problems of BCPs implemented using meta-heuristic algorithms (a) convergence of a multiblock blend formulation designed to stabilize coexistence of a line with spot features using the PSO algorithm. The fitness of the best agent in the swarm is plotted for each iteration with four solutions highlighted as red circles to illustrate evolutionary milestones compared to the target pattern.17 (b) Evolution of the objective function of a target letter “I” pattern using the CMA-ES optimization combined with CH model. The optimal morphology and spot positions are shown on the lower left corner.13 (c) The magnitude of the objective function corresponding to the best offspring as a function of CMA-ES iterations combined with coarse-grained many-body model of BCP for 3× density multiplication of a lamellar structure on a chemical pattern. Adaptation of the search step size is shown in the inset.14 |
Evolutionary algorithms widely known as genetic algorithms,12,90 genetic programming,91,92 evolution strategies,93 and evolutionary programming,94,95 have proven successful across various fields for problems with high dimensional combinatorial or continuous search spaces.96 Qin et al.13 utilized a combination of the CH equation and the CMA-ES for the determination of equilibrium BCP morphologies under chemical templating conditions.97 With this BCP model, it was possible to identify the optimal chemical pattern to assemble the BCP into non-regular morphologies. Polymer volume fraction and polymer interaction parameter are initially set to generate a lamellar structure for reference. A predetermined set of chemical anchoring spots are then allowed to move in order to determine their optimum locations within a population of parameter sets. The optimization algorithm iteratively minimizes an objective function that relates the generated topography to the target one. The solution converges when the objective function is minimized. During each generation, the chemical templating points are repositioned through the CMA-ES, and the equilibrium morphology of the polymer is determined using the CH equation as shown in Fig. 2b. The objective function is calculated for the population to allow the CMA-ES to propose a new population of locations of the chemical templating spots. The algorithm converged irrespective of the initial spot arrangements. The optimization technique was later combined with a more sophisticated model that includes molecular details and thermal fluctuations in full 3D simulations.14 In this system, a triblock copolymer system that assembles into lamellar structures was simulated on a periodic chemical pattern to give an approximately 3 to 4 density multiplied pattern (see Fig. 2c). The results were experimentally verified for a P2VP-b-PS-b-P2VP (polystyrene-b-poly-2-vinylpyridine) BCP on a silicon substrate. Even for such level of complexity, the CMA-ES is able to find the optimal set of parameters to produce a given target structure within only NGen ≈ 50 generations using a population of size NPop = 31 and NParam = 3 optimized parameters. Fig. 2c shows a plot of the objective function as a function of the number of generations taken from their study demonstrating this rapid convergence. This corresponds to NGenNPop = 1550 independent simulations. For a traditional forward approach, these parameters would have been divided into a search vector of length NBin that would require (NBin)NParam independent forward simulations to be performed. A typical search vector of NBin = 16 searching NParam = 3 parameters would require 163 = 4096 simulations. The number of forward simulations will rapidly increase when exploring a larger parameter space NParam, compared to the CMA-ES algorithm, assuming NGen stays roughly constant.
Inverse design of topographical templating has also proven to be successful in predicting considerably complex patterns with bends, terminations, and junctions, for open and closed domains.10 The topographical templates consist of posts with specific surface wetting condition that can be placed at different locations to control the BCP pattern that forms. Hannon et al.11 introduced the combination of a Monte-Carlo-like optimization search algorithm with an SCFT simulation to determine the optimum post locations. A schematic comparing the forward design methods used in prior work77,98 with these inverse methods in shown in Fig. 3. In this formalism, the target density profile is synthesized using a tiling technique of simple segments. The target density is held constant, while a fixed number of posts are allowed to be relocated. For every step, a single post is moved in a series of random walks to a new position and the potential fields are allowed to evolve to a steady state. The energy of the system is calculated and the post move is accepted if it results in an overall reduction to the system energy, otherwise after failing a certain number of steps, the post is returned to its original location. The formalism was further developed to include the polymer volume fraction and the number of posts as variables.10 A fidelity factor relating the density distribution in the generated and the target polymer densities is defined to gauge the optimum selected parameters. This fidelity factor is essentially the reciprocal inverse of an objective function which is maximized rather than minimized. The optimized post number and locations were verified experimentally for a PS-b-PDMS (polystyrene-b-polydimethylsiloxane) system where electron-beam lithography was employed to generate posts from HSQ (hydrogen silsesquioxane, an electron-beam resist) that were further functionalized to preferentially attract PDMS. It was interesting to note that the 2D SCFT simulations were sufficient to predict experimental results since the interactions are primarily constrained by the in-plane topography. Such work can be extended to find optimum interaction parameters χ and degree of polymerization N to further explore the possibility of different chemistries and chain architectures to produce a given target structure.
Fig. 3 (a) Schematic showing from left to right how the general forward design process occurred in modeling the directed self-assembly of BCPs.76,98 (b) Schematic showing how the inverse design methodology works for the directed self-assembly of BCPs.10,77 |
The BCPs discussed in the context of inverse DSA design so far have generally been simple diblocks modeled using Gaussian chain statistics (i.e., coil–coil BCPs). The formalism provided can easily be expanded to more complicated systems where the chains are more rod or wormlike99,100 in behavior including rod–coil, rod–rod, and bottlebrush copolymers (where the backbone chain will likely exhibit rod/wormlike behavior). Using such models simply changes the structure model used as diagramed in Fig. 3b, and changes the total number of parameters optimized by the inverse search algorithm. As an example, rod–coil systems have been studied using SCFT models101–109 with the rod block modeled as a wormlike chain. These models add the additional degrees of freedom in the rod persistence length λ, the relative length of the rod segments to the Gaussian coil radius of gyration β, and Maier–Saupe interaction parameter μ to model the dipole–dipole interactions between neighboring rods.99,100 In principle, these parameters can all be optimized just as χN, f, and the DSA feature positions by algorithms such as the CMA-ES. Additionally, more complicated objective functions may be necessary to target not only the local density field profile desired for the constituent species, but also include a target orientation field for the rod/wormlike segments. To our knowledge, no study using such models has been published.
In order to implement the inverse design process with ever more increasingly complex physics-based models, more advanced inverse search algorithms need to be developed. Of the approaches discussed in this review, the CMA-ES algorithm appears to be one of the most efficient algorithms based on the prior studies as well as our own examination of it relative to other algorithms due to its speed of convergence and self-tuning nature.54 The algorithm does have an inherent disadvantage in not necessarily converging initially to the global objective function minima with the propensity for this to occur increasing with a larger parameter search space. Methods such as the PSO method17 may be able to overcome these limitations, but there are still inherent biases in the search in getting trapped in local minima, especially when there is high density of metastable states around the true energy minimum as in the case of large 3D simulations of bulk BCPs.12,16 Stochastic based methods such as Markov chain Monte Carlo will eventually always find the global minima by randomly walking through the search space with an optimized step size, but this requires very long calculations that do not gain much search speed over just doing a complete forward search parameter sweep. A possible approach to alleviate the inherent issues with the CMA-ES and PSO approaches in getting trapped in local minima solutions is to have a wrapper algorithm around the inverse search algorithm that resets the algorithm when the objective function saturates for an extended number of generations but keeps the knowledge of the best solution found, thus being able to search other parameter combinations that may have been missed due to a rapid convergence into a local minimum.
Some interesting new areas of exploration in terms of inverse design would be a hybrid experimental–computational framework in which one could have direct in situ experimental input into the computation to adjust parameters on the fly during the optimization processes. While ambitious, this would provide a complete inverse design platform where features from the physical system as well as those from simulations are captured simultaneously.8,54,55,111 Both macroscopic physical features and computational molecular features are complementary, so it is foreseeable that we will learn much of the physics of these assembly processes by understanding their underlying correlations. Such correlations will become clearly apparent as one optimizes the building blocks to obtain a target system. In particular, the time coordinate does not seem to enter such protocols, but experimentally it is a very important variable. By controlling processing pathways, it is possible to frustrate the materials assembly and force the system to arrange into a particular metastable structure.112 This might be a new avenue in inverse design that could be quite useful and could also provide us with controlled “synthesis” routes and robust assembly pathways.
The inverse design paradigm has been successful for BCPs and for colloid assembly, but the methodology is general enough to be applied to other soft material systems where models are well developed for the given system of interest. Some examples are DNA nanotechnology and de novo protein design which have gathered much recent interest. It is without a doubt that optimization in these areas will yield also optimized sequences for given DNA and protein folding structures.113,114 Overall, it is foreseeable that the future of materials science research, and in particular self-assembling materials, will rely more extensively on inverse design methods to obtain tangible solutions because the parameter search spaces for new systems become more complex and practically unsearchable under traditional methods.
Footnote |
† Guest researcher. |
This journal is © The Royal Society of Chemistry 2017 |