Kazuki Ishizakia,
Toru Asahia and
Takuya Taniguchi
*b
aDepartment of Advanced Science and Engineering, Graduate School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
bCenter for Data Science, Waseda University, 1-6-1 Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, Japan. E-mail: takuya.taniguchi@aoni.waseda.jp
First published on 20th March 2025
Photo-actuated organic crystals that can be remotely controlled by light are gaining attention as next-generation actuator materials. In the practical application of actuator materials, the mode of deformation and the output force are important properties. Since the output force depends on the crystal properties and experimental conditions, it is necessary to explore the optimal conditions from a vast parameter space. In this study, we employed two types of machine learning for molecular design and experimental optimization to maximize the blocking force. Machine learning in molecular design led to the creation of a material pool of salicylideneamine derivatives. Bayesian optimization was used for efficient sampling from the material pool for force measurements in the real world, achieving a maximum blocking force of 37.0 mN. This method was at least 73 times more efficient than the grid search approach.
Among actuator materials, photomechanical crystals deform upon light irradiation, making them attractive as lightweight and remotely controllable actuators. Previous studies have primarily reported on the free deformation of these crystals, revealing various deformation behaviors.4–10 When deformation occurs to an object, it can exert force and perform work.11–15 The maximum force is generated when the free deformation is perfectly blocked, which is defined as the blocking force, a key performance indicator for actuators. However, blocking force measurements for photomechanical crystals are rare due to the small size of the crystals and difficulty in constructing the experimental setup. Previously, we developed an experimental system that allowed us to directly measure the blocking force of photomechanical crystals.10,16
Controlling the blocking force of photomechanical crystals is crucial for their practical applications. While generating small forces can be readily achieved by adjusting the light intensity, increasing the maximum force remains challenging as it depends on multiple factors including crystal properties, crystal sizes, and experimental conditions. The relationship between these parameters and the generated force is not yet fully understood. Expanding the achievable maximum force could significantly broaden the potential applications of photomechanical crystals. Therefore, a comprehensive investigation of crystal properties and experimental parameters is essential. However, conducting a systematic grid search is impractical due to extensive time and resources required for the experiments. Machine learning approaches have recently proven effective in designing molecular materials and predicting crystal properties17–23 and offer a promising alternative. When predicting physical properties, striking the right balance between predictive performance and model interpretability is crucial, with the choice of model depending on the specific objectives. While Bayesian optimization is effective in optimizing the experimental conditions across various fields,24–27 its application to photomechanical crystals remains unexplored.
In this study, we employed two types of machine learning in the molecular design and experimental optimization of photomechanical crystals, successfully maximizing the blocking force (Fig. 1). First, the least absolute shrinkage and selection operator (LASSO) regression was used for molecular design of photomechanical crystals with different Young's moduli. Crystals of several compounds were prepared based on the molecular design, creating a material pool with varying Young's moduli and crystal sizes. Then, we experimentally measured the blocking force using Bayesian optimization for efficient sampling from the material pool, aiming to maximize the output force. These findings broaden the applicability of photomechanical molecular crystals and help optimize other properties of functional crystals.
UCB(x) = μ(x) + κσ(x) |
We set the value range for each parameter. The measured Young's modulus was used as an identifier for the crystals and we used the values from the bending tests. Crystal sizes were defined within 0–3000 μm length, 200–2000 μm width, and 50–500 μm thickness. Irradiation utilized UV light at 365 nm, with an experimental setup capable of producing an intensity of 2080 mW cm−2 at 100% output. We conducted experiments with four additional light intensity conditions besides 365 nm since changing light intensity is easier than changing crystals. The four conditions were selected from 10, 25, 50, 75, and 100%, excluding the percentage closest to the Bayesian-suggested value. The explanatory variables were standardized and iterative regression was performed.
We first measured 10 data from 5 light conditions of 2 crystals. The blocking force was measured with a universal testing machine (RTG-1210, A&D). The jig approached a sample and stopped when the initial load was confirmed. Then, the crystal was irradiated with UV light at 365 nm (8332A, CCS Inc.) for 5 min at 20 °C. As we aimed to maximize the blocking force rather than investigate the cause of deformation, the sample was irradiated from top and bottom, adopting the larger force value. In the sampling by Bayesian optimization, the crystal closest to the suggested crystal sizes was manually selected. When crystals of the suggested size were unavailable, we either selected alternative crystals of compound (S)-4 or (S)-5 to update the surrogate function. Since adjusting the light intensity was relatively simple, four additional light intensities were tested outside of the suggested conditions.
In the prepared dataset containing 393 data points for machine learning, Young's modulus values distributed with a maximum value of 76.86 GPa, an average of 11.51 GPa, and a median of 9.36 GPa (Fig. 2b and ESI data†). We encoded the molecules to 85 dimensional vectors by the number of substructures implemented in the RDKit library. There are many ways to vectorize molecular structures, but we chose this molecular representation because it provides direct insights into the molecular structure of salicylideneamine reactants. After removing variables with high correlation among explanatory variables, 18 and 27 substructures showed positive and negative correlation with Young's modulus, respectively (ESI Fig. 1†). Substructures that form hydrogen bonds show a high positive correlation with Young's modulus. This is consistent with previous findings.33 Substructures with strong negative correlations should form relatively weak intermolecular interactions in crystals.
Our study focuses on regression analysis to gain insights into molecular design rather than merely improving predictive accuracy. Thus, enhancing interpretability aligns better with our analytical objectives. For this reason, we applied LASSO regression, a linear model that also enables variable selection. After hyperparameter optimization (Fig. 2c), we evaluated the accuracy of LASSO regression. The mean absolute error (MAE) for the training data was 6.0 GPa, while the MAE for the test data was 5.5 GPa (Fig. 2d). This metric is better than the reference metric of the mean model (MAE = 6.9 GPa), which assumes that there is no relationship between the input structure and output. Although basing the analysis solely on functional groups may not encapsulate all relevant factors (e.g., those modelled by graph neural networks), we believe this method is more suitable for identifying reactant compounds in organic synthesis. Furthermore, our dataset comprises experimentally measured values of Young's modulus, which naturally include measurement errors stemming from various factors, such as the crystal surface flatness and the specific conditions used for nanoindentation. As a result, it is not surprising that our prediction errors exceed those typically observed when working with purely computational databases such as the Materials Project or QM9. Nevertheless, we uncovered the relationship between molecular structure and Young's modulus and established design guidelines for salicylideneamine molecules.
The LASSO model retained seven variables with positive and negative coefficients (Fig. 2e). Even with slight changes in α, the LASSO regression coefficients remained almost stable, and the selected functional groups did not change. This demonstrates the robustness of variable selection (ESI Fig. 2†). Four variables were extracted as substructures that are likely to increase Young's modulus, while three variables were extracted as substructures that are likely to decrease Young's modulus. As mentioned earlier, substructures with positive coefficients tend to form hydrogen bonds in crystals, which is consistent with the increase in Young's modulus. Substructures with negative coefficients should be related with the flexibility of the crystal. Crystals containing benzene rings often form layered structures with weak interlayer interactions. Many elastically bendable crystals have this arrangement, which is consistent with a lower Young's modulus.34,35 Crystals forming halogen–halogen interactions are known for their plastic deformation,36 which is included in the training dataset. It is thought that the halogen substructure remained in the variable selection of LASSO regression because crystals that undergo plastic deformation have a small Young's modulus. The counterintuitive finding is regarding aromatic OH. In the dataset, methyl gallate has the highest number of aromatic OH groups, with three OH groups forming hydrogen bonds in the crystal. However, its Young's modulus is reportedly 9.38 GPa,37 which does not reach the average value. The existence of such data likely explains why aromatic OH showed a negative coefficient.
Based on the relationship obtained from LASSO regression, we chose commercially available reactants and synthesized salicylideneamine molecules (Fig. 3). We used reactants with nitro and carboxylic acid groups for likely positive fragments. We did not select aliphatic hydroxyl groups because the reactant already has an aromatic ring. We also excluded amino groups to prevent side reactions since salicylideneamine are synthesized by dehydration reactions between amino and carbonyl groups. For likely negative fragments, we used reactants with benzene rings and halogens. Since the reactant already has a phenyl ring, reactants with a naphthyl ring were used to verify the induction effect of the benzene fragment. Considering these to be sufficient, we did not use reactants with aromatic OH.
In salicylideneamine crystals, the dihedral angle between the salicyl plane and the adjacent phenyl (or naphthyl) plane is an indicator of photoisomerization. Most of the synthesized crystals exhibited dihedral angles ranging from 20–50° (Table 1). An established empirical rule indicates that salicylideneamine crystals with dihedral angles exceeding 30° show higher propensity for photoisomerization.38 UV-vis absorption spectra of the crystalline powders showed absorption bands between 300–600 nm prior to UV irradiation, followed by enhanced absorption in the 400–600 nm range due to enol–keto photoisomerization (ESI Fig. 3†). Compounds 2 and 7 containing planar structures with dihedral angles of 8.48° and 21.24°, respectively, displayed minimal absorption changes.
Compound | Dihedral angle (°) | Longitudinal axis | Largest face | Eindent (GPa) | Ebending (GPa) |
---|---|---|---|---|---|
a The average value of two independent molecules with slightly different conformations.b The crystal of compound 6 was not measured due to its small size. | |||||
1 | 36.21 | b-Axis | (001) | 7.32 (0.74) | 0.60 (0.25) |
2 | 8.48 | a-Axis | (010) | 8.54 (0.19) | 1.92 (0.33) |
3 | 51.58 | a-Axis | (001) | 6.04 (0.32) | 0.68 (0.17) |
(S)-4 | 46.61a | b-Axis | (001) | 3.73 (0.07) | 1.83 (1.11) |
(S)-5 | 36.99 | a-Axis | (001) | 4.75 (0.16) | 1.25 (1.81) |
6 | 46.84 | b-Axis | (001) | 3.13 (0.52) | —b |
7 | 21.24 | c-Axis | (100) | 12.88 (0.87) | 1.83 (0.37) |
8 | 28.54 | a-Axis | (001) | 6.20 (0.53) | 0.66 (0.53) |
9 | 27.51 | a-Axis | (001) | 7.58 (0.12) | 2.29 (1.62) |
10 | 24.88 | a-Axis | (001) | 1.99 (0.24) | 3.55 (2.84) |
It is important to identify Young's modulus, which is a parameter of force. Young's modulus was measured from different directions to characterize the anisotropy. First, nanoindentation was performed on the wide face of all plate-shaped crystals, measuring the Young's modulus in the thickness direction (Table 1 and ESI Fig. 4†). Nanoindentation is the most widespread and reliable method to measure Young's modulus of the molecular crystal by indenting the crystal surface up to hundreds of nm. Young's modulus of crystalline compounds 1–3 was larger than that of compounds 4–6 (Fig. 5a). This result is consistent with the prediction by LASSO because most of the dataset used for LASSO regression consisted of nanoindentation results and these crystals are thought to reflect the intermolecular interactions included in that dataset. However, halogenated crystals (compounds 7–10) did not align with predictions. Crystals of compounds 7–9 had relatively large Young's moduli. This is because the dataset includes halogenated crystals that form halogen bonds with relatively small Young's moduli, while halogenated crystals in this study do not form halogen bonds. For example, hexachlorobenzene, which is known for plastic deformation,36 has a relatively small Young's modulus in the dataset and contributes to a negative relationship in the regression because it has six halogen fragments.
Next, cantilever bending tests were conducted to measure Young's modulus in the longitudinal direction because it is difficult to measure the modulus of this direction by nanoindentation. Since Young's modulus measured by the bending test is a small fraction in the dataset, measurement results may differ from the LASSO trends. Crystals with larger Young's moduli in the thickness direction tended to have smaller Young's moduli in the length direction (Fig. 5b). This can be attributed to the anisotropy of organic crystals. A large Young's modulus in the thickness direction indicates strong intermolecular interactions in that direction. When strong interactions form in one direction, weaker interactions such as CH–π interactions and van der Waals forces become the main intermolecular interactions in other directions given that similar molecules are used in all crystals, explaining the smaller Young's modulus. The standard deviation of the bending test is larger than that of nanoindentation because the bending tests calculate Young's modulus from macroscopic deformation against load.
In Bayesian optimization, the acquisition function used for sampling is crucial, and we employed upper confidence bound (UCB). Through pre-simulation, we set the hyperparameter of UCB to be κ = 5 for the optimal balance between exploration and exploitation (ESI Fig. 5†). Initially, we measured 10 points (five light conditions of two crystals), with the maximum value among these being Fmax = 9.8 mN (Fig. 6b). Here, Fmax is defined as the maximum blocking force minus the initial load. Then, we performed crystal sampling using Bayesian optimization and measured Fmax iteratively. Fmax increased progressively from 9.8 mN to 16.0 mN, then to 26.8 mN, 34.4 mN, and finally to 37.0 mN over the course of 100 experiments (Fig. 6c and ESI Table 2†). The decision to terminate the optimization process after 110 experiments was based on the convergence behaviour of Fmax. The final three updates of Fmax showed progressively smaller increments: from 26.8 mN to 34.4 mN (an increase of 7.6 mN), and then to 37.0 mN (an increase of only 2.6 mN). This diminishing rate of improvement, coupled with the fact that the last three maximum values were achieved using crystals of compound 3 under similar conditions, strongly suggests that the optimization was approaching the global maximum within our parameter space. The crystals of compound 3 were selected the most in the optimization process (Fig. 6d). While continuing the experiments might yield marginally higher forces, the decreasing magnitude of improvements indicated that we had likely identified the region of optimal conditions.
This study compares the force values achieved with those from previous research. The force of photomechanical crystals has been previously evaluated by the displacement of objects with known weights. The force achieved by photomechanical crystals is typically in the μN–mN range, with a maximum of around 10 mN.11–16 While our initial data also showed this value, we achieved a greater force than previously known by actively seeking to maximize it. On the other hand, the force density, which is the force divided by the crystal volume, reached a maximum of 1.7 × 108 N m−3, which is comparable to existing photomechanical crystals (107–109). Force density cancels out size effects, indicating that the light absorption properties and intermolecular interactions of our crystals are similar to those of other photomechanical crystals. Conversely, the significance of this study lies in demonstrating that larger forces can be achieved by sampling experimental conditions. The success of Bayesian optimization is evidenced by its ability to identify optimal combinations of crystal properties and sizes, rather than simply selecting the largest available crystals (Fig. 6e). If larger crystal volume automatically corresponded to greater force generation, a simple maximization of crystal sizes would suffice. However, the optimization process converged on specific combinations of crystal properties and dimensions, indicating that there is an optimal balance between crystal characteristics and size for maximizing the blocking force.
It is also important to consider the efficiency of sampling through Bayesian optimization. The material pool used for this force measurement included eight types of crystals. Although the ranges for length, width, and thickness differ, a grid search would require a total of 8000 crystal samplings if we sampled 10 discrete points along each size dimension. Assuming that the maximum value obtained through Bayesian optimization is the global maximum of the search space and the light conditions are the same, this method reduced the number of experiments to 110 out of 8000. Consequently, we have improved the experimental efficiency by at least 73 times, depending on the discretization. If we take finer intervals in the grid search, the total number of experiments increases exponentially, making the efficiency of Bayesian optimization stand out.
Finally, we discuss the mechanism behind the enhanced force generation. The superior performance of compound 3 can be attributed to its structural features: the large dihedral angle between the salicyl and phenyl planes (51.58°) enables efficient photoisomerization, while the nitro group enhances photoresponsivity. The molecular layers stacked along the thickness direction facilitate anisotropic thermal expansion.39 When exposed to light, compound 3 exhibits enol–keto photoisomerization and thermal effects. In thin crystals, photoisomerization causes bending away from the light source, while thermal expansion dominates in thick crystals.39 The maximum force generation observed during top illumination suggests significant thermal expansion in the thickness direction. The crystal's negative thermal expansion coefficient along the length direction near room temperature likely contributes to the overall mechanical response. The force generation increased with crystal size in regions of sufficient light intensity (ESI Fig. 6†).
Footnote |
† Electronic supplementary information (ESI) available: Details of the dataset, additional results and experimental results. CCDC 2391778–2391786. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d4dd00380b |
This journal is © The Royal Society of Chemistry 2025 |