Cesare
Roncaglia
a,
Daniele
Rapetti
a and
Riccardo
Ferrando
*b
aDipartimento di Fisica dell'Università di Genova, via Dodecaneso 33, Genova 16146, Italy
bDipartimento di Fisica dell'Università di Genova, via Dodecaneso 33, Genova 16146, Italy. E-mail: ferrando@fisica.unige.it
First published on 11th October 2021
The lowest-energy structures of AgCu nanoalloys are searched for by global optimization algorithms for sizes 100 and 200 atoms depending on composition. Even though the AgCu system is very weakly miscible in macroscopic samples, the mixing energy for these nanoalloys turns out to be clearly negative for both sizes, a result which is attributed to the stabilization of non-crystalline Cu@Ag core–shell structures at the nanoscale. The mixing energy is a quantity nowadays unknown in its functional form, so that its prediction may take advantage of machine learning techniques. A support vector regressor is then implemented to successfully predict the mixing energy of AgCu nanoalloys of both sizes. Moreover, with the help of unsupervised learning algorithms, it is shown that the automatic classification of such nanoalloys into different physically meaningful structural families is indeed possible. Finally, thanks to the harmonic superposition approximation, the temperature-dependent probabilities of such structural families are calculated.
Only in the past few years, also Machine Learning has been proposed as an effective tool to investigate and solve some problems related to the modeling of nanoparticles,10 such as energy landscape exploration11,12 and dynamic predictions via force field reconstruction,13 but also binding energy14 and free energy15 predictions, and atom classification.16 In a very broad general depiction, machine learning algorithms can be divided in two large classes: supervised and unsupervised learning algorithms. The first class deals with data sets consisting of both input and output data, and tries to establish a relation between these sets for fitting or classification purposes. The second class deals with unlabelled data, i.e. only input data, and tries to work out a specific task such as clustering the data into separate sets, feature selection or dimensionality reduction.
Here in particular we decided to implement both supervised and unsupervised learning algorithms, showing indeed their ability to capture different interesting properties of our data set. It is in fact known that a regression algorithm is capable of detecting subtle relationships between input and output data, whereas a clustering algorithm can identify subgroups of an ensemble without any prior knowledge. Our data sets were collected by global optimization searches of the lowest-energy structures of AgCu nanoalloys of sizes N = 100 and N = 200 atoms, for different compositions. Machine learning algorithms were then trained in order to make predictions of the mixing energy, a quantity not known in its functional form, and also to find structural families without any previous classification. Finally, we calculated probabilities for such different structural families as a function of temperature by means of the Harmonic Superposition Approximation (HSA).17–19
AgCu nanoalloys are important for a different number of remarkable reasons. In general, the bi-metallic nature of nanoalloys allows to enrich their range of applicability in real life situations, since the chemical ordering of equilibrium and out of equilibrium structures will induce different desirable properties, missing most of the times in their single metal counterparts. For example, AgCu nanoalloys have shown interesting plasmonic,20 electrical,21 antibacterial22 and catalytic23 properties, but also applications in corrosion resistance24 and solar cells.25 The interest in the theoretical modeling of such nanoalloys is therefore motivated by this impressive variety of experimental results.
AgCu is a weakly miscible system in bulk samples,26 with positive (endothermic) mixing energy. However, global optimization studies of small nanoalloys (of 34 and 38 atoms)27 showed that the mixing energy at the nanoscale is negative in the whole composition range. It is therefore interesting to check whether this behaviour persists at larger sizes such as those considered in this article (100 and 200 atoms).
The material is organized as follows. The second section includes a brief explanation of the theoretical methods, including Machine Learning (ML) and global optimizations algorithms, as well as more specific topics such as Common Neighbor Analysis (CNA) and Harmonic Superposition Approximation (HSA). The third section is entirely dedicated to the results of the applications of such methods to AgCu nanoalloys of 100 and 200 atoms. Finally in the fourth and last section, the conclusions can be found.
A note on terminology. In order to avoid confusion, aggregates of atoms will be referred to as nanoparticles or nanoalloys. On the other hand, the term cluster will be used to denote a set of nanoparticle structures which are grouped together by a clustering algorithm working in the space of suitable order parameters (i.e. of structural descriptors).
![]() | (1) |
![]() | (2) |
![]() | (3) |
p | q | A (eV) | ξ (eV) | r 0 (Å) | r c1 (Å) | r c2 (Å) | |
---|---|---|---|---|---|---|---|
Ag–Ag | 10.85 | 3.18 | 0.1031 | 1.1895 | 2.89 | 4.08707719 | 5.00562683 |
Cu–Cu | 10.55 | 2.43 | 0.0894 | 1.2799 | 2.56 | 3.62038672 | 4.43405007 |
Ag–Cu | 10.70 | 2.805 | 0.0977 | 1.2275 | 2.725 | 4.08707719 | 4.43405007 |
This interaction potential has been used previously to model the structures of AgCu nanoalloys and compared to Density Functional Theory (DFT) calculations and experimental results,32–35 obtaining quite good agreement with DFT data on the behaviour of the composition-dependent excess energy and on the structures of lowest-energy magic polyicosahedra of sizes 34, 38 and 45,32 on the energetics of the placement of Cu impurities in Ag icosahedra and truncated octahedra (see ref. 7, Tables 5.1 and 5.3), on the relative stability of Mackay, anti-Mackay and chiral Cu@Ag icosahedral structures (see ref. 33). Moreover, the agreement between the predictions of this model and the experimentally observed structures is quite good34 as discussed in ref. 35. Therefore we believe that this model, although approximate, is able to catch the relevant structural aspects of AgCu nanoalloys, with the advantage of allowing such a thorough exploration of the nanoalloy energy landscape that would be unfeasible by DFT calculations.
The mixing energy Emix7 was used to analyze the energetic stability of the nanoalloys depending on composition. Emix is defined as follows
![]() | (4) |
NE(m, n) < mE(N, 0) + nE(0, N) | (5) |
Each nanoparticle structure was described with the aid of two other parameters, coming from the Common Neighbor Analysis40 (CNA). For each pair of nearest-neighbor atoms, the CNA defines a signature consisting of a triplet of integer numbers rst:
• r – the number of common nearest neighbors of the pair.
• s – the number of bonds between those r atoms.
• t – the length of the longest chain of bonds that can be made out of the s bonds present.
For a given nanoalloy and signature, we define its signature order parameter as the number of nearest-neighbor pairs presenting such signature divided by the total number of nearest-neighbor pairs in the nanoalloy. Some typical signatures are shown in Fig. 1. In particular, here we decided to use first a two-dimensional description based on the (555, 422) pair of signatures, and the to compare the results to those obtained by means of different choices of the variables.
![]() | (6) |
![]() | (7) |
![]() | (8) |
![]() | (9) |
![]() | (10) |
BIC = log(n)p − 2![]() ![]() ![]() | (11) |
![]() | (12) |
![]() | (13) |
![]() | (14) |
![]() | (15) |
![]() | (16) |
In Section 3.3, we discuss different possible choices of the order parameters for the application of the clustering algorithms. Finally, in Section 3.4, we consider a specific composition, Ag64Cu36. For that composition we consider a set of low-energy local minima collected by our BH searches, we use the clustering algorithm to separate these structures into families, and we calculate the temperature-dependent probability of the families by means of the HSA.
![]() | ||
Fig. 2 (A) Mixing energies calculated after global optimization. (B) SVR (20% test) predictions for every composition, along with mixing energies calculated from global optimization. |
For a selected set of compositions, we checked the behavior of the mixing energy by DFT calculations. The DFT results (reported in the ESI†) are in good agreement with those of the Gupta potential, confirming the overall behavior of the mixing energy, and even giving somewhat more negative values. The negative values of the mixing energy in nanoalloys such as AgCu, AgNi, AgCo, AuCo and others was attributed to the efficient stress relaxation achieved by core@shell structures in which the element with lower surface energy and larger atomic size is segregating at the surface.35 The DFT data confirm this result and show that, at least for AgCu, the electronic effects not included in the Gupta model, such as directional terms in bonding between atoms or charge transfer, are likely to be of minor importance.
For this size, the mixing energy is calculated for every possible composition, so that in principle there is a complete knowledge about this function. However, for larger nanoalloys (as we shall see later) it can be difficult and rather expensive to compute the mixing energy for every possible composition, since it requires a full global optimization for each composition, so that in principle it is useful to have a method capable of making predictions where they are needed, i.e. on “unseen” data.
To make such predictions, we trained and tested a SVR splitting randomly the data set into an 80% training set and a 20% test set, using the number of silver atoms m as the only variable. In order to avoid overfitting, hyperparameters were tuned from the following ensemble, which spans different orders of magnitudes, after 5-fold cross validation:
Rtraining2 = 0.999825![]() ![]() ![]() ![]() |
Rtraining2 = 0.999875 Rtest2 = 0.999468 |
When the data set is represented in the two dimensional space given by the 422 and 555 CNA signatures, all nanoalloys automatically separate into different groups. However, it is still not clear at this stage how many are there, as shown in Fig. 3A. After K-means is implemented, and the silhouette score is plotted as a function of the number of clusters K, one obtains the plot given in Fig. 4A. The optimal number of clusters, according to this criterion, is then K = 7. The Voronoi tessellation, along with each cluster center, is shown in Fig. 3B. A representative structure for each family can be found in Fig. 4B. Further details and images are given in the ESI.† Here we give a description of the different structural families found, specifying the number of silver atoms m.
![]() | ||
Fig. 4 (A) Silhouette score as a function of K for the case N = 100. (B) Seven representative structures, one for each family. Labels are referred to Fig. 3B. The seven structures have 2, 82, 62, 23, 72, 56 and 45 silver atoms respectively. |
• (1) m = 0–4, m = 100
These are the only six structures with Marks decahedral55 symmetry.
• (2) m = 79–83
Five asymmetric icosahedral structures.
• (3) m = 59–66
These eight nanoalloys have a 55-atom perfect Mackay icosahedron covered by an incomplete Mackay crust, which is slightly distorted and presents a small rotation.
• (4) m = 5–34, m = 84–99
These forty six structures, which compose the largest cluster, have icosahedral symmetry. They are basically composed of a 55-atom perfect Mackay icosahedron covered with an incomplete Mackay icosahedral shell (which is part of the surface of the 147-atom icosahedron). However, even if they share this common feature, there are two subgroups that can be identified. The first subgroup is that of m = 5–34, where the silver atoms are at the nanoalloy surface, typically occupying icosahedral vertices and the external shell. The second subgroup can be identified with the remaining nanoalloys. Those structures have all of their copper atoms in the core, typically forming part of the 13-atom perfect icosahedron.
• (5) m = 67–78
These twelve structures are polyicosahedra32 resulting from three joined 55-atom icosahedra sharing some atoms. The third icosahedron is incomplete because of an insufficient total number of atoms.
• (6) m = 54–58
These five structures are again core–shell as those in group 4. However their crust is of anti-Mackay type.33 Moreover, they have part of such crust removed at the boundary and some silver atoms are placed at the fivefold vertices. We remark that this small deviation (their 422 signature differs 0.0262 from the seventh group) was detected by the clustering algorithm.
• (7) m = 35–53
All nineteen these structures are similar for their icosahedral symmetry to those already described in group 4 and 6, however they have the 55 perfect Mackay icosahedron covered with a partial anti-Mackay crust to form a ball-and-cup structure.56 Perfect symmetry is achieved when m = 45.
With very few exceptions, the global minima of 100-atom particles belong to some type of icosahedral family. Exceptions are found at a few extreme Cu-rich compositions and for pure Ag, where the best structures are decahedral. All structures with the lowest mixing energy are icosahedral, and the best ones (belonging to clusters 3, 6 and 7) present a 55-atom Mackay icosahedron covered either by a Mackay or an anti-Mackay incomplete shell. These results show that icosahedral and polyicosahedral structures take advantage from stress relaxation in Cu@Ag structures much more efficiently than decahedral ones.
We note also that the transition between neighboring clusters of structures often corresponds to a change in slope or to an inflexion point in the mixing energy curve (see the ESI,† Fig. S2).
Rtraining2 = 0.999013 Rtest2 = 0.997736 |
Rtraining2 = 0.999591 Rtest2 = 0.998537 |
![]() | ||
Fig. 5 Mixing energy profile for AgCu nanoalloys with N = 200 atoms in total, and SVR fit for the case of 80–20 split. |
The representation of these nanoalloys in the same two dimensional space described by the 422 and 55 CNA signatures is given in Fig. 6A.
When K-means is implemented, and the silhouette score is plotted as a function of the number of clusters K, one obtains the plot given in Fig. 7A. The optimal number of clusters, according to this criterion, is then K = 5. The Voronoi tessellation, along with each cluster center, is shown in Fig. 6B. A representative structure for each family is shown in Fig. 7B. Further details and images are given in the ESI.† The different structural families can be described as follows
![]() | ||
Fig. 7 (A) Silhouette score as a function of K for the case N = 200. (B) Five representative structures, one for each family. Labels are referred to Fig. 6B. The five structures have 10, 106, 168, 42 and 80 silver atoms respectively. |
• (1) m = 0–30, m = 196–200
Marks decahedra. In Cu-rich decahedra, Ag atoms fill the vertices first, then they start filling (100) facets. In Ag-rich decahedra, the few Cu are atoms placed in the fivefold axis.
• (2) m = 104–108
Core–shell FCC–HCP fragments.
• (3) m = 160–174
Core–shell asymmetric icosahedra.
• (4) m = 32–54, m = 102, m = 110–152, m = 176–194
Incomplete core–shell structures, with a Cu icosahedron of N = 147 atoms covered by a Mackay Ag crust. Ag atoms initially fill the vertices, and later the edges. Finally they become part of the surface of the core 147 icosahedron. Again as in the case with N = 100, this is the largest cluster of structures.
• (5) m = 56–100, m = 154–158
As group (4), but with an anti-Mackay crust.
Also for size 200, a vast majority of global minima is of icosahedral structure. The exceptions are the decahedra found for Ag-rich and Cu-rich compositions and few FCC–HCP structures at intermediate compositions. These results confirm the key role of stress relaxation in determining the most stable structures also at this larger size of 200 atoms.
First of all, we applied the clustering algorithms in the three-dimensional (3D) space of (555, 422, 421) signatures. Then we applied clustering in the spaces of two signatures at a time: (555, 421) and (422, 421) besides (555, 422), and finally to 1D spaces of one signature at a time, i.e. (555), (422) and (421) separately. We compared all clustering choices finding the following results:
– For size 100, only the pair (555, 422) was able to reproduce exactly the same clustering as the 3D description, while (555, 421) and (422, 421) were giving somewhat different clusters. On the other hand, the clustering obtained by 1D descriptions was significantly different.
– For size 200, only the pair (422, 421) was able to reproduce the same clustering as the 3D description. At variance with the (555, 422) description, (422, 421) gave six clusters instead of five, due to the splitting of cluster 4 of Section 3.2 into two new clusters corresponding to Cu-rich and Ag-rich parts. These two parts differ by a small distortion of the Ag crust in the Ag-rich part which eliminates the few 421 signatures that are present in Cu-rich nanoparticles.
From these results it turns out that (555, 422) and (422, 421) are the best choices to capture the physically relevant clusters of the full 3D description. However we note that the (422, 421) pair would be unable to separate well nanoparticles with fivefold symmetries and nanoparticles with extended hcp domains or several stacking faults. The latter were not present in our samples of global minima for sizes 100 and 200, but they may appear in samples including also higher energy isomers. For this reason we prefer the (555, 422) pair as the most useful 2D description. We note that this procedure of selecting the minimal set of variables for clustering of structures can be generalized to multi-dimensional cases including many CNA signatures and also other variables. Work in this direction is in progress.
![]() | ||
Fig. 8 (A) Data set representation in the two dimensional space of 422 and 555 order parameters. (B) Gaussian mixture model plot with centers and contours for K = 7 distributions. |
![]() | ||
Fig. 9 (A) BIC and (B) silhouette scores for both full and reduced data sets. In (A) the curve for the reduced data set has the y-axis range given on the right part of the plot. |
If we consider only structures which differ at most 0.5 eV from the global minimum, we obtain another representation of such nanoalloys, as shown in Fig. 10A.
With such restriction, the amount of structures reduces to 90. BIC and silhouette score suggest (as expected) four clusters to be the optimal subdivision of the reduced data set, as shown in Fig. 9(A) and (B). A representative structure for each family is instead depicted in Fig. 11.
![]() | ||
Fig. 11 Representative structures for the four clusters of Fig. 10B. Top left is an icosahedron with a Mackay crust (black cluster), top right is an icosahedron with mixed Mackay and anti-Mackay crust (yellow cluster), bottom left is an icosahedron with full anti-Mackay crust (purple cluster) and finally bottom right is a polyicosahedron (red cluster). |
When the HSA is implemented, the probability for these four structural families as a function of temperature can be calculated, as shown in Fig. 12. Specifically, the relative probability P of family is given by
![]() | (17) |
![]() | ||
Fig. 12 Probability P of the different structural families as a function of temperature in the range 200–600 K. P is calculated by eqn (17). The curves refer to icosahedra with Mackay crust (black), icosahedra with mixed Mackay and anti-Mackay crust (yellow), icosahedra with full anti-Mackay crust (purple) and polyicosahedra (red). The colors correspond to those of the clusters of Fig. 10B, while representative structures are given in Fig. 11. |
The mixing energy was found to be negative, for all compositions at size 100 and for the vast majority of compositions of size 200, with very few slightly positive values in the extreme Ag-rich limit. We attribute the negative values of the mixing energy to the possibility of forming non-crystalline structures with core shell Cu@Ag chemical ordering, which is the optimal chemical ordering due to the larger size and lower surface energy of Ag compared to Cu. This chemical ordering, which is possible at the nanoscale with no counterpart in the bulk limit, allows an efficient stress relaxation which helps in stabilizing nanoalloy structures with respect to elemental nanoparticles.32,35 In fact, the structures with the lowest mixing energy are found in the composition range where the inner part and the surface layer are almost completely made of Cu and Ag atoms, respectively. The most efficient stress relaxation is achieved by structures of the icosahedral or polyicosahedral families, even for sizes, such as 100 and 200, that are far from icosahedral magic numbers. The stabilization of icosahedra by stress relaxation was already found for nanoparticles of sizes below 50 atoms, in several systems (AgCo, AgNi, AgCu and others).27,32,35 Here we have shown that stress relaxation is still very effective in stabilizing icosahedral structures at significantly larger sizes, and for wide composition intervals.
The global optimization data were used as a benchmark for the use of machine-learning techniques with the aim of answering to the following questions:
1. Is it possible to accurately fit the mixing energy by a suitable expression?
2. Is it possible to automatically group the nanoalloy structures into physically meaningful families?
As regards question 1, our calculations showed that the SVR algorithm is able to produce very accurate fits, for both N = 100 and N = 200. These results indicate that the same procedure is likely to produce accurate functions for interpolating the mixing energy also in cases in which the search of the global minima for all compositions might be very cumbersome (think for example about nanoalloys of sizes 500 or 1000), so that interpolating on the basis of the results of a limited set of compositions may be quite useful. We note also that the interpolation procedure by the SVR method includes a cross-validation of the result of the fitting that allows to determine the reliability of the fit itself.
As regards question 2, thanks to the clustering algorithms, we successfully divided each data set in distinct groups, thus underlying the variety of structural families present in AgCu nanoalloys of size N = 100 and N = 200 atoms. In particular we found that even without a previous manual classification of such nanoalloys, it is still possible to recover a physically meaningful as well as detailed separation into different clusters of structures. This result was achieved also thanks to the clever description given by the CNA signatures, which made possible the success of the two algorithms used, in a two dimensional space. We remark the importance of our choice of using unsupervised learning algorithms, for it would be surely possible to train a supervised classification model on a manually labelled data set, but more time consuming and, at some point, subjective. In order to cluster the data sets of the global minima for N = 100 and 200, we used K-means which produced a hard partition (Voronoi tessellation) of the two dimensional space of the two order parameters relative to 422 and 555 signatures. The classification obtained by the (555, 422) pair was compared also to those obtained by using other CNA sets of variables, discussing how the classification depended on the choice of the set.
A different algorithm, the Gaussian mixture model, was instead used to perform the same task on a slightly more complicated case, that is the one offered by the two dimensional representation of some of the local minima found in the global minimization of Ag64Cu36. Here the superior flexibility of such algorithm with respect to K-means was crucial to find a good separation into clusters.
The usefulness of the clustering of the minima of Ag64Cu36 was then demonstrated by calculating the equilibrium probabilities of the different structural families depending on temperature. Finally, we note that the approaches developed in this work do not rely on any specific feature of the AgCu system, so that they can be easily used to treat other nanoalloys.
Footnote |
† Electronic supplementary information (ESI) available: Parameters of the machine learning fit. Details on nanoalloy structural families. See DOI: 10.1039/d1cp02143e |
This journal is © the Owner Societies 2021 |