A. J. C.
Varandas
abc
aSchool of Physics and Physical Engineering, Qufu Normal University, 273165 Qufu, China
bDepartment of Physics, Universidade Federal do Esp rito Santo, 29075-910 Vitória, Brazil
cDepartment of Chemistry, and Chemistry Centre, University of Coimbra, 3004-535 Coimbra, Portugal. E-mail: varandas@uc.pt
First published on 28th November 2025
Quasi-molecule theory and ‘tiles’ embedded on the parent molecules are used to investigate whether calcium hexa-carbides can attain linearity irrespective of their size. By assuming that some small carbon clusters may behave pseudo-atomic-like, the approach is generalized to explore ‘pseudo-linearity’ among the calcium hexa-carbides. It is tentatively conveyed that the atomic basis sets used for studying families of related molecules, rather than individual ones, should be maintained constant for the whole set such as to warrant that any differences in the observed trends are not driven by computational approximations or numerical inaccuracies. In the absence of experimental data, low-hierarchy (often subminimal) basis-set calculations are employed for probing the predictions by quasi-molecule theory for the parent molecules, while augmented correlation-consistent basis sets can efficiently be used for accurate correlated calculations in the tiles they embed. Besides hexa-carbides, other calcium carbides, planar and volumetric, are also studied.
One may start by questioning whether a molecule needs to be strictly (pseudo-)linear or if it suffices being quasilinear (mildly nonlinear). The answer depends on the context and property under consideration. Quasi-linearity may suffice for many purposes, provided the deviations do not drastically alter the molecular properties of interest. Nevertheless, when precision is key such as in high-resolution spectroscopy or specific reaction pathways, being linear or not can be a critical issue.
Assuming atoms as pointwise entities, quasi-molecule theory5–13 can decipher linearity and planarity or otherwise of molecules (called parents) at their equilibrium geometries from sets of atoms (‘tiles’) that are embedded on them. Stretching the concept, one wonders in the current work whether small molecules, rather than single atoms, can be conceived themselves as pointwise, thus allowing the generalization of quasi-molecule theory to predict the shape of pseudo-linear (although not tested yet, a similar extension is conceivable to handle pseudo-planar) parents. Pseudo-linearity is discussed here by focusing on alkaline-earth-metal carbides, so far little studied in the literature, specifically calcium carbides. In the absence of experimental data, ab initio and DFT calculations must unavoidably be performed to probe the predictions made for the members of the CaC6-family, all neutral systems. For a broader perspective, related calcium carbides of other shapes will also be studied here.
Materials may not exist in isolation. However, if they are large enough, their boundaries can be assumed to not have a significant effect on their electronic states. Periodic boundary conditions14 are then commonly assumed in computer simulations, and mathematical models used for approximating a large (even infinite) system by using just a small part of it called a unit cell. Although the geometry of such a cell commonly satisfies perfect two-dimensional tiling, three-dimensional periodic boundary conditions are also useful for approximating the behavior of macro-scale systems of gases, liquids, and solids. In turn, one-dimensional tiling is used in the treatment of quantum wires (ref. 15, and references therein). However, rather than hypothesising and performing simulations, it may be sensible to ask can unit cell-like clusters and even the full 2D materials be themselves predicted?
Regarding calcium carbides, CaC and CaC2 are formed under laboratory conditions and their existence is likely in primordial clouds such as molecular clouds or interstellar clouds. Because CaC is less stable than CaC2, it may require high-energy conditions for its formation, such as a high-temperature environment or pressure conditions. In turn, CaC2 has already been identified in the IRC+10216 nebula,16 and reaction dynamics studies on its potential energy surface (PES) were recently carried out17 using the internally contracted multireference configuration-interaction (MRCI+Q) method and augmented correlation consistent basis sets.18,19 Moreover, a three-dimensional PES has been reported for the c-CaC2–He van der Waals complex at the explicitly correlated coupled-cluster level, and close-coupling calculations were performed on it to determine cross-sections for rotational transitions through He collisions with c-CaC2.20
In turn, the C6 molecule is deemed relatively abundant in the circumstellar shells of carbon stars thanks to the favorable conditions for formation of carbon chains. Although the presence of metals like calcium is scarce in such environments (and possibly predominant as ionized calcium Ca+ or Ca++ due to high-energy UV radiation and stellar winds), it is conceivable that it may interact directly with carbon to form compounds like CaC6. To our knowledge, the existence of the CaC6 molecule has not yet been reported, with most studies and syntheses involving solid-state forms where calcium is bonded to a network of carbon atoms.21–23 Its gas-phase formation is then challenging mostly due to the reactive nature of calcium and its tendency to form stable ionic compounds or solid-state carbides. Nonetheless, even the latter are not widely encountered outside of research environments, and their properties and practical applications are still the subject of scientific investigation. In this work, we will study the family of gas-phase CaC6 molecules using ab initio molecular orbital (MO) methods and density functional theory (DFT). Besides offering a relevant system for theoretical work, it may help in elucidating how to avoid the above-mentioned issues in electronic structure calculations affect the rigorous study of large molecules and molecular materials.
Consider the following question: can the geometry of a molecule be rationalized as pseudo-linear or otherwise from the geometries of the quasi-pseudo-triatomic tiles (pseudo-tiles or just tiles for simplicity) it embeds? The answer is positive as implied by the lemma: If all triads of points (atoms or pseudo-atoms) in a set are on a line, they are all on the same line.
This lemma is valid for atoms (recall, not themselves true points) and, in the present work, under the assumption that C6 is considered a pseudo-atom. Of course, such an approach is best judged by the results one obtains for pseudo-linear molecules like the calcium-hexacarbides studied here. Recently reported5 as Lemma 1 (Lemma 2 is for planarity,5,6,8,13 with the tiles being real tetratomics in this case; see also later), the reader is addressed elsewhere10 for details.
Rather than rationalizing the results for the title systems from specific optimizations for the parent, the aim is to provide a background theory to predict molecular geometries without resorting to direct calculations or experimental data. Of course, in the absence of experimental information, direct ab initio or DFT calculations must be done to probe the predictions made from the present approach for the parent molecule, named quasi-molecule theory. One may argue that it is sufficient, for example for a molecule with D∞h symmetry, that a non-vanishing dipole moment arises to conclude whether the employed method is unsatisfactory. This is certainly the case, but how to proceed when the molecule is: (i) less symmetrical, such as in this case with pseudo-linearity; (ii) experimentally unknown; (iii) not amenable to sufficiently accurate electronic structure calculations?
Although the structure of a stable molecule should be minimum, one wonders whether predicting a saddle point may suffice. The answer is positive, yet uncommon to some extent. Recall that in the absence of a push along the out-of-linearity coordinate associated with the imaginary frequency, one cannot distinguish the two (minimum versus saddle point) stationary points, since the Lemma involves no forces. Of course, a single point calculation along the bending (or another convenient) coordinate may suffice to disclose its nature at the chosen level of theory. Recall further that a saddle point may still behave as a minimum if vibrational stabilization is brought into play.24,25 This can only be certified if a wider portion of the PES is calculated, which will not be attempted here. Another remark to note that one can disclose the nature of stationary points in quasi-molecule theory by employing a bisection method reported elsewhere,5,6,8 thus avoiding to perform ab initio or DFT calculations for all possible tiles (triatomics, for linearity) embedded on the parent molecule. This can greatly facilitate things, since the combinatorial law could make the task tedious and eventually unaffordable.
![]() | ||
| Fig. 1 Planar and linear structures of ground singlet C6: top and bottom panels, respectively. Panels (a) on left-hand-side are with the 3-21G basis set, those on right with AVTZ. Energies and bond distances (in Eh and Å, except when specified otherwise) are indicated top-down for DFT using the B3LYP and M06-2X functionals with the AVTZ basis set and, for CCSD(T) with the VTZ basis set, where signaled with an asterisk. The stabilities of the linear isomers relative to the planar congeners are shown in parenthesis. For convenience, the total energies along the work are given as the Molpro37 output. | ||
Panel (a) in the left-hand-side of Fig. 2 shows CaC6 at the DFT B3LYP34–36 level with the 3-21G basis set, where the ring-shaped C6 is viewed as a pseudo-atom. In turn, the right-hand-side panels (b) and (c) show Ca(C6)2 and Ca2C6 optimized at the same level of theory. Both are minima, and hence can be viewed as pseudo-linear triatomics.
Naturally, such molecules reveal shapes that depend somewhat on the level of theory. This is evinced in Fig. 1, which illustrates the actual shapes of ground-singlet C6 at the DFT level of theory using both the 3-21G and the recommended AVTZ basis sets, the first of which is the only one affordable for the larger clusters considered in the present work. Corresponding structures have been obtained for Ca(C6)2 and Ca2C6 at B3LYP and M06-2X38,39 levels of theory with AVTZ basis sets; see the top and bottom panels of Fig. 3, respectively. However, no converged structure has been found for CaC6Ca with the M06-2X functional. All attempts for CaC6Ca with B3LYP and M06-2X and AVTZ basis sets tend to yield structures with lower symmetry compared to the 3-21G structure as well as broken forms of C6 followed by formation of a Ca–Ca bond. Conversely, results similar to described above with the 3-21G basis set are obtained when using the same functionals and the MINI40 basis set. Moreover, the above findings hold when Grimmes's correction41 for the dispersion interaction is used, i.e., when Grimme's original DFT-D3 is employed in the current work. Of course, other stationary points may be expected, as noted in previous work,5,7–10,13 such a possibility should not invalidate the results reported here.
The tendency to form a Ca–Ca bond is actually corroborated in Fig. 4, which shows the easy to converge Ca2C6 minimum structure at the B3LYP/AVTZ level of theory. It turns out to be predicted as a saddle point at this level of theory, with a small imaginary frequency (−19.3 cm−1) for bending. This is not surprising since the inverse sandwich with C6 between the two Ca atoms is predicted to be a minimum at the B3LYP/3-21G level, with a more negative energy than CaCaC6; cf.Fig. 2(c) and 4(a). This pattern is maintained when including the dispersion correction, and in more elaborate calculations with the AVTZ basis set; see Fig. 2 and 4 for the corresponding energies.
![]() | ||
| Fig. 4 Pseudo-linear triatomics CaCaC6 at B3LYP/3-21G [panels (a) and (c)] and B3LYP/AVTZ [(b) and (d)] levels of DFT. Side views are shown in the lhs, while top views are shown in the rhs. | ||
One should now address the intriguing fact in Fig. 2 related to the six bonds apparently imparted by the Ca atom in CaC6, a number that raises to 12 in Fig. 3 for Ca(C6)2. Such plots are obtained with the WXMACPLOT package by setting to default the standard criterion for lengths and normal bonding tolerance, as well as guessing the bond order and assigning hydrogen bonds. In fact, such a criterium is followed in drawing all stick-and-ball plots in the present work. Naturally, one cannot assign a single-bond to every stick; see also later. In fact, no obvious atomic hybridization can be invoked when knowing that calcium is an alkaline-metal atom typically with only s and p valence orbitals, thence electronic configuration 1s22s22p63s23p64s2. It should also be recalled that C6 can assume other isomeric forms, in particular the linear one in Fig. 1.
Known to vary in size and level of sophistication, modest basis sets (sub-minimal28 such as Pople's STO-nG46 and split-valence 3-21G,26 Huzinaga's40 MINI, or minimal like Dunning's18 correlation consistent cc-pVDZ) may allow for a uniform computational approach across a wider portion of the target family of molecules selected here. This is important for a comparative study, since the consistent use of the same basis set helps to mitigate the risk of introducing errors or inconsistencies due to varying the atomic bases. Indeed, switching atomic basis sets may lead to unexpected or non-systematic shifts in the calculated properties as implied by some of the results reported here. Because the molecular orbitals are approximated by the atomic basis sets, the results are likely to be comparable only when the latter are the same for all molecules. Keeping the atomic basis set constant will then help to maintain the uniformity of the comparison while ensuring that any differences in the observed trends are not driven by differences in computational approximations, thence are expected to be the intrinsic properties of the molecules rather than possible artifacts of the employed methodology.
If the approach described in the previous paragraph is adopted, the affordable basis set should be kept constant or vary in size in some (unknown) compensating ways within the range of studied molecular sizes. For the singlet ground state of C3, the detailed results of Table 1 (also Fig. 5) show that keeping the basis small enough can be a promising way to proceed if some loss of accuracy in the energetics can be tolerated. Can this support the fact that the use of a diffuse basis set, a curse for sparsity,4 can be problematic and hence should be avoided when aiming at an accurate geometry at a moderate cost? In fact, the results in Table 1 even appear to promote the joint use of a small basis set with the cost-effective DFT method. Conversely, the CCSD(T) results in Fig. 5 cast doubt on its use even with the AVDZ and AVTZ basis sets. In fact, this extends to the use of the latter with the M06-2X functional. It should be recalled that (“exact”) full configuration interaction (FCI) calculations with the MINI basis set predict a minimum linear structure10 in good agreement with the experimental data.47,48 Of course, the flat nature of the ground-state C3 PES in the region of linearity is well established49 (cf. also attributes at the B3LYP/AVDZ level of theory in Table 1, where both a saddle point and a minimum are scrutinized close to each other in energy), and the minimum known to occur at linear geometries.47,48 Also, the decreasing pattern of the imaginary frequency at the M06-2X level of theory is observed with enhancement of the basis set, which suggests that a real frequency for bending could even arise if the basis-set enhancement was pursued.
| System | Energy/Eh | R/A | Angle/deg | ω 1 /cm−1 | ω 2/cm−1 | ω 3/cm−1 |
|---|---|---|---|---|---|---|
| a For further comparisons with CCSD(T) calculations using larger basis sets, see Fig. 5. b Double-degenerate. | ||||||
| M06-2X/3-21G | −113.35296850 | 1.2961 | 180.0 | 178.2 | 1250.4 | 2184.5 |
| B3LYP/3-21G | −113.34470023 | 1.2978 | 180.0 | 164.6 | 1220.9 | 2152.7 |
| M06-2X/VDZ | −114.00965632 | 1.2982 | 180.0 | 115.6 | 1267.8 | 2185.3 |
| B3LYP/VDZ | −113.99039153 | 1.3012 | 180.0 | 167.8 | 1240.3 | 2155.7 |
| M06-2X/AVDZ | −114.01376535 | 1.2982 | 180.0 | −151.3 | 1262.2 | 2171.4 |
| B3LYP/AVDZ | −113.99539701 | 1.3014 | 180.0 | −117.4 | 1235.5 | 2142.9 |
| B3LYP/AVDZ | −113.99550276 | 1.3013 | 171.7 | 119.9 | 1258.5 | 2128.9 |
| M06-2X/VTZ | −114.03328782 | 1.2866 | 180.0 | −43.3 | 1258.6 | 2164.6 |
| B3LYP/VTZ | −114.01954681 | 1.2879 | 180.0 | 105.6 | 1239.0 | 2145.6 |
| M06-2X/AVTZ | −114.03432259 | 1.2861 | 180.0 | −49.1 | 1257.6 | 2160.4 |
| B3LYP/AVTZ | −114.02074756 | 1.2873 | 180.0 | 102.5 | 1238.3 | 2142.4 |
| M06-2X/VQZ | −114.03898463 | 1.2854 | 180.0 | −34.1 | 1259.4 | 2159.9 |
| B3LYP/VQZ | −114.02681463 | 1.2868 | 180.0 | 98.3 | 1238.8 | 2139.9 |
| M06-2X/AVQZ | −114.03942668 | 1.2853 | 180.0 | −17.5 | 1259.4 | 2159.4 |
| B3LYP/AVQZ | −114.02735668 | 1.2866 | 180.0 | 91.8 | 1238.8 | 2139.4 |
| CCSD(T)/VDZ | −113.74247109 | 1.3206 | 180.0 | 108.3 | 1184.8 | 2084.6 |
| MRCI/CBS49 | 1.3092 | 180.0 | 56.1 | 1171.0 | 2060.9 | |
| Exp.47,48 | 1.2970 | 180.0 | 63.42 | 1224.20 | 2040.02 | |
Often known from experiments that molecular geometries are frequently unveiled via ab initio calculations and correspond to minima on the Born–Oppenheimer (BO) PES.50 Of course, as it is already clear, variability in the results may arise from the level of theory and inclusion of electron correlation effects. Being a basic enterprise in chemistry,51,52 the common dialog is MO theory versus DFT, which implies solving the electronic Schrödinger equation using either the molecular orbital wave function1,2 or the electronic density,53,54 respectively. Both yield approximate solutions: in MO theory, because of the cost-scaling law of such calculations and inability to saturate the basis set used to represent the molecular orbitals;29,30 in DFT, due to the lack of a rigorous density functional.55,56 Indeed, although Kohn–Sham (KS) DFT bears an exact formulation of quantum mechanical electronic structure theory, it relies on approximate exchange–correlation functionals,57 with one out of the many functionals being the best for one system but not for another.58 As already noted, the common belief that DFT is more trustable than MO is actually due to the fact that its adoption offers an excellent trade-off of accuracy versus cost. Specifically, we have chosen B3LYP and M06-2X, two of the most extensively benchmarked and validated functionals in quantum chemistry. Besides good reliability, B3LYP demonstrates a balanced accuracy versus cost, and includes a fraction (20%) of Hartree–Fock exact exchange; thence, it is a practical choice for medium- to large-sized molecules. In turn, with a distinct philosophical design, M06-2X has a high HF exchange (54%), hopefully reducing self-interaction errors, and often giving better reaction barriers, noncovalent interactions, and thermochemistry.
Still, DFT tends to overshoot the effect of correlation at the expense of exchange by predicting homogenized electronic structures with delocalized electrons59,60 (cumulenic rather than alternating bond length or bond angle structures). However, KS DFT routinely provides structures and energies for up to a few hundred atoms, with recent efforts61–63 (and references therein) directed toward the development of even lower-cost DFT formulations. In particular, the semiempirical tight-binding model63 has been primarily designed for the fast calculation of structures and noncovalent interaction energies for molecular systems with roughly 1000 atoms. Its performance is illustrated in Section 3 through comparisons with conventional DFT results reported here.
As it is well established, modern electronic structure calculations employ mostly Gaussian orbitals for the wavefunction or the density. Despite their advantages due to the analytic solution of the involved integrals, there are disadvantages due to the large number of basis functions required to accurately represent the nuclear cusps in the wave function and lack of orthonormality: in particular, convergence to the complete basis set (CBS) limit cannot be necessarily achieved by adding more basis functions.4,64 Since the goal here is the shape of the molecule at its equilibrium geometry, it is hoped that large basis sets can be avoided, and that their sizes can be reduced to a minimum trading some accuracy in the energetics for increased locality4 (parsity in the 1-particle density matrix, 1-PDM) and computational efficiency.
Employing standard codes for the calculations of a variety of systems belonging to a family, one wonders whether their intrinsic default parameters are the best to use for the purpose. Because the target is geometry optimization rather than energetics along a pathway, it appears reasonable to trust defaults as a safe approach, although not necessarily optimal.65 However, there are a few high-leverage defaults that are worth overriding. This is here illustrated for the prototypical CaC6 molecule by considering a few grid checks in the involved numerical integrations. Briefly, to assess the grid sensitivity of default settings, CaC6 was re-optimized with MOLPRO37 at B3LYP-D3/3-21G and aug-cc-pVTZ basis-set levels using both the default pruned grid and a dense unpruned grid. The differences for the default 3-21G were within ≤0.004 kcal mol−1, ≤0.001 Å and ≤0.1 deg and, for AVTZ, within ≤0.001 kcal mol−1, ≤0.0005 Å and ≤0.1 deg. This supports defaults to be adequate, besides less time consuming, for the broader set. They are then employed in all DFT calculations here reported, except for convergence where tight criteria are often employed.
| Can(C6)m(1A) → Can−1(C6)m−2(1A) + Ca(C6)2(1A) | (1) |
| → Can−2(C6)m−1(1A) + Ca2C6(1A) | (2) |
Before addressing eqn (1) and (2), consider first the cases up to CaC6, where the linear C6 is treated as usual. One has:
| CaC6(1A) → C6−m1CaC6−m2(1A) + Cm1+m2(1A) (2 ≤ m1 + m2 ≤ 6) | (3) |
Interestingly, all cases in eqn (3) with m1 = 2 and an even m2 turn out to be saddle points either when using the B3LYP or M06-2X functionals and the 3-21G or AVTZ basis sets. Note that CaC2 is predicted to have an imaginary bending frequency of −130.6 cm−1 (−163.6 cm−1) when calculated at the B3LYP (M06-2X) level of theory with the AVTZ basis set. In fact, the same is predicted at the CCSD(T)/AVTZ level but with an imaginary frequency for bending of −131.8 cm−1; this saddle point connects the two isosceles triangular geometries (C2v symmetry) where the C–C and two equal Ca–C bond lengths are, respectively, 1.2771 Å and 2.2723 Å, see Fig. 6. If CCSD(T)/AVTZ* (CCSD(T)/AVQZ*) is used (the asterisk signals the use of the cc-pCVXZ basis68 for Ca), the energy of CaC2 is −752.74818245Eh (−752.76724578Eh), and the above distances read 1.2771 Å (1.2730 Å) and 2.2680 A (2.2652 Å), in the same order. These results show satisfactory agreement both with the AVTZ ones and those reported17 at the MRCI+Q/AVQZ level of theory with V(Q+d)Z42,68 and AV(Q+d)Z18,69 basis sets for Ca and C, respectively, 1.268 Å and 2.233 Å. They also compare well with previous CCSD(T) results for the pair (RCC, RCaC2), which include extrapolations from up to quintuple-zeta basis sets:16 (1.2685, 2.1078) versus the current (1.277, 2.180) values, all in Å. The calculated CCSD(T)/AVTZ coordinates and frequencies are given in the SI.
Regarding CaC3, the two possible linear structures in Fig. 7 are predicted to be minima with B3LYP/3-21G, while CCaC2 is a saddle point when using the AVTZ basis. Similar considerations apply to CaC4, although showing a small bending frequency of −30.5 cm−1 (2.8 cm−1) at the B3LYP (M06-2X) level of theory with the AVTZ basis set. Also, note that CaC4 is predicted to be a saddle point with B3LYP/AVTZ but a minimum when employing the M06-2X functional with the same basis set. It turns out to be a minimum also at the CCSD(T)/AVTZ level of theory, with a low-vibrational frequency for bending of 28.0 cm−1. Even if repeatedly stated, recall that the specific nature of the above stationary points is not crucial to feed, via Lemma, the chain that leads to other stationary points up to m1 + m2 = 6 in eqn (3).
Notably, except for linear C2CaC2 for which both B3LYP and M06-2X functionals predict saddle points with the 3-21G and AVTZ basis sets, the predictions tend to be minima with B3LYP while a few are predicted to be saddle points with M06-2X as summarized in Fig. 7. This occurs mostly for Cm1CaCm2 when m2 is even: m2 = 2, 4, 6. In particular, when m2 = 2, the tendency of that side of the molecule is to form a CaC2 triangle-shaped end following the pattern for the pristine triatomic. In fact, the absolute minimum of CaC3 at the B3LYP level shows the Ca atom bonded to the three C atoms in a planar arrangement with two equal C–C bonds of 1.311 Å and three Ca–C ones close to 2.372 Å, see Fig. 6. Because the cost-expensive harmonic vibrational frequencies have not been reported for such calcium carbides, they are gathered in Table 3. As shown, the DFT results show fair agreement with the more expensive CCSD(T) ones, which extends to cases where the 3-21G basis set is employed. Fig. 6 shows also good agreement between the predicted C–C bond distances with the 3-21G basis set and the cost-expensive CCSD(T)/AVTZ values. This may help trusting the results for larger molecules, where subminimal bases are the only affordable.
In the above context, one may ask whether DFT works better with small atomic basis sets than do MO methods. The answer is positive, because DFT focuses on the electron density, and hence is less sensitive to the exact form of the wavefunction. In fact, DFT methods seem to capture important electron correlation effects with smaller basis sets compared to wavefunction-based ones, which require more computationally expensive post-Hartree–Fock treatments (MP2, CCSD, etc.) to include correlation accurately. This is illustrated in Fig. 6 and Table 2, where DFT methods are shown to efficiently incorporate electron correlation and even achieve satisfactory accuracy with basis sets as small as 3-21G. Conversely, MO-based methods require larger basis sets to describe accurately the electronic structure, especially when accounting for electron correlation. This is particularly advantageous when studying large molecules (or periodic systems), an appreciation often taken for granted in the literature.
| System | 3-21G | VTZ or AVTZ | ||||
|---|---|---|---|---|---|---|
| E | ZPE | ΔEb | E | ZPE | ΔEb | |
| a The energies of Ca are −674.14453689, −677.51058418 and −677.51068053Eh with the 3-21G, VTZ and AVTZ basis sets, respectively. For linear C6, with 3-21G, VTZ and AVTZ basis sets, they are −226.83519419, −228.18423529 and −228.18646661Eh, with ZPEs of 0.02404867, 0.02312140 and 0.02301653Eh, in the same order. For the cyclic C6, the corresponding values are −226.82853636, −228.19555210 and −228.19764711Eh, with ZPEs of 0.02744775, 0.02743928 and 0.02739578Eh; see also Fig. 1. b See eqn (4). c With the AVTZ basis set. d With B3LYP-D3 and default convergence. e Not done: too computationally expensive. f At the B3LYP/AVTZ level of theory, the pseudo-linear CaC6Ca structure is a saddle point with two nearly degenerate frequencies for bending of −9.0 cm−1 and 12.6 cm−1. | ||||||
| Linear | ||||||
| CaC6 | −901.06858985 | 0.03094199 | −51.434 | −905.78306096c | 0.02739578c | −51.164 |
| Ca(C6)2 | −1128.06787510 | 0.05850426 | −152.198 | −1134.14819453c | 0.05045526c | −154.716 |
| Ca2C6 | −1575.18279586 | 0.03469092 | −30.049 | −1583.32865881c | 0.02643229c | −69.411 |
| Ca2(C6)2 | −1802.27545063 | 0.06534889 | −187.460 | −1811.78196277d | 0.05349781d | −241.634 |
| Ca2(C6)3 | −2029.35873151 | 0.09266875 | −341.084 | −2040.13333320d | 0.07755946d | −345.923 |
| Ca3(C6)2 | −2476.58775192 | 0.06859120 | −290.699 | —e | —e | —e |
| Ca3(C6)3 | −2703.59447314 | 0.09854201 | −394.631 | —e | —e | —e |
| Ca3(C6)4 | −2929.15863530 | 0.12353628 | 403.547 | —e | —e | —e |
| Ca4(C6)3 | −3377.84402899 | 0.10333983 | −457.520 | —e | —e | —e |
| Ca4(C6)4 | −3603.57559328 | 0.13078367 | 237.148 | —e | —e | —e |
| Ca4(C6)5 | −3831.87307989 | 0.16173424 | −676.123 | —e | —e | —e |
| Ca5(C6)4 | −4278.00083235 | 0.13783533 | 65.430 | —e | —e | —e |
| Pseudo-linear | ||||||
| CaC6 | −900.98570246 | 0.02151673 | −11.647 | −905.78306096c | 0.02285039c | −49.748 |
| Ca(C6)2 | −1127.90706435 | 0.04901827 | −69.862 | −1134.04040821c | 0.05124572c | −86.583 |
| Ca2C6 | −1575.19079086 | 0.02207718 | −49.292 | −1811.78348033c | 0.02346122cf | −60.476 |
| Ca2(C6)2 | −1802.03737204 | 0.04397965 | −64.095 | —e | —e | —e |
| Ca2(C6)3 | −2029.00409051 | 0.07181339 | −150.563 | —e | —e | —e |
| Ca3(C6)3 | −2703.22506643 | 0.06812440 | −200.844 | —e | —e | —e |
| Ca3(C6)4 | −2930.10424764 | 0.09467593 | −233.187 | —e | —e | —e |
| Ca4(C6)3 | −3377.38252624 | 0.06795410 | −209.060 | —e | —e | —e |
| Ca4(C6)4 | −3604.29718599 | 0.09241064 | −264.980 | —e | —e | —e |
| Ca4(C6)5 | −3831.20564924 | 0.11819916 | −316.177 | —e | —e | —e |
Corroborating the above, coupled cluster calculations show problematic convergence for CaC2 with the 3-21G basis set, as evinced in Table 3. This is likely due to the complex charge distribution in CaC2 due to the ionic bonding nature between calcium (Ca2+) and carbide (C22−) which, if the electron density is not well-behaved or is poorly initialized, can make the algorithm used in the calculation struggle to converge to a stable solution. Conversely, DFT appears to be less sensitive both with B3LYP and M06-2X, and possibly even with other functionals.
| System | ω 1 | ω 2 | ω 3 | ω 4 | ω 5 | ω 6 | ω 7 | ω 8 | ω 9 |
|---|---|---|---|---|---|---|---|---|---|
| a Empty spaces are not relevant: there are 3n − 6 vibrational frequencies where n is the number of atoms. b Not converged. c For the asterisk, see Fig. 5 and text. | |||||||||
| CaC2 | |||||||||
| M06-2X/3-21G | 346.2 | 469.9 | 1778.8 | ||||||
| B3LYP/3-21G | 302.2 | 431.6 | 1749.6 | ||||||
| CCSD(T)/3-21G | —b | —b | —b | ||||||
| M06-2X/AVTZ | 421.9 | 570.2 | 1865.2 | ||||||
| B3LYP/AVTZ | 384.6 | 558.2 | 1826.7 | ||||||
| CCSD(T)/AVTZ | 405.1 | 529.8 | 1757.5 | ||||||
| CCSD(T)/AVTZ*c | 407.2 | 530.5 | 1758.1 | ||||||
| CCSD(T)/AVQZ*c | 407.9 | 534.6 | 1768.3 | ||||||
| CaC3 | |||||||||
| M06-2X/3-21G | 241.7 | 273.0 | 327.5 | 600.4 | 1093.8 | 1573.6 | |||
| B3LYP/3-21G | 212.7 | 263.6 | 323.5 | 580.8 | 1062.8 | 1579.2 | |||
| CCSD(T)/3-21G | 67.0i | 230.4 | 234.6 | 500.2 | 1009.3 | 1500.3 | |||
| M06-2X/AVTZ | 183.0 | 300.0 | 460.6 | 717.1 | 1223.0 | 1506.4 | |||
| B3LYP/AVTZ | 201.8 | 288.9 | 453.4 | 689.7 | 1206.7 | 1497.8 | |||
| CCSD(T)/AVTZ | 159.4 | 297.0 | 446.3 | 689.8 | 1151.1 | 1407.9 | |||
| CaC4 | |||||||||
| M06-2X/3-21G | 232.9 | 328.4 | 352.8 | 398.8 | 493.5 | 576.7 | 942.1 | 1956.4 | 2095.8 |
| B3LYP/3-21G | 233.9 | 320.2 | 347.3 | 378.6 | 428.0 | 523.7 | 945.9 | 1894.1 | 2038.0 |
| CCSD(T)/3-21G | 219.9 | 326.4 | 345.3 | 380.8 | 398.0 | 509.2 | 892.0 | 1810.7 | 1930.2 |
| M06-2X/AVTZ | 277.2 | 342.0 | 415.5 | 454.5 | 504.3 | 620.0 | 1002.1 | 1942.9 | 2070.6 |
| B3LYP/AVTZ | 277.6 | 320.0 | 402.3 | 444.2 | 489.3 | 600.5 | 1003.8 | 1885.1 | 2010.7 |
| CCSD(T)/AVTZ | 258.6 | 335.5 | 400.9 | 431.1 | 455.0 | 571.0 | 960.4 | 1818.4 | 1951.4 |
We now turn to Fig. 8 where linear C6 is considered as a pseudo-atom. The results are mostly predicted as linear minima at the 3-21G level up to Ca4(C6)5 and Ca5(C6)4. Whether such a pattern is observed with larger basis sets can only be tested at an affordable cost for the few smaller members of the series. This is done here for the first four members with the non-augmented VTZ basis set, specifically Ca2C6, Ca(C6)2, Ca2(C6)3 and (CaC6)2, see Fig. 9 and Table 2. For CaC6Ca, the results with the B3LYP-D3 functional predict a saddle point with an imaginary frequency for bending (−83.8 cm−1), while a stable species with a low bending harmonic vibrational frequency (37.2 cm−1) is predicted with M06-2X-D3.
![]() | ||
| Fig. 8 Large linear triatomics at the B3LYP/3-21G level of theory. All are predicted as minima; for the energies, see Table 1. | ||
The Ca + C6 break-up energies are also reported in Table 2:
| ΔEsystem = Esystem + ZPEsystem − xECa − y(EC6 + ZPEC6) | (4) |
Finally, Fig. 10 shows the parent molecules when the pseudo-atoms are planar C6. Due to being computationally expensive, only a few test cases were considered at the B3LYP/AVTZ level of theory. Converged typically to <10−4Eha0−1, the results are gathered in Fig. 3 and 4. The same general pattern is observed: while the optimization with small basis sets (3-21G or MINI) tends to predict minima, the larger correlation consistent basis sets often predict the optimized structures as saddle points for bending (signaled by sp in red) or for rotation (sp in blue) of the planar C6 around the axis orthogonal to their plane. As noted above, calculations have also been performed with the ORCA43–45 code using the broadly parametrized self-consistent tight-binding (GFN2-xTB) model Hamiltonian recently introduced by Grimme and coworkers.63,70 The stationary points so obtained employ a minimal valence basis set of atom-centered contracted-Gaussian functions, which approximate Slater functions (STO-nG46). Polarization functions for most main group elements (typically second row or higher) are also employed, while hydrogen is generally assigned a single 1s function. The nature of the optimized stationary points is indicated in the rhs-most column of Fig. 10. They differ for two systems. One, Ca3C12, predicted at the conventional B3LYP/3-21G level to be a saddle-point for bending (with a double-degenerate imaginary harmonic vibrational frequency of −44 cm−1), while being predicted a minimum with GFN2-xTB. The other, Ca4C30, which yields Ca4C24 plus C6, located non-collinearly with both fragments assuming structures similar to their own in isolation. All other molecules in Fig. 10 are predicted as minima if rotations of C6 around the linear axis are ignored (the corresponding imaginary frequencies are typically of a few cm−1). In an attempt for clarification, additional optimizations were done for Ca5C30, Ca5C36 and Ca6C36, the latter involving 336 electrons. All yielded pseudo-linear stationary points of the saddle point type with one or a few minor imaginary harmonic vibrational frequencies for rotation of the C6 units, see Table 4. This may be understood from the orientation of those units relative to each other as suggested by their optimized structures. Note that the determination of the optimum relative orientation of such units would imply even tighter convergence requirements than the ones already employed, a procedure not deemed justified since the involved normal modes do not lead to bending (nonlinearity) of the optimized structures. Note further that the deviations from collinearity between the Ca atoms is typically ≤1 deg with GFN2-xTB. For example, the angle formed by the middle and terminal Ca atoms in Ca5C36 is ∠7,22,35 = 179.4 deg with B3LYP/3-21G and 178.2 deg with GFN2-xTB, values that are deemed as acceptable in view of the involved numerical intricacies. Similarly, for Ca6C36, the angle ∠35,7,42 is 179.8 deg and 179.8 deg with the two above methods, in the same order, with the distances between the terminal Ca atoms being 24.4 Å and 16.9 Å, respectively. The two structures so obtained are displayed in panel (a) of Fig. 11, top and bottom entries, respectively. From the trajectory of the optimization procedure, which leads from the conventional DFT to the extended-tight-binding (xTB) structure, it is visible that the C6 units tend to loose planarity particularly in the middle of the Ca6C36 structure. To some extent, this significant difference could be expected from the corresponding results for CaC6Ca, where the Ca–Ca distances are 5.01 Å and 4.57 Å, in the same order (note though that both methods predict a minimum pseudo-linear structure). Even more noticeable are the differences for Ca7C36 where the top structure shows a near D∞h symmetry whereas the bottom one is C∞h or even C1 like. Suffice it to note that the full length separating the two terminal Ca atoms in Ca7C36 is 19.8 Å with xTB rather than 29.3 Å with conventional DFT at the B3LYP/3-21G level. The difference is still large when comparing with the conventional DFT B3LYP/STO-3G value: 27.6 Å. In summary, despite significant differences in the predicted structures, partly to be ascribed to the basis sets, it is notable the celerity of the GFN2-xTB method.63 This is remarkable when compared with conventional DFT (even using the 3-21G or STO-3G basis sets), since it is well established71 that the cost of diagonalization can be overwhelming: it ranges from O(n3) to higher, depending on the sizes of the molecule and the basis set, and the algorithm.
![]() | ||
| Fig. 10 Large pseudo-linear molecules at the B3LYP/3-21G level of DFT; for the energies, see Table 2. Indicated as “min” is a minimum while “sp” in red indicates a saddle point for bending, and in blue for rotation of C6; the left column refers to B3LYP/3-21G while in the rhs column is for the semi-empirical GFN2-xTB method at the B3LYP/VTZ level. | ||
| System | 3-21G | VTZ or AVTZ |
|---|---|---|
| a With AVTZ. b With VTZ. c Not done. d For CaCaC6. e It shows a tiny imaginary frequency of −4.2 cm−1 for rotation of the middle C6. f As in (e) but −5.2 cm−1 for rotation of the middle C6. g As in (e) but −1.4 cm−1 for rotation of the middle C6. h As in (e) but two imaginary frequencies of −24 cm−1 and −15 cm−1 for rotations of the C6s. i As in (e) but one imaginary frequency of −4.4 cm−1. j As in (e) but three imaginary frequencies of −6.3 cm−1, −5.3 cm−1 and −1.9 cm−1. k As in (e) but two imaginary frequencies of −8.9 cm−1 and −5.4 cm−1. l No imaginary frequencies. m As in (e) but one imaginary frequency of −2.2 cm−1 (−2.6 cm−1 with B3LYP/STO-3G, and dipole moment norm 0.004 D). | ||
| Linear | ||
| CaC6 | 27.244 | 27.166a |
| Ca(C6)2 | 18.022 | 0.003b |
| Ca2C6 | 0.002 | 0.230b |
| Ca2(C6)2 | 75.095 | 70.698b |
| Ca2(C6)3 | 0.015 | 0.013b |
| Ca3(C6)2 | 0.005 | —c |
| Ca3(C6)3 | 40.466 | —c |
| Ca3(C6)4 | 31.820 | —c |
| Ca4(C6)3 | 0.026 | —c |
| Ca4(C6)4 | 66.040 | —c |
| Ca4(C6)5 | 0.106 | —c |
| Ca5(C6)4 | 55.914 | —c |
| Pseudo-linear | ||
| CaC6 | 13.892 | 10.914a |
| Ca(C6)2 | 0.009 | 0.016a |
| Ca2C6 | 0.044 | 16.798ad |
| Ca2(C6)2 | 34.224 | —c |
| Ca2(C6)3 | 0.027e | —c |
| Ca3(C6)3 | 5.753f | —c |
| Ca3(C6)4 | 0.039g | —c |
| Ca4(C6)3 | 0.006 | —c |
| Ca4(C6)4 | 17.145 | —c |
| Ca4(C6)5 | 0.026h | —c |
| Ca5(C6)4 | 0.019i | —c |
| Ca5(C6)5 | 5.845j | —c |
| Ca5(C6)6 | 0.058k | —c |
| Ca6(C6)6 | 5.857l | —c |
| Ca7(C6)6 | 0.017m | —c |
![]() | ||
| Fig. 11 Ca6C36 and Ca7C36 in panels (a) and (b), respectively, as optimized with conventional B3LYP/3-21G (top entry) and GFN2-xTB (bottom entry). The Ca–Ca bond distances are indicated (in A). | ||
Considered next are the dipole moments of the calcium hexacarbides here discussed, which can assume large magnitudes. Since molecules with large dipole moments are overall important in both understanding and harnessing a wide range of scientific phenomena, they can eventually stimulate some interest for both fundamental research and practical applications, particularly in areas where polar interactions are important. Obviously, if symmetrical, their dipole moments should vanish or be very small, except if some lack of symmetry occurs. Conversely, large non-symmetric linear or pseudo-linear ones can show huge dipole moments as gathered in Table 4 for up to Ca7(C6)6. Naturally, any differences in the molecular structure should reflect on the calculated dipole moments. For example, the dipole moments of Can(C6)6 (n = 5–7) are predicted with GFN2-xTB to be 0.173 D, 168.311 D, and 216.456 D which, for the last two, show drastic differences from the values calculated with the conventional B3LYP/3-21G. However, as noted by the authors,63 no atomic partial charges or molecular dipole moments have been used in the parameterization of the GFN2-xTB method or its predecessors, with the parameters fitted to yield reasonable structures, vibrational frequencies, and noncovalent interactions while focused on organic, organometallic, and biochemical systems of up to a few thousand atoms. Although performing well for relatively small dipole moments,63 it would be interesting to test how the method does for larger magnitude ones such as those here reported. Unfortunately, experimental data are nonexistent for comparison in the present case.
A further remark is that can the many types of calcium carbides be formed by adding atoms of Ca to a plasma of carbon? A positive answer is plausible, depending on the specific conditions of the plasma, such as temperature, pressure, and the ratio of Ca to C atoms. In fact, it is well known that calcium carbides may form in different types based on their stoichiometry and crystal structures. One of the most common is CaC2, where the carbide ion is present. This should typically form at high temperatures, where calcium can react with carbon in a ratio of 1
:
2, producing a solid carbide. Another is CaC6: commonly associated with graphite-like structures, where the calcium atoms are intercalated between carbon layers in a graphite-like lattice, it tends to form under different regimes often involving higher carbon content and specific plasma conditions. Thence, depending on the temperature and density of the plasma, the Ca and C atoms could bond in various stoichiometries. Despite apparently bearing little relevance for the studies here performed, the latter may hopefully show relevance when diluted calcium regimes in carbon vapour are examined at as low realistic temperatures as possible. We focus next on other possible calcium carbides.
Besides CaC12, CaC7 is predicted with default convergence at the B3LYP/VDZ level of theory to be a quasi-planar saddle point with the geometry shown in panel (c) of Fig. 12 and a low imaginary vibrational frequency of −35.8 cm−1. However, when tightly converged at the same level, a minimum (−943.85108164Eh) is found with a distorted structure similar to panel (c″) of Fig. 12 and a difficult to notice off-planarity in some torsion angles: ∠1,2,3,4 = −24.2, ∠4,2,3,5 = −10.1, ∠4,2,1,6 = 14.7, ∠2,1,6,7 = −35.6 and ∠2,3,5,8 = 16.2, all in deg. In turn, when tightly B3LYP/AVTZ converged, it yields the perfectly planar minimum shown in panel (c′) of the same plot, thence similar to panel (c) of Fig. 12 at the B3LYP/3-21G level of theory. Also found at this level of theory is a slightly non-planar minimum (−943.93608026Eh) resembling the one in panel (c″) with the torsion angles now reading: −23.6, −10.4,15.6, −37.3 and 17.6 deg, in the same order. With the energy difference between the two minima of only 0.572 kcal mol−1, this reveals the high planarity of the corresponding PES in that region of configuration space. In fact, similar topographical features are observed at the B3LYP/VTZ level, with the deepest minimum [−943.93264515Eh, similar to panel (c″)] predicted to be 0.588 kcal mol−1 lower than the one in panel (c). Moreover, DFT calculations carried out with the M06-2X functional and the VDZ basis set predict a perfectly planar saddle point (−31.9 cm−1 for out-of-plane motion); see the top entry in panel (c″). This has been confirmed at the tightly converged M06-2X/AVTZ level, which yields an imaginary frequency of −20.3 cm−1. Clearly, this points to a slightly non-planar minimum at this level of theory, a search that we have not deemed justified to pursue in the current work.
Another issue is the fact that the minima reported in this section are deep (eventually absolute) minima in the corresponding PESs. Still, this does not invalidate the existence of further minima, sufficing to recall that many stationary points previously determined turned out to be minima at the DFT/3-21G level with both B3LYP and M06-2X functionals. Of course, such basis sets may not yield the best energetics, but this handicap should not reflect in the geometrical parameters evinced by Fig. 6 and Tables 2 and 3. A minor point to emphasize that the apparent single and double bonds drawn in the various plots shown in the present work are based on the default parameter set in WXMACPLOT for the bond length, and hence are mostly pictorial. Indeed, if the IBOVIEW package is employed to draw the highest occupied molecular orbital of CaC13, one obtains Fig. 13 where no sticks are drawn for the interactions of the Ca and C atoms. As expected, the HOMO of CaC13 in Fig. 13 is likely formed from the 4s orbital of the Ca atom interacting with the carbon 2p (or sp2 hybridized) orbitals. The electronic structure exhibits a delocalized bonding interaction in C13, meaning that the HOMO has likely contributions from multiple bonding and anti-bonding interactions across the carbon cluster. In CaC12, the structure should be similar to CaC13 but with one less carbon atoms. This should not influence the bonding except for the fact that the Ca atom lies outside the plane of the carbon, but still benefits of the extensive delocalization of the electrons.
![]() | ||
| Fig. 13 Highest occupied molecular orbital of CaC13 as tightly converged at the B3LYP/VDZ level of theory. | ||
Of course, one may ask why does WXMACPLOT show 13 single bonds. Possibly, if maximum overlap hybrids72,73 could be constructed, they might help to obtain a more realistic view, an issue that will not be here pursued. In its absence, it appears justified to recall that bonding occurs as long as the orbitals of the Ca and Cn overlap each other, probably about equally irrespective of the atom of the cyclic Cn structure. Since this may very much depending on the method and the basis set, one further wonders whether an atomic basis set optimized from the minimization of the energy (variational principle) is necessarily better suited for predicting the correct molecular structure than a poorer (less negative energy) atomic basis set. The answer is “not necessarily” because the atomic orbitals optimized for isolated atoms do not account for changes in the electron density and potential due to molecular interactions. They must therefore require re-optimization or adaptation to accurately describe the interactions in the molecular system. In fact, such densities may turn out to be optimal when large atomic basis sets are used, but this will not preclude issues of basis overcompletness besides being unaffordable costwise: their validity is then largely validated only by reasons of the variational principle, although this applies only to the energy at a selected geometry.
Although tetrads of Ca and C atoms or of C atoms alone tend to be planar10 (see also Fig. 6), it should be recalled that volumetric clusters can also be formed as long as the Ca atom fits within the Cn cluster or bonds externally to it. An example is the CaC20 cluster shown in Fig. 14 which has been optimized at the conventional B3LYP level of theory with the 3-21G and VDZ basis sets. Recall that Lemma 25 predicts a planar stationary point for the parent whenever the tiles embedded on it are planar. Of course, this does not prevent the occurrence of non-planar stationary points: it simply warrants that a planar stationary points exists whenever the Lemma is satisfied. In fact, given the apparent flatness of the PESs of some of the species here studied, one cannot exclude that other minima exist besides the ones here reported. In particular, recall that C20 is itself planar.9 Indeed, the distances between atom numbers 21 and 19 or 21 and 20 in CaC20 (respectively 2.748 Å and 2.742 Å) differ little from the average Ca–C distance in CaC13 at the same level of theory: (2.717 ± 0.040) Å [(2.693 ± 0.002) Å at the VTZ level]. A further remark to note that the planar structure in panel (d) of Fig. 14 is a saddle point of index 3 at the B3LYP/3-21G level with a frequency of about −150 cm−1 for off-planar vibration. Still, it has a lower energy than its fullerenic analog, which is predicted to be a minimum (possibly among others, since many are known to exist for other fullerenic-type species74).
The results here reported with subminimal basis sets (3-21G, MINI and STO-3G) predict the title hexacarbides to grow to large sizes. This cannot be probed from a comparison with experimental data (absent), but raises the question: can such clusters be formed when Ca atoms are added to a plasma of carbon at relatively low temperatures? The answer is likely affirmative, but cannot be definite prior to a detailed study of their stabilities (energetics, thence ab initio calculations with large basis sets!). Although the good aspects evinced by DFT can ameliorate difficulties, experimental work would be most valuable. The reported trends may also be of relevance for studies with other alkaline-earth metal elements of group 2 of the periodic table. There are no reasons of principle though for not being applicable even to other molecules. Finally, one may wonder whether quasi-molecule theory and a subminimal basis set is all that is required to decipher linearity, planarity or otherwise of a molecule. Moreover, can other pseudo-atoms be considered and tiling generalized to quasi-planarity, thence to (not strictly planar) 2D materials? Further studies are required before well-founded answers can be given.
| This journal is © the Owner Societies 2026 |