Open Access Article
Zihui Song
and
Ka Un Lao
*
Department of Chemistry, Virginia Commonwealth University, Richmond, VA, USA. E-mail: laoku@vcu.edu
First published on 16th June 2026
Geometry optimization in electronic structure theory requires repeated self-consistent field (SCF) calculations at every optimization step, making SCF convergence one of the primary computational bottlenecks in quantum chemistry simulations. Consequently, generating superior SCF initial guesses that reduce the number of required SCF iterations provides an effective strategy for significantly accelerating geometry optimization calculations. In this work, we develop a Grassmann extrapolation framework combined with direct inversion in the iterative subspace (G-Ext-DIIS) to generate improved density matrix initial guesses throughout geometry optimization. The proposed approach exploits the geometric structure of density matrices on the nonlinear Grassmann manifold, enabling physically consistent extrapolation while rigorously preserving the fundamental physical constraints of the density matrix. The method was systematically evaluated through calibration studies and benchmark calculations on diverse covalently bonded organic molecules and flexible noncovalent molecular clusters. Using either the overlap matrix or the core Hamiltonian matrix as the molecular descriptor, G-Ext-DIIS consistently reduces the total number of SCF iterations during geometry optimization. The performance of G-Ext-DIIS was found to be largely insensitive to the choice of density functional and basis set. For flexible cluster systems, reductions of approximately 20–30% in SCF iterations are routinely achieved, including a reduction of nearly 2000 SCF cycles for a hydroquinone clathrate cluster containing 129 atoms. Importantly, the computational overhead associated with the Grassmann extrapolation procedure is negligible compared with that of a single SCF iteration. These results demonstrate that G-Ext-DIIS provides an efficient, transferable, black-box, and computationally inexpensive strategy for accelerating geometry optimization calculations in large-scale quantum chemical simulations, making it a promising default SCF initialization scheme for future quantum chemistry software implementations.
At the heart of this process is the minimization of the total electronic energy on the potential energy surface (PES) with respect to nuclear coordinates. Due to its favorable balance between accuracy and computational cost, density functional theory (DFT) has become the standard method for such evaluations. However, this optimization is inherently iterative; each step requires a full electronic structure calculation to achieve self-consistent field (SCF) convergence, followed by the evaluation of energy gradients—and occasionally the Hessian—to guide the structural update. While robust quasi-Newton algorithms like Broyden–Fletcher–Goldfarb–Shanno (BFGS)3 are ubiquitous in modern quantum chemistry packages, the primary computational bottleneck remains the sheer number of required DFT evaluations. Since each evaluation is computationally demanding, particularly for large-scale or condensed-phase systems, the total wall-clock time is directly governed by the number of iterative cycles. Therefore, developing advanced techniques to minimize these expensive DFT evaluations is essential for accelerating the discovery of stationary points on the PES.
To accelerate SCF convergence, Pulay's direct inversion in the iterative subspace (DIIS) method4,5 remains the most prevalent technique and serves as the default in many modern quantum chemistry packages. Beyond acceleration algorithms, the quality of the initial guess is equally critical because it determines whether the SCF procedure converges to the desired electronic state and dictates the overall time-to-solution by influencing the number of iterations. Various strategies have been developed to generate high-quality initial guesses, ranging from standard approaches, such as the superposition of atomic densities (SAD)6 and basis set projection (BSP), to sophisticated methods, including molecular fragmentation7–10 and, more recently, machine learning-based models.11–13
In the context of PES scans, geometry optimizations, and ab initio molecular dynamics (AIMD) simulations, efficiency is further enhanced by utilizing the converged density matrix or molecular orbital coefficients from the preceding configuration as the initial guess. More advanced schemes extrapolate the density matrix from multiple prior steps to provide a superior estimate for the new geometry,14 effectively bridging the gap between successive electronic structure calculations. However, direct density matrix extrapolation is challenging. Since the density matrix in an orthonormal basis must satisfy the physical constraints of Hermiticity, particle number conservation, and idempotency, it is constrained to a nonlinear Grassmann manifold rather than a linear vector space. Consequently, as a linear combination of valid density matrices generally fails to preserve these physical constraints,15 direct extrapolation is both physically and mathematically ill-defined.
To overcome these limitations, an indirect approach has been proposed for generating accurate density matrices by mapping them onto the tangent space of a Grassmann manifold.15–23 Because this tangent space functions as a vector space, it is closed under both addition and scalar multiplication, which allows linear operations to be performed straightforwardly. The results are subsequently mapped back to the Grassmann manifold, ensuring the resulting density matrices remain valid and adhere to all physical constraints. These Grassmann interpolation and extrapolation schemes not only provide high quality initial guesses for SCF iterations but also facilitate the direct evaluation of expectation values for one-particle operators, such as atomic charges.21
Recently, our group introduced an approach combining a Grassmann extrapolation method with DIIS (G-Ext-DIIS) to accurately and efficiently extrapolate density matrices in electronic structure calculations.23 Unlike Tikhonov-regularized G-Ext,16,18–20 G-Ext-DIIS requires no tuning of regularization parameters. Moreover, its DIIS subspace is compact, numerically stable, and independent of descriptor dimensionality, system size, or basis set, ensuring both robustness and computational efficiency. This technique has been applied to electronic structure calculations during PES scans, where it significantly reduces the number of SCF iterations required for convergence or enables accurate energy predictions without the need for further SCF cycles, achieving sub-millihartree accuracy across extrapolation ranges that exceed typical geometry optimization steps.23 Finally, G-Ext-DIIS demonstrates superior accuracy, variational consistency, and reliability across various basis sets compared to direct extrapolation methods and Löwdin extrapolation from nearby geometries.23
Given that computational chemistry frequently involves repeated quantum mechanical calculations on the same system, such as in geometry optimizations and AIMD simulations, we extend this robust, efficient, and transferable G-Ext-DIIS approach to geometry optimization in the present work. Specifically, accurate density matrices generated by G-Ext-DIIS are employed to initialize the SCF iterations at each optimization step. This initialization strategy significantly accelerates convergence, thereby decreasing the total number of costly SCF cycles and reducing the overall wall clock time required for geometry optimization.
![]() | (1) |
At each step of the geometry optimization procedure, an electronic structure calculation is performed to determine the total energy and its analytical derivatives. For mean-field methods such as DFT, this requires solving the nonlinear SCF equations, typically initialized using an initial guess of either the molecular orbital (MO) coefficients or the density matrix (P). In an orthonormal basis, the density matrix
is derived from the occupied MO coefficient Cocc as follows:
= S1/2Cocc(Cocc)TS1/2,
| (2) |
satisfies the aforementioned symmetry, idempotency, and trace properties, which constrains it to lie on the Grassmann manifold
. However, the Grassmann manifold does not satisfy the closure property under linear addition; consequently, a linear combination of such density matrices generally does not result in another element of the Grassmann manifold.15
Rather than performing operations directly on the Grassmann manifold, a more effective approach involves projecting known density matrices, {
k}, onto the tangent space
at a reference point r0. By mapping these matrices to tangent vectors {Γk} via the Grassmann logarithm,24 we transform complex geometric problems into simple vector space operations. Interpolation or extrapolation can then be executed to estimate the target vector Γtarget as a linear combination of the sampled vectors:
![]() | (3) |
Crucially, the accuracy of the resulting Γtarget is largely invariant to the choice of reference point.15 The coefficients {ck} can be determined using a variety of schemes, including Lagrange interpolation,15,16,21,22 Tikhonov-regularized least-squares,17–20 and our recently developed DIIS-based minimization framework.23 The latter determines the extrapolation coefficients within a compact residual subspace whose dimensionality depends only on the number of sampling points n, leading to a numerically stable and computationally efficient formulation. Specifically, the DIIS minimization leads to a compact system of n + 1 linear equations,
![]() | (4) |
. The elements of the symmetric error matrix B are defined as| Bij = Bji = Tr((di − dtarget)T (dj − dtarget)), | (5) |
It is worth emphasizing that both the Tikhonov-regularized G-Ext formulation and the present DIIS-based minimization framework rely on the same central approximation: the extrapolation coefficients determined from the molecular descriptors are assumed to be transferable to the tangent-space representation. Consequently, the choice of molecular descriptor plays a critical role in determining the quality of the extrapolation coefficients, the accuracy of the resulting tangent vector, and ultimately the performance of the G-Ext procedure, as demonstrated in our previous study.23
Subsequently, the resulting Γtarget is mapped back to the Grassmann manifold via the Grassmann exponential,24 recovering the orthonormal density matrix
target. This geometric transformation ensures that all intrinsic physical constraints and geometric properties are rigorously preserved, as detailed in our previous G-Ext-DIIS study.23 Finally, the orthonormal density matrix is transformed back to the nonorthogonal basis, yielding Ptarget. This matrix can then be utilized either as a robust initial guess for subsequent SCF calculations or as a direct basis for predicting molecular properties.
Conventional software packages typically initialize each step of a molecular geometry optimization by reusing the converged electronic structure, such as MO coefficients, from the preceding geometry. To improve upon this practice, the recently developed Tikhonov-regularized G-Ext method predicts the current density matrix based on data from previous steps,20 providing a more robust initial guess for the SCF process. However, solving the resulting least squares problem requires a regularization parameter that depends on the SCF convergence threshold,18 as well as the specific molecular systems and descriptors used.20 Recently, we introduced a stable and computationally efficient DIIS-based technique to determine the interpolation/extrapolation coefficients ck, eliminating the need for manual parameter tuning. In this study, G-Ext-DIIS is applied for the first time to generate SCF initial guesses throughout optimization cycles. By leveraging the geometry optimization history at marginal computational cost, this approach significantly reduces the number of SCF cycles per step and minimizes overall wall time.
The sequence of density matrices {Pk} corresponding to the nuclear coordinates {Rk} is collected throughout the geometry optimization process. Subsequently, the G-Ext-DIIS technique is applied to generate an extrapolated density matrix Pk+1 based on this historical data, which serves as the SCF initial guess for the next geometry Rk+1. To determine the extrapolation coefficients ck in eqn (3), the overlap matrix (S), core Hamiltonian matrix (H), and Coulomb matrix (CM) are utilized as molecular descriptors. Notably, the S and H descriptors share the same symmetry properties as the density matrix, being invariant to translations and equivariant under rotations. In contrast, the CM descriptor is invariant to both translations and rotations. As demonstrated in our earlier study, descriptors that align with the symmetry of the density matrix produce more accurate coefficients, resulting in enhanced initial guesses.23 In this study, all three descriptors are evaluated within the G-Ext-DIIS framework during geometry optimization. The integrated procedure for accelerating geometry optimization via G-Ext-DIIS is depicted in Fig. 1. We systematically investigate the effects of the length of the initial standard optimization phase (i) required to accumulate sufficient data for initiating G-Ext-DIIS, and the number of historical density matrices (k) utilized in the extrapolation procedure on the overall accuracy, robustness, and stability of the G-Ext-DIIS scheme. Furthermore, we compare the performance of G-Ext-DIIS with the conventional strategy of using the converged MO coefficients from the previous optimization step as the initial SCF guess for the current geometry.
All methods were implemented in a locally modified version of Q-Chem.25 In all calculations, the DIIS SCF algorithm was employed with an SCF convergence threshold of τSCF = 10−8 a.u. and an integral screening threshold of τints = 10−14 a.u. Geometry optimizations were carried out using the default convergence criteria in Q-Chem, including a gradient threshold of 300 × 10−6 a.u., a displacement threshold of 1200 × 10−6 a.u., and an energy threshold of 100 × 10−8 a.u.
The geometry immediately preceding the target geometry is selected as the reference point for extrapolation in G-Ext-DIIS during geometry optimization. It was confirmed that the numbers of geometry optimization cycles are identical between the standard and G-Ext-DIIS-accelerated geometry optimization calculations for each system. All systems considered in this work are neutral closed-shell singlet systems.
![]() | ||
| Fig. 2 Initial molecular structures of (a) phenol(H2O)10 and (b) (H2O)20 used to investigate the optimal G-Ext-DIIS settings for geometry optimization. | ||
To determine the optimal parameters for each type of descriptor, we evaluate two strategies using these molecular systems. In the first strategy, G-Ext-DIIS is applied after i initial conventional geometry optimization steps, using a fixed number k of previous sampling points to construct the density matrix at the current geometry; if fewer than k sampling points are available, all previous points are used. This approach is denoted as (i, k). In the second strategy, all available sampling points are employed in the G-Ext-DIIS extrapolation, denoted as (i, all). The resulting reduction in SCF iterations at the B3LYP/def2-SVP level is compared to that obtained with the standard geometry optimization protocol, in which the MO coefficients from the preceding step are used as the initial guess for the current SCF calculation. The results, obtained using the molecular descriptor S, are summarized in Table 1.
| System | Standard | i | k | G-Ext-DIIS | SCF |
|---|---|---|---|---|---|
| OPT | OPT | PR (%) | |||
| Strategy-I | |||||
| Phenol(H2O)10 | 2199 | 25 | 25 | 1750 | −20.4 |
| 20 | 20 | 1754 | −20.2 | ||
| 20 | 15 | 1781 | −19.8 | ||
| 20 | 10 | 1829 | −16.8 | ||
| 20 | 8 | 1859 | −16.0 | ||
| 20 | 5 | 1924 | −12.6 | ||
| 20 | 2 | 2063 | −6.2 | ||
| 10 | 10 | 1824 | −17.1 | ||
| 10 | 8 | 1847 | −16.7 | ||
| 10 | 5 | 1911 | −13.1 | ||
| 10 | 2 | 2050 | −6.8 | ||
| (H2O)20 | 2075 | 25 | 25 | 1606 | −22.7 |
| 20 | 20 | 1603 | −22.7 | ||
| 20 | 15 | 1618 | −22.1 | ||
| 20 | 10 | 1658 | −20.1 | ||
| 20 | 8 | 1701 | −19.2 | ||
| 20 | 5 | 1769 | −14.9 | ||
| 20 | 2 | 1954 | −5.9 | ||
| 10 | 10 | 1666 | −19.7 | ||
| 10 | 8 | 1656 | −20.2 | ||
| 10 | 5 | 1761 | −15.1 | ||
| 10 | 2 | 1950 | −6.0 | ||
| Strategy-II | |||||
| Phenol(H2O)10 | 2199 | 25 | all | 1628 | −26.0 |
| 20 | all | 1617 | −26.5 | ||
| 10 | all | 1603 | −26.9 | ||
| 8 | all | 1605 | −27.0 | ||
| 5 | all | 1606 | −27.1 | ||
| 2 | all | 1605 | −27.2 | ||
| (H2O)20 | 2075 | 25 | all | 1478 | −28.8 |
| 20 | all | 1464 | −29.5 | ||
| 10 | all | 1448 | −30.2 | ||
| 8 | all | 1447 | −30.3 | ||
| 5 | all | 1443 | −30.5 | ||
| 2 | all | 1437 | −30.8 | ||
In the first strategy, multiple sets of sampling parameters were examined. The number of initial optimized points was chosen as i = 10, 20, and 25, while the number of historical points was varied as k = 2, 5, 8, 10, 15, 20, and 25. Different molecular descriptors (S, H, and CM) were then applied to evaluate their impact on the total number of SCF iterations, which serves as the primary metric for assessing the performance of G-Ext-DIIS in accelerating geometry optimization (OPT). Table 1 compares the total SCF iterations for standard OPT, where previous MO coefficients are used as the initial guess, with those from G-Ext-DIIS-accelerated OPT employing the overlap matrix S as the descriptor. During the initial sampling stage, conventional OPT is carried out using the previous MO coefficients as the initial guess. Once G-Ext-DIIS is activated, the density matrix generated by G-Ext-DIIS is employed as the initial guess for subsequent SCF cycles. The total number of SCF cycles reported for G-Ext-DIIS in Table 1 includes those incurred during the initial sampling stage, thereby representing the full SCF cost required to complete the geometry optimization. Overall, G-Ext-DIIS consistently reduces the total number of SCF iterations across all tested parameter combinations, achieving reductions of 6–20% for phenol(H2O)10 and 6–23% for (H2O)20. These results indicate that greater savings in SCF cycles can be obtained when a larger number of historical points is included in constructing the initial guess density.
Using the core Hamiltonian H as the descriptor yields comparable performance, with reductions of 6–20% for phenol(H2O)10 and 6–23% for (H2O)20, as reported in Table S1 of the supplementary information (SI). This is consistent with our previous observation that S and H perform similarly when combined with G-Ext-DIIS for generating density matrices.23 In contrast, the Coulomb matrix descriptor CM provides only marginal improvement (approximately 1%) and, in some cases, even increases the number of SCF iterations by about 1%.
In the second strategy, the number of initial optimized points is varied as i = 2, 5, 8, 10, 20, and 25, while all available historical points are used to construct the extrapolated density matrix for the current geometry. Incorporating the full set of historical points significantly enhances the performance of G-Ext-DIIS in geometry optimization, leading to reductions of 26–27% for phenol(H2O)10 and 29–31% for (H2O)20. Notably, the best performance is achieved when the smallest number of initial sampling points (i = 2) is combined with the use of all historical points. This result indicates that the benefits of G-Ext-DIIS can be realized at a very early stage of the optimization, with improvements already evident from the third optimization step under the (2, all) setting.
A similar level of reduction is observed when using the core Hamiltonian H as the descriptor, yielding decreases of approximately 26–27% for phenol(H2O)10 and 29–30% for (H2O)20, as reported in Table S1. In contrast, the Coulomb matrix descriptor CM again shows no significant improvement in SCF convergence, with the maximum reduction limited to about 1%, and in some cases even leading to an increase of approximately 1% in the number of SCF cycles, as presented in Table S1. Thus, the largest reduction in SCF iterations is achieved when the first two optimization steps are performed conventionally, followed by the application of G-Ext-DIIS from the third step onward using all available historical points, with either S or H as the descriptor.
To further assess the robustness of this approach, we examined a different functional and a larger diffuse basis set. Specifically, G-Ext-DIIS was tested using various values of i combined with all available historical points (k = all), as summarized in Table S2 at the levels of PBE/def2-SVP and B3LYP/def2-TZVPPD. A similar trend in reducing the total number of SCF iterations is consistently observed. For PBE/def2-SVP, G-Ext-DIIS with S yields reductions of 24–25% for phenol(H2O)10 and 26–28% for (H2O)20, while using H results in comparable reductions of 23–24% and 26–27%, respectively. At the B3LYP/def2-TZVPPD level, G-Ext-DIIS with S achieves reductions of 23–24% for phenol(H2O)10 and 24–28% for (H2O)20, whereas H yields reductions of 23–25% and 20–28%, respectively. In contrast, the CM descriptor consistently increases the total number of SCF cycles across all tested cases at both levels of theory. Notably, for both PBE/def2-SVP and B3LYP/def2-TZVPPD, the largest reduction in SCF iterations is again obtained using the (2, all) setting, with improvements emerging from the third optimization step onward.
Overall, these results demonstrate that the effectiveness of G-Ext-DIIS in accelerating OPT is largely insensitive to the choice of functional and basis set. Moreover, initiating the procedure early and incorporating all available historical density matrices generally leads to more accurate and stable predictions, thereby reducing the total number of SCF iterations required for convergence.
![]() | ||
| Fig. 3 Initial molecular cluster structures used to assess the performance of G-Ext-DIIS in geometry optimization. | ||
The performance of G-Ext-DIIS-accelerated geometry optimization using S as the descriptor with the (2, all) setting at the B3LYP/def2-SVP level, evaluated in terms of the reduction and percentage reduction in total SCF iterations relative to the standard optimization scheme employing the MO coefficients from the preceding step as the initial guess, is shown in Fig. 4a for organic molecules and Fig. 4b for molecular clusters, including phenol(H2O)10 and (H2O)20. The corresponding results obtained using H as the descriptor are presented in Fig. S2. Complete details of the SCF iteration counts and the percentage reduction in SCF cycles achieved by G-Ext-DIIS for all systems using the (2, all) setting are provided in Table S3.
For the eight covalently bonded organic molecules shown in Fig. 4a, G-Ext-DIIS using S as the descriptor consistently lowers the total number of SCF iterations for all systems, with reductions ranging from 3 to 32 cycles, corresponding to 2–12%, and an average reduction of 6.6%. Similar performance is obtained when H is employed as the descriptor, yielding an average SCF reduction of 6.7%, as shown in Fig. S2. In contrast, the use of CM leads to increased SCF iterations for every system examined, with increases ranging from 2 to 13% and an average increase of 5.9%, as reported in Table S3, further demonstrating that CM is not an appropriate descriptor for G-Ext-DIIS. The comparatively modest reductions obtained for these organic molecules, relative to the approximately 30% reduction observed for phenol(H2O)10 and (H2O)20, are likely attributable to two factors. First, these molecular geometries had already been optimized, albeit at a different level of theory. Second, the covalently bonded organic molecules considered here are generally less flexible and exhibit fewer degrees of freedom for structural rearrangement than the molecular clusters investigated in this work.
For the 13 noncovalently bonded clusters shown in Fig. 4b, the performance improvement becomes substantially more pronounced. Using S as the descriptor, G-Ext-DIIS reduces the total number of SCF iterations for all systems by 31 to 1931 cycles, corresponding to reductions of 9–29% and an average reduction of 21.5%. The largest absolute reduction is observed for (hydroquinone)9CO2, where the total number of SCF iterations decreases from 7144 to 5213, corresponding to a reduction of 1931 cycles (27.3%) during geometry optimization. The largest percentage reduction is obtained for alanine(H2O)10, where the SCF iterations decrease from 2596 to 1835, yielding a reduction of 761 cycles (29.3%). Overall, the SCF savings achieved for molecular clusters are approximately three times larger than those observed for the covalently bonded organic molecules, primarily because these cluster systems require substantially more geometry optimization steps to reach the minimum-energy structure. Using H as the descriptor again provides comparable performance, with an average SCF reduction of 21.8%, as shown in Fig. S2. The largest absolute reduction is again found for (hydroquinone)9CO2 (1936 cycles), whereas the largest percentage reduction is observed for phenol(H2O)10 (30.0%). In contrast, G-Ext-DIIS using CM performs significantly worse: it reduces SCF iterations for only four systems, with a maximum reduction of 6% for (H2O)6, while increasing the number of SCF iterations for the remaining systems by as much as 10% for ethanol(H2O)10. Interestingly, although G-Ext-DIIS using S or H decreases the SCF iterations for (hydroquinone)9CO2 by approximately 2000 cycles, the use of CM instead increases the total SCF iterations by about 200 cycles.
For completeness, Table S4 also compares the performance of G-Ext-DIIS against geometry optimization calculations that employ the superposition of atomic densities (SAD) initial guess at every optimization step. Relative to this SAD-based reference, the conventional strategy of reusing the converged MOs from the preceding optimization step reduces the total number of SCF iterations by approximately 17–28% for covalently bonded organic molecules and 11–35% for noncovalently bonded clusters. This reduction reflects the well-established benefit of orbital recycling during geometry optimization. The G-Ext-DIIS approach provides a further reduction beyond this conventional baseline. Relative to the SAD reference, the total number of SCF iterations is reduced by approximately 22–35% and 23–35% when using the S and H descriptors, respectively, for covalently bonded organic molecules, and by approximately 30–52% for noncovalently bonded clusters using either descriptor. These results demonstrate that the performance gains achieved by G-Ext-DIIS arise not merely from reusing information from previous optimization steps, but also from the descriptor-driven extrapolation of tangent vectors within the Grassmann manifold framework, which yields improved initial density matrix guesses for subsequent SCF calculations.
Overall, these results demonstrate that G-Ext-DIIS, using either S or H as the descriptor, consistently accelerates geometry optimization by reducing the total number of SCF iterations by approximately 7% on average for covalently bonded organic molecules and 22% on average for flexible noncovalent molecular clusters, relative to the default approach of reusing the converged MOs from the preceding optimization step. The results further establish that S and H serve as robust and transferable descriptors for G-Ext-DIIS, whereas CM is less suitable for practical geometry optimization applications. This trend is further illustrated in Fig. 5, which presents the correlation map of total SCF iterations obtained from standard geometry optimization and G-Ext-DIIS-accelerated geometry optimization using the S, H, and CM descriptors for all 21 systems considered in this study. In particular, G-Ext-DIIS employing either S or H provides the largest reductions for systems requiring a greater number of geometry optimization steps to converge.
This behavior is consistent with our previous findings, which identified S and H as more effective descriptors for G-Ext-based extrapolation than CM.23 The Coulomb matrix is constructed directly from the molecular geometry and therefore provides only an indirect representation of the underlying electronic structure. In contrast, the overlap matrix and core Hamiltonian arise directly from the electronic structure problem and consequently track changes in the self-consistent density matrix more closely throughout the optimization process. As a result, S and H provide descriptor spaces that are more strongly correlated with density matrix evolution, leading to more reliable extrapolation coefficients. Moreover, the overlap matrix, core Hamiltonian, and density matrix share the same symmetry properties, being invariant under translations and equivariant under rotations, whereas the Coulomb matrix does not exhibit the same transformation behavior. Since G-Ext-DIIS relies on the transferability of extrapolation coefficients determined from molecular descriptors to the tangent-space representation, descriptors whose symmetry properties and evolution more closely resemble those of the density matrix are expected to yield more transferable extrapolation coefficients. These factors collectively explain the superior performance and robustness of the S and H descriptors relative to CM.
It should be noted that the density matrix generated by G-Ext-DIIS is intended to provide a high-quality initial guess for the SCF procedure rather than the exact self-consistent solution at the target geometry. Although the extrapolated density matrix is generally very close to the converged solution, with typical DIIS errors of 10−5–10−6 a.u. for covalently bonded organic molecules and 10−6–10−7 a.u. for noncovalently bonded clusters, a small number of additional SCF iterations is still required to achieve full convergence. This is partly due to the stringent SCF convergence threshold of 10−8 a.u. employed in the present work. Furthermore, the nonlinear nature of the DIIS-SCF procedure can give rise to residual oscillatory behavior, even when the initial density matrix is already very close to the converged solution. Consequently, several SCF iterations remain necessary to obtain the final self-consistent density matrix.
Finally, the computational cost associated with generating density matrices through G-Ext-DIIS is negligible compared with that of a single SCF iteration. For example, the extrapolation procedure requires only approximately 10−4 s for covalently bonded organic molecules and 10−2 s for noncovalent molecular clusters. In addition, the associated storage requirements remain modest, even for the largest system considered in this study (1320 basis functions) when all available historical density matrices are retained. The present study focuses primarily on reductions in SCF iteration counts, which provide a hardware-independent measure of the effectiveness of the proposed extrapolation procedure and facilitate direct comparisons across different molecular systems and computational environments. While direct wall-clock benchmarks would offer additional practical insight, particularly for larger systems and parallel computing environments where implementation-specific factors such as memory management, communication overhead, and I/O operations may become increasingly important, the negligible computational cost of the extrapolation step suggests that the overall efficiency gains are dominated by the reduction in SCF iterations. A systematic investigation of wall-clock performance and scaling behavior is beyond the scope of the present work and will be pursued in future studies.
Taken together, these results demonstrate that G-Ext-DIIS provides an efficient ab initio framework for substantially reducing the total number of SCF iterations during geometry optimization while introducing only minimal additional computational cost. Given the negligible overhead associated with the extrapolation procedure, the resulting computational savings are expected to be primarily determined by the reduction in SCF iterations. Consequently, G-Ext-DIIS represents a promising general strategy for generating initial density matrix guesses in geometry optimization calculations, and implementations for additional quantum chemistry software packages are currently under development in our group.
Comprehensive benchmark calculations demonstrate that G-Ext-DIIS can substantially reduce the computational effort associated with SCF convergence during geometry optimization across a diverse range of molecular systems. The optimal protocol was found to employ the first two optimization steps for data accumulation and to initiate G-Ext-DIIS extrapolation from the third step onward using all previously available density matrices. Using either the overlap matrix S or the core Hamiltonian matrix H as the molecular descriptor, the method consistently accelerates geometry optimization for both covalently bonded organic molecules and flexible noncovalent molecular clusters. The largest performance gains are observed for structurally flexible molecular clusters requiring many optimization steps to converge, where G-Ext-DIIS routinely reduces the total number of SCF iterations by approximately 20–30% relative to the conventional strategy of reusing the converged MOs from the preceding optimization step. For the largest system considered, (hydroquinone)9CO2, G-Ext-DIIS reduces the total SCF iteration count by nearly 2000 cycles, corresponding to a reduction of approximately 27%. The advantages of G-Ext-DIIS are even more evident when compared with the SAD-based reference, yielding reductions in total SCF iterations of up to 52% across the tested systems. In contrast, the Coulomb matrix descriptor CM was found to be significantly less effective, highlighting the importance of descriptors that accurately capture the evolution of the electronic structure while sharing symmetry properties with the density matrix in Grassmann-based extrapolation schemes. Finally, the performance of G-Ext-DIIS is largely insensitive to the choice of density functional and basis set, demonstrating the robustness, transferability, and broad applicability of the proposed framework.
Importantly, the computational overhead associated with G-Ext-DIIS is negligible compared with that of a single SCF iteration, making the method particularly attractive for large-scale quantum chemical calculations. Because the approach relies exclusively on information already generated during the geometry optimization procedure, it requires no additional electronic structure calculations while providing a simple, transferable, fully ab initio, and computationally efficient strategy for improving SCF convergence. Consequently, G-Ext-DIIS offers a practical and broadly applicable framework for reducing the overall wall time of geometry optimization calculations with essentially no additional computational cost.
Overall, these results establish G-Ext-DIIS as a practical and robust framework for accelerating geometry optimization in electronic structure theory. Owing to its black-box nature, broad transferability, and negligible computational overhead, this method is particularly well suited for large molecular clusters, biomolecular systems, and condensed-phase simulations, where repeated SCF calculations constitute a significant fraction of the overall computational cost. As a fully ab initio and computationally inexpensive approach, G-Ext-DIIS represents a promising default SCF initialization scheme for future quantum chemistry software implementations and large-scale electronic structure simulations.
While the present study focuses on neutral closed-shell molecular systems to establish and validate the G-Ext-DIIS framework, the methodology itself is not inherently restricted to this class of problems. Future work will extend the benchmarking and application scope to electronically more challenging systems, including charged species, zwitterions, transition-state structures, systems with diffuse electronic densities, and open-shell molecules. Such systems often exhibit more complex electronic structure variations and more challenging SCF convergence behavior, providing a rigorous test of the robustness and transferability of the proposed approach. Beyond geometry optimization, future developments will focus on extending G-Ext-DIIS to AIMD simulations, periodic electronic structure calculations, and implementations in additional quantum chemistry software packages, thereby broadening the applicability and impact of the methodology.
| This journal is © the Owner Societies 2026 |