‘Diet GMTKN55’ offers accelerated benchmarking through a representative subset approach†
Abstract
The GMTKN55 benchmarking protocol introduced by [Goerigk et al., Phys. Chem. Chem. Phys., 2017, 19, 32184] allows comprehensive analysis and ranking of density functional approximations with diverse chemical behaviours. But this comprehensiveness comes at a cost: GMTKN55's 1500 benchmarking values require energies for around 2500 systems to be calculated, making it a costly exercise. This manuscript introduces three subsets of GMTKN55, consisting of 30, 100 and 150 systems, as ‘diet’ substitutes for the full database. The subsets are chosen via a stochastic genetic approach, and consequently can reproduce key results of the full GMTKN55 database, including ranking of approximations. Some results are also included for the recent MGCDB84 database.