Publication date: Nov 15, 2023
In the past decade, quantum diffusion Monte Carlo (DMC) has been demonstrated to successfully predict the energetics and properties of a wide range of molecules and solids by numerically solving the electronic many-body Schrödinger equation. We show that when coupled with quantum machine learning (QML) based surrogate methods the computational burden can be alleviated such that QMC shows clear potential to undergird the formation of high quality descriptions across chemical space. We discuss three crucial approximations necessary to accomplish this: The fixed node approximation, universal and accurate references for chemical bond dissociation energies, and scalable minimal amons set based QML (AQML) models. Numerical evidence presented includes converged DMC results for over one thousand small organic molecules with up to 5 heavy atoms used as amons, and 50 medium sized organic molecules with 9 heavy atoms to validate the AQML predictions. Numerical evidence collected for 𝛥-AQML models suggests that already modestly sized QMC training data sets of amons suffice to predict total energies with near chemical accuracy throughout chemical space. In this archive, we present DMC energies for over one thousand small organic molecules with up to 5 heavy atoms, and 50 medium sized organic molecules with 9 heavy atoms, as well as energies computed at cheaper levels of theory such as HF, DFT, MP2 and CCSD(T).
No Explore or Discover sections associated with this archive record.
|845.9 KiB||This archive includes the geometries and energies for atom-in-molecule based fragments (aka. amons) of the 50 test molecules (see targets.zip). Geometries were calculated at the level of B3LYP, while energies (in Hartree) were calculated at various levels of theory, including HF, DFT and post-HF (MP2 & CCSD(T)) levels of theory, all with cc-pVTZ basis.|
|23.6 KiB||This archive includes the B3LYP/cc-pVTZ geometries and energies (in Hartree, computed at QMC and various cheaper levels of theory shown above) of 50 test molecules randomly drawn from the QM9 dataset.|
|878.9 KiB||This archive includes the geometries and energies for amons of the QM9 dataset, made up of at most 5 heavy atoms. Geometries were calculated at the B3LYP/cc-pVTZ level, while energies (in kcal/mol) were calculated at the QMC level, as well as the other cheaper levels of theory.|