There is a newer version of the record available.

Published December 19, 2022 | Version v1
Dataset Open

Fixed node diffusion Monte Carlo energies for over one thousand small organic molecules

  • 1. Faculty of Physics, University of Vienna, 1090 Wien, Austria
  • 2. Departments of Chemistry, Materials Science and Engineering, and Physics, University of Toronto, St. George Campus, Toronto, ON, Canada
  • 3. Machine Learning Group, Technische Universit\"at Berlin and Institute for the Foundations of Learning and Data, 10587 Berlin, Germany
  • 4. Materials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, United States
  • 5. Computational Science Division, Argonne National Laboratory, Argonne, IL 60439, United States

* Contact person

Description

In the past decade, quantum diffusion Monte Carlo (DMC) has been demonstrated to successfully predict the energetics and properties of a wide range of molecules and solids by numerically solving the electronic many-body Schrödinger equation. We show that when coupled with quantum machine learning (QML) based surrogate methods the computational burden can be alleviated such that QMC shows clear potential to undergird the formation of high quality descriptions across chemical space. We discuss three crucial approximations necessary to accomplish this: The fixed node approximation, universal and accurate references for chemical bond dissociation energies, and scalable minimal amons set based QML (AQML) models. Numerical evidence presented includes converged DMC results for over one thousand small organic molecules with up to 5 heavy atoms used as amons, and 50 medium sized organic molecules with 9 heavy atoms to validate the AQML predictions. Numerical evidence collected for 𝛥-AQML models suggests that already modestly sized QMC training data sets of amons suffice to predict total energies with near chemical accuracy throughout chemical space. In this archive, we present DMC energies for over one thousand small organic molecules with up to 5 heavy atoms, and 50 medium sized organic molecules with 9 heavy atoms, as well as energies computed at cheaper levels of theory such as HF, DFT, MP2 and CCSD(T).

Files

File preview

files_description.md

All files

Files (1.7 MiB)

Name Size
md5:c73e624f2fcc874ae7efd2ed3842d55e
968 Bytes Preview Download
md5:250058b60c5f8f300d7b93cac0c3adee
860.3 KiB Preview Download
md5:d4f8b7cd9f71bf26158e5baa513767b4
845.9 KiB Preview Download
md5:605d8e3cf6edd4127146d9d7abf532e7
23.6 KiB Preview Download

References

Preprint
Huang, Bing, O. Anatole von Lilienfeld, Jaron T. Krogel, and Anouar Benali. "Towards DMC accuracy across chemical space with scalable $\Delta $-QML." arXiv preprint arXiv:2210.06430 (2022)., doi: 10.48550/arXiv.2210.06430