Published December 9, 2021 | Version v1
Dataset Open

Equivariant representations for molecular Hamiltonians

  • 1. Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
  • 2. National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland

* Contact person

Description

The application of machine learning to the modeling of materials and molecules has proven to be extremely successful in accelerating the understanding, design, and characterization of materials. A major factor in this success has been the development of representations of atomic structures that reflect physics-based symmetries of the underlying interactions. Most of the descriptions of atomic properties or even global observables rely on decompositions into atomic contributions that are subsequently learnt in an atom-centered framework. However, many quantities associated with quantum mechanical calculations, such as the single-particle Hamiltonian matrices written in an atomic-orbital basis, are associated with multiple atom-centers. Following the introduction of equivariant N-center structural descriptors, in the reference below, that generalize the very successful atom-centered density correlation features to the problem of learning properties indexed by N atoms, we present benchmarks showing how the construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian in an atom-centered orbital basis. In this record, we include the dataset comprising the Fock and overlap matrices in the def2-SVP of 1000 distorted water molecules, up to 4500 ethanol structures, and a subset of QM7-CHNO molecules. We also provide scripts to generate the two-center representations and fit linear and sparse kernel models for the Hamiltonians.

Files

File preview

files_description.md

All files

Files (7.4 GiB)

Name Size
md5:2a473be05ff36adabefd0a9d5ef18d6e
368 Bytes Preview Download
md5:f3029a4c81963b173b9de4b872686e39
7.4 GiB Preview Download
md5:f0504fdd55d22b894af8474560270245
889 Bytes Preview Download

References

Journal reference (Manuscript where the data and methods are described)
J. Nigam, M.J. Willatt, M. Ceriotti, J. Chem. Phys. 156, 014115 (2022), doi: 10.1063/5.0072784

Journal reference (Manuscript where the data and methods are described)
J. Nigam, M.J. Willatt, M. Ceriotti, J. Chem. Phys. 156, 014115 (2022)