Published September 27, 2021 | Version v1
Dataset Open

eQM7: a dataset for small molecules with Foster-Boys centers

  • 1. Ghent University, Center for Molecular Modeling, Technologiepark-Zwijnaarde 46, Gent, B-9052, Belgium
  • 2. Ghent University - imec, IDLab, Electronics and Information Systems Department, Technologiepark-Zwijnaarde 126, Gent, B-9052, Belgium

* Contact person

Description

The electron QM7 (eQM7) dataset is created with the purpose of training and validating polarizable (machine learning) force fields on non-equilibrium configurations of small molecules. It contains 6868 molecules with hydrogen, carbon, nitrogen and oxygen. For each molecule, 500 perturbations are constructed using normal mode sampling, torsion sampling, dimer sampling and homogeneous electric fields. Energies, forces and Foster-Boys centers are computed using density functional theory (DFT) with the PBE0 functional, Aug-cc-pVTZ basis set in the ab-initio quantum chemistry code Psi4.

Files

File preview

files_description.md

All files

Files (2.8 GiB)

Name Size
md5:9ee5f042c4fa40c46ad19333a7eeb4e0
647 Bytes Preview Download
md5:5a1ab9299cb1e5914c11874c1b076139
2.7 GiB Download
md5:60906e6b4f53e161adea3d1c0cf4f068
91.3 MiB Download
md5:076917784fac1bc744d7a379cf1342dd
3.8 KiB Preview Download
md5:9b04a40ed2f9aabf1a8d618aa848b7ce
5.1 KiB Download

References

Journal reference (Paper in which the dataset is described)
M. Cools-Ceuppens, J. Dambre, T. Verstraelen, J. Chem. Theory Comput. 18 (3), 1672–1691 (2022), doi: 10.1021/acs.jctc.1c00978