Published May 12, 2023 | Version v1
Dataset Open

Incompleteness of graph neural networks for points clouds in three dimensions

  • 1. Institute of Materials, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland

* Contact person

Description

Graph neural networks are a popular deep-learning architecture in applications to materials and molecules, and the most widespread implementations rely on interatomic distances as geometric descriptors. Unfortunately, GNNs based on distances are not complete, i.e. there are geometries, corresponding to molecules and/or periodic structures, that are indistinguishable by the GNN. For these, the corresponding machine-learning models will be unable to learn differences in the properties of the "degenerate" structures. This dataset contains a collection of molecular and solid structures that cannot be discriminated by distance-based graph neural networks, together with example code showing how to parse them and use to demonstrate the shortcomings of this class of machine-learning algorithms.

Files

File preview

files_description.md

All files

Files (8.3 MiB)

Name Size
md5:ccebfa892ed6965ab660c63e911525a3
298 Bytes Preview Download
md5:c987bec26ce8c6055363dce34ff60bef
8.3 MiB Preview Download

References

Journal reference (Paper describing the construction of this class of counterexamples)
S. N. Pozdnyakov and M. Ceriotti, Mach. Learn.: Sci. Technol. 3(4), 045020 (2022)., doi: 10.1088/2632-2153/aca1f8

Journal reference (Paper describing the construction of this class of counterexamples)
S. N. Pozdnyakov and M. Ceriotti, Mach. Learn.: Sci. Technol. 3(4), 045020 (2022).