Published March 17, 2026 | Version v2
Dataset Open

Benchmarking physics-inspired machine learning models for transition metal complexes with diverse charge and spin states

  • 1. ROR icon École Polytechnique Fédérale de Lausanne

* Contact person

Description

Physics-inspired machine learning (ML) models can be categorized into two classes: those relying solely on three-dimensional structure and those incorporating electronic information. In this work, we benchmark both classes for predicting quantum-chemical properties of transition metal complexes with diverse charge and spin states, using three complementary datasets. The evaluated methods include molecular representations (SLATM, FCHL, SOAP, and SPAHM family) combined with kernel ridge regression, as well as geometric deep learning models (MACE and 3DMol). We examine how the inclusion of electronic information affects predictive accuracy across datasets and target properties. Models that incorporate electronic information consistently outperform purely structure-based models for properties whose distributions are strongly governed by electronic characters, such as spin-splitting energies and frontier orbital energies. In contrast, structure-only models perform well for predicting the HOMO–LUMO gap and dipole moment magnitude, whose distributions are relatively insensitive to electronic characteristics. Geometric deep learning models with charge and spin embeddings (MACE-QS and 3DMol-QS) show the highest overall accuracy, with 3DMol offering the best computational efficiency among the tested models. These results clarify when geometric information is sufficient and when incorporating electronic information becomes essential, providing practical guidance for selecting effective physics-based ML models for transition metal complexes.

Files

File preview

All files

Files (9.4 GiB)

Name Size
md5:84da153421559d0a9d57c8b9bd8bf501
1.0 GiB Download
md5:c705a45c653cb945f4edd064d2a8bf6f
112.7 MiB Download
md5:061408b9c69a4eba954711dad0dc9004
5.2 GiB Download
md5:42173375d0ccaa187f83dce32741a604
2.9 KiB Preview Download
md5:d514b53bd17f46eda3dbdad21abd3e41
4.4 KiB Preview Download
md5:baf653eeef412edb784d9338bd3148b3
1.1 GiB Download
md5:9ed02efa81ffcb520496bdf845c71fbf
714.0 MiB Download
md5:d803f33405083bae5d3ce857cebdbb00
1.3 GiB Download

References

Preprint
Y. Cho, K. R. Briling, Y. Calvino Alonso, R. Laplaza, and C. Corminboeuf, ChemRxiv (2025), doi: 10.26434/chemrxiv-2025-j38bv