Using collective knowledge to assign oxidation states

doi:10.24435/materialscloud:2019.0085/v1

materialscloud:2019.0085/v1

Published December 11, 2019 | Version v1

Dataset Open

Using collective knowledge to assign oxidation states

Jablonka, Kevin Maik¹

Ongari, Daniele¹

Moosavi, Seyed Mohamad¹

Smit, Berend¹

*

1. Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingenierie Chimiques (ISIC), École Polytechnique Fédérale de Lausanne (EPFL), Sion, VS, Switzerland

* Contact person

Knowledge of the oxidation state of a metal centre in a material is essential to understand its properties. Chemists have developed several theories to predict the oxidation state on the basis of the chemical formula. These methods are quite successful for simple compounds but often fail to describe the oxidation states of more complex systems, such as metal-organic frameworks. In this work, we present a data-driven approach to automatically assign oxidation states, using a machine learning algorithm trained on the assignments by chemists encoded in the chemical names in the Cambridge Crystallographic Database. Our approach only considers the immediate local chemical environment around a metal centre and, in this way, is robust to most of the experimental uncertainties in these structures (like incorrect protonation or unbound solvents). We find such excellent accuracy (>98%) in our predictions that we can use our method to identify a large number of incorrect assignments in the database. The predictions of our model follow chemical intuition, without explicitly having taught the model those heuristics. This work nicely illustrates how powerful the collective knowledge of chemists actually is. Machine learning can harvest this knowledge and convert it into a useful tool for chemists.

Files

File preview

files_description.md

All files

Files (100.7 MiB)

Name	Size
files_description.md md5:3fee9ae5e9b5c045433911af95b1df1d	304 Bytes	Preview Download
datapackage.zip md5:06c443c12aaf2584dbdcadaf2144d13a	100.7 MiB	Preview Download
README.txt md5:c43b268e6071f1793417b125060f5b93	2.1 KiB	Preview Download

References

Journal reference
K. M. Jablonka, D. Ongari, S. M. Moosavi, B. Smit, submitted, 2019.

Software (Code that can be used to generate the feature matrix.)
K. M. Jablonka, D. Ongari, S. M. Moosavi, B. Smit, Zenodo, 2019., doi: 10.5281/zenodo.3567274

Software (Software that implements the code to train and test the models.)
K. M. Jablonka, D. Ongari, S. M. Moosavi, B. Smit, Zenodo, 2019., doi: 10.5281/zenodo.3567011

	All versions	This version
Views	1,230	624
Downloads	255	118
Data volume	86.6 GiB	7.0 GiB

Using collective knowledge to assign oxidation states

Creators

Description

Files

File preview

files_description.md

All files

Files (100.7 MiB)

References