Bias free multiobjective active learning for materials design and discovery

doi:10.24435/materialscloud:8m-6d

materialscloud:2021.34

Published February 22, 2021 | Version v1

Dataset Open

Bias free multiobjective active learning for materials design and discovery

Jablonka, Kevin Maik¹

*

Melpatti Jothiappan, Giriprasad²

Wang, Shefang²

Smit, Berend¹

*

Yoo, Brian²

*

1. Laboratory of Molecular Simulation, Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), CH-1951 Sion, Valais, Switzerland
2. BASF Corporation, 540 White Plains Road, Tarrytown, New York, 10591, USA

* Contact person

The design rules for materials are clear for applications with a single objective. For most applications, however, there are often multiple, sometimes competing objectives where there is no single best material, and the design rules change to finding the set of Pareto optimal materials. In this work, we introduce an active learning algorithm that directly uses the Pareto dominance relation to compute the set of Pareto optimal materials with desirable accuracy. We apply our algorithm to de novo polymer design with a prohibitively large search space. Using molecular simulations, we compute key descriptors for dispersant applications and reduce the number of materials that need to be evaluated to reconstruct the Pareto front with a desired confidence by over 98% compared to random search. This work showcases how simulation and machine learning techniques can be coupled to discover materials within a design space that would be intractable using conventional screening approaches.

Files

File preview

files_description.md

All files

Files (7.3 GiB)

Name	Size
files_description.md md5:e494984c4449fe604fa5a6b3254a8b2d	561 Bytes	Preview Download
adsorption_runs.zip md5:b8f36997b0b3fbf0898110c30c927d75	3.4 GiB	Preview Download
dimer_runs2.zip md5:2b7b37bbb2623a6f94469b2a0e20a88d	3.4 GiB	Preview Download
ml_data.zip md5:9df8b9ec233d1f551e1e35e21399cf27	430.5 KiB	Preview Download
README.txt md5:4d7e5be16e378137f0aab6f62194f620	1.1 KiB	Preview Download
rg_runs.zip md5:41825511b8ef9210b91814f3b376f810	416.6 MiB	Preview Download

References

Software (Script that can be used to reproduce the main results.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, dispersant_screening_PAL (2020).

Software (General-purpose implementation of the active learning algorithm.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, PyPAL (2020).

Preprint (Preprint where the data is discussed.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, Chemrxiv (2020)., doi: 10.26434/chemrxiv.13200197.v1

Journal reference
K. M. Jablonka, G. M. Jothiappan, S. Wang, B. Smit, B. Yoo, Nature Communications 12, 1-10 (2021), doi: 10.1038/s41467-021-22437-0

	All versions	This version
Views	46	46
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Bias free multiobjective active learning for materials design and discovery

Creators

Description

Files

File preview

files_description.md

All files

Files (7.3 GiB)

References