Published February 22, 2021 | Version v1
Dataset Open

Bias free multiobjective active learning for materials design and discovery

  • 1. Laboratory of Molecular Simulation, Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), CH-1951 Sion, Valais, Switzerland
  • 2. BASF Corporation, 540 White Plains Road, Tarrytown, New York, 10591, USA

* Contact person

Description

The design rules for materials are clear for applications with a single objective. For most applications, however, there are often multiple, sometimes competing objectives where there is no single best material, and the design rules change to finding the set of Pareto optimal materials. In this work, we introduce an active learning algorithm that directly uses the Pareto dominance relation to compute the set of Pareto optimal materials with desirable accuracy. We apply our algorithm to de novo polymer design with a prohibitively large search space. Using molecular simulations, we compute key descriptors for dispersant applications and reduce the number of materials that need to be evaluated to reconstruct the Pareto front with a desired confidence by over 98% compared to random search. This work showcases how simulation and machine learning techniques can be coupled to discover materials within a design space that would be intractable using conventional screening approaches.

Files

File preview

files_description.md

All files

Files (7.3 GiB)

Name Size
md5:e494984c4449fe604fa5a6b3254a8b2d
561 Bytes Preview Download
md5:b8f36997b0b3fbf0898110c30c927d75
3.4 GiB Preview Download
md5:2b7b37bbb2623a6f94469b2a0e20a88d
3.4 GiB Preview Download
md5:9df8b9ec233d1f551e1e35e21399cf27
430.5 KiB Preview Download
md5:4d7e5be16e378137f0aab6f62194f620
1.1 KiB Preview Download
md5:41825511b8ef9210b91814f3b376f810
416.6 MiB Preview Download

References

Software (Script that can be used to reproduce the main results.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, dispersant_screening_PAL (2020).

Software (General-purpose implementation of the active learning algorithm.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, PyPAL (2020).

Preprint (Preprint where the data is discussed.)
K. M. Jablonka, M. J. Giriprasad, S. Wang, B. Smit, and B. Yoo, Chemrxiv (2020)., doi: 10.26434/chemrxiv.13200197.v1

Journal reference
K. M. Jablonka, G. M. Jothiappan, S. Wang, B. Smit, B. Yoo, Nature Communications 12, 1-10 (2021), doi: 10.1038/s41467-021-22437-0