Published October 25, 2022 | Version v2
Dataset Open

Ranking the synthesizability of hypothetical zeolites with the sorting hat

  • 1. Laboratory of Computational Science and Modeling, Institut des Matériaux, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
  • 2. PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 Rue Lhomond, 75005 Paris, France
  • 3. ICGM, Université de Montpellier, CNRS, ENSCM, Montpellier, France
  • 4. Department of Chemistry and Department of Chemical Engineering, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, USA

* Contact person

Description

Zeolites are nanoporous alumino-silicate frameworks widely used as catalysts and adsorbents. Even though millions of siliceous networks can be generated by computer-aided searches, no new hypothetical framework has yet been synthesized. The needle-in-a-haystack problem of finding promising candidates among large databases of predicted structures has intrigued materials scientists for decades; yet, most work to date on the zeolite problem has been limited to intuitive structural descriptors. Here, we tackle this problem through a rigorous data science scheme—the "zeolite sorting hat"—that exploits interatomic correlations to discriminate between real and hypothetical zeolites and to partition real zeolites into compositional classes that guide synthetic strategies for a given hypothetical framework. We find that, regardless of the structural descriptor used by the zeolite sorting hat, there remain hypothetical frameworks that are incorrectly classified as real ones, suggesting that they might be good candidates for synthesis. We seek to minimize the number of such misclassified frameworks by using as complete a structural descriptor as possible, thus focusing on truly viable synthetic targets, while discovering structural features that distinguish real and hypothetical frameworks as an output of the zeolite sorting hat. Further ranking of the candidates can be achieved based on thermodynamic stability and/or their suitability for the desired applications. Based on this workflow, we propose three hypothetical frameworks differing in their molar volume range as the top targets for synthesis, each with a composition suggested by the zeolite sorting hat. Finally, we analyze the behavior of the zeolite sorting hat with a hierarchy of structural descriptors including intuitive descriptors reported in previous studies, finding that intuitive descriptors produce significantly more misclassified hypothetical frameworks, and that more rigorous interatomic correlations point to second-neighbor Si-O distances around 3.2–3.4 Å as the key discriminatory factor.

Files

File preview

files_description.md

All files

Files (638.4 MiB)

Name Apps Size
md5:25257250764cdc46076d88990df6f6b9
365 Bytes Preview Download
md5:3b94a4086f429e8a1c1bc9fecfb4ff36
631.0 MiB Download
md5:5ccb6e3e9b1e8e574f90080d9f85b478
7.4 MiB Download

References

Journal reference (Paper for which the data were generated)
B. A. Helfrecht, G. Pireddu, R. Semino, S. M. Auerbach, M. Ceriotti, Digital Discovery 1, 779-789 (2022), doi: 10.1039/D2DD00056C

Journal reference (Paper for which the data were generated)
B. A. Helfrecht, G. Pireddu, R. Semino, S. M. Auerbach, M. Ceriotti, Digital Discovery 1, 779-789 (2022)

Journal reference (Paper in which the subset of 10,000 structures from the Deem database was originally described)
B. A. Helfrecht, R. Semino, G. Pireddu, S. M. Auerbach, M. Ceriotti, J. Chem. Phys. 151, 154112 (2019), doi: 10.1063/1.5119751

Journal reference (Paper in which the Deem database of hypothetical zeolites was originally created, described, and used)
R. Pophale, P. A. Cheeseman, M. W. Deem, Phys. Chem. Chem. Phys. 13, 12407-12412 (2011)., doi: 10.1039/C0CP02255A

Website (Archive of the original Deem database of hypothetical zeolite structures)
M. W. Deem, Michael Deem's PCOD and PCOD2 databases of zeolitic structures [Zenodo data set] (2020)., doi: 10.5281/zenodo.4030232