Simulating solvation and acidity in complex mixtures with first-principles accuracy: the case of CH₃SO₃H and H₂O₂ in phenol


JSON Export

{
  "revision": 7, 
  "id": "781", 
  "created": "2021-03-19T17:39:20.419226+00:00", 
  "metadata": {
    "doi": "10.24435/materialscloud:2x-7x", 
    "status": "published", 
    "title": "Simulating solvation and acidity in complex mixtures with first-principles accuracy: the case of CH\u2083SO\u2083H and H\u2082O\u2082 in phenol", 
    "mcid": "2021.50", 
    "license_addendum": null, 
    "_files": [
      {
        "description": "i-pi input to run basic MD (https://github.com/cosmo-epfl/i-pi)", 
        "key": "input.xml", 
        "size": 3779, 
        "checksum": "md5:012f438cd8a0d09e1ae32cd583c7a09a"
      }, 
      {
        "description": "example PBE input for DFT calculations (https://www.cp2k.org/)", 
        "key": "pbe.cp2k", 
        "size": 1587, 
        "checksum": "md5:08e632a33d5afb551b9d5f704ceced55"
      }, 
      {
        "description": "exemple PBE0 input for DFT calculations (https://www.cp2k.org/)", 
        "key": "pbe0.cp2k", 
        "size": 3058, 
        "checksum": "md5:9a47c035c868b0ccb78729729345949c"
      }, 
      {
        "description": "example DFTB+ input for DFTB calculations (https://www.dftbplus.org/)", 
        "key": "dftb_in.hsd", 
        "size": 1222, 
        "checksum": "md5:8e253196041e84221c7cde407c6c5656"
      }, 
      {
        "description": "example input.nn input to train and use a neural network for force and energy predictions (https://github.com/CompPhysVienna/n2p2)", 
        "key": "input.nn", 
        "size": 23151, 
        "checksum": "md5:e7acab8f9218b083a66d28a0bcda31a1"
      }, 
      {
        "description": "example LAMMPS input for MD calculations via i-pi and using neural network potentials (https://lammps.sandia.gov/)", 
        "key": "lmp1.in", 
        "size": 3547, 
        "checksum": "md5:ba8eae7e733c2cd14244540b26535f8e"
      }, 
      {
        "description": "Dataset for learning DFT energies and forces (input.data format as used in N2P2)", 
        "key": "input.data.direct.gz", 
        "size": 35960401, 
        "checksum": "md5:36254b945cfaf68c226e49644ce5e96f"
      }, 
      {
        "description": "Dataset for learning DFTB-baselined energies and forces (input.data format as used in N2P2)", 
        "key": "input.data.delta.gz", 
        "size": 37003974, 
        "checksum": "md5:d6b5b6ed0368757315593302a3df17ff"
      }, 
      {
        "description": "nn weights for direct predictions of forces and energies", 
        "key": "direct.tar.gz", 
        "size": 146607, 
        "checksum": "md5:8475cf5afa4b6543e57b845c96882a14"
      }, 
      {
        "description": "nn weights for DFTB-baselined predictions of forces and energies", 
        "key": "delta.tar.gz", 
        "size": 730654, 
        "checksum": "md5:68cde22ad92072c0452c7941ce792bb0"
      }, 
      {
        "description": "README", 
        "key": "README.txt", 
        "size": 375, 
        "checksum": "md5:9dc6a1ab9a65080329ee332892c0460f"
      }
    ], 
    "owner": 132, 
    "_oai": {
      "id": "oai:materialscloud.org:781"
    }, 
    "keywords": [
      "machine learning", 
      "solution chemistry", 
      "acid homogeneous catalysis", 
      "catalysis", 
      "acid", 
      "artificial intelligence", 
      "reaction", 
      "CH3SO3H", 
      "H2O2", 
      "MARVEL"
    ], 
    "conceptrecid": "433", 
    "is_last": false, 
    "references": [
      {
        "type": "Journal reference", 
        "doi": "https://doi.org/10.1021/acs.jctc.0c00362", 
        "url": "https://pubs.acs.org/doi/abs/10.1021/acs.jctc.0c00362", 
        "comment": "Paper reference", 
        "citation": "K. Rossi, V. Jur\u00e1skov\u00e1, R. Wischert, L. Garel, C. Corminb\u0153uf, M. Ceriotti, J. Chem. Theory Comput., 16, 8, 5139\u20135149 (2020)"
      }
    ], 
    "publication_date": "Mar 26, 2021, 18:24:05", 
    "license": "Creative Commons Attribution 4.0 International", 
    "id": "781", 
    "description": "Set of inputs to perform the calculations reported in the paper.\nThe i-pi input enables to perform molecular dynamics / metadynamics / REMD / PIMD simulations, with adequate thermostats.\nThe DFTB and LAMMPS input respectively enable to calculate force and energies within the DFTB and Neural Network Forcefield frameworks.\nThe CP2K input files enable to calculate force and energies at PBE and PBE0 level. The latter is used as the reference to train the neural network correction on top of DFTB.\n\nBrief description of the work: We present a generally-applicable computational framework for the efficient and accurate characterization of molecular structural patterns and acid properties in explicit solvent using H\u2082O\u2082 and CH\u2083SO\u2083H in phenol as an example. In order to address the challenges posed by the complexity of the problem, we resort to a set of data-driven methods and enhanced sampling algorithms. The synergistic application of these techniques makes the first-principle estimation of the chemical properties feasible without renouncing to the use of explicit solvation, involving extensive statistical sampling. Ensembles of neural network potentials are trained on a set of configurations carefully selected out of preliminary simulations performed at a low-cost density-functional tight-binding (DFTB) level. Energy and forces of these configurations are then recomputed at the hybrid density functional theory (DFT) level and used to train the neural networks. The stability of the NN model is enhanced by using DFTB energetics as a baseline, but the efficiency of the direct NN (i.e., baseline-free) is exploited via a multiple-time step integrator. The neural network potentials are combined with enhanced sampling techniques, such as replica exchange and metadynamics, and used to characterize the relevant protonated species and dominant non-covalent interactions in the mixture, also considering nuclear quantum effects.", 
    "version": 2, 
    "contributors": [
      {
        "email": "kevin.rossi@epfl.ch", 
        "affiliations": [
          "Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL), Lausanne, 1015, Switzerland"
        ], 
        "familyname": "Rossi", 
        "givennames": "Kevin"
      }, 
      {
        "email": "veronika.juraskova@epfl.ch", 
        "affiliations": [
          "Laboratory for Computational Molecular Design (LCMD), Institute of Chemical Sciences and Engineering, \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL), Lausanne, 1015, Switzerland"
        ], 
        "familyname": "Juraskova", 
        "givennames": "Veronika"
      }, 
      {
        "affiliations": [
          "Eco-Efficient Products and Processes Laboratory, Solvay, RIC Shanghai, China"
        ], 
        "familyname": "Wischert", 
        "givennames": "Raphael"
      }, 
      {
        "affiliations": [
          "Aroma Performance Laboratory, Solvay, RIC Lyon, France"
        ], 
        "familyname": "Garel", 
        "givennames": "Laurent"
      }, 
      {
        "email": "clemence.corminboeuf@epfl.ch", 
        "affiliations": [
          "Laboratory for Computational Molecular Design (LCMD), Institute of Chemical Sciences and Engineering, \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL), Lausanne, 1015, Switzerland"
        ], 
        "familyname": "Corminboeuf", 
        "givennames": "Clemence"
      }, 
      {
        "email": "michele.ceriotti@epfl.ch", 
        "affiliations": [
          "Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL), Lausanne, 1015, Switzerland"
        ], 
        "familyname": "Ceriotti", 
        "givennames": "Michele"
      }
    ], 
    "edited_by": 100
  }, 
  "updated": "2021-05-20T09:18:17.088002+00:00"
}