<?xml version='1.0' encoding='utf-8'?> <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"> <dc:creator>Rossi, Kevin</dc:creator> <dc:creator>Juraskova, Veronika</dc:creator> <dc:creator>Wischert, Raphael</dc:creator> <dc:creator>Garel, Laurent</dc:creator> <dc:creator>Corminboeuf, Clemence</dc:creator> <dc:creator>Ceriotti, Michele</dc:creator> <dc:date>2020-06-22</dc:date> <dc:description>Set of inputs to perform the calculations reported in the paper. The i-pi input enables to perform molecular dynamics / metadynamics / REMD / PIMD simulations, with adequate thermostats. The DFTB and LAMMPS input respectively enable to calculate force and energies within the DFTB and Neural Network Forcefield frameworks. The CP2K input files enable to calculate force and energies at PBE and PBE0 level. The latter is used as the reference to train the neural network correction on top of DFTB. Brief description of the work: We present a generally-applicable computational framework for the efficient and accurate characterization of molecular structural patterns and acid properties in explicit solvent using H₂O₂ and CH₃SO₃H in phenol as an example. In order to address the challenges posed by the complexity of the problem, we resort to a set of data-driven methods and enhanced sampling algorithms. The synergistic application of these techniques makes the first-principle estimation of the chemical properties feasible without renouncing to the use of explicit solvation, involving extensive statistical sampling. Ensembles of neural network potentials are trained on a set of configurations carefully selected out of preliminary simulations performed at a low-cost density-functional tight-binding (DFTB) level. Energy and forces of these configurations are then recomputed at the hybrid density functional theory (DFT) level and used to train the neural networks. The stability of the NN model is enhanced by using DFTB energetics as a baseline, but the efficiency of the direct NN (i.e., baseline-free) is exploited via a multiple-time step integrator. The neural network potentials are combined with enhanced sampling techniques, such as replica exchange and metadynamics, and used to characterize the relevant protonated species and dominant non-covalent interactions in the mixture, also considering nuclear quantum effects.</dc:description> <dc:identifier>https://archive.materialscloud.org/record/2020.64</dc:identifier> <dc:identifier>doi:10.24435/materialscloud:z9-zr</dc:identifier> <dc:identifier>mcid:2020.64</dc:identifier> <dc:identifier>oai:materialscloud.org:434</dc:identifier> <dc:language>en</dc:language> <dc:publisher>Materials Cloud</dc:publisher> <dc:rights>info:eu-repo/semantics/openAccess</dc:rights> <dc:rights>Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights> <dc:subject>machine learning</dc:subject> <dc:subject>solution chemistry</dc:subject> <dc:subject>acid homogeneous catalysis</dc:subject> <dc:subject>catalysis</dc:subject> <dc:subject>acid</dc:subject> <dc:subject>artificial intelligence</dc:subject> <dc:subject>reaction</dc:subject> <dc:title>Simulating solvation and acidity in complex mixtures with first-principles accuracy: the case of CH₃SO₃H and H₂O₂ in phenol</dc:title> <dc:type>Dataset</dc:type> </oai_dc:dc>