<?xml version='1.0' encoding='utf-8'?> <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"> <dc:creator>Pozdnyakov, Sergey</dc:creator> <dc:creator>Willat, Michael</dc:creator> <dc:creator>Ceriotti, Michele</dc:creator> <dc:date>2020-09-04</dc:date> <dc:description>Most of the datasets to benchmark machine-learning models contain minimum-energy structures, or small fluctuations around stable geometries, and focus on the diversity of chemical compositions, or the presence of different phases. This dataset provides a large number (7732489) configurations for a simple CH4 composition, that are generated in an almost completely unbiased fashion. Hydrogen atoms are randomly distributed in a 3A sphere centered around the carbon atom, and the only structures that are discarded are those with atoms that are closer than 0.5A, or such that the reference DFT calculation does not converge. This dataset is ideal to benchmark structural representations and regression algorithms, verifying whether they allow reaching arbitrary accuracy in the data rich regime.</dc:description> <dc:identifier>https://archive.materialscloud.org/record/2020.105</dc:identifier> <dc:identifier>doi:10.24435/materialscloud:s6-nq</dc:identifier> <dc:identifier>mcid:2020.105</dc:identifier> <dc:identifier>oai:materialscloud.org:507</dc:identifier> <dc:language>en</dc:language> <dc:publisher>Materials Cloud</dc:publisher> <dc:rights>info:eu-repo/semantics/openAccess</dc:rights> <dc:rights>Creative Commons Attribution Non Commercial 4.0 International https://creativecommons.org/licenses/by-nc/4.0/legalcode</dc:rights> <dc:subject>dataset</dc:subject> <dc:subject>DFT</dc:subject> <dc:subject>Methane</dc:subject> <dc:subject>atomistic machine learning</dc:subject> <dc:subject>SNSF</dc:subject> <dc:subject>MARVEL</dc:subject> <dc:title>Randomly-displaced methane configurations</dc:title> <dc:type>Dataset</dc:type> </oai_dc:dc>