## Latest records Upload

### Free-Energy Surface Prediction by Flying Gaussian Method

[materialscloud:2019.0032] Last version: 18 June 2019

Molecular simulations are computationally expensive, especially in systems with multiple free energy minima. To address this problem many enhanced sampling techniques have been developed. Metadynamics uses a bias potential defined as a sum of Gaussian hills in space of few (one or two) collective variables. This bias potential disfavors states that have been visited since the beginning of the simulation. Multiple walker metadynamics simulates the system in multiple parallel replicas. The bias potential disfavors states that have been visited since the beginning of the simulation in any replica. Flying Gaussian method presented here also simulates the system in multiple parallel replicas. It disfavors states that are, at certain moment, similar in two or more replicas. It was demonstrated on Alanine Dipeptide in vacuum and water, cis/trans-isomerisation of Proline-containing peptides and Met-enkephalin.

### Reaction pathway of oxygen evolution on Pt(1 1 1) revealed through constant Fermi level molecular dynamics

[materialscloud:2019.0031] Last version: 18 June 2019

The pathway of the oxygen evolution reaction at the Pt(1 1 1)/water interface is disclosed through constant Fermi level molecular dynamics. Upon the application of a positive bias potential H2O_ads and O_ads adsorbates are found to arrange in a hexagonal lattice with an irregular alternation. Increasing further the electrode potential then induces the oxygen evolution reaction, which is found to proceed through a hydrogen peroxide intermediate. Calculation of the associated overpotential shows a reduction of 0.2 eV compared to the associative mechanism. This result highlights the forcefullness of the applied scheme in exploring catalytic reactions in an unbiased way.

### Reducing the Number of Mean-Square Deviation Calculations with Floating Close Structure in Metadynamics

[materialscloud:2019.0021] Last version: 17 June 2019

Biomolecular simulations are computationally expensive. This limits their application in protein folding simulations, protein engineering, drug design and related fields. Enhanced sampling techniques such as metadynamics accelerates slow events in molecular simulation. This and other method apply artificial forces in directions of collective degrees of freedom (aka collective variables). Path collective variables and Property Map collective variables are defined using a series of reference structures of the studied molecular system. They require a huge number of mean square deviation calculations along the simulation. Close Structure algorithm reduces the number of these operations.

### Multithermal-multibaric molecular simulations from a variational principle

[materialscloud:2019.0016] Last version: 12 June 2019

We present a method for performing multithermal-multibaric molecular dynamics simulations that sample entire regions of the temperature-pressure (TP) phase diagram. The method uses a variational principle [Valsson and Parrinello, Phys. Rev. Lett. 113, 090601 (2014)] in order to construct a bias that leads to a uniform sampling in energy and volume. The intervals of temperature and pressure are taken as inputs and the relevant energy and volume regions are determined on the fly. In this way the method guarantees adequate statistics for the chosen TP region. We show that our multithermal-multibaric simulations can be used to calculate all static physical quantities for all temperatures and pressures in the targeted region of the TP plane. We illustrate our approach by studying the density anomaly of TIP4P/Ice water. This record includes input and output files, and Jupyter Notebooks describing the analysis of the simulations and the creation of the figures for the paper.

### MD trajectories of semiconductor-water interfaces and relaxed atomic structures of semiconductor surfaces

[materialscloud:2019.0029] Last version: 02 June 2019

This entry includes the MD trajectories of several semiconductor-water interfaces generated with ab initio molecular dynamics using the rVV10 density functional at the temperature of 350 K. Eight semiconductor surfaces are considered, namely GaAs(110), GaP(110), GaN(10-10), CdS(10-10), ZnO(10-10), SnO2(110), rutile TiO2(110) and anatase TiO2(101). For GaAs, GaP and anatase TiO2, the trajectories for the interfaces with both the molecular and the dissociative adsorption mode of water are provided. In addition, the relaxed atomic structures of the semiconductor surfaces used to calculate the ionization potential (IP) reported in [Chem. Mater. 2018, 30, 94−111] are added.

### MD trajectories of bulk water and of the water-vacuum interface

[materialscloud:2019.0030] Last version: 01 June 2019

This entry provides MD trajectories for bulk water and the water-vacuum interface generated with ab initio molecular dynamics using rVV10 density functional at the temperature of 350 K. In the rVV10 functional, the parameter b is set to 9.3.

### Determining interface structures in vertically aligned nanocomposite films

[materialscloud:2019.0025] Last version: 31 May 2019

Vertically aligned nanocomposites (VANs) films have self-assembled pillar-matrix nanostructures. Owing to their large area-to-volume ratios, interfaces in VAN films are expected to play key roles in inducing functional properties, but our understanding is hindered by limited knowledge about their structures. Motivated by the lack of definitive explanation for the experimentally-found enhanced ionic conductivity in Sm-doped-CeO2/SrTiO3 VAN films, we determine the structure at vertical interfaces using random structure searching and explore how it can affect ionic conduction. This record contains the candidate structures and provenance of the DFT validation calculations. Previously unknown interface structures are found, with lower energy than that of an optimized hand-built model. We find a strongly distorted oxygen sub-lattice which gives a complex landscape of vacancy energies. The cation lattice remains similar to the bulk phase but has a localized strain field. The excess ...

### Intrinsic defects in amorphous TiO2

[materialscloud:2019.0028] Last version: 30 May 2019

This entry provides the atomic structures of three bulk amorphous TiO2 models generated through the melt-and-quench method with different cooling rates and of ten O-O peroxy linkages obtained by adding two holes to the bulk model constructed with the lowest cooling rate.

### Extrinsic Defects in Amorphous Oxides: Hydrogen, Carbon, and Nitrogen Impurities in Alumina

[materialscloud:2019.0027] Last version: 30 May 2019

This entry provides the most stable defect configurations of hydrogen, carbon, and nitrogen impurities in alumina, which are identified through ab initio molecular dynamics in various charge states and structural relaxations with the PBE functional. The structural configurations related to carbon and nitrogen impurities are found to depend on the total charge set in the simulation cell.

### Oxygen defects in amorphous Al2O3

[materialscloud:2019.0026] Last version: 30 May 2019

The electronic properties of the oxygen vacancy and interstitial in amorphous Al2O3 are studied via ab initio molecular dynamics simulations and hybrid functional calculations. Our results indicate that these defects do not occur in amorphous Al2O3, due to structural rearrangements which assimilate the defect structure and cause a delocalization of the associated defect levels. The imbalance of oxygen leads to a nonstoichiometric compound in which the oxygen occurs in the form of O2– ions. Intrinsic oxygen defects are found to be unable to trap excess electrons. For low Fermi energies, the formation of peroxy linkages is found to be favored leading to the capture of holes. The relative +2/0 defect levels occur at 2.5 eV from the valence band.

### Applicability of tail-corrections in the molecular simulations of porous materials

[materialscloud:2019.0024] Last version: 29 May 2019

Molecular simulations with periodic boundary conditions require to define a certain cutoff distance beyond which pairwise dispersion interactions are neglected. For the simulation of homogeneous phases it is well-established to use tail-corrections, that can remedy this truncation of the potential. These corrections are built under the assumption that beyond the cutoff the radial distribution function is equal to one. In this work we shed some light on the discussion whether or not tail corrections should be used in the modelling of heterogeneous systems. We show that for the adsorption of gasses in a diverse set nanoporous crystalline materials (zeolites, Covalent Organic Frameworks (COFs), and Metal Organic Frameworks (MOFs)), tail-corrections are an appropriate choice with which the results are much less sensitive to the details of the truncation.

### Chemical Shifts in Molecular Solids by Machine Learning Datasets

[materialscloud:2019.0023] Last version: 27 May 2019

We present a database of energy and NMR chemical shifts DFT calculations of 2500 crystal organic solids. The structures contain only H/C/N/O atoms and were subject to all-atoms geometry optimisation. Calculations were carried out using Quantum Espresso and GIPAW.

### Dynamics of the Bulk Hydrated Electron from Many‐Body Wave‐Function Theory

[materialscloud:2019.0022] Last version: 27 May 2019

Trajectories and spin densities for the bulk hydrated electron at the MP2 level of theory. The data represent the first ab initio molecular dynamics study of the hydrated electron in the bulk using many-body wave function theory.

### Making the best of a bad situation: a multiscale approach to free energy calculation

[materialscloud:2019.0004] Last version: 21 May 2019

Many enhanced sampling techniques rely on the identification of a number of collective variables that describe all the slow modes of the system. By constructing a bias potential in this reduced space one is then able to sample efficiently and reconstruct the free energy landscape. In methods like metadynamics, the quality of these collective variables plays a key role in convergence efficiency. Unfortunately in many systems of interest it is not possible to identify an optimal collective variable, and one must deal with the non-ideal situation of a system in which some slow modes are not accelerated. We propose a two-step approach in which, by taking into account the residual multiscale nature of the problem, one is able to significantly speed up convergence. To do so, we combine an exploratory metadynamics run with an optimization of the free energy difference between metastable states, based on the recently proposed variationally enhanced sampling method. This new method is ...

### TopoMat: a database of high-throughput first-principles calculations of topological materials

[materialscloud:2019.0019] Last version: 21 May 2019

We present a database of topological materials predicted from high-throughput first-principles calculations. The database contains electronic band structures and topological indices of 13628 materials calculated on experimental crystal structures taken from the Inorganic Crystal Structure Database (ICSD) and the Crystallography Open Database (COD). The calculations have been performed on non-magnetic phases taking into account the spin-orbit interactions using the Quantum ESPRESSO package. The Fu-Kane method and the Wannier charge center method implemented in the Z2pack code have been utilized to calculate the Z2 topological numbers of centrosymmetric and non-centrosymmetric materials, respectively. Over 4000 topologically non-trivial materials have been identified.

### Data-driven studies of magnetic two-dimensional materials

[materialscloud:2019.0020] Last version: 20 May 2019

We use a data-driven approach to study the magnetic and thermodynamic properties of van der Waals (vdW) layered materials. We investigate monolayers of the form A2B2X6, based on the known material Cr2Ge2Te6, using density functional theory (DFT) calculations and determine their magnetic properties, such as magnetic order and magnetic moment. We also examine formation energies and use them as a proxy for chemical stability.

### MgTa2N3: A reference Dirac semimetal

[materialscloud:2019.0018] Last version: 13 May 2019

MgTa2N3 is predicted to host the topological Dirac semimetal phase. This archive includes input data necessary for reproducing first-principles calculation described in the publication.

### Graph Dynamical Networks for Unsupervised Learning of Atomic Scale Dynamics in Materials

[materialscloud:2019.0017] Last version: 09 May 2019

Understanding the dynamical processes that govern the performance of functional materials is essential for the design of next generation materials to tackle global energy and environmental challenges. Many of these processes involve the dynamics of individual atoms or small molecules in condensed phases, e.g. lithium ions in electrolytes, water molecules in membranes, molten atoms at interfaces, etc., which are difficult to understand due to the complexity of local environments. We develop graph dynamical networks, an unsupervised learning approach for understanding atomic scale dynamics in arbitrary phases and environments from molecular dynamics simulations. We show that important dynamical information can be learned for various multi-component amorphous material systems, which is difficult to obtain otherwise. We develop a software package "gdynet" at https://github.com/txie-93/gdynet which implements the graph dynamical networks algorithm. This record contains the MD ...

### Coupled-Cluster Polarizabilities in the QM7b and a Showcase Database

[materialscloud:2019.0002] Last version: 04 May 2019

Dipole polarizabilities, computed using linear response coupled cluster theory and density functional theory (using d-aug-cc-pVDZ basis set), for 7211 molecules from the QM7b dataset of small molecules and for 52 molecules from a showcase dataset.

### High-throughput calculations of catalytic properties of bimetallic alloy surfaces

[materialscloud:2019.0015] Last version: 01 May 2019

We present a large dataset of adsorption of H, C, N, O and S onto more than 2,000 metallic and bimetallic alloy surfaces, consisting of approximately 90,000 DFT calculations performed in Quantum Espresso. The alloys are constructed from all possible combinations of 37 metals into AB and A3B stoichiometries, in the L1_0 and L1_2 structures respectively, where the 37 metals in the A1 structure are included as well. Slabs are cleaved along the 111 facet for A1 and L1_2 and along the 101 facet for L1_0, and all possible adsorption sites are sampled. In addition to the monoatomic adsorbates, adsorption of CH, CH2, CH3, NH, NH2, OH, H2O and SH is sampled for a smaller subset of alloys.

### Approximation of Collective Variables by anncolvar

[materialscloud:2019.0014] Last version: 23 April 2019

Biomolecular simulations are computationally expensive. This limits their application in drug or protein design and related fields. Several methods have been developed to address this problem. These methods often use an artificial force or potential acting on selected degrees of freedom known as collective variables. This requires explicit calculation of a collective variable (and its derivatives) from molecular structure. For collective variables that cannot be calculated explicitly or such calculations is slow we developed anncolvar package (https://github.com/spiwokv/anncolvar). This package approximates collective variables using artificial neural networks. It was tested on Isomap low dimensional representation of cyclooctane derivative or solvent-accessible surface area of Trp-cage miniprotein.

### Free-Energy Surface Prediction by Flying Gaussian Method: Numerical Proof

[materialscloud:2019.0013] Last version: 17 April 2019

Biomolecular simulations have a great potential in protein engineering, drug discovery and many other fields. Unfortunately, this method is computationally expensive, so many interesting processes cannot be routinely studied. In order to address this problem we developed Flying Gaussian method [Journal of Chemical Theory and Computation 12, 4644-4650 (2016)]. This method simultaneously simulates multiple replicas of the studied system and disfavor replicas with similar structures by artificial bias potential. The question arises how to calculate an unbiased free energy surface from a biased simulation. This dataset demonstrates together with mathematical arguments supports application of Umbrella Sampling reweighing method, despite the fact that this method is designed for methods with a time-independent bias potential.

### Unified theory of thermal transport in crystals and disordered solids

[materialscloud:2019.0001] Last version: 08 April 2019

Crystals and glasses exhibit fundamentally different heat conduction mechanisms: the periodicity of crystals allows for the excitation of propagating vibrational waves that carry heat, as first discussed by Peierls; in glasses, the lack of periodicity breaks Peierls' picture and heat is mainly carried by the coupling of vibrational modes, often described by a harmonic theory introduced by Allen and Feldman. Anharmonicity or disorder are thus the limiting factors for thermal conductivity in crystals or glasses; hitherto, no transport equation has been able to account for both. In the paper https://arxiv.org/abs/1901.01964, we derive such equation, resulting in a thermal conductivity that reduces to the Peierls and Allen-Feldman limits, respectively, in anharmonic-and-ordered or harmonic-and-disordered solids, while also covering the intermediate regimes where both effects are relevant. This approach also solves the long-standing problem of accurately predicting the thermal ...

### Tailoring Bond Topologies in Open-Shell Graphene Nanostructures

[materialscloud:2019.0012] Last version: 07 April 2019

The data contained in this record, raw data of images and input files to reproduce calculations, support our recent report for the on-surface synthesis and characterization of two ultralow-gap open-shell molecules, namely peri-tetracene, a benzenoid graphene fragment with zigzag edge topology, and dibenzo[a,m]dicyclohepta[bcde,nopq]rubicene, a nonbenzenoid nonalternant structural isomer of peri-tetracene with two embedded azulene units. Our results provide an understanding of the ramifications of altered bond topologies at the single-molecule scale, with the prospect of designing functionalities in carbon-based nanostructures via engineering of bond topology

### Two-dimensional materials from high-throughput computational exfoliation of experimentally known compounds

[materialscloud:2017.0008] Last version: 03 April 2019

Two-dimensional (2D) materials have emerged as promising candidates for next-generation electronic and optoelectronic applications. Yet, only a few dozens of 2D materials have been successfully synthesized or exfoliated. Here, we search for novel 2D materials that can be easily exfoliated from their parent compounds. Starting from 108423 unique, experimentally known three-dimensional compounds we identify a subset of 5619 that appear layered according to robust geometric and bonding criteria. High-throughput calculations using van-der-Waals density-functional theory, validated against experimental structural data and calculated random-phase-approximation binding energies, allow to identify 1825 compounds that are either easily or potentially exfoliable. In particular, the subset of 1036 easily exfoliable cases provides novel structural prototypes and simple ternary compounds as well as a large portfolio of materials to search from for optimal properties. For a subset of 258 ...

### TuTraST data for methane diffusion in IZA structures

[materialscloud:2019.0011] Last version: 11 March 2019

Data for the case study of TuTraSt on methane diffusion in zeolites, using a standard kinetic Monte Carlo simulation based on the output of our grid analysis. We find that it is accurate, fast, and rigorous without limitations to the geometries of the diffusion tunnels or transition states.

### Capturing chemical intuition in synthesis of metal-organic frameworks

[materialscloud:2018.0011] Last version: 03 March 2019

We report a methodology using machine learning to capture chemical intuition from a set of (partially) failed attempts to synthesize a metal organic framework. We define chemical intuition as the collection of unwritten guidelines used by synthetic chemists to find the right synthesis conditions. As (partially) failed experiments usually remain unreported, we have reconstructed a typical track of failed experiments in a successful search for finding the optimal synthesis conditions that yields HKUST-1 with the highest surface area reported to date. We illustrate the importance of quantifying this chemical intuition for the synthesis of novel materials.

### Enhanced sampling of transition states

[materialscloud:2019.0010] Last version: 28 February 2019

The free energy landscapes of several fundamental processes are characterized by high barriers separating long-lived metastable states. In order to explore these type of landscapes enhanced sampling methods are used. While many such methods are able to obtain sufficient sampling in order to draw the free energy, the transition states are often sparsely sampled. We propose an approach based on the Variationally Enhanced Sampling Method to enhance sampling in the transition region. To this effect, we introduce a dynamic target distribution which uses the derivative of the instantaneous free energy surface to locate the transition regions on the fly and modulate the probability of sampling different regions. Finally, we exemplify the effectiveness of this approach in enriching the number of configurations in the transition state region in the cases of a chemical reaction and of a nucleation process.

### Mining the C-C Cross-Coupling Genome using Machine Learning

[materialscloud:2019.0007] Last version: 23 February 2019

Applications of machine-learning (ML) techniques to the study of catalytic processes have begun to appear in the literature with increasing frequency. The computational speed up provided by ML allows the properties and energetics of thousands of prospective catalysts to be rapidly assessed. These results, once compiled into a database containing different properties, can be mined with the goal of establishing relationships between the intrinsic chemical properties of different catalysts and their overall catalytic performance. Previously, we applied ML models to predict the performance of 18,000 prospective catalysts for a Suzuki coupling reaction using molecular volcano plots. Here, we expand on our earlier work by examining a larger section of the C-C cross-coupling genome by using a dimensionality-reducing data-clustering algorithms (i.e., sketch-map) to, first, identify the compatibility of each catalyst with different C-C cross-coupling variants (e.g., Suzuki, Kumada, ...

### Energetics of the coupled electronic–structural transition in the rare-earth nickelates

[materialscloud:2019.0009] Last version: 21 February 2019

Rare-earth nickelates exhibit a metal–insulator transition accompanied by a structural distortion that breaks the symmetry between formerly equivalent Ni sites. The quantitative theoretical description of this coupled electronic–structural instability is extremely challenging. Here, we address this issue by simultaneously taking into account both structural and electronic degrees of freedom using a charge self-consistent combination of density functional theory and dynamical mean-field theory, together with screened interaction parameters obtained from the constrained random phase approximation. Our total energy calculations show that the coupling to an electronic instability toward a charge disproportionated insulating state is crucial to stabilize the structural distortion, leading to a clear first order character of the coupled transition. The decreasing octahedral rotations across the series suppress this electronic instability and simultaneously increase the screening of the ...

### Unsupervised landmark analysis for jump detection in molecular dynamics simulations

[materialscloud:2019.0008] Last version: 12 February 2019

Molecular dynamics is a versatile and powerful method to study diffusion in solid-state ionic conductors, requiring minimal prior knowledge of equilibrium or transition states of the system's free energy surface. However, the analysis of trajectories for relevant but rare events, such as a jump of the diffusing mobile ion, is still rather cumbersome, requiring prior knowledge of the diffusive process in order to get meaningful results. In this work we present a novel approach to detect the relevant events in a diffusive system without assuming prior information regarding the underlying process. We start from a projection of the atomic coordinates into a landmark basis to identify the dominant features in a mobile ion's environment. Subsequent clustering in landmark space enables a discretization of any trajectory into a sequence of distinct states. As a final step, the use of the Smooth Overlap of Atomic Positions descriptor allows distinguishing between different ...

### DORI reveals the influence of non-covalent interactions on covalent bonding patterns in molecular crystals under pressure

[materialscloud:2019.0006] Last version: 06 February 2019

The study of organic molecular crystals under high pressure provides fundamental insight into crystal packing distortions and reveals mechanisms of phase transitions and the crystallization of polymorphs. These solid state transformations can be monitored directly by analyzing electron charge densities that are experimentally obtained at high pressure. However, restricting the analysis to the featureless electron density does not reveal the chemical bonding nature and the existence of intermolecular interactions. This shortcoming can be resolved by the use of the DORI (Density Overlap Region Indicator) descriptor, which is capable of detecting simultaneously both covalent patterns and non-covalent interactions from electron density and its derivatives. Using the biscarbonyl[14]annulene crystal under pressure as an example, we demonstrate how DORI can be exploited on experimental electron densities to reveal and monitor changes in electronic structure patterns resulting from ...

### Double helix PBDT polymer - Submitted manuscript, simulations and other source data

[materialscloud:2019.0005] Last version: 03 February 2019

We describe a double helical conformation in the densely charged aromatic polyamide poly(2,2’- disulfonyl-4,4’-benzidine terephthalamide) or PBDT. This double helix macromolecule represents one of the most rigid simple molecular structures known, exhibiting an extremely high axial persistence length (~ 1 micrometer).

### Emergence of hidden phases of methylammonium lead-iodide (CH3NH3PbI) upon compression

[materialscloud:2019.0003] Last version: 23 January 2019

We perform a thorough structural search with the minima hopping method (MHM) to explore low-energy structures of methylammonium lead iodide. By combining the MHM with a forcefield, we efficiently screen vast portions of the configurational space with large simulation cells containing up to 96 atoms. Our search reveals two structures of methylammonium iodide perovskite (MAPI) that are substantially lower in energy than the well-studied experimentally observed low-temperature orthorhombic phase. The data set containing approximately ~180,000 crystal structures is provided.

### Few layer 2D pnictogens catalyze the alkylation of soft nucleophiles with esters.

[materialscloud:2018.0021] Last version: 14 January 2019

Group 15 elements in zero oxidation state (P, As, Sb and Bi), also called pnictogens, are rarely used in catalysis due to the difficulties associated in preparing well–structured and stable materials. Here, we report on the synthesis of highly exfoliated, few layer 2D phosphorene and antimonene in zero oxidation state, suspended in an ionic liquid, with the native atoms ready to interact with external reagents while avoiding aerobic or aqueous decomposition pathways, and on their use as efficient catalysts for the alkylation of nucleophiles with esters. The few layer pnictogen material circumvents the extremely harsh reaction conditions associated to previous superacid–catalyzed alkylations, by enabling an alternative mechanism on surface, protected from the water and air by the ionic liquid. These 2D catalysts allow the alkylation of a variety of acid–sensitive organic molecules and giving synthetic relevancy to the use of simple esters as alkylating agents.

### Ab initio electronic structure of liquid water: Molecular dynamics snapshots

[materialscloud:2018.0023] Last version: 10 December 2018

This entry provides the snapshots of liquid water generated with ab initio molecular dynamics using rVV10 density functional at room temperature. Nuclear quantum effects are taken into account through path-integral molecular dynamics simulations.

### Atomic structures of semiconductor surfaces

[materialscloud:2018.0022] Last version: 10 December 2018

This entry includes the surface structures of some prototypical semiconductors obtained via structural optimizations using the PBE density functional. The structures were initially used for benchmarking ionization potentials calculated with hybrid density functionals and GW approximation. Seven semiconductor surfaces are provided in the form of Quantum ESPRESSO input: Si(111), C(111), GaAs(110), GaP(110), ZnSe(110), ZnO(10-10), and TiO2(110).

### Ab initio thermodynamics of liquid and solid water: supplemental materials

[materialscloud:2018.0020] Last version: 04 December 2018

Thermodynamic properties of liquid water as well as hexagonal (Ih) and cubic (Ic) ice are predicted based on density functional theory at the hybrid-functional level, rigorously taking into account quantum nuclear motion, anharmonic fluctuations and proton disorder. This is made possible by combining advanced free energy methods and state-of-the-art machine learning techniques. The ab initio description leads to structural properties in excellent agreement with experiments, and reliable estimates of the melting points of light and heavy water. We observe that nuclear quantum effects contribute a crucial 0.2 meV/H2O to the stability of ice Ih, making it more stable than ice Ic. Our computational approach is general and transferable, providing a comprehensive framework for quantitative predictions of ab initio thermodynamic properties using machine learning potentials as an intermediate step. In this set of supplemental materials, we have included the neural network potential for ...

### Special quasi-random structures for the 6-component high entropy alloys

[materialscloud:2018.0019] Last version: 03 December 2018

We propose a general method to calculate the average misfit volumes of atoms in any random alloy via DFT calculations. The method is validated with an example of a 6-component equi-composition high entropy alloy. The special quasi-random structures (SQSs) used in our work are reported here.

### Evaluating charge equilibration methods to generate electrostatic fields in nanoporous materials

[materialscloud:2018.0017] Last version: 29 November 2018

Charge equilibration (Qeq) methods can estimate the electrostatic potential of molecules and periodic frameworks by assigning point charges to each atom, using only a small fraction of the resources needed to compute density functional (DFT)-derived charges. This makes possible, for example, the computational screening of thousands of microporous structures to assess their performance for the adsorption of polar molecules. Recently, different variants of the original Qeq scheme were proposed to improve the quality of the computed point charges. One focus of this research was to improve the gas adsorption predictions in Metal Organic Frameworks (MOFs), for which many different structures are available. In this work, we review the evolution of the method from the original Qeq scheme, understanding the role of the different modifications on the final output. We evaluated the result of combining different protocols and set of parameters, by comparing the Qeq charges with high quality ...

### Data-driven design and synthesis of metal-organic frameworks for wet flue gas CO2 capture

[materialscloud:2018.0016] Last version: 25 November 2018

In this entry is a database of 324,426 hypothetical Metal-Organic Frameworks (MOFs) that were used in a study to screen potential carbon dioxide scrubbers. Using a method to assemble these materials with topological blueprints, we only selected materials that could be accurately represented with the MEPO-QEq charge generation method. By ensuring that the electrostatic potential is accurately represented in these materials, screening for CO2 adsorption properties would result very few false positives/negatives. The atom-centered charges reported in the CIF file for each MOF were derived from the MEPO-QEq method, which can be found under the '_atom_type_partial_charge' column in each CIF file. The relevant data for each MOF is reported in accompanying .csv files. Post-combustion flue gas was simulated at a temperature of both 298K and 0.15 bar CO2, and 313K and 0.15 bar CO2. Mixture adsorption was simulated with the conditions 298K and 0.15:0.85 CO2/N2 with a total ...

### Hidden Beneath the Surface: Origin of the Observed Enantioselective Adsorption on PdGa(111)

[materialscloud:2018.0018] Last version: 23 November 2018

We provide the input files to reproduce the data presented in the work: Hidden Beneath the Surface: Origin of the Observed Enantioselective Adsorption on PdGa(111) The files are subdivided in directories named after the figures/table of the manuscript A. V. Yakutovich, J. Hoja, D. Passerone, Alexandre Tkatchenko, C. A. Pignedoli J. Am. Chem. Soc., 140, 1401-1408 (2018) DOI: 10.1021/jacs.7b10980 In the work, we unravel the origin of the recently observed striking enantioselectivity of the PdGa(111) surface with respect to the adsorption of a small organic molecule, 9-ethynylphenanthrene, using first-principles calculations. It turns out that the key ingredient to understand the experimental evidence is the appropriate description of van der Waals interactions beyond the widely employed atomic pairwise approximation. A recently developed van der Waals-inclusive density functional method, which encompasses dielectric screening effects, reveals the origin of the experimentally ...

### High-throughput computational screening of nanoporous adsorbents for CO 2 capture from natural gas

[materialscloud:2018.0005] Last version: 14 November 2018

With the growth of natural gas as an energy source, upgrading CO2-contaminated supplies has become increasingly important. Here we develop a single metric that captures how well an adsorbent performs the separation of CH4 and CO2, and we then use this metric to computationally screen tens of thousands of all-silica zeolites. We show that the most important predictors of separation performance are the CO2 heat of adsorption (Qst, CO2) and the CO2 saturation loading capacity. We find that a higher-performing material results when the absolute value of the CH4 heat of adsorption (Qst, CH4) is decreased independently of Qst, CO2, but a correlation that exists between Qst, CH4 and Qst, CO2 in all-silica zeolites leads to incongruity between the objectives of optimizing Qst, CO2 and minimizing Qst, CH4, rendering Qst, CH4 nonpredictive of separation performance. We also conduct a large-scale analysis of ideal adsorbed solution theory (IAST) by comparing results obtained using ...

### Mail-order metal-organic frameworks (MOFs): designing isoreticular MOF-5 analogues comprising commercially available organic molecules

[materialscloud:2018.0007] Last version: 14 November 2018

Metal–organic frameworks (MOFs), a class of porous materials, are of particular interest in gas storage and separation applications due largely to their high internal surface areas and tunable structures. MOF-5 is perhaps the archetypal MOF; in particular, many isoreticular analogues of MOF-5 have been synthesized, comprising alternative dicarboxylic acid ligands. In this contribution we introduce a new set of hypothesized MOF-5 analogues, constructed from commercially available organic molecules. We describe our automated procedure for hypothetical MOF design, comprising selection of appropriate ligands, construction of 3D structure models, and structure relaxation methods. 116 MOF-5 analogues were designed and characterized in terms of geometric properties and simulated methane uptake at conditions relevant to vehicular storage applications. A strength of the presented approach is that all of the hypothesized MOFs are designed to be synthesizable utilizing ligands purchasable online. Version 2 includes the structures in CIF format.

### The Influence of Intrinsic Framework Flexibility on Adsorption in Nanoporous Materials (Data Download)

[materialscloud:2017.0003] Last version: 10 November 2018

Project Abstract: For applications of metal-organic frameworks (MOFs) such as gas storage and separation, flexibility is often seen as a parameter that can tune material performance. In this work we aim to determine the optimal flexibility for the shape selective separation of similarly sized molecules (e.g., Xe/Kr mixtures). To obtain systematic insight into how the flexibility impacts this type of separation we develop a simple analytical model that predicts a material's Henry regime adsorption and selectivity as a function of flexibility. We elucidate the complex dependence of selectivity on a framework's intrinsic flexibility whereby performance is either improved or reduced with increasing flexibility, depending on the material's pore size characteristics. However, the selectivity of a material with the pore size and chemistry that already maximizes selectivity in the rigid approximation is continuously diminished with increasing flexibility, demonstrating that ...

### A Standard Solid State Pseudopotentials (SSSP) library optimized for precision and efficiency (Version 1.1, data download)

[materialscloud:2018.0001] Last version: 08 November 2018

Despite the enormous success and popularity of density functional theory, systematic verification and validation studies are still very limited both in number and scope. Here, we propose a universal standard protocol to verify publicly available pseudopotential libraries, based on several independent criteria including verification against all-electron equations of state and plane-wave convergence tests for phonon frequencies, band structure, cohesive energy and pressure. Adopting these criteria we obtain two optimal pseudopotential sets, namely the Standard Solid State Pseudopotential (SSSP) efficiency and precision libraries, tailored for high-throughput materials screening and high-precision materials modelling. As of today, the SSSP precision library is the most accurate open-source pseudopotential library available. This archive entry contains the database of calculations (phonons, cohesive energy, equation of state, band structure, pressure, etc.) together with the provenance of all data and calculations as stored by AiiDA.

### In Silico Design of 2D and 3D Covalent Organic Frameworks for Methane Storage Applications

[materialscloud:2018.0003] Last version: 05 October 2018

Here we present 69,840 covalent organic frameworks (COFs) assembled in silico from a set of 666 distinct organic linkers into 2D-layered and 3D configurations. We investigate the feasibility of using these frameworks for methane storage by using grand-canonical Monte Carlo (GCMC) simulations to calculate their deliverable capacities (DCs). From these calculations, we predict that the best structure in the database is linker91_C_linker91_C_tbd, a structure composed of carbon-carbon bonded triazine linkers in the tbd topology. This structure has a predicted 65-bar DC of 216 v STP/v, greater than that of the best current methane storage material. We also predict other top performing materials, with 305 structures having DCs of over 190 v STP/v, and 34 of these having DCs of over 200 v STP/v. This archive entry contains the database of assembled COF structures (in CIF file format) together with all of their properties, which can be explored using interactive figures. Among the ...

### Toward GW Calculations on Thousands of Atoms

[materialscloud:2018.0015] Last version: 28 September 2018

We provide the input files needed to reproduce the results of the article Toward GW Calculations on Thousands of Atoms J. Wilhelm, D. Golze, L. Talirz, J. Hutter, C. A. Pignedoli J. Phys. Chem. Lett. 9, 306–312 (2018) DOI:10.1021/acs.jpclett.7b02740 The GW approximation of many-body perturbation theory is an accurate method for computing electron addition and removal energies of molecules and solids. In a canonical implementation, however, its computational cost is in the system size N, which prohibits its application to many systems of interest. We present a full-frequency GW algorithm in a Gaussian-type basis, whose computational cost scales with N2 to N3. The implementation is optimized for massively parallel execution on state-of-the-art supercomputers and is suitable for nanostructures and molecules in the gas, liquid or condensed phase, using either pseudopotentials or all electrons. We validate the accuracy of the algorithm on the GW100 molecular test ...

### The geometric blueprint of perovskites

[materialscloud:2018.0012] Last version: 03 September 2018

Perovskite minerals form an essential component of the Earth’s mantle, and synthetic crystals are ubiquitous in electronics, photonics, and energy technology. The extraordinary chemical diversity of these crystals raises the question of how many and which perovskites are yet to be discovered. Here we show that the “no-rattling” principle postulated by Goldschmidt in 1926, describing the geometric conditions under which a perovskite can form, is much more effective than previously thought and allows us to predict perovskites with a fidelity of 80%. By supplementing this principle with inferential statistics and internet data mining we establish that currently known perovskites are only the tip of the iceberg, and we enumerate 90,000 hitherto-unknown compounds awaiting to be studied. Our results suggest that geometric blueprints may enable the systematic screening of millions of compounds and offer untapped opportunities in structure prediction and materials design.

### Machine learning meets volcano plots: Computational discovery of cross-coupling catalysts

[materialscloud:2018.0014] Last version: 01 August 2018

The application of modern machine learning to challenges in atomistic simulation is gaining attraction. We present new machine learning models that can predict the energy of the oxidative addition process between a transition metal complex and a substrate for C-C cross-coupling reaction. In turn, this quantity can be used as a descriptor to estimate the activity of homogeneous catalysts using molecular volcano plots. The versatility of this approach is illustrated for vast libraries of organometallic catalysts based on Pt, Pd, Ni, Cu, Ag, and Au combined with 91 ligands. Out-of-sample machine learning predictions were made on a total of 18,062 compounds leading to 557 catalyst candidates falling into the ideal thermodynamic window. This number was further refined by searching for candidates with an estimated price lower than 10 US$/mmol. The 37 catalyst finalists are dominated by palladium phosphine ligand combinations but also include earth abundant (Cu) transition metal with ...

### Generating carbon schwarzites via zeolite-templating

[materialscloud:2018.0013] Last version: 31 July 2018

Zeolite-templated carbons (ZTCs) comprise a relatively recent material class synthesized via the chemical vapor deposition of a carbon-containing precursor on a zeolite template, followed by the removal of the template. We have developed a theoretical framework to generate a ZTC model from any given zeolite structure, which we show can successfully predict the structure of known ZTCs. We use our method to generate a library of ZTCs from all known zeolites, to establish criteria for which zeolites can produce experimentally accessible ZTCs, and to identify over 10 ZTCs that have never before been synthesized. We show that ZTCs partition space into two disjoint labyrinths that can be described by a pair of interpenetrating nets. Since such a pair of nets also describes a triply periodic minimal surface (TPMS), our results establish the relationship between ZTCs and schwarzites—carbon materials with negative Gaussian curvature that resemble TPMSs—linking the research topics and ...

### Mapping uncharted territory in ice from zeolite networks to ice structures

[materialscloud:2018.0010] Last version: 19 May 2018

We report a large-scale density-functional-theory study of the configuration space of water ice. We geometry optimise 74,963 ice structures, which are selected and constructed from over five million tetrahedral networks listed in the databases of Treacy and Deem, and the International Zeolite Association database. All prior knowledge of ice is set aside and we introduce generalised convex hulls to identify configurations stabilised by appropriate thermodynamic constraints. We thereby rediscover all known phases (I to XVII, i, 0 and the quartz phase) except the metastable ice IV. Crucially, we also find promising candidates for ices XVIII through LI. Using the sketch-map dimensionality-reduction algorithm we construct an a priori, navigable map of configuration space, which reproduces similarity relations between structures and highlights the novel candidates. By relating the known phases to the tractably small, yet structurally diverse set of synthesisable candidate structures, we ...

### Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems

[materialscloud:2018.0009] Last version: 19 May 2018

Here we present 1,000 structures each of a water monomer, water dimer, Zundel cation and bulk water used to train tensorial machine-learning models in Phys. Rev. Lett. 120, 036002 (2018). The archive entry contains files in extended-XYZ format including the structures and several tensorial properties: for the monomer, dimer and Zundel cation, the dipole moment, polarizability and first hyperpolarizability are included, and for bulk water the dipole moment, polarizability and dielectric tensor are given.

### In Silico Design of Porous Polymer Networks: High Throughput Screening for Methane Storage Materials

[materialscloud:2018.0008] Last version: 15 May 2018

Porous polymer networks (PPNs) are a class of advanced porous materials that combine the advantages of cheap and stable polymers with the high surface areas and tunable chemistry of metal–organic frameworks. They are of particular interest for gas separation or storage applications, for instance, as methane adsorbents for a vehicular natural gas tank or other portable applications. PPNs are self-assembled from distinct building units; here, we utilize commercially available chemical fragments and two experimentally known synthetic routes to design in silico a large database of synthetically realistic PPN materials. All structures from our database of 18,000 materials have been relaxed with semiempirical electronic structure methods and characterized with Grand-canonical Monte Carlo simulations for methane uptake and deliverable (working) capacity. A number of novel structure–property relationships that govern methane storage performance were identified. The relationships are ...

### Improving the Mechanical Stability of Metal-Organic Frameworks Using Chemical Caryatids

[materialscloud:2018.0004] Last version: 17 April 2018

Metal-organic frameworks (MOFs) have emerged as versatile materials for applications ranging from gas separation and storage, catalysis, and sensing. The attractive feature of MOFs is that by changing the ligand and/or metal, they can be chemically tuned to perform optimally for a given application. In most, if not all, of these applications one also needs a material that has a sufficient mechanical stability, but our understanding of how changes in the chemical structure influence mechanical stability is limited. In this work, we rationalize how the mechanical properties of MOFs are related to framework bonding topology and ligand structure. We illustrate that the functional groups on the organic ligands can either enhance the mechanical stability through formation of a secondary network of non-bonded interactions, or soften the material by destabilizing the bonded network of a MOF. In addition, we show that synergistic effect of the bonding network of the material and the ...

### Adatom-Induced Local Melting

[materialscloud:2018.0002] Last version: 11 February 2018

We introduce and discuss the phenomenon of adatom-induced surface local melting, using extensive first-principles molecular dynamics simulations of Al(100) taken as a paradigmatic case of a non-premelting surface that nevertheless displays facile adatom diffusion with single and multiple exchange pathways. Here, a single adatom deposited on the surface is sufficient to nucleate a localized and diffusing liquid-like region that remains confined to the surface layer, but with an area that increases with temperature; in the absence of the adatom, the surface instead remains crystalline until reaching the bulk melting temperature.

### Isobaric-Isothermal Monte Carlo Simulations of Bulk Liquid Water from MP2 and RPA Theory (MC Trajectories Data Download)

[materialscloud:2017.0007] Last version: 28 November 2017

Methods based on the second order Møller–Plesset perturbation theory (MP2) and the Random Phase Approximation (RPA) have emerged as practicable and reliable approaches to improve the accuracy of density functional approximations for first principle atomistic simulations. Such approaches are in fact capable to account ab-initio for non-local dynamical electron correlation effects, which play a fundamental role, for example, in the description of non-bonded interactions. To assess the performance of MP2 and RPA for real applications, isobaric-isothermal Monte Carlo simulations have been performed to study the structural properties of bulk liquid water under ambient conditions. The choice of bulk liquid water as benchmark system is motivated by the complicated nature of the intermolecular interactions, where repulsion, polarization, hydrogen bonding and van der Waals forces play an important role and are particularly difficult to reproduce accurately in atomistic models. The results ...

### Gaussian Approximation Potentials for iron from extended first-principles database (Data Download)

[materialscloud:2017.0006] Last version: 06 November 2017

Interatomic potentials are often necessary to describe complex realistic systems that would be too costly to study from first-principles. Commonly, interatomic potentials are designed using functional forms driven by physical intuition and fitted to experimental or computational data. The moderate flexibility of these functional forms limits their ability to be systematically improved by increasing the fitting datasets; on the other hand, their qualitative description of the essential physical interactions ensures a modicum degree of transferability. Recently, a novel trend has emerged where potential-energy surfaces are represented by neural networks fitted on large numbers of first-principles calculations, thus maximizing flexibility but requiring extensive datasets to ensure transferability. Gaussian Approximation Potentials in particular are a novel class of potentials based on non-linear, non-parametric Gaussian-process regression. Here we generate a Gaussian Approximation ...

### Accurate Characterization of the Pore Volume in Microporous Crystalline Materials (Data Download)

[materialscloud:2017.0005] Last version: 18 May 2017

Project Abstract: Pore volume is one of the main properties for the characterization of microporous crystals. It is experimentally measurable and it can also be obtained from the refined unit cell by a number of computational techniques. In this work we assess the accuracy and the discrepancies between the different computational methods which are commonly used for this purpose, i.e, geometric, helium and probe center pore volume, by studying a database of more than 5000 frameworks. We developed a new technique to fully characterize the internal void of a microporous material and to compute the probe accessible and occupiable pore volume. We show that unlike the other definitions of pore volume, the occupiable pore volume can be directly related to the experimentally measured pore volumes from nitrogen isotherms.

### Predicting Product Distribution of Propene Dimerization in Nanoporous Materials (Data Download)

[materialscloud:2017.0004] Last version: 05 May 2017

Project abstract: In this work, a theoretical framework is developed to explain and predict changes in the product distribution of the propene dimerization reaction, which yields a mixture of C_{6} olefin isomers, resulting from the use of different porous materials as catalysts. The MOF-74 class of materials has shown promise in catalyzing the dimerization of propene with high selectivity for valuable linear olefin products. We show that experimentally observed changes in the product distribution can be explained in terms of the contribution of the pores to the free energy of formation, which are directly computed using molecular simulation. Our model is used to screen a library of 118 existing and hypothetical MOF and zeolite structures to study how product distribution can be tuned by changing pore size, shape, and composition of porous materials. Using these molecular descriptors, catalyst properties are identified that increase the selective reaction of linear olefin ...

### Barcodes for nanoporous materials

[materialscloud:2017.0001] Last version: 14 March 2017

In most applications of nanoporous materials the pore structure is as important as the chemical composition as a determinant of performance. For example, one can alter performance in applications like carbon capture or methane storage by orders of magnitude by only modifying the pore structure. For these applications it is therefore important to identify the optimal pore geometry and use this information to find similar materials. However, the mathematical language and tools to identify materials with similar pore structures, but different composition, has been lacking. Recently, we developed a pore recognition approach to quantify similarity of pore structures using topological data analysis. Barcodes generated with using this approach allow us to identify materials with similar pore geometries, and to screen for materials that are similar to given top-performing structures. This database has barcodes for zeolites, metal organic frameworks, and zeolitic imidazolate frameworks.