Aluminum alloy composition and property data


Abstract: 

Researchers continue to explore and develop aluminum alloys with new compositions and improved performance characteristics. An understanding of the current design space can help accelerate the discovery of new alloys. We present two datasets: 1) chemical composition, and 2) mechanical properties for predominantly wrought aluminum alloys. The first dataset contains 14,884 entries on aluminum alloy compositions extracted from academic literature and US patents using text processing techniques, including 550 wrought aluminum alloys which are already registered with the Aluminum Association. The second dataset contains 1,278 entries on mechanical properties for aluminum alloys, where each entry is associated with a particular wrought series designation, extracted from tables in academic literature. 

————————————————————————————————————————————————

Data Summary:
- property.csv
	- Each row in the dataset represents an aluminum alloy 
	- Number of attributes: 11 (5 descriptive headers, 4 property-related headers, 2 ‘flag’ headers)

- composition.csv
	- Each row in the dataset represents an aluminum alloy 
	- Number of attributes: 74 (5 descriptive headers, and 69 element headers) 

————————————————————————————————————————————————

Variable Information:
Given is the attribute name, datatype, and a brief description. 


Property Dataset:

‘doi’ — String — Digital Object Identifier of the journal article
  
‘name’ — String — Original table row name

‘series’ — Integer (class) — Aluminum alloy series designation, one of (1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000) Based on ‘AA_des’ when available; when not available, search in abstract for ‘XXXX’ or composition string and assign by most prominent alloying element

‘caption’ — String — Original source table caption

‘AA_des’ — Integer — Aluminum alloy designation code; extracted from original source row name or table caption via text matching where available

‘YS’ — Decimal — Yield strength (MPa)

‘UTS’ — Decimal — Ultimate tensile strength (MPa)

‘temper’ — String — Temper designation of the alloy

‘elong’ — Decimal — Percent elongation

‘flag’ — True/False — Alloy undergoes special processing

‘flag_note’ — String — Reason for flag (e.g., equal angular extrusion) 




Composition Dataset: 

‘source’ — String (class) — The original source of composition information, one of: (named, full text, table, patent)

‘ft_doi_list’ — String (list format) —  List of unique Digital Object Identifiers of the full text ('ft') journal articles associated with the given extracted composition.

‘table_doi’ — String —  Digital Object Identifier of the journal article for compositions extracted from tables 
 
‘name’ — String —  Determined by 'source' as follows:  ('named': Series designation by Al Association (see Ref. [1]), 'full text': N/A,  'table': Original source table row name,  'patent': patent publication number)

‘AA_des’ — Integer — Aluminum alloy series designation, extracted from original source table row name or table caption

<element> — Decimal — Percent weight of this <element> within the Al alloy


————————————————————————————————————————————————


References

[1] The Aluminum Association. International Alloy Designations and Chemical Composition Limits for Wrought Aluminum and Wrought Aluminum Alloys. (2018).