"Machine Learning for Complex Materials Science" by Guannan Liu

Machine Learning for Complex Materials Science

Date of Award

Spring 2024

Document Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Mechanical Engineering & Materials Science (ENAS)

First Advisor

Schroers, Jan

Abstract

Complex materials science problems, such as metallic glass formation, originate from the collective behavior of many atoms and are by many orders of magnitude too large to be solved by first-principles calculations. The successful application of machine learning (ML) in various fields outside of materials science suggests its potential in addressing such complex problems. To test its efficacy, we attempt to predict bulk metallic glass formation using ML. Surprisingly, we find that a general-material ML model with 201 alloy features, constructed through simple statistical functions from 31 elemental features, is indistinguishable from models that are unphysical or do not consider any features when the prediction accuracy is tested in an interpolation manner. Only when significant separation of training and testing data is carried out does the general-material model perform better in this extrapolation mode than the unphysical or composition-only models, yet significantly worse than a human learning based 3-(alloy)feature model. We attribute the limited performance of the general-material model to its general inability to accurately represent alloy features through elemental features. As the potential data space is too large to determine a representative fraction that would allow ML models to work well even with poorly representative features, complex material science problems like metallic glass formation require physical insights about mixing behavior to develop effective ML models.Another challenge of applying ML to complex materials science problems is that the underlying mechanisms may vary within the considered problem space. To quantify this, we divide alloy data into subgroups and construct ML models to predict metallic glass formation. We discover that subgrouping guided by physical insights into the problem leads to significantly higher prediction accuracy. Specifically, when applying Inoue’s subgrouping which is based on the constituent elements of alloys, models specific to subgroups outperform those trained on the entire dataset. Furthermore, our results highlight the potential of ML strategies to quantitatively assess guiding principles utilized in materials science, allowing for a rigorous evaluation of empirical rules like Inoue's. Additionally, our analysis uncovers distinct mechanisms governing the glass-forming ability (GFA) within different subgroups, shedding light on the diverse nature of this phenomenon. Statistical methods for subgrouping prove less effective compared to physics-informed subgrouping. Lastly, recognizing the limitations such as data representation bias and reliance on binary labeling, we curate a dataset of binary alloy libraries fabricated using combinatorial co-sputtering and characterized through high-throughput strategies to achieve a better representation of the GFA in binary alloys. By employing a continuous labeling of critical cooling rates and physics-informed alloy features, the model displays robust extrapolation capabilities, successfully predicting the GFA of new alloy systems unseen by the model. This thesis demonstrates the significance of physics-informed insight into the problem to be addressed to build effective ML models. Such an integrated approach has the potential to unlock new insights and accelerate materials discovery in a wide range of applications beyond metallic glass formation.

This document is currently not available here.

Share

COinS