Machine learning for material property prediction
Submission navigation links for Project
Submission information
Submission Number: 17
Submission ID: 34
Submission UUID: ff4a13e3-3528-40bd-931f-af61c546c22a
Submission URI: /form/project
Created: Tue, 09/03/2019 - 13:23
Completed: Tue, 09/03/2019 - 13:26
Changed: Thu, 04/28/2022 - 13:11
Remote IP address: 130.215.55.243
Submitted by: Northeast Cyberteam
Language: English
Is draft: No
Webform: Project
Machine learning for material property prediction

Complete
Project Leader
Project Personnel
Project Information
Density Functional Theory based methods for calculating material properties from first principles require large computation facilities and significant computation time. This project aims to develop novel machine learning models and workflows in order to better predict material properties in a fraction of the computation the time that current techniques require.
---
Traditionally, the physical properties of solids are calculated from solving the many-body Schrödinger Equation, using atomic compositions as inputs. Such methods are often computationally demanding and the data generated in this way are disconnected from each other. Although massive amounts of physical property data of materials have been accumulated, the calculation of properties for a new compound does not benefit from them.
This CITeam project is aimed at developing machine learning models, software, and workflows that can be used in the development of a material descriptor, which can be used to accurately predict the properties of novel materials. The development of such software is a key first step in high-throughput discovery of novel energy materials.
This research integrates theory, computation, and data mining. The results and machine learning workflows generated under this project will benefit researchers in multiple disciplines such as physics, chemistry, material science, and computational science.
This project will require significant computation time and hardware resources and thus benefit greatly from having access to the Umaine super computer.
---
Traditionally, the physical properties of solids are calculated from solving the many-body Schrödinger Equation, using atomic compositions as inputs. Such methods are often computationally demanding and the data generated in this way are disconnected from each other. Although massive amounts of physical property data of materials have been accumulated, the calculation of properties for a new compound does not benefit from them.
This CITeam project is aimed at developing machine learning models, software, and workflows that can be used in the development of a material descriptor, which can be used to accurately predict the properties of novel materials. The development of such software is a key first step in high-throughput discovery of novel energy materials.
This research integrates theory, computation, and data mining. The results and machine learning workflows generated under this project will benefit researchers in multiple disciplines such as physics, chemistry, material science, and computational science.
This project will require significant computation time and hardware resources and thus benefit greatly from having access to the Umaine super computer.
Project Information Subsection
1) Software:
a) A main script from which specified ML models and material descriptors can be deployed on the computing clster. This script will parallelize the training of the ML models and marshal the necessary compute resources.
b) Individual functions/scripts/classes implementing each ML model
c) Scripts to parse material structure files and produce material descriptors
d) Scripts to compare ML model and descriptor pairs and analyze their efficacy
2) A report evaluating the efficacy of a variety of machine learning models and machine descriptors in predicting new material properties.
a) A main script from which specified ML models and material descriptors can be deployed on the computing clster. This script will parallelize the training of the ML models and marshal the necessary compute resources.
b) Individual functions/scripts/classes implementing each ML model
c) Scripts to parse material structure files and produce material descriptors
d) Scripts to compare ML model and descriptor pairs and analyze their efficacy
2) A report evaluating the efficacy of a variety of machine learning models and machine descriptors in predicting new material properties.
{Empty}
Graduate student studying physics with further background in computer science, machine learning, and high performance computing. (Michael Butler, UMaine Orono)
{Empty}
Practical applications
{Empty}
University of Maine Orono
105 Bennett Hall
Orono, Maine. 04469
Orono, Maine. 04469
NE-University of Maine
01/01/2019
No
Already behind3Start date is flexible
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
A module on implementing machine learning models on a HPC cluster.
The discovery of a novel, improved material descriptor or machine learning model would have a very high probability of being published.
The student will learn how to build and efficiently train large, robust machine learning models in a distributed HPC environment.
{Empty}
The Cyberteam will learn how to better support a distributed machine learning environment so that future researchers will have an easier time writing and deploying such software.
Training models will require tens - hundreds of hours of HPC time. Further validation of those models should take a comparable amount of time to that of training. CUDA capable nodes would be helpful, but are not necessary.
{Empty}
Final Report
This project helped make advancements in predicting material properties using neural networks, impacting physics
The project helped make advancements in computer science by utilizing neural networks in a novel way.
{Empty}
The faculty researcher formed a human regional network through the ERN as a result of this work.
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}