Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning

In the past several years, Materials Genome Initiative (MGI) efforts have produced myriad examples of computationally designed materials in the fields of energy storage, catalysis, thermoelectrics, and hydrogen storage as well as large data resources that are used to screen for potentially transform...

Full description

Bibliographic Details
Main Authors: Kim, Edward (Author), Huang, Kevin Joon-Ming (Author), Saunders, Adam (Author), McCallum, Andrew (Author), Ceder, Gerbrand (Author), Olivetti, Elsa A. (Author)
Other Authors: Massachusetts Institute of Technology. Department of Materials Science and Engineering (Contributor), Massachusetts Institute of Technology. Institute for Data, Systems, and Society (Contributor)
Format: Article
Language:English
Published: American Chemical Society (ACS), 2021-01-22T22:30:19Z.
Subjects:
Online Access:Get fulltext
LEADER 02182 am a22002533u 4500
001 129530
042 |a dc 
100 1 0 |a Kim, Edward  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Materials Science and Engineering  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Institute for Data, Systems, and Society  |e contributor 
700 1 0 |a Huang, Kevin Joon-Ming  |e author 
700 1 0 |a Saunders, Adam  |e author 
700 1 0 |a McCallum, Andrew  |e author 
700 1 0 |a Ceder, Gerbrand  |e author 
700 1 0 |a Olivetti, Elsa A.  |e author 
245 0 0 |a Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning 
260 |b American Chemical Society (ACS),   |c 2021-01-22T22:30:19Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/129530 
520 |a In the past several years, Materials Genome Initiative (MGI) efforts have produced myriad examples of computationally designed materials in the fields of energy storage, catalysis, thermoelectrics, and hydrogen storage as well as large data resources that are used to screen for potentially transformative compounds. The bottleneck in high-Throughput materials design has thus shifted to materials synthesis, which motivates our development of a methodology to automatically compile materials synthesis parameters across tens of thousands of scholarly publications using natural language processing techniques. To demonstrate our framework's capabilities, we examine the synthesis conditions for various metal oxides across more than 12 thousand manuscripts. We then apply machine learning methods to predict the critical parameters needed to synthesize titania nanotubes via hydrothermal methods and verify this result against known mechanisms. Finally, we demonstrate the capacity for transfer learning by using machine learning models to predict synthesis outcomes on materials systems not included in the training set and thereby outperform heuristic strategies. 
520 |a National Science Foundation (Award 1534340) 
520 |a Office of Naval Research (Contract N00014-16-1- 2432) 
546 |a en 
655 7 |a Article 
773 |t Chemistry of Materials