New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production

The success of machine learning (ML) techniques implemented in different industries heavily rely on operator expertise and domain knowledge, which is used in manually choosing an algorithm and setting up the specific algorithm parameters for a problem. Due to the manual nature of model selection and...

Full description

Bibliographic Details
Main Authors: Vuong Van Pham, Ebrahim Fathi, Fatemeh Belyadi
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Fuels
Subjects:
Online Access:https://www.mdpi.com/2673-3994/2/3/17
id doaj-b562cc4c76864692877967c894349fad
record_format Article
spelling doaj-b562cc4c76864692877967c894349fad2021-09-26T00:11:33ZengMDPI AGFuels2673-39942021-07-0121728630310.3390/fuels2030017New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas ProductionVuong Van Pham0Ebrahim Fathi1Fatemeh Belyadi2Department of Chemical & Petroleum Engineering, University of Kansas, Lawrence, KS 66045, USAPetroleum and Natural Gas Engineering Department, West Virginia University, Morgantown, WV 26505, USAIntegrated Smart Data Solutions LLC, Morgantown, WV 26505, USAThe success of machine learning (ML) techniques implemented in different industries heavily rely on operator expertise and domain knowledge, which is used in manually choosing an algorithm and setting up the specific algorithm parameters for a problem. Due to the manual nature of model selection and parameter tuning, it is impossible to quantify or evaluate the quality of this manual process, which in turn limits the ability to perform comparison studies between different algorithms. In this study, we propose a new hybrid approach for developing machine learning workflows to help automated algorithm selection and hyperparameter optimization. The proposed approach provides a robust, reproducible, and unbiased workflow that can be quantified and validated using different scoring metrics. We have used the most common workflows implemented in the application of artificial intelligence (AI) and ML in engineering problems including grid/random search, Bayesian search and optimization, genetic programming, and compared that with our new hybrid approach that includes the integration of Tree-based Pipeline Optimization Tool (TPOT) and Bayesian optimization. The performance of each workflow is quantified using different scoring metrics such as Pearson correlation (i.e., R<sup>2</sup> correlation) and Mean Square Error (i.e., MSE). For this purpose, actual field data obtained from 1567 gas wells in Marcellus Shale, with 121 features from reservoir, drilling, completion, stimulation, and operation is tested using different proposed workflows. A proposed new hybrid workflow is then used to evaluate the type well used for evaluation of Marcellus shale gas production. In conclusion, our automated hybrid approach showed significant improvement in comparison to other proposed workflows using both scoring matrices. The new hybrid approach provides a practical tool that supports the automated model and hyperparameter selection, which is tested using real field data that can be implemented in solving different engineering problems using artificial intelligence and machine learning. The new hybrid model is tested in a real field and compared with conventional type wells developed by field engineers. It is found that the type well of the field is very close to P50 predictions of the field, which shows great success in the completion design of the field performed by field engineers. It also shows that the field average production could have been improved by 8% if shorter cluster spacing and higher proppant loading per cluster were used during the frac jobs.https://www.mdpi.com/2673-3994/2/3/17shale reservoirscompletion design optimizationautomated machine learningMarcellus Shale
collection DOAJ
language English
format Article
sources DOAJ
author Vuong Van Pham
Ebrahim Fathi
Fatemeh Belyadi
spellingShingle Vuong Van Pham
Ebrahim Fathi
Fatemeh Belyadi
New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
Fuels
shale reservoirs
completion design optimization
automated machine learning
Marcellus Shale
author_facet Vuong Van Pham
Ebrahim Fathi
Fatemeh Belyadi
author_sort Vuong Van Pham
title New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
title_short New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
title_full New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
title_fullStr New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
title_full_unstemmed New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
title_sort new hybrid approach for developing automated machine learning workflows: a real case application in evaluation of marcellus shale gas production
publisher MDPI AG
series Fuels
issn 2673-3994
publishDate 2021-07-01
description The success of machine learning (ML) techniques implemented in different industries heavily rely on operator expertise and domain knowledge, which is used in manually choosing an algorithm and setting up the specific algorithm parameters for a problem. Due to the manual nature of model selection and parameter tuning, it is impossible to quantify or evaluate the quality of this manual process, which in turn limits the ability to perform comparison studies between different algorithms. In this study, we propose a new hybrid approach for developing machine learning workflows to help automated algorithm selection and hyperparameter optimization. The proposed approach provides a robust, reproducible, and unbiased workflow that can be quantified and validated using different scoring metrics. We have used the most common workflows implemented in the application of artificial intelligence (AI) and ML in engineering problems including grid/random search, Bayesian search and optimization, genetic programming, and compared that with our new hybrid approach that includes the integration of Tree-based Pipeline Optimization Tool (TPOT) and Bayesian optimization. The performance of each workflow is quantified using different scoring metrics such as Pearson correlation (i.e., R<sup>2</sup> correlation) and Mean Square Error (i.e., MSE). For this purpose, actual field data obtained from 1567 gas wells in Marcellus Shale, with 121 features from reservoir, drilling, completion, stimulation, and operation is tested using different proposed workflows. A proposed new hybrid workflow is then used to evaluate the type well used for evaluation of Marcellus shale gas production. In conclusion, our automated hybrid approach showed significant improvement in comparison to other proposed workflows using both scoring matrices. The new hybrid approach provides a practical tool that supports the automated model and hyperparameter selection, which is tested using real field data that can be implemented in solving different engineering problems using artificial intelligence and machine learning. The new hybrid model is tested in a real field and compared with conventional type wells developed by field engineers. It is found that the type well of the field is very close to P50 predictions of the field, which shows great success in the completion design of the field performed by field engineers. It also shows that the field average production could have been improved by 8% if shorter cluster spacing and higher proppant loading per cluster were used during the frac jobs.
topic shale reservoirs
completion design optimization
automated machine learning
Marcellus Shale
url https://www.mdpi.com/2673-3994/2/3/17
work_keys_str_mv AT vuongvanpham newhybridapproachfordevelopingautomatedmachinelearningworkflowsarealcaseapplicationinevaluationofmarcellusshalegasproduction
AT ebrahimfathi newhybridapproachfordevelopingautomatedmachinelearningworkflowsarealcaseapplicationinevaluationofmarcellusshalegasproduction
AT fatemehbelyadi newhybridapproachfordevelopingautomatedmachinelearningworkflowsarealcaseapplicationinevaluationofmarcellusshalegasproduction
_version_ 1717366792226078720