Machine learning and artificial intelligence approaches have rapidly gained popularity for use in many subsurface energy applications.They are seen as novel methods that may enhance existing capabilities,providing for...Machine learning and artificial intelligence approaches have rapidly gained popularity for use in many subsurface energy applications.They are seen as novel methods that may enhance existing capabilities,providing for improved efficiency in exploration and production operations.Furthermore,their inte-gration into reservoir management workflows may shape the future landscape of the energy industry.This study implements a framework that generates predictive models using multiple machine learning classification-based algorithms which can identify specific stratigraphic units(i.e.,formations)as a function of total vertical depth and spatial positioning.The framework is applied in a case study to 13 specific formations of interest(Upper Spraberry through Atoka/Morrow reservoirs)in the Midland Basin,West Texas,United States;a prominent hydrocarbon producing sub-basin of the larger Permian Basin.The study dataset consists of over 275,000 records and includes data fields like formation iden-tifier,true vertical depth(in feet)of formations observed,and latitude and longitude coordinates(in decimal degrees).A subset of 134,374 data records were relevant to the 13 distinct formations of interest and were extracted and used for machine learning model training,validation,and testing.Four super-vised learning approaches including random forest(RF),gradient boosting(GB),support vector machine(SVM),and multilayer perceptron neural network(MLP)were evaluated and their prediction accuracy compared.The best performing model was ultimately built on the RF algorithm and is capable of an overall prediction accuracy of 93 percent on holdout data.The RF-based model demonstrated high prediction accuracy for major oil and gas producing zones including the San Andres,Upper Spraberry,Lower Spraberry,Clearfork,and Wolfcamp at 98,94,89,94,and 94 percent respectively.Overall,the resulting data-driven model provides a robust,cost-effective approach which can complement contemporary reservoir management approaches for multiple subsurface energy applications.展开更多
文摘Machine learning and artificial intelligence approaches have rapidly gained popularity for use in many subsurface energy applications.They are seen as novel methods that may enhance existing capabilities,providing for improved efficiency in exploration and production operations.Furthermore,their inte-gration into reservoir management workflows may shape the future landscape of the energy industry.This study implements a framework that generates predictive models using multiple machine learning classification-based algorithms which can identify specific stratigraphic units(i.e.,formations)as a function of total vertical depth and spatial positioning.The framework is applied in a case study to 13 specific formations of interest(Upper Spraberry through Atoka/Morrow reservoirs)in the Midland Basin,West Texas,United States;a prominent hydrocarbon producing sub-basin of the larger Permian Basin.The study dataset consists of over 275,000 records and includes data fields like formation iden-tifier,true vertical depth(in feet)of formations observed,and latitude and longitude coordinates(in decimal degrees).A subset of 134,374 data records were relevant to the 13 distinct formations of interest and were extracted and used for machine learning model training,validation,and testing.Four super-vised learning approaches including random forest(RF),gradient boosting(GB),support vector machine(SVM),and multilayer perceptron neural network(MLP)were evaluated and their prediction accuracy compared.The best performing model was ultimately built on the RF algorithm and is capable of an overall prediction accuracy of 93 percent on holdout data.The RF-based model demonstrated high prediction accuracy for major oil and gas producing zones including the San Andres,Upper Spraberry,Lower Spraberry,Clearfork,and Wolfcamp at 98,94,89,94,and 94 percent respectively.Overall,the resulting data-driven model provides a robust,cost-effective approach which can complement contemporary reservoir management approaches for multiple subsurface energy applications.