How to improve machine learning models for lithofacies identification by practical and novel ensemble strategy and principles

下载PDF

导出

摘要 Typically, relationship between well logs and lithofacies is complex, which leads to low accuracy of lithofacies identification. Machine learning (ML) methods are often applied to identify lithofacies using logs labelled by rock cores. However, these methods have accuracy limits to some extent. To further improve their accuracies, practical and novel ensemble learning strategy and principles are proposed in this work, which allows geologists not familiar with ML to establish a good ML lithofacies identification model and help geologists familiar with ML further improve accuracy of lithofacies identification. The ensemble learning strategy combines ML methods as sub-classifiers to generate a comprehensive lithofacies identification model, which aims to reduce the variance errors in prediction. Each sub-classifier is trained by randomly sampled labelled data with random features. The novelty of this work lies in the ensemble principles making sub-classifiers just overfitting by algorithm parameter setting and sub-dataset sampling. The principles can help reduce the bias errors in the prediction. Two issues are discussed, videlicet (1) whether only a relatively simple single-classifier method can be as sub-classifiers and how to select proper ML methods as sub-classifiers;(2) whether different kinds of ML methods can be combined as sub-classifiers. If yes, how to determine a proper combination. In order to test the effectiveness of the ensemble strategy and principles for lithofacies identification, different kinds of machine learning algorithms are selected as sub-classifiers, including regular classifiers (LDA, NB, KNN, ID3 tree and CART), kernel method (SVM), and ensemble learning algorithms (RF, AdaBoost, XGBoost and LightGBM). In this work, the experiments used a published dataset of lithofacies from Daniudi gas field (DGF) in Ordes Basin, China. Based on a series of comparisons between ML algorithms and their corresponding ensemble models using the ensemble strategy and principles, conclusions are drawn: (1) not only decision tree but also other single-classifiers and ensemble-learning-classifiers can be used as sub-classifiers of homogeneous ensemble learning and the ensemble can improve the accuracy of the original classifiers;(2) the ensemble principles for the introduced homogeneous and heterogeneous ensemble strategy are effective in promoting ML in lithofacies identification;(3) in practice, heterogeneous ensemble is more suitable for building a more powerful lithofacies identification model, though it is complex.

作者 Shao-Qun Dong Yan-Ming Sun Tao Xu Lian-Bo Zeng Xiang-Yi Du Xu Yang Yu Liang

机构地区 State Key Laboratory of Petroleum Resources and Prospecting College of Science College of Geoscience

出处《Petroleum Science》 SCIE EI CAS CSCD 2023年第2期733-752,共20页 石油科学（英文版）

基金 financially supported by the National Natural Science Foundation of China(Grant No.42002134) China Postdoctoral Science Foundation(Grant No.2021T140735) Science Foundation of China University of Petroleum,Beijing(Grant Nos.2462020XKJS02 and 2462020YXZZ004).

关键词 Lithofacies identification Machine learning Ensemble learning strategy Ensemble principle Homogeneous ensemble Heterogeneous ensemble

分类号 P631.4 [天文地球—地质矿产勘探]

引文网络
相关文献

1Longwei Qiu,Shengchao Yang,Changsheng Qu,Ningning Xu,Qingsong Gao,Xiangjin Zhang,Xugang Liu,Donghui Wang.A Comprehensive Porosity Prediction Model for the Upper Paleozoic Tight Sandstone Reservoir in the Daniudi Gas Field, Ordos Basin[J].Journal of Earth Science,2017,28(6):1086-1094. 被引量：4
2杜明婧,孙宝军,凯歌.Adaptive multi-step piecewise interpolation reproducing kernel method for solving the nonlinear time-fractional partial differential equation arising from financial economics[J].Chinese Physics B,2023,32(3):53-57. 被引量：1
3Muhammad Zia Ur Rehman,Jawad Ahmad,Emad Sami Jaha,Abdullah Marish Ali,Mohammed A.Alzain,Faisal Saeed.An Efficient Automated Technique for Classification of Breast Cancer Using Deep Ensemble Model[J].Computer Systems Science & Engineering,2023,46(7):897-911.
4Xinjie Xiao,Yuanhong Ren,Zhiwei Li,Nannan Zhang,Wuneng Zhou.Self‑supervised zero‑shot dehazing network based on dark channel prior[J].Frontiers of Optoelectronics,2023,16(1):95-108.
5Ling-Si KONG,Xian-Chun TAN,Bai-He GU,Hong-Shuo YAN.Significance of achieving carbon neutrality by 2060 on China's energy transition pathway: A multi-model comparison analysis[J].Advances in Climate Change Research,2023,14(1):32-42. 被引量：5
6Niharika Gupta,Baijnath Kaushik,Mohammad Khalid Imam Rahmani,Saima Anwar Lashari.Performance Evaluation of Deep Dense Layer Neural Network for Diabetes Prediction[J].Computers, Materials & Continua,2023(7):347-366.
7Zhedong Xu,Yongbo Su,Fang Yang,Ming Zhang.A Whale Optimization Algorithm with Distributed Collaboration and Reverse Learning Ability[J].Computers, Materials & Continua,2023(6):5965-5986. 被引量：2
8Huaxiang Song.FST-EfficientNetV2:Exceptional Image Classification for Remote Sensing[J].Computer Systems Science & Engineering,2023,46(9):3959-3978.
9本刊编辑部.英语教学术语选摘(137)[J].基础教育外语教学研究,2023(6):80-84.
10G.Anurekha,P.Geetha.An Intelligent Hybrid Ensemble Gene Selection Model for Autism Using DNN[J].Intelligent Automation & Soft Computing,2023(3):3049-3064.

Petroleum Science

2023年第2期

浏览历史

内容加载中请稍等...

How to improve machine learning models for lithofacies identification by practical and novel ensemble strategy and principles

相关作者

相关机构

相关主题

浏览历史