期刊文献+

基于改进XGBoost的地震多属性地质构造识别方法

Seismic Multi-attribute Geological Structure Identification Method Based on Improved XGBoost
下载PDF
导出
摘要 地震属性可以用来解释与预测地质构造,因此地震属性被广泛地运用在煤矿地质构造的识别。但一般情况下,勘探区中无构造区域与有构造区域分布不均衡,无构造区域远远多于有构造区域。机器学习中,传统的分类器更习惯于偏向多数类,这使得如何有效地识别出构造体成为一个难题。为了解决这一问题,提出了一种针对不平衡数据集的改进极限梯度提升(extreme gradient boosting,XGBoost)构造识别方法。该方法的原理是,首先,以基于三维地震勘探成果数据体提取的12种地震属性为数据集特征,以实际揭露后的地质构造为数据集标签构建多属性数据集,然后以特征对标签的相关性为标准,过滤掉冗余的特征;其次,将边界样本分类算法(boundary sample classification,BSC)与合成少数类过采样技术(synthetic minority over-sampling technique,SMOTE)相结合形成BSC-SMOTE算法。用BSC-SMOTE算法对原始数据集进行平衡,再利用平衡后的数据集训练XGBoost分类器,并用贝叶斯优化(Bayesian optimization,BO)对该分类器进行超参数寻优,最后用此分类器预测构造。以山西新元煤矿责任有限公司东翼矿区为研究区域,实验结果显示,改进XGBoost算法模型的预测精确度为0.95,比未改进的XGBoost算法提高了0.16,比K-近邻(K-nearest neighbor,KNN)、随机森林和支持向量机(support vector machine,SVM)等传统算法提高了0.15以上。改进XGBoost模型的预测结果经可视化后与实际揭露构造对比基本吻合,说明该模型能够有效地识别出地质构造体。 Seismic attributes can be used to interpret and predict geological structures,and therefore are widely used in the identification of coal mine geological structures.However,in general,the distribution of regions without structures and regions with structures in the exploration area is unbalanced,with many more regions without structures than with structures.In machine learning,traditional classifiers tend to be biased towards the majority class,making it difficult to effectively identify structures.To solve this problem,an improved extreme gradient boosting(XGBoost) construction recognition method for imbalanced datasets was proposed.Firstly,twelve seismic attributes extracted from a three-dimensional seismic exploration dataset were used as dataset features and actual disclosed geological structures as dataset labels to construct a multi-attribute dataset.Then,redundant features were filtered based on the correlation between features and labels.Next,the boundary sample classification(BSC) algorithm was combined with the synthetic minority over-sampling technique(SMOTE) to form the BSC-SMOTE algorithm.The original dataset was balanced using the BSC-SMOTE algorithm,and the balanced dataset was then used to train the XGBoost classifier.The classifier was further optimized using Bayesian optimization(BO) to search for hyperparameters.Finally,the classifier was used to predict structures.Taking the Dongyi mining area of Shanxi Xinyuan Coal Mine Co.,Ltd.as the research area,the experimental results show that the prediction accuracy of the improved XGBoost algorithm model is 0.95,which is 0.16 higher than the original XGBoost algorithm,and more than 0.15 higher than the traditional algorithms such as KNN,random forest and SVM.The prediction results of the improved XGBoost model are basically consistent with the actual exposed structure after visualization,which shows that the model can effectively identify geological structures.
作者 杨楚龙 王怀秀 刘最亮 YANG Chu-long;WANG Huai-xiu;LIU Zui-liang(School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 102616,China;Huayang New Material Technology Group Co.,Ltd.,Yangquan 045000,China)
出处 《科学技术与工程》 北大核心 2023年第29期12442-12450,共9页 Science Technology and Engineering
基金 国家重点研发计划(2018YFC0807806) 北京建筑大学2022年度研究生创新项目(PG2022090)。
关键词 地震属性融合 地质构造识别 不平衡数据 机器学习 seismic attribute fusion geological structure identification unbalanced data machine learning
  • 相关文献

参考文献16

二级参考文献169

共引文献154

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部