期刊文献+

大数据与数学地球科学研究进展——大数据与数学地球科学专题代序 被引量:84

Advances and prospects of big data and mathematical geoscience
下载PDF
导出
摘要 大数据与数学地球科学的核心应用技术包括高维数据降维、图像数据处理、无限数据流挖掘、机器学习、关联规则算法与推荐系统算法等。人工智能地质学,包括大数据-智能矿床成因模型与找矿模型的构建,是具有重要价值的研究方向。高维数据降维旨在从初始高维特征集合中选出低维特征集合,有效地消除无关和冗余特征,增强学习结果的易理解性。哈希算法、聚类分析、主成分分析等是较常用的数学降维工具。机器学习是人工智能的核心,是使计算机具有智能的根本途径。机器学习与人工智能各种基础问题的统一性观点正在形成。深度学习的训练模型往往需要海量数据作为支撑,因此迁移学习方法日益受到重视。图像模式识别是大数据挖掘的重要技术。网络中的社区结构识别对理解整个网络的结构和功能有重要价值,可帮助分析、预测网络各元素间的交互关系。沉浸式虚拟现实技术是实现大数据可视化的重要方向,对具有多元、异构、时空性、非线性、多尺度地质矿产勘查数据的展示要求有特别的价值。引入VR技术进行矿产地质大数据的可视化,可实现大数据时代矿产勘查数据的新认知。无限数据流在地质、地球化学、地球物理监测中大量存在,甚至可以持续自动产生。对数据流数据的计算包括对点查询、范围查询、内积查询、分位数计算、频繁项计算等。关联规则和推荐系统算法是大数据挖掘中的重要算法,其应用范围越来越广泛。贝叶斯原理在大数据时代有独特的价值,贝叶斯网络是成因建模的一个革命性工具。智能地质学研究刚刚起步,构建大数据-智能矿床成因模型与找矿模型是智能地质学研究的重要内容。矿床模型研究方式的变革,将出现于互联网、云计算技术环境下全球各地的矿床研究团队的共同参与。 Dimensionality reduction, graph data processing, stream data mining, machine learning, association rule algorithm and recommendation system are included in the core technologies of big data and mathematical geoscience. Intelligent geology, including construction of big data-based intelligent metallogenetic and prospecting models, is a highly valuable research direction. Dimensionality reduction aims at extracting low dimensional feature sets out of initial high dimensional feature ones, which can effectively eliminate irrelevant and redundant features, and enhancing the comprehensibility of learning results. Hash algorithm, clustering and PCA are frequently used as tool of dimensionality reduction. Machine learning is the core of artificial intelligence and the fundamental way to endow computer with intelligence. Unity for machine learning and artificial intelligence is emerging. The training model of deep learning often needs huge amounts of data, leading to the raising attention of transfer learning. Graph pattern recognition is an important technology of data mining. Community structure identification has great value to understand the structure and function of the entire network. It can help analyze and predict the interaction between different elements in the network. Immersive virtual reality (VR) technology is another important direction to achieve the visualization of big data. It is of special value in demonstrating mineral resource exploration data characterized by multivariate, heterogeneous, time-spatial, nonlinear, and multi-scale features. Utilizing VR technology to visualize geology and mineral data can result in new insight into mineral exploration under the background of big data era. Infinite data streams widely exist, and even may be automatically and continuously generated in many geological, geochemical, and geophysical monitoring projects. Point query, range query, inner product query, quantile calculation, frequent item-set computing and the like are included in data stream mining. Association rules and recommendation systems, as essential algorithms in data mining, are seeing an expanding application scope. Bayes' theorem has unique value in the era of big data. The Bayesian Network is a revolutionary tool for genesis modelling. Intelligent Geology (IG) is still at its primary stage. The construction of big data-based intelligent metallogenetic and mineral prospecting models is part of IG. The revolution of research mode of the metallogenetic and mineral prospecting model will emerge with the worldwide participation of teams together with the help of internet and cloud computing technologies.
出处 《岩石学报》 SCIE EI CAS CSCD 北大核心 2018年第2期255-263,共9页 Acta Petrologica Sinica
基金 国家重点研发计划项目(2016YFC0600506) 国家自然科学基金项目(41273040) 中国地质调查局项目(12120113067600) 高校基本科研业务费中山大学科研助手资助计划联合资助
关键词 大数据挖掘 高维数据降维 图像数据处理 无限数据流挖掘 机器学习 关联规则 人工智能地质学 智能矿床模型 贝叶斯网络 Big Data Mining Dimensionality Reduction Graph Data Processing Infinite Data Stream Machine Learning Association Rule Intelligent Geology Artificial Intelligent metallogenetic Model The Bayesian Network
  • 相关文献

参考文献17

二级参考文献263

共引文献486

同被引文献1184

引证文献84

二级引证文献467

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部