摘要
地质文献资料包含了矿床成因、地质过程、矿产预测等多方面信息,从结构化和非结构化数据中抽取总结矿床特征,构建知识库对于研究和分析成矿规律,进行知识计算具有重要意义。因此,本文总结了钨矿知识库的要素模型,基于潜力评价数据和文献资料构建了钨矿知识库。以钨矿知识库为数据源,基于python的字符串模糊匹配算法实现了数据分类和相似度计算。结果表明该算法可以很好的识别和区分不同预测类型。
Geological literature includes information of genesis,process and prediction of ore deposits.It is important to extract and summarize the characteristics of ore deposits from structured and unstructured data.Therefore,this paper summarizes the feature model of the tungsten ore knowledge base,and constructs the tungsten mineral knowledge base based on the potential evaluation data and literature data.Based on python s string fuzzy matching algorithm,data classification and similarity calculation are realized by using tungsten ore knowledge base as data source.The results show that the algorithm can identify and distinguish different prediction types.
作者
常力恒
朱月琴
汪新庆
张旋
刘雨江
吴硕
CHANG Liheng;ZHU Yueqin;WANG Xinqing;ZHANG Xuan;LIU Yujiang;WU Shuo(Faculty of Earth Resources,China University of Geosciences(Wuhan),Wuhan 430074,China;Key Laboratory of Geological Information Technology,Ministry of Natural Resources,Beijing 100037,China;Development and Research Center,China Geological Survey,Beijing 100037,China;University of Chinese Academy of Sciences,Beijing 100049,China;Beijing Language and Culture University Press,Beijing 100083,China)
出处
《中国矿业》
北大核心
2018年第9期93-96,108,共5页
China Mining Magazine
基金
国土资源部公益性行业科研专项项目资助(编号:201511079)
自然资源部地质信息技术重点实验室开放课题资助(编号:2017020058)
关键词
大数据
知识库
数据分类
字符串模糊匹配
big data
knowledge base
data classification
string fuzzy matching