摘要
湿地作为地表覆盖类型的一种,对于生物多样性与气候变化有着重要的意义,也是人类的基本生存环境之一。为更好地理解和表达湿地知识及分类间关系,本文提出了一种基于本体的湿地知识图谱构建方法。首先,利用GlobeLand30数据、生态地理分区数据,围绕湿地类型、特征分布等要素分析湿地领域知识,提取知识间的语义关系,通过本体建模形成湿地知识图谱的概念框架;其次,融合百度百科数据等进行湿地实体的提取、属性信息抽取,丰富湿地知识图谱的数据层;最后,使用图数据库Neo4j存储实体关系和实体属性,实现了湿地知识图谱构建。本文构建的知识图谱扩充了湿地实体的概念描述信息,探索了顾及时空特征的湿地知识表示方法,为地表覆盖领域的知识图谱构建提供了一个应用范例。
Wetland is of great significance to biodiversity and climate change,and it is also one of the basic living environments of human beings.In order to better understand and express wetland knowledge and the relationship between classifications,this paper proposes an ontology-based wetland knowledge graph construction method.Based on the land cover classification system of GlobeLand 30,this paper establishes the conceptual structure of wetland data and the rich semantic relationship between the elements around wetland type definition,spatial pattern,case distribution,and trend change.Firstly,based on the prior knowledge of wetlands,taking the wetland types in the GlobeLand 30 classification system as an example,we analyze the wetland domain knowledge around the wetland types,feature distribution,and other elements,extract the semantic relationship between knowledge,and construct the ontology database of wetland knowledge by combining topdown and bottom-up methods.The conceptual framework of wetland knowledge graph is formed through ontology modeling.Secondly,based on the wetland knowledge automatically extracted from the technical specification text and encyclopedia website,the extracted conceptual knowledge is stored in the model layer,and the data layer is constructed from bottom to top.The main contents include knowledge acquisition and knowledge fusion.According to the concepts contained in wetland knowledge,the relationship extraction of wetland knowledge is carried out,mainly including attribute relationship,spatial relationship,and temporal relationship.Using the wetland directory crawled from the wetland China website,the wetland entity name and knowledge are directly extracted from Baidu Encyclopedia by means of web crawler to form a triple.Finally,Through the above construction processes of wetland knowledge graph,the wetland related data with different structures are transformed into structured knowledge triple data,and the graph database Neo4j is used for semantic relationship storage with the"node relationship"storage model.Knowledge graph provides a new idea for the study of rich knowledge representation and storage in the field of land cover.It is a bridge between the basic geographic data of surface coverage and spatial knowledge service.It is of great significance to promote the sharing and reasoning analysis of surface coverage data.Taking the wetland land cover type as the research example,the knowledge graph constructed in this paper expands the conceptual description information of wetland entities,explores the wetland knowledge representation method by considering the temporal and spatial characteristics,and provides a new perspective and application demonstration for the expression of land cover knowledge.
作者
杨玉莹
赵学胜
刘会园
彭舒
吕源鑫
YANG Yuying;ZHAO Xuesheng;LIU Huiyuan;PENG Shu;LV Yuanxin(College of Geosciences and Surveying Engineering,China University of Mining and Technology(Beijing),Beijing 100083,China;National Geomatics Center of China,Beijing 100830,China)
出处
《地球信息科学学报》
EI
CSCD
北大核心
2023年第6期1240-1251,共12页
Journal of Geo-information Science
基金
国家自然科学基金项目(41631178、41930650)。
关键词
本体
地表覆盖
湿地
知识图谱
知识融合
百度百科
地理实体
图数据库
ontology
land cover
wetland
knowledge graph
knowledge fusion
Baidu encyclopedia
geographical entities
graph database