Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data.However,data interlinking,which is the most valuable contribution of Linke...Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data.However,data interlinking,which is the most valuable contribution of Linked Data,remains incomplete and inaccurate.This study proposes a multidimensional and quantitative interlinking approach for Linked Data in the geospatial domain.According to the characteristics and roles of geospatial data in data discovery,eight elementary data characteristics are adopted as data interlinking types.These elementary characteristics are further combined to form compound and overall data interlinking types.Each data interlinking type possesses one specific predicate to indicate the actual relationship of Linked Data and uses data similarity to represent the correlation degree quantitatively.Therefore,geospatial data interlinking can be expressed by a directed edge associated with a relation predicate and a similarity value.The approach transforms existing simple and qualitative geospatial data interlinking into complete and quantitative interlinking and promotes the establishment of high-quality and trusted Linked Geospatial Data.The approach is applied to build data intra-links in the Chinese National Earth System Scientific Data Sharing Network(NSTI-GEO)and data-links in NSTI-GEO with the Chinese Meteorological Data Network and National Population and Health Scientific Data Sharing Platform.展开更多
基金Thiswork was supported by the National Natural Science Foundation of China[grant number 41371381],[grant number 41431177]Natural Science Research Program of Jiangsu[grant number 14KJA170001]+4 种基金National Special Program on Basic Works for Science and Technology of China[grant number 2013FY110900]National Key Technology Innovation Project for Water Pollution Control and Remediation[grant number 2013ZX07103006]National Basic Research Program of China[grant number 2015CB954102]GuiZhou Welfare and Basic Geological Research Program of China[grant number 201423]China Scholarship Council[grant number 201504910358].
文摘Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data.However,data interlinking,which is the most valuable contribution of Linked Data,remains incomplete and inaccurate.This study proposes a multidimensional and quantitative interlinking approach for Linked Data in the geospatial domain.According to the characteristics and roles of geospatial data in data discovery,eight elementary data characteristics are adopted as data interlinking types.These elementary characteristics are further combined to form compound and overall data interlinking types.Each data interlinking type possesses one specific predicate to indicate the actual relationship of Linked Data and uses data similarity to represent the correlation degree quantitatively.Therefore,geospatial data interlinking can be expressed by a directed edge associated with a relation predicate and a similarity value.The approach transforms existing simple and qualitative geospatial data interlinking into complete and quantitative interlinking and promotes the establishment of high-quality and trusted Linked Geospatial Data.The approach is applied to build data intra-links in the Chinese National Earth System Scientific Data Sharing Network(NSTI-GEO)and data-links in NSTI-GEO with the Chinese Meteorological Data Network and National Population and Health Scientific Data Sharing Platform.