期刊文献+

顾及环境相似的多特征组合实体匹配方法 被引量:7

Entity Matching Methods Based on Combining Multi-Similarity-Characteristics Considering Environment Similarity
下载PDF
导出
摘要 实体匹配是指通过相似性指标识别出不同来源地图数据中同名实体并建立实体对应关系的过程,它是地图数据融合与更新中的关键技术和难点之一,匹配效果的好坏直接关系到后续应用的正确性。现有的实体匹配方法大多只利用实体自身的几何或属性特征进行匹配,很少顾及实体周边环境的相似性,使得一些匹配实例不足以得到正确的匹配结果。针对该点不足,首次引入点、线实体环境相似度,并提出顾及环境相似的多特征组合匹配方法,有效地提高了实体匹配精度;通过双向匹配与聚类合并策略,解决了部分线实体一对多、多对多的匹配问题。 The entity′s geometric and attribute characteristics are used in existing matching methods,and the environment similarity of entities is seldom considered,it leads to wrong matching result for some matching instances.According to this deficiency,the environment similarity measures for point and line entity are brought forward,where point entity environment similarity measure is formed as follows:Firstly,2×2 grid around entity is constructed and its environment characteristic is calculated,which assumes that four grids are described respectively by LT,RT,RB and LB,then environment characteristics of point entity can be described by four-unit group(ρLT,ρRT,ρRB,ρLB),ρ denotes point density of a grid region.Secondly,the size of corresponding items are compared in four-unit group for matching entities,if their sizes are equal,the count variable will be added 1,otherwise count variable is unchanged.Lastly,environment similarity measure is equal to 1/4 of count variable.Environment similarity measure of line entity is determined by the similarity degree of adjoining graph structure of line entities,adjoining graph is composed of current entity and all entities that directly connect to current entity.The structure relationship between current entity and it′s adjoining entity is calculated,and the structure relationship can be described with the dual group(distance,angle),where distance denotes the normalized path length from connection point to start point of current line entity,angle denotes normalized azimuth of line segment in adjoining entity which connects to current entity.Then all structure relationships of matching pairs are matched.Last,the environment similarity measure is calculated according to matching result of structure relationships.Based on introducing environment similarity measure,matching methods are put forward based on combining multi-similarity-characteristics considering environment similarity,it improves effectively the entity matching precision,through the matching strategy of bidirectional matching and clustering combination,and resolves some entity matching problems of one-to-many and many-to-many.
作者 吴建华
出处 《地理与地理信息科学》 CSSCI CSCD 北大核心 2010年第4期1-6,共6页 Geography and Geo-Information Science
基金 江西师范大学博士启动基金(2644)
关键词 实体相似度 匹配策略 数据融合 GIS entity similarity matching strategy data fusion GIS
  • 相关文献

参考文献17

  • 1FU Z L,WU J H.Entity matching in vector data[C].21th SPRS2008,2008.1467-1472.
  • 2CATRIEL B,YARON K,ELIYAHU S,et al.Object fusion in geographic information systems[A].Proceedings of the 30th VLDB Conference[C].Toronto,2004.816-827.
  • 3COBB M,CHUNG M,FOL EY H.A rule-based approach for the conflation of attributed vector data[J].Geo Informatica,1998,2(1):7-35.
  • 4HAUNERT J-H.Link based conflation of geographic datasets[A].8th ICA Workshop on Generalisation and Multiple Representation[C].Spain,2005.1-7.
  • 5SAALFELD A.Conflation:Automated map compilation[J].Geographical Information Systems,1988,2(3):217-228.
  • 6SAALFELD A.Automated Map Conflation[D].Washington D C:University of Maryland,1993.1-10.
  • 7WALTER V,FRITSCH D.Matching spatial data sets a statistical approach[J].Geographical Information Science,1999,13(5):445-473.
  • 8SAMAL A,SETH S,CUETO K.A feature-based approach to conflation of geospatial sources[J].Geographical Information Science,2004,18(5):459-489.
  • 9DOYTSHER Y,FILIN S,EZRA E.Transformation of datasets in a linear-based map conflation framework[J].Surveying and Land Information Systems,2001,61(3):165-175.
  • 10ZhangQiaopin,LIDeren,等.Shape Similarity Measures of Linear Entities[J].Geo-Spatial Information Science,2002,5(2):62-67. 被引量:1

二级参考文献64

  • 1郭庆胜,杜晓初,刘浩.空间拓扑关系定量描述与抽象方法研究[J].测绘学报,2005,34(2):123-128. 被引量:32
  • 2[1]COBB M,CHUNG M,FOLEY H.A Rule-based Approach for the Conflation of Attributed Vector Data[J].GeoInformatica,1998,2(1):7-35.
  • 3[2]LYNCH M, SAALFELD A. Conflation:Automated Map Compilation-A Video Game Approach[A].Proceedings of 7th International Symposium on Computer Assisted Cartography(AutoCarto 7)[C].Washington:[s.n.],1985.
  • 4[3]DENERS M N. Fundamentals of Geographic Information Systems[M]. New York:John Wiley &Sons,1996.
  • 5[4]WALTER V, FRITSH D.Matching Spatial Data Sets: A Statical Approach[J].IJGIS,1999,13(5):445-473.
  • 6[5]LEMARIE C, RAYNAL L.Geographic Data Matching:First Investigations for a Generic Tool[A].Proceedings of the GIS/LIS'96 Annual Convention[C].Denver:[s.n.],1996.
  • 7[6]SAALFELD A. Spatial Data Acquisition and Integration[EB/OL]. http://www.ncgia.ucsb.edu/other/ucgis/research_priorities/old3%20%287-07-97%29/acquisition. html,1996.
  • 8[7]GOODCHILD M F.Conflation:Combining Geographical Information[EB/OL]. http:// www.ncgia.ucsb.edu/research/ucgis/proposal.html,1996.
  • 9[8]COWEN D J,SHIRLEY L,JENSEN J.The Update and Maintenance of TIGER Files from Other Data Sources[EB/OL].http://www.geog.buffalo.edu/ucgis/nres_pri/socar_a.html,1996.
  • 10[1]Cobb M.,Chung M.,Foley H.(1998) A rule-based approach for the conflation of attributed vector data.GeoInformatica,2(1):7-35

共引文献200

同被引文献75

引证文献7

二级引证文献59

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部