摘要
分析数据演化下的基于相关性聚类的增量实体解析机制。针对增量实体解析过程展开分析和研究,首先探讨静态数据集中的实体解析、然后分析基于相关性聚类的解析机制,最后研究面向数据演化的实体解析过程。基于相关性聚类的增量实体解析技术能很好地运用于频繁更新的数据环境中。仅从聚类技术角度分析了面向数据演化的增量实体解析技术现状,未给出该技术的详细算法描述。有助于较全面系统地理解面向数据演化的实体解析过程及其内在的相关技术难点。
To analyse the mechanism of incremental entity resolution based on correlation clustering on data evolution. This paper analyses and studies the process of entity resolution, first discusses the entity resolution of static data collection, then analyses the mechanism of entity resolution based on Correlation Clustering, finally studies the process of entity evolution on data evolution. Incremental entity resolution based on Correlation Clustering can be used in frequently updated data environment. Only studies the technical situation of incremental entity evolution on data evolution in terms of clustering technique, not gives a detailed arithmetic statements of the technology. This paper will facilitate to give an comprehensive overview of the process of entity resolution on data evolution and its inherent technical difficulties.
出处
《情报学报》
CSSCI
北大核心
2016年第3期326-336,共11页
Journal of the China Society for Scientific and Technical Information
基金
广西文科中心"科学研究工程"项目"基于Web Services的异构平台数据集成模型研究"(项目编号:ZX2011044)的研究成果之一
关键词
数据演化
相关性聚类
增量实体解析
data evolution, correlation clustering, incremental entity resolution