摘要
如何找出异构数据库间相同的实体,特别是当现实生活中的同一实体在不同的应用环境中用不同的标识符表示时,如何根据已知描述实体的相同属性的信息,进行实体匹配,解决实体异构问题,是实现数据库间互操作至关重要的因素。针对该问题,文章给出了一种基于属性信息熵的实体匹配方法。具体数据的实验结果显示该方法是很有效的。
One main problem encountered constantly in heterogeneous databases is to identify corresponding entities, which arises when the same real-word entity type is represented using different identifiers in different applications. In order to make interoperability in multiple heterogeneous databases, identifying the heterogeneous entities and resolving entity heterogeneity are critical. The paper proposes an approach for entities matching based on attribute information entropy. The experimental results on real-world data show the proposed approach is very effective.
出处
《计算机工程》
EI
CAS
CSCD
北大核心
2005年第21期31-33,共3页
Computer Engineering
基金
国家自然科学基金资助项目(60073047)
关键词
实体匹配
属性信息熵
实体异构
异构数据库
Entity matching
Attribute information entropy
Entity heterogeneity: Heterogeneous databases