摘要
首先简单介绍了相关规则及其并行开采算法的一些基本情况,然后指出了现有算法在分布式异构数据库中不能有效利用计算资源和造成信息丢失的问题.在证明了一个基本的定理之后,提出了基于HDDMiner模型的异步并行算法,并就其中的一些问题作了说明.最后,介绍了分布式异构数据库中数据开采的并行算法中一些仍需继续研究的问题.
The problem of mining association rules and its relative parallel mining algorithms is presented. The problems that existing algorithms can cause low efficiency and information lost in heterogeneous distributed databases is pointed out. An asynchronous parallel algorithm based on our HDDMiner system for mining association rules in heterogeneous distributed databases is given after a basic theory is proved. Some problems involved are discussed in detail as well. At the end of this paper, several key issues in the research of parallel algorithms in heterogeneous distributed databases are introduced.
出处
《武汉大学学报(自然科学版)》
CSCD
1999年第5期649-653,共5页
Journal of Wuhan University(Natural Science Edition)
基金
湖北省自然科学基金
关键词
数据库
相关规则
分布式
异构数据库
并行开采
knowledge discovery in database
association rule
heterogeneous distributed databases
parallel mining algorithms