期刊文献+

基于假设检验的多数据源知识发现研究

Hypothesis Testing Based Knowledge Discovery in Distributed Multi-Data Sources
原文传递
导出
摘要 现行的数据挖掘算法大多是针对单一数据源进行挖掘,多数据源挖掘是网络分布式状况下KDD所面临的新问题,是解决基于全局数据分布状态下知识发现问题的有效技术。本文提出了一种多数据源知识发现新方法,该方法通过共享从其它数据源中发现的知识模式,采用抽样检验的方法来判断知识在本地数据源的有效性,大大提高了知识发现的效率。实验结果表明了该方法的有效性,该方法可以进一步推广,作为对已知模式的高效知识发现方法,并可应用于增量式知识发现。 Nowadays, the techniques of data mining focus on single data source. Mining from multi-data sources is a new problem in Web environment and is also an efficient technique for solving knowledge discovery in distributed databases. A new method for mining multi-data sources is presented in this paper. By sharing knowledge patterns discovered in other similar data sources, hypothesis testing is employed for verifying whether the patterns are also suitable for local data source. The efficiency of KDD can be improved greatly. Finally, the effectiveness of this method is analyzed and experimental result is given. This method can be extended as an efficient data mining algorithm in case of apriori hypothesizes are provided. And it can be also used for incremental data mining.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2005年第5期564-568,共5页 Pattern Recognition and Artificial Intelligence
基金 国家863计划资助项目(No.2003AA118070)
关键词 多数据源 假设检验 知识共享 知识发现 Multi-Data Sources, Hypothesis Testing, Knowledge Sharing, Knowledge Discovery
  • 相关文献

参考文献13

  • 1Kargupta H, Park B H, Hershberger D, Johnson E. Collective Data Mining: A New Perspective towards Distributed Data Mining. In: Kargupta H, Chan P, eds. Advances in Distributed Data Mining. Menlo Park, USA: AAAI Press, 2000, 131-178,216.
  • 2蒋良孝,蔡之华.基于数据仓库的数据挖掘研究[J].计算技术与自动化,2003,22(3):102-105. 被引量:9
  • 3Goulbourne G, Coenen F, Leng P. Algorithms for Computing Assoeiation Rules Using a Partial-Support Tree. Knowledge-Based Systems, 2000, 13(2-3): 141-149.
  • 4唐懿芳,牛力,张师超.多数据源关联规则挖掘算法研究[J].广西师范大学学报(自然科学版),2002,20(4):27-31. 被引量:14
  • 5Wei D, Gagan A. Distributed Data Mining Implementations for a Grid Environment. In: Proc of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid. Berlin,Germany, 2002, 1-2.
  • 6Zhang S C, Wu X D, Zhang C Q. Multi-Database Mining. IEEE Computational Intelligence Bulletin, 2003, 2 (1) : 5 - 13.
  • 7Zhong N, Yao Y Y. Interestingness, Peculiarity, and Multi-Database Mining. In: Proc of the IEEE International Conference on Data Mining. Silicon Valley, USA, 2001, 1-8.
  • 8Zaidi S Z H, Abidi S S R, Manickam S. Distributed Data Mining from Heterogeneous Healthcare Data Repositories: Towards an Intelligent Agent-Based Framework. ln~ Proc of the 15th IEEE Symposium on Computer-Based Medical Systems. Maribor, Slovenia, 2002, 339-342.
  • 9Gorodetsky V, Karsaeyv O, Samollov V. Software Tool for Agent-Based Distributed Data Mining. In: Proc of International Conference on Integration of Knowledge Intensive Multi-Agent Systems. Boston, USA, 2003, 710-715.
  • 10Toivonen H. Sampling Large Databases for Association Rules. In: Proc of the 22nd International Conference on Very Large Data Bases. Bombay, India, 1996, 134-145.

二级参考文献11

  • 1苏毅娟,严小卫.一种改进的频繁集挖掘方法[J].广西师范大学学报(自然科学版),2001,19(3):22-26. 被引量:10
  • 2Park J S,Proc of the Fourth Int’ l Conf on Knowledge Discovery andData Mining,1998年
  • 3Chan P,Ph D dissertation,1996年
  • 4Cheung D W,Proc 1996Int’ l Conf Parallel and Distribut-ed Information Systems,1996年,31页
  • 5Park J,Proc ACM SIGMOD Int Conf on Management of Data,1995年,175页
  • 6Agrawal R,Imielinski T,Swami A. Mining associations between sets of items in large databases[A]. Proceeding of the 1993 ACM-SIGMOD international conference on management of data[C]. Washington:Springer-Verlag,, 1993.207-216.
  • 7Shichao Zhang. Aggregation and maintenance for databases mining,intelligent data analysis:an international journal[J]. Elesvia, 1999,3 (6): 475- 490.
  • 8Zhong N,Yao Y,Ohsuga S. Peculiarity oritented multi-database mining[A]]. Proceedings of PKDD'99[C]. Washington: Springer-Verlag, 1999.251- 254.
  • 9唐懿芳 牛力 严小卫 等.数据挖掘存在问题的探讨[J].计算机应用研究,2002,:60-62.
  • 10樊玮.数据仓库与数据挖掘[J].中国民航学院学报,1999,17(5):51-54. 被引量:6

共引文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部