期刊文献+

Apriori-sort算法研究 被引量:4

Apriori-sort algorithm
下载PDF
导出
摘要 Apriori算法虽然在候选集的产生时利用了剪支技术,但每次扫描数据库时都必须扫描整个数据库,因此扫描的数据量大,速度较慢。Apriori-sort算法是在Apriori算法基础上的改进,基本思想是把事务数据库变为以度表示的事务度数据库,并对事务度数据库进行排序。Apriori-sort算法查找频繁项集时,只扫描数据库Dd中满足d(Ck)芨d(Ti)的事务。对扫描数据库进行了有效剪支,因此Apriori-sort算法的计算效率高。并用仿真数据对Apriori-sort算法和Apriori算法进行了仿真对比实验,实验结果证明了新算法的高效性。 Although Apriori algorithm uses cut-technology when it generate item sets of candidates,it has to scan the entire database while scanning the transaction database each time.The scanning speed is very slow for its large amount of data.The Apriori-sort algorithm is improved from the Apriori algorithm.Its basic idea is to transform a transaction database into a transaction degree database,and sort the transaction-degree database.When the Apriori-sort algorithm searches the frequent item sets,it only scans the transactions in the database Dd that appeases d(Ck)≤d(Ti).h scans the database with an effective cut.There- fore the Apriori-sort algorithm's computing time is very fast.The results of simulation experiments for the Apriori-sort algorithm and the Apriori algorithm show the new algorithm's efficiency.
出处 《计算机工程与应用》 CSCD 北大核心 2008年第36期156-159,共4页 Computer Engineering and Applications
基金 西南大学荣昌校区科研项目(No.2007S10)
关键词 排序 Aprior 关联规则 购物篮分析 数据库知识发现 sort Apriori association rule market basket analysis Knowledge Discovery in Database(KDD)
  • 相关文献

参考文献8

二级参考文献26

  • 1王创新.关联规则提取中对Apriori算法的一种改进[J].计算机工程与应用,2004,40(34):183-185. 被引量:32
  • 2李清峰,杨路明,张晓峰,龙艳军.数据挖掘中关联规则的一种高效Apriori算法[J].计算机应用与软件,2004,21(12):84-86. 被引量:29
  • 3徐章艳,刘美玲,张师超,卢景丽,区玉明.Apriori算法的三种优化方法[J].计算机工程与应用,2004,40(36):190-192. 被引量:71
  • 4耿素云 等.离散数学[M].清华大学出版社,2001-03..
  • 5Agarwal R, Imielinski T, Swami A. Mining Associaiton Rules Between Sots of Items in Large Databttscs [A]. In: Proceeding of 1993 SIGMOD International Conterence on Management of Data [C].New York: ACM Press, 1993:207-216.
  • 6Park J S, Chen M S, Yu P S. Using a Hash-based Method with Transaction Trimming for Mining Associations Rules[J]. IEEE Transactions on Knowledge and Data Engineering, 1997, (9):813-825.
  • 7Agarwal R, Agarwal C. A Tree Projection Algorithm for Generation of Frequent Itemsets[J]. Journal of Parallel and Distributed Compuling,2001, (Special Issue on High Performance Data Mining): 1-23.
  • 8Han Jiawei, Pei Ham Yin Yiwen. Mining Frequent Patterns Without Candidate Generation[A]. In: Proceeding of 2000 ACM SIGMOID International Conference on Management of Data [C]. New York:ACM Press, 2000: 1-12.
  • 9HanJiawei KamberM.Data Mining:Concepts and Techniques[M].北京:高等教育出版社,2001..
  • 10Berry M, Linoff G. Data Mining Techniques[M]. John Wiler, 1997.

共引文献66

同被引文献48

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部