摘要
为了提高经典关联规则Apriori算法的挖掘效率,针对Apriori算法的瓶颈问题,提出了一种链式结构存储频繁项目集并生成最大频繁项目集的关联规则算法。该算法采用比特向量方式存储事务,生成频繁项目集的同时,把包含此频繁项目的事务作为链表连接到频繁项目之后,生成最大频繁项目集。该算法能够减小扫描事物数据库的次数和生成候选项目集的数量,从而减少了生成最大频繁项目集的时间,实验结果表明,该算法提高了运算效率。
In order to improve the Apriori algorithm mining efficiency, a mining algorithm is presented to use a chain structure to store frequent itemsets and generate maximum frequent itemsets for the bottleneck problem of the classic Apriori algorithm. In this algorithm, the transaction is stored with a bit vector, when frequent itemset is generated, the transaction including this frequent itemset as a linked list connect to frequent itemset, at end, it make maximum frequent itemsets. This algorithm reduce the number of scanning transaction database and the amount of generating candidate itemsets, thus reduce time of generated maximum frequent itemsets, experimental results show that this algorithm improve the operation efficiency.
出处
《计算机工程与设计》
CSCD
北大核心
2012年第3期1002-1007,共6页
Computer Engineering and Design