期刊文献+

关联规则试验数据的人工合成

Synthetic Data Generation on Association Rules
下载PDF
导出
摘要 给出了一种实验人工数据合成的算法。该算法利用多种概率模型模拟事务长度、潜在强项集长度、项集频度等特征数据 ,模拟大型超市的事务数据库 ,生成不同规模、不同特点的数据 ,用以测试算法的时间曲线及可伸缩性。 This paper develops algorithms on synthetic data generation based on the mathematic model presented by IBM Almaden Center.Synthetic data generaion is the test foundation of association rule research.To verify that the new algorithm is advanced to the existings on performances and scalibility,the authors need to compare existing algorithms with new algorithms on various kinds of datasets different in size and potentially strong item size.The research makes it releasable and the result is relatively satisfactory.
出处 《吉林工业大学自然科学学报》 CSCD 2000年第3期47-50,共4页 Natural Science Journal of Jilin University of Technology
基金 国家自然科学基金!资助项目 (6 98730 19)
关键词 数据挖掘 关联规则 人工数据合成 合成算法 data mining association rules synthetic data generation
  • 相关文献

参考文献3

  • 1[1]Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large databases[Z].In Proc.ACM SIGMOD,Washington,D.C.,1993.
  • 2[2]Agrawal R,Srikant R.Fast algorithms for mining association rules in large databases[Z].In Proc.20th Int′l Conf.Very Large Data Bases,Santiago,Chile,1994.
  • 3[3]Agrawal R,Imielinski T,Swami A.Database mining:a performance perspective[J].IEEE Trans.Knowledge and Data Eng,1993,12:914~925.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部