期刊文献+

基于已发现序列模式的序列聚类研究

Clustering sequences using sequential patterns
下载PDF
导出
摘要 文章研究了利用序列模式的挖掘结果对序列数据库进行再发现的问题,提出一种利用已发现序列模式对数据库中的数据序列进行聚类的方法SPSC。该方法利用发现的序列模式定义了数据序列之间相似度函数和数据序列分组的平均值,使得经典聚类方法k-means可以应用于序列型数据,实现了对包含相似模式的数据序列进行聚类;理论分析和实验表明,与已有的序列聚类方法相比,该文所提出的方法不仅可以得到更加优化的聚类,而且效率更高。 The paper deals with the problem of farther discovering in the sequence database on the basis of the results of sequential pattern mining, and a sequence clustering method using sequential patterns achieved is proposed. The definition of the similarity of data sequences and the mean of the data sequence cluster are given, so that the k-means method can be applied to the sequence data and a set of high quality data sequence clusters with similar sequential patterns can be discovered. Theoretic analysis and experiments prove that the method not only generates optimal clusters but also exhibits good efficiency.
出处 《合肥工业大学学报(自然科学版)》 CAS CSCD 北大核心 2008年第1期9-12,共4页 Journal of Hefei University of Technology:Natural Science
基金 安徽省自然科学基金资助项目(050420207) 合肥工业大学科研发展基金资助项目(050504F)
关键词 数据挖掘 序列模式 聚类 data mining sequential pattern cluster
  • 相关文献

参考文献8

  • 1Agrawal A,Srikant R.Mining sequential patterns[C]//Proc of the 11st Int Conf on Data Engineering.Taipei,1995:3-14.
  • 2Aggarwal C C,Wolf J L,Yu P S.A new method for similarity indexing of market basket data[C]//Proc of 1999ACM SIGMOD Int Conf on Management of Data.1999:407-418.
  • 3Wang K,Xu C,Liu B.Clustering transactions using large items[C]//ACM CIKM International Conference on Information and Knowledge Management.1999:483-490.
  • 4赵 奕,施鹏飞.最大频繁集的数据聚类方法[J].计算机工程与应用,2000,36(11):35-37. 被引量:4
  • 5陈宁,陈安,周龙骧,CHEN Ning.大规模交易数据库的一种有效聚类算法(英文)[J].软件学报,2001,12(4):475-484. 被引量:17
  • 6Han E,Karypis G,Kumar V,et al.Hypergraph based clustering in high-dimensional data sets:a summary of results[J].Bulletin of the IEEE Computer Society Technical Committee on Data Engineering,1998,21(1):15-22.
  • 7Ramkumar G D,Swami A.Clustering data without distance functions[J].Bulletin of the IEEE Computer Society Technical Committee on Data Engineering,1998,21(1):9-14.
  • 8Morzy T,Wojciechowski M,Zakrzewicz M.Scalable hierar-chical clustering method for sequences of categorical values[C]//Proc of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD),Lecture Notes in Computer Science 2035.Springer-Verlag,2001:282-293.

二级参考文献6

  • 1Lang S D,Proc SPIE Data Mining Knowledge Discovery:Theory Toolsand Technology …,1999年,31页
  • 2Aggarwal C C,Proc the ACMSIGMOD Int Conference on Management of Data,1999年,407页
  • 3Han E,Bulletin IEEE Computer Society Technical Committee Data Engineering,1998年,21卷,1期,15页
  • 4Zhang T,Proc the ACMSIGMOD Int Conference on Management of Data,1996年,103页
  • 5Cheung D W,Distributed and Parallel Databases
  • 6赵奕,施鹏飞,熊范纶.概念格递增修正关联规则挖掘方法[J].上海交通大学学报,2000,34(5):684-687. 被引量:3

共引文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部