期刊文献+

基于映射簇的Web数据挖掘研究

Study on the Projected Cluster Based Web Data Mining
下载PDF
导出
摘要 传统特征选择算法在多维Web数据中由于其数据对象自身固有的稀缺性而常常失效。在典型多维Web数据挖掘应用中,不同数据对象集合对于不同维度集合而言可能聚类会更好,且在每个簇的具体子空间中维度数将可能非常大。事实上,为所有簇查找出单个的小维度集合是不可能的。本文应用映射簇的概念来明确簇与维度的关系,将聚类问题转化为映射簇问题,从而简化计算提高挖掘效率。最后给出相应的算法。 Traditional feature selection algorithms trends to break down in high dimensional Web spaces because of the (inherent) sparsity of the data object. In the typical high dimensional Web data mining applications different sets of points may cluster better for different subsets of dimensions and the number of dimensions in each such cluster-specific subspace may also vary. In fact, it may be impossible to find a single small subset of dimensions for all the clusters. So in the paper we use the concept of projected cluster to discuss the relation of cluster and its dimensions, and realize clustering in high (dimensional) data by solving the projected cluster problem. Finally, corresponding fast algorithm is developed based on (Projected Cluster.)
作者 陈晓红 秦杨
机构地区 中南大学商学院
出处 《系统工程》 CSCD 北大核心 2004年第7期80-83,共4页 Systems Engineering
基金 国家自然科学基金委国家杰出青年科学基金资助项目(70125002)
关键词 多维Web数据 WEB数据挖掘 聚类 映射簇 High Dimensional Web Data Web Data Mining Clustering Projected Cluster
  • 相关文献

参考文献7

  • 1Aggarwal C C, Procopiuc C, Wolf J L, et al. Fast algorithm for projected clustering[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1999:61~71.
  • 2Agrawal R, Gehrke J, Gunopolos D, Raghavan P. Automatic subspace clustering of high dimensional data for data mining applications[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1998.
  • 3Yoo J,Yoo S. Conceptual clustering on partitioned data: tree-weaver[J]. Expert System with Applications,1998,(15):367~374.
  • 4Ankerst M, Breuning M M, Kriegel H-P, Sander J. OPTICS: ordering points to identify the clustering structure[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1999:49~68.
  • 5Yasuhiko Takahara,Chen X H,Liu Y M. Design and system theoretic implementation of a menu system generator - meta support of UI for task processing[J]. Journal of the Japan Society for Management Information,2002,10(4).
  • 6Yasuhiko Takahara,Naoki Shiba,Liu Y M. General system theoretic approach to data mining system[J]. International Journal of General Systems, 2002, 31(3):245~264.
  • 7韩家伟 KamberM著 范明译.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部