基于映射簇的Web数据挖掘研究

Study on the Projected Cluster Based Web Data Mining

下载PDF

导出

摘要传统特征选择算法在多维Web数据中由于其数据对象自身固有的稀缺性而常常失效。在典型多维Web数据挖掘应用中,不同数据对象集合对于不同维度集合而言可能聚类会更好,且在每个簇的具体子空间中维度数将可能非常大。事实上,为所有簇查找出单个的小维度集合是不可能的。本文应用映射簇的概念来明确簇与维度的关系,将聚类问题转化为映射簇问题,从而简化计算提高挖掘效率。最后给出相应的算法。 Traditional feature selection algorithms trends to break down in high dimensional Web spaces because of the (inherent) sparsity of the data object. In the typical high dimensional Web data mining applications different sets of points may cluster better for different subsets of dimensions and the number of dimensions in each such cluster-specific subspace may also vary. In fact, it may be impossible to find a single small subset of dimensions for all the clusters. So in the paper we use the concept of projected cluster to discuss the relation of cluster and its dimensions, and realize clustering in high (dimensional) data by solving the projected cluster problem. Finally, corresponding fast algorithm is developed based on (Projected Cluster.)

作者陈晓红秦杨

机构地区中南大学商学院

出处《系统工程》 CSCD 北大核心 2004年第7期80-83,共4页 Systems Engineering

基金国家自然科学基金委国家杰出青年科学基金资助项目(70125002)

关键词多维Web数据 WEB数据挖掘聚类映射簇 High Dimensional Web Data Web Data Mining Clustering Projected Cluster

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1Aggarwal C C, Procopiuc C, Wolf J L, et al. Fast algorithm for projected clustering[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1999:61～71.
2Agrawal R, Gehrke J, Gunopolos D, Raghavan P. Automatic subspace clustering of high dimensional data for data mining applications[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1998.
3Yoo J,Yoo S. Conceptual clustering on partitioned data: tree-weaver[J]. Expert System with Applications,1998,(15):367～374.
4Ankerst M, Breuning M M, Kriegel H-P, Sander J. OPTICS: ordering points to identify the clustering structure[A]. Proceeding of the ACM SIGMOD international conference on management of data[C]. 1999:49～68.
5Yasuhiko Takahara,Chen X H,Liu Y M. Design and system theoretic implementation of a menu system generator - meta support of UI for task processing[J]. Journal of the Japan Society for Management Information,2002,10(4).
6Yasuhiko Takahara,Naoki Shiba,Liu Y M. General system theoretic approach to data mining system[J]. International Journal of General Systems, 2002, 31(3):245～264.
7韩家伟 KamberM著范明译.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..

共引文献1

1陈晓红,马亮.基于多维数据的关联规则算法[J].系统工程,2005,23(5):103-105. 被引量：4

1陈景文.主数据管理:打造大数据时代企业核心竞争力[J].通信世界,2014,0(28):38-39. 被引量：2
2赖邦传,陈晓红,周辉.一种基于映射簇的聚类分析算法[J].中南大学学报（自然科学版）,2004,35(1):112-116.
3马安国,成玉,唐遇星,邢座程.GPU异构系统中的存储层次和负载均衡策略研究[J].国防科技大学学报,2009,31(5):38-43. 被引量：12
4董伟康,李静.从中专生到首席代理——董伟康先生讲述其与劳达的不解之缘[J].国外塑料,2009,27(10):30-31.
5程广金,缪淮扣,方明科,梅佳,髙洪皓.基于XML的Web应用模型抽取[J].计算机科学,2011,38(9):130-134.
6王一拙,左琦,计卫星,王小军,石峰.访存与用户行为敏感的MPSoC应用映射[J].电子学报,2015,43(4):631-638. 被引量：3
7肖春华,刘韦辰.支持通信资源全局共享的射频片上网络研究[J].计算机学报,2016,39(9):1843-1857. 被引量：1
8宋朝晖,马光胜,宋大雷.NoC处理单元随机舍入的启发式应用映射[J].计算机辅助设计与图形学学报,2011,23(7):1263-1269. 被引量：1
9于千城.应用映射与任务调度综述[J].电脑知识与技术,2015,11(1X):248-251.
10肖征.锡业股份:物以“锡”为贵[J].中国金属通报,2012(38):34-35.

系统工程

2004年第7期

浏览历史

内容加载中请稍等...

基于映射簇的Web数据挖掘研究

参考文献7

共引文献1

相关作者

相关机构

相关主题

浏览历史