一种基于网格索引的数据聚类算法被引量：1

Data clustering algorithm based on index of gridding

下载PDF

导出

摘要为了提高基于密度聚类算法的效率,避免算法在执行过程中的多余搜索,提出了一种基于DBSCAN算法的改进的空间数据聚类算法。该算法采用对象邻域空间进行划分的方法,将网格索引结构应用于该算法。在核心对象的邻域内选择八个方向上未标记且距离核心对象最边缘的对象来扩展种子对象,减少查询次数,降低聚类的时间复杂度。在实验中,利用海量数据集对算法进行测试,测试结果证明新算法在保证聚类精度的情况下时间效率显著高于DBSCAN算法。 In order to improve the efficiency of clustering algorithm based on density and avoid redundant search in processing, the paper puts forward an improved spatial data clustering algorithm based on DBSCAN.The algorithm uses the method of object＇s neighborhood-spatial segmentation,and makes use of index of gridding structure.In core points＇ neighborhood,the objects without mark which lie in eight aspects and have the biggest distance from core objects are chose to expand seed objects.In the case,the times of query is decreased,and the time complexity of clustering is reduced.In experiment,mass data is used to test the algorithm, which proves that the new algorithm＇s time efficiency is much better than DBSCAN in the same clustering precision.

作者李筠宋凯姜学军

机构地区沈阳理工大学信息科学与工程学院

出处《计算机工程与应用》 CSCD 北大核心 2008年第16期139-141,共3页 Computer Engineering and Applications

基金国家高技术研究发展计划(863)(the National High-Tech Research and Development Plan of China under Grant No.2003AA41250) 辽宁省教育厅A类基金(No.20243303)

关键词 DBSCAN 网格索引空间数据聚类 Density Based Spatial Clustering of Application with Noise（DBSCAN） index of gridding spatial data clustering

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献5

1Ester M,Kriegel H P,Sander J,et al.A Density-based algorithm for discovering clusters in large spatial databases with noise[C]// Proe 2nd Int Conf on Knowledge Discovery and Data Mining. Portland, Oregon : AAAI Press, 1996:226-231.
2马帅,王腾蛟,唐世渭,杨冬青,高军.一种基于参考点和密度的快速聚类算法[J].软件学报,2003,14(6):1089-1095. 被引量：108
3Chen M S,Han J,Yu P S.Data mining:an overview from a database perspective[J].IEEE Transactions on Knowledge and Data Engineering, 1996,8(6) : 866-883.
4蔡伟杰,张晓辉,朱建秋,朱扬勇.关联规则挖掘综述[J].计算机工程,2001,27(5):31-33. 被引量：134
5刘红岩,陈剑,陈国青.数据挖掘中的数据分类算法综述[J].清华大学学报（自然科学版）,2002,42(6):727-730. 被引量：168

二级参考文献11

1刘红岩.可扩展的快速分类算法的研究与实现[M].北京:清华大学出版社,2000..
2Han JW, Kambr M. Data Mining Concepts and Techniques. Beijing: Higher Education Press, 2001. 145-176.
3Kaufan L, Rousseeuw PJ. Finding Groups in Data: an Introduction to Cluster Analysis. New York: John Wiley & Sons, 1990.
4Ester M, Kriegel HP, Sander J, Xu X. A density based algorithm for discovering clusters in large spatial databases with noise. In:Simoudis E, Han JW, Fayyad UM, eds. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining.Portland: AAAI Press, 1996. 226-231.
5Guha S, Rastogi R, Shim K. CURE: an efficient clustering algorithm for large databases. In: Haas LM, Tiwary A, eds. Proceedings of the ACM SIGMOD International Conference on Management of Data. Seattle: ACM Press, 1998. "73-84.
6Agrawal R, Gehrke J, Gunopolos D, Raghavan P. Automatic subspace clustering of high dimensional data for data mining application. In: Haas LM, Tiwary A, eds. Proceedings of the ACM SIGMOD International Conference on Management of Data.Seattle: ACM Press, 1998.94-105.
7Alexandros N, Yannis T,Yannis M. C^2P: clustering based on closest pairs. In: Apers PMG, Atzeni P, Ceri S, Paraboschi S,Ramamohanarao K, Snodgrass RT, eds. Proceedings of the 27th International Conference on Very Large Data Bases. Roma:Morgan Kaufmann Publishers, 2001. 331-340.
8Berchtold S, Bohm C, Kriegel H-P. The pyramid-technique: towards breaking the curse of dimensionality. In: Haas LM, Tiwary A,eds. Proceedings of the ACM SIGMOD International Conference on Management of Data. Seattle: ACM Press, 1998. 142- 153.
9Yu C, Ooi BC, Tan K-L, Jagadish HV. Indexing the distance: an efficient method to KNN processing. In: Apers PMG, Atzeni P,Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass RT, eds. Proceedings of the 27th International Conference on Very Large Data Bases. Roma: Morgan Kaufmann Publishers, 2001. 421--430.
10Han J，Proc 2000 ACMSIGMOD Int Conf Management of Data（SIGMOD 2000），2000年

共引文献404

1罗航,余利娟,张康.移动端考研产品的春天真的到来了吗?[J].广东经济,2017,0(7X):157-157.
2王晓燕,程志梅.数据挖掘技术在高校学生管理中的应用[J].电脑知识与技术（过刊）,2007(18):1725-1726.
3韩奎国,龚卫国,李伟红,马任飞,史澜.基于CRM的大型商场POS-MIS系统的设计开发[J].仪器仪表学报,2005,26(z2):337-340.
4刘洪婧,邓芬.关联规则Apriori算法的一种优化与实现[J].计算机时代,2009(3):62-64. 被引量：2
5王洪云.加强教学档案管理为提高教学质量服务[J].黑龙江档案,2006(1):28-28.
6李霞,王秋云,董健康.关联规则挖掘算法[J].科技经济市场,2006(12):285-286.
7董云龙 ,何友 ,谢曦鹏 .网络入侵检测技术研究[J].海军航空工程学院学报,2004,19(4):491-494.
8李玉鑑.自适应K-均值聚类算法[J].计算机研究与发展,2007,44(z2):100-104. 被引量：5
9马猛,唐理兵,李学俊.基于OLAP的关联规则的挖掘[J].宿州学院学报,2004,19(5):77-78.
10朱倩.略论高校教学管理中数据挖掘技术的应用[J].硅谷,2009,2(4). 被引量：6

同被引文献4

1高春矿.煤矿安全监控系统现状与发展前景[J].煤炭技术,2004,23(11):65-66. 被引量：47
2李杰,贾瑞玉,张璐璐.一个改进的基于DBSCAN的空间聚类算法研究[J].计算机技术与发展,2007,17(1):114-116. 被引量：13
3高滢,刘大有,徐益.一种特征加权的聚类算法框架[J].计算机科学,2008,35(10):152-154. 被引量：6
4周天沛,孙伟.基于蚁群-模糊聚类算法的井下工作面瓦斯突出预测[J].工矿自动化,2012,38(10):42-46. 被引量：8

引证文献1

1董萍.改进的空间聚类算法在煤矿瓦斯监测系统中的应用研究[J].煤炭技术,2014,33(2):84-86. 被引量：2

二级引证文献2

1陈佳,石林.数据挖掘中模糊C聚类算法的寻优能力优化[J].科技通报,2015,31(9):208-211. 被引量：2
2武珍珍.数据挖掘在煤矿瓦斯监测系统中的研究[J].煤炭技术,2017,36(9):258-260. 被引量：6

1张斌,孟凡荣,闫秋艳.基于网格和队列触发的多维空间Skyline查询算法[J].微电子学与计算机,2010,27(8):108-111.
2姜婷.基于改进离散蜂群算法的车辆路径问题求解[J].湖北文理学院学报,2016,37(2):9-14. 被引量：2
3姜婷.求解配送中心选址问题的改进人工蜂群算法[J].四川理工学院学报（自然科学版）,2016,29(1):24-28. 被引量：4
4张凤斌,杨泽,葛海洋.基于聚类的邻域检测器生成算法[J].计算机工程,2016,42(2):131-136. 被引量：2
5胡瑞飞,殷国富,谭颖.一种混合聚类算法及其应用[J].四川大学学报（工程科学版）,2006,38(5):156-161. 被引量：2
6陈平,李毅红.基于线阵CCD的小物体掉落自动检测系统[J].制造业自动化,2013,35(4):45-49. 被引量：6
7李团结,曹玉岩,孙国鼎.动态改变邻域空间和搜索步的自由搜索算法[J].西安电子科技大学学报,2010,37(4):737-742. 被引量：5
8信俊昌,王培,王之琼,王国仁,郭欣宇.基于连接操作的反轮廓查询处理算法[J].小型微型计算机系统,2014,35(10):2249-2255.
9周水庚,周傲英,曹晶,胡运发.一种基于密度的快速聚类算法[J].计算机研究与发展,2000,37(11):1287-1292. 被引量：89
10曾毅,朱旭生,廖国勇.一种基于邻域空间的混合粒子群优化算法[J].华东交通大学学报,2013,30(3):44-49. 被引量：6

计算机工程与应用

2008年第16期

浏览历史

内容加载中请稍等...

一种基于网格索引的数据聚类算法被引量：1

参考文献5

二级参考文献11

共引文献404

同被引文献4

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于网格索引的数据聚类算法 被引量：1

参考文献5

二级参考文献11

共引文献404

同被引文献4

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于网格索引的数据聚类算法被引量：1