期刊文献+

一种基于动态SOM的增量中文文本聚类方法 被引量:2

An Incremental Clustering Method of Chinese Documents Based on DASOM
原文传递
导出
摘要 对一种基于动态可调自组织神经网络(the dynamic adaptive self-organizing map neural network,简称DASOM)的增量中文文本聚类方法进行研究,认为其只需处理更新数据,提高聚类速度,并能自动抽取SOM聚类结果;DASOM模型具有动态的结构,通过数值实验表明该方法对中文文本增量聚类具有有效性。 This paper studies an incremental clustering method of Chinese documents based on dynamic adaptive self-organizing map neural network (in short DASOM),which can speed up clustering because it only recalculate the update datas. Features of DASOM are dynamic structure and organizing SOM's clusters results automatically. Numerical experiment shows that the method is efficient for clustering Chinese documents.
作者 朱红灿 唐毅
出处 《图书情报工作》 CSSCI 北大核心 2007年第6期116-119,126,共5页 Library and Information Service
关键词 增量聚类 文本聚类 自组织神经网络(SOM) 向量空间模型 incremental clustering documents clustering self-organizing map VSM
  • 相关文献

参考文献4

二级参考文献16

  • 1刘立平,易华容,何文斌.一种基于向量空间模型的文本聚类方法[J].株洲师范高等专科学校学报,2004,9(5):23-25. 被引量:4
  • 2[1]Agrawal R,Imielinsky T,Swami A.Mining Association Rules Between Sets of Items in Large Databases.In:Proceedings of the 1993ACM SIGMOD Conference,Washington DC.USA.1993-05
  • 3[2]Agrawal R,Srikant R.Fast Algorithms for Mining Generalized Association.VLDB94,Santiago,Chile,1994-09
  • 4[3]World Wide Web Consortium.Httpd-log Files.In:http://lists.w3.org/Archives,1998
  • 5[4]Toivonen H.Sampling Large Databases for Association Rules.In:Proceedings of the 22nd VLDB Mumbai (Bombay),India,1996-09
  • 6[5]Agrawal R,Srikant R.Fast Algorithms for Mining Association Rules.In:Proceedings of the ACM SIGMOD International Conference.1994-05
  • 7[6]Cooley R,Mobasher B,Srivastava J.Web Mining: Information and Pattern Discovery on the World Wide Web.In:Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence(ICTAI97),1997-11
  • 8[7]Masseglia F,Poncelet P,Han E,et al.Web Mining:Pattern Discobery from World Wide Web Transactions.Technical Report TR-96-050,Department of C.S.,University of Minnesota,1996
  • 9[8]Zaiane O,Xin M,Han J.Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs.In:Proceedings on Advances in Digital Libraries Conference (ADL98),Santa Barbara,CA,1998-04
  • 10[9]Lin M Y,Lee S Y.Incremental Update on Sequential Patterns in Large Databases.In:Proceedings of the Tools for Artificial Intelligence Conference (TAI98) ,1998-05

共引文献35

同被引文献25

  • 1徐建锁,王正欧.基于LSI和自组织神经网络的高效文本聚类方法[J].天津大学学报(自然科学与工程技术版),2004,37(11):1026-1030. 被引量:7
  • 2徐海霞.聚类分析在Web文本挖掘中的应用[J].情报杂志,2004,23(12):99-101. 被引量:4
  • 3宋韶旭,李春平.基于非对称相似度的文本聚类方法[J].清华大学学报(自然科学版),2006,46(7):1325-1328. 被引量:7
  • 4彭京,杨冬青,唐世渭,付艳,蒋汉奎.一种基于语义内积空间模型的文本聚类算法[J].计算机学报,2007,30(8):1354-1363. 被引量:44
  • 5朱红灿,唐毅.一种基于DASOM的两阶段中文文本聚类方法[J].情报杂志,2007,26(9):101-104. 被引量:3
  • 6Daigle R J, Arnold V. An analysis of the research pro- ductivity of AIS faculty [J].International Journal of Ac- counting Information Systems, 2000, 1(2): 106 - 122.
  • 7Reid E F, Chen H. Mapping the contemporary terror- ism research domain [J]. International Journal of Hu- man-Computer Studies, 2007, 65(1): 42-56.
  • 8Huang Z, Chen H, Chen Z K, Roco M C. International nanotechnology development in 2003: Country, institu- tion, and technology field analysis based on USPTOpatent database [J]. Journal of Nanoparticle Research, 2004, 6 (4), 325 - 354.
  • 9Shen J, Yao L, Li Y, Clarke M, Gan Q, Fan Y, Zhong D, Li Y, Gou Y, Wang L. Visualization studies on evi- dence-based medicine domain knowledge (series 2): Structural diagrams of author networks [J]. Journal of Evidence-Based Medicine, 2011, 4(2): 85 -95.research sub- jects [J]. Journal of Evidence-Based Medicine, 2011, 4(2): 73 - 84.
  • 10Shen J, Yao L, Li Y, Clarke M, Gan Q, Fan Y, Zhong D, Li Y, Gou Y, Wang L. Visualization studies on evi- dence-based medicine domain knowledge (Series 1): Mapping of evidence-based medicine.

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部