期刊文献+

一种统计降维和Kohonen网络相结合的文本聚类方法 被引量:3

New text clustering method based on statistical reduction dimension and Kohonen network
下载PDF
导出
摘要 提出了一种基于词条互信息(WM I)值的统计降维和Kohonen网络(SOFM网)相结合的文本聚类方法,WM I值的方法侧重考虑文本特征项之间的互信息进行降维,可提高特征选择的效率,并使其更趋实用化。采用Kohonen网络进行文本聚类,其学习率函数是随时间单调下降的退火函数,实验结果表明了这种结合方法较一般的降维方法得到的聚类结果具有较高的聚类精度。 A new text clustering method based on WMI(words mutual information) statistical reduction dimension approach and Kohonen network( SOFM network) was proposed. This method considered the mutual information between text features and improved the efficiency of reduction dimension largely. Text clustering used Kohonen network whose learning rate was dropped with time. The experiments indicate that the clustering result obtained using this combined method has much higher precision than that using general reduction dimension approach.
出处 《计算机应用》 CSCD 北大核心 2005年第10期2328-2330,2333,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(60275020)
关键词 词条互信息 统计降维 KOHONEN网络 文本聚类 WMI statistical reduction dimension Kohonen network text clustering
  • 相关文献

参考文献8

  • 1PAL SK, et al. Web mining in soft computing framework: relevance, state of the art and future directions [ J]. IEEE Transactions on Neural Networks, 2002, 13(5): 1163 - 1177.
  • 2KASKI S. Dimensionality Reduction by Random Mapping: Fast similarity Computation for Clustering[ A]. Proceedings of international Joint Conferfence on Neural Networks(IJCNN98) [ C]. IEEE Service Center, Piscataway, NJ, 1998. 413 -418.
  • 3KURIMO M. Fast latent semantic indexing of spoken documents by using self-organizing maps[ A]. IEEE international Conference on Acoustics, Speech and signal Processing(ICASSP00) [ C]. Istanbul,Turkey, 2000. 2425 - 2428.
  • 4钱晓东,王正欧.基于SOM网络的随机映射文本降维方法[J].计算机应用,2004,24(5):56-58. 被引量:2
  • 5DAGAN I, MARCUS S. Contextual word similarity and estimation from sparse data[ J] . Computer Speech and Language, 1995, 9:123-152.
  • 6SAHAMI M. Using machine learning to information access[ D]. A Dissertation of Standford university , 1998.
  • 7KOHONEN T. self-organizing maps[ M]. Springer, Berlin, 1995.
  • 8王明春,王正欧,张楷,郝玺龙.一种基于CHI值特征选取的粗糙集文本分类规则抽取方法[J].计算机应用,2005,25(5):1026-1028. 被引量:8

二级参考文献11

  • 1SHEN Q. Alexios chouchoulas. A rough-fuzzy approach for generating classification rules[J]. Pattern Recogonition, 2002,(35):2425-2438.
  • 2CHEN YQ.Implementing the k-nearest neighbour rule via a neural network[A]. IEEE International Conference on Neural Networks[C], 1995, vol.1.136-140.
  • 3PAWLAK Z, GRAYMALA-BAUSSE J,Slowinski R. Rough sets[J]. Communications of the ACM, 1995,38(11):89-95.
  • 4HAN J, KAMBR M.DATA MINING: Concepts and techniques[M].Beijing: Higher Education Press, 2001.
  • 5Kurimo M. Fast latent semantic indexing of spoken documents by using self-organizing maps[A]. IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP′00)[C]. Istanbul,Turkey,2000. 2425-2428.
  • 6Kaski S. Dimensionality Reduction by Random Mapping:Fast similarity Computation for Clustering[A]. Proceedings of International Joint Conference on Neural Networks(IJCNN′98)[C]. IEEE Service Center,Piscataway,NJ,1998. 413-418.
  • 7Hecht-Nielsen R. Context vectors:general purpose approximate meaning representations self-organized from raw data[A]. Computational Intelligence:Imitating Life[C]. IEEE Press,Piscataway,NJ,1994. 43-65.
  • 8Inien Syu,Lang SD,Deo N. A Neural Network Model for Information Retrieval Using LatentSemantic Indexing[A]. IEEE International Conference on Neural Networks[C]. 1996. 1318-1323.
  • 9Kaski S. Fast winner search for SOM-based monitoring and retrieval of high- dimensional data[A]. Proceedingsof the 9th International Conference on Artificial Neural Networks(ICANN′99)[C]. London,UK,1999. 940-945
  • 10常犁云,263.net,王国胤,263.net,吴渝,263.net.一种基于Rough Set理论的属性约简及规则提取方法[J].软件学报,1999,10(11):1206-1211. 被引量:285

共引文献8

同被引文献42

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部