期刊文献+

模糊聚类在中文文本分类中的应用研究 被引量:5

Study on Application of Fuzzy Clustering in Chinese Text Categorization
下载PDF
导出
摘要 将基于等价关系的模糊聚类技术应用于中文文本分类,提出了基于模糊聚类的中文文本分类算法ATCFC。该算法利用基于二级字索引的正向最大匹配算法对文本分词,建立模糊特征向量空间模型,使用贴近度法刻划文本间的相似度。利用算法ATCFC对文本集合进行动态聚类实验,实验结果表明算法ATCFC对于中文文本分类是可行、有效的。 This paper studies Chinese text categorization with the technique of fuzzy clustering based on equivalence relation and proposes an algorithm(ATCFC) for Chinese text categorization based on fuzzy clustering, This algorithm uses forward maximum match algorithm based on two-level word-index to segment Chinese text,creates fuzzy feature vector space model and describes similarity degree among texts using the method of close degree.Algorithm ATCFC is used to conduct a dynamic clustering experiment on a text set and the experimental results demonstrate that algorithm ATCFC is feasible and effective for Chinese text categorization.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第8期170-172,177,共4页 Computer Engineering and Applications
基金 江苏省重点实验室开放基金资助项目(编号:KJS03064)
关键词 模糊聚类 文本分类 贴近度 模糊等价矩阵 fuzzy clustering, text categorization, close degree, fuzzy equivalence matrix
  • 相关文献

参考文献5

  • 1W Lam,C Y Ho.Using a generalized instance set for automatic text catego-rization[C].In:Proceedings of the 21th Ann Int ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR ' 98),1998:81~89
  • 2A McCallum,K Nigam.A comparison of event models for naive bayes text cla-ssification.AAAI-98 Workshop on Learning for text categorization,1998
  • 3Thorsten Jocachims.Text Categorization with Support Vector Machine:Learning with Many Relevant Features[C].In:European Conference on Machine Learning (ECML'97),1997:170~179
  • 4周水庚,关佶红,俞红奇,胡运发.基于Ngram信息的中文文档分类研究[J].中文信息学报,2001,15(1):34-39. 被引量:23
  • 5解冲锋,李 星.基于序列的文本自动分类算法[J].软件学报,2002,13(4):783-789. 被引量:35

二级参考文献10

  • 1Xiang,Jing-cheng,Wang Yi-qing.Singal Detection and Estimation.Beijing: Electronics Industry Press,1994.165~166 (in Chinese).
  • 2Lam,W.,Ruiz,M.,Srinivasan,P.Automatic text categorization and its application to text retrieval.IEEE Transactions on Knowledge and Data Engineering,1999,11(6):865~879.
  • 3Chute,C.G.An example based mapping method for text categorization and retrieval.ACM Transactions on Information System,1994,12(3):252~277.
  • 4Cohen,W.W.,Singer,Y.Context-Sensitive learning methods for text categorization.ACM Transactions on Information System,1999,17(2):141~173.
  • 5Turle,H.,Croft,B.Evaluation of an inference network net-based retrieval model.ACM Transactions on Information System,1991,9(3):187~222.
  • 6Apte,C.,Damerau,F.Automated learning of decision rules for text categorization.ACM Transactions on Information System,1994,12(3):233~251.
  • 7Belkin,N.J.,Croft,W.B.Information filtering and information retrieval: two sides of the same coin? Communications of the ACM,1994,35(12):29~38.
  • 8黄萱菁,吴立德.基于向量空间模型的文档分类系统[J].模式识别与人工智能,1998,11(2):147-153. 被引量:24
  • 9邹涛,王继成,黄源,张福炎.中文文档自动分类系统的设计与实现[J].中文信息学报,1999,13(3):26-32. 被引量:45
  • 10战学刚,林鸿飞,姚天顺.中文文献的层次分类方法[J].中文信息学报,1999,13(6):20-25. 被引量:22

共引文献55

同被引文献30

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部