期刊文献+

不确定性推理在文本分类上的应用研究 被引量:3

Application Research of Uncertainty Reasoning on Text Categorization
下载PDF
导出
摘要 在文本分类中k-NN分类方法简洁而有效,但在多类分类问题中,由于类的重叠和属性的不足导致训练文本和类边界出现不确定性,而传统k-NN分类方法无法处理这种不确定性.该文借助于几种经典的不确定性推理方法:DS证据理论、模糊集理论、模糊-粗糙集理论,来改进传统k-NN文本分类方法,实验表明基于不确定性推理的方法能够提高文本分类的精度和召回率. The k-nearest neighbors( k-NN ) categorization method is simple and effective in text categorization. The uncertainty of training documents and classes border would appear in multi-class categorization, because of the overlapping of classes and the lack of features. But the conventional k-NN method is unsuitable to deal with this uncertainty. In this paper, the several classical uncertainty reasoning methods, such as DS evidence theory, fuzzy..set theory, fuzzy-rough set theory, is applied to modify the conventional k-NN categorization method. The experimental test shows that these methods based on uncertainty reasoning improve the efficiency of text categorization.
出处 《江西师范大学学报(自然科学版)》 CAS 北大核心 2007年第4期383-386,共4页 Journal of Jiangxi Normal University(Natural Science Edition)
基金 国家自然科学基金(60663007)资助项目
关键词 不确定性推理 K近邻法 文本分类 Uncertainty Reasoning k-NN Text Categorization
  • 相关文献

参考文献10

  • 1Dubois D,Prade H.Putting rough sets and fuzzy sets together[A].Intelligent Decision Support:Handbook of Applications and Advanced of the Rough Set Theory[C].Boston:Slowinski R ED,Kluwer Academic Publishers,1992:203-222.
  • 2Yao,Y Y.A comparative study of fuzzy sets and rough sets[J].Information Sciences,1998,109(4):227-242.
  • 3Keller J M,Gray M R,Givens J A.A fuzzy K-nearest neighbor algorithm[J].IEEE Transactions on System Man and Cybernetics,1985,15(4):580-585.
  • 4Denoeux T.A k-nearest neighbor classification rule based on Dempster-Shafer theory[J].IEEE Transactions on System,Man and Cybernetics,SMC,1995,25(5):804-813.
  • 5Yang Yiming,Pederen Jan O.A comparative study on feature selection in text categorization[A].Proceeding of the fourteenth international conference on machine learning (CML'97)[C].Boston:kluwer Academic Publishers,1997:121-129.
  • 6Francois J,Grandvalet Y,Denoeux T,et al.Resample and Combine:An Approach to Improving Uncertainty Representation in Evidential Pattern Classification[J].Information Fusion,2003 (4):75-85.
  • 7Ruthven I,Lalmas M.Using Dempster-Shafer's theory of evidence to combine aspects of information use[J].Journal of Intelligent Information Systems,2003,19(3):267-301.
  • 8张文修 吴伟志 梁吉业.粗糙集理论与方法[M].北京:科学出版社,2003.107-112.
  • 9单松巍,冯是聪,李晓明.几种典型特征选取方法在中文网页分类上的效果比较[J].计算机工程与应用,2003,39(22):146-148. 被引量:76
  • 10付雪峰,王明文.基于模糊-粗糙集的文本分类方法[J].华南理工大学学报(自然科学版),2004,32(z1):73-76. 被引量:8

二级参考文献12

  • 1曾黄麟.粗集理论及其应用--关于数据推理的新方法[M].重庆:重庆大学出版社,1998..
  • 2冯是聪 单松巍 张志刚 等.一个中文网页数据集及其分类体系[A]..海峡两岸技术交流会[C].南京,2002-10.121-129.
  • 3[1]Dubois D,Prade H. Putting rough sets and fuzzy sets together [A]. Intelligent Decision Support: Handbook of Applications and Advanced of the Rough Set Theory [C].Boston: Slowinski R ED, Kluwer Academic Publishers, 1992. 203 - 222.
  • 4[2]Yao Y Y. A comparative study of fuzzy sets and rough sets [J]. Information Sciences, 1998,109 (1-4): 227 -242.
  • 5[4]Keller J M, Gray M R, Givens J A. A fuzzy k-nearest neighbor algorithm [J]. IEEE Transactions on System Man and Cybernetics, 1985,15 (4) :580 - 585.
  • 6[5]Yang Y,Pederen J P. A comparative study on feature selection in text categorization [A]. Proceeding of the Fourteenth International Conference on Machine Learning (ICML97) [C]. Nashville Tennessee USA :Morgan Kaufmann, 1997.412 - 420.
  • 7[7]Denoeux T. A k-nearest neighbor classification rule based on Dempster-Shafer theory [J]. IEEE Transactions on System Man and Cybernetics, 1995,25(5):804 -813.
  • 8[8]Francois J, Grandvalet Y, Denoeux T, et al. Resample and combine:An approach to improving uncertainty representation in evidential pattern classification [J]. Information Fusion,2003 (4) :75 -85.
  • 9Yiming Yang,Jan O Pedersen.A comparative Study on Feature Selection in Text Categorization[C].In :Proceedings of the Fourteenth International Conference on Machine Leaming(ICML'97), 1997.
  • 10Yiming Yang,Xin Liu.A re-examination of text categorization methods[C].In:Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR'99,1999:42---49.

共引文献184

同被引文献31

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部