期刊文献+

一个基于加权和组合降维的web文本分类系统

A Web Page Categorization System Based on Combined Reduction Methods and Authority
下载PDF
导出
摘要 设计了一个web文本分类系统,采用了基于统计分析和粗糙集组合的方法进行降维;降维时考虑了属性的位置信息,采用加权方式标注属性的不同重要性,以达到提高分类速度和分类准确度的目的。 This paper designs a web page categorization system, which uses SVM classifier. It reduces attributes through a method, which combines statistic method and Rough set. At the same time, the system endues different attribute with different authority based on the location of attributes in a web page. The system aims to improve the classifier's rate and result.
作者 张东娜 刘博 ZHANG Dong-na,LIU Bo (1.Dept. of Computer, Zhuhai College of Jilin University,Zhuhai 519004,China;2.TEC Avei Electric (Zhuhai) Co., Ltd.Zhuhai 519040, China)
出处 《电脑知识与技术》 2008年第3期1234-1235,1278,共3页 Computer Knowledge and Technology
关键词 约简 CHI 粗糙集 Reduction CHI Rough set
  • 引文网络
  • 相关文献

参考文献4

二级参考文献22

  • 1冯是聪 单松巍 张志刚 等.一个中文网页数据集及其分类体系[A]..海峡两岸技术交流会[C].南京,2002-10.121-129.
  • 2Yiming Yang,Jan O Pedersen.A comparative Study on Feature Selection in Text Categorization[C].In :Proceedings of the Fourteenth International Conference on Machine Leaming(ICML'97), 1997.
  • 3Yiming Yang,Xin Liu.A re-examination of text categorization methods[C].In:Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR'99,1999:42---49.
  • 4Yiming Yang.A study on thresholding strategies for text categorization[C].In:Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR'01),2001.
  • 5KONT KANEN P, MYLLYMAKI P, SILANDER T, et al.BYDA: software for Bayesian classification and feature selection[A]. AGRAWAL R, STOLORZ P E, PIATETSKY- SHAPIRO G, eds. Processdings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD'98) [C]. Menlo Park: AAAI Press, 1998,254-258.
  • 6YANG Y. Expert network: Effective and efficient learning from human decisions in text categorization and retrieval[ A]. Proc .Seventeenth International ACM SIGIR Conference on Research and Developmentin Information Retrieval[ C ]. Dublin, 1994.
  • 7APTE C, DAMERAU F, WEISS S. Automated learning of decision rules for text categorization[ J]. ACM Transactions on Information System ,1994, 12 (3) : 233 - 251.
  • 8SALTON G, WONG YAND C S. A Vector space model for automatic indexing[ J]. Communications of ADC, 1975, 18(11) : 613-620.
  • 9SALTON G. Introduction to Modem Information Retrieval [M]. New York : Mc Graw - Hill Book Company, 1983.
  • 10PAWLAK Z. Rough Sets - Theoretical Aspects of Reasoning About Data[M]. Kluwer Academic Pub, 1991.

共引文献156

;
使用帮助 返回顶部