期刊文献+

An Effective Concept Extraction Method for Improving Text Classification Performance

An Effective Concept Extraction Method for Improving Text Classification Performance
下载PDF
导出
摘要 This paper presents anew way to extract concept that can beused to improve text classification per-formance (precision and recall). Thecomputational measure will be dividedinto two layers. The bottom layercalled document layer is concernedwith extracting the concepts of parti-cular document and the upper layercalled category layer is with findingthe description and subject concepts ofparticular category. The relevant im-plementation algorithm that dramatic-ally decreases the search space is dis-cussed in detail. The experiment basedon real-world data collected from Info-Bank shows that the approach is supe-rior to the traditional ones. This paper presents a new way to extract concept that can be used to improvetext classification performance (precision and recall). The computational measure will be dividedinto two layers. The bottom layer called document layer is concerned with extracting the concepts ofparticular document and the upper layer called category layer is. with finding the description andsubject concepts of particular category. The relevant implementation algorithm that dramaticallydecreases the search space is discussed in detail. The experiment based on real-world data collectedfrom Info-Bank shows that the approach is superior to the traditional ones.
机构地区 不详 lecturer
出处 《Geo-Spatial Information Science》 2003年第4期66-72,共7页 地球空间信息科学学报(英文)
基金 Project supported by the National Natural Science Foundation of China (No. 60082003) and the National High Technology Research and Development Program of China (N0.863-306-ZD03-04-1).
关键词 text classification concept extraction characteristic term associationrule ALGORITHM 概念 计算方法 运算法则 正文 分类 有效性 实用性
  • 相关文献

参考文献6

  • 1[1]Tan A H (2001) Predictive self-organizing networks for text categorization. The 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining,Hong Kong.
  • 2[2]Sebastiani F (2003) Machine learning in automated text categorization. ACM Computing Surveys. http://www. cvc. uab. es/shared/teach/a20368/ACMCS00. pdf.
  • 3[3]Lewis D D (1992) Feature selection and feature extraction for text categorization. Speech and Natural Language Workshop, San Francsico.
  • 4[4]Han J W, Kamber M (2001) Data mining: concepts and techniques. California: Morgan Kaufmann.
  • 5[5]Li C, Luo Z S, Li Y H (2002) Research on automatic classification of documents based on concept attributes. 2002 IEEE International Conference on Systems, Man and Cybernetics.
  • 6[6]Bakus J, Kamel M, Carey T (2002) Extraction of text phrases using hierarchical grammar. The Fifteenth Canadian Conference on Artificial Intelligence (AI'2002) ,Ottawa.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部