An Effective Concept Extraction Method for Improving Text Classification Performance

An Effective Concept Extraction Method for Improving Text Classification Performance

下载PDF

导出

摘要 This paper presents anew way to extract concept that can beused to improve text classification per-formance (precision and recall). Thecomputational measure will be dividedinto two layers. The bottom layercalled document layer is concernedwith extracting the concepts of parti-cular document and the upper layercalled category layer is with findingthe description and subject concepts ofparticular category. The relevant im-plementation algorithm that dramatic-ally decreases the search space is dis-cussed in detail. The experiment basedon real-world data collected from Info-Bank shows that the approach is supe-rior to the traditional ones. This paper presents a new way to extract concept that can be used to improvetext classification performance (precision and recall). The computational measure will be dividedinto two layers. The bottom layer called document layer is concerned with extracting the concepts ofparticular document and the upper layer called category layer is. with finding the description andsubject concepts of particular category. The relevant implementation algorithm that dramaticallydecreases the search space is discussed in detail. The experiment based on real-world data collectedfrom Info-Bank shows that the approach is superior to the traditional ones.

作者 ZHANGYuntao GONGLing WANGYongcheng YINZhonghang

机构地区不详 lecturer

出处《Geo-Spatial Information Science》 2003年第4期66-72,共7页 地球空间信息科学学报（英文）

基金 Project supported by the National Natural Science Foundation of China (No. 60082003) and the National High Technology Research and Development Program of China (N0.863-306-ZD03-04-1).

关键词 text classification concept extraction characteristic term associationrule ALGORITHM 概念计算方法运算法则正文分类有效性实用性

分类号 O29 [理学—应用数学]

引文网络
相关文献

参考文献6

1[1]Tan A H (2001) Predictive self-organizing networks for text categorization. The 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining,Hong Kong.
2[2]Sebastiani F (2003) Machine learning in automated text categorization. ACM Computing Surveys. http://www. cvc. uab. es/shared/teach/a20368/ACMCS00. pdf.
3[3]Lewis D D (1992) Feature selection and feature extraction for text categorization. Speech and Natural Language Workshop, San Francsico.
4[4]Han J W, Kamber M (2001) Data mining: concepts and techniques. California: Morgan Kaufmann.
5[5]Li C, Luo Z S, Li Y H (2002) Research on automatic classification of documents based on concept attributes. 2002 IEEE International Conference on Systems, Man and Cybernetics.
6[6]Bakus J, Kamel M, Carey T (2002) Extraction of text phrases using hierarchical grammar. The Fifteenth Canadian Conference on Artificial Intelligence (AI'2002) ,Ottawa.

1陈瑞林.不同分布的NA列的加权和的强收敛速度[J].应用概率统计,2004,20(1):47-53. 被引量：10
2瞿祖清,傅志方.AN ITERATIVE METHOD FOR DYNAMIC CONDENSATION OF FINITE ELEMENT MODELS,PART I : BASIC METHOD[J].Journal of Shanghai Jiaotong university(Science),1998,3(1):85-90.
3耿苗,李培咸,罗卫军,孙朋朋,张蓉,马晓华.Small-signal modeling of GaN HEMT switch with a new intrinsic elements extraction method[J].Chinese Physics B,2016,25(11):446-452. 被引量：1
4WANG Jing-dan.Utilization Frequency and Distributions of Drama Language Body Standard[J].Journal of Literature and Art Studies,2014,4(2):123-128.
5韦静,唐国强.混合随机变量序列加权和最大值的几乎处处收敛性[J].桂林理工大学学报,2011,31(4):633-636. 被引量：3
6陈慧慧,韩诚,李愿.正余弦数列及其子列的敛散性[J].高等数学研究,2015,18(4):98-102.
7岳修魁.Multipliers of A p and H p Spaces[J].Journal of Mathematical Research with Applications,1999,31(S1):8-13.
8何五力.科艺交融[J].自然杂志,2016,38(4):270-270.
9Xu Yongjun,Yuau Si.STRESS INTENSITY FACTORS CALCULATION IN ANTI-PLANE FRACTURE PROBLEM BY ORTHOGONAL INTEGRAL EXTRACTION METHOD BASED ON FEMOL[J].Acta Mechanica Solida Sinica,2007,20(1):87-94. 被引量：1
10Zhen-Shan Wang,Yun-Fei Chen,Gui-Juan Li,Ji-Hui Wang.Elastic Characteristic Extraction Method of Underwater Target Based on Adaptive Filtering[J].Journal of Electronic Science and Technology,2012,10(2):149-152.

Geo-Spatial Information Science

2003年第4期

浏览历史

内容加载中请稍等...

An Effective Concept Extraction Method for Improving Text Classification Performance

参考文献6

相关作者

相关机构

相关主题

浏览历史