期刊文献+

Application of the probability-based covering algorithm model in text classification

Application of the probability-based covering algorithm model in text classification
下载PDF
导出
摘要 The probability-based covering algorithm(PBCA) is a new algorithm based on probability distribution. It decides, by voting, the class of the tested samples on the border of the coverage area, based on the probability of training samples. When using the original covering algorithm(CA), many tested samples that are located on the border of the coverage cannot be classified by the spherical neighborhood gained. The network structure of PBCA is a mixed structure composed of both a feed-forward network and a feedback network. By using this method of adding some heterogeneous samples and enlarging the coverage radius,it is possible to decrease the number of rejected samples and improve the rate of recognition accuracy. Relevant computer experiments indicate that the algorithm improves the study precision and achieves reasonably good results in text classification. The probability-based covering algorithm(PBCA) is a new algorithm based on probability distribution. It decides, by voting, the class of the tested samples on the border of the coverage area, based on the probability of training samples. When using the original covering algorithm(CA), many tested samples that are located on the border of the coverage cannot be classified by the spherical neighborhood gained. The network structure of PBCA is a mixed structure composed of both a feed-forward network and a feedback network. By using this method of adding some heterogeneous samples and enlarging the coverage radius,it is possible to decrease the number of rejected samples and improve the rate of recognition accuracy. Relevant computer experiments indicate that the algorithm improves the study precision and achieves reasonably good results in text classification.
作者 ZHOU Ying
出处 《Chinese Journal of Library and Information Science》 2009年第4期1-17,共17页 中国文献情报(英文版)
基金 supported by the Fund for Philosophy and Social Science of Anhui Province the Fund for Human and Art Social Science of the Education Department of Anhui Province(Grant Nos.AHSKF0708D13 and 2009sk038)
关键词 Probability-based covering algorithm Structural training algorithm PROBABILITY Text classification Probability-based covering algorithm Structural training algorithm Probability Text classification
  • 相关文献

参考文献6

二级参考文献16

  • 1贾银山,贾传荧.一种加权支持向量机分类算法[J].计算机工程,2005,31(12):23-25. 被引量:20
  • 2Chen Q C,Neural Networks,1994年,5卷,7期,1477页
  • 3Baum E B,Neural Information Processing,1991年,904页
  • 4B. Liu, W. Hsu, and Y. Ma. Integrating Classification and Association Rule Mining [C]. KDD - 98, New York,1998.
  • 5Wenmin Li, Jiawei Han, JianPei. CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules [C] .ICDM2001, Silicon Valley, Ca, Nov 2001:369- 376.
  • 6Maria-Luiza Antonie, Osmar R. Zaiane. Text Document Categorization by Term Association [C]. In: Proc of the IEEE International Conference on Data Mining (ICDM 2002), Maebashi City, Japan: 19 - 26.
  • 7Mohammed J. Zaki, Charu C. Aggarwal. XRules: An Effective Structural Classifier for XML Data [C]. The Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(SIGKDD). Washington, DC,USA, 2003.
  • 8Yiming Yang, Jan O. Pederson. A Comparative Study on Feature Selection in Text Categorization [C]. International Conference on Machine Learning, Nashville, TN, July 1997.
  • 9https://securesite.chireader.com/Archive/stopwords.txt.
  • 10http://www. in2in. com/download. htm.

共引文献171

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部