期刊文献+

基于随机关键词技术的文本特征降维

Stochastic Keyword In Text Feature Dimension Reduction
下载PDF
导出
摘要 运用随机关键词产生技术将带有关键词的文本表示为一组关键词的条件概率向量,对文本特征空间进行转换,既有效的实现了一个不同于传统的特征降维过程,新的文本特征空间又能很好的覆盖到文本关键词的信息。 A text with keywords may be presented a conditional probability value vector of a set of keywords by using stochastic keyword generation, implementing text Feature space transformation. It is a process that differs from the traditional text Feature dimension reduction,and the new text Feature space can also override the text keywords efficiently.
作者 刘颖
出处 《电脑与信息技术》 2008年第4期43-45,共3页 Computer and Information Technology
关键词 特征降维 随机关键词产生 关键词 feature dimension reduction stochastic keyword generation keyword
  • 相关文献

参考文献5

  • 1Fabrizio Sebastiani.Machine learning in automated text categorization [J]. ACM Computing Surveys, 2002, 34(1):147.
  • 2Yiming Yang and Jan O. Pedersen.A comparative study on feature selection in text categorization [C].In Douglas H. Fisher, editor, Proceedings of ICML-97, 14th International Conference on Machine Learning, pages 412-420, Nashville, US, 1997.Morgan Kaufmann Publishers, San Francisco, US.
  • 3S. Deerwester, S. T. Dumais, G. W. Fumas, T. K. Landauer, and R. Harshman.Indexing by latent semantic indexing [J].Joumal of the American Society for Information Science, 1999, 41(6):391-407.
  • 4陈文亮,朱靖波,朱慕华,姚天顺.基于领域词典的文本特征表示[J].计算机研究与发展,2005,42(12):2155-2160. 被引量:22
  • 5苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:386

二级参考文献15

  • 1王建会,王洪伟,申展,胡运发.一种实用高效的文本分类算法[J].计算机研究与发展,2005,42(1):85-93. 被引量:20
  • 2李荣陆,王建会,陈晓云,陶晓鹏,胡运发.使用最大熵模型进行中文文本分类[J].计算机研究与发展,2005,42(1):94-101. 被引量:95
  • 3Fabrizio Sebastiani. Machine learning in automated text categorization. ACM Computing Surveys, 2002, 34(1): 1- 47.
  • 4D. Lewis, Ringuette. A comparison of two learning algorithms for text categorization. Symposium on Document Analysis and IR,Las Vegas, 1994.
  • 5Yiming Yang, Xin Liu. A re-examination of text categorization methods. In: Proc. 22nd Annual Int'l ACM SIGIR Conf.Research and Development in Information Retrieval. New York:ACM Press, 1999. 42-49.
  • 6Scott, Sam, Stan Matwin. Text classification using WordNet hypernyms. The COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, Montreal, 1998.
  • 7L.D. Baker, A. K. MCallum. Distributional clustering of words for text classification. In: Proc. 21st Annual Int'l ACM SIGIR Conf. Research and Development in Information Retrieval. New York: ACM Press, 1998. 96- 103.
  • 8Sangkon Lee, Masami Shishibori. Passage segmentation based on topic matter. Computer Processing of Oriental Languages, 2002,15(3): 305-340.
  • 9Chen Wenliang, Chang Xingzhi, Wang Huizhen, et al.Automatic word clustering for text categorization using global information. AIRS2004, Beijing, 2004.
  • 10Yiming Yang.A comparative study on feature selection in text categorization.The 14th Int'l Conf.Machine Learning.Nashville,1997.

共引文献405

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部