期刊文献+

—种新的中文层次化文本分类规则设计

下载PDF
导出
摘要 在信息爆炸时代,其中存在大量的中文文本,并且文本之间存在层次关系,为了从中及时的获取有用的信息,需要进行有效的组织和管理。本文通过文本分类的方法,设计了“全路径+自底向上”的层次化分类规则,可以缓解自顶向下分类的阻塞,同时兼顾解决多标签和中间节点分类问题。首先使用BR方法即二元关系法把多标签转化为单标签统一处理,为除根节点外的每个节点构建一个二元分类器,使得可以在中间节点和叶子节点进行分类,然后利用节点及其祖先节点的关系从底向上对分类结果进行筛选过滤,以减少错分现象。实验表明采用该方法比常规自顶向下的方法在宏平均F1和微平均F1有3%到6%的提升。
机构地区 不详
出处 《电信技术研究》 2019年第2期16-21,共6页 Research on telecommunication technology
  • 相关文献

参考文献2

二级参考文献26

  • 1Xue Gui-Rong, Xing Di-Kan, Yang Qiang, et al. Deep classification in large- scale text hierarchies/ /Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Singapore, 2008: 619-626.
  • 2Dh H, Choi y, Myaeng S. Combining global and local information for enhanced deep classification/ /Proceedings of the 25th ACM SIGAPP Symposium on Applied Computing. Sierre , Switzerland, 2010: 1760-1767.
  • 3Malik H. Improving hierarchical SVMs by hierarchy flattening and lazy classification/ /Proceedings of the Large-Scale Hierarchical Classification Workshop in 32nd European Conference on Information Retrial. Milton Keynes, UK, 2010: 1-12.
  • 4Han Xiaogang , Liu Iunfa , Shen Zhiqi , et al. An optimized K -nearest neighbor algorithm for large scale hierarchical text classification/ /Proceedings of the 2011 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Athens, Greece, 2011: 2-12.
  • 5Xing Di-Kan , Xue Gui-Rong, Yang Qiang , et al. Deep classifier: Automatically categorizing search results into largescale hierarchies/ /Proceedings of the 1 st ACM International Conference On Web Search and Data Mining. New York, USA, 2008: 139-148.
  • 6Malik H, Fradkin D, Moerchen F. Single pass text classification by direct feature weighting. Knowledge and Information Systems, 2011, 28(1): 79-98.
  • 7Guan Hu , Zhou Iingyu , Guo Minyi. A class-feature-centroid classifier for text categorization/ /Proceedings of the 18th International Conference on World Wide Web. Madrid, Spain, 2009: 201-210.
  • 8Ceci M, Malerba D. Classifying web documents in a hierarchy of categories: A comprehensive study. Journal of Intelligent Information Systems, 2007, 28(1): 37-78.
  • 9Liu T y, Yang v , Wan H, et al. Support vector machines classification with a very large- scale taxonomy. ACM SIGKDD Explorations Newsletter, 2005, 7 (1): 36-43.
  • 10Cai L, Hofmann T. Hierarchical document categorization with support vector machines/ /Proceedings of the 13th ACM International Conference on Information and Knowledge Management. ACM, 2004: 78-87.

共引文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部