期刊文献+

本体驱动的文本虚拟样本构造方法研究 被引量:4

Research on Ontology-driven Text Virtual Sample Constructing
下载PDF
导出
摘要 构造虚拟样本能够为机器学习中的训练集融入先验知识,从而改善标注瓶颈问题。提出了一种本体驱动的文本虚拟样本构造方法。在确保类别不变性的前提下,该方法依据领域相关本体所明晰表达的领域知识,基于本体树的点、边、子树,从同义、父子、语义同构的多个词义关系角度实现了文本虚拟样本的构造。初步实验表明,该方法与原分类及类似方法相比具有更好的分类精度和推广能力。 Constructing virtual examples can incorporate prior knowledge into training set in machine learning, so as to alleviate the labeling bottleneck. An Ontology-driven scheme to construct text virtual sample is proposed. Under the precondition of label invariability, the proposal constructs virtual samples according to the domain knowledge explicitly formalized by domain-specific Ontology. Based on the different Ontology tree structures, namely nodes, edges, and sub-trees, various lexical-semantic relations, including synonymy, paternity, and semantic isomorphs, are applied into text virtual example constructing. The primary experimental results show the scheme outperforms original text catego- rizations and other similar ones in precision and generalization ability.
出处 《计算机科学》 CSCD 北大核心 2008年第3期142-145,共4页 Computer Science
基金 国家自然科学基金资助项目(60675015)
关键词 虚拟样本 文本分类 本体 本体树 领域知识 Virtual example, Text categorization, Ontology, Ontology tree, Domain knowledge
  • 相关文献

参考文献9

  • 1苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:383
  • 2Niyogi P, Girosi F, Poggio T. Incorporating prior information on machine learning by creating virtual examples [J]. Proe. IEEE, 1998, 86(11): 2196-2209
  • 3Poggio T, Vetter T. Recognition and structure from one 2D model view: observations on prototypes, object classes, and symmetries[C]. A.I. Memo No. 1347, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1992
  • 4Scholkopf B, Simard P, Smola A, et al. Prior knowledge in support vector kernels [C]. Advances in Neural Information Processing Systems. MIT Press, 1998
  • 5李辉,史忠植,许卓群.运用文本领域的常识改善基于支撑向量机的文本分类器性能[J].中文信息学报,2002,16(2):7-13. 被引量:16
  • 6Sassano M. Virtual examples for text classification with support vector machines[C]. In: Proceedings of 2003 Conference on Empirical Methods in Natural Language Processing,2003. 208-215
  • 7Bhogal J, Macfarlane A, Smith P. A review of ontology based query expansion [J ]. Information Processing and Management, 2007,43(4) :866-886
  • 8Latifur R K, Mcleod D. Ontology-based information selection [D]. California:University of Southern California, 2000
  • 9Chih-Chung Chang and Chih-Jen LirL LIBSVM: a library for support vector machines[CP], 2001. Software available at http:// www. csie.ntu. edu. tw/-cjlin/libsvm.

二级参考文献5

共引文献396

同被引文献51

引证文献4

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部