期刊文献+

中文文本分类中利用依存关系的实验研究 被引量:16

Study on Chinese text classification based on dependency relation
下载PDF
导出
摘要 为了利用依存关系进行短文本分类,研究了利用依存关系进行短文本分类存在的四个关键问题。分别在长文本语料集和两个短文本语料集上,抽取具有依存关系的词对,并利用这些词对作为特征进行分类实验。实验结果表明:依存关系能够作为有效的特征进行文本分类,并能够改善文本分类的性能;单独把依存关系作为特征,不能提高短文本的分类性能;可以利用依存关系作为特征扩充的手段,增加短文本的特征,增强短文本的描述能力,进而进行有效的短文本分类。 Four key issues of classifying Chinese short text using dependency relation are discussed to use dependency relation to classify Chinese short text effectively.This paper extracts the dependency relations between two words in a long-text corpus and two short-text corpuses,and uses these word-pairs to classify texts in order to analyze the role of dependency relation in short text classification.Experiments show that Using dependency relation to classify texts can improve the classification performance;Using dependency relation to classify short texts lonely can not improve the classification performance;dependency relation as means to expand features can increase features and enhance description ability of short text in order to classify short texts effectively.
作者 王鹏 樊兴华
出处 《计算机工程与应用》 CSCD 北大核心 2010年第3期131-133,141,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60703010 重庆市自然科学基金 No.2006BB2374 重庆市教委科学技术研究项目(No.KJ070519) 教育部回国留学人员启动基金(教外司留[2007]1109号~~
关键词 依存关系 短文本 文本分类 dependency relation short text text classification
  • 相关文献

参考文献6

  • 1Zelikovitz S,Marquez F.Transductive learning for short text classification problems using latent semantic indexing[J].International Journal of Pattern Reeognition and Artificial Intelligence,2005,19 (2): 143-163.
  • 2Zelikovitz S.Transductive LSI for short text classification problems[C]// Proceedings of the 17th International FLAIRS Conference,2004: 556-561.
  • 3Zelikovitz S,Hirsh H.Improving short text classification using unlabeled background knowledge to assess document similarity[C]// Proceedings of ICML-2002,2002:1183-1190.
  • 4Robinson J.Dependency structures and transformational rules[J]. Language, 1970,46( 2 ) : 259-285.
  • 5Gao jianfeng,Suzuki H.Unsupervised learning of dependency structure for language modeling[C]//ACI 2003:521-528.
  • 6Yuret D.Discovery of linguistic relations using texical attraction[D]. MIT, 1998.

同被引文献182

引证文献16

二级引证文献153

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部