期刊文献+

基于弱指导SVM的汉语动词次范畴化自动获取 被引量:2

Subcategorization Acquisition Based on Weakly Supervised SVM for Chinese Verbs
下载PDF
导出
摘要 动词次范畴化自动获取过程主要涉及到两个典型步骤一、依据启发性规则生成次范畴化假设;二、应用统计方法对假设集合进行过滤,选择可靠的次范畴化类型。此前改进获取性能的研究都集中在统计过滤阶段,并且相关实验的假设生成阶段都没有涉及到有指导的训练过程,因此所有这些方法都是无指导的。文章提出一种弱指导的汉语动词次范畴化自动获取方案,并应用SVM分类器取代原系统中的无指导假设生成模块。实验结果表明,最终获取性能有了统计意义上的改善。 Procedure of subcategorization acquisition mainly includes two typical steps :Subcategorization hypotheses are generated according to certain heuristic rules ;Hypotheses are filtered via statistical methods and reliable subcategorization types are selected.Previous efforts to improve the acquisition performance are focused on statistical filtering,and there is no supervised training for the generation of hypotheses in relevant experiments,Therefore,all these methods are unsupervised.This paper proposes a weakly supervised method for Chinese subcategorization acquisition, where the unsupervised hypothesis generator is replaced with a SVM classifier.Results of experiments indicate statistically significant improvement in the general acquisition performance.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第28期9-11,27,共4页 Computer Engineering and Applications
基金 国家自然科学基金项目资助(编号:60373101)
关键词 汉语动词 次范畴化 弱指导 SVM Chinese verbs, subcategorization,weakly supervised, SVM
  • 相关文献

参考文献15

  • 1Chomsky N.Aspects of the Theory of Syntax[M].Cambridge:MIT Press,1965
  • 2Korhonen,Anna.Subcategorization Acquisition[D].Dissertation for Ph D.Trinity Hall University of Cambridge,2001:29~77
  • 3Briscoe E J,Carroll J:Automatic Extraction of Subcategorization from Corpora[C].In:Proceedings of the 5th ACL Conference on Applied Natural Language Processing,Washington DC,1997:356~363
  • 4Brent M.Automatic Acquisition of Subcategorization Frames from Untagged Text[C].In:Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics,Berkeley,CA,1991:209~214
  • 5Sabine Schulte im Walde,Helmut Schmid,Mats Rooth et al.Statistical Grammar Models and Lexicon Acquisition[C].In:Christian Rohrer,Antje RoBdeutrcher,Hans Kamp eds.Linguistic Form and its Computation,CSLI Publications,2001
  • 6A Sarkar,D Zeman.Automatic Extraction of Subcategorization Frames for Czech[C].In:Proceedings of the 19th International Conference on Computational Linguistics,Saarbrucken,Germany,2002
  • 7Grzegorz Chrupala.Acquiring Verb Subcategorization from Spanish Corpora[D].PhD Program"Cognitive Science and Language".Universitat de Barcelona,2003:5~71
  • 8Manolis Maragoudakis,Katia Lida Kermanidis,George Kokkinakis.Learning Subcategorization Frames from Corpora:A Case Study for Modern Greek.Wire Communications Laboratory,University of Patras 26500 Rio,Greece,2002
  • 9P Gamallo,A Agustini,P L Gabriel.Using co-composition for acquiring syntactic and semantic subcategorization-Unsupervised lexical acquisition[C].In:Proceedings of the Workshop of the ACL Special Interest Group on the Lexicon (SIGLEX),Philadelphia,2002:34~41
  • 10Han Xiwu,Tiejun Zhao,Muyun Yang.FML-Based SCF Predefinition Learning for Chinese Verbs[C].In:Proceedings of the International Joint Conference of NLP,2004:115~122.

同被引文献30

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部