期刊文献+

自动词性标注方法的比较 被引量:4

Analysis and Comparison of the Part-of-Speech Tagging Techniques
下载PDF
导出
摘要 对机器自动词性标注技术领域的三类主要理论方法(基于规则的方法、基于统计的方法和规则与统计相结合的方法)进行了研究分析和优缺点的对比,并在描述方式、标注依据、机器效率、鲁棒性、标注正确率和实用性等方面,对这三类方法进行认真的比较。比较结果显示规则与统计相结合的方法在各方面都占有较明显的优势,是目前最理想的标注方法。基于此类方法的自动词性标注技术可以较好地满足实际应用的要求。此外,本文还指出这类方法有待解决的三大难题。 With the development of the natural language processing technology, diverse techniques of part-of-speech tagging have got boost in recent years. After the elaborate study of those techniques, we find that the core methodology of them falls into three groups: rule-based, statistics-based and the combination of rule and statistics. In this paper, we put the main effort on the comparison of the three types of methods and point out the advantage, the disadvantage and some serious problems. Furthermore, the article concludes that the combinatory method achieves the best results and possesses the applicable value. However, the combinatory method also leaves some haunting problems as well.
作者 陈晓文
出处 《温州大学学报》 2006年第1期53-57,共5页
关键词 词性标注 规则 统计 概率 兼类词 Part-of-speech tagging Rule Statistics Probability Syntactic category
  • 相关文献

参考文献4

  • 1[1]刘颖.计算机语言学[M].北京:商务出版社,2000
  • 2[4]DeRose S.Grammatical Category Disambiguation by Statistical Optimization [J].Computational Linguistics.1998,(14):31-39
  • 3[5]Roger G,Leech G,Sampson G.The Computational Analysis of English:A Corpus-based Approach [M].London:Longman,1987
  • 4黄德根,张丽静,张艳丽,杨元生.规则与统计相结合的兼类词处理机制[J].小型微型计算机系统,2003,24(7):1252-1255. 被引量:6

二级参考文献3

共引文献5

同被引文献30

引证文献4

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部