期刊文献+

以规则为主的英语句子边界识别方法的C#实现

下载PDF
导出
摘要 利用从英汉词典中提取的带点缩略语和从语料库统计筛选出的句首常用词词表,通过C#编程,设计了英语句子边界识别工具RCESBD。采用互相检验的方法发现RCESBD正确率明显高于OpenNlP。
出处 《科技信息》 2014年第14期23-24,29,共3页 Science & Technology Information
基金 "解放军外国语学院2012年度学院科研基金"项目 2011年"基于语料库的军事英语综合研究"(批准号11BYY126)项目资助
  • 相关文献

参考文献10

  • 1Stamatatos,Efstathios,Nikos Fakotakis,George K.Kokkinakis.Automatic extraction of rules for sentence boundary disambiguation. Proceedings of the Workshop on Machine Learning in Human Language Technology . 1999
  • 2Riley,M.D.Some applications of tree-based modeling to speech and language indexing. Proceedings of the DARPA Speech and Natural Language Workshop . 1989
  • 3Read,Jonathon,Rebecca Dridan,Stephan Oepen,Lars J?rgen Solberg.Sentence boundary detection:A long solved problem?. Proceedings of COLING 2012 . 2012
  • 4David D Palmer,and Marti A. Hearst.Adaptive sentence boundary disambiguation. Proc. of the fourth Conference on Applied Natural Language Processing . 1994
  • 5Reynar JC;Ratnaparkhi A.A maximum entropy approach to identifying sentence boundaries,1997.
  • 6Andrei Mikheev.Periods, capitalized words, etc. Computational Linguistics . 2002
  • 7MIKHEEV Andrei.Feature Lattices for Maximum Entropy Modeling. ACL . 1998
  • 8Clough P.A Perl Program for Sentence Splitting Using Rules. Journal of Women s Health . 2001
  • 9Grefenstette,Gregory,Pasi Tapanainen.What is a word, what is a sentence? Problems of Tokenization. Proceedings of the 3rd International Conference on Computational Lexicography . 1994
  • 10Tibor Kiss,Jan Strunk.Unsupervised Multilingual Sentence Boundary Detection. Computational Linguistics . 2006

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部