期刊文献+

基于词典特征优化和依存关系的中文时间表达式识别 被引量:4

Chinese Temporal Expression Recognition Algorithm Based on Optimization of Dictionary Features and Dependency Parsing
下载PDF
导出
摘要 提出一种基于词典特征优化和依存关系的时间表达式识别方法。首先针对中文文本时间表达式边界定位不准确及长距离依赖的问题,优化了传统时间词典特征,将时间词典分为时间词词典和时间单位词典;其次针对传统基于机器学习的时间表达式识别方法忽视时间表达式本身结构特点的问题,在优化后的词典特征的基础上提取依存特征,挖掘时间表达式的结构信息;最后综合时间表达式的基本特征、词典特征和依存特征,在条件随机场模型上完成时间表达式识别。在中文语料上进行实验,时间表达式识别达到较好效果。 This paper proposes a Chinese temporal expression recognition method based on optimization of dictionary features and dependency relation. First, since it' s hard to extract an exact match for temporal expression and recognize the long-distance-dependent temporal expressions representing time with many tokens in Chinese text, the traditional temporal dictionary features are optimized, and the temporal dictionary is divided into the temporal word dictionary and the temporal unit dietionary. Secondly, since traditional temporal expression recognition method based on machine learning ignores structural characteristics of temporal expression, dependent features are extracted on the basis of optimized dictionary features to mine structural information of temporal expression. Finally, by integrating basic features, dictionary features and dependent features, temporal expression recognition is completed based on conditional random fields. Experimental results show that the proposed method is beneficial to Chinese temporal expression recognition.
出处 《信息工程大学学报》 2016年第4期490-495,共6页 Journal of Information Engineering University
基金 国家社会科学基金资助项目(14BXW028)
关键词 时间表达式 时间表达式识别 时间词典 条件随机场 依存句法分析 temporal expression temporal expression recognition temporal dictionary conditional random fields dependency parsing
  • 相关文献

参考文献17

  • 1刘宗田,黄美丽,周文,仲兆满,付剑锋,单建芳,智慧来.面向事件的本体研究[J].计算机科学,2009,36(11):189-192. 被引量:96
  • 2林静,曹德芳,苑春法.中文时间信息的TIMEX2自动标注[J].清华大学学报(自然科学版),2008,48(1):117-120. 被引量:20
  • 3Mani I, Wilson G. Robust temporal processing of news [ C ]//Proceedings of the 38th Annual Meeting on Associ- ation for Computational Linguistics. 2000: 69-76.
  • 4Ferro L, Gerber L, Mani I, et al. TIDES 2003 Standard for the Annotation of Temporal Expressions [ EB/OL ]. [ 20~-09-12 ]. http://www. pdf/ferro _tides. pdf.
  • 5Ferro L, Gerber L, Mani I, mitre, org/sites/default/files/ et al. TIDES 2005 Standard for the Annotation of Temporal Expressions[ R]. McLean, Virginia,United States : MITRE Corporation, 2005.
  • 6TimeML Working Group. Guidelines for Temporal Ex- pression Annotation for English for TempEval 2010 [ EB/ OL ]. [ 2015-03-01 ]. http ://www. timeml, org/tempeval2/ tempeva12-trial/guidelines/timex3guidelines-072009, pdf.
  • 7Ferro L. Time stamping of ACE Relations and Events for 2005 Version 3.0 [ EB/OL ]. [ 2005-11-12 ]. https :// www. ldc. upenn, edu/sites/www, ldc. upenn, edu/files/ chinese-timestamping-guidelines-v2, pdf.
  • 8Verhagen M, Gaizauskas R, Schilder F, et al. SemEval- 2007 Task 15: TempEval temporal relation identification [ C]//Proceedings of the 4th International Workshop on Semantic Evaluations. 2007: 75-80.
  • 9贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量:21
  • 10邬桐,周雅倩,黄萱菁,吴立德.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(4):3-10. 被引量:16

二级参考文献94

共引文献144

同被引文献18

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部