期刊文献+

自动构建时间基元规则库的中文时间表达式识别 被引量:16

Chinese Time Expression Recognition Based on Automatically Generated Basic-Time-Unit Rules
下载PDF
导出
摘要 该文提出一种基于正则文法的时间表达式识别算法:它基于"时间基元"①进行规则构建,提高了时间表达式识别的召回率;同时使用基于错误驱动思想的规则剪枝算法,削减了从训练语料带来的噪声,提高了识别的正确率,两者搭配有效提高了系统整体性能。在ACE07中文语料上的实验结果显著超过了现有水平,F-score达到89.9%。该文提出的算法具有很好的通用性和扩展性,加以改进将可以有更广泛的应用。 This paper proposes a generic algorithm for Time Expression Recognition(TER) task based on regular expressions.The algorithm generates rules based on "Basic Time Unit",which improves the recall value.And it prunes the rule collection through error driven method and reduces the "noise" taken from training corpus,which leads to a high precision.The two features jointlyimprove the overall efficiency of our method compared to the baseline system: with a significant better performance of up to 89.9% F-score on ACE07 Chinese Corpus.In addition,the proposed algorithm has good adaptablility and scalability for a broader application.
出处 《中文信息学报》 CSCD 北大核心 2010年第4期3-10,共8页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60503070)
关键词 计算机应用 中文信息处理 时间表达式识别 时间基元 Timex2 错误驱动 正则表达式 computer application Chinese information processing time expression recognition basic time unit Timex2 error-driven regular expression
  • 相关文献

参考文献9

  • 1Seok Bae Jang, Jennifer Baldwin. Inderjeet Mani Automatic TIMEX2 Tagging of Korean News [J].ACM Transactions on Asian Language Information processing (TALIP), 2004, 3(1) : 51-65.
  • 2Nikolai Vazov A System for Extraction of Temporal Expressions from French Texts based on Syntactic and Semantic Constraints[C]//Proceedings of the workshop on Temporal and spatial information processing, 2001, Volume 13: Article No. 14:1-8.
  • 3Estela Saquete, Patricio Martinez-barco. Rafael Mufioz Recognizing and Tagging Temporal Expressions in Spanish [C]//Workshop on Annotation Standards for Temporal Information in Natural Language (LREC), 2002: 44-51.
  • 4Mingli Wu, Wenjie Li, Qin Lu, Baoli Li. A Chinese Temporal Parser for Extracting and Normalizing Temporal Information [C]//International Joint Conference on Natural Language Processing ( IJCNLP), 2005, Volume 3651: 694-706.
  • 5David Ahn, Sisay Fissaha Adafre, Maarten De Rijke Towards Task-Based Temporal Extraction and Recognition [C]//Proceedings Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Time and Events, 2005.
  • 6Kadri Hacioglu, Ying Chen. Benjamin Douglas Auto matic Time Expression Labeling for English and Chi nese Text [C]//Computational Linguistics and Intelli gent Text Processing (CICLing), 2005, Volume 3406 548-559.
  • 7林静,曹德芳,苑春法.中文时间信息的TIMEX2自动标注[J].清华大学学报(自然科学版),2008,48(1):117-120. 被引量:20
  • 8贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量:21
  • 9贺瑞芳,秦兵,潘越群,刘挺,李生.基于启发式错误驱动学习的中文时间表达式识别[J].高技术通讯,2008,18(12):1258-1262. 被引量:3

二级参考文献34

  • 1WuML, LiWJ, Lu Q, etal. CTEMP: A Chinese temporal parser for extracting and normalizing temporal Information. In: Proceeding of the International Joint Conference on Natural language Processing, Jeju Island, Korea, 2005. 694-706
  • 2Ye Y, Fossum V L, Abney S. Latent features in automatic tense translation between Chinese and English. in: Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing, Sydney, Australia, 2006.48-55
  • 3ACE2007 evaluation plan. http://projects. ldc. upenn. edu/ace/intro. html. 2006-11-6
  • 4SemEval-2007. http://nlp. cs. swarthmore.edu/semevaL/index. shtml. 2007-1
  • 5Jang S B, Baldwin J, Mind I. Automatic TIMEX2 tagging of Korean news. ACM Transaction on Asian Language Information processing,2004, 3 (1):51-65
  • 6Vazov N. A system for extraction of temporal expressions French Texts based on syntactic and semantic constraints. In: Proceedings d the Association for Computational Linguistics Workshop on Temporal and Spatial Information Processing, Toulouse, France, 2001. 96-103
  • 7Estela S, Martinez-Barco, Patricio, et al. Recognizing and tagging temporal expressions in Spanish. In: Proceedinss of the Workshop on Annotation Standards for Temporal Information in Natural Language, The International Conference on Language Resources and Evaluation, Las Palmas, Spain, 2002
  • 8Mani I. Recent developments in temporal information extraction. In: Proceedings d the Conference on Recent Advances in Natural Language Processing, Alicante, Spain, 2004
  • 9Hacioglu K, Chen Y, Douglas B. Automatic time exxon labeling for English and Chinese text. In: Proceedings d Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, 2005.
  • 10AhnD, Adahe S F, Rijke M de. Towards task-besed temporal extraction and recognition. In: Proceedings of Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Tune and Events, Dagstuhl Castle, Germany, 2005

共引文献32

同被引文献180

引证文献16

二级引证文献61

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部