期刊文献+

规则与统计相结合的日语时间表达式识别 被引量:3

Japanese Time Expression Recognition by Combining Rules with Statistics
下载PDF
导出
摘要 该文提出了一种基于自定义知识库强化获取规则集,以及规则与统计模型相结合的日语时间表达式识别方法。在按照Timex2标准对时间表达进行细化分类的基础上,我们结合日语时间词的特点,渐进地扩展重构日语时间表达式知识库,实现基于知识库获取的规则集的优化更新,旨在不断提高时间表达式的识别精准度。同时,融合CRF统计模型提高日语时间表达式识别的泛化能力。实验结果显示开放测试F1值达0.898 7。 Based on the knowledge base we defined, this paper presents a Japanese time expression recognition method throughcombining rules setstrengthened by knowledge base with statistical model. According to the Timex2 standards' granular classification on time, we progressivelyexpanded and reconstructed the knowledge base given the Japanese time characteristic, and then achieved rules set optimization and update, in order to increase recognition accuracy. Simultaneously, we fused CRF model to enhance the generalization ability of Japanese time expression recognition. Our experimental results show that the F1 value reaches0. 8987 on open test.
出处 《中文信息学报》 CSCD 北大核心 2013年第6期192-200,共9页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(61370130) 科技部国际科技合作计划(K11F100010) 中央高校基本科研业务费专项资金资助项目(2010JBZ2007) 北京市重点学科共建资助项目(计算机应用技术) 中国科学院计算技术研究所智能信息处理重点实验室开放课题(IIP2010-4) 北京交通大学人才基金资助项目(2011RC034)
关键词 知识库 规则集 统计模型 knowledge base rules set statistical model
  • 相关文献

参考文献17

  • 1邬桐,周雅倩,黄萱菁,吴立德.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(4):3-10. 被引量:16
  • 2贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量:21
  • 3PawelMaqur, Robert Dale . A Rule Based Approach to Temporal Expression Tagging [C]//Proceeding of the International Multiconference on Computer Science and Information Technology. 2007,293-03.
  • 4Mingli Wu, Wenjie Li, Qin Lu, et al. A Chinese Tem- poral Parser for Extracting and Normalizing Temporal Information [ C]//Proceeding of International Joint Conference on Natural Language Processing (IJC- NLP),2005. 3651: 694-706.
  • 5David Ahn, SisayFissahaAdafre, Maarten de Rijke. Recognizing and Interpreting Temproal Expressions in Open Domain Texts [J]. Digital Information Manage-ment,2005,3(1): 14-20.
  • 6David Ahn,SisayFissahaAdafre, Maarten De Rijke To wards Task -Based Temporal Extraction and Recog nition[C]//Proceedings Dagstuhl Workshop on Anno tating,Extracting, and Reasoning about Time and E vents, 2005.
  • 7KadriHacioglu, Ying Chen. Benjamin Douglas Auto- matic Time Expression Labeling for English and Chi- nese Text[C]//Proceeding of Computational Linguis- tics and Intelligent Text Processing (CfCLing), 2005, 3406: 548-559.
  • 8刘成亮 韩海伟.知识库系统的原理及其在智能搜索引擎中的应用.电脑知识与技术,2008,(8):1512-1514.
  • 9Nouvel D, Antoine J Y, Friburger N, et al. Coupling knowledge-based and data-driven systems for named entity reeognition[C]//Proeeeding of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data. Association for Computational Linguis- tics, 2012: 69-77.
  • 10ACE( Automatic Content Extraction)Chinese Annota- tion Gubdelines for TIMEX2 (Summary) [C]//Pro- ceeding of Version 1.2, 2005.

二级参考文献50

  • 1沈庶英.谈约量时间词[J].世界汉语教学,2000,14(1):41-45. 被引量:21
  • 2周小兵.谈汉语时间词[J].语言教学与研究,1995(3):85-93. 被引量:18
  • 3陆俭明.说“年、月、日”[J].世界汉语教学,1987,1(3):35-36. 被引量:28
  • 4温云水.对外汉语教学中的时间词问题[J].天津外国语大学学报,1997,15(3):37-42. 被引量:11
  • 5Seok Bae Jang, Jennifer Baldwin. Inderjeet Mani Automatic TIMEX2 Tagging of Korean News [J].ACM Transactions on Asian Language Information processing (TALIP), 2004, 3(1) : 51-65.
  • 6Nikolai Vazov A System for Extraction of Temporal Expressions from French Texts based on Syntactic and Semantic Constraints[C]//Proceedings of the workshop on Temporal and spatial information processing, 2001, Volume 13: Article No. 14:1-8.
  • 7Estela Saquete, Patricio Martinez-barco. Rafael Mufioz Recognizing and Tagging Temporal Expressions in Spanish [C]//Workshop on Annotation Standards for Temporal Information in Natural Language (LREC), 2002: 44-51.
  • 8Mingli Wu, Wenjie Li, Qin Lu, Baoli Li. A Chinese Temporal Parser for Extracting and Normalizing Temporal Information [C]//International Joint Conference on Natural Language Processing ( IJCNLP), 2005, Volume 3651: 694-706.
  • 9David Ahn, Sisay Fissaha Adafre, Maarten De Rijke Towards Task-Based Temporal Extraction and Recognition [C]//Proceedings Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Time and Events, 2005.
  • 10Kadri Hacioglu, Ying Chen. Benjamin Douglas Auto matic Time Expression Labeling for English and Chi nese Text [C]//Computational Linguistics and Intelli gent Text Processing (CICLing), 2005, Volume 3406 548-559.

共引文献32

同被引文献32

  • 1李君婵,谭红叶,王风娥.中文时间表达式及类型识别[J].计算机科学,2012,39(S3):191-194. 被引量:9
  • 2高霄云,杨建林.基于规则的中文时间词和数词的自动识别算法[J].现代图书情报技术,2007(3):46-50. 被引量:2
  • 3贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量:21
  • 4翟飞飞,夏睿,周玉,等.汉英双向时间和数字命名实体的识别与翻译系统[C] //第五届全国机器翻译研讨会,南京,2009:172-179.
  • 5Available at http://crfpp.googlecode.com/svn/trunk/doc/index.html.
  • 6Mingli Wu,Wenjie Li,Qin Lu,et al.A Chinese Temporal Parser for Extracting And Normalizing Temporal Information[C] //Proceedings of international Joint Conference on Natural Language Processing (IJC-NLP),2005(3651):694-706.
  • 7Ferro L,Gerber L,Mani I,et al.TIDES 2003 Standard for fhe Annotation of Temporal Expressions[EB/OL] .http://timex2.mitre.org.2003.
  • 8Ferro L,Gerber L,Mani I,et al.TIDES 2005 Standard for fhe Annotation of Temporal Expressions[EB/OL] .http://timex2.mitre.org.2005.
  • 9Pawel Maqur,Robert Dale.A Rule Based Approach to Temporal Expression tagging[C] //Proceedings of the International Multiconference on Computer Science and Information Technology.2007,293-03.
  • 10David Ahn,Sisay Fissaha Adafre,Maarten De Rijke.Towards Task-Based Temporal Extraction and Recognition[C] //Proceedings of Dagstuhl Workshop on Annotating,Extracting,and Reasoning about Time and Events,2005.

引证文献3

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部