规则与统计相结合的日语时间表达式识别被引量：3

Japanese Time Expression Recognition by Combining Rules with Statistics

下载PDF

导出

摘要该文提出了一种基于自定义知识库强化获取规则集,以及规则与统计模型相结合的日语时间表达式识别方法。在按照Timex2标准对时间表达进行细化分类的基础上,我们结合日语时间词的特点,渐进地扩展重构日语时间表达式知识库,实现基于知识库获取的规则集的优化更新,旨在不断提高时间表达式的识别精准度。同时,融合CRF统计模型提高日语时间表达式识别的泛化能力。实验结果显示开放测试F1值达0.898 7。 Based on the knowledge base we defined, this paper presents a Japanese time expression recognition method throughcombining rules setstrengthened by knowledge base with statistical model. According to the Timex2 standards＇ granular classification on time, we progressivelyexpanded and reconstructed the knowledge base given the Japanese time characteristic, and then achieved rules set optimization and update, in order to increase recognition accuracy. Simultaneously, we fused CRF model to enhance the generalization ability of Japanese time expression recognition. Our experimental results show that the F1 value reaches0. 8987 on open test.

作者赵紫玉徐金安张玉洁刘江鸣

机构地区北京交通大学计算机与信息技术学院

出处《中文信息学报》 CSCD 北大核心 2013年第6期192-200,共9页 Journal of Chinese Information Processing

基金国家自然科学基金资助项目(61370130) 科技部国际科技合作计划(K11F100010) 中央高校基本科研业务费专项资金资助项目(2010JBZ2007) 北京市重点学科共建资助项目(计算机应用技术) 中国科学院计算技术研究所智能信息处理重点实验室开放课题(IIP2010-4) 北京交通大学人才基金资助项目(2011RC034)

关键词知识库规则集统计模型 knowledge base rules set statistical model

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献17

1邬桐,周雅倩,黄萱菁,吴立德.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(4):3-10. 被引量：16
2贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量：21
3PawelMaqur, Robert Dale . A Rule Based Approach to Temporal Expression Tagging [C]//Proceeding of the International Multiconference on Computer Science and Information Technology. 2007,293-03.
4Mingli Wu, Wenjie Li, Qin Lu, et al. A Chinese Tem- poral Parser for Extracting and Normalizing Temporal Information [ C]//Proceeding of International Joint Conference on Natural Language Processing (IJC- NLP),2005. 3651: 694-706.
5David Ahn, SisayFissahaAdafre, Maarten de Rijke. Recognizing and Interpreting Temproal Expressions in Open Domain Texts [J]. Digital Information Manage-ment,2005,3(1): 14-20.
6David Ahn,SisayFissahaAdafre, Maarten De Rijke To wards Task -Based Temporal Extraction and Recog nition[C]//Proceedings Dagstuhl Workshop on Anno tating,Extracting, and Reasoning about Time and E vents, 2005.
7KadriHacioglu, Ying Chen. Benjamin Douglas Auto- matic Time Expression Labeling for English and Chi- nese Text[C]//Proceeding of Computational Linguis- tics and Intelligent Text Processing (CfCLing), 2005, 3406: 548-559.
8刘成亮韩海伟.知识库系统的原理及其在智能搜索引擎中的应用.电脑知识与技术,2008,(8):1512-1514.
9Nouvel D, Antoine J Y, Friburger N, et al. Coupling knowledge-based and data-driven systems for named entity reeognition[C]//Proeeeding of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data. Association for Computational Linguis- tics, 2012: 69-77.
10ACE( Automatic Content Extraction)Chinese Annota- tion Gubdelines for TIMEX2 (Summary) [C]//Pro- ceeding of Version 1.2, 2005.

二级参考文献50

1沈庶英.谈约量时间词[J].世界汉语教学,2000,14(1):41-45. 被引量：21
2周小兵.谈汉语时间词[J].语言教学与研究,1995(3):85-93. 被引量：18
3陆俭明.说“年、月、日”[J].世界汉语教学,1987,1(3):35-36. 被引量：28
4温云水.对外汉语教学中的时间词问题[J].天津外国语大学学报,1997,15(3):37-42. 被引量：11
5Seok Bae Jang, Jennifer Baldwin. Inderjeet Mani Automatic TIMEX2 Tagging of Korean News [J].ACM Transactions on Asian Language Information processing (TALIP), 2004, 3(1) : 51-65.
6Nikolai Vazov A System for Extraction of Temporal Expressions from French Texts based on Syntactic and Semantic Constraints[C]//Proceedings of the workshop on Temporal and spatial information processing, 2001, Volume 13: Article No. 14:1-8.
7Estela Saquete, Patricio Martinez-barco. Rafael Mufioz Recognizing and Tagging Temporal Expressions in Spanish [C]//Workshop on Annotation Standards for Temporal Information in Natural Language (LREC), 2002: 44-51.
8Mingli Wu, Wenjie Li, Qin Lu, Baoli Li. A Chinese Temporal Parser for Extracting and Normalizing Temporal Information [C]//International Joint Conference on Natural Language Processing ( IJCNLP), 2005, Volume 3651: 694-706.
9David Ahn, Sisay Fissaha Adafre, Maarten De Rijke Towards Task-Based Temporal Extraction and Recognition [C]//Proceedings Dagstuhl Workshop on Annotating, Extracting, and Reasoning about Time and Events, 2005.
10Kadri Hacioglu, Ying Chen. Benjamin Douglas Auto matic Time Expression Labeling for English and Chi nese Text [C]//Computational Linguistics and Intelli gent Text Processing (CICLing), 2005, Volume 3406 548-559.

共引文献32

1李君婵,谭红叶,王风娥.中文时间表达式及类型识别[J].计算机科学,2012,39(S3):191-194. 被引量：9
2朱勇.对日汉语词汇教学研究的现状与前瞻[J].语言文字应用,2007(2):134-140. 被引量：9
3丁建琴,张娣.文检课智能教学系统中知识库的构建[J].现代情报,2008,28(12):184-185.
4贺瑞芳,秦兵,潘越群,刘挺,李生.基于启发式错误驱动学习的中文时间表达式识别[J].高技术通讯,2008,18(12):1258-1262. 被引量：3
5徐永东,王亚东,刘杨,王伟,权光日.多文档文摘中基于时间信息的句子排序策略研究[J].中文信息学报,2009,23(4):27-33. 被引量：8
6邬桐,周雅倩,黄萱菁,吴立德.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(4):3-10. 被引量：16
7朱莎莎,刘宗田,付剑锋,朱芳.基于条件随机场的中文时间短语识别[J].计算机工程,2011,37(15):164-167. 被引量：16
8谭红叶,郑家恒,梁吉业.时间关系识别研究进展[J].中文信息学报,2011,25(5):44-52. 被引量：6
9许旭阳,李弼程,张先飞,席耀一.基于条件随机场与自定义规则的时间表达式识别[J].情报学报,2011,30(10):1065-1071. 被引量：3
10沈思,苏新宁,谢靖,王东波.基于清华汉语树库的时间表达式抽取模型构建研究[J].图书情报工作,2012,56(18):127-132. 被引量：6

同被引文献32

1李君婵,谭红叶,王风娥.中文时间表达式及类型识别[J].计算机科学,2012,39(S3):191-194. 被引量：9
2高霄云,杨建林.基于规则的中文时间词和数词的自动识别算法[J].现代图书情报技术,2007(3):46-50. 被引量：2
3贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量：21
4翟飞飞,夏睿,周玉,等.汉英双向时间和数字命名实体的识别与翻译系统[C] //第五届全国机器翻译研讨会,南京,2009:172-179.
5Available at http://crfpp.googlecode.com/svn/trunk/doc/index.html.
6Mingli Wu,Wenjie Li,Qin Lu,et al.A Chinese Temporal Parser for Extracting And Normalizing Temporal Information[C] //Proceedings of international Joint Conference on Natural Language Processing (IJC-NLP),2005(3651):694-706.
7Ferro L,Gerber L,Mani I,et al.TIDES 2003 Standard for fhe Annotation of Temporal Expressions[EB/OL] .http://timex2.mitre.org.2003.
8Ferro L,Gerber L,Mani I,et al.TIDES 2005 Standard for fhe Annotation of Temporal Expressions[EB/OL] .http://timex2.mitre.org.2005.
9Pawel Maqur,Robert Dale.A Rule Based Approach to Temporal Expression tagging[C] //Proceedings of the International Multiconference on Computer Science and Information Technology.2007,293-03.
10David Ahn,Sisay Fissaha Adafre,Maarten De Rijke.Towards Task-Based Temporal Extraction and Recognition[C] //Proceedings of Dagstuhl Workshop on Annotating,Extracting,and Reasoning about Time and Events,2005.

引证文献3

1吴琼,黄德根.基于条件随机场与时间词库的中文时间表达式识别[J].中文信息学报,2014,28(6):169-174. 被引量：11
2张磊,杨雅婷,米成刚,李晓.维吾尔语数词类命名实体的识别与翻译[J].计算机应用与软件,2015,32(8):64-67. 被引量：6
3阿依古丽.哈力克,艾山.吾买尔,吐尔根.伊布拉音,卡哈尔江.阿比的热西提,买合木提.买买提.汉维时间数字和量词的识别与翻译研究[J].中文信息学报,2016,30(6):190-200. 被引量：8

二级引证文献25

1贾遂民,张玉,张腾飞.一种基于介词用法的灾难事件信息抽取方法[J].计算机与现代化,2015(7):116-119. 被引量：3
2张海军.维吾尔语短语自动抽取研究进展[J].计算机科学与探索,2015,9(12):1420-1429. 被引量：3
3张义,李治江.基于高斯词长特征的中文分词方法[J].中文信息学报,2016,30(5):89-93. 被引量：3
4阿依古丽.哈力克,艾山.吾买尔,吐尔根.伊布拉音,卡哈尔江.阿比的热西提,买合木提.买买提.汉维时间数字和量词的识别与翻译研究[J].中文信息学报,2016,30(6):190-200. 被引量：8
5王晓玉,李斌.基于CRFs和词典信息的中古汉语自动分词[J].数据分析与知识发现,2017,1(5):62-70. 被引量：25
6孙健,高大启,刘珉,高炬,阮彤.中文电子病历文本中的时间识别算法研究[J].山西大学学报（自然科学版）,2018,41(1):15-22. 被引量：2
7买合木提.买买提,卡哈尔江.阿比的热西提,艾山.吾买尔,吐尔根.依布拉音,王路路.CRF与规则相结合的维吾尔文地名识别研究[J].中文信息学报,2017,31(6):110-118. 被引量：9
8马雷雷,李宏伟,魏勇,梁汝鹏,龚竞.基于规则的中文文本时间表达式识别和规范化方法[J].信息工程大学学报,2017,18(5):560-565. 被引量：8
9贾圣宾,向阳.面向智能服务系统的时间语义理解[J].计算机应用,2018,38(3):620-625.
10朱顺乐.融合深度学习特征的汉维短语表过滤研究[J].计算机技术与发展,2018,28(7):149-154. 被引量：1

1赵紫玉,徐金安,张玉洁,刘江鸣.日语时间表达式识别与日汉翻译研究[J].北京大学学报（自然科学版）,2014,50(1):180-186. 被引量：1
2高源,席耀一,李弼程,李苏奕.基于词典特征优化和依存关系的中文时间表达式识别[J].信息工程大学学报,2016,17(4):490-495. 被引量：4
3邬桐,周雅倩,黄萱菁,吴立德.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(4):3-10. 被引量：16
4许旭阳,李弼程,张先飞,席耀一.基于条件随机场与自定义规则的时间表达式识别[J].情报学报,2011,30(10):1065-1071. 被引量：3
5石翠.依存句法分析研究综述[J].智能计算机与应用,2013,3(6):47-49. 被引量：6
6王凤玲.基于条件随机域模型的英语时间表达式识别研究[J].电子技术（上海）,2012,39(5):8-10. 被引量：2
7贺瑞芳,秦兵,刘挺,潘越群,李生.基于依存分析和错误驱动的中文时间表达式识别[J].中文信息学报,2007,21(5):36-40. 被引量：21
8邬晓钧.《时间表达》解答[J].程序员,2008(1):112-113.
9罗跃生,凌焕章,吴荣华,齐绪.广义线性系统的结构分类方法[J].控制工程,2012,19(4):639-643.
10李君婵,谭红叶,王风娥.中文时间表达式及类型识别[J].计算机科学,2012,39(S3):191-194. 被引量：9

中文信息学报

2013年第6期

浏览历史

内容加载中请稍等...

规则与统计相结合的日语时间表达式识别被引量：3

参考文献17

二级参考文献50

共引文献32

同被引文献32

引证文献3

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

规则与统计相结合的日语时间表达式识别 被引量：3

参考文献17

二级参考文献50

共引文献32

同被引文献32

引证文献3

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

规则与统计相结合的日语时间表达式识别被引量：3