期刊文献+

中文文本的事件时空信息标注 被引量:12

Annotation of Spatial-Temporal Information of Event in Chinese Text
下载PDF
导出
摘要 基于文本数据源的地理空间信息解析研究侧重于地名实体、空间关系等空间语义角色的标注和抽取,忽略了丰富的时间信息、主题事件信息及其时空一体化信息。该文通过分析中文文本中事件信息描述的语言特点和事件的时空语义特征,基于地名实体和空间关系标注研究成果,制定了中文文本的事件时空信息标注体系和标注模式,并以GATE(General Architecture for Text Engineering)为标注平台,以网页文本为数据源,构建了事件时空信息标注语料库。研究成果为中文文本中地理信息的语义解析提供标准化的训练和测试数据。 Text has become an important data source of geo-spatial information. Currently, researches on structured geo-spatial information expression focused on extraction of spatial information,such as place names and spatial rela- tions in text. However, abundant temporal information, event information and spatial-temporal information are ig- nored. In this paper, annotation of spatial-temporal information of event in Chinese text is proposed. Firstly, the lin guistic characteristics of spatial-temporal information of event in Chinese text are analyzed. Then, an annotation schema is presented,and the annotation specification is decribed in detail. Finally, GATE (General Architecture for Text Engineering) is introduced as the annotation platform,and a large-scale annotated corpus based on the Web da ta source is developed and evaluated. This study effectively addresses the current lack of related specification and standard data for interpretation of event and spatial-temporal information in Chinese text.
出处 《中文信息学报》 CSCD 北大核心 2016年第3期213-222,共10页 Journal of Chinese Information Processing
基金 国家自然科学基金(41401451 40971231) 国家863项目(2012AA12A403-3) 中央高校基本科研业务项目(JZ2014HGBZ0064) 江苏省测绘地理信息科研项目(JSCHKY201502)
关键词 中文文本 时空信息 事件 标注体系 标注语料库 Chinese text spatial-temporal information event annotation schema annotated corpus
  • 相关文献

参考文献16

  • 1闾国年,袁林旺,俞肇元.GIS技术发展与社会化的困境与挑战[J].地球信息科学学报,2013,15(4):483-490. 被引量:24
  • 2Palkowsky B,MetaCarta I. A New Approach to Information Discovery-Geography Really Does Matter[C]//Proceedings of the SPE Annual Technical Conference and Exhibition,United States,2005: 3231-3234.
  • 3Goodchild M F. Twenty Years of Progress: GIScience in 2010[J]. Journal of Spatial Information Science,2013,1: 3-20.
  • 4俞士汶,朱学锋,段慧明.大规模现代汉语标注语料库的加工规范[J].中文信息学报,2000,14(6):58-64. 被引量:30
  • 5冯志伟.标准通用置标语言SGML及其在自然语言处理中的应用[J].当代语言学,1998(4):2-12. 被引量:8
  • 6俞士汶,段慧明,朱学锋,孙斌.北京大学现代汉语语料库基本加工规范[J].中文信息学报,2002,16(5):49-64. 被引量:126
  • 7Kim J D,Ohta T,Tsujii J I. Multilevel Annotation for Information Extraction Introduction to the GENIA Annotation[J].Linguistic Modeling of Information and Markup Languages,2010,41: 125-142.
  • 8Leidner J L. Toponym Resolution in Text: Annotation,Evaluation and Applications of Spatial Grounding of Place Names [D]. Edinburgh: University of Edinburgh,2008.
  • 9Blaylock N,Swain B,Allen J. TESLA: A Tool for Annotating Geospatial Language Corpora[C]//Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics,2009: 45-28.
  • 10Leidner J L. Toponym Resolution in Text: Annotation,Evaluation and Applications of Spatial Grounding of Place Names[J]. University of Edinburgh,2007,41(2): 124-126.

二级参考文献112

共引文献390

同被引文献161

引证文献12

二级引证文献145

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部