期刊文献+

一个中文实体链接语料库的建设 被引量:7

Construction of a Chinese Entity Linking Corpus
下载PDF
导出
摘要 鉴于现有中文实体链接基准语料库的缺乏,在ACE2005中文语料库和中文维基百科的基础上,通过自动构造和人工标注的方法,构建一个中文实体链接语料库及其相关的中文知识库。与传统的英文实体链接语料库不同,构造的中文实体链接语料库是基于实体而非单个实体指称(Mention)。中文实体链接语料库的构建,将为中文实体链接研究提供一个可用的基准平台。 In view of the lack of Chinese entity linking benchmark corpus, the methodology of automatic construction and manual annotation was applied to build a Chinese entity linking corpus as well as its related Chinese knowledge base derived from the ACE2005 Chinese corpus and the Chinese Wikipedia resource. Contrary to traditional English entity linking corpus, this corpus is based on entities rather than individual entity mentions. The construction of Chinese entity linking corpus provides a benchmark platform to the Chinese entity linking research community.
出处 《北京大学学报(自然科学版)》 EI CAS CSCD 北大核心 2015年第2期321-327,共7页 Acta Scientiarum Naturalium Universitatis Pekinensis
基金 国家自然科学基金(61373096 90920004) 江苏省高校自然科学重大项目(11KJA520003)资助
关键词 中文 实体链接 语料库 Chinese entity linking corpus
  • 相关文献

参考文献12

  • 1McNamee P, Simpson H, Dang H T, et al. Overview of the TAC 2009 knowledge base population track// Second Text Analysis Conference (TAC 2009). Gaith- ersburg, MD, 2009:111-113.
  • 2ACE (Automatic Content Extraction). Chinese Anno- tation Guidelines for Entities Version 5.5 [EB/OL]. (2005-05-05) [2014-05-20]. http://www.ldc.upenn. edu/Projects/ACE/.
  • 3Ji H, Grishman R, Dang H T, et al. Overview of the TAC 2010 knowledge base population track// Third Text Analysis Conference (TAC 2010). Gaithersburg, MD, 2010:141-165.
  • 4Ji H, Grishman R, Dang H T, et al. Overview of theTAC 2011 knowledge base population track//Fourth Text Analysis Conference (TAC 2011). Gaithersburg, MD, 2011:121-153.
  • 5TCCI.中文微博实体链接评测大纲[R/OL].(2013)【2014-05-20].http://tcci.ccf.org.cn/conference/2013/dldoc/ev04.pdf.
  • 6朱敏,贾真,左玲,吴安峻,陈方正,柏玉.中文微博实体链接研究[J].北京大学学报(自然科学版),2014,50(1):73-78. 被引量:12
  • 7Cucerzan S. Large-scale named entity disambiguation based on Wikipedia data // Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). Prague, 2007: 708-716.
  • 8Csomai A, Mihalcea R. Linking documents to encyclopedic knowledge. IEEE Intelligent Systems, 2008, 23(5): 34-41.
  • 9Milne D, Witten I H. Learning to link with Wikipedia //CIKM'08: Proceeding of the 17th ACM Conference on Information and Knowledge Management. Hong Kong, 2008:509-518.
  • 10Kulkarni S, Singh A, Ramakrishnan G, et al. Collective annotation of wikipedia entities in web text // KDD'09: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Paris, 2009:457-466.

二级参考文献6

共引文献11

同被引文献39

引证文献7

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部