期刊文献+

基于词典和字形特征的中文命名实体识别 被引量:1

Chinese Named Entity Recognition Based on Lexicon and Glyph Features
下载PDF
导出
摘要 命名实体识别是自然语言处理中的一项基础任务。通过基于词典的方法增强词内语义和词边界信息是中文命名实体识别的主流做法。然而,汉字由象形字演变而来,汉字字形中包含着丰富的实体信息,这些信息在该任务中却很少被使用。该文提出了一个基于词典和字形特征的中文命名实体识别模型,将词信息和结构信息统一地结合起来,提高了实体匹配的准确性。该文首先通过SoftLexicon方法丰富语义信息,并使用改进的部首级嵌入优化字符表示;然后通过门卷积网络加强了对潜在词和上下文信息的提取;最后在四个基准数据集上实验,结果表明与传统模型和最新模型相比,基于词典和字形特征的模型取得了显著的性能提升。 Named entity recognition is a fundamental task of natural language processing.Lexicon-based method is the popular approach to enhance the representation of semantic and boundary information for Chinese named entity recognition.To utilize the glyphs containing rich entity information,we propose a novel Chinese named entity recognition model based on lexicon and glyph features.Specifically,the model enriches the semantic information through SoftLexicon and optimizes character representation through the improved radical-level embedding,which is fed into gated convolutional network.The experiments on four benchmark datasets show that the proposed model achieves significant improvements compared to both the existing models.
作者 于舒娟 毛新涛 张昀 黄丽亚 YU Shujuan;MAO Xintao;ZHANG Yun;HUANG Liya(College of Electronic and Optical Engineering&College of Flexible Electronics(Future Technology),Nanjing University of Posts and Telecommunications,Nanjing,Jiangsu 210023,China)
出处 《中文信息学报》 CSCD 北大核心 2023年第3期112-122,共11页 Journal of Chinese Information Processing
基金 国家自然科学基金(61977039)
关键词 中文命名实体识别 词典 字形特征 Chinese named entity recognition lexicon glyph features
  • 相关文献

参考文献1

二级参考文献18

  • 1Gina-Anne Levow, “The third international Chinese languageprocessing bakeoff: Word segmentation and named entity recog-nition”,Proc. of the Fifth SIGHAN Workshop on Chinese Lan-guage Processing, Sydney, Australia, pp.108-117, 2006.
  • 2H. Zhang, Q. Liu, H.K. Yu, Y.Q. Cheng and S. Bai, “Chi-nese named entity recognition using role model,,, Computa-tional Linguistics and Chinese Language Processing, Vol.8,No.2, pp.29-60,2003.
  • 3H. Zhang, Q. Liu, H.K. Yu, Y.Q. Cheng and S. Bai, “Chi-nese named entity recognition using role model,,, Computa-tional Linguistics and Chinese Language Processing, Vol.8,No.2, pp.29-60,2003.
  • 4W. Chen, Yujie Zhang and Hitoshi Isahara, “Chinese namedentity recognition with conditional random fields”,Proc. of 5thSIGHAN Workshop on Chinese Language Processing, Sydney,Australia, pp.118-121, 2006.
  • 5J. Zhou, L. He, X. Dai and J. Chen, “Chinese named entityrecognition with a multiphase model”,Proc. of 5th SIGHANWorkshop on Chinese Language Processing, Sydney, Australia,pp.213-216, 2006.
  • 6A. Chen, F. Peng, R. Shan and G. Sun, “Chinese named entityrecognition with conditional probabilistic models", Proc. of 5thSIGHAN Workshop on Chinese Language Processing, Sydney,Australia, pp.173-176, 2006.
  • 7J. Lafferty, A. McCallum and F. Pereira, “Conditional ran-dom fields: Probabilistic models for segmenting and labelingsequence data”, Proc. of ICML, San Francisco, USA, pp.282-289, 2001.
  • 8Yue Zhang and Stephen Clark, “Joint word segmentation andPOS tagging using a single perceptron”,Proc. of ACL/HLT,Columbus, OH, pp.888-896, 2008.
  • 9Yue Zhang and Stephen Clark, “A fast decoder for joint wordsegmentation and POS-tagging using a single discriminativemodel”,Proc. of EM NLP, Cambridge, MA, pp.843-852, 2010.
  • 10W. Jiang, Haitao Mi and Qun Liu, “Word lattice reranking forChinese word segmentation and part-of-speech tagging,,,Proc.of COLING, Manchester, UK, pp.385-392, 2008.

共引文献19

同被引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部