期刊文献+

基于统计的中文机构名自动识别 被引量:1

Chinese organization automatic recognition based on statistical method
原文传递
导出
摘要 通过对中文机构名的语法语义特性进行分析,将中文机构名分成前部词和特征词,提出了一种基于统计的识别方法。使用成熟语料库的训练数据,计算候选机构名的特征词可信度、前部词首词可信度和前部词中部可信度,最终得到机构名构词可信度,并与给定阈值比较,实现了中文机构名识别,在开放性实验中,达到了85.57%的召回率和94.37%的准确率。 By analysing the syntactical and semantical characteristics of Chinese organization and dividing it into the forward word and the special word, an approach based on statistical method is put forward about Chinese organization automatic recognition. The credibilities of both the special word and the forward word for the candidate organization name are computed by using the data from the trained corpus to decide the final credibility of organization name. This final credibility is compared with the given threshold to decide whether it is an organization name. After the primary test, this method can get 85.57% recall, and 94.37% precision.
作者 夏赟 李志蜀
出处 《四川大学学报(自然科学版)》 CAS CSCD 北大核心 2009年第3期613-617,共5页 Journal of Sichuan University(Natural Science Edition)
关键词 自然语言处理 中文机构名识别 前部词 特征词 natural language processing, Chinese organization recognition, forward word, special word
  • 相关文献

参考文献4

二级参考文献52

  • 1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:197
  • 2季姮,罗振声.基于统计和规则的中文姓名自动辨识[J].语言文字应用,2001(1):14-18. 被引量:13
  • 3罗智勇 宋柔.现代汉语自动分词中专名的一体化、快速识别方法[A]..ICCC,Singapore[C].,2001.11..
  • 4Sundheim B M. Named entity task definition, version 2.1. In:Proc. of the Sixth Message Understanding Conf. 1995. 319~332
  • 5Borthwick A. A Maximum Entropy Approach to Named Entity Recognition: [Ph. D]. New York University. Department of Computer Science, Courant Institute 1999
  • 6Humphreys K, Gaizauskas R, Azzam S, et al. Description of the LaSIE-Ⅱ system as used for MUC-7. In:Proc. of the 7th Message Understanding Conference (MUC-7), 1998
  • 7URL http://www. ltg. ed. ac. uk
  • 8Chen H H, Ding Y W, Tsai S C,et al. Description of the NTU System Used for MET2. In: Proc. of 7th Message Understanding Conference, 1998
  • 9Black W J, Rinaldi F,Mowatt D. Facile: Description of the NE System Used For MUC-7. In:Proc. of 7th Message Understanding Conf. 1998
  • 10Fukumoto J, Shimohata M, Masui F, Sasaki M. Oki Electric Industry: Description of the Oki System as Used for MET-2. In:Proc. of 7th Message Understanding Conf. 1998

共引文献237

同被引文献13

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部