期刊文献+

混合的汉语基本名词短语识别方法 被引量:7

Hybrid Method to Chinese Base Noun Phrase Recognition
下载PDF
导出
摘要 提出一种混合的汉语基本名词短语(BaseNP)识别模型,包括采用语法规则、统计方法和组合分类器方法。利用BaseNP词的信息、词性信息及上下文句法信息,构建组合分类器,提高判断的准确性。在中文树库(CTB5.0)上进行实验,F值达到了90.09%,证明该方法能有效地识别BaseNP。 This paper proposes a hybrid method to recognize Chinese Base Noun Phrase(BaseNP),including the use of grammer rules,statistical approach and classification combination.It utilizes words information,part of speech information and context syntax information of BaseNP,generates a combination classification and improves the precision.Experimental results on CTB5.0 show that the F-score is 90.09%,it proves that the method is an effective approach to Chinese BaseNP recognition.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第20期199-201,共3页 Computer Engineering
基金 国家自然科学基金资助项目(0673041) 国家"863"计划基金资助项目(006AA01Z147)
关键词 基本名词短语 规则模板 组合分类器 Base Noun Phrase(BaseNP) rule templates combined classifier
  • 相关文献

参考文献7

  • 1Chruch K W. A Stochastic Parts Program and Noun Phrase for Unrestricted Test[C]//Proc. of the 2nd Conf. on Applied Natural Language Processing. Austin, TX, USA: Kluwer Academic Publicshers, 1998.
  • 2赵军,黄昌宁.基于转换的汉语基本名词短语识别模型[J].中文信息学报,1999,13(2):1-7. 被引量:41
  • 3Koeling R. Chunking with Maximum Entropy Models[C]//Proc. of CoNLL-2000 and LLL-2000. Lisbon, Portugal: [s. n.], 2000.
  • 4Lafferty J, McCallum A, Pereira E Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proc. of the 18th International Conference on Machine Learning. San Francisco, USA: [s. n.], 2001.
  • 5Tjong E F, Sang K, Daelemans W, et al. Applying System Combination to Base Noun Phrase Identification[C]//Proc. of COLING'00. Saarbrucken, Germany: [s. n.], 2000.
  • 6周雅倩,郭以昆,黄萱菁,吴立德.基于最大熵方法的中英文基本名词短语识别[J].计算机研究与发展,2003,40(3):440-446. 被引量:61
  • 7徐昉,宗成庆,王霞.中文Base NP识别:错误驱动的组合分类器方法[J].中文信息学报,2007,21(1):115-119. 被引量:7

二级参考文献46

  • 1孙宏林,俞士汶.浅层句法分析方法概述[J].当代语言学,2000,2(2):74-83. 被引量:38
  • 2张卫国.三种定语、三类意义及三个槽位[J].中国人民大学学报,1996,(4):97-100.
  • 3张卫国,中国人民大学学报,1996年,4期,97页
  • 4梅家驹,同义词词林,1983年
  • 5CHRISTOPHER D,MANNING,HINRICH SCHUTZE.统计自然语言处理基础[M].苑春法译.北京:电子工业出版社,2005:143-163.
  • 6E F T K Sang, W Daelemans, H Déjean et al. Applying system combination to base noun phrase identification. In: Proc of COLING 2000. Saarbrücken, Germany: Morgan Kaufmann Publishers, 2000. 857~863
  • 7周明 .基于语料库的中文最长名词短语的自动抽取.见:计算语言进展与应用.北京,清华大学出版社,1995. 50-55(Zhou Ming. Corpus-based Chinese maximum noun phrase extraction. In: Computer Linguistic Development and Application(in Chinese). Beijing: Tsinghua University Press, 1995. 50-55)
  • 8K W Church. A stochastic parts program and noun phrase for unrestricted test. In: Proc of the 2nd Conf on Applied Natural Language Processing. Austin, TX, USA: Kluwer Academic Publishers, 1988. 136~143
  • 9S P Abney. Parsing by Chunks. In: R C Berwick, S P Abney eds. PrincipleBased Parsing: Computation and Psycholinguistics. Boston, USA: Kluwer Academic Publishers, 1991. 257~278
  • 10L A Ramshaw, M P Marcus. Text chunking using transformation-based learning. In: Proc of the 3rd Workshop on Very Large Corpora. Kluwer Academic Publishers, 1995. 82~94

共引文献95

同被引文献52

引证文献7

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部