期刊文献+

语义信息与CRF结合的汉语功能块自动识别 被引量:3

Chinese Functional Chunk Parsing Employing CRF and Semantic Information
下载PDF
导出
摘要 为了构建汉语功能块自动识别系统,该文利用条件随机域模型对经过正确词语切分和词性标注处理的汉语句子进行功能块边界识别和功能信息标注处理,通过在特征提取阶段优化组合丰富的上下文特征,得到功能块识别的精确率、召回率和F1-measure值分别为85.84%、85.07%和85.45%。在此基础上,该文引入由词义聚合关系将汉语单词组织起来的《同义词词林》作为语义资源,把其中的语义信息作为特征加入到功能块的识别过程,缓解了数据稀疏以及歧义问题对识别结果造成的影响,使得上述三个性能指标分别提高到86.21%、85.31%和85.76%。 We focus on building a system for labeling Chinese functional chunks automatically,through detecting the boundary of Chinese functional chunks and labeling the functional information in a sentence with correctly word segmenting and POS tagging.This paper proposes an approach that combines the feature template optimizing strategy with Conditional Random Field Model for labeling Chinese functional chunks automatically.On the testing data set,the precision,recall and F-1 measure of Chinese functional chunks reaches 85.84%,85.07% and 85.45% respectively.On the basis of that,existing language resources Chinese thesaurus "Tongyici Cilin" is introduced into the processing module,from which the semantic information will be added to the feature template to remit the effect of data sparseness and ambiguous problem.In this case,the three performance indexes are increased to 86.21%、85.31% and 85.76% respectively.
出处 《中文信息学报》 CSCD 北大核心 2011年第5期53-59,共7页 Journal of Chinese Information Processing
基金 中央高校基本科研业务费专项资金资助(DUT10RW202)
关键词 汉语功能块 条件随机域(CRFs) 语义信息 歧义结构 Chinese functional chunk Conditional Random Fields(CRFs) semantic information ambiguous structure
  • 相关文献

参考文献26

  • 1周强.汉语基本块描述体系[J].中文信息学报,2007,21(3):21-27. 被引量:25
  • 2周强,李玉梅.汉语块分析评测任务设计[J].中文信息学报,2010,24(1):123-128. 被引量:9
  • 3Steven Abney. Parsing by chunks [C]//Robert Betwick, Steven Abney and Carol Tenny (eds.). Principle-Based Parsing. Dordrecht: Kluwer Academic Publishers, 1991, 257-278.
  • 4李珩,杨峰,朱靖波,姚天顺.基于增益的隐马尔科夫模型的文本组块分析[J].计算机科学,2004,31(2):152-154. 被引量:9
  • 5李珩,朱靖波,姚天顺.基于SVM的中文组块分析[J].中文信息学报,2004,18(2):1-7. 被引量:50
  • 6李素建,刘群,杨志峰.基于最大熵模型的组块分析[J].计算机报,2003,1722-1727.
  • 7Fei Sha, Fernando Pereira. Shallow parsing with conditional random fields [C]//Proc. of Human Language Technology/North American chapter of the Association for Computational Linguistics annual meeting. Edmonton: 2003, 213-220.
  • 8Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo Zhu. Applying conditional random fields to Chinese shallow parsing [C]//Proc. of CICLing-2005. Mexico: 2005, 167-176.
  • 9GuoDong Zhou, Jian Su, TongGuan Tey. Hybrid text chunking [C]//Proc. of CoNLL-2000 and LLL-2000, Lisbon, Portugal: 2000, 163-165.
  • 10Rob Koeling. Chunking with maximum entropy models [C]//Proc. of CoNLL-2000 and LLL-2000, Lisbon, Portugal: 2000, 139-141.

二级参考文献110

共引文献174

同被引文献34

引证文献3

二级引证文献38

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部