期刊文献+

基于用法的现代汉语连词结构短语识别研究 被引量:8

Modern Chinese Conjunction Phrase Recognition Based on Usage
下载PDF
导出
摘要 连词能够连接词语、短语、小句、句子乃至句群,连词结构短语是连词所连接对象的一种,不同的连词形成不同长度、不同关系的连词结构短语。该文根据虚词用法知识库中的连词用法,构建了连词结构短语识别规则,实现了基于规则的连词结构短语识别,并将连词用法作为特征采用条件随机场模型实现了基于统计的连词结构短语识别。实验结果表明,统计的识别效果高于规则的识别效果,连词用法能够较好地用于连词结构短语的识别中。 Conjunctions connect words, phrases, clauses, sentences and even sentence groups. The conjunction phrase is the words or phrases connected by conjuctions, bearing different lengths and relations. According to conjunction usage in the functional word usage knowledge base, the paper formulates a rule based method for the recognition of conjunction structure phrases. Meanwhile, the paper adopts the conditional random field to build a statistical model for the conjunction phrase recognition based on the conjunction usage. Results indicate that the statistical method performs better than the rule method, and conjunction usage is beneficial to the conjunction phrase recogni tion.
出处 《中文信息学报》 CSCD 北大核心 2012年第6期72-78,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60970083) 模式识别国家重点实验室开放课题基金资助项目 河南省科技创新人才杰出青年基金资助项目(104100510026)
关键词 连词结构短语 连词用法 条件随机场 conjunction phrase conjunction usages conditional random fields
  • 相关文献

参考文献7

  • 1王东波,陈小荷,年洪东.基于条件随机场的有标记联合结构自动识别[J].中文信息学报,2008,22(6):3-7. 被引量:9
  • 2Dongbo Wang, Danhao Zhu, Xinning Su, et al. Automatic Identification of Parallel Structure Based on Conditional Random Field[C]//Proceedings of the 3rd International Symposium on Computer Science and Computational Technology ( ISCSCT '10), Jiaozuo, 2010 : 400-404.
  • 3Hongying Zan, Lijuan Zhou, Kunli Zhang. Studies on the Automatic Recognition of Modern Chinese Conjunction Usages[J]. Lecture Notes in Computer Science, 2011,6838:472-479.
  • 4Lafferty J, MeCallum A, Pereira F. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proceedings of the 18th ICML-01, Montreal, 2001:282-289.
  • 5Hai Zhao, Changning Huang, Mu Li. An Improved Chinese Word Segmentation System with Conditional Random Field[C]//Proeeedings of the 5th SIGHAN Workshop on Chinese Language Processing(SIGHAN- 5). Sydeny, 2006 : 162-165.
  • 6周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809. 被引量:112
  • 7丁德鑫,曲维光,徐涛,董宇.基于CRF模型的组合型歧义消解研究[J].南京师范大学学报(工程技术版),2008,8(4):73-76. 被引量:8

二级参考文献22

  • 1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:198
  • 2周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809. 被引量:112
  • 3李双龙,刘群,王成耀.基于条件随机场的汉语分词系统[J].微计算机信息,2006,22(10S):178-180. 被引量:15
  • 4周强.汉语语料库的短语自动划分和标注研究[D].北京:北京大学,2002.
  • 5Fei Sha,Fernando Pereira. Shallow Parsing with Con ditional Random Fields [C]//Proceedings of Human Language Technology Conference and North Ameri- canChapter of the Association for Computational Linguistics (HLT-NAACL) ,2003,135 136.
  • 6[7]Luo Xiao,Sun Maosong,Tsou B K.Covering ambiguity resolution in Chinese word segmentation based on contextual information[C]// Proceedings of the 19th International Conference on Computational Linguistics.Taiwan:[s.n.],2002:598-604.
  • 7[10]John Lafferty,Andrew McCallum,Femando Pereira.Conditional random fields:Probabilistic models for segmenting and labeling sequence data[C]// Proceedings of the 18th ICML.San Francisco:Mogan Konfmann,2001:282-289.
  • 8Wang Houfeng,Shi Wuguang.A simple rule-based approach to organization name recognition in chinese text[A].Proc of 5th CICLing[C].LNCS 3406,Heidelberg,German:Springer-Verlag,2005.769-772.
  • 9Hongkui Yu,Huaping Zhang,Quan Liu.Recognition of Chinese organization name based role tagging[A].Proc of Advances in Computation of Oriental Languages[C].Beijing:Tsinghua University Press,2003.79-87.
  • 10McCallum A,Freitag D,Pereira F.Maximum entropy Markov models for information extraction and segmentation[A].Proc of 17th ICML[C].Stanford,California,USA:Morgan Kaufmann,2000.591-598.

共引文献126

同被引文献82

  • 1王东波.基于规则的单层单标记联合结构自动识别[J].文教资料,2008(9):29-31. 被引量:6
  • 2吴云芳.V+V形成的并列结构[J].语言研究,2004,24(3):45-51. 被引量:4
  • 3吴云芳.并列成分中心语语义相似性考察[J].当代语言学,2005,7(4):305-315. 被引量:15
  • 4刘锐 咎红英 张坤丽.现代汉语副词用法的自动识别研究.计算机科学,2008,(8):172-174.
  • 5袁应成,咎红英,张坤丽,等.基于规则的虚词用法自动标注算法设计与系统实现[c].苏州:第11届汉语词汇语意学会议论文集,2010:163-169.
  • 6周溢辉,昝红英,柴玉梅,等.基于主观认知的汉语助词和语气词区分问题研究[c].苏十1,1:第11届汉语词汇语意学会议论文集,2010:382-388.
  • 7昝红英,张坤丽,柴玉梅,俞士汶.现代汉语虚词知识库的研究[J].中文信息学报,2007,21(5):107-111. 被引量:27
  • 8董振东 董强.[EB/OL].知网http://www.keenage.com,2000.
  • 9俞士汶 朱学锋 刘云.现代汉语广义虚词知识库的建设.汉语语言与计算学报,2003,(1):89-98.
  • 10周丽娟,张坤丽,袁应成,等.基于规则的现代汉语连词用法自动识别研究[c]//武汉第五届全国青年计算语言学研讨会,2010:96-102.

引证文献8

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部