期刊文献+

韩国语名词短语结构特征分析及自动提取 被引量:6

Structure Characteristic Analysisand Automatic Extraction for Korean Noun Phrase
下载PDF
导出
摘要 名词短语作为语言中一种普遍的语法现象,在自然语言处理领域日益受到了研究人员的关注。目前,对其研究范围主要集中在边界识别、语法分析、语义分析及其分类等方面。该文通过研究分析韩国语书面语名词短语的左右边界规则,从大规模标注语料库中自动提取出名词短语。实验结果表明:语料中的高频名词短语相对集中于8个类型之中。根据提取结果分别建立不同类型的名词短语库,为进一步建立双语平行短语语料库打下基础,以便于以后的机器翻译、信息检索等自然语言信息处理工作。 These years, noun phrase,as a common grammatical phenomenon,has attracted eyes of many scholars in the field of language processing.At present,most researches on noun phrase lie in boundary identification grammatical analysis,semantic analysis,categorization and some other aspects.This thesis abstractsnoun phrases from a large-scale tagged corpus through studying and analyzing rules of left and right boundaries of noun phrases in written Korean.From the experimental result,we can see that high-frequency noun phrases mainly lie in 8 categories.Different kinds of corpus for noun phrases can be built according to the result of the abstract,which lays the foundation of building paralell corpus.It will also be convenient for machine translation,information retrieval and other work in language information processing in the future.
出处 《中文信息学报》 CSCD 北大核心 2013年第5期205-210,共6页 Journal of Chinese Information Processing
关键词 韩国语 名词短语 标注语料库 边界界定 自动提取 Korean noun phrase tagged corpus boundary identification automatic extraction
  • 相关文献

参考文献3

  • 1K W Church.A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Test[A].Proceedings of the Second Conference on Applied Natural Language Processing.1988:136-143.
  • 2赵军,黄昌宁.基于转换的汉语基本名词短语识别模型[J].中文信息学报,1999,13(2):1-7. 被引量:41
  • 3李基文[韩].东亚国语大辞典[D].斗山东亚,1997:754.

二级参考文献3

  • 1张卫国.三种定语、三类意义及三个槽位[J].中国人民大学学报,1996,(4):97-100.
  • 2张卫国,中国人民大学学报,1996年,4期,97页
  • 3梅家驹,同义词词林,1983年

共引文献40

同被引文献36

  • 1李文中.平行语料库设计及对应单位识别[J].当代外语研究,2010(9):22-27. 被引量:18
  • 2晋耀红.基于语境框架的文本相似度计算[J].计算机工程与应用,2004,40(16):36-39. 被引量:26
  • 3王荣波,池哲儒.基于词类串的汉语句子结构相似度计算方法[J].中文信息学报,2005,19(1):21-29. 被引量:28
  • 4张仕仁.汉语复句的结构分析[J].中文信息学报,1994,8(4):43-54. 被引量:13
  • 5Chruch K W. A Stochastic Parts Program and Noun Phrase for Unrestricted Test : proceedings of the 2nd Conference on Applied Natural Language Processing, Austin, TX [ C ]. USA : Kluwer Academic Publicshers, 1988 : 136-- 142.
  • 6Ramshaw L, Marcus M. Text Chunking Using Transfor- mation-Based Learning [C] //Proceedings of 3rd Work- shop on Very Large Corpora. Massachusetts : Association for Computational Linguistics, 1995 : 82--94.
  • 7K Uehimoto, et al. Named entity extraction based on a maximum entropy model and transformation rules [ C ] //Proceedings of the 38th Annual Meeting of the Associa- tion for Computational Linguistics, 2000 : 326-- 335.
  • 8Gulila Ahenbek, Ruina Sun. Kazakh Noun Phrase Ex- traction based on N-gram and Rules : 2010 International Conference on Asian Language Processing [C ]. Harbin, Heilongiiang, China: 1EEE computer society, 2010: 305-- 308.
  • 9Laffel~y J. et al. Conditional Random Fields : Proba- bilistic Models for Segmenting and Labeling Sequence Da- m [ C ]//Proceedings of the 18th International Conf on machineLeaming, 2001: 282--289.
  • 10S Lakshmana Pandian, T V Geetha. CRF Models for Tamil Part of Speech Tagging and Chunking [ C ]. In- ternational Conference on the Computer Processing of Ori- entalLanguages-ICCPOL, Hong Kong, 2009: 11 --22.

引证文献6

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部