摘要
要提高计算机的自然语言处理 (NaturalLanguageProcessing ,NLP)性能 ,首要的问题是加强语言知识库的建造。词是语言的基本层次单位 ,既涵盖着语言的静态范畴知识 ,也体现着语言的动态规则知识 ,所以词汇知识库的建造处于重中之重的位置。俄语词汇知识库是以词为基本操作单位和描述主题、以一定组织形式表示和存储词汇单位相关语言信息的仓库。建造过程中涉及几个关键性技术问题 ,即 :(1)条头筛选技术 ;(2 )条目中语义信息获取技术 ;(3)多义现象处理技术 ;(4 )固定词组处理技术 ;(5 )
Construction of a language knowledge database is critical for enhancement of the efficiency of an NLP system. Word as the basic language unit covers both static categorical knowledge of a language and its dynamic rule knowledge. The Russian language knowledge database under construction takes words as its basic operation units and description objects, and contains word-relevant information represented and stored according to some principles. The main technical problems in the process of construction involve: (1) selection of head words; (2) extraction of semantic information of head words; (3) processing of polysemy; (4) processing of idioms; (5) assignment of Chinese equivalents to Russian head words.
出处
《解放军外国语学院学报》
北大核心
2002年第5期28-32,共5页
Journal of PLA University of Foreign Languages
关键词
核心技术
俄语词汇
知识库
建造技术
Russian lexicon
knowledge database
construction techniques