摘要
维吾尔语动词的体范畴是维吾尔语动词语法范畴中极为复杂的范畴,也是维吾尔语信息处理中的难点问题之一,计算机对维吾尔语动词体范畴的处理是在对人称、时、否定等语法范畴处理之后才进行处理。但是难点就是体范畴重叠问题的解决。维吾尔语动词的体范畴词尾按照一定的规则连接在词干,这使得维吾尔语动词体范畴的重叠形式可用有限状态自动机形式化描述。因此它根据重叠规则构造从右向左的非确定自动机,之后把从右向左方向的自动机转换成从左向右的非确定自动机,最后把非确定自动机转换成确定自动机来实现维吾尔语动词体范畴的形式化描述。
The verb aspect category is one of the most complicated categories in Uighur language and,thus,remains as one of the hardest problems in Uyghur language processing.Computer processing of verb aspect category can only be done after resolving the grammatical categories such as tense,person,negative in Uighur language.But overlapping of verb aspect is hard to crack.The verb aspect suffixes of Uighur language are attached to the verb stem according to specific rules,which enables to describe the overlapping forms of Uyghur verb aspect in terms of finite state machine.An FSM can be firstly generated from right to left according to overlapping rules,then it can be transformed into DFA from left to right,during which the formal description of Uyghur verb aspect is realized.
出处
《中文信息学报》
CSCD
北大核心
2012年第4期61-65,84,共6页
Journal of Chinese Information Processing
基金
2011年度教育部人文社会科学青年基金资助项目(11YJC740001)
国家社会科学基金资助项目(10AYY006)
新疆维吾尔自治区普通高等学校人文社会科学重点研究基地基金资助项目(010812B04)
关键词
维吾尔语
动词
体范畴
有限状态自动机
形式化
Uyghur language
verb
aspect category,finite state machine,formalization