摘要
本文提出了一种基于汉语语法知识的汉语拼音自动分词的方法。文章描述了自动分词时,多义切分检测与处理策略,以及利用语法和语义知识实现多义切分纠错方法。本文方法已经在拼音汉字转换系统中应用。实际情况表明,本文提出的汉语拼音自动分词方法是可行的。
A knowledge-based method for automatically separating Chinese words from the text in hanyupinyin form is presented in the paper. The strategy for detecting and processing the multi-separations, and the way to distiguish these ambiguities by using the grarmmartical and semantical information are discripeed in detail. This method has already been applied in the system to convert Chinese texts from hanyupinyin form tl Chinese character form. It is shown that the method is very effective.
关键词
分词
汉语
拼音
知识
word-separator
hanyupiyin
text
knowledge-based