摘要
本文论述一个称之为“多词组一次性变换”的拼音·汉字变换技术。多词组一次性变换以拼音为基本单位,使拼音·汉字变换首次以任意的拼音列为单位进行一次性处理,实现了拼音输入中文中的自动分词。多词组一次性变换建立在一个完整的词法定义下。本文的多词组一次性变换机构,已在LUNA UNIX工作站上的中文输入系统cWnn上实现。取得了显著的效果。
This paper proposes a multiple Phrase pinyin\Hanzi (拼音\汉字) conversion mechanism. The mechanism Performs the pinyin\Hanzi conversion under welldefined gram-mer rules, which are implemented by using the concept of Chinese phrases. A. Chinese phrase is a sequence of Chinese words arranged according to a defined gramm-er. A Chinese word is a sequence of Hanzi, that is defined in one of the dictionaries in the system.Our mechanism has made it possible to accomplish the Pinyin\Hanzi conversion of an arbitrary Pinyin sequence. It is the first complete implementation of automatic Chinese word division (also called segment division) (自动分词) from an arbitrary Pinyin sequence.The proposed mechanism has already been developed in a Chinese input system called 'cWnn', which runs under X Window System and GMW Window System on the LUNA Unix workstation.
出处
《中文信息学报》
CSCD
1990年第2期55-64,共10页
Journal of Chinese Information Processing