摘要
汉字的“字”与汉语的“词”单位不一致,给人脑和电脑的理解都带来困难。汉语另外有三种可能的文本:“汉字词式书写”文本、“纯拼音词式书写”文本、“拼音夹用汉字”文本。“拼音夹用汉字”文本最有可能成为走出汉语自动理解困境的最佳路径。
The unit of Chinese Characters is inconsistent with the unit of Chinese word , which brings many difficulties to humen and computers. Chinese text provides three possible methods: writing the Chinese text word by word; writing it by Chinese Phonetic Alphabet purely ; writing by Chinese Phonetic Alphabet + Chinese Characters, in which the third method is the best way to free Chinese from the predicament of automatic comprehension.
出处
《北华大学学报(社会科学版)》
2006年第3期56-59,共4页
Journal of Beihua University(Social Sciences)
关键词
汉语
自动理解
词语切分
文本改进
Chinese
Automatic comprehension
Chinese segmentation
Text reform