摘要
将汉语词法分析看作一个整体, 提出了一个一体化的词法分析模型。该模型能将词形、词性和词义等不同层面的统计信息有机地融合在一起。在此基础上, 面向汉语真实文本, 采用启发式的A* 解码算法, 实现了一个基于该模型的汉语词法自动分析系统。分别对系统进行了初步的开放和封闭测试,
Viewing the three parts (word segmentation, part of speech tagging, word sense tagging) of Chinese lexical process as a whole, a hybrid model is presented, which can combine different lexical information sources together, such as word form, part of speech and word sense. Based on the proposed model, an integrated Chinese lexical analyzing system of running texts using A * decode algorithm is developed. In addition, this system was tested using a close sample and an open sample respectively. All the results show that the integrated approach is efficient.
出处
《高技术通讯》
EI
CAS
CSCD
1999年第12期6-10,共5页
Chinese High Technology Letters
基金
863 计划资助项目!(863306ZT03023)
关键词
词法分析
汉语一体化
汉语词法
自动分析系统
Lexical analysis, Word segmentation, Part of speech tagging, Word sense tagging