摘要
歧义消解是中文分词的主要问题之一.提出了一种全切分与统计结合的分词算法,构造出基于统计词典的有向无环词图,利用动态规划算法得出最佳的分词路径.实验证明,系统有效地提高了歧义切分的准确性及分词速度.
Ambiguity resolution is one of the main problems in Chinese word segmentation.This paper presents a Chinese segmentation system combining omni-segmentation with statistic.A directed acyclic graph based on statistical dictionary is first constructed,and then the best segmentation path is obtained by dynamic programming algorithm.The experiments show that both the accuracy of ambiguous segmentation and the speed of the segmentation are improved effectively in this system.
出处
《微电子学与计算机》
CSCD
北大核心
2009年第5期68-70,共3页
Microelectronics & Computer
基金
国防"十一五"预研项目(513060601)
关键词
中文分词
全切分
统计分词
歧义消解
Chinese word segmentation
omni-segmentation
statistical word segmentation
ambiguity resolution