摘要
通过对已有分词算法的分析,尤其是对快速分词算法的分析,在对已有词表进行改进的基础上,提出了一种高效的快速分词算法,理论分析表明,在大词库下,该算法也能有很好的表现。
In this paper, through the analysis of the existing algorithms of Chinese word segmentation, especially of the fast algorithms, a highly efficient algorithm for Chinese word segmentation is introduced which is based on the improvement of existing data structure for Chinese dictionary. In theory, the algorithm has a better efficiency in big Chinese characters library.
出处
《计算机工程》
CAS
CSCD
北大核心
2004年第19期119-120,128,共3页
Computer Engineering
基金
教育部青年骨干教师资金资助项目(E200065)
关键词
分词
HASH
二分法
Word segmentation
hash
Binary search