摘要
简要分析了当前自动答疑系统的缺陷及其重要性,设计了一个基于Lucene的自动答疑系统。该系统充分利用了Lucene强大的检索机制,设计了针对于本答疑系统的专业词典,采用了当前最流行的二级哈希词典存储结构,同时提出了一种优化的最大匹配中文分词算法并应用到Lucene当中,弥补了Lucene自带分词器的不足。
This paper begins with a brief analysis of the current automatic answering system's defects and its importance and designs automated answering system based on Lucene. The system makes a full use of powerful search mechanism of Lucene,designs the pro[essional dictionary for the answering system,and using the most popular dictionary of two hash storage structure. This article also puts forward an optimized maximum matching algorithm of Chinese word segmentation and applies it to the Lucene that makes up for Lcuene' lack of word segmentation.
出处
《电脑开发与应用》
2012年第4期32-34,37,共4页
Computer Development & Applications
关键词
LUCENE
中文分词
自动答疑
Lucene,chinese word segmentation,automatic question answering