期刊文献+

基于Lucene的自动答疑系统的设计 被引量:1

The Design of Lucene-based Automatic Answering System
下载PDF
导出
摘要 简要分析了当前自动答疑系统的缺陷及其重要性,设计了一个基于Lucene的自动答疑系统。该系统充分利用了Lucene强大的检索机制,设计了针对于本答疑系统的专业词典,采用了当前最流行的二级哈希词典存储结构,同时提出了一种优化的最大匹配中文分词算法并应用到Lucene当中,弥补了Lucene自带分词器的不足。 This paper begins with a brief analysis of the current automatic answering system's defects and its importance and designs automated answering system based on Lucene. The system makes a full use of powerful search mechanism of Lucene,designs the pro[essional dictionary for the answering system,and using the most popular dictionary of two hash storage structure. This article also puts forward an optimized maximum matching algorithm of Chinese word segmentation and applies it to the Lucene that makes up for Lcuene' lack of word segmentation.
出处 《电脑开发与应用》 2012年第4期32-34,37,共4页 Computer Development & Applications
关键词 LUCENE 中文分词 自动答疑 Lucene,chinese word segmentation,automatic question answering
  • 相关文献

参考文献4

二级参考文献19

  • 1黄昌宁,赵海.中文分词十年回顾[J].中文信息学报,2007,21(3):8-19. 被引量:248
  • 2梁南元.书面汉语自动分词系统—CDWS[J].中文信息学报,1987,(2):44-52.
  • 3Jurafsky D, Martin J H. Speech and Language Processing : An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition[ M ]. USA : Prentice Hall, 2000.
  • 4Jeffrey H. Theory of Probability [ M ]. Oxford : Oxford University Press, 1948.
  • 5Good I J. The Population Frequencies of Species and the Estimation of Population Parameters[ J]. Biometrika , 1953, 40 (3 - 4 ) :237 - 264.
  • 6Jelinek F, Mercer R L. Interpolated Estimation of Markov Source Parameters from Sparse Data[ C]. In:Gelsema E. S. and Kanal L. N. ( eds. ) Pattern Recognition in Practice, North Holland, Amsterdam, 1980:381 -397.
  • 7Katz S M. Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer[ J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, 35 (3) :400 -401.
  • 8Kneser R, Ney H. Improved Backing - off for M - Gram Language Modeling [ C ]. In : Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 1995 ( 1 ) :181 - 184.
  • 9Witten I H, Bell T C. The Zero -frequency Problem:Estimating the Probabilities of Novel Events in Adaptive Text Compression[ J]. IEEE Transactions on Information Theory, 1991, 37 (4) : 1085 - 1094.
  • 10郑林曦.普通话三千常用词表[M].北京:语文出版社,1987.

共引文献100

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部