摘要
随着电子文档的出现和发展,电子图书馆在人们的生活中正起着越来越重要的作用,如何从大量文档中获取有用的信息,成为信息检索领域的关键技术。一种改进的文本检索算法使用目前较先进的小波变换算法,有效地结合传统的向量空间法和近似法的优点,并且利用CBW算法来进行加权,有效避免了交叉比较关键词的问题。此算法用以比较文档之间的相似性,实验表明是一套性能较好的文档相似性检索方法。
With the emergence and evolution of electronic documents, electronic library is playing an important role in people's life.How to retrieve useful information from a large amount of information has become a key technology in the information area. In this paper, we present an improved document intrieval algorithm. Using the advantage of vector space methods and proximity search, it is a document similar retrieval technique with a better performance which uses wavelet transform technique and CBW weighting and can be used to compare the similarity of documents.
关键词
小波变换
词信号
文本检索
幅值
零相位精准
wave transform
term signal
document retrieval
magnitude
zero pahse precision