期刊文献+

一种基于聚类的语义检索算法

Clustering-based Semantic Retrieval Algorithm
下载PDF
导出
摘要 潜在语义分析在进行大规模语义检索时计算效率较低、存储开销较大。针对该问题,提出一种基于聚类的潜在语义检索算法。通过文档之间的结构关系对文档进行聚类,利用簇代替文档分析潜在语义,以此减少处理文档的个数。实验结果表明,该算法能减少查询时间,且检索精确度较高。 Latent Semantic Analysis(LSA) lacks computation efficiency and has storage deficiencies when it is used in the large scale semantic retrieval.To solve this problem,this paper proposes a clustering-based semantic retrieval algorithm.This algorithm clusters the documents using their structural information,and applies the LSA process on those clusters to efficiently reduce the number of documents.Experimental results show that the algorithm can exponentially decrease the time of inquiring and get good retrieval accuracy.
出处 《计算机工程》 CAS CSCD 2012年第2期36-38,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60703093)
关键词 潜在语义分析 信息检索 向量空间模型 图聚类算法 Latent Semantic Analysis(LSA) information retrieval vector space model graph clustering algorithm
  • 相关文献

参考文献5

  • 1Deerwester S, Dumais S T, Furnas G W, et al. Indexing by Latent Semantic Analysis[J]. Journal of the American Society for Information Science, 1990, 41(6): 391-407.
  • 2Hofmann T. Probabilistic Latent Semantic Indexing[C] //Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 1999.
  • 3王卫国,徐炜民.基于潜在语义分析的个性化查询扩展模型[J].计算机工程,2010,36(21):43-45. 被引量:13
  • 4Jeh G, Widom J. SimRank: A Measure of Structural-context Similarity[C] //Proc. of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2002.
  • 5Yin Xiaoxin, Han Jiawei, Philip Y S. LinkClus: Efficient Clus- tering Via Heterogeneous Semantic Links[C] //Proc. of the 32nd International Conference on Very Large Data Bases. Seoul, Korea: [s. n.] , 2006.

二级参考文献6

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部