期刊文献+

P2P系统分布式LSI的构建和更新

On building and updating distributed LSI for P2P systems
下载PDF
导出
摘要 从P2P系统自组织和动态性特点出发,提出分布式环境下隐语义索引(LSI)构建和更新的P2P网络模型,设计适合P2P系统文档矩阵的降维表示(RDR)合并算法,结合信号和噪声子空间模型从理论上分析RDR合并算法的有效性及算法需要满足的前提条件;使用M atlab6.5针对标准文集测试RDR合并算法对查询精度的影响.理论分析和数字实验证明,该算法能够解决P2P系统中分布式LSI的构建和更新问题,能在可容忍的查询精度影响范围内,以较低的网络开销和计算量分布式地构建、更新隐语义索引. Taking P2P's (peer-to-peer) characters such as self-organizing, anonymous and dynamic into account, this paper proposes a model for building and updating distributed LSI ( latent semantic indexing) and an algorithm for merging reduced-dimension-representation (RDR)s which is suitable for P2P systems. Using the subspace model in signal and noise field, a theoretical justification for RDR-Merging and the precondition of the algorithm are provided. A test based on standard document set MED ( medlars collection) was conducted in Matlab 6.5 to explore the error brought by RDR- Merging algorithm. Theoretical analysis and numerical experiments both show that our building and updating algorithm for distributed LSI can reduce communication overhead and computation cost of SVD (singular value decomposition) effectively while keeping fair query precision.
出处 《东南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2006年第1期39-42,共4页 Journal of Southeast University:Natural Science Edition
关键词 奇异值分解 更新算法 隐语义索引 peer—to—peer singular value decomposition updating problem latent semantic indexing peer-to-peer
  • 相关文献

参考文献9

  • 1Berry M,Drmac Z,Jessup E.Matrices,vector spaces,and information retrieval [J].SIAM Review,1999,41(2):335-362.
  • 2Berry M,Dumais S T,O'Brien G W.Using linear algebra for intelligent information retrieval [J].SIAM Review,1995,37(4):573-595.
  • 3Tang C,Xu Z,Dwarkadas S.Peer-to-peer information retrieval using self-organizing semantic overlay networks [C]//Proc of Applications,Technologies,Architectures,and Protocols for Computer Comm (SIGCOMM'03).New York:ACM Press,2003:175-186.
  • 4Tang C,Xu Z,Dwarkadas S.On scaling latent semantic indexing for sarge peer-to-peer systems [C]// Proceedings of the 27th Annual International Conference on Research and Development in Information Retrieval.New York:ACM Press,2004:112-121.
  • 5Shen Hengtao,Shu Yanfeng,Yu Bei.Efficient semantic-based content search in P2P network [J].IEEE Transactions on Knowledge and Data Engineering Archive,2004,16(7):813-826.
  • 6Zha Hongyuan,Simon H.On updating problems in latent semantic indexing [J].SIAM Journal of Scientific Computing,1999,21(2):782-791.
  • 7Cornell.Cornell smart system [EB/OL].ftp://ftp.cs.cornell.edu/pub/smart.1998/2005-05.
  • 8Zeimpekis Dimitrios.TMG[EB/OL].(2005-05)[2005-05].http://scgroup.hpclab.ceid.upatras.gr/scgroup/Projects/TMG/.
  • 9Berry M.SVDPACKC [EB/OL].(2004-12-01)[2005-05].http://www.netlib.org/svdpack/.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部