期刊文献+

利用集合差异度实现基于内容聚类的P2P搜索模型 被引量:2

Content-based clustered P2P search model depending on set distance
下载PDF
导出
摘要 基于内容的非结构化P2P搜索系统中直接影响查询效果和搜索成本的两个主要问题是,高维语义空间所引起的文本相似度计算复杂以及广播算法带来的大量冗余消息.本文提出利用集合差异度实现基于内容聚类的P2P搜索模型提高查询效率和减少冗余消息.该模型利用集合差异度定义文本相似度,将文本相似性的计算复杂度控制在线性时间内而有效地减少了查询时间;利用节点之间的集合差异度实现基于内容的聚类,既降低了查询时间,又减少了冗余消息.模拟实验表明,利用集合差异度构建的基于内容的搜索模型不仅具有较高的召回率,而且将搜索成本和查询时间分别降低到了Gnutella系统的40%和30%左右. In a content-based unstructured P2P search system, the main issues that affect the query efficiency and searching cost are the complexity of computing document similarities brought by high dimensions and the great deal of redundant messages. The content-based cluster P2P search model depending on a set distance is proposed in this paper to reduce the query time and redundant messages. This model defines document similarities by a set distance to restrain the complexity of computing the document similarities in linear time. Also, clustering peers based on the content depending on a set distance reduces the query time and decreases the redundant messages. Simulations show that this model not only has higher recall, but also reduces the search cost and query time to the rate of 40 % and 30% of Gnutella.
出处 《中国科学院研究生院学报》 CAS CSCD 2007年第2期241-247,共7页 Journal of the Graduate School of the Chinese Academy of Sciences
基金 国家自然科学基金项目(60673172) 下一代互联网示范工程2005年研究开发 产业化及应用试验项目(CNGI-04-15-7A)资助
关键词 对等网络 GNUTELLA DHT 集合差异度 向量空间模型 peer to peer, Gnutella, distributed hash tables, set distance, vector space model
  • 相关文献

参考文献14

  • 1Napster Inc.Napster website.[2006-07-01] http://www.napster.com/
  • 2Open Source Community.Gnutella website.[2006-07-01] http://gnutella.wego.com/
  • 3Ratnasamy S,Francis P,Handley M,et al.A scalable content-addressable network.In:Proceedings of ACM SIGCOMM'01,San Diego:September 2001.161 ~ 172
  • 4Zhao B,Kubiatowicz J,Joseph A.Tapestry:An infrastructure for fault-tolerant wide-area location and routing.Technical Report UCB/CSD-01-1141,Computer Science Division,U.C.Berkeley,2001
  • 5Stoica I,Morris M,Karger D,et al.Chord:a scalable peer-to-peer lookup service for Internet applications.In:Proceedings of the 2001 ACM SIGCOMM Conference,San Diego:2001.149 ~ 160
  • 6Rowstron A,Drnschel P.Pastry:Scalable,decentralized object location and routing for largescale peer-to-peer systems.In:Proceedings of the IFIP/ACM International Conference on Distributed Systems Plafforms (Middleware'01),Heidelberg,Germany:Springer-Verlag,2001.329 ~ 350
  • 7Bawn M,Manku G S,Raghavan P.SETS:Search enhanced by topic segmentation.In:Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR' 03),Toronto:2003.306 ~ 313
  • 8Iamnitchi A,Ripeanu M,Foster IT.Locating data in peer-to-peer scientific collaborations.In:Proceedings of International Workshop on Peer-toPeer Systems(IPTPS),Cambridge:2002.232 ~ 241
  • 9Ng CH,Sia KC,Chang CH.Advanced peer clustering and firework query model in the peer-to-peer network.In:Proceedings of International World Wide Web Conference,Budapest:May 2003
  • 10Jin H,Yu YJ.SemreX:a Semantic Peer-to-peer Scientific Reference Sharing System.In:Proceedings of the International Conference on Internet and Web Application and Services.(ICIW' 06) Guadeloupe,French:IEEE Computer Society,2006

同被引文献27

  • 1李云,刘宗田,陈崚,沈夏炯,徐晓华.基于属性的概念格渐进式生成算法[J].小型微型计算机系统,2004,25(10):1768-1771. 被引量:27
  • 2杨舰,吕智慧,钟亦平,张世永.一种基于兴趣域的高效对等网络搜索方案[J].计算机研究与发展,2005,42(5):804-809. 被引量:24
  • 3Joseph S.NeuroGrid:Semantically routing queries in peer-to=peer networks.Proc of the International Work-shop on Peer-to-Peer Computing.2002(2376):202-214.
  • 4Liu L,Antonopoulos N,Makin S.Social Peer-to-Peer for Resource Discovery.Proc.of the 15th Euromicro International Conference on Parallel,Distributed and Network-Based Processing.2007:459-466.
  • 5Crowcroft J.Application level programmable Inter -Network environment,2005.http://www.cs.ucl.ac.uk/research/alpine/alpine.html.
  • 6Sripanidkulchai K,Maggs B,Zhang H.Efficient content location using interest-based locality in peer-to-peer systems.INFOCOM.2003.TwentySecond Annual Joint Conference of the IEEE Computer and Communications,2003:2166-2176.
  • 7Tan YH,Li YP,Chen ZP,et al.Research and Implementation on Routing Scheme Based on Interest Mining in Unstructured P2P Systems.7th International Conference on Web-Age Information Management Workshops,2006:2-2.
  • 8Crespo A,Garcia-Molina H.Semantic overlay networks.[2009-8-2].http://www.db.stanford.edu/crespo/publicationgop2p.pdf,.
  • 9Godin R,Missaoui R,Alaoui H.Incremental concept formation algorithms based on Galois(concept)lattices.Computational Intelligence,1995,11(2):246-267.
  • 10Gnutella.[2009-8-6].http://www.gnutella.com,2001.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部