基于P-Rank的网络书店相似性搜索

Online bookstore similarity search based on P-Rank

下载PDF

导出

摘要为提高网络书店相似性搜索效率,降低时间和存储开销以适应大规模数据,提出一种基于P-Rank的相似性搜索优化算法ProductP-Rank。对相似性搜索算法进行分析和比较,指出相似性计算精确度和复杂度是现有算法所面临的难点;依据消费者与图书之间的购买关系构建购物网络,离线计算一步相似性矩阵,在线计算两步相似性矩阵。实验结果表明,该方法降低了相似性计算的存储和预计算时间的开销,具有较高精确度,能够快速响应查询请求。 To increase the efficiency of algorithms on online bookstore＇s similarity search,and reduce time and space cost to adapt to large information network,ProductP-Rank,an optimized similarity search method,was proposed based on the basic idea of P-Rank.The past algorithms for similarity search were analyzed and discussed and the accuracy and complexity problems in similarity search were pointed out.By building the customer-product network according to the co-purchasing relationship,for a given query,the 2-hop similarity matrix between query and each item was computed based on the pre-computed 1-hop similarity matrix.Experimental results show the space cost and pre-computation time cost of ProductP-Rank were evidently less than that of P-Rank with little effectiveness loss and low online-query time cost.

作者吕巍邬春学张明西钟聃

机构地区上海理工大学光电信息与计算机工程学院上海理工大学新闻出版总署重点实验室上海理工大学出版印刷与艺术设计学院中国民用航空西北地区空中交通管理局计保中心通信室

出处《计算机工程与设计》北大核心 2015年第10期2849-2855,共7页 Computer Engineering and Design

基金国家自然科学基金项目(61202376) 上海出版传媒研究院上海出版印刷高等专科学校招标课题基金项目(SAYB1410) 上海高校青年教师培养资助计划基金项目(ZZSLG14021) 上海市教育基金会晨光计划基金项目(10CG49) 上海市教委科研创新基金项目(13YZ075)

关键词相似性搜索 P-RANK 网络书店 “消费者-商品”关系网络信息检索 similarity search P-Rank online bookstore customer-product network information retrieval

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Sun Y,Yu Y,Han J.Ranking-based clustering of heterogeneous information networks with star network schema[C]//ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2009:797-806.
2David Carmel,Naama Zwerdling,Ido Guy,et al.Personalized social search based on the user’s social network[C]//International Conference on Information and Knowledge Management-CIKM,2009:1227-1236.
3Ido Guy,Naama Zwerdling,David Carmel,et al.Personalized recommendation of social software items based on social relations[C]//Proceedings of the 3rd ACM Conference on Recommender Systems,2009:53-60.
4Moricz M,Dosbayev Y,Berlyant M.Pymk:Friend recommendation at MySpace[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data,2010:999-1002.
5Zhao P,Han J,Sun Y.P-rank:A comprehensive structural similarity measure over information networks[C]//Proceedings of the 18th ACM Conference on Information and Knowledge Management,2009:553-562.
6Li C,Han J,He G,et al.Fast computation of SimRank for static and dynamic information networks[C]//13th International Conference on Extending Database Technology,2010.
7Cai Y,Cong G,Jia X,et al.Efficient algorithms for computing link based similarity in real world networks[C]//9th IEEE International Conference on Data Mining,2009:734-739.
8Cai Y,Liu H,He J,et al.An adaptive method for efficient similarity calculation[C]//Proc of DASFAA,2009:339-353.
9Li P,Liu H,Xu Yu J,et al.Fast single-pair simrank computation[C]//Proc of SDM,2010:571-582.
10Zhang M,He Z,Hu H,et al.E-rank:A structural-based similarity measure in social networks[C]//Proc of WI,2012:415-422.

二级参考文献16

1Jeh G,Widom J. SimRank: A Measure of Structural-contextSimilarity [ C ]// Prec. of SIGKDD,2002.
2Jeh G, Widom J. Scaling personalized web search [ C ]//Prec. of WWW, 2002.
3Small H G. Co-citation in the scientific literature: A new measure of the relationship between two documents [ J ]. Journal of the American Society for Information Science, 1973, 24(4) : 265 -269.
4Kessler M M. Bibliographic coupling between scientific papers [ J ]. A- merican Documentation, 1963, 14 : 10 - 25.
5Popescul A, Flake G, Lawrence S, et al. Clustering and identifying tem- poral trends in document databases[ C ]//Prec. of the IEEE Advances in Digital Libraries, 2000.
6Small H. Co-citation in the scientific literature : A new measure of the relationship between two documents [ J ]. Journal of the American Soci- ety for Information Science, 1973, 4:265 - 269.
7Larson R R. Bibliometrics of the World-Wide Web: An exploratory a- nalysis of the intellectual structure of cyberspace [ C ]//Prec. of the Annual Meeting of the American Society for Information Science, Balti- more, Maryland, October 1996.
8Pitkow J, Pirolli P. Life, death, and lawfulness on the electronic frontier [C]//Proc. of the Conference on Human Factors in Computing Sys- tems,Atlanta, Georgia, 1997.
9Lin Z, King I, Lyu M. R. Pagesim: A novel link-based similarity meas- ure for the world wide web [ C ]//Prec. of WI, 2006:687 - 693.
10Fogaras D, Racz B. Scaling link-based similarity search [ C ]//Prec. of WWW, 2005:641-650.

共引文献3

1刘萍,黄纯万.基于SimRank的作者相似度计算[J].情报理论与实践,2015,38(6):109-114. 被引量：10
2巨星海,周刚,王婧,张凤娟.用户画像构建技术研究[J].信息工程大学学报,2020,21(2):242-250. 被引量：4
3韦二龙,刘东,龙恩,王永安.基于用户画像的遥感信息精准服务系统设计[J].无线电工程,2021,51(8):720-724. 被引量：2

1冷泳林,申华,鲁富宇.基于P-Rank的RDF有向图的分布式存储[J].重庆理工大学学报（自然科学）,2015,29(1):91-95. 被引量：2
2冷泳林,鲁富宇.基于结构相似性的RDF数据聚类分割[J].信息技术,2015,39(6):63-65.
3王旭丛,李翠平,陈红.大数据下基于异步累积更新的高效P-Rank计算方法[J].软件学报,2014,25(9):2136-2148. 被引量：4
4Meng Chen,Xiaohui Yu,Yang Liu.Mining Object Similarity for Predicting Next Locations[J].Journal of Computer Science & Technology,2016,31(4):649-660.
5秦琦冰,谭龙.基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法[J].计算机应用,2017,37(2):329-334. 被引量：1
6李彬.计算机在各行业中的应用[J].电脑迷（数码生活）（上旬刊）,2013(5):6-7.
7韩文静,李海峰,马琳.考虑情感程度相对顺序的维度语音情感识别[J].信号处理,2011,27(11):1658-1663. 被引量：2
8武森,魏桂英,白尘,张桂琼.分类属性高维数据基于集合差异度的聚类算法[J].北京科技大学学报,2010,32(8):1085-1089.
9韩富春,王英,张丽,张宁.人工神经网络在电力系统网损计算中的应用[J].太原理工大学学报,2004,35(6):664-666. 被引量：1
10陈德运,高明,李伟,王莉莉,王飞虎.新型ECT数据采集系统设计与实现[J].电机与控制学报,2013,17(5):87-92. 被引量：26

计算机工程与设计

2015年第10期

浏览历史

内容加载中请稍等...

基于P-Rank的网络书店相似性搜索

参考文献14

二级参考文献16

共引文献3

相关作者

相关机构

相关主题

浏览历史