期刊文献+

基于标签匹配的协同过滤推荐算法研究 被引量:2

Investigation on Collaborative Filtering Recommendation Algorithm with Tag Matching
下载PDF
导出
摘要 随着微博用户数量的上升,微博信息量成倍增长,基于冗杂的微博信息向微博用户快速推荐感兴趣的好友是不容回避的技术问题。针对这一问题,基于微博大数据,以Hadoop为平台,HBase为基础,MapReduce为编程框架,提出了基于Apriori算法与Item-based协同过滤算法的组合算法,并构建了推荐好友系统。该系统通过Apriori算法对冗杂的微博内容记录进行频繁项集的计算,得出能表达用户喜好的标签,以提升系统的时间性能;通过Item-based算法对标签进行匹配推荐,以缩短系统的推荐时间以及资源占用率。为了验证所构建系统的有效性和可靠性,分别进行了两组对比实验,第一组实验为添加了Apriori算法的协同过滤算法与传统协同过滤算法在时间性能方面的对比测试,第二组实验则为Apriori算法混合Item-based协同过滤算法与混合K-means算法的对比测试。实验结果表明,在庞大的微博容量下,与传统协同过滤算法相比,所提出算法的运行时间缩短了24%~44%;与混合K-means聚类算法相比,所提出算法在算法运行时间和CPU占用率均有1.2~1.5倍的提升。可见,提出的算法可显著缩短推荐时间,减少资源消耗率,提高推荐效率。 With the rising of micro-blogging users, microblog information capacity has grown rapidly. Fast recommendation of interested friends for micro-blogging users based on the jumbled microblog information becomes inevitable problem. Therefore faced with massive data of microblog, with Hadoop as platform and MapReduce as program frame and based on HBase, a hybrid algorithm of Apriori & Item -based collaborative filtering recommendation algorithm has been proposed and a recommended friends system has been established, in which system computation of frequent item set with massive microblog content records has been conducted to express users' favorites with tags for promotion of its time performances via Apriori algorithm and thus recommendation of tags has been matched via Item-based algorithm for decrease of recommendation time and occupancy rate of system resource. In order to verify its effectiveness and reliability, two groups of contrast experiments have been conducted, in which the first one involves contrast tests of time performances with collabo- rative filtering algorithm based on Apriori algorithm vs traditional collaborative filtering algorithm and the other one is composed of con- trast tests of hybrid algorithm combined Apriori algorithm with Item-based collaborative filtering algorithm vs hybrid K -means algo- rithm. The results of contrast experiments show that in large micro-blogging capacity, compared with hybrid K -means clustering algo- rithm ,the proposed algorithm has decreased the running time by 24% -44% and has lifted 1.2 - 1.5 times in operation time and CPU oc- cupancy rate. Obviously, the time and recommended resource consumption can be greatly reduced and efficiency recommended improved for proposed algorithm.
出处 《计算机技术与发展》 2017年第7期25-28,共4页 Computer Technology and Development
基金 国家自然科学基金资助项目(61562086 61462079 61363083 61262088) 新疆"万人计划"后备项目(wr2015bj01)
关键词 协同过滤算法 标签计算 HADOOP MAPREDUCE 标签匹配 collaborative filtering algorithm tag computing Hadoop MapReduce tag matching
  • 相关文献

参考文献7

二级参考文献147

  • 1邢春晓,高凤荣,战思南,周立柱.适应用户兴趣变化的协同过滤推荐算法[J].计算机研究与发展,2007,44(2):296-301. 被引量:148
  • 2陈健,印鉴.基于影响集的协作过滤推荐算法[J].软件学报,2007,18(7):1685-1694. 被引量:59
  • 3Shardanand U, Maes P. Social information filtering: Algorithms for automating "Word of Mouth". In: Proc. of the Conf. on Human Factors in Computing Systems. New York: ACM Press, 1995.210-217.
  • 4Hill W, Stead L, Rosenstein M, Furnas G. Recommending and evaluating choices in a virtual community of use. In: Proc. of the Conf. on Human Factors in Computing Systems. New York: ACM Press, 1995. 194-201.
  • 5Resnick P, Iakovou N, Sushak M, Bergstrom P, Riedl J. GroupLens: An open architecture for collaborative filtering of netnews. In: Proc. of the Computer Supported Cooperative Work Conf. New York: ACM Press, 1994. 175-186.
  • 6Baeza-Yates R, Ribeiro-Neto B. Modern Information Retrieval. New York: Addison-Wesley Publishing Co., 1999.
  • 7Murthi BPS, Sarkar S. The role of the management sciences in research on personalization. Management Science, 2003,49(10): 1344-1362.
  • 8Smith SM, Swinyard WR. Introduction to marketing models. 1999. http://marketing.byu.edu/htmlpages/courses/693r/modelsbook/ preface.html
  • 9Adomavicius G, Tuzhilin A. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. on Knowledge and Data Engineering, 2005,17(6):734-749.
  • 10Resnick P, Varian HR. Recommender systems. Communications of the ACM, 1997,40(3):56-58.

共引文献847

同被引文献7

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部