期刊文献+

基于在线百科全书的群体兴趣及其关联性挖掘 被引量:10

Group Interests and Their Correlations Mining Based on Wikipedia
下载PDF
导出
摘要 针对协同过滤、基于内容过滤等个性化推荐方法所存在的用户隐私数据收集、冷启动等问题,提出一种群体兴趣及其关联性的挖掘方法,并应用于推荐领域.以维基百科作为数据源,获取用户社团及其编辑的词条,设计了以词条及其所属类别为基础的泛树结构生长策略,使用泛树结构表征用户社团所对应的兴趣点.结合用户社团的结构特征和兴趣点的语义特征给出了用户社团对兴趣点的关注度及兴趣点间关联性的定义,用此群体兴趣取代个性化推荐方法中的个体兴趣,进行了人工直观评价、测试集对比以及视频点播中的新闻推荐等三种实验.结果表明,测试集上群体兴趣关联性的准确度达到了50%,高于基准协同推荐方法的准确度;新闻推荐实验中,本方法比按热度推荐方法获得了高出近一倍的点击率,验证了群体兴趣及其关联性的合理性. Personalized recommendation technologies,such as collaborative filtering and content based filtering,face some problems.The obvious ones are the privacy history data collection and cold start.In this paper,we suggest a group interests mining method from Wikipedia.We also apply the group interests into the recommendation system,which avoid the cold start,and don't need any privacy data.Here,the group interest replaces the personalized interest in the traditional personalized recommendation technologies.In detail,we first suggest a general tree structure and a growing strategy to denote the interest of a users group,which includes the semantic relationship of each interest.Then we define the group interest based on the structure of users groups.At last,we measure the correlations of interests according to the general tree structure of interests.We further design three types of experiment to evaluate the reasonability of group interests,which is manual evaluation,test set evaluation and a news recommendation experiment in video service.The results show that,the accuracy of correlation between group interests can be more than 50%,and the news hits rate on the recommendation from group interests is 2 times larger than that on the recommendation from news popularity.
出处 《计算机学报》 EI CSCD 北大核心 2011年第11期2234-2242,共9页 Chinese Journal of Computers
基金 国家自然科学基金(69120912,61035004) 国家“九七三”重点基础研究发展规划项目基金(2007CB310804) 中国博士后科学基金(20090460107,201003794)资助~~
关键词 群体兴趣 兴趣点泛树结构 协同推荐 维基百科 社会网挖掘 group interest general tree of interests collaborative recommendation Wikipedia social network mining
  • 相关文献

参考文献18

  • 1许海玲,吴潇,李晓东,阎保平.互联网推荐系统比较研究[J].软件学报,2009,20(2):350-362. 被引量:541
  • 2Stadnyk I, Kass R. Modeling users' interests in information filters. Communications of the ACM, 1992, 35(12): 49-50.
  • 3Halavais A, Lackaff D. An analysis of topical coverage of Wikipedia. Journal of Computer-Mediated Communication, 2008, 13(2): 429-440.
  • 4Kittur A, Chi E H, Suh B. What's in wikipedia: Mapping topics and conflict using socially annotated category struc- ture//Proceedings of the ACM Conference on Human Factors in Computing Systems. Boston, USA, 2009:1509-1512.
  • 5Lin Tsun-Chen, Liu Ru-Sheng, Chen Shu-Yuan, Liu Chen- Chung, Chen Chieh-Yu. Genetic algorithms and silhouette measures applied to microarray data classification//Proceed- ings of the 3rd Asia-Pacific Bioinformatics Conference. Singapore, 2005:229-238.
  • 6Plangprasopchok A, Lerman K, Getoor L. Growing a tree in the forest: Constructing folksonomies by integrating struc- tured metadata//Proceedings of the Conference on Knowl- edge Discovery and Data Mining. Washington, USA, 2010: 949-958.
  • 7Han Jia-Wei, Kamber M, Pei Jian. Data Mining: Concepts and Technologies. 3rd Edition. Massachusetts, USA: Morgan Kaufmann Publishers, 2011.
  • 8Segaran T. Programming Collective Intelligence: Building Smart Web 2.0 Applications. USA: O'Reilly Media, 2007.
  • 9郭岩,白硕,杨志峰,张凯.网络日志规模分析和用户兴趣挖掘[J].计算机学报,2005,28(9):1483-1496. 被引量:62
  • 10Brzozowski M J, Romero D M. Who should I follow? Recommending people in directed social networks//Proeeedings of the 5th International AAAI Conference on Weblogs and Social Media. Barcelona, Spain, 2011: 458-461.

二级参考文献79

  • 1郭岩.基于网络用户行为的搜索引擎系统SISI[J].计算机工程,2004,30(16):9-11. 被引量:1
  • 2Shardanand U, Maes P. Social information filtering: Algorithms for automating "Word of Mouth". In: Proc. of the Conf. on Human Factors in Computing Systems. New York: ACM Press, 1995.210-217.
  • 3Hill W, Stead L, Rosenstein M, Furnas G. Recommending and evaluating choices in a virtual community of use. In: Proc. of the Conf. on Human Factors in Computing Systems. New York: ACM Press, 1995. 194-201.
  • 4Resnick P, Iakovou N, Sushak M, Bergstrom P, Riedl J. GroupLens: An open architecture for collaborative filtering of netnews. In: Proc. of the Computer Supported Cooperative Work Conf. New York: ACM Press, 1994. 175-186.
  • 5Baeza-Yates R, Ribeiro-Neto B. Modern Information Retrieval. New York: Addison-Wesley Publishing Co., 1999.
  • 6Murthi BPS, Sarkar S. The role of the management sciences in research on personalization. Management Science, 2003,49(10): 1344-1362.
  • 7Smith SM, Swinyard WR. Introduction to marketing models. 1999. http://marketing.byu.edu/htmlpages/courses/693r/modelsbook/ preface.html
  • 8Adomavicius G, Tuzhilin A. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. on Knowledge and Data Engineering, 2005,17(6):734-749.
  • 9Resnick P, Varian HR. Recommender systems. Communications of the ACM, 1997,40(3):56-58.
  • 10Balabanovic M, Shoham Y. Fab: Content-Based, collaborative recommendation. Communications of the ACM, 1997,40(3):66-72.

共引文献600

同被引文献109

引证文献10

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部