期刊文献+

Scientific articles recommendation with topic regression and relational matrix factorization

Scientific articles recommendation with topic regression and relational matrix factorization
原文传递
导出
摘要 In this paper we study the problem of recommending scientific articles to users in an online community with a new perspective of considering topic regression modeling and articles relational structure analysis simultaneously. First, we present a novel topic regression model, the topic regression matrix factorization (tr-MF), to solve the problem. The main idea of tr-MF lies in extending the matrix factorization with a probabilistic topic modeling. In particular, tr-MF introduces a regression model to regularize user factors through probabilistic topic modeling under the basic hypothesis that users share similar preferences if they rate similar sets of items. Consequently, tr-MF provides interpretable latent factors for users and items, and makes accurate predictions for community users. To incorporate the relational structure into the framework of tr-MF, we introduce relational matrix factorization. Through combining tr-MF with the relational matrix femtorization, we propose the topic regression collective matrix factorization (tr-CMF) model. In addition, we also present the collaborative topic regression model with relational matrix factorization (CTR-RMF) model, which combines the existing collaborative topic regression (CTR) model and relational matrix factorization (RMF). From this point of view, CTR-RMF can be considered as an appropriate baseline for tr-CMF. Further, we demonstrate the efficacy of the proposed models on a large subset of the data from CiteULike, a bibliography sharing service dataset. The proposed models outperform the state-of-the-art matrix factorization models with a significant margin. Specifically, the proposed models are effective in making predictions for users with only few ratings or even no ratings, and support tasks that are specific to a certain field, neither of which has been addressed in the existing literature. In this paper we study the problem of recommending scientific articles to users in an online community with a new perspective of considering topic regression modeling and articles relational structure analysis simultaneously. First, we present a novel topic regression model, the topic regression matrix factorization(tr-MF), to solve the problem. The main idea of tr-MF lies in extending the matrix factorization with a probabilistic topic modeling.In particular, tr-MF introduces a regression model to regularize user factors through probabilistic topic modeling under the basic hypothesis that users share similar preferences if they rate similar sets of items. Consequently,tr-MF provides interpretable latent factors for users and items, and makes accurate predictions for community users.To incorporate the relational structure into the framework of tr-MF, we introduce relational matrix factorization.Through combining tr-MF with the relational matrix factorization, we propose the topic regression collective matrix factorization(tr-CMF) model. In addition, we also present the collaborative topic regression model with relational matrix factorization(CTR-RMF) model, which combines the existing collaborative topic regression(CTR) model and relational matrix factorization(RMF). From this point of view, CTR-RMF can be considered as an appropriate baseline for tr-CMF. Further, we demonstrate the efficacy of the proposed models on a large subset of the data from CiteULike, a bibliography sharing service dataset. The proposed models outperform the state-of-the-art matrix factorization models with a significant margin. Specifically, the proposed models are effective in making predictions for users with only few ratings or even no ratings, and support tasks that are specific to a certain field, neither of which has been addressed in the existing literature.
出处 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2014年第11期984-998,共15页 浙江大学学报C辑(计算机与电子(英文版)
基金 Project supported by the National Basic Research Program(973) of China(No.2012CB316400) Zhejiang University–Alibaba Financial Joint Lab,Zhejiang Provincial Engineering Center on Media Data Cloud Processing and Analysis the National Science Foundation of the United States(Nos.IIS0812114 and CCF-1017828)
  • 相关文献

参考文献27

  • 1Agarwal, D., Chen, B.C., 2009. Regression-based latent factor models. Proc. 15th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.19-28.
  • 2Agarwal, D., Chen, B.C., 2010. fLDA: matrix factorization through latent dirichlet allocation. Proc. 3rd ACM Int. Conf. on Web Search and Data Mining, p.91-100.
  • 3Bell, R.M., Koren, Y., 2007. Scalable collaborative filtering with jointly derived neighborhood interpolation weights. Proc. 7th IEEE Int. Conf. on Data Mining, p.43-52.
  • 4Bell, R.M., Koren, Y., Volinsky, C., 2007. Modeling relationships at multiple scales to improve accuracy of large recommender systems. Proc. 13th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.95-104.
  • 5Blei, D.M., Ng, A.Y., Jordan, M.I., 2003. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993-1022.
  • 6Dempster, A.P., Laird, N.M., Rubin, D.B., 1977. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B-Stat. Methdol., 39(1):1-38.
  • 7Ge, Y., Liu, Q., Xiong, H., et al., 2011. Cost-aware travel tour recommendation. Proc. 17th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.983-991.
  • 8Hu, Y., Koren, Y., Volinsky, C., 2008. Collaborative filtering for implicit feedback datasets. Proc. 8th IEEE Int. Conf. on Data Mining, p.263-272. [doi:10.1109/ICDM.2008.22.
  • 9Jiang, M., Cui, P., Liu, R., et al., 2012. Social contextual recommendation. Proc. 21st ACM Int. Conf. on Information and Knowledge Management, p.45-54. [doi:10.1145/2396761.2396771.
  • 10Jin, X., Zhou, Y., Mobasher, B., 2005. A maximum entropy web recommendation system: combining collaborative and content features. Proc. 11th ACM SIGKDD Int. Conf. on Knowledge Discovery in Data Mining, p.612-617.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部