摘要
评分矩阵(rating matrix)的特点是高维、稀疏、低秩,对其研究的主要方法是低秩矩阵恢复。对这些算法而言,不同评分矩阵的秩,会得到不同的恢复精度。但目前没有理论来研究评分矩阵秩的估计,从而影响了这些算法的应用。从理论上分析了用户聚类数与评分矩阵秩的关系,给出用户聚类数的计算方法,并在此基础上提出一种基于聚类数的秩1矩阵恢复(Clusters Number Rank-1 Matrix Completion,CN-R1MC)算法来恢复评分矩阵。通过在多个推荐系统数据集上的实验证明:用户聚类数能较好地近似评分矩阵的秩,这对提高评分矩阵的恢复精度有重要的作用。所提出的算法有较好的应用价值。
Rating matrix is high-dimensional, sparse and low rank. The low rank matrix recovery is the important method for rating matrix of research. For these algorithms, different scoring matrix rank will obtain different recovery precision. But there is no theory to study the score matrix rank, thus affecting the application of these algorithms. This paper analyzes the relationship between clustering number of user and rank of rating matrix, and then it presents the method of computing the cluster number of user, and on this basis, it proposes a number of clusters based on rank 1 matrix recovery(Clusters Number Rank-1 Matrix Completion, CN-R1MC)algorithm to recover rating matrix. Through a plurality of recommendation system data sets on the experiments, the cluster number of user can approximate rank of rating matrix better, which has an important role in improving recovery accuracy for the rating matrix. The proposed algorithm has good application value.
出处
《计算机工程与应用》
CSCD
北大核心
2015年第21期6-11,47,共7页
Computer Engineering and Applications
基金
国家自然科学基金青年科学基金项目(No.61402063)
重庆市教委科学技术项目(No.KJ1400612
No.KJ130709)
重庆工商大学项目(No.20135609)
关键词
评分矩阵
低秩矩阵恢复
秩1矩阵
用户聚类数
奇异值分解
rating matrix
low-rank matrix completion
rank-one matrix
number of user clustering
singular value decomposition