摘要
协同过滤的推荐性能受限于评分矩阵中数据的稀疏性。针对这个问题,提出一种基于粗糙集的协同过滤方法,能在一定程度上缓解数据稀疏性的影响。首先进行数据预处理,即利用用户评分数量以及用户的评分值,作为用户进行分类的特征值,对用户进行分类;其次,利用粗糙集属性约简的方法剔除对用户分类影响较小的项目,生成更小的用户-项目评分矩阵,以降低数据的稀疏性和规模;最后基于约简后的数据集进行用户相似度的计算,获得目标用户真正近邻。实验结果表明,所提算法在常用评价推荐性能的指标MAE,COVERAGE,PRECISION和RECALL中显著优于COS,PCC,ADCOS,NHSM算法中的指标。
In collaborative filtering,the recommendation performance is limited by the data sparsity of rating matrix. To alleviate this problem,a rough set based collaborative filtering method is proposed,which is helpful to reduce the influence of the data sparsity to some extent. First,the procedure of data pre-processing is executed. As the eigenvalue for user classification,user’s ratings with the rating number are employed. Then,an attribute reduction approach of rough set is introduced to eliminate the item which has little effect with respect to user classification,and a smaller scale user-item rating matrix is generated to lower the data sparsity.Finally,the similarity between users is calculated based on the reducing attributes to obtain real neighbors of active users. The experimental results demonstrate that the proposed algorithm in MAE,COVERAGE,PRECISION and RECALL are significantly better than the ones of COS,PCC,ADCOS,NHSM.
作者
郑荔平
胡敏杰
杨红和
林耀进
ZHENG Li-ping;HU Min-jie;YANG Hong-he;LIN Yao-jin(School of Computer Science,Minnan Normal University,Zhangzhou 363000,Fujian,China)
出处
《山东大学学报(理学版)》
CAS
CSCD
北大核心
2019年第2期41-50,共10页
Journal of Shandong University(Natural Science)
基金
国家自然科学基金资助项目(61303131)
福建省教育厅科技资助项目(JAT170350)
福建省教育厅科技资助项目(JAT170347)
关键词
协同过滤
属性约简
数据预处理
分类标记
粗糙集
collaborative filtering
attribute reduction
data preprocess
classification label
rough set