摘要
由于旅游数据集具有隐式反馈和极度稀疏性特点,限制了已有旅游线路推荐算法的性能。为解决上述问题,提出基于嵌入表示的改进协同过滤旅游线路推荐算法。首先,利用词向量模型将每条旅游线路表示成低维向量,并根据游客参与过的线路得到游客兴趣的向量表示;其次,根据旅游线路间的相似性得到游客的共现线路集合,并根据其相似性利用改进协同过滤算法完成线路推荐;最后,经某真实旅游数据集验证,该算法可明显提高旅游线路推荐算法的性能。
The performance of the existing tourism recommendation methods are limited because of the implicit feedback and extreme sparsity of tourism data sets.To solve the problem,an improved collaborative filtering algorithm based on embedding is proposed.Firstly,every route is represented as a low dimensional vector by using Doc2vector and every tourist is represented as a low dimensional vector based on all routes he/she had taken.Secondly,the co-occurrence routes between two tourists are obtained on the similarity of routes.Then the similarities among tourists are calculated according to the set of co-occurrence routes among them.Then,the routes are recommended by the improved collaborative filtering algorithm.Finally,the proposed method is proved to be effective on a real tourism data set.
作者
王洪建
WANG Hongjian(Xiamen Airlines Co.,Ltd.,Fujian Xiamen 361006,China)
出处
《中国民航大学学报》
CAS
2021年第5期40-43,共4页
Journal of Civil Aviation University of China
关键词
词向量
改进协同过滤
相似性
稀疏性
word vector
improved collaborative filtering
similarity
data sparsity