摘要
时空数据具有多维关联特性,而深度学习恰恰因其能够对复杂高维数据进行高层抽象处理而备受关注.本文依据轨迹数据特征给出其形式化定义,并据此构建基于Word2vec的时空语义轨迹模型.通过模型网络训练位置特征向量,对不同时间粒度下的用户轨迹进行语义探究.实验中采取Top-K近邻预测和聚类分析等手段验证了轨迹模型在无监督式学习下输出的位置向量具备空间语义且定型良好.其结果也进一步检验了基于词向量的语言模型迁移至轨迹挖掘的研究具备可行性.
Spatial temporal data has associated multidimensional features. Deep learning has attracted much attention due to its ability to perform high-level abstraction of complex data. In this paper, we give the definition of the track-data based on its characteristics, and build a spatial temporal semantic trajectory model using Word2 vec as its foundation. We explore the semantics of different user-tracks under varying time periods by training position vectors in the model network. During the experiments, we use Top-K neighbor prediction and cluster analysis to verify that the position vector has both good semantic meaning and structure. The vector is derived from a trajectory model that employs unsupervised learning. The results also test a word-vector-based language model that can be applied to the study of trajectory mining.
出处
《中国科学:信息科学》
CSCD
北大核心
2017年第8期1051-1065,共15页
Scientia Sinica(Informationis)
基金
国家重点基础研究发展计划(973)(批准号:2015CB352502)
国家自然科学基金(批准号:61272092
61572289)
山东省自然科学基金(批准号:ZR2015FM002
ZR2016FB14)资助项目
关键词
社交网络
时空轨迹
语义分析
特征向量
深度学习
social networks
spatial and temporal trajectory
semantic analysis
feature vector
deep learning