期刊文献+

一种基于序列到序列时空注意力学习的交通流预测模型 被引量:34

A Sequence-to-Sequence Spatial-Temporal Attention Learning Model for Urban Traffic Flow Prediction
下载PDF
导出
摘要 城市交通流预测是研究交通时空序列数据的动态演化并预测未来交通情况的关键技术,对于智能交通预警及管理决策来讲至关重要.但是有效的交通流建模非常具有挑战性,因为它受到很多复杂因素的影响,例如交通网络的时空依赖性和序列突变性等问题.一些研究工作将卷积神经网络(convolutional neural networks, CNN)或循环神经网络(recurrent neural networks, RNN)用于交通流量预测建模.但是,直接使用经典的深度学习模型难以有效捕获与交通流相关的多通道多变量序列数据中的隐含时空依赖性特征.针对上述问题,提出了一种新的序列到序列时空注意力深度学习框架(spatial-temporal attention traffic forecasting, STATF)来处理城市交通流建模任务,它是一种基于卷积LSTM编码层和LSTM解码层,并辅助注意力机制的端到端深度学习模型,可以自适应地学习与城市交通流相关的多通道多变量时空序列数据中的时空依赖性和非线性相关性特征.基于3个真实的交通流数据集实验结果表明:不管是单步预测还是多步预测条件下,STATF模型都具有更优的预测性能. Urban traffic flow prediction is a key technology to study the behavior of traffic-related big data and predict future traffic flow, which is crucial to guide the early warning of traffic congestion in the intelligent transportation system. But effective traffic flow prediction is very challenging as it is affected by many complex factors, e.g. spatial-temporal dependency and temporal dynamics of traffic networks. In the literature, some research works applied convolutional neural networks(CNN) or recurrent neural networks(RNN) for traffic flow prediction. However, it is difficult for these models to capture the spatial-temporal correlation features of traffic flow related temporal data. In this paper, we propose a novel sequence-to-sequence spatial-temporal attention framework to deal with the urban traffic flow forecasting task. It is an end-to-end deep learning model which is based on convolutional LSTM layers and LSTM layers with attention mechanism to adaptively learn spatial-temporal dependency and non-linear correlation features of urban traffic flow related multivariate sequence data. Extensive experimental results based on three real-world traffic flow datasets show that our model has the best forecasting performance compared with state-of-the-art methods.
作者 杜圣东 李天瑞 杨燕 王浩 谢鹏 洪西进 Du Shengdong;Li Tianrui;Yang Yan;Wang Hao;Xie Peng;Horng Shi-Jinn(School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031;Department of Computer Science and Information Engineering,Taiwan University of Science and Technology,Taipei 10607)
出处 《计算机研究与发展》 EI CSCD 北大核心 2020年第8期1715-1728,共14页 Journal of Computer Research and Development
基金 国家重点研发计划项目(2019YFB2101801) 国家自然科学基金项目(61773324,61976247)。
关键词 交通流预测 长短时记忆网络 序列到序列学习 时空注意力 编码器-解码器 traffic flow prediction long short-term memory networks sequence-to-sequence learning spatial-temporal attention encoder-decoder
  • 相关文献

参考文献6

二级参考文献127

  • 1赵金山,狄增如,王大辉.北京市公共汽车交通网络几何性质的实证研究[J].复杂系统与复杂性科学,2005,2(2):45-48. 被引量:45
  • 2陆化普,石冶.Complexity of Public Transport Networks[J].Tsinghua Science and Technology,2007,12(2):204-213. 被引量:13
  • 3李英,周伟,郭世进.上海公共交通网络复杂性分析[J].系统工程,2007,25(1):38-41. 被引量:65
  • 4MarkoffJ. How many computers to identify a cat?[NJ The New York Times, 2012-06-25.
  • 5MarkoffJ. Scientists see promise in deep-learning programs[NJ. The New York Times, 2012-11-23.
  • 6李彦宏.2012百度年会主题报告:相信技术的力量[R].北京:百度,2013.
  • 710 Breakthrough Technologies 2013[N]. MIT Technology Review, 2013-04-23.
  • 8Rumelhart D, Hinton G, Williams R. Learning representations by back-propagating errors[J]. Nature. 1986, 323(6088): 533-536.
  • 9Hinton G, Salakhutdinov R. Reducing the dimensionality of data with neural networks[J]. Science. 2006, 313(504). Doi: 10. 1l26/science. 1127647.
  • 10Dahl G. Yu Dong, Deng u, et a1. Context-dependent pre?trained deep neural networks for large vocabulary speech recognition[J]. IEEE Trans on Audio, Speech, and Language Processing. 2012, 20 (1): 30-42.

共引文献1190

同被引文献250

引证文献34

二级引证文献118

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部