摘要
城市轨道交通站点受多层面因素交互作用而反映出异质性,为实现站点精细化分类,本文统筹考虑地铁刷卡数据、兴趣点数据和地铁网络数据,提取客流、土地利用和网络性质等特征,其中,客流层面考虑工作日、周末和节假日等不同日期类型下客流状态,土地利用层面考虑站点辐射区用地强度和均衡性,网络层面考虑节点自身特性和影响能力。构建基于主成分分析与K-means++算法的聚类模型,综合聚类评价指标确定簇数,辨析不同类型站点多维度特性,结合站区土地利用和站点网络特征探讨对出行活动的影响,并设计簇内联合预测和整体联合预测策略,采用3种多元时序预测方法探究站点聚类对预测性能的影响。研究结果表明:考虑全部客流特征时,划分为10簇,考虑工作日进站客流特征时,划分为5簇,充分挖掘客流时变特征能够获得更加精细化的聚类结果;各簇站点客流分布特征与其土地利用及网络特征间存在一定的反馈关系;相比于整体联合预测,通过聚类联合相关性强的站点进行预测,以间接捕获空间相关性的方式能有效提升预测性能,各模型均方根误差平均降低9.04%,平均绝对误差平均降低4.94%。研究结果为站点精细化管理和站区设施建设规划提供依据。
Urban rail transit stations have heterogeneity due to the interaction of multi-level factors.This paper classifies the rail transit stations using the passenger card data,interest point data and subway network data,and extracts passenger flow,land use and network properties.The passenger flow property considers the different status under weekdays,weekends and holidays.The land use property considers the land use intensity and balance of the radiation area of the station.The network property considers the characteristics of nodes and influence capabilities.The paper develops the clustering model of the K-means++algorithm and uses comprehensive clustering evaluation indicators to determine the number of clusters,distinguish and analyze the multi-dimensional characteristics of different types of stations.It combines the land use of the station area and the characteristics of the station network to analyze the impact on travel activities,and designs the joint prediction within the cluster and the overall joint.Three multivariate time series forecasting methods are used to evaluate the impact of site clustering on forecasting performance.The results show that the rail transit station can be divided into 10 clusters when consider the characteristics of all passenger flow.The rail transit stations can be divided into 5 clusters when consider the characteristics of inbound passenger flow on weekdays.Fully mining the time-varying characteristics of passenger flow can obtain more refined clustering results.There is a certain feedback relationship between passenger flow distribution characteristics and its land use and network characteristics.Compared to the overall joint prediction,the proposed method captures spatial correlation through clustering and joint prediction of sites with strong correlation and can effectively improve the prediction performance.The average reduction in root mean square error was 9.04%,and the average reduction in mean absolute error was 4.94%.The study results provide a basis for the refined management of the station and the planning of station facility construction.
作者
户佐安
邓锦程
杨江浩
赵妍
HU Zuo-an;DENG Jin-cheng;YANG Jiang-hao;ZHAO Yan(School of Transportation and Logistics,Southwest Jiaotong University,Chengdu 611756,China;National Engineering Laboratory of Integrated Transportation Big Data Application Technology,Southwest Jiaotong University,Chengdu 611756,China;National United Engineering Laboratory of Integrated and Intelligent Transportation,Southwest Jiaotong University,Chengdu 611756,China;College of Resources and Environment,Chengdu University of Information Technology,Chengdu 610225,China)
出处
《交通运输系统工程与信息》
EI
CSCD
北大核心
2023年第6期227-238,共12页
Journal of Transportation Systems Engineering and Information Technology
基金
国家自然科学基金(61104175)
四川省科技计划项目(2021YJ0067)。
关键词
城市交通
站点聚类
机器学习
轨道交通站点
多源数据
客流预测
urban traffic
station clustering
machine learning
rail transit station
multi-source data
passenger flow forecast