摘要
基于上海市出租车轨迹数据,使用Hadoop分布式计算框架对出租车GPS数据进行处理,以出租车为对象提取其轨迹,并计算出租车的载客平均距离、轨迹中心与城区中心距离与回转半径3个指标,采用k-means++模型分类并定义5种出租车运行模式。通过对比不同时间划分标准下各运行模式出租车的时间、里程载客率与收入情况,分析各类出租车在各时间段的运行效益与运行特征。
Based on the taxi trajectory data in Shanghai,this paper uses the Hadoop distributed computing framework to process the taxi GPS data,extracts the taxi trajectory as the object,and calculates the taxi’s average distance to carry passengers,the distance between the center of the trajectory and the center of the city and the radius of gyration,and uses the k-means++model to classify and define five taxi operation patterns.By comparing the operation time,spatial passenger load rate and revenue of each operation pattern taxis under different time division criteria,the operation efficiency and operation characteristics of each type of taxis in each time period are analyzed.
作者
蔡宇阳
李伯钊
牛彦芬
汪有为
CAI Yuyang;LI Bozhao;NIU Yanfen;WANG Youwei(School of Geography and Information Engineering,China University of Geosciences,Wuhan 430074,China;School of Resources and Environmental Sciences,Wuhan University,Wuhan 430079,China;Beijing PalmGo InfoTech Co.,Ltd.,Beijing 100022,China)
出处
《测绘地理信息》
CSCD
2023年第4期146-150,共5页
Journal of Geomatics
基金
国家重点研发计划(2018YFF0215006)。
关键词
出租车轨迹数据
运行模式识别
时空特征挖掘
出租车效益分析
Taxi trajectory data
operation pattern identification
spatio-temporal feature mining
Taxi efficiency analysis