摘要
为推进城际交通大数据的应用,需要补全出行目的信息,将团体旅客出行目的决策与文本主题生成类比,开发基于无监督学习框架的出行目的推断方法.提出嵌入出发时间生成模块的主题模型,以及团体旅客重建和语义化特征设计方法,并通过吉布斯采样估计参数.基于调查数据的模型对比研究发现,模型对一般私务辨识性能提升7.7%;基于票务数据的案例研究发现,模型对出发时间预测精度达到90.9%,间接验证了模型的可靠性.主题标注表明,模型不仅推断出4种与典型模式相符的出行目的,还辨识出既有认识外的非常规模式.对道路客运分析表明,出行目的构成呈现地区差异,高铁开通对不同出行目的出行量的负向影响程度不一.
To obtain trip purpose missing in big data derived from intercity transportation for deeper application,by drawing an analogy between the decision-making of trip purpose in group passengers and the generation of topics in texts,this study develops an approach for trip purpose inference under the unsupervised learning framework.First,a modified topic model embedded with the generation process of start time was proposed.Second,methods for reconstructing group passengers and designing semantic features were presented.Finally,the parameters were estimated using Gibbs sampling.Model comparison based on the survey data manifests that the performance of identifying personal affairs is raised by 7.7 percent using the proposed model;a case study based on the ticket sales data demonstrates that the precision of predicting start time is 90%,providing an indirect proof of its reliability.The topic annotation reveals that not only trip purpose corresponding to four typical patterns are inferred,but also anomalies beyond existing knowledge are recognized.In regard to the road passenger transport,trip purpose configuration shows a regional disparity,and whether high speed rail(HSR)has reached has diverse negative effects on the ridership of different trip purposes.
作者
钱剑培
邵春福
李军
蔡楠
黄士琛
QIAN Jian-pei;SHAO Chun-fu;LI Jun;CAI Nan;HUANG Shi-chen(Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport,Beijing Jiaotong University,Beijing 100044,China;Institute of Transportation Information Standardization,China Transport Telecommunications&Information Center,Beijing 100011,China;Nantong Urban Planning&Design Institute Co.,Ltd,Nantong 226004,Jiangsu,China)
出处
《交通运输系统工程与信息》
EI
CSCD
北大核心
2020年第6期99-105,共7页
Journal of Transportation Systems Engineering and Information Technology
基金
国家自然科学基金创新研究群体科学基金(71621001).
关键词
交通工程
出行目的推断
主题模型
面板回归模型
道路客运
票务数据
traffic engineering
trip purpose inference
topic model
panel regression model
road passenger transport
ticket sales data