期刊文献+

基于BTS数据集的航班延误分类和预测算法

Delay Classification and Prediction Algorithm Based on BTS Flight Data Set
下载PDF
导出
摘要 针对神经网络分类模型对美国联邦运输统计局(Bureau of Transportation Statistics,BTS)航班数据集中的不均衡数据预测误差较大的问题,采用自适应合成采样算法(adaptive synthetic sampling approach,ADASYN)和合成少数类过采样算法(synthetic minority over-sampling technique,SMOTE)对航班延误类别进行平衡处理,并用随机森林(random forest,RF)模型进行训练和贝叶斯调参。结果表明:与不经过平衡采样的方法比较,该方法在权重平均下的精确率、召回率和F 1评分分别提高了19%、8%和16%;分类预测准确率提升8.03%,模型拟合指数AUC(area under curve)提升5.4%。同时,采用多特征相融合的图神经网络模型Graph WaveNet对航班平均延误时间进行预测。实验结果表明:与单特征模型比较,该模型平均绝对误差和均方根误差分别降低了16%和12.45%。这些方法和结果对研究航班延误分类和预测算法研究具有参考价值。 To address the problem that the neural network classification model has a large prediction error on the unbalanced data in the Bureau of Transportation Statistics(BTS)flight dataset,the adaptive synthetic sampling approach(ADASYN)and the synthetic minority over-sampling technique(SMOTE)were used to balance the flight delay categories,ADASYN and synthetic minority over-sampling technique(SMOTE)were used to balance the flight delay categories,and the random forest(random forest,RF)model was used for training and Bayesian conditioning.The results show that compared with the method without balanced sampling,the accuracy,recall and F 1 score of the method under weight averaging are improved by 19%,8%and 16%,respectively.The classification prediction accuracy is improved by 8.03%and the model fit index area under curve(AUC)is improved by 5.4%.Meanwhile,Graph WaveNet,a multi-feature fusion graph neural network model,was used to predict the average flight delay time.The experimental results show that the average absolute error and root mean square error of the model are reduced by 16%and 12.45%,respectively,compared with the single-feature model.These methods and results are of reference value for studying flight delay classification and prediction algorithm research.
作者 郭海州 杨晶晶 吴季达 张彬 黄铭 GUO Hai-zhou;YANG Jing-jing;WU Ji-da;ZHANG Bin;HUANG Ming(School of Information Science and Engineering,Yunnan University,Kunming 650091,China;Radio Monitoring Center of Yunnan Province,Kunming 650228,China)
出处 《科学技术与工程》 北大核心 2023年第12期5304-5311,共8页 Science Technology and Engineering
基金 国家自然科学基金(61863035,62261059,61963037)。
关键词 不平衡分类数据 平衡采样算法 随机森林(RF)模型 图神经网络 特征融合 unbalanced classification data balanced sampling algorithm random forest(RF)model graph neural network feature fusion
  • 相关文献

参考文献9

二级参考文献50

共引文献140

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部