摘要
互联网上的有关交通的网页文本数据常常是非结构化、分散性的,面对不断增长的海量信息,如何从中提取出有用的交通信息具有一定难度.传统的信息采集处理方法很难高效准确地完成海量信息处理.由此,网络爬虫技术则显示出其优越性.文中介绍了网络爬虫技术的基本内容,总结了各类交通信息获取方法的研究,从不同方面综述了国内外应用网络爬虫技术解决交通信息获取问题的研究历史和现状,展望了网络爬虫技术在交通中的应用前景.
Web page text data about traffic on the Internet is often unstructured and scattered,so it is difficult to extract useful traffic information from the ever-increasing mass information.Traditional information collection and processing methods are difficult to process massive information efficiently and accurately,thus web crawler technology shows its superiority.This paper introduced the basic content of web crawler technology,and summarized the research of various traffic information acquisition methods.Moreover,the research history and present situation of applying web crawler technology to solve the problem of traffic information acquisition at home and abroad were summarized from different aspects,and the application prospect of web crawler technology in traffic was prospected.
作者
秦雅琴
马玲玲
QIN Yaqin;MA Lingling(Faculty of Transportation Engineering,Kunming University of Science and Technology,Kunming 650500,China)
出处
《武汉理工大学学报(交通科学与工程版)》
2020年第3期456-461,共6页
Journal of Wuhan University of Technology(Transportation Science & Engineering)
基金
国家自然科学基金项目资助(71861016)。
关键词
交通工程
交通信息
网络爬虫技术
综述
traffic engineering
traffic information
web crawler technology
review