期刊文献+
共找到1,269篇文章
< 1 2 64 >
每页显示 20 50 100
Accuracy Assessment and Guidelines for Manual Traffic Counts from Pre-Recorded Video Data
1
作者 Mishuk Majumder Chester Wilmot 《Journal of Transportation Technologies》 2023年第4期497-523,共27页
Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to con... Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to convenience, high accuracy, and cost-effectiveness. Manual counting from pre-recorded video footage can be prone to inconsistencies and errors, leading to inaccurate counts. Besides, there are no standard guidelines for collecting video data and conducting manual counts from the recorded videos. This paper aims to comprehensively assess the accuracy of manual counts from pre-recorded videos and introduces guidelines for efficiently collecting video data and conducting manual counts by trained individuals. The accuracy assessment of the manual counts was conducted based on repeated counts, and the guidelines were provided from the experience of conducting a traffic survey on forty strip mall access points in Baton Rouge, Louisiana, USA. The percentage of total error, classification error, and interval error were found to be 1.05 percent, 1.08 percent, and 1.29 percent, respectively. Besides, the percent root mean square errors (RMSE) were found to be 1.13 percent, 1.21 percent, and 1.48 percent, respectively. Guidelines were provided for selecting survey sites, instruments and timeframe, fieldwork, and manual counts for an efficient traffic data collection survey. 展开更多
关键词 Traffic Survey Counting Error Transportation Planning Total Error Collecting video data Classification Error Standard Guidelines Repeated Counts Interval Error
下载PDF
Using microscopic video data measures for driver behavior analysis during adverse winter weather:opportunities and challenges 被引量:1
2
作者 Ting Fu Sohail Zangenehpour +2 位作者 Paul St-Aubin Liping Fu Luis F.Miranda-Moreno 《Journal of Modern Transportation》 2015年第2期81-92,共12页
This paper presents a driver behavior analysis using microscopic video data measures including vehicle speed, lane-changing ratio, and time to collision. An analytical framework was developed to evaluate the effect of... This paper presents a driver behavior analysis using microscopic video data measures including vehicle speed, lane-changing ratio, and time to collision. An analytical framework was developed to evaluate the effect of adverse winter weather conditions on highway driving behavior based on automated (computer) and manual methods. The research was conducted through two case studies. The first case study was conducted to evaluate the feasibility of applying an au- tomated approach to extracting driver behavior data based on 15 video recordings obtained in the winter 2013 at three dif- ferent locations on the Don Valley Parkway in Toronto, Canada. A comparison was made between the automated approach and manual approach, and issues in collecting data using the automated approach under winter conditions were identified. The second case study was based on high quality data collected in the winter 2014, at a location on Highway 25 in Montreal, Canada. The results demonstrate the effectiveness of the automated analytical framework in analyzing driver behavior, as well as evaluating the impact of adverse winter weather conditions on driver behavior. This approach could be applied to evaluate winter maintenance strategies and crash risk on highways during adverse winter weather conditions. 展开更多
关键词 WINTER video data collection Issues Driver behavior Time to collision Winter roadmaintenance
下载PDF
A Novel Video Data-Source Authentication Model Based on Digital Watermarking and MAC in Multicast
3
作者 ZHAO Anjun LU Xiangli GUO Lei 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1257-1261,共5页
A novel video data authentication model based on digital video watermarking and MAC (message authentication code) in multicast protocol is proposed in this paper, The digital watermarking which composes of the MAC o... A novel video data authentication model based on digital video watermarking and MAC (message authentication code) in multicast protocol is proposed in this paper, The digital watermarking which composes of the MAC of the significant vid eo content, the key and instant authentication data is embedded into the insignificant video component by the MLUT (modified look-up table) video watermarking technology. We explain a method that does not require storage of each data packet for a time, thus making receiver not vulnerable to DOS (denial of service) attack. So the video packets can be authenticated instantly without large volume buffer in the receivers. TESLA (timed efficient stream loss tolerant authentication) does not explain how to select the suitable value for d, which is an important parameter in multicast source authentication. So we give a method to calculate the key disclosure delay (number of intervals). Simulation results show that the proposed algorithms improve the performance of data source authentication in multicast. 展开更多
关键词 video data authentication MULTICAST MAC(message authentication code) digital watermarking MLUT(modifled look-up table)
下载PDF
Optimization-Based Fragmental Transmission Method for Video Data in Opportunistic Networks 被引量:1
4
作者 Peng Li Xiaoming Wang +2 位作者 Junling Lu Lichen Zhang Zaobo He 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第4期389-399,共11页
Multimedia data have become popularly transmitted content in opportunistic networks. A large amount of video data easily leads to a low delivery ratio. Breaking up these big data into small pieces or fragments is a re... Multimedia data have become popularly transmitted content in opportunistic networks. A large amount of video data easily leads to a low delivery ratio. Breaking up these big data into small pieces or fragments is a reasonable option. The size of the fragments is critical to transmission efficiency and should be adaptable to the communication capability of a network. We propose a novel communication capacity calculation model of opportunistic network based on the classical random direction mobile model, define the restrain facts model of overhead, and present an optimal fragment size algorithm. We also design and evaluate the methods and algorithms with video data fragments disseminated in a simulated environment. Experiment results verified the effectiveness of the network capability and the optimal fragment methods. 展开更多
关键词 opportunistic network communication capabilities fragment video data
原文传递
StreamTune: dynamic resource scheduling approach for workload skew in video data center
5
作者 Yihong GAO Huadong MA 《Frontiers of Computer Science》 SCIE EI CSCD 2018年第4期669-681,共13页
Video surveillance applications need video data center to provide elastic virtual machine (VM) provisioning. However, the workloads of the VMs are hardly to be predicted for online video surveillance service. The un... Video surveillance applications need video data center to provide elastic virtual machine (VM) provisioning. However, the workloads of the VMs are hardly to be predicted for online video surveillance service. The unknown arrival workloads easily lead to workload skew among VMs. In this paper, we study how to balance the workload skew on online video surveillance system. First, we design the system framework for online surveillance service which con- sists of video capturing and analysis tasks. Second, we propose StreamTune, an online resource scheduling approach for workload balancing, to deal with irregular video analysis workload with the minimum number of VMs. We aim at timely balancing the workload skew on video analyzers without depending on any workload prediction method. Furthermore, we evaluate the performance of the proposed approach using a traffic surveillance application. The experimental results show that our approach is well adaptive to the variation of workload and achieves workload balance with less VMs. 展开更多
关键词 video data center load balancing stream computing online video analysis scheduling algorithm
原文传递
Minimizing Resource Cost for Camera Stream Scheduling in Video Data Center
6
作者 Yi-Hong Gao Hua-Dong Ma Wu Liu 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第3期555-570,共16页
Video surveillance service, which receives live streams from IP cameras and forwards the streams to end users, has become one of the most popular services of video data center. The video data center focuses on minimiz... Video surveillance service, which receives live streams from IP cameras and forwards the streams to end users, has become one of the most popular services of video data center. The video data center focuses on minimizing the resource cost during resource provisioning for the service. However, little of the previous work comprehensively considers the bandwidth cost optimization of both upload and forwarding streams, and the capacity of the media server. In this paper, we propose an efficient resource scheduling approach for online multi-camera video forwarding, which tries to optimize the resource sharing of media servers and the networks together. Firstly, we not only provide a fine-grained resource usage model for media servers, but also evaluate the bandwidth cost of both upload and forwarding streams. Without loss of generality, we utilize two resource pricing models with different resource cost functions to evaluate the resource cost: the linear cost function and the non-linear cost functions. Then, we formulate the cost minimization problem as a constrained integer programming problem. For the linear resource cost function, the drift-plus-penalty optimization method is exploited in our approach. For non-linear resource cost functions, the approach employs a heuristic method to reduce both media server cost and bandwidth cost. The experimental results demonstrate that our approach obviously reduces the total resource costs on both media servers and networks simultaneously. 展开更多
关键词 video data center resource scheduling video surveillance as a service multi-camera networking
原文传递
Human Stress Recognition by Correlating Vision and EEG Data
7
作者 S.Praveenkumar T.Karthick 《Computer Systems Science & Engineering》 SCIE EI 2023年第6期2417-2433,共17页
Because stress has such a powerful impact on human health,we must be able to identify it automatically in our everyday lives.The human activity recognition(HAR)system use data from several kinds of sensors to try to r... Because stress has such a powerful impact on human health,we must be able to identify it automatically in our everyday lives.The human activity recognition(HAR)system use data from several kinds of sensors to try to recognize and evaluate human actions automatically recognize and evaluate human actions.Using the multimodal dataset DEAP(Database for Emotion Analysis using Physiological Signals),this paper presents deep learning(DL)technique for effectively detecting human stress.The combination of vision-based and sensor-based approaches for recognizing human stress will help us achieve the increased efficiency of current stress recognition systems and predict probable actions in advance of when fatal.Based on visual and EEG(Electroencephalogram)data,this research aims to enhance the performance and extract the dominating characteristics of stress detection.For the stress identification test,we utilized the DEAP dataset,which included video and EEG data.We also demonstrate that combining video and EEG characteristics may increase overall performance,with the suggested stochastic features providing the most accurate results.In the first step,CNN(Convolutional Neural Network)extracts feature vectors from video frames and EEG data.Feature Level(FL)fusion that combines the features extracted from video and EEG data.We use XGBoost as our classifier model to predict stress,and we put it into action.The stress recognition accuracy of the proposed method is compared to existing methods of Decision Tree(DT),Random Forest(RF),AdaBoost,Linear Discriminant Analysis(LDA),and KNearest Neighborhood(KNN).When we compared our technique to existing state-of-the-art approaches,we found that the suggested DL methodology combining multimodal and heterogeneous inputs may improve stress identification. 展开更多
关键词 Mental stress physiological data XGBoost feature fusion DEAP video data EEG CNN HAR
下载PDF
Design and Implementation of GIS Data Production System for Expressway Video and Road Infostructure 被引量:2
8
作者 Yuan Liu Jianhua Liu Guoqiang Feng 《International Journal of Geosciences》 2021年第1期23-38,共16页
The expressway is necessary for the development of the modern transportation industry, and the level of expressway construction reflects the overall grade of national or regional economic development. In order to proc... The expressway is necessary for the development of the modern transportation industry, and the level of expressway construction reflects the overall grade of national or regional economic development. In order to process the expressway road property data information, based on the current mainstream Windows operating system, this study utilizes Geographic Information System (GIS) development technology, road video processing technology, and spatial data mining method to design and develop an expressway video and road infostructure GIS data production system. The system designs a multi-layer distributed application model in accordance with the ideas and methods of GIS engineering and the characteristics of road production data. In addition, according to the characteristics and specification requirements of basic geographic data, the road production database of spatial data and attribute data integrated storage is constructed by combining database and spatial data engine. Through the development of the GIS data production system for expressway video and road infostructure, various functions such as generation of road property data, dynamic management of road infostructure, and visualization of spatial information have been realized. The system focuses on improving the production efficiency and automation level of expressway production data and meet</span><span style="font-family:Verdana;">s</span><span style="font-family:Verdana;"> the construction requirements for modernization, informatization, and intelligence of expressways. 展开更多
关键词 EXPRESSWAY GIS Road Infostructure video Road Property data
下载PDF
Recommendations for Big Data in Online Video Quality of Experience Assessment
9
作者 Ethan Court Kapilan Radhakrishnan +1 位作者 Kemi Ademoye Stephen Hole 《Journal of Computer and Communications》 2016年第5期24-31,共8页
Real-time video application usage is increasing rapidly. Hence, accurate and efficient assessment of video Quality of Experience (QoE) is a crucial concern for end-users and communication service providers. After cons... Real-time video application usage is increasing rapidly. Hence, accurate and efficient assessment of video Quality of Experience (QoE) is a crucial concern for end-users and communication service providers. After considering the relevant literature on QoS, QoE and characteristics of video trans-missions, this paper investigates the role of big data in video QoE assessment. The impact of QoS parameters on video QoE are established based on test-bed experiments. Essentially big data is employed as a method to establish a sensible mapping between network QoS parameters and the resulting video QoE. Ultimately, based on the outcome of experiments, recommendations/re- quirements are made for a Big Data-driven QoE model. 展开更多
关键词 Quality of Experience QOE Big data ONLINE video TRAFFIC
下载PDF
短视频消费时长能够反映就业率吗?
10
作者 白小虎 程开明 叶子豪 《商业经济与管理》 CSSCI 北大核心 2024年第3期92-104,共13页
鉴于现行就业统计方式不能及时、准确反映经济运行形势,如何利用大数据对现有统计指标进行补充成为一个重要问题。鉴于短视频消费已经成为当今社会主要的休闲娱乐方式,文章将短视频消费纳入效用函数,通过失业人员与雇员在时间分配上的差... 鉴于现行就业统计方式不能及时、准确反映经济运行形势,如何利用大数据对现有统计指标进行补充成为一个重要问题。鉴于短视频消费已经成为当今社会主要的休闲娱乐方式,文章将短视频消费纳入效用函数,通过失业人员与雇员在时间分配上的差异,揭示全社会用于观看短视频的时间与就业率之间存在的理论联系。通过计量分析,研究发现基于抖音在线人数计算的估计值与描述失业者寻找工作行为的百度搜索指数呈现正相关关系,可以借助短视频观看时长变动说明失业者寻找工作行为的变化。基于抖音在线人数估计的劳动参与率与16—24岁城镇调查失业率存在显著负相关关系,两者的皮尔逊相关系数为-0.862,因此估计的劳动参与率可以作为城镇调查失业率,特别是16—24岁劳动力调查失业率的补充。 展开更多
关键词 就业 失业率 大数据 短视频 抖音
下载PDF
航空辐射数据热力图与视频融合方法
11
作者 杨金政 张文峰 +2 位作者 安政伟 刘学 刘林峰 《世界核地质科学》 CAS 2024年第5期1040-1048,共9页
随着核能技术的广泛应用,核安全与应急监测的重要性日益凸显,核应急航空监测成为国家核应急体系的重要组成部分。在核应急航空监测过程中,获取并分析航空辐射数据与视频数据对于快速监测辐射状况、圈定辐射污染区域具有重要意义。其中,... 随着核能技术的广泛应用,核安全与应急监测的重要性日益凸显,核应急航空监测成为国家核应急体系的重要组成部分。在核应急航空监测过程中,获取并分析航空辐射数据与视频数据对于快速监测辐射状况、圈定辐射污染区域具有重要意义。其中,航空监测视频数据可以全面直观地获取目标区域地面影像,并大范围展现目标区域地面场景实态,结合放射性异常时刻的视频帧可快速精确地分析异常的成因,具有高时效性的特点。尽管视频融合技术在城市安全、交通管控等领域得到广泛应用,但在核应急航空监测领域中,视频融合技术研究应用较少,存在航空辐射数据与视频之间关联性较差的问题。传统方法中,应急人员需要手动搜寻视频关键帧,分析地表地貌特征,效率较低。设计并实现将航空辐射数据以热力图的形式与视频进行融合的程序,总体上遵循“视频-图像-视频”变换过程。通过有效提取视频关键帧图像及关注数据信息,建立数值与色度映射关系,分析飞行方向,绘制热力图,并将热力图与视频帧融合显示,实现航空辐射数据热力图与视频的融合。这种融合方法的应用可以帮助技术人员更快速地识别和分析辐射热点及污染区域,为制定应急措施提供有力支持。因此,该研究对于提高航空辐射数据与视频数据的关联度,实现快速分析核事故影响具有重要意义。此外,所采用的方法也可应用到铀矿勘查、辐射环境航空调查等工作中。 展开更多
关键词 核应急航空监测 航空辐射数据 视频 融合 热力图
下载PDF
基于电力载波与云计算的视频监控系统设计
12
作者 罗启平 《科技创新与应用》 2024年第31期126-129,共4页
针对目前视频监控系统不断升级改造存在的布线工作量大、布线成本高、无法存储及计算海量视频文件的问题,该文提出基于电力载波技术和云计算框架的视频监控系统。视频数据经过调制解调、振荡、放大、耦合等过程,利用电力线网络传输到各... 针对目前视频监控系统不断升级改造存在的布线工作量大、布线成本高、无法存储及计算海量视频文件的问题,该文提出基于电力载波技术和云计算框架的视频监控系统。视频数据经过调制解调、振荡、放大、耦合等过程,利用电力线网络传输到各分控点,通过互联网传送到云端,利用云端强大的存储、计算能对视频数据进行各种处理。设计中硬件部分的ARM11微处理器主要负责视频数据的采集、编码并封包传送到PLC调制解调模块,完成数据收发控制和系统任务调度。软件部分主要包括开发系统搭建、USB摄像头驱动、视频数据采集、LCD实时显示、网络传输和双核数据处理等。该文的设计充分结合电力载波和云计算的特点,在视频监控领域中具有较高的应用价值。 展开更多
关键词 电力载波 云计算 视频监控 数据收发控制 系统任务调度
下载PDF
中国石油井下作业数字化监督发展问题与对策 被引量:2
13
作者 张绍辉 黄伟和 +5 位作者 毕国强 张晓辉 张建利 文虎成 李海军 田涛 《石油工业技术监督》 2024年第5期1-7,共7页
阐述了中国石油井下作业数字化监督在监督人员数字化管理、施工队伍数字化监督、施工作业数字化监督、井场环境数字化监督4个方面的发展现状。分析了井下作业数字化监督发展面临数字化监督管理制度不健全、监督数据标准不统一、数字化... 阐述了中国石油井下作业数字化监督在监督人员数字化管理、施工队伍数字化监督、施工作业数字化监督、井场环境数字化监督4个方面的发展现状。分析了井下作业数字化监督发展面临数字化监督管理制度不健全、监督数据标准不统一、数字化智能监督技术不完善、缺少统一的数字化智能监督平台、监督数据价值挖掘不深入5个方面的主要问题。提出了制定井下作业数字化监督管理制度、建立井下作业监督数据标准、攻关数字化智能监督关键技术、研发井下作业数字化智能监督平台、提升监督大数据分析应用水平5项数字化监督发展对策。展望了井下作业数字化监督建设发展后在提升监督人员履职效率、降低监督人力资源成本、提升工程质量、保障施工安全与环保4个方面的预期成效。 展开更多
关键词 井下作业 数字化监督 数据标准 修井机 视频监控 监督平台
下载PDF
基于视频AI的互联网数据中心机房智慧管理技术研究
14
作者 宗凌 《电信工程技术与标准化》 2024年第S01期369-374,共6页
人工智能正在重塑电信市场的竞争环境和盈利模式,电信运营商急需战略转型,通过优化价值链争取竞争优势以赶上数字经济发展的战略机遇。本文使用文献研究法和案例研究法研究了中国电信的互联网数据中心使用视频人工智能进行智慧化管理的... 人工智能正在重塑电信市场的竞争环境和盈利模式,电信运营商急需战略转型,通过优化价值链争取竞争优势以赶上数字经济发展的战略机遇。本文使用文献研究法和案例研究法研究了中国电信的互联网数据中心使用视频人工智能进行智慧化管理的情况,发现虚拟价值链逐步超越传统价值链,得出对人工智能技术的恰当运用有助于电信企业长远发展的结论,可供电信运营商在智能数字时代发展相关业务时参考。 展开更多
关键词 视频AI 互联网数据中心 机房管理
下载PDF
视频侦查技术关键及其发展展望 被引量:1
15
作者 赵秀萍 《辽宁警察学院学报》 2024年第2期79-87,共9页
视频侦查技术的核心是通过视频图像的提取、查看、分析和研判来获取侦查线索、固定涉案证据。多年来经过在实战应用中不断地发展创新,视频侦查技术形成了自己独特的关键技术体系:视频信息分析解读技术是侦查应用和证据固定的基础和核心... 视频侦查技术的核心是通过视频图像的提取、查看、分析和研判来获取侦查线索、固定涉案证据。多年来经过在实战应用中不断地发展创新,视频侦查技术形成了自己独特的关键技术体系:视频信息分析解读技术是侦查应用和证据固定的基础和核心;视频证据固定保全技术的规范是审判中心主义的客观要求,可以获取视频侦查记录报告、视频检验鉴定报告或视频数据关联报告;低质量视频图像的增强恢复技术专业性强,应用范围窄,技术成熟度高,然而它不断面临新的挑战。目前,视频数据的智能应用在大数据背景下变得越来越重要,仍需进一步突破视频自动识别技术的应用范畴,建立完善多层次的视频数据综合应用体系,打造适应不同业务需要的视频数据实战应用模型。 展开更多
关键词 视频侦查技术 视频解析 证据固定 图像处理 数据智能
下载PDF
涉Sora等文生视频危害行为的刑法规制
16
作者 刘宪权 余越洋 《华东师范大学学报(哲学社会科学版)》 CSSCI 北大核心 2024年第5期89-102,172,共15页
Sora等文生视频的应用将对刑法的法益理论产生重大影响。应将文生视频生成内容纳入作品范围。文生视频是其生成内容的作者但尚不是权利主体,应将相关著作财产权转移给使用人。涉文生视频强制猥亵、侮辱罪所侵害法益应当包括精神损害的... Sora等文生视频的应用将对刑法的法益理论产生重大影响。应将文生视频生成内容纳入作品范围。文生视频是其生成内容的作者但尚不是权利主体,应将相关著作财产权转移给使用人。涉文生视频强制猥亵、侮辱罪所侵害法益应当包括精神损害的内容。文生视频的应用改变了视频创作行业的社会关系,进而会改变相关犯罪行为法益侵害性程度的判断标准。涉文生视频犯罪主要涉及侵犯人身权利类犯罪、侵犯财产权利类犯罪以及寻衅滋事犯罪。司法解释对涉文生视频犯罪相关罪名构成要件的规定应当既有突破也有限制。我国刑法应设立数据安全责任事故罪以追究文生视频研发者、生产者的过失罪过下的刑事责任。应及时完善生成式人工智能产品应用的前置法规定。司法实践中应将文生视频的智能化程度区分为信息的整理、信息的基本加工、信息的深度加工三类,以此合理分配使用者、研发者与生产者所应当承担的涉文生视频犯罪的刑事责任。 展开更多
关键词 文生视频(Sora) 文生视频 生成内容 法益 数据安全责任事故罪
下载PDF
基于监控视频数据的城市轨道交通车站闸机设施处行人停滞行为谱特征研究
17
作者 石琦 方勇 +2 位作者 胡华 刘志钢 魏万旭 《城市轨道交通研究》 北大核心 2024年第3期150-154,159,共6页
[目的]为科学设置闸机,改善其实际通行能力,以及减少行人异常停滞行为带来的拥挤风险,需对城市轨道交通车站闸机设施处行人停滞行为谱特征进行研究。[方法]利用视频轨迹追踪技术,采集城市轨道交通车站闸机设施处监控视频中的行人运动参... [目的]为科学设置闸机,改善其实际通行能力,以及减少行人异常停滞行为带来的拥挤风险,需对城市轨道交通车站闸机设施处行人停滞行为谱特征进行研究。[方法]利用视频轨迹追踪技术,采集城市轨道交通车站闸机设施处监控视频中的行人运动参数;确定行人停滞行为表征指标,从闸机类型、行人流及个体停滞行为三类要素构建了城市轨道交通车站闸机设施处行人停滞行为谱,并采用四分位差法确定行人停滞行为谱表征指标上下限阈值;对行人停滞行为起点至闸翼/闸杆中心的纵向距离和横向距离,以及行人停滞范围和停滞时间等4个表征指标的差异性进行了分析,并全面解析了不同闸机类型和不同行人流率下的行人停滞行为谱特征。[结果及结论]行人易在闸机设施正前方停滞,行人停滞行为起点至闸翼/闸杆中心的纵向距离远大于其横向距离;当行人流率为>5~10人次/min时,行人停滞范围波动性最大;随着行人流量的增加,行人可用空间、停滞范围及停滞时间均减小。 展开更多
关键词 城市轨道交通 车站 闸机设施 行人停滞行为谱特征 监控视频数据
下载PDF
车路协同感知技术研究进展及展望 被引量:2
18
作者 伊笑莹 芮一康 +2 位作者 冉斌 罗开杰 孙虎成 《中国工程科学》 CSCD 北大核心 2024年第1期178-189,共12页
近年来,我国自动驾驶研究逐步从聚焦于单车智能技术向车路协同技术转变,为智能交通产业发展带来了重大机遇;我国在车路协同感知领域的研究虽处于起步阶段,但注重技术推动,未来发展前景广阔。本文致力于深入探讨车路协同感知技术的发展动... 近年来,我国自动驾驶研究逐步从聚焦于单车智能技术向车路协同技术转变,为智能交通产业发展带来了重大机遇;我国在车路协同感知领域的研究虽处于起步阶段,但注重技术推动,未来发展前景广阔。本文致力于深入探讨车路协同感知技术的发展动态,梳理了车路协同感知基础支撑技术的特性和发展现状,厘清了车路协同感知技术的研究进展,探讨了其技术发展趋势,并针对推动车路协同感知技术发展提出了一系列建议。研究表明,车路协同感知技术正朝着多源数据融合方向发展,主要集中在纯视觉协同感知技术优化、激光雷达点云处理技术升级、多传感器时空信息匹配与数据融合技术发展以及车路协同感知技术标准体系构建等方面。为进一步促进我国车路协同自动驾驶产业的迅速成长,研究建议,加大对多模态车路协同感知技术的研发投入、深化行业间的合作、制定统一的感知数据处理技术标准并加速技术应用普及,以期推动我国在全球自动驾驶竞争中赢得主动,推动自动驾驶行业稳定持续发展。 展开更多
关键词 自动驾驶 车路协同感知 多源数据 激光雷达 视频摄像机 标准体系
下载PDF
决策树码率自适应算法的无数据蒸馏框架 被引量:1
19
作者 黄天驰 李朝阳 +2 位作者 张睿霄 李文哲 孙立峰 《计算机学报》 EI CAS CSCD 北大核心 2024年第1期113-130,共18页
码率自适应(Adaptive Bit-Rate,ABR)算法是流媒体视频传输中至关重要的技术.该算法根据当前网络情况和播放状态等因素,为下一个视频块选择合适的码率,以确保用户获得良好的体验质量(QoE).其中,基于学习的ABR算法因其不依赖传统模型和从... 码率自适应(Adaptive Bit-Rate,ABR)算法是流媒体视频传输中至关重要的技术.该算法根据当前网络情况和播放状态等因素,为下一个视频块选择合适的码率,以确保用户获得良好的体验质量(QoE).其中,基于学习的ABR算法因其不依赖传统模型和从头学习策略的特点,表现出良好的性能,并逐渐取代需要繁琐调优的启发式ABR算法,成为研究领域的热点.然而,这些算法使用神经网络推理,导致模型参数较多,整体计算量较大,使得在实际场景中难以部署.因此,以往的研究提出了决策树蒸馏方案,即使用轻量级的决策树来提取基于学习的ABR算法的专家策略,并在线上部署这些决策树.然而,本文的实验结果表明,过去的蒸馏框架忽略了训练环境对蒸馏后策略的影响,导致策略的泛化能力较差.因此,本文提出了一种名为NIA(data-free Network-environmental Imitationbased rate Adaptation framework)的新型无数据蒸馏框架,用于生成具有更好泛化性能的决策树A BR算法.NIA通过网络环境生成模块构建多个人工网络环境,并在每次迭代训练前使用环境选择模块来选择适合的网络场景,然后与该场景进行交互,利用基于学生驱动的模仿学习算法完成决策树的蒸馏过程.本文还设计了完整的评测平台测试NIA的性能.实验表明,NIA在各种带宽数据集上展现出良好的QoE性能和泛化性能:(1)相较于启发式算法,在QoE指标上提升了1%~46%;(2)与以往的决策树蒸馏方案相比,在低带宽场景下表现相当,但在高带宽场景下提升了近1倍;(3)总体性能接近甚至超过基于学习的算法(即专家策略)的表现. 展开更多
关键词 流媒体 码率自适应算法 无数据蒸馏
下载PDF
基于多模态知识主动学习的视频问答方案
20
作者 刘明阳 王若梅 +1 位作者 周凡 林格 《计算机研究与发展》 EI CSCD 北大核心 2024年第4期889-902,共14页
视频问答是人工智能领域的一个热点研究问题.现有方法在特征提取方面缺乏针对视觉目标运动细节的获取,从而会导致错误因果关系的建立.此外,在数据融合与推理过程中,现有方法缺乏有效的主动学习能力,难以获取特征提取之外的先验知识,影... 视频问答是人工智能领域的一个热点研究问题.现有方法在特征提取方面缺乏针对视觉目标运动细节的获取,从而会导致错误因果关系的建立.此外,在数据融合与推理过程中,现有方法缺乏有效的主动学习能力,难以获取特征提取之外的先验知识,影响了模型对多模态内容的深度理解.针对这些问题,首先,设计了一种显性多模态特征提取模块,通过获取图像序列中视觉目标的语义关联以及与周围环境的动态关系来建立每个视觉目标的运动轨迹.进一步通过动态内容对静态内容的补充,为数据融合与推理提供了更加精准的视频特征表达.其次,提出了知识自增强多模态数据融合与推理模型,实现了多模态信息理解的自我完善和逻辑思维聚焦,增强了对多模态特征的深度理解,减少了对先验知识的依赖.最后,提出了一种基于多模态知识主动学习的视频问答方案.实验结果表明,该方案的性能优于现有最先进的视频问答算法,大量的消融和可视化实验也验证了方案的合理性. 展开更多
关键词 视频问答 数据融合与推理 多模态主动学习 视频细节描述提取 深度学习
下载PDF
上一页 1 2 64 下一页 到第
使用帮助 返回顶部