期刊文献+
共找到95篇文章
< 1 2 5 >
每页显示 20 50 100
AV-FDTI:Audio-visual fusion for drone threat identification
1
作者 Yizhuo Yang Shenghai Yuan +5 位作者 Jianfei Yang Thien Hoang Nguyen Muqing Cao Thien-Minh Nguyen Han Wang Lihua Xie 《Journal of Automation and Intelligence》 2024年第3期144-151,共8页
In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fus... In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fusion system designed for Drone Threat Identification.AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs,providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization.Specifically,AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction.Furthermore,we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data.Notably,our system is trained based on automated Leica tracking annotations,offering accurate ground truth data with millimeter-level accuracy.Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems.In our commitment to advancing this field,we will release this work as open-source code and wearable AV-FDTI design,contributing valuable resources to the research community. 展开更多
关键词 audio-visual fusion Anti-UAV Multi-modal fusion Classification 3D localization Self-attention
下载PDF
Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video 被引量:1
2
作者 Liu Hua-yong, Zhou Dong-ru School of Computer,Wuhan University,Wuhan 430072, Hubei, China 《Wuhan University Journal of Natural Sciences》 CAS 2003年第04A期1070-1074,共5页
Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The p... Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames, and integrates them with silence clips detection results, as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames, when the boundaries between news stories are detected, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust. 展开更多
关键词 news video story segmentation audio-visual features analysis text detection
下载PDF
A Review on Audio-visual Translation Studies
3
作者 李瑶 《语言与文化研究》 2008年第1期146-150,共5页
This paper is dedicated to a thorough review on the audio-visual related translations from both home and abroad.In reviewing the foreign achievements on this specific field of translation studies it can shed some ligh... This paper is dedicated to a thorough review on the audio-visual related translations from both home and abroad.In reviewing the foreign achievements on this specific field of translation studies it can shed some lights on our national audio-visual practice and research.The review on the Chinese scholars’ audio-visual translation studies is to offer the potential developing direction and guidelines to the studies and aspects neglected as well.Based on the summary of relevant studies,possible topics for further studies are proposed. 展开更多
关键词 audio-visual TRANSLATION SUBTITLING DUBBING
下载PDF
Audio-visual emotion recognition with multilayer boosted HMM
4
作者 吕坤 贾云得 张欣 《Journal of Beijing Institute of Technology》 EI CAS 2013年第1期89-93,共5页
Emotion recognition has become an important task of modern human-computer interac- tion. A multilayer boosted HMM ( MBHMM ) classifier for automatic audio-visual emotion recognition is presented in this paper. A mod... Emotion recognition has become an important task of modern human-computer interac- tion. A multilayer boosted HMM ( MBHMM ) classifier for automatic audio-visual emotion recognition is presented in this paper. A modified Baum-Welch algorithm is proposed for component HMM learn- ing and adaptive boosting (AdaBoost) is used to train ensemble classifiers for different layers (cues). Except for the first layer, the initial weights of training samples in current layer are decided by recognition results of the ensemble classifier in the upper layer. Thus the training procedure using current cue can focus more on the difficult samples according to the previous cue. Our MBHMM clas- sifier is combined by these ensemble classifiers and takes advantage of the complementary informa- tion from multiple cues and modalities. Experimental results on audio-visual emotion data collected in Wizard of Oz scenarios and labeled under two types of emotion category sets demonstrate that our approach is effective and promising. 展开更多
关键词 emotion recognition audio-visual fusion Baum-Welch algorithm multilayer boostedHMM Wizard of Oz scenario
下载PDF
The Audio-Visual Performance Highlighted Craze in Chicago During Chinese New Year
5
《China & The World Cultural Exchange》 2019年第2期38-39,共2页
February 10 (US Central Time), 2019, China National Peking Opera Company (CNPOC) and the Hubei Chime Bells National Chinese Orchestra presented a fantastic audio-visual performance of Chinese Peking Opera and Chinese ... February 10 (US Central Time), 2019, China National Peking Opera Company (CNPOC) and the Hubei Chime Bells National Chinese Orchestra presented a fantastic audio-visual performance of Chinese Peking Opera and Chinese chime bells for the American audience at the world s top-level Buntrock Hall at Symphony Center. 展开更多
关键词 audio-visual PERFORMANCE Chicago CHINESE New YEAR
下载PDF
Research on National Identity Based on National Audio-Visual Works: Taking Inner Mongolia as an Example
6
作者 LIU Haitao ZHANG Pei 《Cultural and Religious Studies》 2021年第8期391-396,共6页
Mongolian audio-visual works are an important carrier of exploring the true significance to this national culture.This paper believes that the Mongolian people in Inner Mongolia constantly enhance the individual sense... Mongolian audio-visual works are an important carrier of exploring the true significance to this national culture.This paper believes that the Mongolian people in Inner Mongolia constantly enhance the individual sense of identity to the overall ethnic group through the influence of film and television and music,and on this basis constantly evolve a new culture in line with modern and contemporary life to further enhance their sense of belonging to the ethnic nation. 展开更多
关键词 MONGOLIAN audio-visual works national identity
下载PDF
Application of Task-based Teaching Method to College Audio-visual English Teaching
7
作者 Liguo Shi 《International Journal of Technology Management》 2015年第9期65-67,共3页
Based on the current situation of college audio-visual English teaching in China, this article points out that the avoidance in class is a serious phenomenon in the process of college audio-visual English teaching. Af... Based on the current situation of college audio-visual English teaching in China, this article points out that the avoidance in class is a serious phenomenon in the process of college audio-visual English teaching. After further analysis and combination with the characteristics of college English audio-visual teaching in China, it puts forward the application of task-based teaching method to college audio-visual English teaching and its steps, attempting to alleviate the avoidance phenomenon in students through task-based teaching method. 展开更多
关键词 task-based teaching method college English audio-visual English teaching
下载PDF
Prioritized MPEG-4 Audio-Visual Objects Streaming over the DiffServ
8
作者 黄天云 郑婵 《Journal of Electronic Science and Technology of China》 2005年第4期314-320,共7页
The object-based scalable coding in MPEG-4 is investigated, and a prioritized transmission scheme of MPEG-4 audio-visual objects (AVOs) over the DiffServ network with the QoS guarantee is proposed. MPEG-4 AVOs are e... The object-based scalable coding in MPEG-4 is investigated, and a prioritized transmission scheme of MPEG-4 audio-visual objects (AVOs) over the DiffServ network with the QoS guarantee is proposed. MPEG-4 AVOs are extracted and classified into different groups according to their priority values and scalable layers (visual importance). These priority values are mapped to the 1P DiffServ per hop behaviors (PHB). This scheme can selectively discard packets with low importance, in order to avoid the network congestion. Simulation results show that the quality of received video can gracefully adapt to network state, as compared with the ‘best-effort' manner. Also, by allowing the content provider to define prioritization of each audio-visual object, the adaptive transmission of object-based scalable video can be customized based on the content. 展开更多
关键词 video streaming quality of service (QoS) MPEG-4 audio-visual objects (AVOs) DIFFSERV PRIORITIZATION
下载PDF
Integrating Zhuang Culture Into College English Audio-Visual Speaking Course:A Multicultural Perspective
9
作者 LUO Mei CHEN Yingzhu 《Cultural and Religious Studies》 2024年第12期801-805,共5页
Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Vi... Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Visual Speaking Course,and enhance the intercultural communication ability of college students,this paper,from a multicultural perspective,explores the classroom practices of integrating indigenous Zhuang cultural elements in College English Audio-Visual Speaking Course,providing new perspectives and reference for multicultural education in foreign languages. 展开更多
关键词 Zhuang culture College English audio-visual Speaking Course classroom practice multicultural perspective
下载PDF
基于ⅢF A/V规范和Avalon系统的大学图书馆视听数据库建设研究
10
作者 张毅 熊泽泉 +1 位作者 胡晓明 陈丹 《图书馆杂志》 CSSCI 北大核心 2024年第1期50-58,49,共10页
随着中国网络基础设施的不断改善,视听媒体在年轻一代中非常流行,给以文本资源为主的图书馆带来了挑战。本研究旨在探究国内外大学图书馆视听资源数据库建设的现状,借鉴ⅢF规范在图像资源管理方面的成功经验和各种视听保存社区的实践,... 随着中国网络基础设施的不断改善,视听媒体在年轻一代中非常流行,给以文本资源为主的图书馆带来了挑战。本研究旨在探究国内外大学图书馆视听资源数据库建设的现状,借鉴ⅢF规范在图像资源管理方面的成功经验和各种视听保存社区的实践,提出基于ⅢF A/V规范与开源软件的中国大学图书馆视听资源管理方法。通过分析华东师范大学图书馆在视听资源保存、流媒体发布、时间轴气泡注释、转录、视听结构化和开放共享方面的实践,进行实证研究。 展开更多
关键词 视听数据库 ⅢF A/V Avalon媒体系统 视听可视化
下载PDF
Self-supervised Learning for Speech Emotion Recognition Task Using Audio-visual Features and Distil Hubert Model on BAVED and RAVDESS Databases
11
作者 Karim Dabbabi Abdelkarim Mars 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2024年第5期576-606,共31页
Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types, such as audio and visual information. We harnessed this capability to... Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types, such as audio and visual information. We harnessed this capability to develop a deep learning model that utilizes Distil HuBERT for jointly learning these combined features in speech emotion recognition (SER). Our experiments highlight its distinct advantages: it significantly outperforms Wav2vec 2.0 in both offline and real-time accuracy on RAVDESS and BAVED datasets. Although slightly trailing HuBERT’s offline accuracy, Distil HuBERT shines with comparable performance at a fraction of the model size, making it an ideal choice for resource-constrained environments like mobile devices. This smaller size does come with a slight trade-off: Distil HuBERT achieved notable accuracy in offline evaluation, with 96.33% on the BAVED database and 87.01% on the RAVDESS database. In real-time evaluation, the accuracy decreased to 79.3% on the BAVED database and 77.87% on the RAVDESS database. This decrease is likely a result of the challenges associated with real-time processing, including latency and noise, but still demonstrates strong performance in practical scenarios. Therefore, Distil HuBERT emerges as a compelling choice for SER, especially when prioritizing accuracy over real-time processing. Its compact size further enhances its potential for resource-limited settings, making it a versatile tool for a wide range of applications. 展开更多
关键词 Wav2vec 2.0 Distil HuBERT HuBERT SER audio and audio-visual features
原文传递
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
12
作者 Mengting Liu Ying Zhou +1 位作者 Yuwei Wu Feng Gao 《Machine Intelligence Research》 EI CSCD 2024年第1期4-28,共25页
In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been... In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been applied to various practical tasks,including video or game score,assisting artists in creation,art education and other aspects,which demonstrates a broad application pro-spect.In this paper,we introduce innovative achievements in audio-visual content generation from the perspective of visual art genera-tion and auditory art generation based on artificial intelligence(Al).We outline the development tendency of image and music datasets,visual and auditory content modelling,and related automatic generation systems.The objective and subjective evaluation of generated samples plays an important role in the measurement of algorithm performance.We provide a cogeneration mechanism of audio-visual content in multimodal tasks from image to music and display the construction of specific stylized datasets.There are still many new op-portunities and challenges in the field of audio-visual synesthesia generation,and we provide a comprehensive discussion on them. 展开更多
关键词 Artificial intelligence(AI)art audio-visual artificial intelligence generated content(AIGC) MULTIMODAL artistic evalu-ation
原文传递
云技术支持下视听新媒体监测监管系统的设计与实现
13
作者 徐松寅 《电视技术》 2024年第5期17-22,共6页
阐述云技术与视听新媒体检测监管的基本概念,分析当前视听新媒体监测监管面临的挑战。针对性提出基于云技术的视听新媒体监测监管系统设计方案,平台整体架构包含采集层、资源池、数据仓库、分析处理层和用户交互层。通过分析关键技术验... 阐述云技术与视听新媒体检测监管的基本概念,分析当前视听新媒体监测监管面临的挑战。针对性提出基于云技术的视听新媒体监测监管系统设计方案,平台整体架构包含采集层、资源池、数据仓库、分析处理层和用户交互层。通过分析关键技术验证了系统的可行性。 展开更多
关键词 云技术 视听新媒体 监测监管系统
下载PDF
体育赛事视听信息设权路径反思与法益体系保护—基于《体育法》第52条第2款的法教义学解读
14
作者 罗祥 《上海体育大学学报》 CSSCI 北大核心 2024年第10期83-96,共14页
《体育法》新增第52条第2款是否有意创设权利来保护体育赛事视听信息,学界对此尚存争议。现有设权论据之所以存在缺陷,表面上是因为设权逻辑不周延,实际上是形式上的设权期待与实质上薄弱的设权基础存在差距所致。《体育法》的基本法属... 《体育法》新增第52条第2款是否有意创设权利来保护体育赛事视听信息,学界对此尚存争议。现有设权论据之所以存在缺陷,表面上是因为设权逻辑不周延,实际上是形式上的设权期待与实质上薄弱的设权基础存在差距所致。《体育法》的基本法属性以及体系构造很难为体育赛事视听信息权预留充足的设权空间,相比之下,《体育法》确认体育赛事视听信息法益保护更具优势。结合《体育法》对体育赛事视听信息有关行为、主体和客体的态度,能够推定《体育法》第52条第2款的立法意图是确认法益保护而非创设权利。明确以《体育法》为基础法的规范统筹地位,构建体系协调的法律适用位阶,贯彻价值多元下实质正义的领域法规范体系,能够为体育赛事视听信息法益保护提供科学的指引,也为未来《体育法》的完善指明了方向。 展开更多
关键词 体育赛事 视听信息权 体育法 法益 领域法 体系
下载PDF
森林作业与规划动态仿真实验室建设 被引量:10
15
作者 巫志龙 周成军 +3 位作者 周新年 张正雄 沈嵘枫 李纲 《实验技术与管理》 CAS 北大核心 2013年第8期217-220,共4页
为培养大学生的创新意识、创新思维和创新精神,提高整体教学质量,针对我国南方林区特点,结合森林工程学科教学与科研特色,新建了森林作业与规划动态仿真实验室,包括电控动态仿真实物模型和配套影音教学系统,提升传统的实验教学内容和实... 为培养大学生的创新意识、创新思维和创新精神,提高整体教学质量,针对我国南方林区特点,结合森林工程学科教学与科研特色,新建了森林作业与规划动态仿真实验室,包括电控动态仿真实物模型和配套影音教学系统,提升传统的实验教学内容和实验技术方法。教学实践结果表明,有利于培养学生具备较高的现代化生态文明素质、雄厚的专业基础、宽广的专业知识面和较强的获取信息能力、实践能力和创新能力等,取得很好效果。同时,也可作为创新实验平台,为学生课外创新研究提供良好条件。 展开更多
关键词 森林作业与规划 动态仿真 影音系统 实验室建设
下载PDF
“工程索道”创新人才培养实践教学体系构建 被引量:2
16
作者 巫志龙 周成军 +3 位作者 周新年 张正雄 郑丽凤 沈嵘枫 《实验科学与技术》 2016年第5期178-182,189,共6页
为深化改革实践教学,提高实践教学质量和培养学生创新能力,通过师资培养、平台搭建和管理创新等,构建了"创新实践教学团队+创新实践实验室平台+创新实践成果平台+创新实践管理机制"的工程索道创新实践教学体系,并研制配套实... 为深化改革实践教学,提高实践教学质量和培养学生创新能力,通过师资培养、平台搭建和管理创新等,构建了"创新实践教学团队+创新实践实验室平台+创新实践成果平台+创新实践管理机制"的工程索道创新实践教学体系,并研制配套实践教学影音系统。实践表明,学生创新能力及解决实际工程问题能力显著增强,学生实践技能和创新能力培养方面特色鲜明。 展开更多
关键词 创新能力 实践教学体系 影音系统 工程索道
下载PDF
智能视频监控系统关键技术及算法研究 被引量:12
17
作者 李小斌 吴宏岐 +1 位作者 袁战军 王瑾 《控制工程》 CSCD 北大核心 2016年第S1期18-22,共5页
提出了一种基于PC机的智能视频监控系统设计方案。将基于直流力矩电动机的三环伺服系统应用于云台控制,有效解决了对高速运动目标的精确实时跟踪问题。系统采用声源定位技术辅助实现对监控目标的快速定位,利用相邻三帧差分法实现了运动... 提出了一种基于PC机的智能视频监控系统设计方案。将基于直流力矩电动机的三环伺服系统应用于云台控制,有效解决了对高速运动目标的精确实时跟踪问题。系统采用声源定位技术辅助实现对监控目标的快速定位,利用相邻三帧差分法实现了运动目标的提取,并且给出了运动目标的形心计算方法。系统根据运动目标的位置信息,实时控制云台摄像头姿态,使被监控目标始终处于监控画面中央。在MATLAB环境下,对系统各模块进行仿真,结果表明,系统能够实现对高速运动目标的精确跟踪。 展开更多
关键词 音视频融合 云台 声源定位 伺服系统 视频监控
下载PDF
利用多媒体实验室提高学生外语视听说能力 被引量:3
18
作者 黄岚 朱珏 《实验室研究与探索》 CAS 2004年第3期6-8,共3页
如何充分利用多媒体设备和网络教学系统提高学生的听说能力和影视鉴赏评论能力一直是我们搞视听说教学工作者所关心和探索的课题。本文根据笔者实际教学中多媒体设备和网络的应用,着重从日常生活对话,新闻听力训练和影视片鉴赏三个方面... 如何充分利用多媒体设备和网络教学系统提高学生的听说能力和影视鉴赏评论能力一直是我们搞视听说教学工作者所关心和探索的课题。本文根据笔者实际教学中多媒体设备和网络的应用,着重从日常生活对话,新闻听力训练和影视片鉴赏三个方面探讨了多媒体语言实验室如何导入外语视听说教学中才能充分利用学生的能动性、积极性、创造性,从而提高学生的英语视听说能力。 展开更多
关键词 英语教学 多媒体实验室 学生培养 视听说能力 学习兴趣 网络教学
下载PDF
国内外音像资源管理系统的发展与应用调查分析 被引量:1
19
作者 栾芳芳 蓝晓东 +1 位作者 郭南 韩全惜 《情报理论与实践》 CSSCI 北大核心 2008年第6期912-916,共5页
本文介绍了音像资源管理系统的基本概念、功能和发展。通过系统的调查研究,分析了音像资源管理系统的发展现状及应用情况,并对部分国内外具有代表性的音像资源管理的系统功能和关键技术进行比较分析,为利用音像资源管理系统技术构建高... 本文介绍了音像资源管理系统的基本概念、功能和发展。通过系统的调查研究,分析了音像资源管理系统的发展现状及应用情况,并对部分国内外具有代表性的音像资源管理的系统功能和关键技术进行比较分析,为利用音像资源管理系统技术构建高校及教育音像中心的音像资源管理系统提供有益的参考。 展开更多
关键词 音像资源 管理系统 发展趋势
下载PDF
基于二维条码技术的高校电教设备管理系统 被引量:12
20
作者 黄燕芬 陈雪娟 《安阳师范学院学报》 2011年第2期24-27,共4页
针对高校电教设备管理的特点和使用情况,提出电教设备管理系统的构建设想,通过唯一性标识对电教设备进行全程有效管理。选择二维条码作为电教设备信息的载体,将其运用于电教设备管理系统中,以达到学校电教设备管理信息化的目的。
关键词 二维条码 电教设备 管理系统
下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部