期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
On‐device audio‐visual multi‐person wake word spotting
1
作者 Yidi Li Guoquan Wang +2 位作者 Zhan Chen Hao Tang Hong Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第4期1578-1589,共12页
Audio‐visual wake word spotting is a challenging multi‐modal task that exploits visual information of lip motion patterns to supplement acoustic speech to improve overall detection performance.However,most audio‐vi... Audio‐visual wake word spotting is a challenging multi‐modal task that exploits visual information of lip motion patterns to supplement acoustic speech to improve overall detection performance.However,most audio‐visual wake word spotting models are only suitable for simple single‐speaker scenarios and require high computational complexity.Further development is hindered by complex multi‐person scenarios and computational limitations in mobile environments.In this paper,a novel audio‐visual model is proposed for on‐device multi‐person wake word spotting.Firstly,an attention‐based audio‐visual voice activity detection module is presented,which generates an attention score matrix of audio and visual representations to derive active speaker representation.Secondly,the knowledge distillation method is introduced to transfer knowledge from the large model to the on‐device model to control the size of our model.Moreover,a new audio‐visual dataset,PKU‐KWS,is collected for sentence‐level multi‐person wake word spotting.Experimental results on the PKU‐KWS dataset show that this approach outperforms the previous state‐of‐the‐art methods. 展开更多
关键词 audio‐visual fusion human‐computer interfacing speech processing
下载PDF
The Effect of Audio Visual Entrainment on Pre-Attentive Dysfunctional Processing to Stressful Events in Anxious Individuals
2
作者 Guadalupe Villarreal Trevino Ernesto Octavio Lopez Ramirez +2 位作者 Guadalupe Elizabeth Morales Martinez Claudia Castro Campos Maria Elena Urdiales Ibarra 《Open Journal of Medical Psychology》 2014年第5期364-372,共9页
Experimental single case studies on automatic processing of emotion were carried on a sample of people with an anxiety disorder. Participants were required to take three Audio Visual Entrainment (AVE) sessions to test... Experimental single case studies on automatic processing of emotion were carried on a sample of people with an anxiety disorder. Participants were required to take three Audio Visual Entrainment (AVE) sessions to test for anxiety reduction as proclaimed by some academic research. Explicit reports were measured as well as pre-attentive bias to stressing information by using affective priming studies before and after AVE intervention. Group analysis shows that indeed AVEs program applications do reduce anxiety producing significant changes over explicit reports on anxiety levels and automatic processing bias of emotion. However, case by case analysis of six anxious participants shows that even when all of the participants report emotional improvement after intervention, not all of them reduce or eliminate dysfunctional bias to stressing information. Rather, they show a variety of processing styles due to intervention and some of them show no change at all. Implications of this differential effect to clinical sets are discussed. 展开更多
关键词 audio Visual Entrainment Anxiety Disorders Affective Priming Single Case Experimental Study
下载PDF
Video Games Localization into Arabic:Gamers’Reactions to Localizing PUBG and Free Fire
3
作者 Shatha Jarrah Saleh Al-Salman Ahmad S Haider 《Journal of Social Computing》 EI 2023年第1期74-93,共20页
The Middle East and North Africa(MENA)region has an active gaming community,with Arab gamers being reliant on games produced in Europe,America,and Japan due to the lack of significant game production companies in the ... The Middle East and North Africa(MENA)region has an active gaming community,with Arab gamers being reliant on games produced in Europe,America,and Japan due to the lack of significant game production companies in the MENA region.This study explores the gamers’reactions to the localization process of two video games,namely PUBG and Free Fire.For data collection purposes,a five-point Likert scale questionnaire that consisted of 18 items and six constructs,namely need for subtitled games,technical aspects,language issues,language preference,attitudes to game localization,and future actions and recommendations,was designed to elicit the reactions of 112 participants.Upon analyzing the responses,the findings showed that the better the technical aspects and language issues of the games’performance,the more positive participants’attitudes to game localization.The study recommends that further research could be conducted on the localization of video games with different themes into Arabic. 展开更多
关键词 LOCALIZATION video games PUBG Free Fire audio Visual Translation(AVT)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部