期刊文献+
共找到10,956篇文章
< 1 2 250 >
每页显示 20 50 100
Research on the Audio Publishing of Classical Books Based on the Theory of Business Model Canvas: Taking Romance of the Three Kingdoms in Digital Audio Platforms as an Example
1
作者 Ding Qin Liu Mengzhi 《Contemporary Social Sciences》 2024年第1期58-74,共17页
Visual media have dominated sensory communications for decades,and the resulting“visual hegemony”leads to the call for the“auditory return”in order to achieve a holistic balance in cultural acceptance.Romance of t... Visual media have dominated sensory communications for decades,and the resulting“visual hegemony”leads to the call for the“auditory return”in order to achieve a holistic balance in cultural acceptance.Romance of the Three Kingdoms,a classic literary work in China,has received significant attention and promotion from leading audio platforms.However,the commercialization of digital audio publishing faces unprecedented challenges due to the mismatch between the dissemination of long-form content on digital audio platforms and the current trend of short and fast information reception.Drawing on the Business Model Canvas Theory and taking Romance of the Three Kingdoms as the main focus of analysis,this paper argues that the construction of a business model for the audio publishing of classical books should start from three aspects:the user evaluation of digital audio platforms,the establishment of value propositions based on the“creative transformation and innovative development”principle,and the improvement of the audio publishing infrastructure to ensure the healthy operation and development of the digital audio platforms and consequently improve their current state of development and expand the boundaries of cultural heritage. 展开更多
关键词 Romance of the Three Kingdoms audio publishing Business Model Canvas digital audio platforms
下载PDF
Audio2AB:Audio-driven collaborative generation of virtual character animation
2
作者 Lichao NIU Wenjun XIE +2 位作者 Dong WANG Zhongrui CAO Xiaoping LIU 《虚拟现实与智能硬件(中英文)》 EI 2024年第1期56-70,共15页
Background Considerable research has been conducted in the areas of audio-driven virtual character gestures and facial animation with some degree of success.However,few methods exist for generating full-body animation... Background Considerable research has been conducted in the areas of audio-driven virtual character gestures and facial animation with some degree of success.However,few methods exist for generating full-body animations,and the portability of virtual character gestures and facial animations has not received sufficient attention.Methods Therefore,we propose a deep-learning-based audio-to-animation-and-blendshape(Audio2AB)network that generates gesture animations and ARK it's 52 facial expression parameter blendshape weights based on audio,audio-corresponding text,emotion labels,and semantic relevance labels to generate parametric data for full-body animations.This parameterization method can be used to drive full-body animations of virtual characters and improve their portability.In the experiment,we first downsampled the gesture and facial data to achieve the same temporal resolution for the input,output,and facial data.The Audio2AB network then encoded the audio,audio-corresponding text,emotion labels,and semantic relevance labels,and then fused the text,emotion labels,and semantic relevance labels into the audio to obtain better audio features.Finally,we established links between the body,gestures,and facial decoders and generated the corresponding animation sequences through our proposed GAN-GF loss function.Results By using audio,audio-corresponding text,and emotional and semantic relevance labels as input,the trained Audio2AB network could generate gesture animation data containing blendshape weights.Therefore,different 3D virtual character animations could be created through parameterization.Conclusions The experimental results showed that the proposed method could generate significant gestures and facial animations. 展开更多
关键词 audio-driven Virtual character Full-body animation audio2AB Blendshape GAN-GF
下载PDF
Automatic recognition of depression based on audio and video:A review
3
作者 Meng-Meng Han Xing-Yun Li +4 位作者 Xin-Yu Yi Yun-Shao Zheng Wei-Li Xia Ya-Fei Liu Qing-Xiang Wang 《World Journal of Psychiatry》 SCIE 2024年第2期225-233,共9页
Depression is a common mental health disorder.With current depression detection methods,specialized physicians often engage in conversations and physiological examinations based on standardized scales as auxiliary mea... Depression is a common mental health disorder.With current depression detection methods,specialized physicians often engage in conversations and physiological examinations based on standardized scales as auxiliary measures for depression assessment.Non-biological markers-typically classified as verbal or non-verbal and deemed crucial evaluation criteria for depression-have not been effectively utilized.Specialized physicians usually require extensive training and experience to capture changes in these features.Advancements in deep learning technology have provided technical support for capturing non-biological markers.Several researchers have proposed automatic depression estimation(ADE)systems based on sounds and videos to assist physicians in capturing these features and conducting depression screening.This article summarizes commonly used public datasets and recent research on audio-and video-based ADE based on three perspectives:Datasets,deficiencies in existing research,and future development directions. 展开更多
关键词 Depression recognition Deep learning Automatic depression estimation System audio processing Image processing Feature fusion Future development
下载PDF
美观与好声音并不相撞 访问Pylon Audio亚洲销售代表Antoine Montana先生
4
作者 阿毕(图/文) 《视听前线》 2024年第1期82-83,共2页
在2023.11月的“广州国际音响唱片冬季特展”上,洪陆科技旗下代理的FEZZ斐驰、Pylon湃隆两大波兰品牌的功放和音箱产品到场展示,吸引了不少发烧友的关注。展会期间,FEZZ斐驰和Pylon湃隆的亚洲销售代表AntoineMontana先生也亲临现场,本... 在2023.11月的“广州国际音响唱片冬季特展”上,洪陆科技旗下代理的FEZZ斐驰、Pylon湃隆两大波兰品牌的功放和音箱产品到场展示,吸引了不少发烧友的关注。展会期间,FEZZ斐驰和Pylon湃隆的亚洲销售代表AntoineMontana先生也亲临现场,本刊记者也和他进行了详细的访谈,特别就最近颇受国内发烧友欢迎的PylonAudio进行访谈,深入了解Pylon Audio品牌的产品特色以及新品动向。 展开更多
关键词 audio 音箱 功放 销售
下载PDF
两大技术共冶一炉的新派发烧典范 Kudos Audio Titan系列
5
《视听前线》 2024年第3期74-76,共3页
英国Kudos Audio Titan系列的出现,可算是将等压式(Isobaric)与低音反射式(Bass Reflex)箱体优点共冶一炉的发烧级扬声器,既能为用家缔造快如闪电的敏捷瞬变,还具备得天独厚的大能量输出,优秀的下潜幅度远远超越了体积相若同类型产品所... 英国Kudos Audio Titan系列的出现,可算是将等压式(Isobaric)与低音反射式(Bass Reflex)箱体优点共冶一炉的发烧级扬声器,既能为用家缔造快如闪电的敏捷瞬变,还具备得天独厚的大能量输出,优秀的下潜幅度远远超越了体积相若同类型产品所制定的参考基准。 展开更多
关键词 audio 扬声器 参考基准 能量输出 KU
下载PDF
Cambridge Audio AXN10
6
《视听前线》 2024年第5期14-15,共2页
品牌1968年在英国剑桥,一批才华横溢的年轻毕业生开拓了以高科技研发和制作原型器材的业务,由此Cambridge Audio应运而生。成立当年便推出全球首款采用环形变压器的P40功放,呈现最自然的音乐重播。让不少录音室和制造商,也开始以“尽可... 品牌1968年在英国剑桥,一批才华横溢的年轻毕业生开拓了以高科技研发和制作原型器材的业务,由此Cambridge Audio应运而生。成立当年便推出全球首款采用环形变压器的P40功放,呈现最自然的音乐重播。让不少录音室和制造商,也开始以“尽可能自然地捕捉和重播音乐”为标准,研发相关设备和系统。 展开更多
关键词 audio 环形变压器 录音室 重播 功放
下载PDF
旧瓶装新酒?非也!这是一台全新设计的合并功放Gryphon Audio贵丰Diablo 333合并式功放
7
作者 阿毕(文/图) 《视听前线》 2024年第3期12-16,共5页
说起贵丰Gryphon,相信很多发烧友必定会想起它那让无数发烧友为之倾慕的“大菠萝”Diablo合并功放。笔者对“大菠萝”印象中最深刻的一次是在十多年前,试听KEF成立50周年时推出的纪念版音箱LS50.
关键词 KEF audio 纪念版 音箱 全新设计 合并功放 试听
下载PDF
复古中坚持现代要素 访Fyne Audio总经理Andrzej Sosna先生
8
作者 家祺 《视听前线》 2024年第2期81-82,共2页
FyneAudio成立于2017年,至今不到7年时间,已推出了超过10个各具特色、不同定位的音箱系列,展现了他们深厚的技术底蕴和强大的研发能力。在2023广州国际音响唱片展冬季特展上,小编得到了与FyneAudio总经理AndrzejSosna先生访谈的机会,了... FyneAudio成立于2017年,至今不到7年时间,已推出了超过10个各具特色、不同定位的音箱系列,展现了他们深厚的技术底蕴和强大的研发能力。在2023广州国际音响唱片展冬季特展上,小编得到了与FyneAudio总经理AndrzejSosna先生访谈的机会,了解FyneAudio最近的动向。 展开更多
关键词 audio 研发能力 音箱 音响
下载PDF
极其震撼的视觉和听觉体验--LINN&Vivid Audio旗舰新品发布会回顾
9
《视听前线》 2024年第4期62-63,共2页
3月16日(周六)上午,泽森音响在广州东方宾馆8楼东方厅举行了LINN&VividAudio新品发布会,LINN CEO Gilad Tiefenbrun和VIVID Audio总设计师LaurenceDickie亲临现场,为到场媒体和发烧友介绍了此次发布会的两款重磅产品:LINNKlimax Sol... 3月16日(周六)上午,泽森音响在广州东方宾馆8楼东方厅举行了LINN&VividAudio新品发布会,LINN CEO Gilad Tiefenbrun和VIVID Audio总设计师LaurenceDickie亲临现场,为到场媒体和发烧友介绍了此次发布会的两款重磅产品:LINNKlimax Solo800旗舰单声道后级、Vivid Audio全新旗舰音箱MoyaM1。 展开更多
关键词 audio 新品发布会 广州东方宾馆 听觉体验 音箱 LIN
下载PDF
Naim Audio 名 ND5 XS 2
10
《视听前线》 2024年第5期20-21,共2页
品牌屹立接近半个世纪的Naim Audio,成立于1973年,由Julian Vereker和Shirley Clarke联合创立,属于英国其中一所最老字号的音响品牌,其产品以浓厚的音乐味和强烈的感染力著称,除了传统合并式及前后级功放之外,NaimAudio炮制的串流播放... 品牌屹立接近半个世纪的Naim Audio,成立于1973年,由Julian Vereker和Shirley Clarke联合创立,属于英国其中一所最老字号的音响品牌,其产品以浓厚的音乐味和强烈的感染力著称,除了传统合并式及前后级功放之外,NaimAudio炮制的串流播放机及一体化产品,在世界各地同样深受用家欢迎。 展开更多
关键词 合并式 播放机 audio 串流 音响品牌 ND5 一体化产品 老字号
下载PDF
Audio Vivid标准关键技术研究及系统试验 被引量:1
11
作者 周芸 庞超 +1 位作者 王喆 郭晓强 《广播与电视技术》 2023年第7期35-42,共8页
本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio V... 本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio Vivid标准端到端技术试验情况,为Audio Vivid标准应用部署提供技术参考。 展开更多
关键词 audio Vivid 三维声 编解码 渲染 HOA空间编码 基于神经网络的音频编码
下载PDF
面向BLE Audio的改进DSDV路由算法
12
作者 陆晗 夏玮玮 何光栎 《移动通信》 2023年第10期71-77,92,共8页
在需要频繁传输音频的大规模蓝牙网络中,传统的DSDV路由算法会引入极高的时延和功耗。为了实现低功耗低时延的蓝牙网络音频传输,将BLE Audio的同步传输链路引入DSDV路由算法中,并结合根据节点状态自适应改变广播周期的机制,提出了一种面... 在需要频繁传输音频的大规模蓝牙网络中,传统的DSDV路由算法会引入极高的时延和功耗。为了实现低功耗低时延的蓝牙网络音频传输,将BLE Audio的同步传输链路引入DSDV路由算法中,并结合根据节点状态自适应改变广播周期的机制,提出了一种面向BLE Audio的改进DSDV路由算法。并且在BLE Audio硬件平台开发和实现了提出的路由算法。实际测试结果表明,该算法不仅能够降低节点功耗,同时能够极大降低音频数据包的端到端时延。 展开更多
关键词 低功耗蓝牙 BLE audio DSDV路由 同步传输
下载PDF
Cover Enhancement Method for Audio Steganography Based on Universal Adversarial Perturbations with Sample Diversification
13
作者 Jiangchuan Li Peisong He +2 位作者 Jiayong Liu Jie Luo Qiang Xia 《Computers, Materials & Continua》 SCIE EI 2023年第6期4893-4915,共23页
Steganography techniques,such as audio steganography,have been widely used in covert communication.However,the deep neural network,especially the convolutional neural network(CNN),has greatly threatened the security o... Steganography techniques,such as audio steganography,have been widely used in covert communication.However,the deep neural network,especially the convolutional neural network(CNN),has greatly threatened the security of audio steganography.Besides,existing adversarial attacks-based countermeasures cannot provide general perturbation,and the trans-ferability against unknown steganography detection methods is weak.This paper proposes a cover enhancement method for audio steganography based on universal adversarial perturbations with sample diversification to address these issues.Universal adversarial perturbation is constructed by iteratively optimizing adversarial perturbation,which applies adversarial attack tech-niques,such as Deepfool.Moreover,the sample diversification strategy is designed to improve the transferability of adversarial perturbations in black-box attack scenarios,where two types of common audio-processing operations are considered,including noise addition and moving picture experts group audio layer III(MP3)compression.Furthermore,the perturbation ensemble method is applied to further improve the attacks’transferability by integrating perturbations of different detection networks with heterogeneous architec-tures.Consequently,the single universal adversarial perturbation can enhance different cover audios against a CNN-based detection network.Extensive experiments have been conducted,and the results demonstrate that the average missed-detection probabilities of the proposed method are higher than those of the state-of-the-art methods by 7.3%and 16.6%for known and unknown detection networks,respectively.It verifies the efficiency and transferability of the proposed methods for the cover enhancement of audio steganography. 展开更多
关键词 audio steganography cover enhancement adversarial perturbations sample diversification
下载PDF
On‐device audio‐visual multi‐person wake word spotting
14
作者 Yidi Li Guoquan Wang +2 位作者 Zhan Chen Hao Tang Hong Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第4期1578-1589,共12页
Audio‐visual wake word spotting is a challenging multi‐modal task that exploits visual information of lip motion patterns to supplement acoustic speech to improve overall detection performance.However,most audio‐vi... Audio‐visual wake word spotting is a challenging multi‐modal task that exploits visual information of lip motion patterns to supplement acoustic speech to improve overall detection performance.However,most audio‐visual wake word spotting models are only suitable for simple single‐speaker scenarios and require high computational complexity.Further development is hindered by complex multi‐person scenarios and computational limitations in mobile environments.In this paper,a novel audio‐visual model is proposed for on‐device multi‐person wake word spotting.Firstly,an attention‐based audio‐visual voice activity detection module is presented,which generates an attention score matrix of audio and visual representations to derive active speaker representation.Secondly,the knowledge distillation method is introduced to transfer knowledge from the large model to the on‐device model to control the size of our model.Moreover,a new audio‐visual dataset,PKU‐KWS,is collected for sentence‐level multi‐person wake word spotting.Experimental results on the PKU‐KWS dataset show that this approach outperforms the previous state‐of‐the‐art methods. 展开更多
关键词 audio‐visual fusion human‐computer interfacing speech processing
下载PDF
Determined Reverberant Blind Source Separation of Audio Mixing Signals
15
作者 Senquan Yang Fan Ding +2 位作者 Jianjun Liu Pu Li Songxi Hu 《Intelligent Automation & Soft Computing》 SCIE 2023年第6期3309-3323,共15页
Audio signal separation is an open and challenging issue in the classical“Cocktail Party Problem”.Especially in a reverberation environment,the separation of mixed signals is more difficult separated due to the infl... Audio signal separation is an open and challenging issue in the classical“Cocktail Party Problem”.Especially in a reverberation environment,the separation of mixed signals is more difficult separated due to the influence of reverberation and echo.To solve the problem,we propose a determined reverberant blind source separation algorithm.The main innovation of the algorithm focuses on the estimation of the mixing matrix.A new cost function is built to obtain the accurate demixing matrix,which shows the gap between the prediction and the actual data.Then,the update rule of the demixing matrix is derived using Newton gradient descent method.The identity matrix is employed as the initial demixing matrix for avoiding local optima problem.Through the real-time iterative update of the demixing matrix,frequency-domain sources are obtained.Then,time-domain sources can be obtained using an inverse short-time Fourier transform.Experi-mental results based on a series of source separation of speech and music mixing signals demonstrate that the proposed algorithm achieves better separation performance than the state-of-the-art methods.In particular,it has much better superiority in the highly reverberant environment. 展开更多
关键词 Determined mixtures reverberant environment audio signal separation cocktail party problem
下载PDF
Design and Simulation of an Audio Signal Alerting and Automatic Control System
16
作者 Winfred Adjardjah John Awuah Addor +1 位作者 Wisdom Opare Isaac Mensah Ayipeh 《Communications and Network》 2023年第4期98-119,共22页
A large part of our daily lives is spent with audio information. Massive obstacles are frequently presented by the colossal amounts of acoustic information and the incredibly quick processing times. This results in th... A large part of our daily lives is spent with audio information. Massive obstacles are frequently presented by the colossal amounts of acoustic information and the incredibly quick processing times. This results in the need for applications and methodologies that are capable of automatically analyzing these contents. These technologies can be applied in automatic contentanalysis and emergency response systems. Breaks in manual communication usually occur in emergencies leading to accidents and equipment damage. The audio signal does a good job by sending a signal underground, which warrants action from an emergency management team at the surface. This paper, therefore, seeks to design and simulate an audio signal alerting and automatic control system using Unity Pro XL to substitute manual communication of emergencies and manual control of equipment. Sound data were trained using the neural network technique of machine learning. The metrics used are Fast Fourier transform magnitude, zero crossing rate, root mean square, and percentage error. Sounds were detected with an error of approximately 17%;thus, the system can detect sounds with an accuracy of 83%. With more data training, the system can detect sounds with minimal or no error. The paper, therefore, has critical policy implications about communication, safety, and health for underground mine. 展开更多
关键词 Emergency Response Emergency Management Team audio Signal Alerting Automatic Control System Uni Pro XL Manual Communication Fast Fourier Transform Magnitude Zero Crossing Rate Root Means Square
下载PDF
Audio Description for Educational Videos on COVID-19 Response:A Corpus-Based Study on Linguistic and Textual Idiosyncrasies
17
作者 XIONG Ling-song 《Journal of Literature and Art Studies》 2023年第4期276-285,共10页
Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to th... Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to the visually disadvantaged.In this study,a corpus of AD of COVID-19 educational videos is developed,named“Audio Description Corpus of COVID-19 Educational Videos”(ADCCEV).Drawing on the model of Textual and Linguistic Audio Description Matrix(TLADM),this paper aims to identify the linguistic and textual idiosyncrasies of AD themed on COVID-19 response released by the New Zealand Government.This study finds that linguistically,the AD script uses a mix of complete sentences and phrases,the majority being in Present Simple tense.Present participles and the“with”structure are used for brevity.Vocabulary is diverse,with simpler words for animated explainers.Third-person pronouns are common in educational videos.Color words are a salient feature of AD,where“yellow”denotes urgency,and“red”indicates importance,negativity,and hostility.On textual idiosyncrasies,coherence is achieved through intermodal components that align with the video’s mood and style.AD style varies depending on the video’s purpose,from informative to narrative or expressive. 展开更多
关键词 audio Description COVID-19 educational videos corpus-based study
下载PDF
BLE Audio技术及其在电视互动应用中的实时音频通信
18
作者 徐乐研 《电声技术》 2023年第4期142-146,共5页
随着智能电视的普及和电视观看体验的不断提升,用户对于更方便、更自由的音频传输方式的需求日益增加。蓝牙低功耗音频(Bluetooth Low Energy Audio,BLE Audio)技术是一种基于蓝牙低功耗(Bluetooth Low Energy,BLE)标准的无线音频传输技... 随着智能电视的普及和电视观看体验的不断提升,用户对于更方便、更自由的音频传输方式的需求日益增加。蓝牙低功耗音频(Bluetooth Low Energy Audio,BLE Audio)技术是一种基于蓝牙低功耗(Bluetooth Low Energy,BLE)标准的无线音频传输技术,由BLE技术升级而来,为音频设备提供高质量的无线音频传输,同时具有低功耗、低延迟和简单的设备连接特性。BLE Audio技术适用于各种应用场景,包括耳机、音箱、智能家居系统以及可穿戴设备等。基于此,详细分析BLE、BLE Audio技术以及BLE Audio技术在电视互动场景中的应用。 展开更多
关键词 蓝牙低功耗(BLE) BLE audio 无线音频传输
下载PDF
AudioTuning GMBH总裁Heinrich Lichtenegger先生谈Pro-Ject.Tone Factory和Musical Fidelity
19
作者 小路(文/图) 《视听前线》 2023年第6期79-82,共4页
今年3月,Cinemaster影音大师成为英国Musical Fidelity(音乐传真)、奥地利Tone Factory的中港澳独家总代理,加上此前代理的奥地利Pro-Ject(宝碟),目前影音大师代理Audio Tuning GMBH旗下的品牌达到三个。
关键词 audio GM 代理 影音
下载PDF
小投资大改善AudioQuest ThunderBird及Dragon Jumpers喇叭跳线
20
《视听前线》 2023年第2期68-68,共1页
以高质素喇叭跳线Jumpers,取代原本扬声器沿用的Bi-wire及Tri-wire的传导金属薄片,能直接提升音效,已属无庸置疑的事实。美国线圣AudioQuest全新推出雷鸟ThunderBird及龙Dragon Jumpers喇叭跳线,更可进一步改善现有系统两极伸延及动态表... 以高质素喇叭跳线Jumpers,取代原本扬声器沿用的Bi-wire及Tri-wire的传导金属薄片,能直接提升音效,已属无庸置疑的事实。美国线圣AudioQuest全新推出雷鸟ThunderBird及龙Dragon Jumpers喇叭跳线,更可进一步改善现有系统两极伸延及动态表现,将音场密度提升至更高层面。 展开更多
关键词 高质素 音场 audio 金属薄片 扬声器 跳线 QUEST
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部