期刊文献+
共找到6篇文章
< 1 >
每页显示 20 50 100
Automatic recognition of depression based on audio and video:A review
1
作者 Meng-Meng Han Xing-Yun Li +4 位作者 Xin-Yu Yi Yun-Shao Zheng Wei-Li Xia Ya-Fei Liu Qing-Xiang Wang 《World Journal of Psychiatry》 SCIE 2024年第2期225-233,共9页
Depression is a common mental health disorder.With current depression detection methods,specialized physicians often engage in conversations and physiological examinations based on standardized scales as auxiliary mea... Depression is a common mental health disorder.With current depression detection methods,specialized physicians often engage in conversations and physiological examinations based on standardized scales as auxiliary measures for depression assessment.Non-biological markers-typically classified as verbal or non-verbal and deemed crucial evaluation criteria for depression-have not been effectively utilized.Specialized physicians usually require extensive training and experience to capture changes in these features.Advancements in deep learning technology have provided technical support for capturing non-biological markers.Several researchers have proposed automatic depression estimation(ADE)systems based on sounds and videos to assist physicians in capturing these features and conducting depression screening.This article summarizes commonly used public datasets and recent research on audio-and video-based ADE based on three perspectives:Datasets,deficiencies in existing research,and future development directions. 展开更多
关键词 Depression recognition Deep learning Automatic depression estimation System audio processing Image processing Feature fusion Future development
下载PDF
Filter algorithm based on cochlear mechanics and neuron filter mechanism and application on enhancement of audio signals 被引量:1
2
作者 GAO Wa KAN Yue ZHA Fu-sheng 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第6期1813-1828,共16页
A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying ... A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying the auditory filters.A cochlear mechanical transduction model is built to illustrate the audio signals processing procedure in cochlea,and then the neuron filter mechanism is modeled to indirectly obtain the outputs with the cochlear properties of frequency tuning and non-linear amplification.The mathematic description of the proposed algorithm is derived by the two models.The parameter space,the parameter selection rules and the error correction of the proposed algorithm are discussed.The unit impulse responses in the time domain and the frequency domain are simulated and compared to probe into the characteristics of the proposed algorithm.Then a 24-channel filter bank is built based on the proposed algorithm and applied to the enhancements of the audio signals.The experiments and comparisons verify that,the proposed algorithm can effectively divide the audio signals into different frequencies,significantly enhance the high frequency parts,and provide positive impacts on the performance of speech enhancement in different noise environments,especially for the babble noise and the volvo noise. 展开更多
关键词 COCHLEA neuron filter audio signal processing speech enhancement
下载PDF
Audio Mixing Inversion via Embodied Self-supervised Learning
3
作者 Haotian Zhou Feng Yu Xihong Wu 《Machine Intelligence Research》 EI CSCD 2024年第1期55-62,共8页
Audio mixing is a crucial part of music production.For analyzing or recreating audio mixing,it is of great importance to conduct research on estimating mixing parameters used to create mixdowns from music recordings,i... Audio mixing is a crucial part of music production.For analyzing or recreating audio mixing,it is of great importance to conduct research on estimating mixing parameters used to create mixdowns from music recordings,i.e.,audio mixing inversion.However,approaches of audio mixing inversion are rarely explored.A method of estimating mixing parameters from raw tracks and a stereo mixdown via embodied self-supervised learning is presented.In this work,several commonly used audio effects including gain,pan,equalization,reverb,and compression,are taken into consideration.This method is able to learn an inference neural network that takes a stereo mixdown and the raw audio sources as input and estimate mixing parameters used to create the mixdown by iteratively sampling and training.During the sampling step,the inference network predicts a set of mixing parameters,which is sampled and fed to an audio-processing framework to generate audio data for the training step.During the training step,the same network used in the sampling step is optimized with the sampled data generated from the sampling step.This method is able to explicitly model the mixing process in an interpretable way instead of using a black-box neural network model.A set of objective measures are used for evaluation.The experimental results show that this method has better performance than current state-of-the-art methods. 展开更多
关键词 audio mixing inversion intelligent audio mixing self-supervised learning audio signal processing deep learning
原文传递
BD512模拟延迟线的原理及其应用
4
作者 董小伍 《微电子学》 CAS 1988年第5期33-37,共5页
我所研制的BD512是一种新型集成电路,系集成戽链器件,它在音频处理方面有着广泛的用途。本文就BD512电路的原理和应用作了叙述。
关键词 Analog device Delay line Bucket brigade device audio processing
下载PDF
Identical-video retrieval using the low-peak feature of a video's audio information 被引量:2
5
作者 Myoung-beom CHUNG Il-ju KO 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2010年第3期151-159,共9页
The recognition and retrieval of identical videos by combing through entire video files requires a great deal of time and memory space. Therefore, most current video-matching methods analyze only a part of each video&... The recognition and retrieval of identical videos by combing through entire video files requires a great deal of time and memory space. Therefore, most current video-matching methods analyze only a part of each video's image frame information. All these methods, however, share the critical problem of erroneously categorizing identical videos as different if they have merely been altered in resolution or converted with a different codec. This paper deals instead with an identical-video-retrieval method using the low-peak feature of audio data. The low-peak feature remains relatively stable even with changes in bit-rate or codec. The proposed method showed a search success rate of 93.7% in a video matching experiment. This approach could provide a technique for recognizing identical content on video file share sites. 展开更多
关键词 Video retrieval Video DNA audio signal processing audio feature extraction
原文传递
An algorithm that minimizes audio fingerprints using the difference of Gaussians 被引量:1
6
作者 MyoungBeom CHUNG IlJu KO 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2011年第10期836-845,共10页
Recently,many audio search sites headed by Google have used audio fingerprinting technology to search for the same audio and protect the music copyright using one part of the audio data.However,if there are fingerprin... Recently,many audio search sites headed by Google have used audio fingerprinting technology to search for the same audio and protect the music copyright using one part of the audio data.However,if there are fingerprints per audio file,then the amount of query data for the audio search increases.In this paper,we propose a novel method that can reduce the number of fingerprints while providing a level of performance similar to that of existing methods.The proposed method uses the difference of Gaussians which is often used in feature extraction during image signal processing.In the experiment,we use the proposed method and dynamic time warping and undertake an experimental search for the same audio with a success rate of 90%.The proposed method,therefore,can be used for an effective audio search. 展开更多
关键词 audio retrieval audio fingerprint audio signal processing Difference of Gaussians (DOG)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部