期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
Novel DTD and VAD assisted voice detection algorithm for VoIP systems
1
作者 Ming Meng Wang Ke Ji Hong 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2016年第4期9-16,76,共9页
Echo cancellation plays an important role in current Internet protocol(IP) based voice interactive systems. Voice state detection is an essential part in echo cancellation. It mainly comprises two parts: double tal... Echo cancellation plays an important role in current Internet protocol(IP) based voice interactive systems. Voice state detection is an essential part in echo cancellation. It mainly comprises two parts: double talk detection(DTD) and voice activity detection(VAD). DTD is used to detect doubletalk and prevent filter divergence in the presence of near-end speech, and VAD is used to determine the near-end voice activity and output silence indicator when near-end is silent. However, DTD straightforwardly proceeded may mistakenly declare double talk under double silent condition, coefficients update under the far-end silence condition may lead to filter divergence, and current VAD algorithms may misjudge the residual echo from the near end to be far-end voice. Therefore, a voice detection algorithm combining DTD and far-end VAD is proposed. DTD is implemented when VAD declares far-end speech, filtering and coefficients update will be halted when VAD declares far-end silence, and the far-end VAD adopted is multi-feature VAD based on short-time energy and correlation. The new algorithm can improve the accuracy of DTD, prevent filter divergence, and exclude the circumstance that far-end signal only contains residual echo from near end. Actual test results show that the voice state decision of the new algorithm is accurate, and the performance of echo cancellation is improved. 展开更多
关键词 echo cancellation double talk detection(DTD) voice activity detection(VAD) adaptive filter
原文传递
How to Improve the Voice Quality of VoIP
2
作者 林志洁 《四川大学学报(工程科学版)》 EI CAS CSCD 北大核心 2007年第S1期178-181,共4页
VoIP(voice over intemet protocol) has made great progress in Communication area in recent years.But a biggest pity is that the voice quality of VOIP can't satisfy users as traditional phone via PSTN does.In this p... VoIP(voice over intemet protocol) has made great progress in Communication area in recent years.But a biggest pity is that the voice quality of VOIP can't satisfy users as traditional phone via PSTN does.In this pa- per,the author analysis the reason and bring out some methods to improve the voice quality of VOIP that are utili- ring the bandwidth effectively to reduce delay;minishing the jitter to reduce packet lose and bit error;eliminating the echo.As the emphases,the author pointed out the specialty ... 展开更多
关键词 VOIP voice detection RSVP echo elimination
下载PDF
Audio-visual voice activity detection 被引量:1
3
作者 LIU Peng WANG Zuo-ying 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2006年第4期425-430,共6页
In speech signal processing systems,frame-energy based voice activity detection(VAD)method may be interfered with the background noise and non-stationary characteristic of the frame-energy in voice segment.The purpose... In speech signal processing systems,frame-energy based voice activity detection(VAD)method may be interfered with the background noise and non-stationary characteristic of the frame-energy in voice segment.The purpose of this paper is to improve the performance and robustness of VAD by introducing visual information.Meanwhile,data-driven linear transformation is adopted in visual feature extraction,and a general statistical VAD model is designed.Using the general model and a two-stage fusion strategy presented in this paper,a concrete multimodal VAD system is built.Experiments show that a 55.0%relative reduction in frame error rate and a 98.5%relative reduction in sentence-breaking error rate are obtained when using multimodal VAD,compared to frame-energy based audio VAD.The results show that using multimodal method,sentence-breaking errors are almost avoided,and frame-detection performance is clearly improved,which proves the effectiveness of the visual modal in VAD. 展开更多
关键词 speech recognition voice activity detection MULTIMODAL
原文传递
Fast Echo Canceller in IP Telephony Gateway
4
作者 黄永峰 李星 《Journal of Beijing Institute of Technology》 EI CAS 2003年第1期109-112,共4页
The length of the echo path in the IP telephony system is very long. Generally, the echo canceller is implemented on the IP telephony gateway which needs to perform concurrently multi-channel echo cancellation and voi... The length of the echo path in the IP telephony system is very long. Generally, the echo canceller is implemented on the IP telephony gateway which needs to perform concurrently multi-channel echo cancellation and voice compression. Hence, the most key technique to design the echo canceller is to reduce greatly the computational requirement. For this reason a number of innovative features to implement a fast echo canceller are presented. The key components of this canceller include: the separation of adaptive and cancel filters, non-real-time adaptation and real-time cancellation, sharing VAD algorithms with the speech codec, the incorporation of delay indexing with zero coefficients, and windowing the adaptive filter coefficients to reduce the cost of DSP during the cancellation. Finally, the performance of the echo canceller is summarized; the results of evaluation show that the performance gains for echo cancellation are significant. 展开更多
关键词 echo cancellation voice activity detection adaptive filter
下载PDF
Enhancing Parkinson's disease severity assessment through voice-based wavelet scattering,optimized model selection,and weighted majority voting
5
作者 Farhad Abedinzadeh Torghabeh Seyyed Abed Hosseini Elham Ahmadi Moghadam 《Medicine in Novel Technology and Devices》 2023年第4期51-63,共13页
Parkinson's disease(PD)is a neurodegenerative disorder characterized by motor and non-motor symptoms that significantly impact an individual's quality of life.Voice changes have shown promise as early indicato... Parkinson's disease(PD)is a neurodegenerative disorder characterized by motor and non-motor symptoms that significantly impact an individual's quality of life.Voice changes have shown promise as early indicators of PD,making voice analysis a valuable tool for early detection and intervention.This study aims to assess and detect the severity of PD through voice analysis using the mobile device voice recordings dataset.The dataset consisted of recordings from PD patients at different stages of the disease and healthy control subjects.A novel approach was employed,incorporating a voice activity detection algorithm for speech segmentation and the wavelet scattering transform for feature extraction.A Bayesian optimization technique is used to fine-tune the hyperparameters of seven commonly used classifiers and optimize the performance of machine learning classifiers for PD severity detection.AdaBoost and K-nearest neighbor consistently demonstrated superior performance across various evaluation metrics among the classifiers.Furthermore,a weighted majority voting(WMV)technique is implemented,leveraging the predictions of multiple models to achieve a near-perfect accuracy of 98.62%,improving classification accuracy.The results highlight the promising potential of voice analysis in PD diagnosis and monitoring.Integrating advanced signal processing techniques and machine learning models provides reliable and accessible tools for PD assessment,facilitating early intervention and improving patient outcomes.This study contributes to the field by demonstrating the effectiveness of the proposed methodology and the significant role of WMV in enhancing classification accuracy for PD severity detection. 展开更多
关键词 Parkinson's disease Speech impairment voice activity detection Model selection Bayesian optimization Weighted majority voting
原文传递
Real-Time Implementation of an Efficient Speech Enhancement Algorithm for Digital Hearing Aids
6
作者 高杰 张辉 胡广书 《Tsinghua Science and Technology》 SCIE EI CAS 2006年第4期475-480,共6页
In order to remove background noise and improve the quality of speech for digital hearing aids, a single-channel speech enhancement algorithm is proposed. The algorithm is implemented and assessed on a digital hearing... In order to remove background noise and improve the quality of speech for digital hearing aids, a single-channel speech enhancement algorithm is proposed. The algorithm is implemented and assessed on a digital hearing aid platform based on the TI DSP TMS320VC5502 chip. Assuming that background noise is stationary or varies slowly, an energy-based voice activity detection algorithm is adopted by adaptively tracking the minima and maxima of the power envelope in noisy speech. The target speech is then enhanced by using a Wiener filter, on the basis of a short-term power spectral estimation. To deal with the distracting musical noise of the processed speech, phase randomization, along with adjacent spectral averaging, is adopted. Objective measures and an informal hearing test both show an improved performance as well as obvious attenuation of residual noise. The low power consumption and high efficiency render the whole algorithm very applicable for use in digital hearing aids. 展开更多
关键词 voice activity detection power envelope Wiener filter speech enhancement
原文传递
Speech enhancement with a GSC-like structure employing sparse coding
7
作者 Li-chun YANG Yun-tao QIAN 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2014年第12期1154-1163,共10页
Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller(GSC), which uses a reference signal to estimate the inte... Speech communication is often influenced by various types of interfering signals. To improve the quality of the desired signal, a generalized sidelobe canceller(GSC), which uses a reference signal to estimate the interfering signal, is attracting attention of researchers. However, the interference suppression of GSC is limited since a little residual desired signal leaks into the reference signal. To overcome this problem, we use sparse coding to suppress the residual desired signal while preserving the reference signal. Sparse coding with the learned dictionary is usually used to reconstruct the desired signal. As the training samples of a desired signal for dictionary learning are not observable in the real environment, the reconstructed desired signal may contain a lot of residual interfering signal. In contrast,the training samples of the interfering signal during the absence of the desired signal for interferer dictionary learning can be achieved through voice activity detection(VAD). Since the reference signal of an interfering signal is coherent to the interferer dictionary, it can be well restructured by sparse coding, while the residual desired signal will be removed. The performance of GSC will be improved since the estimate of the interfering signal with the proposed reference signal is more accurate than ever. Simulation and experiments on a real acoustic environment show that our proposed method is effective in suppressing interfering signals. 展开更多
关键词 Generalized sidelobe canceller Speech enhancement voice activity detection Dictionary learning Sparse coding
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部