期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
基于轻量级卷积门控循环神经网络的语声增强方法 被引量:1
1
作者 王玫 李江和 +1 位作者 宋浠瑜 刘小娟 《应用声学》 CSCD 北大核心 2023年第3期652-658,共7页
针对在基于深度学习语声增强方法中因采用因果式的网络输入导致语声增强性能下降的问题,提出了一种基于轻量级卷积门控循环神经网络的语声增强方法。门控循环神经网络能够建模语声信号的时间相关性,但是其全连接结构忽略了语声信号的时... 针对在基于深度学习语声增强方法中因采用因果式的网络输入导致语声增强性能下降的问题,提出了一种基于轻量级卷积门控循环神经网络的语声增强方法。门控循环神经网络能够建模语声信号的时间相关性,但是其全连接结构忽略了语声信号的时频结构特征,并且参数数量庞大,不利于网络的训练。对此,该文采用卷积核替代门控循环神经网络中的全连接结构,在对语声信号时间相关性建模的同时保留了语声信号的时频结构特征,同时降低了网络的参数数量。为充分利用先前帧的特征信息,该网络单元当前时刻的输入融合了上一时刻的输入与输出。针对网络训练过程中容易产生过拟合的问题,该文采用了线性门控机制来控制信息的传输,这缓解了网络训练过程中的过拟合问题,提高了网络的语声增强性能。实验结果表明,该文所提出的网络结构在增强后的语声感知质量、语声短时客观可懂度、分段信噪比等指标上均优于传统的网络结构。 展开更多
关键词 卷积门控循环神经网络 固定时延 因果式语声增强 语声质量 语声可懂度
下载PDF
Whisper intelligibility enhancement based on noise robust feature and SVM 被引量:2
2
作者 周健 赵力 +1 位作者 梁瑞宇 方贤勇 《Journal of Southeast University(English Edition)》 EI CAS 2012年第3期261-265,共5页
A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize... A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech. 展开更多
关键词 whispered speech intelligibility enhancement noise robust feature machine learning
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部