摘要
研究了帧长、帧移、信噪比、窗函数和语速对汉语音段反转言语掩蔽效率的影响.采用不同参数对目标语音进行处理,得到音段反转言语,其掩蔽效率由被掩蔽声的可懂度衡量,被掩蔽声的可懂度由语音质量感知评估(PESQ)指标评价得到.研究表明,汉语音段反转言语的掩蔽效率随帧长的增加、信噪比的降低或语速的升高而升高,帧移为帧长的1/2时掩蔽效率最高,窗函数对掩蔽效率没有影响.
This paper investigates the effects of parameters on masking efficiency of Chinese time- reversed speech. The parameters include frame length, frame shift, signal-to-noise ratio, time reversal window and speech rate. The time-reversed speeches with different parameters are derived from the target speech and the masking efficiency is shown by the speech intelligibility of the masked speech,which is evaluated by the perceptual evaluation of speech quality (PESQ) measure. Results show that the masking efficiency of time-reversed speech increases as the frame length increases, the signal-to-noise ratio decreases, or the speech rate increases. The making efficiency maintains unchanged with the time reversal window, and it presents the highest masking efficiency when the frame shift is half of the frame length. The results provide a good reference on the optimization design for Chinese time-reversed speech.
出处
《内蒙古师范大学学报(自然科学汉文版)》
CAS
北大核心
2017年第1期23-26,共4页
Journal of Inner Mongolia Normal University(Natural Science Edition)
基金
国家自然科学基金资助项目(11175257)
关键词
音段反转言语
语音质量感知评估(PESQ)
可懂度
掩蔽效率
time-reversed speech
perceptual evaluation of speech quality (PESQ)
speech intelligibility
masking efficiency