基于能量谱熵的英语摩擦音检测方法

An English Fricative Detection Method Based on Energy Spectrum Entropy

下载PDF

导出

摘要根据摩擦音发声时的频谱特点,提出一种基于能量谱熵的摩擦音检测方法.该方法首先利用不同音素的语谱能量特点检测出音素边界.然后计算每个语音段的能量谱熵,并将超过阈值的语音段作为候选.最后根据语音段的长度、开始结束时的能量突变等对特征候选语音段后处理,去除错误候选.实验表明,在干净环境中并且容错误差为20 ms时,摩擦音的检测率达到96.9%. According to the spectrum characteristics of fricatives, a fricative detection method based on the energy spectrum entropy is proposed. Firstly, phone boundaries are detected based on spectrum of different phonemes. Then, each spectrum entropy of speech segments is computed and the segments whose entropy exceeds the threshold are selected as candidates. Finally, post processing is conducted to remove the insertion errors according to parameters of segment length and the sudden changing of energy at segment starts and ends. The experimental results show that the accuracy of the proposed method is up to 96.9% in clean circumstance when the tolerance is 20 ms.

作者李立永张连海

机构地区解放军信息工程大学信息系统工程学院

出处《模式识别与人工智能》 EI CSCD 北大核心 2014年第6期554-560,共7页 Pattern Recognition and Artificial Intelligence

关键词能量谱熵摩擦音检测音素边界检测 Energy Spectrum Entropy, Fricative Detection, Phone Boundary Detection

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Lee C. From Knowledge-Ignorant to Knowledge-Rich Modeling : A New Speech Research Paradigm for Next Generation AutomaticSpeech Recognition[EB/OL]. [2012-08-30] . http://slam. iis.sinica. edu. tw/NGASR/workshop/20041127-asat. pdf.
2Dusan S, Rabiner L R. On Integrating Insights from Human Speech Perception into Automatic Speech Recognition [ EB/OL]. [2012-09-01 ] . http://cronos. rutgers. edu/ ~ lrr/lrr%20papers/352_dr_euro2005c. pdf.
3Lee C H . An Overview on Automatic Speech Attribute Transcription(ASAT) // Proc of the 8 th Annual Conference of the InternationalSpeech Communication Association. Antwerp, Belgium, 2007 :1825-1828.
4Stevens K N. Toward a Model for Lexical Access Based on AcousticLandmarks and Distinctive Features.Journal of the Acoustical Socie-ty of America, 2002, 111(4) : 1872-1891.
5Liu S A. Landmark Detection for Distinctive Feature-Based SpeechRecognition. Journal of the Acoustical Society of America, 1996,100(5): 3417-3430.
6Park C. Consonant Landmark Detection for Speech Recognition.Ph. D Dissertation. Massachusetts, USA: Massachusetts Instituteof Technology,2008.
7陈斌,张连海,牛铜,王波.基于能量分布和共振峰结构的汉语鼻音检测[J].中文信息学报,2012,26(1):104-109. 被引量：1
8Wang Y. A Two-Stage Sample-Based Phone Boundary Detector Using Segmental Similarity Features // Proc of the 12th AnnualConference of the International Speech Communication Association.Florence, Italy, 2011 : 413-416.
9Quatieri T F. Discrete-Time Speech Signal Processing: Principlesand Practice. Upper Saddle River, USA: Prentice Hall, 2001.
10李朝晖,迟惠生.听觉外周计算模型研究进展[J].声学学报,2006,31(5):449-465. 被引量：22

二级参考文献161

1栗学丽,丁慧,徐柏龄.基于熵函数的耳语音声韵分割法[J].声学学报,2005,30(1):69-75. 被引量：34
2Chin-Hui. Lee. From knowledge-ignorant to knowl- edge-rich modeling: A new speech research paradigm for next generation automatic speech recognition[C]// Proceedings of ICSLP Keynote speech, 2004.
3S. R. Mahadeva Prasanna, B.V. Sandeep Reddy, P. Krishnamoorthy. Vowel onset point detection using source, spectral peaks and modulation spectrum ener- gies[J]. IEEE Transactions on Audio, Speech and Language Processing, 2009,17 (4): 556-565.
4Almpanidis G. , Kotti M. , Kotropoulos C.. Robust Detection of Phone Boundaries Using Model Selection Criteria With Few Observations [J]. IEEE Transac- tions on Audio, Speech, and Language Processing, 2009,17(2) .. 287-298.
5K.Y. Leung, M. Siu. Speech Recognition Using Combined Acoustic and Articulatory Information with Retraining of Acoustic Model Parameters[C]//Pro- ceedings of ICSLP 2002,3: 2117-2120.
6M. Hasegawa-Johnson, J. Baker, S. Borys, et. al. Landmark-based speech recognition: Report of the 2004 Johns Hopkins summer workshop[C]//Proeeedings of ICASSP,2005 : 213-216.
7J. Morris, E. Fosler-Lussier. Further experiments with detector-based conditional random fields in pho- netic recognition[C]//Proeeedings of ICASSP, April, 2007.
8Carla Lopes, Fernando Perdigao. A HierarchicalBroad-class Classification to Enhance Phoneme Recog- nition[C]//Proceedings of European Signal Processing Conference, 2009,1760-1764.
9Limin Du, Kenneth Noble Stevens. Automatic Detec- tion of Landmark for Nasal Consonants from Speech Waveform[C]//Proceedings of ICSLP 2006.
10Sarah E. Borys. An SVM Front-end Landmark Speech Recognition System[M]. University of Illinois, 2008.

共引文献26

1马元锋,陈克安,王娜,郑文.听觉模型输出谱特征在声目标识别中的应用[J].声学学报,2009,34(2):142-150. 被引量：20
2马元锋,陈克安,马苗,张成.一种新的可应用于声目标识别的倒谱系数[J].兵工学报,2009,30(11):1477-1483. 被引量：12
3MA Yuanfeng,CHEN Ke'an,SHI Fang.Application of auditory spectrum-based features into acoustic target recognition[J].Chinese Journal of Acoustics,2010,29(1):33-44.
4马元锋,陈克安,王云山,马苗.自适应听觉感知时频分析模型[J].声学学报,2010,35(4):393-402. 被引量：1
5刘辉,杨俊安,周志增.听觉模型倒谱系数及其在声目标识别中的应用[J].应用科学学报,2011,29(1):51-55. 被引量：1
6陈斌,张连海,王波,屈丹.基于Seneff听觉谱特征的汉语连续语音声韵母边界检测[J].声学学报,2012,37(1):104-112. 被引量：6
7李皓,唐朝京.采用损失函数和声学特征切分声韵母的方法[J].声学学报,2012,37(3):339-345. 被引量：3
8张连海,陈斌,屈丹.基于发音特性的摩擦音和塞擦音分类算法[J].计算机科学,2012,39(9):211-214. 被引量：1
9李允公,戴丽,张金萍.一种双耳听觉模型及其在轴心轨迹分析中的应用[J].振动与冲击,2012,31(18):46-49. 被引量：3
10胡峰松,曹孝玉.基于Gammatone滤波器组的听觉特征提取[J].计算机工程,2012,38(21):168-170. 被引量：29

1董胡,钱盛友.改进的能量谱熵端点检测算法[J].测控技术,2016,35(6):26-29. 被引量：14
2才溪,赵巍.Contourlet变换低通滤波器对图像融合算法影响的讨论[J].自动化学报,2009,35(3):258-266. 被引量：10
3赵欢,王纲金,胡炼,彭秀娟.车载环境下基于样本熵的语音端点检测方法[J].计算机研究与发展,2011,48(3):471-476. 被引量：7
4杨慧珍,李要球.噪声环境下无波前探测自适应光学系统扩展目标成像校正[J].红外与激光工程,2013,42(S01):133-138.
5赵欢,王纲金,赵丽霞.一种新的对数能量谱熵语音端点检测方法[J].湖南大学学报（自然科学版）,2010,37(7):72-77. 被引量：17
6唐贵基,邓飞跃,何玉灵,王晓龙.基于时间-小波能量谱熵的滚动轴承故障诊断研究[J].振动与冲击,2014,33(7):68-72. 被引量：17
7曾敏,王贤川,胡国南.嵌入式网络协议仿真实验系统的设计[J].计算机应用与软件,2011,28(8):274-278. 被引量：5
8林明.第5代Wi-Fi技术与标准化研究[J].电子科学技术,2016,3(4):429-433.
9杨鸿波,侯霞.基于局部谱能量自相似矩阵的纹理描述[J].计算机应用,2014,34(3):790-796.
10YANG Cui WEI Gang.Fast sinusoidal analysis algorithm based on energy of narrowband spectrum[J].Chinese Journal of Acoustics,2010,29(4):413-427.

模式识别与人工智能

2014年第6期

浏览历史

内容加载中请稍等...

基于能量谱熵的英语摩擦音检测方法

参考文献11

二级参考文献161

共引文献26

相关作者

相关机构

相关主题

浏览历史