摘要
音素分割是语音研究的一个主要组成部分,在大词汇量连续语音识别及语音合成的过程中起着重要的作用。文章以贵州省中部苗语作为研究对象,对其进行特征的提取和音素边界划分。通过对录音的频谱能量进行低频、中频和高频的均值计算,找到各个频段均值点组成的波形突变点作为边界,去掉宽度低于20 ms的边界,然后将得到的边界点进行排序,再一次筛选出宽度大于20 ms的边界,得出划分的边界点。在一定的容错范围内,准确率能够达到83%。
Phoneme segmentation is a main components of speech research,it plays an important role in large vocabulary continuous speech recognition and speech synthesis.In this paper,Miao language in the middle of Guizhou Province is taken as the research object,and its feature extraction and phoneme boundary division are carried out.The mean value of low frequency,intermediate frequency and high frequency is calculated through the spectrum energy of recording.Find the wave mutation point composed of the mean points of each frequency band as the boundary and remove boundary with width less than 20 ms.Then the boundary points are sorted,and the boundary points with a width of more than 20 ms are screened out again to get the boundary points.The accuracy can reach 83%in a certain range of fault tolerance.
作者
李学林
赵冬梅
梁明秀
LI Xuelin;ZHAO Dongmei;LIANG Mingxiu(Guizhou Minzu University,Guiyang 550025,China)
出处
《现代信息科技》
2020年第3期19-21,共3页
Modern Information Technology
基金
贵州民族大学校级课题([2018]5773-QN02)。
关键词
苗族语音
Praat标注
语谱能量
语音分割
Miao nationality’s voice
Praat annotation
spectrogram energy
speech segmentation