期刊文献+

病态嗓声识别特征参数的优化选择 被引量:1

Optimization and Selection of Feature Parameters in Dysphonia Recognition
下载PDF
导出
摘要 为提高病态嗓声识别效率,本研究首次采用主分量分析方法对目前常用的27个嗓声特征参数进行了优化分析,考察了仅用少数主分量参数的识别效果及其分量数对结果的影响;同时根据参数对病态嗓声信息敏感程度,使用正交试验法优选出9个较优特征参数,其识别率即可达到原27个参数的识别结果。经两种方法对参数进行优选后识别率分别达到97.23%和98.10%,显著高于未经优选、使用全部27个参数的92.10%识别率。研究结果表明:原27个参数中,2/3的参数不能很好地反映嗓声的病态变化特征,使用优选的含有大量病态嗓声信息的少量特征参数即可大幅提高病态嗓声识别效率。 In order to improve the performance of dysphonia recognition, 27 common used original acoustic feature parameters were optimized and analyzed by using the method of primary components analysis (PCA). The performances of PCA with only a few primary components and influenced results with changing component numbers were investigated. In this essay 9 optimum feature parameters were selected from 27 original parameters by using orthogonal layout based on their sensitivities in reflecting the voice disorder information, and then nearly the same corrective classification rate of dysphonla recognition was obtained. The optimized rates given by two methods of primary component analysis and orthogonal layout we are 97.23% and 98.10% respectively, much higher than the rate of 92.10% which was got by using the total 27 original acoustic feature parameters without optimization. The results showed that two thirds of 27 original feature parameters could not reflect the main voice disorder characteristics. The higher performance of dysphonia recognition was achieved by using fewer optimized feature parameters which contains the most information of voice disorders.
出处 《中国生物医学工程学报》 CAS CSCD 北大核心 2007年第5期675-679,共5页 Chinese Journal of Biomedical Engineering
基金 天津市重点学科建设基金(津教委高[2000]-31)资助。
关键词 病态嗓声识别 主分量分析 正交试验 支持向量机 正交表 dysphonia recognition primary components analysis orthogonal design experiment support vector machine orthogonal layout
  • 相关文献

参考文献10

  • 1Becker W, Naumann HH, Faltz CR. Ear, nose and throat Diseases[M]. (2nd. Edition). New York: Thieme Medical, 1994.
  • 2Qiu Qingjun, Xu Keyin, Yu Qilian, et al. Automatic quantitative study on the vibrational wave of human vocal folds via videokymography [J].Proc SPIE, Electronic Imaging and Multimedia Technology III, 2002, 4925:430 - 435.
  • 3Maguire C, Chazal P, Reilly RB, et al. Automatic classification of voice pathology using speech analysis[A]. In: Proc of the 3rd Intl[C]. Florence Itady: Workshop Models and Analysis of Vocal Emissions for Biomedical Applications, 2003. 259 - 262.
  • 4Moran R J, Reilly RB. Telephony-Based Voice Pathology Assessment Using Automated Speech Analysis[J]. IEEE Trans Biomed Eng, 2006, 53(3):468-477.
  • 5Dibazar AA, Narayanan S, Berger TW. Feature analysis for automatic detection of pathological speech [A]. In: Proc IEEE Joint, EMBS/BMES Conf[C]. Houston: Proceedings of the Second Joint, 2002. 1 : 182 - 184
  • 6杨大利,徐明星,吴文虎.语音识别特征参数选择方法研究[J].计算机研究与发展,2003,40(7):963-969. 被引量:21
  • 7陈魁.实验设计与分析[M].北京:清华大学出版社,1996.115-145.
  • 8Chang Chihchung, Lin Chihlin. LIBSVM: a library for support vector machines [ EB/OL]. http://www. csie. ntu. edu. tw/- cjlin/ libsvm, 2001/2006 - 10 - 08.
  • 9Lin Husantien, Lin Chihlin. A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods [ EB/ OL]. http://www.csie. ntu. edu. tw/- cjlin/papers. html, 2003 - 03/2006 - 10- 08.
  • 10Cehallos LG, Hansen HL. Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection [J].In IEEE Trans Biomed Eng, 1996, 43(4) :373 - 383.

二级参考文献6

  • 1陈魁.实验设计与分析[M].北京:清华大学出版社,1996,8.94.
  • 2O Viikki, K Laurila. Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication, 1998, 25(1): 133--147.
  • 3Yang Dali, Xu Mingxing, Wu Wenhu. A novel feature selection method in speech recognition. Int' 1 Conf on Chinese Computing,Singapore, 2001.
  • 4K Paliwal. Study of line spectrum pair frequencies for vowel recognition. Speech Communication, 1989, 8(1): 27--33.
  • 5Hermansky, Hykek, Morgan Nelson. RASTA processing of speech. IEEE Trans on Speech and Audio Processing, 1994, 2(4) : 578--589.
  • 6C Emmanouilidis, A Hunter. Multiobjective evolutionary setting for feature selection and a commonality-based crossover operator.In: Proc of the IEEE Conf on Evolutionary Computation.Piscataway: Institute of Electrical and Electronic Engineers Inc,2000. 309--316.

共引文献47

同被引文献11

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部