基于优化检测网络和MLP特征改进发音错误检测的方法被引量：2

Mispronunciation detection with an optimized detection network and multi-layer perception based features

导出

摘要该文基于优化的检测网络和多层感知(multi-layerperception,MLP)特征,提出一种可以更加准确地检测出错误发音类型的方法。首先,从第二语言学习的语音库中提取出基本的发音规则以及组合的发音规则,并相应地计算它们发生的先验概率,再将这些具有先验概率的规则用于构建基于多发音的扩展检测网络。然后在检测过程中,引入基于发音特征的MLP特征来描述发音概率,替代了传统的语音声学特征。最后使用基于MLP特征的GMM-HMM框架从检测网络中识别出最可能的发音音素串。实验表明:该方法将音素识别正确率提高了3.11%,错误类型准确率提高了7.42%。 This paper describes an optimized detection network for multi-layer pereeptron （MLP） features to more accurately capture mispronunciations. First, the basic and combined phonological rules are extracted from the L2 speech corpus with computation of their prior probability of occurrence. The prior probability rules are then used to build a multiple pronunciation based extended detection network. Then, articulatory based MLP features are introduced to describe the pronunciation probability instead of the conventional speech acoustic features during detection. Finally, the GMM-HMM framework with MLP features is used to pick the most probable pronunciation phoneme sequences from the detection network. Tests show that this approach improves phoneme recognition accuracy by 3.11% and the mispronunciation type accuracy by 7.42%.

作者袁桦钱彦旻赵军红刘加

机构地区清华大学电子工程系中国科学院电子学研究所

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2012年第4期557-560,570,共5页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(60931160443,90920302,N-CUHK414/09) 国家科技支撑计划项目(2009BAH41B01)

关键词发音错误检测发音规则多层感知(MLP) 发音特征 mispronunciation detection phonological rules multi-layerperceptron （MLP） articulatory feature

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1Eskenazi M.An overview of spoken language technology foreducation[J].Speech Communication,2009,51(10):823-844.
2QIAN Xiaojun,Meng H,Soong F.Capturing L2segmentalmispronunciations with joint-sequence models inComputer-Aided Pronunciation Training(CAPT)[C]//Proceedings on 7th Chinese Spoken Language Processing(ISCSLP).Tainan,China:IEEE Press,2010:84-88.
3Yoon S Y,Hasegawa-Johnson M,Sproat R.Landmark-based automated pronunciation error detection[C]//Proceedings on Interspeech.Tokyo:InternationalSpeech Communication Association,2010:614-617.
4ZHANG Feng,HUANG Chao,Soong F,et al.Automaticmispronunciation detection for Mandarin[C]//Proceedingson ICASSP.Piscataway,USA:IEEE Press,2008:5077-5080.
5WEI Si,HUA Guoping,HU Yu,et al.A new method formispronunciation detection using Support Vector Machinebased on Pronunciation Space Models[J].SpeechCommunication,2009,51(10):896-905.
6Meng H,Lo Y Y,Wang L,et al.Deriving salient learners'mispronunciations from cross-language phonologicalcomparisons[C]//Proceedings on ASRU.Kyoto:IEEEPress,2007:437-442.
7Harrison A M,Lau W Y,Meng H,et al.Improvingmispronunciation detection and diagnosis of learners'speechwith context-sensitive phonological rules based on languagetransfer[C]//Proceedings on Interspeech.Brisbane:International Speech Communication Association,2008:2787-2790.
8Lo W K,ZHANG Shuang,Meng H.Automatic derivation ofphonological rules for mispronunciation detection in acomputer-assisted pronunciation training system[C]//Proceedings on Interspeech.Makuhari,Japan:InternationalSpeech Communication Association,2010:765-768.
9Allauzen C,Riley M,Schalkwyk J,et al.OpenFst:Ageneral and efficient weighted finite-state transducer library[J].Computer Science,2007,4783:11-23.

同被引文献14

1张海,陶晓宇,夏白桦.战术数据链网络设计优化方法[J].火力与指挥控制,2009,34(S1):108-111. 被引量：4
2贾春强,田树军,张宏,刘万辉.基于智能优化方法的插装阀液压集成块设计[J].计算机集成制造系统,2007,13(6):1041-1046. 被引量：8
3Wang L Y,Liu A,Sushil J. Using Attack Graphs for Correlating,Hypothesizing,and Predicting Intrusion alerts[J].Elsevier B V,2008,(05):21.
4刘澎;王宏远.基于混合遗传算法优化的MLP神经网络的调制方式识别[J]武汉大学学报,2010(02):98-102.
5赵宣;王伟平.入侵检测系统报警信息关联分析模型的设计与实[J]计算机与现代化,2010(04):87-90.
6刘澍,王宏远.基于混合遗传算法优化的MLP神经网络的调制方式识别[J].武汉大学学报（理学版）,2008,54(1):104-108. 被引量：7
7葛凤培,潘复平,董滨,颜永红.汉语发音质量评估的实验研究[J].声学学报,2010,35(2):261-266. 被引量：12
8郭健彬,曾声奎,陈云霞.稳健协同优化方法的改进和应用[J].火力与指挥控制,2010,35(4):32-35. 被引量：3
9安丽丽,吴延年,刘志,刘润生.一种基于检错音网络的发音错误检测新算法[J].电子与信息学报,2012,34(9):2085-2090. 被引量：1
10李志萍,汤晋瑄,强彦.智能家居报警系统的设计与实现[J].电脑开发与应用,2012,25(10):63-65. 被引量：7

引证文献2

1刘荷花.基于MLP优化网络的报警关联技术[J].火力与指挥控制,2013,38(2):9-13.
2柳宗铭,王丽,李军锋,张鹏远.声学发音模型辅助建模的发音错误检测与诊断[J].声学学报,2023,48(1):264-273.

1晁浩,宋成,彭维平.基于发音特征的声效相关鲁棒语音识别算法[J].计算机应用,2015,35(1):257-261. 被引量：8
2赵艳君.数字电路软错误防护方法研究[J].硅谷,2013,6(3):83-83.
3杨董玲.浅谈日语的声调[J].科技信息,2009(35):224-225. 被引量：2
4安丽丽,吴延年,刘志,刘润生.一种基于检错音网络的发音错误检测新算法[J].电子与信息学报,2012,34(9):2085-2090. 被引量：1
5陈雁翔,刘鸣.基于发音特征的音视频说话人识别鲁棒性的研究[J].电子学报,2010,38(12):2920-2924. 被引量：2
6刘明辉,黄中伟.结合高斯混合模型和VOT特征的音素发音错误检测[J].科学技术与工程,2013,21(7):1789-1793. 被引量：3
7张琰彬,呼月宁,初敏,黄超,梁满贵.汉语普通话声调发音错误检测[J].清华大学学报（自然科学版）,2008,48(S1):683-687. 被引量：1
8卢学燕.浅谈数字电路中软错误防护方法[J].装备制造,2014,0(S2):78-78.
9何桂清,虞厥邦,庞晓忠.一种新的神经网络均衡器:结构、算法与性能[J].信号处理,1994,10(2):81-86. 被引量：3
10陈晓红,沈东华.VB程序的调试技术及应用实例研究[J].无线互联科技,2016,13(24):135-137.

清华大学学报（自然科学版）

2012年第4期

浏览历史

内容加载中请稍等...

基于优化检测网络和MLP特征改进发音错误检测的方法被引量：2

参考文献9

同被引文献14

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于优化检测网络和MLP特征改进发音错误检测的方法 被引量：2

参考文献9

同被引文献14

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于优化检测网络和MLP特征改进发音错误检测的方法被引量：2