基于词片的语言模型及在汉语语音检索中的应用被引量：5

Study on performance optimization for Chinese speech retrieval

下载PDF

导出

摘要在汉语语音检索研究中,为充分利用汉语中音节相互搭配的语言学知识,提出了一种新的汉语语言模型构造基元——"词片"(word fragment),研究了最佳词片选择算法。汉语语音识别实验和语音检索实验表明,采用基于词片的语音模型后,音节正确率有所提高,并取得了更好的语音检索性能。 A new unit, named word fragment of language model was proposed to take full advantage of the Chinese linguistic information among adjacent syllables, and an algorithm for word fragment selection was studied. The experimental results show, with the language model based on word fragment, syllable accuracy for recognizer is improved and the speech retrieval system gives better performance than the one with only syllable based model.

作者郑铁然韩纪庆李海洋

机构地区哈尔滨工业大学计算机学院

出处《通信学报》 EI CSCD 北大核心 2009年第3期84-88,共5页 Journal on Communications

基金国家重点基础研究发展计划("973"计划)基金资助项目(2007CB311100) 国家自然科学基金资助项目(60575030)~~

关键词汉语语音检索语言模型词片互信息 Chinese speech retrieval language model word fragment lattice

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1ABBERLEY D, RENALS S, COOK G. Retrieval of broadcast news documents with the THISL system[A]. Proc ICASSP98[C]. Seattle, 1998.3781-3784.
2NG K, ZUE V W. Subword-based approaches for spoken document retrieval[J]. Speech Communication, 2000, 32:157-186.
3BETH L, PEDRO M, OM D. Word and subword indexing approaches for reducing the effects of OOV queries on spoken audio[A]. Proc HLT2002[C]. San Diego, 2002.
4WECHSLER M, SCHAUBLE P. Speech retrieval based on automatic indexing[A]. Proc MIRO '95[C]. Glasgow, 1995.
5SEIDE F, et al. Vocabulary independent search in spontaneous speech[A]. Proc ICASSP'04[C]. Montreal, 2004.I253-I256.
6SARACLAR M, SPROAT R. Lattice-based search for spoken utterance retrieval[A]. Proc HLT-NAACL 2004[C]. Boston, Massachusetts, USA, 2004.129-136.
7HORI T, HETHERINGTON I L, HAZEN T J. Open-vocdalaryspoken utterance retrieval using confusion networks[A]. Proc ICASSP'07[C]. Honolulu, HI, USA, 2007.73-76.
8BAI B R, CHEN B L, WANG H M. Syllable based Chinese text/spoken document retrieval[J]. International Journal of Pattern Recognition and Artificial Intelligence, 2000, 14(5): 603-616.
9BAI B R, WANG H M, LEE L S. Discriminating capabilities of syllable based features and approaches of utilizing them for voice retrieval of speech information in mandarin Chinese[J]. IEEE Trans Speech Audio Processing, 2002, 10(5): 303-314.
10BEAUJARD C, JARDINO M. Language modeling based on automatic word concatenations[A]. Proceedings of European Conference on Speech Communication and Technology[C]. Budapest, Hungary, 1999.

同被引文献55

1李素建,王厚峰,俞士汶,辛乘胜.关键词自动标引的最大熵模型应用研究[J].计算机学报,2004,27(9):1192-1197. 被引量：93
2赵玉娟,水鹏朗,张凌霜.基于子空间匹配追踪的信号稀疏逼近[J].信号处理,2006,22(4):501-505. 被引量：9
3杨琳,张建平,颜永红.特定领域的汉语语言模型平滑算法比较研究[J].计算机工程与应用,2006,42(32):14-16. 被引量：5
4Good I J. The Population Frequencies of Species and the Estimation of Population Parameters. Biometrika, 1953, 40 (3/4) : 237 - 264.
5Katz S M. Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans on Acoustics, Speech, and Signal Processing, 1987, 35 ( 3 ) : 400 - 401.
6Gale W A, Sampson G. Good-Turing Frequency Estimation without Tears. Quantitative Linguistics, 1995, 2:217-237.
7John M, Mostefa Mesbah, Boualem Boashash. A new discrete analytic signal for reducing aliasing in the discrete wigner-ville distribution [J]. IEEE Transactions on Signal Proees-sing, 2008, 56 (11): 5427-5434.
8Peng Z K, Meng G, Chu F L, et al. Polynomial chirplet transform with application to instantaneous frequency estimation [J]. IEEE Transactions on Instrumentation and Measurement, 2011, 60 (9): 3222-3229.
9Fabien Millioz, Nadine Martin. Circularity of the STFT and spectral kurtosis for time-frequency segmentation in Gaussian environment[J].IEEE Transactions on Signal Proce-ssing, 2011, 59 (2): 515-523.
10Zakria Hussain, John Shawe-Taylor. Design and generalization analysis of orthogonal matching pursuit algorithms [J].IEEE Transactions on Information Theory, 2011, 57 ( 8 ): 5326-5340.

引证文献5

1张磊,陆冬,项学智.改进的Katz算法及其在基于Lattice识别系统中的应用[J].模式识别与人工智能,2011,24(2):249-254.
2董帅飞,于凤芹.基于Chirp原子改进的声母时频特征提取研究[J].计算机工程与设计,2013,34(3):1013-1017.
3陆明明,张连海,屈丹.基于子词PSPL的汉语语音文档索引[J].应用科学学报,2013,31(3):259-265.
4张力文,努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木.维吾尔语语音检索技术研究[J].中文信息学报,2014,28(5):182-186. 被引量：3
5范正光,屈丹,闫红刚,张文林.借助音频数据的发音字典新词学习方法[J].西安交通大学学报,2016,50(6):75-82. 被引量：1

二级引证文献4

1李如雄.基于语音分析的智能质检系统设计[J].自动化与仪器仪表,2017(6):114-116. 被引量：8
2呼媛玲,寇媛媛.基于音素的英文发音自动评测系统设计[J].自动化与仪器仪表,2018,0(11):160-163.
3苏立伟,刘振华,陈海燕.95598电力客服智能质检系统问题语音检出方法研究[J].微型电脑应用,2019,35(8):98-100. 被引量：7
4陈海燕,乔麟,苏立伟.基于语音分析的电力行业智能客服评分方法设计[J].微型电脑应用,2019,35(9):66-69. 被引量：7

1孟莎,刘加.汉语语音检索的集外词问题与两阶段检索方法[J].中文信息学报,2009,23(6):91-97. 被引量：8
2郑铁然,韩纪庆.基于后验概率的汉语语音检索方法研究[J].高技术通讯,2009,19(2):119-124. 被引量：1
3听写机及其语音模型[J].科技开发动态,2003(8):24-24.
4黄湘松,赵春晖,张磊,刘柏森.基于互信息置信度的网格连续汉语语音检索[J].计算机应用研究,2009,26(12):4607-4609. 被引量：1
5林晓钢,汪文林,何渝,郭永彩.一种高识别率的语音密码锁[J].重庆大学学报（自然科学版）,2008,31(3):307-310. 被引量：1
6WANG Yebin ZHAO Heming.Vocal tract resonances tracking by auxiliary vector particle filters[J].Chinese Journal of Acoustics,2011,30(1):105-114.
7沈鑫剡,俞海英,伍红兵.分组语音[J].中国数据通信,2001,3(4):59-61.
8田岚,白树忠,郑丽娜.基于多特征序贯判决的电话语音声纹鉴别方法研究[J].山东大学学报（工学版）,2003,33(6):648-651. 被引量：4
9薛丽萍,尹俊勋,纪震.基于粒子群优化-模糊聚类的说话人识别[J].深圳大学学报（理工版）,2008,25(2):178-183. 被引量：8
10叶虹.基于仿生模式识别的非特定人连续语音识别的研究[J].浙江工业大学学报,2006,34(4):433-435.

通信学报

2009年第3期

浏览历史

内容加载中请稍等...

基于词片的语言模型及在汉语语音检索中的应用被引量：5

参考文献12

同被引文献55

引证文献5

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于词片的语言模型及在汉语语音检索中的应用 被引量：5

参考文献12

同被引文献55

引证文献5

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于词片的语言模型及在汉语语音检索中的应用被引量：5