期刊文献+

基于互信息置信度的网格连续汉语语音检索 被引量:1

Lattice-based continuous Chinese speech indexing based on mutual information confidence measure
下载PDF
导出
摘要 针对目前生活中涌现的海量语音数据,人们对语音检索技术准确度的要求越来越高。主要研究了汉语连续语音检索任务中,基于转换音节网格的研究方法。针对语音检索系统中置信度计算的问题,提出了一种基于音节间互信息的置信度计算方法,并将其用于网格结构的语音检索系统中。该方法能够有效地利用上下文之间的互信息量,从而更准确、合理地描述汉语语言模型。实验结果表明,用提出的方法建立转换音节网格来进行语音检索,其检出率(FOM)比后验概率法和N-best法有较大幅度的提高。得到的汉语语音检索系统其FOM最高可以达到83.7%。 Nowadays, with the overwhelming amounts of speech data rushing in our life, higher and higher accuracy of speech indexing techniques is required. This paper mainly studied a converted syllable lattice-based approach in a Chinese continuous speech indexing task. Aiming at the computation of confidence measure in a speech indexing system, this paper proposed a confidence measure method based on mutual information between syllables, which was used in a lattice construction system for speech indexing. The method took full advantage of the context mutual information, which could describe Chinese language model more exactly and logically. The experiment results show that using the proposed method to build a converted syllable lattice in a speech indexing system, the FOM of which has great improvement comparing with posterior probability based method and N-best based method. This best system for Chinese speech indexing achieves a FOM of 83.7%.
出处 《计算机应用研究》 CSCD 北大核心 2009年第12期4607-4609,4616,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(60702053)
关键词 网格 互信息 语音检索 置信度 语言模型 lattice mutual information speech indexing confidence measure language model
  • 相关文献

参考文献11

  • 1TIERAN Z, JIQING H. Chinese spoken document retrieval based on syllable neighbor posterior probability matrix [ C ]//Proc of International Conference on Audio, Language and Image Processing. 2008 : 1209- 1213.
  • 2SILVA J, CHELBA C, ACERO A. Integration of metadata in spoken document search using position specific posterior latices [ C ]//Proc of Spoken Language Technology Workshop. 2006 : 46- 49.
  • 3MENG S, YU P, SEIDE F, LIU J. A study of lattice-based spoken term detection for Chinese spontaneous speech [ C ]//Proc of Automatic Speech Recognition and Understanding. 2007:635-640.
  • 4VIET-BAC L, SOPHEAP S, LAURENT B, et al. Word/sub-word lattices decomposition and combination for speech recognition [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. 2008 : 4321 - 4324.
  • 5MENG Sha, YU Peng, LIU Jia, et al. Fusing multiple systems into a compact lattice index for Chinese spoken term detection [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing.2008 :4345- 4348.
  • 6TAKAAKI H, LEE I H, TIMOTHY J H, et al. Open-vocabulary spoken utterance retrieval using confusion networks [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. 2007 : 73- 76.
  • 7CHAK S L, SHI B E. An one-pass strategy for keyword spotting and verification [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. 2001 : 377- 380.
  • 8LEE S W, TANAKA K, ITOH Y. Combining multiple subword representations for open-vocabulary spoken document retrieval[ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. Philadelphia : [ s. n. ], 2005.
  • 9YU D, DENG L, ACERO A. A lattice search technique for a longcontextual-span hidden trajectory model of speech [ J ]. Speech Communication, 2006,48(9) :1214- 1226.
  • 10TUR D H, BECHET F, RICCARDI G, et al. Beyond ASR 1-best: using word confusion networks in spoken language understanding [ J ]. Computer Speech and Language, 2006,20(4) : 495-514.

同被引文献15

  • 1冀常鹏;高茉.嘈杂环境下语音信号端点检测方法研究[J]{H}成都信息工程学院学报,2011(3):283-287.
  • 2郑成诗;胡笑浒;常道庆.语音倒谱系数均值的 DD 估计算法[J]{H}声学技术,2009(5):255-256.
  • 3李曜;刘加.基于汉语语音学决策树构建语音识别声学建模方法研究[A]2007年全国网络与信息安全技术研讨会,2007380-385.
  • 4Junfeng Sun,Nengheng Zheng,Xinlong Wang. Enhancement of Chi-nese speech based on nonlinear dynamics[J].{H}SIGNAL PROCESSING,2007,(10):2431-2445.
  • 5Gregory S Ezra,Holger Waaalkens,Stephen.Wiggins. Microcanonical rates,gap times,and phase space dividing surfaces[J].The Journal of chemical physics,2009,(16):118-164.
  • 6Takens F. Detecting strange attractors in turbulence[J].{H}Lecture Notes in Mathematics,1981.366-381.
  • 7Brown R,Bryant P. Computing the Lyapunov spectrum of a dynamical system from an observed time series[J].{H}Physical Review A,1991,(6):2787-2806.
  • 8Hao Bailin. Elementary.Symbolic Dynamics and Chaos in Dissipative System[M].World Scientific,New Jersey,1990.196-201.
  • 9Anonymous. Study results from University of Potsdam update under-standing of bifurcation and chaos[J].Journal of Technology & Sci-ence,2008.1944-1894.
  • 10Lim Sukbin,Goldman Mark S. Noise tolerance of attractor and feedfor-ward memory models[J].{H}Neural Computation,2012,(2).

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部