期刊文献+

A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora

A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora
下载PDF
导出
摘要 A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of "seed regions" and through an iterative procedure of mergence. A simple but reliable extraction method of "seed regkms" and advanced distance metric are adopted to construct the sound dendrogram, so that it can present speech's structure character ranging from coarse to fine in a visualized way. Tests show that all phonemic boundaries are contained in the lattice structure of sound dendrogram and very easy to identify. Sound dendrogram can be a powerful assistant tool during the process of speech corporals manual annotation. A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of "seed regions" and through an iterative procedure of mergence. A simple but reliable extraction method of "seed regkms" and advanced distance metric are adopted to construct the sound dendrogram, so that it can present speech's structure character ranging from coarse to fine in a visualized way. Tests show that all phonemic boundaries are contained in the lattice structure of sound dendrogram and very easy to identify. Sound dendrogram can be a powerful assistant tool during the process of speech corporals manual annotation.
出处 《Wuhan University Journal of Natural Sciences》 CAS 2006年第2期381-384,共4页 武汉大学学报(自然科学英文版)
基金 SupportedbytheNationalNaturalScienceFoundationofChina(50099620)andtheNationalHighTechnologyDevelopmentProgramofChina(2001AA132050)
关键词 sound dedrogram speech corpora manual annotation computer aid tool sound dedrogram speech corpora manual annotation computer aid tool
  • 相关文献

参考文献4

  • 1GlassJR.FindingAcousticRegularitiesinSpeech:Applica tiontoPhoneticRecognition[]..1988
  • 2SeneffS.AJointSynchrony/Mean RateModelofAuditory SpeechProcessing[].JournalofPhonetics.1988
  • 3TangM.LargeVocabularyContinuousSpeechRecognition UsingLinguisticFeaturesandConstraints[]..2005
  • 4RabinerL,JuangBH.FundamentalsofSpeechRecognition[]..1993

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部