A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora

A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora

下载PDF

导出

摘要 A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of ＂seed regions＂ and through an iterative procedure of mergence. A simple but reliable extraction method of ＂seed regkms＂ and advanced distance metric are adopted to construct the sound dendrogram, so that it can present speech＇s structure character ranging from coarse to fine in a visualized way. Tests show that all phonemic boundaries are contained in the lattice structure of sound dendrogram and very easy to identify. Sound dendrogram can be a powerful assistant tool during the process of speech corporals manual annotation. A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of ＂seed regions＂ and through an iterative procedure of mergence. A simple but reliable extraction method of ＂seed regkms＂ and advanced distance metric are adopted to construct the sound dendrogram, so that it can present speech＇s structure character ranging from coarse to fine in a visualized way. Tests show that all phonemic boundaries are contained in the lattice structure of sound dendrogram and very easy to identify. Sound dendrogram can be a powerful assistant tool during the process of speech corporals manual annotation.

作者 SHE Kun CHEN Shuzhen YANG Shen ZOU Lian

机构地区 School of Electronic Information

出处《Wuhan University Journal of Natural Sciences》 CAS 2006年第2期381-384,共4页 武汉大学学报（自然科学英文版）

基金 SupportedbytheNationalNaturalScienceFoundationofChina(50099620)andtheNationalHighTechnologyDevelopmentProgramofChina(2001AA132050)

关键词 sound dedrogram speech corpora manual annotation computer aid tool sound dedrogram speech corpora manual annotation computer aid tool

分类号 TP37 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献4

1GlassJR.FindingAcousticRegularitiesinSpeech:Applica tiontoPhoneticRecognition[]..1988
2SeneffS.AJointSynchrony/Mean RateModelofAuditory SpeechProcessing[].JournalofPhonetics.1988
3TangM.LargeVocabularyContinuousSpeechRecognition UsingLinguisticFeaturesandConstraints[]..2005
4RabinerL,JuangBH.FundamentalsofSpeechRecognition[]..1993

1Jie ZHOU,Bi-cheng LI,Gang CHEN.Automatically building large-scale named entity recognition corpora from Chinese Wikipedia[J].Frontiers of Information Technology & Electronic Engineering,2015,16(11):940-956.
2Li Weigang Liu Ting Li Sheng.BOOTSTRAPPING FOR EXTRACTING RELATIONS FROM LARGE CORPORA[J].Journal of Electronics(China),2008,25(1):89-96. 被引量：5
3Shangyi WU.On Application of Computer-based Corpora in Translation[J].International Journal of Technology Management,2015(2):1-3.
4康建初,韩秋菊,尹宝林.基于晶格结构的单晶硅异向腐蚀的计算机模拟[J].计算机工程与应用,2001,37(18):141-143. 被引量：1
5Niladri Sekhar Dash.English Language Corpora as a Secondary ELT Resource for Indian Learners[J].Sino-US English Teaching,2013,10(1):10-22.
6牛刚,林晓梅,白昱,李琳娜.基于VTK的医学图像三维重建系统的设计与实现[J].长春工业大学学报,2005,26(1):42-44. 被引量：4
7Casale Salvatore,Russo Alessandra,Serrano Salvatore.Multi Corpora Robustness Analysis of Attributes Selection Applied to Speech Emotion Classification[J].通讯和计算机（中英文版）,2011,8(10):877-894.
8王延华,洪飞,吴恩华.基于VTK库的医学图像处理子系统设计和实现[J].计算机工程与应用,2003,39(8):205-207. 被引量：20
9瑞士科学家发明超级存储器[J].科学咨询,2016,0(52):2-2.
10肖忠华.基于语料库的语言对比与翻译[J].国际学术动态,2009(5):3-4. 被引量：4

Wuhan University Journal of Natural Sciences

2006年第2期

浏览历史

内容加载中请稍等...

A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora

参考文献4

相关作者

相关机构

相关主题

浏览历史