Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition

下载PDF

导出

摘要 As one of Chinese minority languages,Tibetan speech recognition technology was not researched upon as extensively as Chinese and English were until recently.This,along with the relatively small Tibetan corpus,has resulted in an unsatisfying performance of Tibetan speech recognition based on an end-to-end model.This paper aims to achieve an accurate Tibetan speech recognition using a small amount of Tibetan training data.We demonstrate effective methods of Tibetan end-to-end speech recognition via cross-language transfer learning from three aspects:modeling unit selection,transfer learning method,and source language selection.Experimental results show that the Chinese-Tibetan multi-language learning method using multilanguage character set as the modeling unit yields the best performance on Tibetan Character Error Rate(CER)at 27.3%,which is reduced by 26.1%compared to the language-specific model.And our method also achieves the 2.2%higher accuracy using less amount of data compared with the method using Tibetan multi-dialect transfer learning under the same model structure and data set.

作者 Zhijie Wang Yue Zhao Licheng Wu Xiaojun Bi Zhuoma Dawa Qiang Ji

机构地区 School of Information Engineering School of Chinese Ethnic Minority Languages and Literatures Department of Electrical

出处《Computers, Materials & Continua》 SCIE EI 2022年第10期629-639,共11页 计算机、材料和连续体（英文）

基金 This work was supported by three projects.Zhao Y received the Grant with Nos.61976236 and 2020MDJC06 Bi X J received the Grant with No.20&ZD279.

关键词 Cross-language transfer learning low-resource language modeling unit Tibetan speech recognition

分类号 D67 [政治法律—中外政治制度]

引文网络
相关文献

参考文献4

1Muhammad Zia Ur Rehman,Fawad Ahmed,Muhammad Attique Khan,Usman Tariq,Sajjad Shaukat Jamal,Jawad Ahmad,Iqtadar Hussain.Classification of Citrus Plant Diseases Using Deep Transfer Learning[J].Computers, Materials & Continua,2022(1):1401-1417. 被引量：3
2Ahmed Reda,Sherif Barakat,Amira Rezk.A Transfer Learning-Enabled Optimized Extreme Deep Learning Paradigm for Diagnosis of COVID-19[J].Computers, Materials & Continua,2022(1):1381-1399. 被引量：1
3Zhangjie Fu,Yongjie Ding,Musaazi Godfrey.An LSTM-Based Malware Detection Using Transfer Learning[J].Journal of Cyber Security,2021,3(1):11-28. 被引量：1
4庄福振,罗平,何清,史忠植.迁移学习研究进展[J].软件学报,2015,26(1):26-39. 被引量：465

二级参考文献88

1Ben-David S,Blitzer J,Crammer K,Pereira F.Analysis of representations for domain adaptation.In:Platt JC,Koller D,Singer Y,Roweis ST,eds.Proc.of the Advances in Neural Information Processing Systems 19.Cambridge:MIT Press,2007.137-144.
2Blitzer J,McDonald R,Pereira F.Domain adaptation with structural correspondence learning.In:Jurafsky D,Gaussier E,eds.Proc.of the Int’l Conf.on Empirical Methods in Natural Language Processing.Stroudsburg PA:ACL,2006.120-128.
3Dai WY,Xue GR,Yang Q,Yu Y.Co-Clustering based classification for out-of-domain documents.In:Proc.of the 13th ACM Int’l Conf.on Knowledge Discovery and Data Mining.New York:ACM Press,2007.210-219.[doi:10.1145/1281192.1281218].
4Dai WY,Xue GR,Yang Q,Yu Y.Transferring naive Bayes classifiers for text classification.In:Proc.of the 22nd Conf.on Artificial Intelligence.AAAI Press,2007.540-545.
5Liao XJ,Xue Y,Carin L.Logistic regression with an auxiliary data source.In:Proc.of the 22nd lnt*I Conf.on Machine Learning.San Francisco:Morgan Kaufmann Publishers,2005.505-512.[doi:10.1145/1102351.1102415].
6Xing DK,Dai WY,Xue GR,Yu Y.Bridged refinement for transfer learning.In:Proc.of the Ilth European Conf.on Practice of Knowledge Discovery in Databases.Berlin:Springer-Verlag,2007.324-335.[doi:10.1007/978-3-540-74976-9_31].
7Mahmud MMH.On universal transfer learning.In:Proc.of the 18th Int’l Conf.on Algorithmic Learning Theory.Sendai,2007.135-149.[doi:10,1007/978-3-540-75225-7_14].
8Samarth S,Sylvian R.Cross domain knowledge transfer using structured representations.In:Proc.of the 21st Conf.on Artificial Intelligence.AAAI Press,2006.506-511.
9Bel N,Koster CHA,Villegas M.Cross-Lingual text categorization.In:Proc.of the European Conf.on Digital Libraries.Berlin:Springer-Verlag,2003.126-139.[doi:10.1007/978-3-540-45175-4_13].
10Zhai CX,Velivelli A,Yu B.A cross-collection mixture model for comparative text mining.In:Proc.of the 10th ACM SIGKDD Int’l Conf.on Knowledge Discovery and Data Mining.New York:ACM,2004.743-748.[doi:10.1145/1014052.1014150].

共引文献466

1康文杰,田苗,林岚,孙珅,吴水才.深度卷积生成对抗网络对神经影像通用数据特征的学习[J].智慧健康,2020(31):1-4. 被引量：2
2张政,严哲,顾汉明.基于残差网络与迁移学习的断层自动识别[J].石油地球物理勘探,2020(5):950-956. 被引量：23
3陈曙,叶俊民,刘童.一种基于领域适配的跨项目软件缺陷预测方法[J].软件学报,2020,31(2):266-281. 被引量：15
4吴锐帆,代海洋,杨坦,江颖,蔡志杰.直肠癌淋巴结转移的智能诊断研究[J].数学建模及其应用,2019,8(4):30-37. 被引量：2
5刘世晶,刘阳春,钱程,郑浩君,周捷,张成林.基于CycleGAN和注意力增强迁移学习的小样本鱼类识别[J].农业机械学报,2023,54(S01):296-302. 被引量：3
6张璐,黄琳,李备备,陈鑫,段青玲.基于多尺度融合与无锚点YOLO v3的鱼群计数方法[J].农业机械学报,2021,52(S01):237-244. 被引量：16
7张红洋,田瑞盟.基于SOLO分类理论的科学思维学业质量评价[J].湖南中学物理,2021(2):1-4. 被引量：1
8林峰,郭鹏,刘旭斌.基于叶片表面污垢预处理与CNN的风电机组叶片表面损伤识别[J].动力工程学报,2020(12):975-981. 被引量：5
9颜宏文,陈金鑫.基于改进YOLOv3的绝缘子串定位与状态识别方法[J].高电压技术,2020,46(2):423-432. 被引量：76
10何卫东,申佳红.基于SLE学习评价系统的深度学习初探[J].教育科学论坛,2020(22):75-77.

1曹明伦.论译者的学者意识[J].中国翻译,2022,43(1):175-179. 被引量：4
2Mieradilijiang Maimaiti,Yang Liu,Huanbo Luan,Maosong Sun.Enriching the Transfer Learning with Pre-Trained Lexicon Embedding for Low-Resource Neural Machine Translation[J].Tsinghua Science and Technology,2022,27(1):150-163. 被引量：5
3Hong Geun Ji,Soyoung Oh,Jina Kim,Seong Choi,Eunil Park.Integrating Deep Learning and Machine Translation for Understanding Unrefined Languages[J].Computers, Materials & Continua,2022(1):669-678.
4Ruigang Liang,Ying Cao,Peiwei Hu,Kai Chen.Neutron:an attention-based neural decompiler[J].Cybersecurity,2021,4(1):54-66.
5Shao-Hui Wang,Jing Qin,Xian-Li Meng,Yi Zhang.Research hotspots and trends in Chinese minority traditional medicine during 2021:a visual bibliometrics analysis[J].Traditional Medicine Research,2022,7(3):113-123.
6Yujie Wang,Kaiquan Li,Zonghai Chen.Battery Full Life Cycle Management and Health Prognosis Based on Cloud Service and Broad Learning[J].IEEE/CAA Journal of Automatica Sinica,2022,9(8):1540-1542. 被引量：2
7Yang Xu,Boming Xia,Yueliang Wan,Fan Zhang,Jiabo Xu,Huansheng Ning.CDCAT: A Multi-Language Cross-Document Entity and Event Coreference Annotation Tool[J].Tsinghua Science and Technology,2022,27(3):589-598.
8Muhammad Shahid Bhatti,Azmat Ullah,Rohaya Latip,Abid Sohail,Anum Riaz,Rohail Hassan.Benchmarking Performance of Document Level Classification and Topic Modeling[J].Computers, Materials & Continua,2022(4):125-141. 被引量：1
9Xiaodong Yan,Yiqin Wang,Wei Song,Xiaobing Zhao,A.Run,Yang Yanxing.Unsupervised Graph-Based Tibetan Multi-Document Summarization[J].Computers, Materials & Continua,2022(10):1769-1781.
10Yue Ming,Nannan Hu,Chunxiao Fan,Fan Feng,Jiangwan Zhou,Hui Yu.Visuals to Text:A Comprehensive Review on Automatic Image Captioning[J].IEEE/CAA Journal of Automatica Sinica,2022,9(8):1339-1365. 被引量：4

Computers, Materials & Continua

2022年第10期

浏览历史

内容加载中请稍等...

Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition

参考文献4

二级参考文献88

共引文献466

相关作者

相关机构

相关主题

浏览历史