基于无监督迁移分量分析的语种识别

Language recognition based on unsupervised transfer component analysis

导出

摘要训练数据和测试数据之间由于信道等差异而引起的不匹配会严重影响语种识别的性能。而在实际应用中,通常只能获得少量的和测试数据匹配的标注数据(目标域数据),以及大量的和测试数据不匹配的标注数据(源域数据)。该文利用迁移学习的方法,通过无监督迁移分量分析(unsupervised transfer component analysis,UTCA),可以合理利用上述两种数据寻找到一个低维子空间,在该空间中,源数据和目标数据之间的分布差异最小,而且数据中有利于分类的属性得以保留,从而提高系统识别性能。实验表明:相对于基线系统,该算法对30s和10s语音的识别性能分别有24.7%和8%的提高。 Distribution mismatches between training and test datasets can greatly reduce the performance of language recognition systems.The mismatch is typically due to variability from changes in the channel and other factors.Real-world applications often have many training samples from other source domains but only a limited number of labeled training samples from the target domain.This study uses transfer learning to find a low-dimensional subspace through unsupervised transfer component analysis（UTCA）.This space minimizes the distribution mismatch between the source and target domain samples while preserving the good data properties.Tests show that the UTCA gives 24.7% and 8% relative improvement at 30 s and 10 s durations over the baseline system.

作者徐嘉明张卫强刘加夏善红

机构地区中国科学院大学中国科学院电子学研究所清华大学电子工程系

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2013年第6期800-803,共4页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(61273268 61005019)

关键词语种识别迁移学习迁移分量分析 language recognition transfer learning transfer component analysis

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Pan S J, Yang Q. A survey on transfer learning [J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10): 1345-1359.
2DUAN Lixin, Tsang I W, XU Dong. Domain transfer multiple kernel learning [J]. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2012, 34(3) : 465 -479.
3Campbell W M, Sturim D E, Reynolds D A, et al. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation [C]//IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France: IEEE, 2006: 97-100.
4Daum~ H, Marcu D. Domain adaptation for statistical classifiers [J]. Journal of Artificial Intelligence Research, 2006, 26(1): 101-126.
5Han Y, Wu F, Zhuang Y, et al. Multi label transfer learning with sparse representation [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2010, 20(8): 1110- 1121.
6Xiang E W, Cao B, Hu D H, et al. Bridging domains using world wide knowledge for transfer learning [J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(6) : 770 - 783.
7Sun Z, Chen Y, Qi J, et al. Adaptive localization through transfer learning in indoor Wi-Fi environment [C]// 7th International Conference on Machine Learning and Applications, ICMLA'08. San Diego, CA, USA: IEEE,2008:331 -336.
8Daume H, Marcu D. Frustratingly easy domain adaptation [C]//Annual Meeting: Association for Computational Linguistics. Prague, Czech, 2007:256-263.
9Chen B, Lam W, Tsang I, et al. Extracting discriminative concepts for domain adaptation in text mining [C]//Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Paris, France: ACM, 2009:179-188.
10Pan S J, Tsang I W, Kwok J T, et al. Domain adaptation via transfer component analysis [J]. IEEE Transactions on Neural Networks, 2011, 22(2) : 199 - 210.

1刘顺兰,张鹏.次分量分析恒模盲多用户检测算法[J].杭州电子科技大学学报（自然科学版）,2011,31(4):69-72.
2王淑艳,吴仁彪,石庆研.MCA-CMA次分量分析恒模算法[J].数据采集与处理,2008,23(3):270-272. 被引量：2
3周伟,杜玉晓,杨其宇,张育俊,曾浩.FPGA跨时钟域亚稳态研究[J].电子世界,2012(3):87-89. 被引量：12
4刘传辉.现代谱估计方法分析[J].科学时代,2010(5):192-193.
5陈骏龙,刘亚洲,唐晓晴.大数据环境下基于迁移学习的人体检测性能提升方法[J].现代电子技术,2015,38(14):1-5. 被引量：1
6Liu Hanyu (Dept of Electrical and Computer Engineering University of Manitoba Winnipeg, MB,Canada R3T 2N2) Tong Wen E I Plotkin (Dept of Electrical and Computer Engineering Concordia University Montreal, QUE, Canada H3G 1M8).A　Blind　Equalizer　Based　on　Unsupervised　Gaussian　Cluster　Formation　with　an　Adaptive　Non-Linearit[J].通信学报,1997,18(3):19-26.
7于晓平.彩电电视机谐波分量分析[J].电视技术,1999,23(8):39-40. 被引量：5
8杨青山,蔡敏.基于多时钟域的异步FIFO设计[J].中国集成电路,2007,16(9):36-39. 被引量：7
9John Donovan.低功耗模拟设计时代来临[J].中国电子商情,2014(12):27-29.
10吕乾坤,高勇.基于稀疏约束的PLCA和谱掩蔽的语音增强算法[J].电声技术,2014,38(12):50-54. 被引量：1

清华大学学报（自然科学版）

2013年第6期

浏览历史

内容加载中请稍等...

基于无监督迁移分量分析的语种识别

参考文献12

相关作者

相关机构

相关主题

浏览历史