期刊文献+

基于无监督迁移分量分析的语种识别

Language recognition based on unsupervised transfer component analysis
原文传递
导出
摘要 训练数据和测试数据之间由于信道等差异而引起的不匹配会严重影响语种识别的性能。而在实际应用中,通常只能获得少量的和测试数据匹配的标注数据(目标域数据),以及大量的和测试数据不匹配的标注数据(源域数据)。该文利用迁移学习的方法,通过无监督迁移分量分析(unsupervised transfer component analysis,UTCA),可以合理利用上述两种数据寻找到一个低维子空间,在该空间中,源数据和目标数据之间的分布差异最小,而且数据中有利于分类的属性得以保留,从而提高系统识别性能。实验表明:相对于基线系统,该算法对30s和10s语音的识别性能分别有24.7%和8%的提高。 Distribution mismatches between training and test datasets can greatly reduce the performance of language recognition systems.The mismatch is typically due to variability from changes in the channel and other factors.Real-world applications often have many training samples from other source domains but only a limited number of labeled training samples from the target domain.This study uses transfer learning to find a low-dimensional subspace through unsupervised transfer component analysis(UTCA).This space minimizes the distribution mismatch between the source and target domain samples while preserving the good data properties.Tests show that the UTCA gives 24.7% and 8% relative improvement at 30 s and 10 s durations over the baseline system.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第6期800-803,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(61273268 61005019)
关键词 语种识别 迁移学习 迁移分量分析 language recognition transfer learning transfer component analysis
  • 相关文献

参考文献12

  • 1Pan S J, Yang Q. A survey on transfer learning [J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10): 1345-1359.
  • 2DUAN Lixin, Tsang I W, XU Dong. Domain transfer multiple kernel learning [J]. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2012, 34(3) : 465 -479.
  • 3Campbell W M, Sturim D E, Reynolds D A, et al. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation [C]//IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France: IEEE, 2006: 97-100.
  • 4Daum~ H, Marcu D. Domain adaptation for statistical classifiers [J]. Journal of Artificial Intelligence Research, 2006, 26(1): 101-126.
  • 5Han Y, Wu F, Zhuang Y, et al. Multi label transfer learning with sparse representation [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2010, 20(8): 1110- 1121.
  • 6Xiang E W, Cao B, Hu D H, et al. Bridging domains using world wide knowledge for transfer learning [J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(6) : 770 - 783.
  • 7Sun Z, Chen Y, Qi J, et al. Adaptive localization through transfer learning in indoor Wi-Fi environment [C]// 7th International Conference on Machine Learning and Applications, ICMLA'08. San Diego, CA, USA: IEEE,2008:331 -336.
  • 8Daume H, Marcu D. Frustratingly easy domain adaptation [C]//Annual Meeting: Association for Computational Linguistics. Prague, Czech, 2007:256-263.
  • 9Chen B, Lam W, Tsang I, et al. Extracting discriminative concepts for domain adaptation in text mining [C]//Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Paris, France: ACM, 2009:179-188.
  • 10Pan S J, Tsang I W, Kwok J T, et al. Domain adaptation via transfer component analysis [J]. IEEE Transactions on Neural Networks, 2011, 22(2) : 199 - 210.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部