基于CFCC-PCA的说话人辨识方法

Speaker Identification Based on CFCC-PCA

下载PDF

导出

摘要针对说话人训练和识别时间长、噪音环境下识别率低的问题,提出一种CFCC-PCA特征参数的说话人辨识方法。首先提取具有听觉特性的CFCC特征参数,然后对其进行PCA变换,找出具有分辨能力的参数,最后再用这些参数在云服务器中训练和识别说话人。实验表明:该方法可以提高说话人辨识的鲁棒性和识别率,云服务可提高系统实时性。 Training speaker system and speaker identification need a long time, and in the noise environment, the recognition rate is very low, A CFCC-PCA characteristic parameter method is proposed.Firstly, the acoustic characteristics of CFCC characteristic parameters are extracted.Then, CFCC-PCA parameters are extracted by PCA transformation of CFCC characteristic parameters.Finally the speaker models are trained and recognized in cloud.Experiments show that the CFCC-PCA characteristic parameters can improve the robustness and recognition rate of the speaker, the cloud services with efficient processing ability to improve system real-time performance.

作者刘雪燕李明袁宝玲

机构地区中山火炬职业技术学院信息工程系兰州理工大学计算机与通信学院

出处《成都工业学院学报》 2015年第2期32-34,共3页 Journal of Chengdu Technological University

基金中山市科技发展专项基金项目"基于云计算的生物身份认证技术研究及应用"(2013A3FC0350) 中山市科技发展专项基金项目"基于中山地貌的最优化无线网络模型研究"(2013A3FC0318)

关键词 CFCC-PCA 说话人辨识支持向量机云服务器 CFCC-PCA speaker identification Support Vector Machine（ SVM） cloud server

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1JAIN A K,HONG L,KULKARNI Y A. Muhimodal biometric sys-tem using fingerprints, face and speech [ C ]//2nd Int'l Confcreneeon Audio-and Video-based Biometric Person Authentication, Washington I). C', !999 -182 -187:.
2曹洁,余丽珍.改进的说话人聚类初始化和GMM的多说话人识别[J].计算机应用研究,2012,29(2):590-593. 被引量：6
3GARAU G, DIELMANN A, BOURLARD H. Audio-visual synchroni- sation for speaker diarisation [ C ]// Proc of International Conference on Speech and language Processing. Makuhari, Chiba: [ s n. ] ,2010: 2654 - 2657.
4LI Q, HUANG Y. An Auditory-based robust speaker identification under feature extraction algorithm for mismatched conditions [ J ]. Audio, Speech, and Language Processing, IEEE Transactions on, 2010,19(6) : 1791 -1801.
5TSAIW H, CHHEN S S, WANG H M. Automatic speaker clutering using a voice characteristic reference space and maximum purity estination[ J ]. IEEE Transactions on Audio Speech and Languager Processing,2013,15 (4) : 1461 - 1471.
6LIUM H,XIEY L,YAO Z Q,et al. A new hybrid GMM /SVM for speaker verification [ C ]// The 18th International Conference on Pattern Recognition, Hang Kong: IEEE Press,2006:314 - 317.
7ZHANG W F,YANG Y C,WU Z H,Exploition PCA classifiers to speaker recognition [C ]//Proceddings of the International Joint Conference on the Neural Networks Portland IEEE Press,2003 (1):820- 823.
8BURGES C L C. A tutorial on support vector machines for pattern recognition [ J ]. Data Mining and Knowledge Discovery, 1998,2 ( 2 ) : 121 - 167.
9GAO Y,JIN L W,HE C ,et al. Handwriting character recognition as a service: a new handwriting recognition system based on cloud Computing[ C ]//Document Analysis and Recognition ( ICDAR ), 2011 International Conference on ,2011:885 - 889.
10罗希,刘锦高.基于NIOS的ANN语音识别系统[J].计算机系统应用,2009,18(12):144-146. 被引量：3

二级参考文献13

1邓菁.电话信道下多说话人识别研究[D].北京:清华大学,2007.
2Lee L, Rose RC. Speaker normalization using efficient frequency warping procedures. IEEE Int. Conf. on Acoustics, Speech and Signal Processing. Atlanta. 1996.353 - 356.
3Rabiner L, Juang BH. Fundamentals of Speech Recognition. Prientice Hall PTR, 1993.11 - 54.
4WOOTERS C, HUIJBREGTS M. The ICSI RT07s speaker diarization system[ J]. Multimodal Technologies for Perception of Humans, 2008,4625:509-519.
5GARAU G,BOURLARD H. Using audio and visual cues for speaker diarisation initialization [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. [ S. 1. ] :IEEE Signal Pro- cessin~ Society,2010:4942-4945.
6HUNG H,HUANG Yan, FRIEDLAND G, et al. Estimating the dom- inant person in multi-party conversations using speaker diarization strategies [ C ]//Proc of International Conference on Acoustics, Speech and Signal Processing. [ S. 1. ] : IEEE Press,2008:2197-2200.
7FRIEDLAND G, HUNG H, YEO C. Multi-modal speaker diarization of real-world meetings using compressed-domain video features[ C ]/! Proc of International Conference on Audio, Speech and Signal Proces- sing. [ S. 1. ] :IEEE Press,2009:4069-4072.
8HUNG H, FRIEDLAND G. Towards audio-visual on-line diarization of participants in group meetings[ C ]//Proc of Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications. Mar- seille : European Conference on Computer Vision,2008 : 1-12.
9HUNG H, HUANG Yan, FRIEDLAND G, et al. Estimating domi- nance in multi-party meetings using speaker diarization [ J ]. IEEE Yrans on Audio, Speech and Language Processing, 2010, 19 (4) :84?-860.
10NOULAS A, ENGLEBIENNE G, KROSE B. Multi-modal speaker di- arisation[ J]. IEEE Trans on Pattern Analysis and Machine In- telligence,2011,34( 1 ) :79-93.

共引文献7

1艾佳琪,左毅,刘君霞,贺培超,李铁山,陈俊龙.基于余弦相似度的动态语音特征提取算法[J].计算机应用研究,2020,37(S02):147-149. 被引量：10
2朱玉颖,程强.一种语音信号端点检测法的FPGA实现[J].软件导刊,2010,9(5):194-195.
3孙玉,郭宝增.基于SoPC的孤立词语音识别系统的设计[J].微型机与应用,2012,31(2):74-76.
4曹洁,余丽珍.基于MFCC和运动强度聚类初始化的多说话人识别[J].计算机应用研究,2012,29(9):3295-3298. 被引量：10
5汪洋,甘涛,向军.广播电视新闻中的主持人跟踪系统[J].计算机系统应用,2014,23(10):40-45.
6雷磊,佘堃.基于小波倒谱系数和概率神经网络的取证说话人识别模型[J].计算机应用研究,2018,35(4):978-981. 被引量：3
7朱必松,毛启容,高利剑,沈雅馨.基于时间分段和重组聚类的说话人日志方法[J].计算机应用研究,2024,41(9):2649-2654.

1任伟,田文德,杜廷召.基于C-PCA方法的化工过程故障诊断研究[J].计算机与应用化学,2010,27(8):1042-1044. 被引量：4
2张娜,亢军贤,王峰,王翔,孙锋.基于模糊核聚类和SVM的说话人辨识[J].电脑知识与技术,2007(10):227-228. 被引量：1
3王欢良,韩纪庆,郑贵滨.基于K-L散度模型聚类的快速说话人辨识方法[J].模式识别与人工智能,2010,23(6):856-861. 被引量：5
4吕茂成,刘群芳.关于噪声环境下遗传算法的改进[J].通讯世界（下半月）,2016,0(1):148-148.
5曹辉,曹礼刚,简兴祥.基于神经网络融合的语音人脸身份识别方法[J].计算机工程,2007,33(11):184-186. 被引量：4
6刘雪燕,袁宝玲,张娜.基于双约简的GMM说话人辨识[J].电脑知识与技术,2008,0(12X):2902-2903.
7骆瑞玲,李明,李睿.改进的PSO在说话人辨识中的应用[J].计算机工程与应用,2010,46(2):135-137.
8陈才扣,杨静宇,杨健.一种融合PCA和KFDA的人脸识别方法[J].控制与决策,2004,19(10):1147-1150. 被引量：5
9李邵梅,刘力雄,陈鸿昶.实时说话人辨识系统中改进的DTW算法[J].计算机工程,2008,34(4):218-219. 被引量：20
10骆瑞玲,李明.基于MRSVM的说话人辨识方法[J].计算机工程与设计,2009,30(19):4483-4486.

成都工业学院学报

2015年第2期

浏览历史

内容加载中请稍等...

基于CFCC-PCA的说话人辨识方法

参考文献10

二级参考文献13

共引文献7

相关作者

相关机构

相关主题

浏览历史