Speaker identification based on complete feature corpus and evaluation of mutual information 被引量：1

Speaker identification based on complete feature corpus and evaluation of mutual information

导出

摘要 A speaker model called complete feature corpus (CFC) and an evaluation algorithm of mutual information (MIE) are proposed for text-independent speaker identification. The CFC model represents the speech and pronunciation characteristics of speaker by a feature vector corpus which was trained from some typical speech samples. It hires multi-step mini-max search matching scheme for MIE algorithm to evaluate the similarity of speech features between input speech and the models in distance and information space. Maximum mutual information (MMI) decision criterion is used to decide the identity of speaker. Experiments on performance analysis with comparison to GMM method show that proposed model and evaluation algorithm are quite effective and presented a higher performance than ordinary GMM method. A speaker model called complete feature corpus (CFC) and an evaluation algorithm of mutual information (MIE) are proposed for text-independent speaker identification. The CFC model represents the speech and pronunciation characteristics of speaker by a feature vector corpus which was trained from some typical speech samples. It hires multi-step mini-max search matching scheme for MIE algorithm to evaluate the similarity of speech features between input speech and the models in distance and information space. Maximum mutual information (MMI) decision criterion is used to decide the identity of speaker. Experiments on performance analysis with comparison to GMM method show that proposed model and evaluation algorithm are quite effective and presented a higher performance than ordinary GMM method.

作者 YUYibiao WANGShuozhong

机构地区 SchoolofElectronicInformationEngineering SchoolofCommunicationandInformationEngineering

出处《Chinese Journal of Acoustics》 2005年第3期280-288,共9页 声学学报（英文版）

基金 The work is supported by the University Natural Science Fund (04KJA510133) of Jiangsu Province.

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献2

1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量：8
2岳喜才,伍晓宇,郑崇勋.用神经阵列网络进行文本无关的说话人识别[J].声学学报,2000,25(3):230-234. 被引量：14

二级参考文献12

1俞一彪,赵鹤鸣,周旭东.运用互信息匹配及关键词分析的语音对话系统[J].小型微型计算机系统,2003,24(1):147-150. 被引量：4
2Farrell K R,Mammone R J,Assaleh K T.Speaker recognition using neural networks and conventional classifiers[].IEEE Transactions on Speech and Audio Processing.1994
3Reynolds D A,Rose R.Robust text-independent speakeridentification using Gaussian mixture speaker models[].IEEE Transactions on Speech and Audio Processing.1995
4O’Shaughnessy D.Speaker recognition[].IEEE ASSP Magazine.1986
5Atal B.Effectiveness of linear predictive characteristics ofthe speech wave for automatic speaker identification andverification[].The Journal of The Acoustical Society of America.1974
6Hush D R,Home B G.Progress in supervised neural networks[].IEEE Signal Processing Magazine.1993
7岳喜才,伍晓宇,郑崇勋.用神经阵列网络进行文本无关的说话人识别[J].声学学报,2000,25(3):230-234. 被引量：14
8俞一彪,赵鹤鸣,周旭东.语音识别浏览器VoiceIE设计与实现[J].数据采集与处理,2002,17(1):95-99. 被引量：6
9侯风雷,王炳锡.基于支持向量机的说话人辨认研究[J].通信学报,2002,23(6):61-67. 被引量：17
10俞一彪,赵鹤鸣,周旭东.语音信号互信息估计的非线性搜索算法及识别应用[J].信号处理,2002,18(2):102-106. 被引量：9

共引文献18

1岳喜才,叶大田,管桦.多分类问题的RBF二叉神经树网络方法[J].空军工程大学学报（自然科学版）,2000,1(1):34-39. 被引量：1
2俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量：8
3芮贤义,俞一彪.基于小波变换的鲁棒型特征提取及说话人识别[J].电路与系统学报,2005,10(5):129-132. 被引量：7
4俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量：7
5包永强,赵力,邹采荣.采用归一化补偿变换的与文本无关的说话人识别[J].声学学报,2006,31(1):55-60. 被引量：13
6芮贤义,俞一彪.噪声环境下说话人识别的组合特征提取方法[J].信号处理,2006,22(5):673-677. 被引量：12
7王书诏,邱天爽.说话人识别研究综述[J].电声技术,2007,31(1):51-55. 被引量：9
8俞一彪,许允喜,芮贤义.一种语音特征参数子分量分析与有效性评价的新方法[J].信号处理,2007,23(2):188-191. 被引量：3
9张飞云,蔡子亮,盛胜我.噪声环境下说话人识别性能的研究[J].电声技术,2007,31(6):41-43.
10李邵梅,刘力雄,陈鸿昶.实时说话人辨识系统中改进的DTW算法[J].计算机工程,2008,34(4):218-219. 被引量：20

同被引文献6

1俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量：7
2YU Yibiao YUAN Dongmei XUE Feng.A non-linear frequency transform and its application to speaker recognition[J].Chinese Journal of Acoustics,2009,28(3):280-288. 被引量：1
3陈存宝,赵力,邹采荣.基于极大似然线性回归的模型合成和特征映射进行说话人确认[J].声学学报,2011,36(1):81-87. 被引量：2
4梁春燕,张翔,杨琳,张建平,颜永红.最小方差无失真响应感知倒谱系数在说话人识别中的应用[J].声学学报,2012,37(6):673-678. 被引量：4
5栗志意,张卫强,何亮,刘加.基于总体变化子空间自适应的i-vector说话人识别系统研究[J].自动化学报,2014,40(8):1836-1840. 被引量：17
6梁春燕,杨琳,周若华,颜永红.韵律特征在概率线性判别分析说话人确认中的应用[J].声学学报,2015,40(1):28-33. 被引量：6

引证文献1

1仲伟峰,方祥,范存航,温正棋,陶建华.深浅层特征及模型融合的说话人识别[J].声学学报,2018,43(2):263-272. 被引量：11

二级引证文献11

1曹毅,黄子龙,张威,刘晨,李巍.N-DenseNet的城市声音事件分类模型[J].西安电子科技大学学报,2019,46(6):9-16. 被引量：6
2曾春艳,马超峰,王志锋,朱栋梁,赵楠,王娟,刘聪.深度学习框架下说话人识别研究综述[J].计算机工程与应用,2020,56(7):8-16. 被引量：9
3盛永健,黄子龙,刘晨,曹毅,张洪.基于改进卷积神经网络的燃气调压器故障识别研究[J].现代制造工程,2021(4):132-138. 被引量：2
4张兴明,杨凯.深度学习说话人识别中语音特征参数提取研究[J].现代计算机,2021,27(8):3-7. 被引量：2
5罗春梅,张风雷.基于均值特征和改进深度神经网络的说话人识别算法[J].声学技术,2021,40(4):503-507. 被引量：2
6陈志高,赵庆卫,王丽,王文超.融合分布对齐和对抗学习的无监督跨域声纹识别[J].声学学报,2021,46(5):767-774.
7柴庆凤,史霖炎,梅珊,熊海涛,贺惠新.基于人工特征和机器特征融合的科技文献知识元抽取[J].数据分析与知识发现,2021,5(8):132-143. 被引量：10
8罗春梅.基于改进MFCC与RCNN的说话人识别算法[J].数学的实践与认识,2021,51(17):102-110. 被引量：6
9赵宏,岳鲁鹏,常兆斌,王伟杰.基于多特征I-Vector的说话人识别算法[J].兰州理工大学学报,2021,47(5):93-98. 被引量：1
10刘臣,倪仁倢,周立欣,侯昌佑.多声学特征融合的语音自动剪辑深度学习模型[J].小型微型计算机系统,2023,44(8):1713-1719.

1王香娥,王德贵.不采用CFC—应选择什么[J].电子工艺简讯,1992(5):8-10.
2吴波,陈军.清洗设备的选择[J].电子工艺简讯,1995(6):5-8.
3Zeng Yumin,Wu Zhenyang.COMBINATION OF PITCH SYNCHRONOUS ANALYSIS AND FISHER CRITERION FOR SPEAKER IDENTIFICATION[J].Journal of Electronics(China),2007,24(6):828-834.
4张侠魂.表面贴装与CFC清洗取代[J].有线通信技术,1994(2):46-48.
5范国君.埋墙式音箱[J].实用影音技术,2003(11):32-35.
6王彦,曹鹏,齐伟,费元春.软件无线电技术发展综述[J].测控技术,2004,23(z1):139-141.
7XU Longting,YANG Zhen,SUN Linhui.Simplification of I-Vector Extraction for Speaker Identification[J].Chinese Journal of Electronics,2016,25(6):1121-1126. 被引量：4
8郭俊梅.浅谈CDMA无线系统网管的应用[J].通信世界,2002(28):31-32.
9龚俊杰.加快替代CFC步伐促进清洗工艺的发展：电子工艺清洗技术研讨会纪要[J].电子工艺简讯,1993(11):19-20. 被引量：1
10GU Xiaojiang ZHAO Heming Lu Gang.Whispered speaker identification based on feature and model hybrid compensation[J].Chinese Journal of Acoustics,2012,31(4):499-508. 被引量：1

Chinese Journal of Acoustics

2005年第3期

浏览历史

内容加载中请稍等...