期刊文献+

基于保留部分频域镜像分量的声源定位算法 被引量:1

SOUND SOURCE LOCALISATION ALGORITHM BASED ON RETAINING PARTIAL MIRROR COMPONENTS IN FREQUENCY DOMAIN
下载PDF
导出
摘要 针对传统的SRP-PHAT(Steered Response Power with Phase Transform)声源定位算法容易受噪声影响而导致定位性能降低的问题,提出一种频域补零且保留部分镜像分量的改进算法。该算法首先通过傅里叶变换将接收信号变换到频域,然后在高频端补零至20倍帧长,同时保留部分镜像分量。在此基础上计算麦克风对接收信号的互功率谱密度函数,作傅里叶逆变换得到相位变换加权的广义互相关(GCC-PHAT)函数。保留的镜像分量拓宽了信号频域,使GCC-PHAT函数的峰更为尖锐,累加后得到的SRPPHAT函数的空间谱峰也就更加尖锐,从而提高定位性能。实验表明,相比于传统算法,改进算法能显著提高定位成功率。 To deal with the problem of the sound source localisation algorithm of traditional steered response power with phase transform weighting (SRP-PHAT) that its localisation performance is easily degraded due to noise influence, in this paper we propose an improved algorithm which pads the zeros in frequency domain and retains partial mirror components as well. First, the algorithm transforms the received signals to frequency domain through fast Fourier transform ( FFT), and then pads the zeros to reach 20 times of the frame length in highfrequency band while preserving part of the mirror components. On this basis, the cross power spectral density function of the microphone pair on received signals can be estimated, and the corresponding generalised cross correlation with phase transform weighting ( GCC-PHAT) function can be obtained by taking inverse fast Fourier transform (IFFT) . The retained mirror components broaden the signal spectrum so that the peak of GCC-PHAT function becomes sharper. Consequently, the spatial spectrum peak of SRP-PHAT function, which is theaccumulation of GCC-PHAT functions for all of the microphone pairs, becomes sharper, thus the localisation performance is improved.Experiments show that compared with conventional algorithms, the proposed algorithm can considerably enhance the success rates of sound source localisation.
作者 蔡卫平 刘瑞娟 周琳 Cai Weiping;Liu Ruijuan;Zhou Lin(School of Electrical Engineering, Jiujiang Vocational and Technical College, Jiujiang 332007, Jiangxi, China;School of Information Science and Engineering, Southeast University, Nanjing 210096, Jiangsu, China)
出处 《计算机应用与软件》 CSCD 2016年第6期325-328,共4页 Computer Applications and Software
基金 国家自然科学基金青年基金项目(61201345)
关键词 相位变换 声源定位 镜像分量 Phase transform Sound source localisation Mirror components
  • 相关文献

参考文献16

  • 1Faubel F,Georges M,Kumatani K,et al.Improving hands-free speech recognition in car through audio-visual voice activity detection [C] Proceeding of Joint W orkshop on Hands-free Speech Communicationand Microphone Arrays.Edinburgh,UK:IEEE,2011:70-75.
  • 2Sun L,Cheng Q.Real-time microphone array processing for soundsource separation and localization [C] Proceedin g of IEEE 47th Annual Conference on Information Sciences and Systems (CISS).Baltimore,MD,USA:IEEE,2013:1-6.
  • 3Tourbabin V,Rafaely B.Theoretical framework for the design of microphonearrays for robot audition[C] Proceeding of IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP).Vancouver,Canada:IEEE,2013:4290-4294.
  • 4Seewald L A,Jr L G,Veronez M R,et al.Combining SRP-PHAT and two Kinects for 3D sound source localization [J].Expert Systems with Applications,2014,41(16):7106-7113.
  • 5Dmochowski J P,Benesty J,Affes S.Broadband MUSIC:opportunities and challenges for multiple source localization [C] IEEE Workshopon A pplications of Signal Processing to Audio and Acoustics.New Paltz,NY,USA:IEEE,2007:18-21.
  • 6Dibiase J H.A high-accuracy,low-latency technique for talker localizationin reverberant environments using microphone arrays [D].Providence:Division of Engineering at Brown University,2000.
  • 7Zhao Y,Chen X,Wang B.Real-time sound source localization usinghybrid framework[J].A pplied Acoustics,2013,74(12):1367-1373.
  • 8Oualil Y,Faubel F,Klakow D.A fast cumulative steered response powerfor multiple speaker detection and localization [C] Proceeding of European Signal Processing Conference (EUSIPCO).Marrakech,Morocco:IEEE,2013:1-5.
  • 9Nunes L 0,Martins W A,Lima M V S,et al.A Steered-response poweralgorithm employing hierarchical search for acoustic source localization using microphone arrays [J].IEEE Transactions on Signal Processing,2014,62(19):5171-5183.
  • 10Zhang C,Florencio D,Ba D E,et al.Maximum likelihood sound source localization and beam forming for directional microphone arrays in distributed meetings [J].IEEE Transactions on multimedia,2008,10(3):538-548.

二级参考文献12

  • 1Avokh A,Abutalebi H R.Speech enhancement using linearly constrain-ed adaptive constant directivity beam-formers [ J ].Applied Acoustics,2010,71(3):262-268.
  • 2Markovic I,Petrovic I.Speaker localization and tracking with a micro-phone array on a mobile robot using von Mises distribution and particle filtering [ J ].Robotics and Autonomous Systems,2010,58(11):1185-1196.
  • 3Keshavarz A,Aarabi P.Sound localization-based navigational user in-terfaces[ C]//Proceeding of the 8th IEEE International Symposium on Multimedia(ISM' 06),San Diego,CA,USA,2006:728-733.
  • 4Dibiase J H.A high-accuracy,low-latency technique for talker localiza-tion in reverberant environments using microphone arrays [ D ].Provi-dence:Brown University,2000.
  • 5Fallon M F,Godsill S J.Acoustic source localization and tracking of a time-varying number of speakers [J]IEEE Transactions on Audio,Speech,and Language Processing,2012,20(4):1409-1415.
  • 6Zhang Cha,FlorencioL D,Ba D E,et al.Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings[ J].IEEE Transactions on Multimedia,2008,10(3):538-548.
  • 7Cobos M,Marti A,Lopez J J.A modified SRP-PHAT functional for ro-bust real-time sound source localization with sealable spatial sampling [ J].IEEE Signal Processing Letters,2011,18(1):71-74.
  • 8Mungamuru B,Aarabi P.Enhanced sound localization[ J ].IEEE Trans-actions on Systems,Man,and Cybernetics-part B:Cybernetics,2004,34(3):1526-1540.
  • 9Svaizer P,Brntti A,Omologo M.Use of reflected wavefronts for acoustic source localization with a line array [ C ]//Joint Workshop on Hands-free Speech Communication and Microphone Arrays(HSCMA),Edin-burgh,UK,2011:165-169.
  • 10Bechler D,Kroschel K.Reliability criteria evaluation for TDOA esti-mates in a variety of real environments [ C ]//Proceeding of IEEE Inter-national Conference on Acoustics,Speech,and Signal Processing(IC-ASSP),Philadelphia,PA,USA,2005:985-988.

共引文献5

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部