个性化的头相关传输函数(Head Related Transfer Function,HRTF)对于虚拟听觉技术的实现至关重要。然而在具体的应用过程中,测量每一位受试者的个性化HRTF较为繁琐,为此文章提出一种基于加权弹性网络回归的算法,只需获取受试者的生理参...个性化的头相关传输函数(Head Related Transfer Function,HRTF)对于虚拟听觉技术的实现至关重要。然而在具体的应用过程中,测量每一位受试者的个性化HRTF较为繁琐,为此文章提出一种基于加权弹性网络回归的算法,只需获取受试者的生理参数即可获得HRTF的个性化幅度响应。首先通过数据库中的受试者数据,根据生理参数与幅度的相关性计算获得生理参数的权值,并将权值加入到同时含有1范数和2范数的弹性网络回归中,以此来获取新受试者的生理参数稀疏系数;最后将所得稀疏系数与数据库中的HRTF幅度结合就可以得到新受试者的个性化幅度响应。结果表明,文中方法对于个性化HRTF幅度的合成有较好的效果,尤其是在中低频段内准确度较高。展开更多
Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By ...Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.展开更多
A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decompose...A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.展开更多
A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of...A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of 400 Hz, magnitude of HRTF is nearly constant and the phase is a linear function of frequency both for the far and near field. Therefore, if the HRTFs above 400 Hz are accurately measured by experiment, it is able to correct the HRTFs at low frequency by the theoretical model. The results of calculation and subjective experiment show that the feasibility of the proposed method.展开更多
与头相关传递函数(Head—related Transfer Functions:HRTFs)的准确、有效建模对于空间听觉的分析研究以及虚拟听觉空间的生成起着关键的作用。本文通过应用新型的面向多目标参数优化的遗传算法(Genetic Algorithm:GA)进行了 HRTFs共用...与头相关传递函数(Head—related Transfer Functions:HRTFs)的准确、有效建模对于空间听觉的分析研究以及虚拟听觉空间的生成起着关键的作用。本文通过应用新型的面向多目标参数优化的遗传算法(Genetic Algorithm:GA)进行了 HRTFs共用声学极点的极零点模型(Common-Acoustical—Pole and Zero:CAPZ)逼近。实验结果表明,GA较改进的Prony设计方法获得了更优的效果。展开更多
In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function ...In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function (RBF) networks and wavelet neural networks (WNN) so as to select the best network model for further HRTFs approximation. Experimental results demonstrate that wavelet neural networks are more efficient and useful.展开更多
针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-rela...针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-related transfer function,HRTF)中包含的双耳线索,即双耳时间差、双耳声级差及耳间相关性,提取声源位置的特征.最后,声源的位置信息由高斯混合模型(Gaussian mixture model,GMM)分类器识别.实验结果表明,建议的算法具有高精确度、低复杂度及强鲁棒性.展开更多
文摘个性化的头相关传输函数(Head Related Transfer Function,HRTF)对于虚拟听觉技术的实现至关重要。然而在具体的应用过程中,测量每一位受试者的个性化HRTF较为繁琐,为此文章提出一种基于加权弹性网络回归的算法,只需获取受试者的生理参数即可获得HRTF的个性化幅度响应。首先通过数据库中的受试者数据,根据生理参数与幅度的相关性计算获得生理参数的权值,并将权值加入到同时含有1范数和2范数的弹性网络回归中,以此来获取新受试者的生理参数稀疏系数;最后将所得稀疏系数与数据库中的HRTF幅度结合就可以得到新受试者的个性化幅度响应。结果表明,文中方法对于个性化HRTF幅度的合成有较好的效果,尤其是在中低频段内准确度较高。
基金Supported by the National Natural Science Foundation of China (Grant No. 10374031)
文摘Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.
基金supported by the National Nature Science Fund of China(50938003,10774049)State Key Lab of Subtropical Building Science,South China University of Technology
文摘A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.
基金supported by the National Natural Science Foundation of China(No.10774049)
文摘A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of 400 Hz, magnitude of HRTF is nearly constant and the phase is a linear function of frequency both for the far and near field. Therefore, if the HRTFs above 400 Hz are accurately measured by experiment, it is able to correct the HRTFs at low frequency by the theoretical model. The results of calculation and subjective experiment show that the feasibility of the proposed method.
文摘与头相关传递函数(Head—related Transfer Functions:HRTFs)的准确、有效建模对于空间听觉的分析研究以及虚拟听觉空间的生成起着关键的作用。本文通过应用新型的面向多目标参数优化的遗传算法(Genetic Algorithm:GA)进行了 HRTFs共用声学极点的极零点模型(Common-Acoustical—Pole and Zero:CAPZ)逼近。实验结果表明,GA较改进的Prony设计方法获得了更优的效果。
文摘In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function (RBF) networks and wavelet neural networks (WNN) so as to select the best network model for further HRTFs approximation. Experimental results demonstrate that wavelet neural networks are more efficient and useful.
文摘针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-related transfer function,HRTF)中包含的双耳线索,即双耳时间差、双耳声级差及耳间相关性,提取声源位置的特征.最后,声源的位置信息由高斯混合模型(Gaussian mixture model,GMM)分类器识别.实验结果表明,建议的算法具有高精确度、低复杂度及强鲁棒性.