A new interpolation algorithm for Head-Related Transfer Function (HRTF) is proposed to realize 3D sound reproduction via headphones in arbitrary spatial direction. HRTFs are modeled as a weighted sum of spherical ha...A new interpolation algorithm for Head-Related Transfer Function (HRTF) is proposed to realize 3D sound reproduction via headphones in arbitrary spatial direction. HRTFs are modeled as a weighted sum of spherical harmonics on a spherical surface. Truncated Singular Value Decomposition (SVD) is adopted to calculate the weights of the model. The truncation number is chosen according to Frobenius norm ratio and the partial condition number. Compared with other interpolated methods, our proposed approach not only is continuous but exploits global information of available directions. The HRTF from any desired direction can be and interpolated results demonstrate that our obtained more accurately and robustly. Reconstructed proposed algorithm acquired better performance.展开更多
To accelerate head-related transfer functions(HRTFs)measurement,two or more independent sound sources are usually employed in the measurement system.However,the multiple scattering between adjacent sound sources may i...To accelerate head-related transfer functions(HRTFs)measurement,two or more independent sound sources are usually employed in the measurement system.However,the multiple scattering between adjacent sound sources may influence the accuracy of measurement.On the other hand,the directivity of sound source could induce measurement error.Therefore,a model consisting of two spherical sound sources with approximate omni-directivity and a rigid-spherical head is proposed to evaluate the errors in HRTF measurement caused by multiple scattering between sources.An example of analysis using multipole re-expansion indicates that the error of ipsilateral HRTFs are within the bound of±1.0 dB below a frequency of 20 kHz,provided that the sound source radius does not exceed 0.025 m,the source distance relative to head center is not less than 0.5 m,and the angular interval between two adjacent sources is not less than 20 degrees.Similar conclusions under different conditions can also be analyzed and discussed by using this calculation method.Furthermore,the results are verified by measurements of HRTFs for a rigid sphere and a KEMAR artificial head.展开更多
Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By ...Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.展开更多
In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function ...In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function (RBF) networks and wavelet neural networks (WNN) so as to select the best network model for further HRTFs approximation. Experimental results demonstrate that wavelet neural networks are more efficient and useful.展开更多
针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-rela...针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-related transfer function,HRTF)中包含的双耳线索,即双耳时间差、双耳声级差及耳间相关性,提取声源位置的特征.最后,声源的位置信息由高斯混合模型(Gaussian mixture model,GMM)分类器识别.实验结果表明,建议的算法具有高精确度、低复杂度及强鲁棒性.展开更多
A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decompose...A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.展开更多
A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of...A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of 400 Hz, magnitude of HRTF is nearly constant and the phase is a linear function of frequency both for the far and near field. Therefore, if the HRTFs above 400 Hz are accurately measured by experiment, it is able to correct the HRTFs at low frequency by the theoretical model. The results of calculation and subjective experiment show that the feasibility of the proposed method.展开更多
Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability a...Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability are required in a near-field HRTF measurement. Hence, there is no reference which intents on solving the measuring difficulties of near-field HRTF for human subjects. In present work, an efficient near-field HRTF measurement system based on computer control is designed and implemented, and a fast calibration method for the system is proposed to first solve the measurement of near-field HRTF for human subjects. The efficiency of measurement is enhanced by a comprehensive design on the acoustic, electronic and mechanical parts of the system. And the accuracy and repeatability of the measurement are greatly im- proved by carefully calibrating the positions of sound source, subject and binaural microphones. This system is suitable for near-field HRTF measurement at various source distances within 1.0 m, for both human subject and artificial head. The time costs of HRTF measurement at a single sound source distance and full directions has been reduced to less than 20 minutes. The measurement results indicate that the accuracy of the system satisfies the actual requirements. The system is applicable to scientific research and can be used to establish an individualized near-field HRTF database for human subjects.展开更多
本文设计实现了一个深度神经网络模型,根据人体生理参数及角度信息重建个性化头相关传递函数(Head Related Transfer Function,HRTF),仅需一次训练即可得到全部方向的预测HRTFs。网络模型由将人体测量参数作为输入的深度神经网络、将角...本文设计实现了一个深度神经网络模型,根据人体生理参数及角度信息重建个性化头相关传递函数(Head Related Transfer Function,HRTF),仅需一次训练即可得到全部方向的预测HRTFs。网络模型由将人体测量参数作为输入的深度神经网络、将角度信息作为输入的展开层以及将前两者的输出作为输入的深度神经网络组成。最后对所提出方法的整体性能进行了客观评价。展开更多
三维(Three-dimension,3D)多媒体技术,尤其是和3D视频相比有所差距的3D音频技术受到了广泛的关注。当前三维音频技术研究可分为基于物理声场重建的多声道音频技术和基于感知的声音场景重建的多声道音频技术两大类。物理声场重建技术的...三维(Three-dimension,3D)多媒体技术,尤其是和3D视频相比有所差距的3D音频技术受到了广泛的关注。当前三维音频技术研究可分为基于物理声场重建的多声道音频技术和基于感知的声音场景重建的多声道音频技术两大类。物理声场重建技术的重要代表是基于球谐分解的声重放技术和波场合成技术(Wave field synthesis,WFS),基于感知的声音场景重建技术主要包括幅度平移技术(Amplitude panning,AP)和基于头相关传输函数的双耳重建技术(Head related transfer function,HRTF)。本文对上述4类三维音频技术及其对应的典型系统进行了介绍及对比分析,并对三维音频技术当前3大主要研究热点:空间听觉机制、三维音频压缩编码以及三维音频系统精简的现状与前沿技术进行了介绍。展开更多
基金Supported by Shanghai Natural Science Foundation, Shanghai Leading Academic Discipline Project, and STCSM of China (No. 08ZR1408300, S30108, and 08DZ2231100)
文摘A new interpolation algorithm for Head-Related Transfer Function (HRTF) is proposed to realize 3D sound reproduction via headphones in arbitrary spatial direction. HRTFs are modeled as a weighted sum of spherical harmonics on a spherical surface. Truncated Singular Value Decomposition (SVD) is adopted to calculate the weights of the model. The truncation number is chosen according to Frobenius norm ratio and the partial condition number. Compared with other interpolated methods, our proposed approach not only is continuous but exploits global information of available directions. The HRTF from any desired direction can be and interpolated results demonstrate that our obtained more accurately and robustly. Reconstructed proposed algorithm acquired better performance.
基金funded by National Natural Science Foundation of China(No.11574090)Natural Science Foundation of Guangdong Province(No.2018B030311025).
文摘To accelerate head-related transfer functions(HRTFs)measurement,two or more independent sound sources are usually employed in the measurement system.However,the multiple scattering between adjacent sound sources may influence the accuracy of measurement.On the other hand,the directivity of sound source could induce measurement error.Therefore,a model consisting of two spherical sound sources with approximate omni-directivity and a rigid-spherical head is proposed to evaluate the errors in HRTF measurement caused by multiple scattering between sources.An example of analysis using multipole re-expansion indicates that the error of ipsilateral HRTFs are within the bound of±1.0 dB below a frequency of 20 kHz,provided that the sound source radius does not exceed 0.025 m,the source distance relative to head center is not less than 0.5 m,and the angular interval between two adjacent sources is not less than 20 degrees.Similar conclusions under different conditions can also be analyzed and discussed by using this calculation method.Furthermore,the results are verified by measurements of HRTFs for a rigid sphere and a KEMAR artificial head.
基金Supported by the National Natural Science Foundation of China (Grant No. 10374031)
文摘Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.
文摘In order to approach to head related transfer functions (HRTFs), this paper employs and compares three kinds of one input neural network models, namely, multi layer perceptron (MLP) networks, radial basis function (RBF) networks and wavelet neural networks (WNN) so as to select the best network model for further HRTFs approximation. Experimental results demonstrate that wavelet neural networks are more efficient and useful.
文摘针对数字助听器中现存声源定位算法精确度低和算法复杂度高的问题,提出一种新的双耳声源定位算法.首先,采集到的双耳声源信号通过Gammatone滤波器分解为若干个子带信号,根据能量的大小对数据进行压缩.然后,利用头相关传递函数(head-related transfer function,HRTF)中包含的双耳线索,即双耳时间差、双耳声级差及耳间相关性,提取声源位置的特征.最后,声源的位置信息由高斯混合模型(Gaussian mixture model,GMM)分类器识别.实验结果表明,建议的算法具有高精确度、低复杂度及强鲁棒性.
基金supported by the National Nature Science Fund of China(50938003,10774049)State Key Lab of Subtropical Building Science,South China University of Technology
文摘A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.
基金supported by the National Natural Science Foundation of China(No.10774049)
文摘A method to correct the measured head-related transfer functions (HRTFs) at low frequency was proposed. By analyzing the HRTFs from the spherical head model at low frequency, it is proved that below the frequency of 400 Hz, magnitude of HRTF is nearly constant and the phase is a linear function of frequency both for the far and near field. Therefore, if the HRTFs above 400 Hz are accurately measured by experiment, it is able to correct the HRTFs at low frequency by the theoretical model. The results of calculation and subjective experiment show that the feasibility of the proposed method.
基金supported by the National Natural Science Foundation of China(11104082,11574090)Fundamental Research Funds for the Central Universities of South China University of Technology(2015ZZ135)
文摘Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability are required in a near-field HRTF measurement. Hence, there is no reference which intents on solving the measuring difficulties of near-field HRTF for human subjects. In present work, an efficient near-field HRTF measurement system based on computer control is designed and implemented, and a fast calibration method for the system is proposed to first solve the measurement of near-field HRTF for human subjects. The efficiency of measurement is enhanced by a comprehensive design on the acoustic, electronic and mechanical parts of the system. And the accuracy and repeatability of the measurement are greatly im- proved by carefully calibrating the positions of sound source, subject and binaural microphones. This system is suitable for near-field HRTF measurement at various source distances within 1.0 m, for both human subject and artificial head. The time costs of HRTF measurement at a single sound source distance and full directions has been reduced to less than 20 minutes. The measurement results indicate that the accuracy of the system satisfies the actual requirements. The system is applicable to scientific research and can be used to establish an individualized near-field HRTF database for human subjects.
文摘本文设计实现了一个深度神经网络模型,根据人体生理参数及角度信息重建个性化头相关传递函数(Head Related Transfer Function,HRTF),仅需一次训练即可得到全部方向的预测HRTFs。网络模型由将人体测量参数作为输入的深度神经网络、将角度信息作为输入的展开层以及将前两者的输出作为输入的深度神经网络组成。最后对所提出方法的整体性能进行了客观评价。
文摘三维(Three-dimension,3D)多媒体技术,尤其是和3D视频相比有所差距的3D音频技术受到了广泛的关注。当前三维音频技术研究可分为基于物理声场重建的多声道音频技术和基于感知的声音场景重建的多声道音频技术两大类。物理声场重建技术的重要代表是基于球谐分解的声重放技术和波场合成技术(Wave field synthesis,WFS),基于感知的声音场景重建技术主要包括幅度平移技术(Amplitude panning,AP)和基于头相关传输函数的双耳重建技术(Head related transfer function,HRTF)。本文对上述4类三维音频技术及其对应的典型系统进行了介绍及对比分析,并对三维音频技术当前3大主要研究热点:空间听觉机制、三维音频压缩编码以及三维音频系统精简的现状与前沿技术进行了介绍。
基金This work was supported by National Natural Science Foundation of China(No.1037 4031)and Natural Science Foundation of the South China University of Technology(No.123-E4050600).