Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By ...Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.展开更多
在基于扬声器阵列的声回放技术中,相比于波场合成(Wave field synthesis,WFS)和矢量基幅度平移技术,以球谐分解为理论基础的Ambisonics回放系统拥有编解码相互独立以及可扩展等优势.基本的Ambisonics系统通过采集和回放一阶声场方位信息...在基于扬声器阵列的声回放技术中,相比于波场合成(Wave field synthesis,WFS)和矢量基幅度平移技术,以球谐分解为理论基础的Ambisonics回放系统拥有编解码相互独立以及可扩展等优势.基本的Ambisonics系统通过采集和回放一阶声场方位信息:全指向性(W)和双指向性(X,Y,Z)成分,即所谓的B-format重构声场.HOA(Higher order Ambisonics)基于声场球谐函数分解将B-format以更高空间分辨率进行了扩展.已有很多学者关注HOA的回放精度和使用限制,但综合考虑人头散射效应和人耳听觉特性效应的研究还十分缺乏.从低频人耳定位的重要因素——双耳时间差(Interaural time difference,ITD)的角度评价了不同阶数Ambisonis系统的最佳听音区域.将水平面各方向入射平面波编码为HOA分量,在此基础上计算了模拟人头(刚性球)在环形阵列内部移动时二维水平面的ITD波动,并通过ITD阈值确定最佳听音区域边界.仿真结果表明基于ITD的客观评价指标可以较好地体现不同阶数Ambisonics系统的声场回放性能:4阶Ambisoics系统能够使最佳听音区域达到20cm×14cm,而1,2阶系统在中心区域尚不能实现精确回放.因此,高阶Ambisoics系统拥有更好的声源定位性能.展开更多
基金Supported by the National Natural Science Foundation of China (Grant No. 10374031)
文摘Based on the measurements from 52 Chinese subjects (26 males and 26 females), a high-spatial-resolution head-related transfer function (HRTF) database with corre- sponding anthropometric parameters is established. By using the database, cues relating to sound source localization, including interaural time difference (ITD), interaural level difference (ILD), and spectral features introduced by pinna, are analyzed. Moreover, the statistical relationship between ITD and anthropometric parameters is estimated. It is proved that the mean values of maximum ITD for male and female are significantly different, so are those for Chinese and western sub- jects. The difference in ITD is due to the difference in individual anthropometric parameters. It is further proved that the spectral features introduced by pinna strongly depend on individual; while at high frequencies (f≥ 5.5 kHz), HRTFs are left-right asymmetric. This work is instructive and helpful for the research on bin- aural hearing and applications on virtual auditory in future.
文摘在基于扬声器阵列的声回放技术中,相比于波场合成(Wave field synthesis,WFS)和矢量基幅度平移技术,以球谐分解为理论基础的Ambisonics回放系统拥有编解码相互独立以及可扩展等优势.基本的Ambisonics系统通过采集和回放一阶声场方位信息:全指向性(W)和双指向性(X,Y,Z)成分,即所谓的B-format重构声场.HOA(Higher order Ambisonics)基于声场球谐函数分解将B-format以更高空间分辨率进行了扩展.已有很多学者关注HOA的回放精度和使用限制,但综合考虑人头散射效应和人耳听觉特性效应的研究还十分缺乏.从低频人耳定位的重要因素——双耳时间差(Interaural time difference,ITD)的角度评价了不同阶数Ambisonis系统的最佳听音区域.将水平面各方向入射平面波编码为HOA分量,在此基础上计算了模拟人头(刚性球)在环形阵列内部移动时二维水平面的ITD波动,并通过ITD阈值确定最佳听音区域边界.仿真结果表明基于ITD的客观评价指标可以较好地体现不同阶数Ambisonics系统的声场回放性能:4阶Ambisoics系统能够使最佳听音区域达到20cm×14cm,而1,2阶系统在中心区域尚不能实现精确回放.因此,高阶Ambisoics系统拥有更好的声源定位性能.