Abstract:
Head-related transfer functions (HRTFs) are important for spatial audio reproduction in immersive systems. Most existing data-driven methods focus on personalized HRTF es...Show MoreMetadata
Abstract:
Head-related transfer functions (HRTFs) are important for spatial audio reproduction in immersive systems. Most existing data-driven methods focus on personalized HRTF estimation of monaural spectral factors. These methods ignore the importance of binaural cues, which are essential for binaural reproduction and perception. Moreover, the significant differences among various HRTF datasets in aspects such as measurement setup limit the potential of data-driven methods. This paper proposes a binaural cue generation method (BiCG), which utilizes an implicit neural network (INN) to estimate interaural level differences (ILDs) and interaural time differences (ITDs). Experimental results show that our method outperforms existing neural field methods in terms of binaural cue generation quality across datasets. We also evaluate various data preprocessing methods, and experimental results show that extreme smoothing improves binaural cue generation performance across datasets. The work provides new insights into enhancing HRTF modeling.
Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 06-11 April 2025
Date Added to IEEE Xplore: 07 March 2025
ISBN Information: