Abstract:
Neural networks have been widely applied in direction-of-arrival (DOA) estimation and source tracking systems. In this paper, we introduce a spherical convolutional recur...Show MoreMetadata
Abstract:
Neural networks have been widely applied in direction-of-arrival (DOA) estimation and source tracking systems. In this paper, we introduce a spherical convolutional recurrent neural network that utilizes Deepsphere, a graph-based spherical convolutional neural network, employing the steered response power with phase transform (SRP-PHAT) power maps as input features for real-time robust sound source DOA estimation and tracking applications. The proposed method achieves a performance similar to that of state-of-the-art 3D convolutional neural networks (3D-CNNs) method and reduces the processing time by 88.6%, the parameter count by 85.5%, and the training memory usage by 54.0% respectively. The shallow structure of proposed network demonstrates effectiveness and efficiency.
Published in: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 23-27 May 2022
Date Added to IEEE Xplore: 27 April 2022
ISBN Information: