1. INTRODUCTION
Recent studies have shown that deep neural networks (DNNs) have become the state-of-the-art for sound source localization (SSL) and directions of arrival (DOA) estimation [1]–[8]. Although these DNN-based approaches outperform the classical signal processing-based techniques [9]–[11] under certain conditions, they suffer two major drawbacks.