Open-Vocabulary Sound Event Localization and Detection With Joint Learning of CLAP Embedding and Activity-Coupled Cartesian DOA Vector | IEEE Journals & Magazine | IEEE Xplore