Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs | IEEE Journals & Magazine | IEEE Xplore