Joint Speech-Text Embeddings with Disentangled Speaker Features | IEEE Conference Publication | IEEE Xplore