Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm | IEEE Conference Publication | IEEE Xplore