Learning from Video and Text via Large-Scale Discriminative Clustering | IEEE Conference Publication | IEEE Xplore