MAGVLT: Masked Generative Vision-and-Language Transformer | IEEE Conference Publication | IEEE Xplore