ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models | IEEE Conference Publication | IEEE Xplore