Fine-grained Controllable Video Generation via Object Appearance and Context | IEEE Conference Publication | IEEE Xplore