Object-Centric Representation Learning for Video Question Answering | IEEE Conference Publication | IEEE Xplore