Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering | IEEE Conference Publication | IEEE Xplore