Audio-Visual Emotion Recognition with Capsule-like Feature Representation and Model-Based Reinforcement Learning | IEEE Conference Publication | IEEE Xplore