Learning the fusion of audio and video aggression assessment by meta-information from human annotations | IEEE Conference Publication | IEEE Xplore