Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation | IEEE Conference Publication | IEEE Xplore