Vision-Based Human Activity Recognition Utilizing DCNN and Stacked Multi-stage Multimodal Fusion Strategy | IEEE Conference Publication | IEEE Xplore