Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video | IEEE Conference Publication | IEEE Xplore