Implicit Temporal Modeling with Learnable Alignment for Video Recognition | IEEE Conference Publication | IEEE Xplore