A Supervised Approach With Transformer, CNN, Optical Flow, and Sliding Window for Temporal Video Action Segmentation | IEEE Conference Publication | IEEE Xplore