Large Vision Models: How Transformer-based Models excelled over Traditional Deep Learning Architectures in Video Processing | IEEE Conference Publication | IEEE Xplore