Optimized Vision Transformer Training using GPU and Multi-threading | IEEE Conference Publication | IEEE Xplore