SpecViT: A Custom Vision-Transformer based Approach for Audio Deepfake Detection | IEEE Conference Publication | IEEE Xplore