Pruning One More Token is Enough: Leveraging Latency-Workload Non-Linearities for Vision Transformers on the Edge | IEEE Conference Publication | IEEE Xplore