Co-Designing Transformer Architectures for Distributed Inference With Low Communication | IEEE Journals & Magazine | IEEE Xplore