Characterizing the Behavior and Impact of KV Caching on Transformer Inferences Under Concurrency | IEEE Conference Publication | IEEE Xplore