SHMT: An SRAM and HBM Hybrid Computing-in-Memory Architecture With Optimized KV Cache for Multimodal Transformer | IEEE Journals & Magazine | IEEE Xplore