sLLM: Accelerating LLM Inference using Semantic Load Balancing with Shared Memory Data Structures | IEEE Conference Publication | IEEE Xplore