EdgeShard: Efficient LLM Inference via Collaborative Edge Computing | IEEE Journals & Magazine | IEEE Xplore