Cambricon-LLM: A Chiplet-Based Hybrid Architecture for On-Device Inference of 70B LLM | IEEE Conference Publication | IEEE Xplore