SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models | IEEE Conference Publication | IEEE Xplore