Faster Speech-LLaMA Inference with Multi-token Prediction | IEEE Conference Publication | IEEE Xplore