Extending Large Language Models for Speech and Audio Captioning | IEEE Conference Publication | IEEE Xplore