AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models | IEEE Conference Publication | IEEE Xplore