Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference | IEEE Conference Publication | IEEE Xplore