DuoQ: A DSP Utilization-aware and Outlier-free Quantization for FPGA-based LLMs Acceleration | IEEE Conference Publication | IEEE Xplore