A Scalable GPT-2 Inference Hardware Architecture on FPGA | IEEE Conference Publication | IEEE Xplore