METAL: A Memory-Efficient Transformer Architecture for Long-Context Inference on FPGA | IEEE Conference Publication | IEEE Xplore