MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition | IEEE Conference Publication | IEEE Xplore