Ruby: Improving Hardware Efficiency for Tensor Algebra Accelerators Through Imperfect Factorization | IEEE Conference Publication | IEEE Xplore