Unlocking High Performance with Low-Bit NPUs and CPUs for Highly Optimized HPL-MxP on Cloud Brain II | IEEE Conference Publication | IEEE Xplore