A Scalable Architecture for Accelerating Multi-Operation and Continuous Floating-Point Matrix Computing on FPGAs | IEEE Journals & Magazine | IEEE Xplore