I. Introduction
Vector hardware accelerators are specialized computing units designed to process vector operations, optimizing the execution time in conjunction with technology scaling and energy efficiency demands of heavy computational workloads in Artificial Intelligence (AI) and High-Performance Computing (HPC) applications [1]–[3]. They are usually organized in multiple vector lanes equipped with arithmetic functional units that allow the parallel execution of the same instruction on different data in a Single Instruction Multiple Data (SIMD) fashion.