Architectural support for parallel reductions in scalable shared-memory multiprocessors | IEEE Conference Publication | IEEE Xplore