Improving Linear Algebra Computation on NUMA Platforms through Auto-tuned Nested Parallelism | IEEE Conference Publication | IEEE Xplore