Conferences >2018 30th International Sympo...

Variable-Size Batched Condition Number Calculation on GPUs

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The matrices can ...Show More

Metadata

Abstract:

We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The matrices can differ in size and the process integrates the use of pivoting to ensure a numerically-stable matrix inversion. The performance assessment reveals that, in double precision arithmetic, the new GPU kernel achieves up to 550 GFLOPs (billions of floating-point operations per second) and 800 GFLOPs on NVIDIA's P100 and V100 GPUs, respectively. The results also demonstrate a considerable speed-up with respect to a workflow that computes the condition number via launching a set of four batched kernels. In addition, we present a variable-size batched kernel for the computation of the matrix infinity norm. We show that this memory-bound kernel achieves up to 90% of the sustainable peak bandwidth.

Published in: 2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Date of Conference: 24-27 September 2018

Date Added to IEEE Xplore: 21 February 2019

ISBN Information:

Print on Demand(PoD) ISSN: 1550-6533

DOI: 10.1109/CAHPC.2018.8645907

Conference Location: Lyon, France

Contents

References is not available for this document.

Variable-Size Batched Condition Number Calculation on GPUs

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Variable-Size Batched Condition Number Calculation on GPUs

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?