Conferences >2017 International Conference...

DD-L1D: Improving the Decoupled L1D Efficiency for GPU Architecture

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

GPU L1 data cache contention, caused by a huge amount of concurrent threads, leads to insufficient cache utilization and poor performance, especially for cache unfriendly...Show More

Metadata

Abstract:

GPU L1 data cache contention, caused by a huge amount of concurrent threads, leads to insufficient cache utilization and poor performance, especially for cache unfriendly applications. Cache bypassing is a widely- used method to alleviate this problem, and Decoupled L1D (D-L1D) is a preventive bypassing scheme, which achieves performance improvement for cache unfriendly applications by considering the data locality of memory access streams. However, our experiments and analyses show that limited performance gain by D-L1D is attained due to the pre-defined locality threshold. To address this issue, we propose a novel bypassing scheme named as Dynamic D-L1D (DD-L1D) that directs the L1 data cache to the less contention by dynamically updating the locality threshold during runtime. We evaluate four metrics in DD-L1D to indicate the L1 cache bypassing state, and choose bypassing miss rate in our final configuration. The experimental results demonstrate that DD-L1D improves the baseline performance by 1.45X on average for cache unfriendly benchmarks. It also outperforms D-L1D and the state-of-the-art GPU cache bypassing schemes with lower hardware overhead and memory traffic.

Published in: 2017 International Conference on Networking, Architecture, and Storage (NAS)

Date of Conference: 07-09 August 2017

Date Added to IEEE Xplore: 07 September 2017

ISBN Information:

DOI: 10.1109/NAS.2017.8026851

Conference Location: Shenzhen, China

Contents

References is not available for this document.

DD-L1D: Improving the Decoupled L1D Efficiency for GPU Architecture

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

DD-L1D: Improving the Decoupled L1D Efficiency for GPU Architecture

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?