File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

백웅기

Baek, Woongki
Intelligent System Software Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Quantifying the performance and energy efficiency of advanced cache indexing for GPGPU computing

Author(s)
Kim, Kyu YeunBaek, Woongki
Issued Date
2016-06
DOI
10.1016/j.micpro.2016.01.003
URI
https://scholarworks.unist.ac.kr/handle/201301/19984
Fulltext
http://www.sciencedirect.com/science/article/pii/S0141933116000053
Citation
MICROPROCESSORS AND MICROSYSTEMS, v.43, pp.81 - 94
Abstract
To achieve higher performance and energy efficiency, GPGPU architectures have recently begun to employ hardware caches. Adding caches to GPGPUs, however, does not always guarantee improved performance and energy efficiency due to the thrashing in small caches shared by thousands of threads. While prior work has proposed warp-scheduling and cache-bypassing techniques to address this issue, relatively little work has been done in the context of advanced cache indexing (ACI). To bridge this gap, this work investigates the effectiveness of ACI for high-performance and energy efficient GPGPU computing. We discuss the design and implementation of static and adaptive cache indexing schemes for GPGPUs. We then quantify the effectiveness of the ACI schemes based on a cycle accurate GPGPU simulator. Our quantitative evaluation demonstrates that the ACI schemes are effective in that they provide significant performance and energy-efficiency gains over the conventional indexing scheme. Further, we investigate the performance sensitivity of ACI to key architectural parameters (e.g., indexing latency and cache associativity). Our experimental results show that the ACI schemes are promising in that they continue to provide significant performance gains even when additional indexing latency occurs due to the hardware complexity and the baseline cache is enhanced with high associativity or large capacity. (C) 2016 Elsevier B.V. All rights reserved
Publisher
ELSEVIER SCIENCE BV
ISSN
0141-9331
Keyword (Author)
Advanced cache indexingGPGPU computingHigh performanceEnergy efficiency

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.