File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

이종은

Lee, Jongeun
Intelligent Computing and Codesign Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Software-Managed Automatic Data Sharing for Coarse-Grained Reconfigurable Coprocessors

Author(s)
Mai, Toan X.Lee, Jongeun
Issued Date
2012-12-10
DOI
10.1109/FPT.2012.6412148
URI
https://scholarworks.unist.ac.kr/handle/201301/37547
Fulltext
https://ieeexplore.ieee.org/document/6412148/
Citation
11th International Conference on Field-Programmable Technology (FPT) , pp.277 - 284
Abstract
Coarse-Grained Reconfigurable Architecture (CGRA) in a hybrid system can significantly accelerate the execution of compute-intensive kernels of applications. However, the data communication overhead between the main processor (MP) and the CGRA may be huge and can negate the speed-up of the CGRA. In this paper we address the problem of reducing the data communication overhead in a hybrid system by offering a partially automatic data sharing technique using a special shared memory called Configurable Range Memory (CRM). Unlike the previous work the CRM architecture we use here is based on comparators, which gives much higher flexibility in terms of where an array can be placed within a CRM while it makes the runtime software management of a CRM much more challenging. We present an efficient runtime algorithm based on first-fit heuristic. Our experimental results demonstrate that our CRM-based system can reduce the amount of data transfer between a MP and a CGRA upto 89.5% compared to Scratch Pad Memory (SPM)-based systems, while the software management overhead is only 1.20 similar to 1.34% on average (depending on CRM architecture parameters) of the kernel cycles in the MP-only execution. Overall our CRM-based system can achieve average kernel speedup of 3.47 times over the MP-only execution, which is about 20% improvement over the SPM-based system.
Publisher
IEEE
ISBN
978-1-4673-2846-3

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.