File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

이종은

Lee, Jongeun
Intelligent Computing and Codesign Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Double MAC: Doubling the Performance of Convolutional Neural Networks on Modern FPGAs

Author(s)
Nguyen, DongKim, DaewooLee, Jongeun
Issued Date
2017-03-27
DOI
10.23919/DATE.2017.7927113
URI
https://scholarworks.unist.ac.kr/handle/201301/32773
Fulltext
http://ieeexplore.ieee.org/document/7927113/
Citation
Design Automation and Test in Europe Conference, pp.890 - 893
Abstract
This paper presents a novel method to double the computation rate of convolutional neural network (CNN) accelerators by packing two multiply-and-accumulate (MAC) operations into one DSP block of off-the-shelf FPGAs (called Double MAC). While a general SIMD MAC using a single DSP block seems impossible, our solution is tailored for the kind of MAC operations required for a convolution layer. Our preliminary evaluation shows that not only can our Double MAC approach increase the computation throughput of a CNN layer by twice with essentially the same resource utilization, the network level performance can also be improved by 14∼84% over a highly optimized state-of-the-art accelerator solution depending on the CNN hyper-parameters.
Publisher
20th Design, Automation and Test in Europe, DATE 2017

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.