File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

이종은

Lee, Jongeun
Intelligent Computing and Codesign Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.conferencePlace GE -
dc.citation.conferencePlace International Congress Centre Dresden (ICC)Dresden -
dc.citation.endPage 1398 -
dc.citation.startPage 1393 -
dc.citation.title Design Automation and Test in Europe Conference -
dc.contributor.author Rahman, Atul -
dc.contributor.author Lee, Jongeun -
dc.contributor.author Choi, Kiyoung -
dc.date.accessioned 2023-12-19T21:08:05Z -
dc.date.available 2023-12-19T21:08:05Z -
dc.date.created 2016-07-25 -
dc.date.issued 2016-03-14 -
dc.description.abstract Convolutional Deep Neural Networks (DNNs) are reported to show outstanding recognition performance in many image-related machine learning tasks. DNNs have a very high computational requirement, making accelerators a very attractive option. These DNNs have many convolutional layers with different parameters in terms of input/output/kernel sizes as well as input stride. Design constraints usually require a single design for all layers of a given DNN. Thus a key challenge is how to design a common architecture that can perform well for all convolutional layers of a DNN, which can be quite diverse and complex. In this paper we present a flexible yet highly efficient 3D neuron array architecture that is a natural fit for convolutional layers. We also present our technique to optimize its parameters including onchip buffer sizes for a given set of resource constraint for modern FPGAs. Our experimental results targeting a Virtex-7 FPGA demonstrate that our proposed technique can generate DNN accelerators that can outperform the state-of-the-art solutions, by 22% for 32-bit floating-point MAC implementations, and are far more scalable in terms of compute resources and DNN size. -
dc.identifier.bibliographicCitation Design Automation and Test in Europe Conference, pp.1393 - 1398 -
dc.identifier.scopusid 2-s2.0-84973621831 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/32807 -
dc.identifier.url https://www.date-conference.com/date16 -
dc.language 영어 -
dc.publisher ACM/IEEE -
dc.title Efficient FPGA Acceleration of Convolutional Neural Networks Using Logical-3D Compute Array -
dc.type Conference Paper -
dc.date.conferenceDate 2016-03-14 -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.