Efficient FPGA Acceleration of Convolutional Neural Networks Using Logical-3D Compute Array

Rahman, Atul; Lee, Jongeun; Choi, Kiyoung

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

이종은

Lee, Jongeun: Intelligent Computing and Codesign Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.conferencePlace	GE	-
dc.citation.conferencePlace	International Congress Centre Dresden (ICC)Dresden	-
dc.citation.endPage	1398	-
dc.citation.startPage	1393	-
dc.citation.title	Design Automation and Test in Europe Conference	-
dc.contributor.author	Rahman, Atul	-
dc.contributor.author	Lee, Jongeun	-
dc.contributor.author	Choi, Kiyoung	-
dc.date.accessioned	2023-12-19T21:08:05Z	-
dc.date.available	2023-12-19T21:08:05Z	-
dc.date.created	2016-07-25	-
dc.date.issued	2016-03-14	-
dc.description.abstract	Convolutional Deep Neural Networks (DNNs) are reported to show outstanding recognition performance in many image-related machine learning tasks. DNNs have a very high computational requirement, making accelerators a very attractive option. These DNNs have many convolutional layers with different parameters in terms of input/output/kernel sizes as well as input stride. Design constraints usually require a single design for all layers of a given DNN. Thus a key challenge is how to design a common architecture that can perform well for all convolutional layers of a DNN, which can be quite diverse and complex. In this paper we present a flexible yet highly efficient 3D neuron array architecture that is a natural fit for convolutional layers. We also present our technique to optimize its parameters including onchip buffer sizes for a given set of resource constraint for modern FPGAs. Our experimental results targeting a Virtex-7 FPGA demonstrate that our proposed technique can generate DNN accelerators that can outperform the state-of-the-art solutions, by 22% for 32-bit floating-point MAC implementations, and are far more scalable in terms of compute resources and DNN size.	-
dc.identifier.bibliographicCitation	Design Automation and Test in Europe Conference, pp.1393 - 1398	-
dc.identifier.scopusid	2-s2.0-84973621831	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/32807	-
dc.identifier.url	https://www.date-conference.com/date16	-
dc.language	영어	-
dc.publisher	ACM/IEEE	-
dc.title	Efficient FPGA Acceleration of Convolutional Neural Networks Using Logical-3D Compute Array	-
dc.type	Conference Paper	-
dc.date.conferenceDate	2016-03-14	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.