dc.citation.conferencePlace |
GE |
- |
dc.citation.conferencePlace |
International Congress Centre Dresden (ICC)Dresden |
- |
dc.citation.endPage |
1398 |
- |
dc.citation.startPage |
1393 |
- |
dc.citation.title |
Design Automation and Test in Europe Conference |
- |
dc.contributor.author |
Rahman, Atul |
- |
dc.contributor.author |
Lee, Jongeun |
- |
dc.contributor.author |
Choi, Kiyoung |
- |
dc.date.accessioned |
2023-12-19T21:08:05Z |
- |
dc.date.available |
2023-12-19T21:08:05Z |
- |
dc.date.created |
2016-07-25 |
- |
dc.date.issued |
2016-03-14 |
- |
dc.description.abstract |
Convolutional Deep Neural Networks (DNNs) are reported to show outstanding recognition performance in many image-related machine learning tasks. DNNs have a very high computational requirement, making accelerators a very attractive option. These DNNs have many convolutional layers with different parameters in terms of input/output/kernel sizes as well as input stride. Design constraints usually require a single design for all layers of a given DNN. Thus a key challenge is how to design a common architecture that can perform well for all convolutional layers of a DNN, which can be quite diverse and complex. In this paper we present a flexible yet highly efficient 3D neuron array architecture that is a natural fit for convolutional layers. We also present our technique to optimize its parameters including onchip buffer sizes for a given set of resource constraint for modern FPGAs. Our experimental results targeting a Virtex-7 FPGA demonstrate that our proposed technique can generate DNN accelerators that can outperform the state-of-the-art solutions, by 22% for 32-bit floating-point MAC implementations, and are far more scalable in terms of compute resources and DNN size. |
- |
dc.identifier.bibliographicCitation |
Design Automation and Test in Europe Conference, pp.1393 - 1398 |
- |
dc.identifier.scopusid |
2-s2.0-84973621831 |
- |
dc.identifier.uri |
https://scholarworks.unist.ac.kr/handle/201301/32807 |
- |
dc.identifier.url |
https://www.date-conference.com/date16 |
- |
dc.language |
영어 |
- |
dc.publisher |
ACM/IEEE |
- |
dc.title |
Efficient FPGA Acceleration of Convolutional Neural Networks Using Logical-3D Compute Array |
- |
dc.type |
Conference Paper |
- |
dc.date.conferenceDate |
2016-03-14 |
- |