File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

이종은

Lee, Jongeun
Intelligent Computing and Codesign Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Partial Sum Quantization for Reducing ADC Size in ReRAM-based Neural Network Accelerators

Author(s)
Azamat, AzatAsim, FaaizKim, JintaeLee, Jongeun
Issued Date
2023-12
DOI
10.1109/TCAD.2023.3294461
URI
https://scholarworks.unist.ac.kr/handle/201301/66315
Citation
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, v.42, no.12, pp.4897 - 4908
Abstract
While ReRAM (Resistive Random-Access Memory) crossbar arrays have the potential to significantly accelerate DNN (Deep Neural Network) training through fast and lowcost matrix-vector multiplication, peripheral circuits like ADCs (analog-to-digital converters) create a high overhead. These ADCs consume over half of the chip power and a considerable portion of the chip cost. To address this challenge, we propose advanced quantization techniques that can significantly reduce the ADC overhead of ReRAM crossbar arrays. Our methodology interprets ADC as a quantization mechanism, allowing us to scale the range of ADC input optimally along with the weight parameters of a DNN, resulting in multiple-bit reduction in ADC precision. This approach reduces ADC size and power consumption by several times, and it is applicable to any DNN type (binarized or multi-bit) and any ReRAM crossbar array size. Additionally, we propose ways to minimize the overhead of the digital scaler, which is an essential part of our scheme and sometimes required. Our experimental results using ResNet-18 on the ImageNet dataset demonstrate that our method can reduce the size of the ADC by 32 times compared to ISAAC with only a minimal accuracy loss degradation of 0.24 evaluation results in the presence of ReRAM non-ideality (such as stuck-at fault). IEEE
Publisher
Institute of Electrical and Electronics Engineers
ISSN
0278-0070
Keyword (Author)
analog-to-digital conversion (ADC)Artificial neural networksConvolutionconvolutional neural network (CNN)CostsHardwareIn-memory computing acceleratormemristorquantizationQuantization (signal)ThroughputTraining

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.