NeuroValve: Energy-efficient Deep Learning Inference in Mobile Devices Exploiting Neural Network Architectures

Scholarworks@UNIST

UNIST Library

There are no files associated with this item.

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Cited time in webofscience

Cited time in scopus

Metadata Downloads

NeuroValve: Energy-efficient Deep Learning Inference in Mobile Devices Exploiting Neural Network Architectures

URI: https://scholarworks.unist.ac.kr/handle/201301/82415 http://unist.dcollection.net/common/orgView/200000373309

Abstract: Deep convolutional neural network (CNN) inference on mobile devices is often desirable for user experience and privacy. Running inference of computation and memory-intensive CNNs on mobile devices is challenging as they are resource-limited and battery prolonging is important for them. Because of CNN’s layer-wise properties, each block constituting a CNN can have different workload characteristics in same CNN model. To fully utilize DVFS for increasing energy efficiency of mobile devices, it is important to understand the workload characteristics of each block in a model. In our preliminary study, we can observe that energy-delay trade-off of each block in MobileNet-V3 differs according to the core frequency and it is related to memory access rate during an inference. In this paper, we observe that memory access rate is affected by the structure of a block and the cache size of the device and scaling core frequency and memory bandwidth differently by memory access rate can increase energy efficiency. By taking into account our novel findings, we propose NeuroValve, a fine-grained CPU clock frequency and memory bandwidth control system for running CNN on mobile devices. We implement NeuroValve on the Pixel3a and Pixel3 perform extensive evaluations over various state-of-the-art CNN models. Our results confirm that NeuroValve can conserve energy while offering slight delay increase compared to Android default settings.

qrcode

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.