Designing Extremely Memory-Efficient CNNs for On-Device Vision Tasks

Lee, Jaewook; Park, Yoel; Lee, Seulki

doi:10.1007/978-981-96-0966-6_25

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

이슬기

Lee, Seulki: Embedded Artificial Intelligence Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.conferencePlace	VN	-
dc.citation.conferencePlace	하노이, 베트남	-
dc.citation.endPage	435	-
dc.citation.startPage	419	-
dc.citation.title	Asian Conference on Computer Vision	-
dc.contributor.author	Lee, Jaewook	-
dc.contributor.author	Park, Yoel	-
dc.contributor.author	Lee, Seulki	-
dc.date.accessioned	2024-12-09T11:05:07Z	-
dc.date.available	2024-12-09T11:05:07Z	-
dc.date.created	2024-12-07	-
dc.date.issued	2024-12-11	-
dc.description.abstract	In this paper, we introduce a memory-efficient CNN (convolutional neural network), which enables resource-constrained low-end embedded and IoT devices to perform on-device vision tasks, such as image classification and object detection, using extremely low memory, i.e., only 63 KB on ImageNet classification. Based on the bottleneck block of MobileNet, we propose three design principles that significantly curtail the peak memory usage of a CNN so that it can fit the limited KB memory of the low-end device. First, `input segmentation' divides an input image into a set of patches, including the central patch overlapped with the others, reducing the size (and memory requirement) of a large input image. Second, `patch tunneling' builds independent tunnel-like paths consisting of multiple bottleneck blocks per patch, penetrating through the entire model from an input patch to the last layer of the network, maintaining lightweight memory usage throughout the whole network. Lastly, `bottleneck reordering' rearranges the execution order of convolution operations inside the bottleneck block such that the memory usage remains constant regardless of the size of the convolution output channels. The experiment result shows that the proposed network classifies ImageNet with extremely low memory (i.e., 63 KB) while achieving competitive top-1 accuracy (i.e., 61.58%). To the best of our knowledge, the memory usage of the proposed network is far smaller than state-of-the-art memory-efficient networks, i.e., up to 89x and 3.1x smaller than MobileNet (i.e., 5.6 MB) and MCUNet (i.e., 196 KB), respectively.	-
dc.identifier.bibliographicCitation	Asian Conference on Computer Vision, pp.419 - 435	-
dc.identifier.doi	10.1007/978-981-96-0966-6_25	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/84717	-
dc.identifier.url	https://openaccess.thecvf.com/content/ACCV2024/html/Lee_Designing_Extremely_Memory-Efficient_CNNs_for_On-device_Vision_Tasks_ACCV_2024_paper.html	-
dc.language	영어	-
dc.publisher	17th Asian Conference on Computer Vision (ACCV 2024)	-
dc.title	Designing Extremely Memory-Efficient CNNs for On-Device Vision Tasks	-
dc.type	Conference Paper	-
dc.date.conferenceDate	2024-12-08	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.