HERTI: A Reinforcement Learning-Augmented System for Efficient Real-Time Inference on Heterogeneous Embedded Systems

Han, Myeonggyun; Baek, Woongki

doi:10.1109/PACT52795.2021.00014.

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

백웅기

Baek, Woongki: Intelligent System Software Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.conferencePlace	ZZ	-
dc.citation.conferencePlace	Online	-
dc.citation.endPage	102	-
dc.citation.startPage	90	-
dc.citation.title	International Conference on Parallel Architectures and Compilation Techniques	-
dc.contributor.author	Han, Myeonggyun	-
dc.contributor.author	Baek, Woongki	-
dc.date.accessioned	2024-01-31T21:36:42Z	-
dc.date.available	2024-01-31T21:36:42Z	-
dc.date.created	2021-12-07	-
dc.date.issued	2021-09-26	-
dc.description.abstract	Real-time inference is the key technology that enables a variety of latency-critical intelligent services such as autonomous driving and augmented reality. Heterogeneous embedded systems that consist of various computing devices with widely-different architectural and system-level characteristics are emerging as a promising solution for real-time inference. Despite extensive prior works, it still remains unexplored to design and implement a practical system that enables efficient real-time inference on heterogeneous embedded systems. To bridge this gap, we propose HERTI, a reinforcement learning-augmented system for efficient real-time inference on heterogeneous embedded systems. HERTI efficiently explores the state space and robustly finds an efficient state that significantly improves the efficiency of the target inference workload while satisfying its deadline constraint through reinforcement learning. Our quantitative evaluation conducted on a real heterogeneous embedded system demonstrates the effectiveness of HERTI in that HERTI achieves high inference efficiency in multiple metrics (i.e., energy and energy-delay product) with a strong deadline guarantee in contrast to the state-of-the-art techniques, delivers larger gains as the inference deadline and the system heterogeneity increase, provides strong generality for hyper-parameter tuning, and significantly reduces the training time through its estimation-based approach across all the evaluated inference workloads and scenarios.	-
dc.identifier.bibliographicCitation	International Conference on Parallel Architectures and Compilation Techniques, pp.90 - 102	-
dc.identifier.doi	10.1109/PACT52795.2021.00014.	-
dc.identifier.issn	1089-795X	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/77000	-
dc.identifier.url	https://ieeexplore.ieee.org/document/9563019/	-
dc.identifier.wosid	000758464500007	-
dc.language	영어	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	HERTI: A Reinforcement Learning-Augmented System for Efficient Real-Time Inference on Heterogeneous Embedded Systems	-
dc.type	Conference Paper	-
dc.date.conferenceDate	2021-09-26	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.