File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

백웅기

Baek, Woongki
Intelligent System Software Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Hotness- and Lifetime-Aware Data Placement and Migration for High-Performance Deep Learning on Heterogeneous Memory Systems

Author(s)
Han, MyeonggyunHyun, JihoonPark, SeongbeomBaek, Woongki
Issued Date
2020-03
DOI
10.1109/TC.2019.2949408
URI
https://scholarworks.unist.ac.kr/handle/201301/31857
Fulltext
https://ieeexplore.ieee.org/document/8883082
Citation
IEEE TRANSACTIONS ON COMPUTERS, v.69, no.3, pp.377 - 391
Abstract
Heterogeneous memory systems that comprise memory nodes with disparate architectural characteristics (e.g., DRAM and high-bandwidth memory (HBM)) have surfaced as a promising solution in a variety of computing domains ranging from embedded to high-performance computing. Since deep learning (DL) is one of the most widely-used workloads in various computing domains, it is crucial to explore efficient memory management techniques for DL applications that execute on heterogeneous memory systems. Despite extensive prior works on system software and architectural support for efficient DL, it still remains unexplored to investigate heterogeneity-aware memory management techniques for high-performance DL on heterogeneous memory systems. To bridge this gap, we analyze the characteristics of representative DL workloads on a real heterogeneous memory system. Guided by the characterization results, we propose HALO, hotness- and lifetime-aware data placement and migration for high-performance DL on heterogeneous memory systems. Through quantitative evaluation, we demonstrate the effectiveness of HALO in that it significantly outperforms various memory management policies (e.g., 28.2 percent higher performance than the HBM-Preferred policy) supported by the underlying system software and hardware, achieves the performance comparable to the ideal case with infinite HBM, incurs small performance overheads, and delivers high performance across a wide range of application working-set sizes.
Publisher
IEEE COMPUTER SOC
ISSN
0018-9340
Keyword (Author)
Memory managementNetwork architectureRandom access memoryHardwareSystem softwareHotnessand lifetime-aware data placement and migrationhigh-performance deep learningheterogeneous memory systems

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.