File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

최영리

Choi, Young-Ri
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Mitigating YARN Container Overhead with Input Splits

Author(s)
Kim, WonbaeChoi, Young-RiNam, Beomseok
Issued Date
2017-05-14
DOI
10.1109/CCGRID.2017.106
URI
https://scholarworks.unist.ac.kr/handle/201301/32766
Fulltext
http://ieeexplore.ieee.org/document/7973704/
Citation
IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, pp.204 - 207
Abstract
We analyze YARN container overhead and present early results of reducing its overhead by dynamically adjusting the input split size. YARN is designed as a generic resource manager that decouples programming models from resource management infrastructures. We demonstrate that YARN's generic design incurs significant overhead because each con- tainer must perform various initialization steps, including authentication. To reduce container overhead without changing the existing YARN framework significantly, we propose leverag- ing the input split, which is the logical representation of physical HDFS blocks. With input splits, we can combine multiple HDFS blocks and increase the input size of each container, thereby enabling a single map wave and reducing the number of containers and their initialization overhead. Experimental results shows that we can avoid recurring container overhead by selecting the right size for input splits and reducing the number of containers.
Publisher
IEEE

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.