Making nested parallel transactions practical using lightweight hardware support

Baek, Woongki; Bronson, Nathan; Kozyrakis, Christos; Olukotun, Kunle

doi:10.1145/1810085.1810097

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

백웅기

Baek, Woongki: Intelligent System Software Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.conferencePlace	JA	-
dc.citation.conferencePlace	Tsukuba	-
dc.citation.endPage	71	-
dc.citation.startPage	61	-
dc.citation.title	24th ACM International Conference on Supercomputing, ICS'10	-
dc.contributor.author	Baek, Woongki	-
dc.contributor.author	Bronson, Nathan	-
dc.contributor.author	Kozyrakis, Christos	-
dc.contributor.author	Olukotun, Kunle	-
dc.date.accessioned	2023-12-20T03:37:34Z	-
dc.date.available	2023-12-20T03:37:34Z	-
dc.date.created	2015-07-09	-
dc.date.issued	2010-06-02	-
dc.description.abstract	Transactional Memory (TM) simplifies parallel programming by supporting parallel tasks that execute in an atomic and isolated way. To achieve the best possible performance, TM must support the nested parallelism available in real-world applications and supported by popular programming models. A few recent papers have proposed support for nested parallelism in software TM (STM) and hardware TM (HTM). However, the proposed designs are still impractical, as they either introduce excessive runtime overheads or require complex hardware structures. This paper presents filter-accelerated, nested TM (FaNTM). We extend a hybrid TM based on hardware signatures to provide practical support for nested parallel transactions. In the FaNTM design, hardware filters provide continuous and nesting-aware conflict detection, which effectively eliminates the excessive overheads of software nested transactions. In contrast to a full HTM approach, FaNTM simplifies hardware by decoupling nested parallel transactions from caches using hardware filters. We also describe subtle correctness and liveness issues that do not exist in the non-nested baseline TM. We quantify the performance of FaNTM using STAMP applications and microbenchmarks that use concurrent data structures. First, we demonstrate that the runtime overhead of FaNTM is small (2.3% on average) when applications use only single-level parallelism. Second, we show that the incremental performance overhead of FaNTM is reasonable when the available parallelism is used in deeper nesting levels. We also demonstrate that nested parallel transactions on FaNTM run significantly faster (e.g., 12.4x) than those on a nested STM. Finally, we show how nested parallelism is used to improve the overall performance of a transactional microbenchmark. © 2010 ACM.	-
dc.identifier.bibliographicCitation	24th ACM International Conference on Supercomputing, ICS'10, pp.61 - 71	-
dc.identifier.doi	10.1145/1810085.1810097	-
dc.identifier.scopusid	2-s2.0-77954721421	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/46827	-
dc.identifier.url	http://dl.acm.org/citation.cfm?doid=1810085.1810097	-
dc.language	영어	-
dc.publisher	24th ACM International Conference on Supercomputing, ICS'10	-
dc.title	Making nested parallel transactions practical using lightweight hardware support	-
dc.type	Conference Paper	-
dc.date.conferenceDate	2010-06-02	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.