A Fast Discrete Wavelet Transform Using Hybrid Parallelism on GPUs

Tran Minh Quan; Jeong, Won-Ki

doi:10.1109/TPDS.2016.2536028

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

정원기

Jeong, Won-Ki

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.endPage	3100	-
dc.citation.number	11	-
dc.citation.startPage	3088	-
dc.citation.title	IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS	-
dc.citation.volume	27	-
dc.contributor.author	Tran Minh Quan	-
dc.contributor.author	Jeong, Won-Ki	-
dc.date.accessioned	2023-12-21T23:08:01Z	-
dc.date.available	2023-12-21T23:08:01Z	-
dc.date.created	2016-11-18	-
dc.date.issued	2016-11	-
dc.description.abstract	Wavelet transform has been widely used in many signal and image processing applications. Due to its wide adoption for time-critical applications, such as streaming and real-time signal processing, many acceleration techniques were developed during the past decade. Recently, the graphics processing unit (GPU) has gained much attention for accelerating computationally-intensive problems and many solutions of GPU-based discrete wavelet transform (DWT) have been introduced, but most of them did not fully leverage the potential of the GPU. In this paper, we present various state-of-the-art GPU optimization strategies in DWT implementation, such as leveraging shared memory, registers, warp shuffling instructions, and thread-and instruction-level parallelism (TLP, ILP), and finally elaborate our hybrid approach to further boost up its performance. In addition, we introduce a novel mixed-band memory layout for Haar DWT, where multi-level transform can be carried out in a single fused kernel launch. As a result, unlike recent GPU DWT methods that focus mainly on maximizing ILP, we show that the optimal GPU DWT performance can be achieved by hybrid parallelism combining both TLP and ILP together in a mixed-band approach. We demonstrate the performance of our proposed method by comparison with other CPU and GPU DWTmethods.	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, v.27, no.11, pp.3088 - 3100	-
dc.identifier.doi	10.1109/TPDS.2016.2536028	-
dc.identifier.issn	1045-9219	-
dc.identifier.scopusid	2-s2.0-84994432337	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/20725	-
dc.identifier.url	http://ieeexplore.ieee.org/document/7422119/	-
dc.identifier.wosid	000386247000001	-
dc.language	영어	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	A Fast Discrete Wavelet Transform Using Hybrid Parallelism on GPUs	-
dc.type	Article	-
dc.description.isOpenAccess	FALSE	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods; Engineering, Electrical & Electronic	-
dc.relation.journalResearchArea	Computer Science; Engineering	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordAuthor	Wavelet transform	-
dc.subject.keywordAuthor	hybrid parallelism	-
dc.subject.keywordAuthor	lifting scheme	-
dc.subject.keywordAuthor	bit rotation	-
dc.subject.keywordAuthor	GPU computing	-
dc.subject.keywordPlus	IMPLEMENTATION	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.