File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

정원기

Jeong, Won-Ki
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

A Fast Discrete Wavelet Transform Using Hybrid Parallelism on GPUs

Author(s)
Tran Minh QuanJeong, Won-Ki
Issued Date
2016-11
DOI
10.1109/TPDS.2016.2536028
URI
https://scholarworks.unist.ac.kr/handle/201301/20725
Fulltext
http://ieeexplore.ieee.org/document/7422119/
Citation
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, v.27, no.11, pp.3088 - 3100
Abstract
Wavelet transform has been widely used in many signal and image processing applications. Due to its wide adoption for time-critical applications, such as streaming and real-time signal processing, many acceleration techniques were developed during the past decade. Recently, the graphics processing unit (GPU) has gained much attention for accelerating computationally-intensive problems and many solutions of GPU-based discrete wavelet transform (DWT) have been introduced, but most of them did not fully leverage the potential of the GPU. In this paper, we present various state-of-the-art GPU optimization strategies in DWT implementation, such as leveraging shared memory, registers, warp shuffling instructions, and thread-and instruction-level parallelism (TLP, ILP), and finally elaborate our hybrid approach to further boost up its performance. In addition, we introduce a novel mixed-band memory layout for Haar DWT, where multi-level transform can be carried out in a single fused kernel launch. As a result, unlike recent GPU DWT methods that focus mainly on maximizing ILP, we show that the optimal GPU DWT performance can be achieved by hybrid parallelism combining both TLP and ILP together in a mixed-band approach. We demonstrate the performance of our proposed method by comparison with other CPU and GPU DWTmethods.
Publisher
IEEE COMPUTER SOC
ISSN
1045-9219
Keyword (Author)
Wavelet transformhybrid parallelismlifting schemebit rotationGPU computing
Keyword
IMPLEMENTATION

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.