File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

정원기

Jeong, Won-Ki
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 3100 -
dc.citation.number 11 -
dc.citation.startPage 3088 -
dc.citation.title IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS -
dc.citation.volume 27 -
dc.contributor.author Tran Minh Quan -
dc.contributor.author Jeong, Won-Ki -
dc.date.accessioned 2023-12-21T23:08:01Z -
dc.date.available 2023-12-21T23:08:01Z -
dc.date.created 2016-11-18 -
dc.date.issued 2016-11 -
dc.description.abstract Wavelet transform has been widely used in many signal and image processing applications. Due to its wide adoption for time-critical applications, such as streaming and real-time signal processing, many acceleration techniques were developed during the past decade. Recently, the graphics processing unit (GPU) has gained much attention for accelerating computationally-intensive problems and many solutions of GPU-based discrete wavelet transform (DWT) have been introduced, but most of them did not fully leverage the potential of the GPU. In this paper, we present various state-of-the-art GPU optimization strategies in DWT implementation, such as leveraging shared memory, registers, warp shuffling instructions, and thread-and instruction-level parallelism (TLP, ILP), and finally elaborate our hybrid approach to further boost up its performance. In addition, we introduce a novel mixed-band memory layout for Haar DWT, where multi-level transform can be carried out in a single fused kernel launch. As a result, unlike recent GPU DWT methods that focus mainly on maximizing ILP, we show that the optimal GPU DWT performance can be achieved by hybrid parallelism combining both TLP and ILP together in a mixed-band approach. We demonstrate the performance of our proposed method by comparison with other CPU and GPU DWTmethods. -
dc.identifier.bibliographicCitation IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, v.27, no.11, pp.3088 - 3100 -
dc.identifier.doi 10.1109/TPDS.2016.2536028 -
dc.identifier.issn 1045-9219 -
dc.identifier.scopusid 2-s2.0-84994432337 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/20725 -
dc.identifier.url http://ieeexplore.ieee.org/document/7422119/ -
dc.identifier.wosid 000386247000001 -
dc.language 영어 -
dc.publisher IEEE COMPUTER SOC -
dc.title A Fast Discrete Wavelet Transform Using Hybrid Parallelism on GPUs -
dc.type Article -
dc.description.isOpenAccess FALSE -
dc.relation.journalWebOfScienceCategory Computer Science, Theory & Methods; Engineering, Electrical & Electronic -
dc.relation.journalResearchArea Computer Science; Engineering -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordAuthor Wavelet transform -
dc.subject.keywordAuthor hybrid parallelism -
dc.subject.keywordAuthor lifting scheme -
dc.subject.keywordAuthor bit rotation -
dc.subject.keywordAuthor GPU computing -
dc.subject.keywordPlus IMPLEMENTATION -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.