File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.contributor.advisor Kwon, Jimin -
dc.contributor.author Kim, Jiwon -
dc.date.accessioned 2026-03-26T22:13:57Z -
dc.date.available 2026-03-26T22:13:57Z -
dc.date.issued 2026-02 -
dc.description.abstract Recent advances in deep neural networks have enabled high-level perception and reasoning across domains such as medical diagnosis and natural language processing, accelerating a shift from cloud- centric computation toward on-device intelligence. However, deploying high-performance deep neural networks on edge platforms remains challenging due to strict constraints on computation, memory capacity, and energy efficiency. This thesis addresses these challenges through complementary algorithmic and hardware design approaches. At the algorithmic level, this thesis presents MedBiSeNet, an efficient medical image segmentation network for real-time edge deployment. To robustly handle ambiguous and low-contrast medical boundaries, MedBiSeNet employs a boundary-enhanced bilateral path and a noise-refining feature fusion module. As a result, the proposed network achieves a Dice score of 0.9617 on polyp segmentation tasks while reducing computational complexity by over 17× compared to prior methods. At the hardware level, this thesis proposes an energy-efficient processor architecture for on-device large language models. Exploiting the characteristics of ternary-weight large language models, the proposed design reduces both linear-layer computation and self-attention memory overhead through ternary weight clustering and packing, orthogonal LSB majority-bit approximation with approximation-in-memory, and a unified processing core supporting heterogeneous workloads. The processor achieves up to 18× higher energy efficiency than prior work, enabling practical inference of billion-parameter large language models on resource-limited edge devices. -
dc.description.degree Master -
dc.description Department of Electrical Engineering -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/90960 -
dc.identifier.uri http://unist.dcollection.net/common/orgView/200000964510 -
dc.language ENG -
dc.publisher Ulsan National Institute of Science and Technology -
dc.rights.embargoReleaseDate 9999-12-31 -
dc.rights.embargoReleaseTerms 9999-12-31 -
dc.subject 플라즈마, 레이저-플라즈마 상호작용, 플라즈마 포토닉스 -
dc.title Software-Hardware Co-Optimization for Energy-Efficient AI Processing -
dc.type Thesis -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.