There are no files associated with this item.
Cited time in
Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.citation.title | IEEE JOURNAL OF SOLID-STATE CIRCUITS | - |
| dc.contributor.author | Kim, Donghyuk | - |
| dc.contributor.author | Kim, Jae-Young | - |
| dc.contributor.author | Cho, Hyunjun | - |
| dc.contributor.author | Yoo, Seungjae | - |
| dc.contributor.author | Lee, Sukjin | - |
| dc.contributor.author | Yune, Sungwoong | - |
| dc.contributor.author | Yang, Sejeong | - |
| dc.contributor.author | Jeong, Hoichang | - |
| dc.contributor.author | Park, Keonhee | - |
| dc.contributor.author | Lee, Ki-Soo | - |
| dc.contributor.author | Lee, Jongchan | - |
| dc.contributor.author | Han, Chanheum | - |
| dc.contributor.author | Koo, Gunmo | - |
| dc.contributor.author | Han, Yuli | - |
| dc.contributor.author | Kim, Jaejin | - |
| dc.contributor.author | Kim, Jaemin | - |
| dc.contributor.author | Lee, Kyuho Jason | - |
| dc.contributor.author | Chae, Joo-Hyung | - |
| dc.contributor.author | Cho, Kunhee | - |
| dc.contributor.author | Kim, Joo-Young | - |
| dc.date.accessioned | 2025-12-02T13:13:05Z | - |
| dc.date.available | 2025-12-02T13:13:05Z | - |
| dc.date.created | 2025-10-20 | - |
| dc.date.issued | 2025-10 | - |
| dc.description.abstract | Transformer models have revolutionized artificial intelligence (AI) applications across various domains, but their increasing complexity poses significant challenges in terms of computational and memory demands. While processing-in-memory (PIM) paradigms have been adopted to address these limitations, existing PIM-based transformer accelerators still face hurdles such as: 1) focusing solely on optimizing attention layers; 2) lack of sparsity exploitation for transformers; and 3) limited PIM macro capacity and low cell density, which degrades on-chip data reuse and increases external memory access (EMA). This article presents DPIM, a novel 2T1C eDRAM-based transformer-in-memory chip that addresses these challenges through three key innovations: 1) a sparsity-aware quantization (SAQ) scheme that significantly increases bit-slice sparsity in both activation and weight data, achieving ratios of 83.3% and 88.4%, respectively, with minimal accuracy loss; 2) a heterogeneous PIM core capable of efficiently handling both sparse and dense matrix multiplications (MMs); and 3) a high-density 2T1C eDRAM cell with a density of 1.38 Mb/mm(2), enabling large-capacity PIM macros. By integrating these features, DPIM achieves improved computational efficiency and reduced EMA with enhanced on-chip data reuse. The DPIM chip, fabricated using 28-nm CMOS technology, achieves a throughput of 3.03-12.12 TOPS and an energy efficiency of 4.84-19.36 TOPS/W, all measured across INT8 and INT4 operations, respectively. It achieves a throughput density of 0.55 TOPS/mm(2) with INT8 operation. With a total macro size of 4608 kb, the chip occupies a die area of 20.25 mm(2) and operates at frequencies from 50 to 285 MHz with a supply voltage of 0.85-1.0 V. The DPIM successfully executes BERT-Large on the general language understanding evaluation (GLUE) dataset. Its macro density is 1413 kb/mm(2), and the resulting density figure-of-merit (FoM) (macro density x throughput density) is 1.6x - 115.8x higher than previous works, representing a significant advancement in hardware design for efficient transformer processing. | - |
| dc.identifier.bibliographicCitation | IEEE JOURNAL OF SOLID-STATE CIRCUITS | - |
| dc.identifier.doi | 10.1109/JSSC.2025.3607826 | - |
| dc.identifier.issn | 0018-9200 | - |
| dc.identifier.scopusid | 2-s2.0-105018036698 | - |
| dc.identifier.uri | https://scholarworks.unist.ac.kr/handle/201301/88769 | - |
| dc.identifier.wosid | 001587334300001 | - |
| dc.language | 영어 | - |
| dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
| dc.title | DPIM: A 2T1C eDRAM Transformer-in-Memory Chip With Sparsity-Aware Quantization and Heterogeneous Dense-Sparse Core | - |
| dc.type | Article | - |
| dc.description.isOpenAccess | FALSE | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.type.docType | Article; Early Access | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordAuthor | heterogeneous processor | - |
| dc.subject.keywordAuthor | processing-in-memory (PIM) | - |
| dc.subject.keywordAuthor | quantization | - |
| dc.subject.keywordAuthor | transformer | - |
| dc.subject.keywordAuthor | 2T1C cell | - |
| dc.subject.keywordPlus | ARCHITECTURE | - |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr
Copyright (c) 2023 by UNIST LIBRARY. All rights reserved.
ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.