File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

백웅기

Baek, Woongki
Intelligent System Software Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Activation Sequence Caching: High-Throughput and Memory-Efficient Generative Inference with a Single GPU

Author(s)
Kim, SowoongSim, EunyeongShin, YoungsamCho, YeonGonBaek, Woongki
Issued Date
2024-10-14
URI
https://scholarworks.unist.ac.kr/handle/201301/84622
Citation
International Conference on Parallel Architectures and Compilation Techniques
Publisher
ACM/IEEE

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.