File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

임치현

Lim, Chiehyeon
Service Engineering & Knowledge Discovery Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Towards Pareto-Efficient RLHF: Paying Attention to a Few High-Reward Samples

Author(s)
Lee, CLim, Chiehyeon
Issued Date
2024-11-16
URI
https://scholarworks.unist.ac.kr/handle/201301/83939
Citation
Empirical Methods in Natural Language Processing
Publisher
Association for Computational Linguistics (ACL)

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.