File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

유재준

Yoo, Jaejun
Lab. of Advanced Imaging Technology
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Motion-based Frame Sampling for Natural Language-based Vehicle Retrieval

Alternative Title
움직임 기반 프레임 샘플링 기법을 이용한 자연어 기반 차량 검색 성능 향상
Author(s)
Kim, DongyoungLee, KyoungohJang, In-suKim, Kwang-JuKim, Pyong-KunYoo, Jaejun
Issued Date
2024-11
DOI
10.5573/ieie.2024.61.11.120
URI
https://scholarworks.unist.ac.kr/handle/201301/85213
Citation
전자공학회논문지, v.61, no.11, pp.120 - 128
Abstract
Retrieving target vehicles through natural language descriptions plays a crucial role in intelligent transportation systems. Traditional methods employ models that leverage the correlation between textual and visual representations, such as CLIP, to perform retrieval tasks. However, since these models only handle image, they have struggled to capture the temporal dynamics of video data. Therefore, recent researchers have attempted to enhance temporal understanding through various data augmentation techniques and video encoders. Despite these efforts, conventional approaches frequently overlook the detailed temporal characteristics of vehicles. To address this limitation, we introduce a Motion-based Video Sampling method to effectively capture the detailed motion information of the target vehicle. Additionally, we build a robust model by implementing a re-ranking algorithm to handle various vehicle attributes. Finally, our proposed model achieves state-of-the-art performance on the public vehicle retrieval dataset.
Publisher
대한전자공학회
ISSN
2287-5026

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.