File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

오현동

Oh, Hyondong
Autonomous Systems Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

EV-infotaxis: Extended Vision-language Model-based Infotaxis for Efficient Gas Source Term Estimation and Search

Alternative Title
EV-infotaxis: 비전 언어 모델 기반 인포텍시스를 활용한 효율적인 유해가스 근원지 추정 및 탐색
Author(s)
Park, HyounghoJang, HongroKim, JiwooKim, SeunghwanOh, Hyondong
Issued Date
2025-06
DOI
10.5302/J.ICROS.2025.25.0049
URI
https://scholarworks.unist.ac.kr/handle/201301/91450
Citation
Journal of Institute of Control, Robotics and Systems, v.31, no.6, pp.643 - 653
Abstract
As hazardous gas leakage incidents increase, rapid and accurate source term estimation (STE) has become essential. Infotaxis is an widely-used information-theoretic search method based on Bayesian estimation to infer the information of a source utilizing a gas dispersion model and a sensor model. However, when the gas dispersion model deviates from the actual dispersion distribution, its estimation and search performance degrade. To address this limitation, this study proposes extended vision-language model-based Infotaxis (EV-Infotaxis), which integrates an extended vision-language model (EVLM) into the search process. The proposed method employs Grounding DINO for object detection incorporating environmental context and utilizes CLIP to compute semantic similarity between images and texts entered by a user, improving the efficiency of VLM-based search. Additionally, the gas sensor information is effectively fused with the image information from the EVLM, enabling more efficient exploration and exploitation. Simulation results verify the effectiveness of the proposed framework, confirming that EV-Infotaxis achieves superior search efficiency compared with Infotaxis. © ICROS 2025.
Publisher
Institute of Control, Robotics and Systems
ISSN
1976-5622
Keyword (Author)
bayesian inferenceinformation theoryparticle filterpath planningsource term estimationvision-language model

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.