Scholarworks@UNIST

UNIST Library

Search

File Download

Enhanced_Reward_Function_Design_for_Source_Term_Estimation_Based_on_Deep_Reinforcement_Learning.pdf

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

오현동

Oh, Hyondong: Autonomous Systems Lab.

Read More

Views & Downloads

Detailed Information

Previous

Cited time in webofscience

webofscience

Cited time in scopus

scopus

Metadata Downloads

Enhanced Reward Function Design for Source Term Estimation Based on Deep Reinforcement Learning

Author(s): Lee, Junhee, Jang, Hongro, Park, Minkyu, Oh, Hyondong

Issued Date: 2025-05

DOI: 10.1109/ACCESS.2025.3569827

URI: https://scholarworks.unist.ac.kr/handle/201301/87210

Citation: IEEE ACCESS, v.13, pp.87777 - 87792

Abstract: This study investigates the design of reward functions for deep reinforcement learning-based source term estimation (STE). Estimating the properties of unknown hazardous gas leakage using a mobile sensor, known as STE problems, is challenging due to environmental turbulence and sensor noise. To address this issue, the particle filter is employed to estimate the source term under noisy sensor measurements, and the deep Q-network is used to find the optimal source search policy. In deep reinforcement learning, selecting an appropriate reward function is crucial as it directly impacts the learning performance. Specifically, this paper first reviews existing reward functions based on penalty, distance, concentration, and entropy metrics. To overcome the limitations of existing rewards, we combine their strengths and propose new reward functions such as the Gaussian mixture model (GMM) variance-based reward and the GMM information gain-based reward. To validate the robustness of the proposed approach, simulations are conducted in two types of environments: basic and turbulent, by adjusting the parameters of the noise condition. The simulation results demonstrate that the proposed reward functions outperform existing ones and are particularly robust in noisy environments.

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

ISSN: 2169-3536

Keyword (Author): Deep reinforcement learning, Position measurement, Entropy, Robot sensing systems, Noise measurement, Uncertainty, Search problems, Source term estimation, deep reinforcement learning, deep Q-network, reward function, Bayesian inference, particle filter, path planning, Noise, Estimation, Mobile agents

Keyword: INFOTAXIS, SOURCE SEARCH

Show Full Item Record

qrcode

Share

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

RSS 1.0 RSS 2.0

CONTACT US

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

Copyright (c) 2023 by UNIST LIBRARY. All rights reserved.

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.

Anonymous

Login

Guide