File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

FoX: formation-based exploration with formation-awareness for partially observable agents in multi-agent reinforcement learning

Author(s)
Jo, Yonghyeon
Advisor
Han, Seungyul
Issued Date
2024-02
URI
https://scholarworks.unist.ac.kr/handle/201301/82164 http://unist.dcollection.net/common/orgView/200000744692
Abstract
Recently, the surge in interest surrounding deep multi-agent reinforcement learning (MARL) can be attributed to its remarkable success in tackling various cooperative multi-agent tasks. Despite these advancements, the challenge of effective exploration persists in MARL, primarily owing to the inherent partial observability of agents and the exponential growth of the exploration space with an increasing number of agents. To address the formidable scalability issue associated with exploration, we introduce a formation-based equivalence relation on the exploration space. This novel approach aims to streamline the exploration process by narrowing down the search space to states that bear meaningful distinctions in different formations. Building upon this foundational concept, we present the Formation-aware Exploration (FoX) frame- work, a pioneering approach designed to guide partially observable agents toward states within diverse formations. FoX achieves this by fostering an acute awareness of the agents’ current formation based solely on their individual observations. By encouraging agents to navigate and explore the exploration space through the lens of formations, FoX seeks to provide a more nuanced and targeted exploration strategy. In our numerical evaluations, the results unequivocally demonstrate the superior performance of the proposed FoX framework when compared to state-of-the-art MARL algorithms. These assessments were conducted on challenging tasks in both the Google Research Football (GRF) environment and the sparse StarCraft II multi-agent challenge (SMAC), showcasing the efficacy and versatility of FoX across diverse scenarios.
Publisher
Ulsan National Institute of Science and Technology
Degree
Master
Major
Graduate School of Artificial Intelligence

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.