File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

이승준

Lee, Seung Jun
Nuclear Safety Assessment and Plant HMI Evolution Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Deep reinforcement learning for a multi-objective operation in a nuclear power plant

Author(s)
Bae, JunyongKim, Jae MinLee, Seung Jun
Issued Date
2023-09
DOI
10.1016/j.net.2023.06.009
URI
https://scholarworks.unist.ac.kr/handle/201301/65308
Citation
NUCLEAR ENGINEERING AND TECHNOLOGY, v.55, no.9, pp.3277 - 3290
Abstract
Nuclear power plant (NPP) operations with multiple objectives and devices are still performed manually by operators despite the potential for human error. These operations could be automated to reduce the burden on operators; however, classical approaches may not be suitable for these multi-objective tasks. An alternative approach is deep reinforcement learning (DRL), which has been successful in automating various complex tasks and has been applied in automation of certain operations in NPPs. But despite the recent progress, previous studies using DRL for NPP operations have limitations to handle complex multi-objective operations with multiple devices efficiently. This study proposes a novel DRL-based approach that addresses these limitations by employing a continuous action space and straightforward binary rewards supported by the adoption of a soft actor-critic and hindsight experience replay. The feasibility of the proposed approach was evaluated for controlling the pressure and volume of the reactor coolant while heating the coolant during NPP startup. The results show that the proposed approach can train the agent with a proper strategy for effectively achieving multiple objectives through the control of multiple devices. Moreover, hands-on testing results demonstrate that the trained agent is capable of handling untrained objectives, such as cooldown, with substantial success.
Publisher
한국원자력학회
ISSN
1738-5733
Keyword (Author)
AutomationDeep reinforcement learningHindsight experience replayNuclear power plantSoft actor-critic
Keyword
LEVEL

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.