Model-based reinforcement learning and predictive control for two-stage optimal control of fed-batch bioreactor

Scholarworks@UNIST

UNIST Library

There are no files associated with this item.

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

오태훈

Read More

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Model-based reinforcement learning and predictive control for two-stage optimal control of fed-batch bioreactor

Abstract: In this study, we propose a two-stage optimal control framework for a fed-batch bioreactor. The high-level controller aims to obtain the optimal feed trajectory that maximizes the final time productivity and yield using a nominal model. By contrast, the low-level controller maintains the high-level performance in the presence of the model-plant mismatch and real-time disturbances. This two-stage decomposition can perform the closed-loop operation with less online recomputation. To solve the high-level optimiza-tion, differential dynamic programming (DDP), a model-based reinforcement learning that employs the derivatives of the model is applied. Three types of low-level controllers are proposed: DDP controller, a model predictive control (MPC) that tracks the high-level trajectory, and an economic MPC. We first validate that DDP yields as good result as the direct method. Second, we compare the three low-level controllers and verify the necessity of the two-stage decomposition through the studies on a bioreactor. (c) 2021 Elsevier Ltd. All rights reserved.

Keyword (Author): Fed-batch bioreactor, Dynamic optimization, Reinforcement learning, Model predictive control

qrcode

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.