File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

오태훈

Oh, Tae Hoon
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Primal-dual differential dynamic programming: A model-based reinforcement learning for constrained dynamic optimization

Author(s)
Kim, Jong WooOh, Tae HoonSon, Sang HwanLee, Jong Min
Issued Date
2022-11
DOI
10.1016/j.compchemeng.2022.108004
URI
https://scholarworks.unist.ac.kr/handle/201301/81573
Citation
COMPUTERS & CHEMICAL ENGINEERING, v.167, pp.108004
Abstract
The main objective of this study is to develop primal-dual differential dynamic programming (DDP), a model -based reinforcement learning (RL) framework that can handle constrained dynamic optimization problems. DDP has advantages of being able to provide a closed-loop policy and having computational complexity that grows linearly with respect to the time horizon. To take advantage, the DDP should consider optimality and feasibility for the disturbed state during closed-loop operations. Previous DDPs consider the feasibility only for the nominal state condition and can handle limited types of constraints. In this paper, we propose a primal- dual DDP incorporating modified augmented Lagrangian that can handle general nonlinear constraints. We pay special attention to obtain the feasible policy when active set changes due to the state perturbations, using path-following predictor-corrector approach. The developed framework method was applied to van der Pol oscillator and batch crystallization process, thereby validating the key aspects of this study.
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
ISSN
0098-1354
Keyword (Author)
Reinforcement learningDifferential dynamic programmingConstrained optimizationAugmented LagrangianPath-following
Keyword
ALGORITHM

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.