TY - JOUR
T1 - Hybrid Iteration ADP Algorithm to Solve Cooperative, Optimal Output Regulation Problem for Continuous-Time, Linear, Multiagent Systems
T2 - Theory and Application in Islanded Modern Microgrids with IBRs
AU - Qasem, Omar
AU - Davari, Masoud
AU - Gao, Weinan
AU - Kirk, Daniel R.
AU - Chai, Tianyou
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - In this article, we propose a novel adaptive dynamic programming (ADP) algorithm, named hybrid iteration (HI), to solve the cooperative, optimal output regulation problem (CO2RP) for continuous-time, linear, multiagent systems. Unlike the traditional ADP algorithms, i.e., policy iteration (PI) and value iteration (VI), HI does not need an initial stabilizing control policy required by PI. At the same time, it maintains a faster convergence rate compared with VI. First, a model-based HI algorithm is proposed to solve the CO2RP. Based on the proposed HI algorithm, a data-driven, adaptive, optimal controller is developed to solve the cooperative, adaptive, and optimal output regulation problem without using any information about the physics of the system. Instead, the states/input information collected along the trajectories of the dynamic system is employed. The proposed data-driven HI is applied to the adaptive, optimal secondary voltage control (also known as voltage restoration control) of an islanded modern microgrid based on the inverter-based resources. Compared with the VI and PI algorithms, comparative simulation results demonstrate that the proposed HI approach is significantly able to save the convergence time of the central processing unit (also known as CPU) deployed, reduce the number of learning iterations, and remove the requirement of the initial stabilizing control policy. Comparative experiments reveal the practicality and superiority of the proposed methodology.
AB - In this article, we propose a novel adaptive dynamic programming (ADP) algorithm, named hybrid iteration (HI), to solve the cooperative, optimal output regulation problem (CO2RP) for continuous-time, linear, multiagent systems. Unlike the traditional ADP algorithms, i.e., policy iteration (PI) and value iteration (VI), HI does not need an initial stabilizing control policy required by PI. At the same time, it maintains a faster convergence rate compared with VI. First, a model-based HI algorithm is proposed to solve the CO2RP. Based on the proposed HI algorithm, a data-driven, adaptive, optimal controller is developed to solve the cooperative, adaptive, and optimal output regulation problem without using any information about the physics of the system. Instead, the states/input information collected along the trajectories of the dynamic system is employed. The proposed data-driven HI is applied to the adaptive, optimal secondary voltage control (also known as voltage restoration control) of an islanded modern microgrid based on the inverter-based resources. Compared with the VI and PI algorithms, comparative simulation results demonstrate that the proposed HI approach is significantly able to save the convergence time of the central processing unit (also known as CPU) deployed, reduce the number of learning iterations, and remove the requirement of the initial stabilizing control policy. Comparative experiments reveal the practicality and superiority of the proposed methodology.
KW - Adaptive dynamic programming (ADP)
KW - continuous-time
KW - cooperative
KW - linear
KW - multiagent systems (MASs)
KW - optimal output regulation
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85149420218&partnerID=8YFLogxK
U2 - 10.1109/TIE.2023.3247734
DO - 10.1109/TIE.2023.3247734
M3 - Article
AN - SCOPUS:85149420218
SN - 0278-0046
VL - 71
SP - 834
EP - 845
JO - IEEE Transactions on Industrial Electronics
JF - IEEE Transactions on Industrial Electronics
IS - 1
ER -