Hybrid Iteration ADP Algorithm to Solve Cooperative, Optimal Output Regulation Problem for Continuous-Time, Linear, Multiagent Systems: Theory and Application in Islanded Modern Microgrids with IBRs

Omar Qasem, Masoud Davari, Weinan Gao, Daniel R. Kirk, Tianyou Chai

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

In this article, we propose a novel adaptive dynamic programming (ADP) algorithm, named hybrid iteration (HI), to solve the cooperative, optimal output regulation problem (CO2RP) for continuous-time, linear, multiagent systems. Unlike the traditional ADP algorithms, i.e., policy iteration (PI) and value iteration (VI), HI does not need an initial stabilizing control policy required by PI. At the same time, it maintains a faster convergence rate compared with VI. First, a model-based HI algorithm is proposed to solve the CO2RP. Based on the proposed HI algorithm, a data-driven, adaptive, optimal controller is developed to solve the cooperative, adaptive, and optimal output regulation problem without using any information about the physics of the system. Instead, the states/input information collected along the trajectories of the dynamic system is employed. The proposed data-driven HI is applied to the adaptive, optimal secondary voltage control (also known as voltage restoration control) of an islanded modern microgrid based on the inverter-based resources. Compared with the VI and PI algorithms, comparative simulation results demonstrate that the proposed HI approach is significantly able to save the convergence time of the central processing unit (also known as CPU) deployed, reduce the number of learning iterations, and remove the requirement of the initial stabilizing control policy. Comparative experiments reveal the practicality and superiority of the proposed methodology.

Original languageEnglish
Pages (from-to)834-845
Number of pages12
JournalIEEE Transactions on Industrial Electronics
Volume71
Issue number1
DOIs
StatePublished - Jan 1 2024

Keywords

  • Adaptive dynamic programming (ADP)
  • continuous-time
  • cooperative
  • linear
  • multiagent systems (MASs)
  • optimal output regulation
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Hybrid Iteration ADP Algorithm to Solve Cooperative, Optimal Output Regulation Problem for Continuous-Time, Linear, Multiagent Systems: Theory and Application in Islanded Modern Microgrids with IBRs'. Together they form a unique fingerprint.

Cite this