Abstract
In this article, we propose a novel adaptive dynamic programming (ADP) algorithm, named hybrid iteration (HI), to solve the cooperative, optimal output regulation problem (CO2RP) for continuous-time, linear, multiagent systems. Unlike the traditional ADP algorithms, i.e., policy iteration (PI) and value iteration (VI), HI does not need an initial stabilizing control policy required by PI. At the same time, it maintains a faster convergence rate compared with VI. First, a model-based HI algorithm is proposed to solve the CO2RP. Based on the proposed HI algorithm, a data-driven, adaptive, optimal controller is developed to solve the cooperative, adaptive, and optimal output regulation problem without using any information about the physics of the system. Instead, the states/input information collected along the trajectories of the dynamic system is employed. The proposed data-driven HI is applied to the adaptive, optimal secondary voltage control (also known as voltage restoration control) of an islanded modern microgrid based on the inverter-based resources. Compared with the VI and PI algorithms, comparative simulation results demonstrate that the proposed HI approach is significantly able to save the convergence time of the central processing unit (also known as CPU) deployed, reduce the number of learning iterations, and remove the requirement of the initial stabilizing control policy. Comparative experiments reveal the practicality and superiority of the proposed methodology.
| Original language | English |
|---|---|
| Article number | 1 |
| Pages (from-to) | 834-845 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Industrial Electronics |
| Volume | 71 |
| Issue number | 1 |
| DOIs | |
| State | Published - Feb 28 2023 |
Scopus Subject Areas
- Control and Systems Engineering
- Electrical and Electronic Engineering
Keywords
- Adaptive dynamic programming (ADP)
- continuous-time
- cooperative
- linear
- multiagent systems (MASs)
- optimal output regulation
- reinforcement learning