Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation

abstract

IEEE In this paper, the optimal output tracking control problem of discrete-time nonlinear systems is considered. First, the augmented system is derived and the tracking control problem is converted to the regulation problem with a discounted performance index, which relies on the solution of the Bellman equation. It is known that policy iteration and value iteration are two classical algorithms for solving the Bellman equation. Through analysis of the two algorithms, it is found that policy iteration converges fast while requires an initial admissible control policy, and value iteration avoids the requirement of an initial admissible control policy but converges slowly. To achieve the tradeoff between policy iteration and value iteration, the multistep heuristic dynamic programming (MsHDP) is proposed by using multistep policy evaluation scheme. The convergence of MsHDP algorithm is proved by demonstrating that it converges to the solution of the Bellman equation. Subsequently, neural network-based actor-critic structure is developed to implement the MsHDP algorithm. The effectiveness and advantages of the developed MsHDP method are validated through comparative simulation studies.

authors

Huang, Tingwen

published proceedings

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

author list (cited authors)

Luo, B., Liu, D., Huang, T., & Liu, J.

citation count

53

complete list of authors

Luo, Biao||Liu, Derong||Huang, Tingwen||Liu, Jiangjiang

publication date

October 2019

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

IEEE Transactions on Systems, Man, and Cybernetics: Systems Journal

keywords

Adaptive Dynamic Programming (adp)
Bellman Equation
Heuristic Dynamic Programming
Neural Networks (nns)
Output Tracking Control

Digital Object Identifier (DOI)

10.1109/TSMC.2017.2771516

start page

2155

end page

2165

volume

49

issue

10

URL

http://dx.doi.org/10.1109/tsmc.2017.2771516

Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL