Reinforcement learning solution for HJB equation arising in constrained optimal control problem.

abstract

The constrained optimal control problem depends on the solution of the complicated Hamilton-Jacobi-Bellman equation (HJBE). In this paper, a data-based off-policy reinforcement learning (RL) method is proposed, which learns the solution of the HJBE and the optimal control policy from real system data. One important feature of the off-policy RL is that its policy evaluation can be realized with data generated by other behavior policies, not necessarily the target policy, which solves the insufficient exploration problem. The convergence of the off-policy RL is proved by demonstrating its equivalence to the successive approximation approach. Its implementation procedure is based on the actor-critic neural networks structure, where the function approximation is conducted with linearly independent basis functions. Subsequently, the convergence of the implementation procedure with function approximation is also proved. Finally, its effectiveness is verified through computer simulations.

authors

Huang, Tingwen

published proceedings

Neural Netw

author list (cited authors)

Luo, B., Wu, H., Huang, T., & Liu, D.

citation count

86

complete list of authors

Luo, Biao||Wu, Huai-Ning||Huang, Tingwen||Liu, Derong

publication date

November 2015

publisher

Elsevier Publisher

published in

Neural Networks Journal

keywords

Algorithms
Computer Simulation
Constrained Optimal Control
Data-based
Hamilton–jacobi–bellman Equation
Machine Learning
Models, Theoretical
Neural Networks, Computer
Nonlinear Dynamics
Off-policy Reinforcement Learning
Problem Solving
Reinforcement, Psychology
The Method Of Weighted Residuals

PubMed Central ID

26356598

Digital Object Identifier (DOI)

10.1016/j.neunet.2015.08.007

start page

150

end page

158

volume

71

URL

http%3A%2F%2Fdx.doi.org%2F10.1016%2Fj.neunet.2015.08.007

Reinforcement learning solution for HJB equation arising in constrained optimal control problem.

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

Other

URL