Approximate dynamic programming with state aggregation applied to UAV perimeter patrol

abstract

One encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for large-scale controlled Markov chains. In this paper, we provide a reward-based aggregation method to construct suboptimal policies for a perimeter surveillance control problem which gives rise to a large scale Markov chain. The novelty of this approach lies in circumventing the need for value iteration over the entire state space. Instead, the state space is partitioned and the value function is approximated by a constant over each partition. We associate a meta-state with each partition, where the transition probabilities between these meta-states are known. The state aggregation approach results in a significant reduction in the computational burden and lends itself to value iteration over the aggregated state-space. We provide bounds to assess the quality of the approximation and give numerical results that support the proposed methodology. Published 2011. This article is a US Government work and is in the public domain in the USA. Published 2011. This article is a US Government work and is in the public domain in the USA.

authors

Darbha, Swaroop

published proceedings

International Journal of Robust and Nonlinear Control

author list (cited authors)

Krishnamoorthy, K., Pachter, M., Darbha, S., & Chandler, P.

citation count

18

complete list of authors

Krishnamoorthy, K||Pachter, M||Darbha, S||Chandler, P

publication date

August 2011

publisher

Wiley Publisher

published in

International Journal of Robust and Nonlinear Control Journal

keywords

49 Mathematical Sciences
4901 Applied Mathematics

Digital Object Identifier (DOI)

10.1002/rnc.1686

start page

1396

end page

1409

volume

21

issue

12

URL

http://dx.doi.org/10.1002/rnc.1686

Approximate dynamic programming with state aggregation applied to UAV perimeter patrol Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL