Influence and Variance of a Markov Chain : Application to Adaptive Discretization in Optimal Control

Munos, Remi; Moore, Andrew

doi:10.1184/R1/6555167.v1

file.pdf (207.92 kB)

Influence and Variance of a Markov Chain : Application to Adaptive Discretization in Optimal Control

journal contribution

posted on 1999-01-01, 00:00 authored by Remi Munos, Andrew Moore

This paper addresses the difficult problem of deciding where to refine the resolution of adaptive discretizations for solving continuous time-and-space deterministic optimal control problems. We introduce two measures, influence and variance of a Markov chain. Influence measures the extent to which changes of some state affect the value function at other states. Variance measures the heterogeneity of the future cumulated active rewards (whose mean is the value function). We combine these two measures to derive a nonlocal efficient splitting criterion that takes into account the impact of a state on other states when deciding whether to split. We illustrate this method on the non-linear, two dimensional “Car on the Hill” and the 4d “space-shuttle” and “airplane-meeting” control problems

History

Publisher Statement

"©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE." "This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder."

Date

1999-01-01

Usage metrics

Keywords

Robotics

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Influence and Variance of a Markov Chain : Application to Adaptive Discretization in Optimal Control

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports