Planning in the Presence of Cost Functions Controlled by an Adversary

McMahan, H. Brendan; Gordon, Geoffrey J.; Blum, Avrim

doi:10.1184/R1/6608441.v1

file.pdf (306.7 kB)

Planning in the Presence of Cost Functions Controlled by an Adversary

journal contribution

posted on 1993-10-01, 00:00 authored by H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a robot path planning problem where costs are influenced by sensors that an adversary places in the environment. We formulate the problem as a zero-sum matrix game where rows correspond to deterministic policies for the planning player and columns correspond to cost vectors the adversary can select. For a fixed cost vector, fast algorithms (such as value iteration) are available for solving MDPs. We develop efficient algorithms for matrix games where such best response oracles exist. We show that for our path planning problem these algorithms are at least an order of magnitude faster than direct solution of the linear programming formulation.

History

Publisher Statement

The original publication is available at www.springerlink.com

Date

1993-10-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Planning in the Presence of Cost Functions Controlled by an Adversary

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports