No-regret algorithms for structured prediction problems

Gordon, Geoffrey J.

doi:10.1184/R1/6607772.v1

No-regret algorithms for structured prediction problems

journal contribution

posted on 2000-02-01, 00:00 authored by Geoffrey J. Gordon

Abstract: "No-regret algorithms are a popular class of online learning rules. Unfortunately, most no-regret algorithms assume that the set Y of allowable hypotheses is small and discrete. We consider instead prediction problems where Y has internal structure: Y might be the set of strategies in a game like poker, the set of paths in a graph, or the set of configurations of a data structure like a rebalancing binary search tree; or Y might be a given convex set (the 'online convex programming' problem) or in general an arbitrary bounded set. We derive a family of no-regret learning rules, called Lagrangian Hedging algorithms, to take advantage of this structure. Our algorithms are a direct generalization of known no-regret learning rules like weighted majority and external-regret matching. In addition to proving regret bounds, we demonstrate one of our algorithms learning to play one-card poker."

History

Publisher Statement

©2000 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Date

2000-02-01

Usage metrics

Keywords

Lagrangian functions.Convex programming.Information and Computing Sciences not elsewhere classified

Licence

In Copyright

No-regret algorithms for structured prediction problems

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports