posted on 1971-01-01, 00:00authored byAvrim Blum, Eyel Even-Dar, Katrina Ligett
There has been substantial work developing simple, efficient no-regret algorithms for a
wide class of repeated decision-making problems including online routing. These are adaptive strategies
an individual can use that give strong guarantees on performance even in adversarially-changing
environments. There has also been substantial work on analyzing properties of Nash equilibria in
routing games. In this paper, we consider the question: if each player in a routing game uses a noregret
strategy, will behavior converge to a Nash equilibrium? In general games the answer to this
question is known to be no in a strong sense, but routing games have substantially more structure.
In this paper we show that in the Wardrop setting of multicommodity flow and infinitesimal
agents, behavior will approach Nash equilibrium (formally, on most days, the cost of the flow will be
close to the cost of the cheapest paths possible given that flow) at a rate that depends polynomially
on the players’ regret bounds and the maximum slope of any latency function. We also show that
price-of-anarchy results may be applied to these approximate equilibria, and also consider the finitesize
(non-infinitesimal) load-balancing model of Azar [2]. Our nonatomic results also apply to a more
general class of games known as congestion games.