Solving Games with Functional Regret Estimation

Waugh, Kevin; Morrill, Dustin; Bagnell, James; Bowling, Michael

Repository landing page

oai:ojs.aaai.org:article/9445

Solving Games with Functional Regret Estimation

Authors: Kevin Waugh
Dustin Morrill
James Bagnell
Michael Bowling
Publication date: 18 February 2015
Publisher: Association for the Advancement of Artificial Intelligence
Doi

Abstract

We propose a novel online learning method for minimizing regret in large extensive-form games. The approach learns a function approximator online to estimate the regret for choosing a particular action. A no-regret algorithm uses these estimates in place of the true regrets to define a sequence of policies. We prove the approach sound by providing a bound relating the quality of the function approximation and regret of the algorithm. A corollary being that the method is guaranteed to converge to a Nash equilibrium in self-play so long as the regrets are ultimately realizable by the function approximator. Our technique can be understood as a principled generalization of existing work onabstraction in large games; in our work, both the abstraction as well as the equilibrium are learned during self-play. We demonstrate empirically the method achieves higher quality strategies than state-of-the-art abstraction techniques given the same resources

Similar works

Full text

Association for the Advancement of Artificial Intelligence: AAAI Publications

oai:ojs.aaai.org:article/9445

Last time updated on 20/02/2021

This paper was published in Association for the Advancement of Artificial Intelligence: AAAI Publications.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.