The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality

Zhang, Xuan; Oommen, John; Granmo, Ole-Christoffer

Repository landing page

oai:uia.brage.unit.no:11250/2436428

The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality

Authors: Xuan Zhang
John Oommen
Ole-Christoffer Granmo
Publication date: 1 January 2016
Publisher
Doi

Abstract

The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵϵ-optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel

Similar works

Full text

Open in the Core reader

Download PDF

Agder University Research Archive

oai:uia.brage.unit.no:11250/24...

Last time updated on 03/09/2019

This paper was published in Agder University Research Archive.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.