Batched Bandit Problems

Perchet, Vianney; Rigollet, Philippe; Chassang, Sylvain; Snowberg, Erik

Repository landing page

research

oai:dspace.mit.edu:1721.1/98879

Batched Bandit Problems

Authors: Vianney Perchet
Philippe Rigollet
Sylvain Chassang
Erik Snowberg
Publication date: 24 September 2015
Publisher: 'Institute of Mathematical Statistics'

Abstract

Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. Our results show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost for stochastic bandits.National Science Foundation (U.S.) (Grant DMS-1317308)National Science Foundation (U.S.) (CAREER-DMS-1053987)Meimaris Famil

Similar works

Full text

Open in the Core reader

Download PDF

DSpace@MIT

oai:dspace.mit.edu:1721.1/9887...

Last time updated on 26/02/2017

This paper was published in DSpace@MIT.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.