Model Learning for Look-Ahead Exploration in Continuous Control

Agarwal, Arpit; Muelling, Katharina; Fragkiadaki, Katerina

Repository landing page

oai:ojs.aaai.org:article/4181

Model Learning for Look-Ahead Exploration in Continuous Control

Authors: Arpit Agarwal
Katharina Muelling
Katerina Fragkiadaki
Publication date: 17 July 2019
Publisher: Association for the Advancement of Artificial Intelligence
Doi

Abstract

We propose an exploration method that incorporates lookahead search over basic learnt skills and their dynamics, and use it for reinforcement learning (RL) of manipulation policies. Our skills are multi-goal policies learned in isolation in simpler environments using existing multigoal RL formulations, analogous to options or macroactions. Coarse skill dynamics, i.e., the state transition caused by a (complete) skill execution, are learnt and are unrolled forward during lookahead search. Policy search benefits from temporal abstraction during exploration, though itself operates over low-level primitive actions, and thus the resulting policies does not suffer from suboptimality and inflexibility caused by coarse skill chaining. We show that the proposed exploration strategy results in effective learning of complex manipulation policies faster than current state-of-the-art RL methods, and converges to better policies than methods that use options or parametrized skills as building blocks of the policy itself, as opposed to guiding exploration. We show that the proposed exploration strategy results in effective learning of complex manipulation policies faster than current state-of-the-art RL methods, and converges to better policies than methods that use options or parameterized skills as building blocks of the policy itself, as opposed to guiding exploration

Similar works

Full text

Association for the Advancement of Artificial Intelligence: AAAI Publications

oai:ojs.aaai.org:article/4181

Last time updated on 30/11/2020

This paper was published in Association for the Advancement of Artificial Intelligence: AAAI Publications.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.