Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Song, Yuhang; Wang, Jianyi; Lukasiewicz, Thomas; Xu, Zhenghua; Zhang, Shangtong; Wojcicki, Andrzej; Xu, Mai

Repository landing page

oai:ojs.aaai.org:article/6040

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Authors: Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Shangtong Zhang
Andrzej Wojcicki
Mai Xu
Publication date: 3 April 2020
Publisher: Association for the Advancement of Artificial Intelligence
Doi

Abstract

Intrinsic rewards were introduced to simulate how human intelligence works; they are usually evaluated by intrinsically-motivated play, i.e., playing games without extrinsic rewards but evaluated with extrinsic rewards. However, none of the existing intrinsic reward approaches can achieve human-level performance under this very challenging setting of intrinsically-motivated play. In this work, we propose a novel megalomania-driven intrinsic reward (called mega-reward), which, to our knowledge, is the first approach that achieves human-level performance in intrinsically-motivated play. Intuitively, mega-reward comes from the observation that infants' intelligence develops when they try to gain more control on entities in an environment; therefore, mega-reward aims to maximize the control capabilities of agents on given entities in a given environment. To formalize mega-reward, a relational transition model is proposed to bridge the gaps between direct and latent control. Experimental studies show that mega-reward (i) can greatly outperform all state-of-the-art intrinsic reward approaches, (ii) generally achieves the same level of performance as Ex-PPO and professional human-level scores, and (iii) has also a superior performance when it is incorporated with extrinsic rewards

Similar works

Full text

Association for the Advancement of Artificial Intelligence: AAAI Publications

oai:ojs.aaai.org:article/6040

Last time updated on 30/11/2020

This paper was published in Association for the Advancement of Artificial Intelligence: AAAI Publications.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.