UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

AID-RL: Active information-directed reinforcement learning for autonomous source seeking and estimation

Li, Z; Chen, WH; Yang, J; Yan, Y; (2023) AID-RL: Active information-directed reinforcement learning for autonomous source seeking and estimation. Neurocomputing , 544 , Article 126281. 10.1016/j.neucom.2023.126281. (In press). Green open access

[thumbnail of 1-s2.0-S0925231223004046-main.pdf]
Preview
Text
1-s2.0-S0925231223004046-main.pdf - Published Version

Download (1MB) | Preview

Abstract

This paper proposes an active information-directed reinforcement learning (AID-RL) framework for autonomous source seeking and estimation problem. Source seeking requires the search agent to move towards the true source, and source estimation demands the agent to maintain and update its knowledge regarding the source properties such as release rate and source position. These two objectives give rise to the newly developed framework, namely, dual control for exploration and exploitation. In this paper, the greedy RL forms an exploitation search strategy that navigates the agent to the source position, while the information-directed search commands the agent to explore most informative positions to reduce belief uncertainty. Extensive results are presented using a high-fidelity dataset for autonomous search, which validates the effectiveness of the proposed AID-RL and highlights the importance of active exploration in improving sampling efficiency and search performance.

Type: Article
Title: AID-RL: Active information-directed reinforcement learning for autonomous source seeking and estimation
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.neucom.2023.126281
Publisher version: https://doi.org/10.1016/j.neucom.2023.126281
Language: English
Additional information: © 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Keywords: Autonomous search, Reinforcement learning, Dual control, Active learning, Exploration, Exploitation
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10170728
Downloads since deposit
45Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item