Evolving inborn knowledge for fast adaptation in dynamic POMDP problems

Eseoghene Ben-Iwhiwhu, Pawel Ladosz, Jeffery Dick, Wen Hua Chen, Praveen Pilly, Andrea Soltoggio

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

3 Citations (Scopus)

Abstract

Rapid online adaptation to changing tasks is an important problem in machine learning and, recently, a focus of meta-reinforcement learning. However, reinforcement learning (RL) algorithms struggle in POMDP environments because the state of the system, essential in a RL framework, is not always visible. Additionally, hand-designed meta-RL architectures may not include suitable computational structures for specific learning problems. The evolution of online learning mechanisms, on the contrary, has the ability to incorporate learning strategies into an agent that can (i) evolve memory when required and (ii) optimize adaptation speed to specific online learning problems. In this paper, we exploit the highly adaptive nature of neuromodulated neural networks to evolve a controller that uses the latent space of an autoencoder in a POMDP. The analysis of the evolved networks reveals the ability of the proposed algorithm to acquire inborn knowledge in a variety of aspects such as the detection of cues that reveal implicit rewards, and the ability to evolve location neurons that help with navigation. The integration of inborn knowledge and online plasticity enabled fast adaptation and better performance in comparison to some non-evolutionary meta-reinforcement learning algorithms. The algorithm proved also to succeed in the 3D gaming environment Malmo Minecraft.

Original languageEnglish
Title of host publicationGECCO 2020 - Proceedings of the 2020 Genetic and Evolutionary Computation Conference
PublisherAssociation for Computing Machinery
Pages280-288
Number of pages9
ISBN (Electronic)9781450371285
DOIs
Publication statusPublished - 25 Jun 2020
Event2020 Genetic and Evolutionary Computation Conference, GECCO 2020 - Cancun, Mexico
Duration: 8 Jul 202012 Jul 2020

Publication series

NameGECCO 2020 - Proceedings of the 2020 Genetic and Evolutionary Computation Conference

Conference

Conference2020 Genetic and Evolutionary Computation Conference, GECCO 2020
Country/TerritoryMexico
CityCancun
Period8/07/2012/07/20

Keywords

  • Adaptive agent
  • Few-shots learning
  • Hebbian learning
  • Lifelong learning
  • Neuroevolution
  • Neuromodulation
  • Self modifying network

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Evolving inborn knowledge for fast adaptation in dynamic POMDP problems'. Together they form a unique fingerprint.

Cite this