Estimating Reward Function from Medial Prefrontal Cortex Cortical Activity using Inverse Reinforcement Learning

Jieyuan Tan, Xiang Shen, Xiang Zhang, Zhiwei Song, Yiwen Wang*

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

Reinforcement learning (RL)-based brain-machine interfaces (BMIs) learn the mapping from neural signals to subjects' intention using a reward signal. External rewards (water or food) or internal rewards extracted from neural activity are leveraged to update the parameters of decoders in the existing RL-based BMI framework. However, for complex tasks, the design of external reward could be difficult, which may not fully reflect the subject's own evaluation internally. It is important to obtain an internal reward model from neural activity to access subject's internal evaluation when the subject is performing the task through trial and error. In this paper, we propose to use an inverse reinforcement learning (IRL) method to estimate the internal reward function interpreted from the brain to assist the update of the decoders. Specifically, the inverse Q-learning (IQL) algorithm is applied to extract internal reward information from real data collected from medial prefrontal cortex (mPFC) when a rat was learning a two-lever-press discrimination task. Such an internal reward information is validated by checking whether it can guide the training of the RL decoder to complete movement task. Compared with the RL decoder trained with the external reward, our approach achieves a similar decoding performance. This preliminary result validates the effectiveness of using IRL to obtain the internal reward model. It reveals the potential of estimating internal reward model to improve the design of autonomous learning BMIs.

Original languageEnglish
Title of host publication44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3346-3349
Number of pages4
ISBN (Electronic)9781728127828
DOIs
Publication statusPublished - 2022
Event44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 - Glasgow, United Kingdom
Duration: 11 Jul 202215 Jul 2022

Publication series

NameProceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume2022-July
ISSN (Print)1557-170X

Conference

Conference44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
Country/TerritoryUnited Kingdom
CityGlasgow
Period11/07/2215/07/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Keywords

  • brain-machine interface
  • internal reward
  • inverse reinforcement learning

Fingerprint

Dive into the research topics of 'Estimating Reward Function from Medial Prefrontal Cortex Cortical Activity using Inverse Reinforcement Learning'. Together they form a unique fingerprint.

Cite this