Abstract
Reinforcement learning (RL)-based brain-machine interfaces (BMIs) learn the mapping from neural signals to subjects' intention using a reward signal. External rewards (water or food) or internal rewards extracted from neural activity are leveraged to update the parameters of decoders in the existing RL-based BMI framework. However, for complex tasks, the design of external reward could be difficult, which may not fully reflect the subject's own evaluation internally. It is important to obtain an internal reward model from neural activity to access subject's internal evaluation when the subject is performing the task through trial and error. In this paper, we propose to use an inverse reinforcement learning (IRL) method to estimate the internal reward function interpreted from the brain to assist the update of the decoders. Specifically, the inverse Q-learning (IQL) algorithm is applied to extract internal reward information from real data collected from medial prefrontal cortex (mPFC) when a rat was learning a two-lever-press discrimination task. Such an internal reward information is validated by checking whether it can guide the training of the RL decoder to complete movement task. Compared with the RL decoder trained with the external reward, our approach achieves a similar decoding performance. This preliminary result validates the effectiveness of using IRL to obtain the internal reward model. It reveals the potential of estimating internal reward model to improve the design of autonomous learning BMIs.
| Original language | English |
|---|---|
| Title of host publication | 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 3346-3349 |
| Number of pages | 4 |
| ISBN (Electronic) | 9781728127828 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 - Glasgow, United Kingdom Duration: 11 Jul 2022 → 15 Jul 2022 |
Publication series
| Name | Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS |
|---|---|
| Volume | 2022-July |
| ISSN (Print) | 1557-170X |
Conference
| Conference | 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 |
|---|---|
| Country/Territory | United Kingdom |
| City | Glasgow |
| Period | 11/07/22 → 15/07/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Keywords
- brain-machine interface
- internal reward
- inverse reinforcement learning
Fingerprint
Dive into the research topics of 'Estimating Reward Function from Medial Prefrontal Cortex Cortical Activity using Inverse Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver