Skip to main navigation Skip to search Skip to main content

Remote State Estimation with Discounted Multi-Armed Bandits for Non-Stationary Channel Selection

  • Jiuzhou ZHANG
  • , Wei HUO
  • , Xiaomeng CHEN
  • , Daniel E. Quevedo
  • , Ling SHI

Research output: Contribution to conferenceConference Paperpeer-review

Abstract

This paper addresses the problem of optimal channel selection for remote state estimation in cyber-physical systems, where a sensor transmits measurements over multiple time-varying wireless channels. We model the packet arrival probability of each channel as a non-stationary Bernoulli process and propose two discounted Multi-Armed Bandit (MAB) algorithms-Discounted Upper Confidence Bound (D-UCB) and Discounted Thompson Sampling (D-TS) to select channels with the highest expected packet arrival rates adaptively. The estimation error covariance is analyzed using Kalman filtering, and the cumulative estimation regret is defined as the excess trace of the estimation error covariance compared to an optimal policy. Theoretical analysis shows the algorithms achieve a regret that grows gradually over time, and numerical simulations validate its effectiveness under non-stationary conditions.
Original languageEnglish
Publication statusPublished - 2025
Event64th IEEE Conference on Decision and Control - Windsor Convention Center, Rio de Janeiro, Brazil
Duration: 10 Dec 202512 Dec 2025
https://cdc2025.ieeecss.org/ (Conference website)

Conference

Conference64th IEEE Conference on Decision and Control
Country/TerritoryBrazil
CityRio de Janeiro
Period10/12/2512/12/25
Internet address

Keywords

  • Linear systems
  • Estimatin
  • Kalman filtering

Fingerprint

Dive into the research topics of 'Remote State Estimation with Discounted Multi-Armed Bandits for Non-Stationary Channel Selection'. Together they form a unique fingerprint.

Cite this