Abstract
This paper addresses the problem of optimal channel selection for remote state estimation in cyber-physical systems, where a sensor transmits measurements over multiple time-varying wireless channels. We model the packet arrival probability of each channel as a non-stationary Bernoulli process and propose two discounted Multi-Armed Bandit (MAB) algorithms-Discounted Upper Confidence Bound (D-UCB) and Discounted Thompson Sampling (D-TS) to select channels with the highest expected packet arrival rates adaptively. The estimation error covariance is analyzed using Kalman filtering, and the cumulative estimation regret is defined as the excess trace of the estimation error covariance compared to an optimal policy. Theoretical analysis shows the algorithms achieve a regret that grows gradually over time, and numerical simulations validate its effectiveness under non-stationary conditions.
| Original language | English |
|---|---|
| Publication status | Published - 2025 |
| Event | 64th IEEE Conference on Decision and Control - Windsor Convention Center, Rio de Janeiro, Brazil Duration: 10 Dec 2025 → 12 Dec 2025 https://cdc2025.ieeecss.org/ (Conference website) |
Conference
| Conference | 64th IEEE Conference on Decision and Control |
|---|---|
| Country/Territory | Brazil |
| City | Rio de Janeiro |
| Period | 10/12/25 → 12/12/25 |
| Internet address |
|
Keywords
- Linear systems
- Estimatin
- Kalman filtering
Fingerprint
Dive into the research topics of 'Remote State Estimation with Discounted Multi-Armed Bandits for Non-Stationary Channel Selection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver