TY - JOUR
T1 - Learning optimal scheduling policy for remote state estimation under uncertain channel condition
AU - Wu, Shuang
AU - Ren, Xiaoqiang
AU - Jia, Qing Shan
AU - Johansson, Karl Henrik
AU - Shi, Ling
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2020/6
Y1 - 2020/6
N2 - We consider optimal sensor scheduling with unknown communication channel statistics. We formulate two types of scheduling problems with the communication rate being a soft or hard constraint, respectively. We first present some structural results on the optimal scheduling policy using dynamic programming and assuming that the channel statistics is known. We prove that the Q-factor is monotonic and submodular, which leads to thresholdlike structures in both problems. Then, we develop a stochastic approximation and parameter learning frameworks to deal with the two scheduling problems with unknown channel statistics. We utilize their structures to design specialized learning algorithms. We, then prove the convergence of these algorithms. Performance improvement compared with the standard Q-learning algorithm is shown through numerical examples, which will also discuss an alternative method based on recursive estimation of the channel quality.
AB - We consider optimal sensor scheduling with unknown communication channel statistics. We formulate two types of scheduling problems with the communication rate being a soft or hard constraint, respectively. We first present some structural results on the optimal scheduling policy using dynamic programming and assuming that the channel statistics is known. We prove that the Q-factor is monotonic and submodular, which leads to thresholdlike structures in both problems. Then, we develop a stochastic approximation and parameter learning frameworks to deal with the two scheduling problems with unknown channel statistics. We utilize their structures to design specialized learning algorithms. We, then prove the convergence of these algorithms. Performance improvement compared with the standard Q-learning algorithm is shown through numerical examples, which will also discuss an alternative method based on recursive estimation of the channel quality.
KW - Learning algorithm
KW - scheduling
KW - state estimation
KW - threshold structure
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000549872800005
UR - https://openalex.org/W3016268025
UR - https://www.scopus.com/pages/publications/85076396692
U2 - 10.1109/TCNS.2019.2959162
DO - 10.1109/TCNS.2019.2959162
M3 - Journal Article
SN - 2325-5870
VL - 7
SP - 579
EP - 591
JO - IEEE Transactions on Control of Network Systems
JF - IEEE Transactions on Control of Network Systems
IS - 2
M1 - 8930919
ER -