TY - JOUR
T1 - Optimal Transmission Scheduling Over Multihop Networks
T2 - Structural Results and Reinforcement Learning
AU - Yang, Lixin
AU - Xu, Yong
AU - Lv, Weijun
AU - Li, Jun Yi
AU - Shi, Ling
N1 - Publisher Copyright:
© 1963-2012 IEEE.
PY - 2024/3/1
Y1 - 2024/3/1
N2 - This article studies the optimal transmission scheduling for remote state estimation over multihop networks. A smart sensor observes a dynamic system, and sends its local state estimate to a remote estimator (RE). To save energy, multihop networks are deployed to relay data packets from the smart sensor to the RE. The smart sensor needs to decide the hop number communicating with the RE by adjusting its transmission power. To minimize the estimation error and the energy consumption, the transmission scheduling is formulated as a modified Markov decision process (MDP) by incorporating historical actions into the state. A sufficient condition is constructed to guarantee that the MDP has an optimal deterministic and stationary policy. The optimal policy's structure is further obtained to reduce the computation complexity. A deep reinforcement learning algorithm, i.e., dueling double Q-network, is introduced to obtain a near-optimal policy. Finally, a simulation example is provided to illustrate the developed results.
AB - This article studies the optimal transmission scheduling for remote state estimation over multihop networks. A smart sensor observes a dynamic system, and sends its local state estimate to a remote estimator (RE). To save energy, multihop networks are deployed to relay data packets from the smart sensor to the RE. The smart sensor needs to decide the hop number communicating with the RE by adjusting its transmission power. To minimize the estimation error and the energy consumption, the transmission scheduling is formulated as a modified Markov decision process (MDP) by incorporating historical actions into the state. A sufficient condition is constructed to guarantee that the MDP has an optimal deterministic and stationary policy. The optimal policy's structure is further obtained to reduce the computation complexity. A deep reinforcement learning algorithm, i.e., dueling double Q-network, is introduced to obtain a near-optimal policy. Finally, a simulation example is provided to illustrate the developed results.
KW - Estimation
KW - Kalman filtering
KW - sensor networks
KW - transmission scheduling
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:001179005900042
UR - https://openalex.org/W4388262692
UR - https://www.scopus.com/pages/publications/85176364382
U2 - 10.1109/TAC.2023.3327622
DO - 10.1109/TAC.2023.3327622
M3 - Journal Article
SN - 0018-9286
VL - 69
SP - 1826
EP - 1833
JO - IEEE Transactions on Automatic Control
JF - IEEE Transactions on Automatic Control
IS - 3
ER -