An Index Policy for Minimizing the Uncertainty-of-Information of Markov Sources

IEEE TRANSACTIONS ON INFORMATION THEORY(2024)

引用 0|浏览19
暂无评分
摘要
This paper focuses on the information freshness of finite-state Markov sources, using the uncertainty of information (UoI) as the performance metric. Measured by Shannon's entropy, UoI can capture not only the transition dynamics of the Markov source but also the different evolutions of information quality caused by the different values of the last observation. We consider an information update system with M finite state Markov sources transmitting information to a remote monitor via m communication channels (1 <= m < M). At each time, only m Markov sources can be selected to transmit their latest information to the remote monitor. Our goal is to explore the optimal scheduling policy to minimize the sum-UoI of the Markov sources. The problem is formulated as a restless multi-armed bandit (RMAB). We relax the RMAB and then decouple the relaxed problem into M single bandit problems. Importantly, analyzing the single bandit problem provides useful properties with which the relaxed problem reduces to maximizing a concave and piecewise linear function, allowing us to develop a gradient method to solve the relaxed problem and obtain its optimal policy. By rounding up the optimal policy for the relaxed problem, we obtain an index policy for the original RMAB problem. Notably, the proposed index policy is universal in the sense that it applies to general RMABs with bounded cost functions. Moreover, we show that our policy is asymptotically optimal as m and M tend to infinity with m/M fixed. In non asymptotic cases, numerical results demonstrate that our index policy is near-optimal and performs as well as the celebrated Whittle index policy in the problems that are Whittle-indexable. Unlike the Whittle index policy, our index policy does not require "indexability"; the indices can be computed regardless of indexability in the Whittle's sense. Thus, our index policy is a promising alternative method for the class of RMABs of concern: it can be used when the Whittle index policy is not viable and it performs as well as the Whittle index policy even when the Whittle index policy is viable.
更多
查看译文
关键词
Markov processes,Indexes,Measurement,Monitoring,Optimal scheduling,Real-time systems,Uncertainty,Uncertainty of information,RMAB,information freshness,AoI,asymptotically optimal policy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要