Age Of Semantics In Cooperative Communications: To Expedite Simulation Towards Real Via Offline Reinforcement Learning
2022 Β· Xianfu Chen, Zhifeng Zhao, Shiwen Mao, et al.
Abstract
The age of information metric fails to correctly describe the intrinsic semantics of a status update. In an intelligent reflecting surface-aided cooperative relay communication system, we propose the age of semantics (AoS) for measuring semantics freshness of the status updates. Specifically, we focus on the status updating from a source node (SN) to the destination, which is formulated as a Markov decision process (MDP). The objective of the SN is to maximize the expected satisfaction of AoS and energy consumption under the maximum transmit power constraint. To seek the optimal control policy, we first derive an online deep actor-critic (DAC) learning scheme under the on-policy temporal difference learning framework. However, implementing the online DAC in practice poses the key challenge in infinitely repeated interactions between the SN and the system, which can be dangerous particularly during the exploration. We then put forward a novel offline DAC scheme, which estimates the opti
Authors
(none)
Tags
Stats
Related papers
- Diffusion Model-based Reinforcement Learning For Version Age Of Information Scheduling: Average And Tail-risk-sensitive Control (2026)0.00
- Beyond Freshness And Semantics: A Coupon-collector Framework For Effective Status Updates (2026)0.00
- Scalable Semantic Non-markovian Simulation Proxy For Reinforcement Learning (2023)0.00
- Efficient Communication Via Self-supervised Information Aggregation For Online And Offline Multi-agent Reinforcement Learning (2023)6.34
- Decomposing Communication Gain And Delay Cost Under Cross-timestep Delays In Cooperative Multi-agent Reinforcement Learning (2026)0.00
- Towards Data-driven Offline Simulations For Online Reinforcement Learning (2022)0.00
- DACOM: Learning Delay-aware Communication For Multi-agent Reinforcement Learning (2022)0.00
- Optimizing The Long-term Average Reward For Continuing Mdps: A Technical Report (2021)0.00