Journal Home > Volume 1 , Issue 3

The video transmission in the Internet-of-Things (IoT) system must guarantee the video quality and reduce the packet loss rate and the delay with limited resources to satisfy the requirement of multimedia services. In this paper, we propose a reinforcement learning based energy-efficient IoT video transmission scheme that protects against interference, in which the base station controls the transmission action of the IoT device including the encoding rate, the modulation and coding scheme, and the transmit power. A reinforcement learning algorithm state-action-reward-state-action is applied to choose the transmission action based on the observed state (the queue length of the buffer, the channel gain, the previous bit error rate, and the previous packet loss rate) without knowledge of the transmission channel model at the transmitter and the receiver. We also propose a deep reinforcement learning based energy-efficient IoT video transmission scheme that uses a deep neural network to approximate Q value to further accelerate the learning process involved in choosing the optimal transmission action and improve the video transmission performance. Moreover, both the performance bounds of the proposed schemes and the computational complexity are theoretically derived. Simulation results show that the proposed schemes can increase the peak signal-to-noise ratio and decrease the packet loss rate, the delay, and the energy consumption relative to the benchmark scheme.


menu
Abstract
Full text
Outline
About this article

Reinforcement learning based energy-efficient internet-of-things video transmission

Show Author's information Yilin XiaoGuohang NiuLiang Xiao*( )Yuzhen DingSicong LiuYexian Fan
Department of Information and Communication Engineering, Xiamen University, Xiamen 361005, China
College of Information and Mechanical and Electrical Engineering, Ningde Normal University, Ningde 352100, China

Abstract

The video transmission in the Internet-of-Things (IoT) system must guarantee the video quality and reduce the packet loss rate and the delay with limited resources to satisfy the requirement of multimedia services. In this paper, we propose a reinforcement learning based energy-efficient IoT video transmission scheme that protects against interference, in which the base station controls the transmission action of the IoT device including the encoding rate, the modulation and coding scheme, and the transmit power. A reinforcement learning algorithm state-action-reward-state-action is applied to choose the transmission action based on the observed state (the queue length of the buffer, the channel gain, the previous bit error rate, and the previous packet loss rate) without knowledge of the transmission channel model at the transmitter and the receiver. We also propose a deep reinforcement learning based energy-efficient IoT video transmission scheme that uses a deep neural network to approximate Q value to further accelerate the learning process involved in choosing the optimal transmission action and improve the video transmission performance. Moreover, both the performance bounds of the proposed schemes and the computational complexity are theoretically derived. Simulation results show that the proposed schemes can increase the peak signal-to-noise ratio and decrease the packet loss rate, the delay, and the energy consumption relative to the benchmark scheme.

Keywords: energy efficiency, reinforcement learning, Internet-of-Things (IoT), video transmission

References(26)

[1]
W. Ji, J. C. Xu, H. X. Qiao, M. D. Zhou, and B. Liang, Visual IoT: Enabling internet of things visualization in smart cities, IEEE Network, vol. 33, no. 2, pp. 102-110, 2019.
[2]
Z. Li, Y. H. Liu, K. G. Shin, J. Liu, and Z. Yan, Interference steering to manage interference in IoT, IEEE Internet of Things Journal, vol. 6, no. 6, pp. 10 458-10 471, 2019.
[3]
Z. Z. Chen and X. Pan, An optimized rate control for low-delay H.265/HEVC, IEEE Trans. Image Process., vol. 28, no. 9, pp. 4541-4552, 2019.
[4]
J. Chakareski, Uplink scheduling of visual sensors: When view popularity matters, IEEE Trans. Commun., vol. 63, no. 2, pp. 510-519, 2015.
[5]
P. Z. Wu, P. C. Cosman, and L. B. Milstein, Resource allocation for multicarrier device-to-device video transmission: Symbol error rate analysis and algorithm design, IEEE Trans. Commun., vol. 65, no. 10, pp. 4446-4462, 2017.
[6]
D. Liu, J. Wu, H. Cui, D. D. Zhang, C. Luo, and F. Wu, Cost-distortion optimization and resource control in pseudo-analog visual communications, IEEE Transactions on Multimedia, vol. 20, no. 11, pp. 3097-3110, 2018.
[7]
M. L. Zhou, X. K. Wei, S. Q. Wang, S. Kwong, C. K. Fong, P. H. W. Wong, and W. Y. F. Yuen, Global rate-distortion optimization-based rate control for HEVC HDR coding, IEEE Trans. Circuits Syst. Video Technol., vol. 30, no. 12, pp. 4648-4662, 2020.
[8]
B. Li, H. Q. Li, L. Li, and J. L. Zhang, λ domain rate control algorithm for high efficiency video coding, IEEE Trans. Image Process., vol. 23, no. 9, pp. 3841-3854, 2014.
[9]
S. Pudlewski and T. Melodia, A tutorial on encoding and wireless transmission of compressively sampled videos, IEEE Commun. Surv. Tut., vol. 15, no. 2, pp. 754-767, 2013.
[10]
L. Xiao, H. L. Zhang, Y. L. Xiao, X. Y. Wan, S. C. Liu, L. C. Wang, and H. V. Poor, Reinforcement learning-based downlink interference control for ultra-dense small cells, IEEE Transactions on Wireless Communications, vol. 19, no. 1, pp. 423-434, 2020.
[11]
R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: MIT Press, 2018.
[12]
M. H. Wang, K. N. Ngan, and H. L. Li, Low-delay rate control for consistent quality using distortion-based lagrange multiplier, IEEE Trans. Image Process., vol. 25, no. 7, pp. 2943-2955, 2016.
[13]
C. L. Li, H. K. Xiong, and D. P. Wu, Delay-rate-distortion optimized rate control for end-to-end video communication over wireless channels, IEEE Trans. Circuits Syst. Video Technol., vol. 25, no. 10, pp. 1665-1681, 2015.
[14]
J. Y. Wu, B. Cheng, and M. Wang, Adaptive source-FEC coding for energy-efficient surveillance video over wireless networks, IEEE Trans. Commun., vol. 66, no. 5, pp. 2153-2168, 2018.
[15]
P. L. Li, H. H. Zhang, B. H. Zhao, and S. Rangarajan, Scalable video multicast with adaptive modulation and coding in broadband wireless data systems, IEEE/ACM Trans. Netw., vol. 20, no. 1, pp. 57-68, 2012.
[16]
J. Yoon, H. H. Zhang, S. Banerjee, and S. Rangarajan, Video multicast with joint resource allocation and adaptive modulation and coding in 4G networks, IEEE/ACM Trans. Netw., vol. 22, no. 5, pp. 1531-1544, 2014.
[17]
C. Gong and X. D. Wang, Adaptive transmission for delay-constrained wireless video, IEEE Transactions on Wireless Communications, vol. 13, no. 1, pp. 49-61, 2014.
[18]
S. Lin, F. Miao, J. B. Zhang, G. Zhou, L. Gu, T. He, J. A. Stankovic, S. Son, and G. J. Pappas, ATPC: Adaptive transmission power control for wireless sensor networks, ACM Trans. Sensor Netw., vol. 12, no. 1, p. 6, 2016.
[19]
N. Lee, X. Q. Lin, J. G. Andrews, and R. W. Heath, Power control for D2D underlaid cellular networks: Modeling, algorithms, and analysis, IEEE Journal on Selected Areas in Communications, vol. 33, no. 1, pp. 1-13, 2015.
[20]
B. Matthiesen, A. Zappone, K. L. Besser, E. A. Jorswieck, and M. Debbah, A globally optimal energy-efficient power control framework and its efficient implementation in wireless interference networks, IEEE Trans. Signal Process., vol. 68, pp. 3887-3902, 2020.
[21]
A. A. Khalek, C. Caramanis, and R. W. Heath, Delay-constrained video transmission: Quality-driven resource allocation and scheduling, IEEE Journal of Selected Topics in Signal Processing, vol. 9, no. 1, pp. 60-75, 2015.
[22]
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al., Human-level control through deep reinforcement learning, Nature, vol. 518, no. 7540, pp. 529-533, 2015.
[23]
D. W. Wang, L. Toni, P. C. Cosman, and L. B. Milstein, Resource allocation and performance analysis for multiuser video transmission over doubly selective channels, IEEE Transactions on Wireless Communications, vol. 14, no. 4, pp. 1954-1966, 2015.
[24]
Q. W. Liu, S. L. Zhou, and G. B. Giannakis, Cross-layer combining of adaptive modulation and coding with truncated ARQ over wireless links, IEEE Transactions on Wireless Communications, vol. 3, no. 5, pp. 1746-1755, 2004.
[25]
C. Jin, Z. Allen-Zhu, S. Bubeck, and M. I. Jordan, Is Q-learning provably efficient? in Proc. 32nd Int. Conf. Neural Information Processing Systems, Red Hook, NY, USA, 2018, pp. 4868-4878.
[26]
G. B. Ou and Y. L. Murphey, Multi-class pattern classification using neural networks, Pattern Recognition, vol. 40, no. 1, pp. 4-18, 2007.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 29 September 2020
Revised: 01 December 2020
Accepted: 08 December 2020
Published: 30 December 2020
Issue date: December 2020

Copyright

© All articles included in the journal are copyrighted to the ITU and TUP 2020

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61971366, 61671396, and 61901403), the Youth Innovation Fund of Xiamen (No. 3502Z20206039), and the Natural Science Foundation of Fujian Province of China (No. 2020J01430).

Rights and permissions

© All articles included in the journal are copyrighted to the ITU and TUP. This work is available under the CC BY-NC-ND 3.0 IGO license: https://creativecommons.org/licenses/by-nc-nd/3.0/igo/.

Return