Intelligent and Converged Networks 2020, 1(3): 258-270 https://doi.org/10.23919/ICN.2020.0021

Open Access | Issue | Published: 30 December 2020

Reinforcement learning based energy-efficient internet-of-things video transmission

Show Author's Information Hide Author's Information Yilin Xiao, Guohang Niu, Liang Xiao^{^*}(

), Yuzhen Ding, Sicong Liu, Yexian Fan

Department of Information and Communication Engineering, Xiamen University, Xiamen 361005, China

College of Information and Mechanical and Electrical Engineering, Ningde Normal University, Ningde 352100, China

Keywords:

energy efficiency, reinforcement learning, Internet-of-Things (IoT), video transmission

Cite this article:

Xiao Y, Niu G, Xiao L, et al. Reinforcement learning based energy-efficient internet-of-things video transmission. Intelligent and Converged Networks, 2020, 1(3): 258-270. https://doi.org/10.23919/ICN.2020.0021

Download citation

EndNote(RIS)

BibTeX

667

Views

Downloads

Citations

Crossref

N/A

WoS

Scopus

N/A

CSCD

Abstract Full text About this article

Abstract

The video transmission in the Internet-of-Things (IoT) system must guarantee the video quality and reduce the packet loss rate and the delay with limited resources to satisfy the requirement of multimedia services. In this paper, we propose a reinforcement learning based energy-efficient IoT video transmission scheme that protects against interference, in which the base station controls the transmission action of the IoT device including the encoding rate, the modulation and coding scheme, and the transmit power. A reinforcement learning algorithm state-action-reward-state-action is applied to choose the transmission action based on the observed state (the queue length of the buffer, the channel gain, the previous bit error rate, and the previous packet loss rate) without knowledge of the transmission channel model at the transmitter and the receiver. We also propose a deep reinforcement learning based energy-efficient IoT video transmission scheme that uses a deep neural network to approximate $Q$ value to further accelerate the learning process involved in choosing the optimal transmission action and improve the video transmission performance. Moreover, both the performance bounds of the proposed schemes and the computational complexity are theoretically derived. Simulation results show that the proposed schemes can increase the peak signal-to-noise ratio and decrease the packet loss rate, the delay, and the energy consumption relative to the benchmark scheme.

Full text

Abstract

Full text

Outline

About this article

Reinforcement learning based energy-efficient internet-of-things video transmission

Show Author's information Hide Author's Information Yilin Xiao, Guohang Niu, Liang Xiao^{^*}(

), Yuzhen Ding, Sicong Liu, Yexian Fan

Department of Information and Communication Engineering, Xiamen University, Xiamen 361005, China

College of Information and Mechanical and Electrical Engineering, Ningde Normal University, Ningde 352100, China

Abstract

Keywords: energy efficiency, reinforcement learning, Internet-of-Things (IoT), video transmission

References(26)

[1]

W. Ji, J. C. Xu, H. X. Qiao, M. D. Zhou, and B. Liang, Visual IoT: Enabling internet of things visualization in smart cities, IEEE Network, vol. 33, no. 2, pp. 102-110, 2019.

DOI Google Scholar

[2]

Z. Li, Y. H. Liu, K. G. Shin, J. Liu, and Z. Yan, Interference steering to manage interference in IoT, IEEE Internet of Things Journal, vol. 6, no. 6, pp. 10 458-10 471, 2019.

DOI Google Scholar

[3]

Z. Z. Chen and X. Pan, An optimized rate control for low-delay H.265/HEVC, IEEE Trans. Image Process., vol. 28, no. 9, pp. 4541-4552, 2019.

DOI Google Scholar

[4]

J. Chakareski, Uplink scheduling of visual sensors: When view popularity matters, IEEE Trans. Commun., vol. 63, no. 2, pp. 510-519, 2015.

DOI Google Scholar

[5]

P. Z. Wu, P. C. Cosman, and L. B. Milstein, Resource allocation for multicarrier device-to-device video transmission: Symbol error rate analysis and algorithm design, IEEE Trans. Commun., vol. 65, no. 10, pp. 4446-4462, 2017.

DOI Google Scholar

[6]

D. Liu, J. Wu, H. Cui, D. D. Zhang, C. Luo, and F. Wu, Cost-distortion optimization and resource control in pseudo-analog visual communications, IEEE Transactions on Multimedia, vol. 20, no. 11, pp. 3097-3110, 2018.

DOI Google Scholar

[7]

M. L. Zhou, X. K. Wei, S. Q. Wang, S. Kwong, C. K. Fong, P. H. W. Wong, and W. Y. F. Yuen, Global rate-distortion optimization-based rate control for HEVC HDR coding, IEEE Trans. Circuits Syst. Video Technol., vol. 30, no. 12, pp. 4648-4662, 2020.

DOI Google Scholar

[8]

B. Li, H. Q. Li, L. Li, and J. L. Zhang, λ domain rate control algorithm for high efficiency video coding, IEEE Trans. Image Process., vol. 23, no. 9, pp. 3841-3854, 2014.

DOI Google Scholar

[9]

S. Pudlewski and T. Melodia, A tutorial on encoding and wireless transmission of compressively sampled videos, IEEE Commun. Surv. Tut., vol. 15, no. 2, pp. 754-767, 2013.

DOI Google Scholar

[10]

L. Xiao, H. L. Zhang, Y. L. Xiao, X. Y. Wan, S. C. Liu, L. C. Wang, and H. V. Poor, Reinforcement learning-based downlink interference control for ultra-dense small cells, IEEE Transactions on Wireless Communications, vol. 19, no. 1, pp. 423-434, 2020.

DOI Google Scholar

[11]

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: MIT Press, 2018.

[12]

M. H. Wang, K. N. Ngan, and H. L. Li, Low-delay rate control for consistent quality using distortion-based lagrange multiplier, IEEE Trans. Image Process., vol. 25, no. 7, pp. 2943-2955, 2016.

DOI Google Scholar

[13]

C. L. Li, H. K. Xiong, and D. P. Wu, Delay-rate-distortion optimized rate control for end-to-end video communication over wireless channels, IEEE Trans. Circuits Syst. Video Technol., vol. 25, no. 10, pp. 1665-1681, 2015.

DOI Google Scholar

[14]

J. Y. Wu, B. Cheng, and M. Wang, Adaptive source-FEC coding for energy-efficient surveillance video over wireless networks, IEEE Trans. Commun., vol. 66, no. 5, pp. 2153-2168, 2018.

DOI Google Scholar

[15]

P. L. Li, H. H. Zhang, B. H. Zhao, and S. Rangarajan, Scalable video multicast with adaptive modulation and coding in broadband wireless data systems, IEEE/ACM Trans. Netw., vol. 20, no. 1, pp. 57-68, 2012.

DOI Google Scholar

[16]

J. Yoon, H. H. Zhang, S. Banerjee, and S. Rangarajan, Video multicast with joint resource allocation and adaptive modulation and coding in 4G networks, IEEE/ACM Trans. Netw., vol. 22, no. 5, pp. 1531-1544, 2014.

DOI Google Scholar

[17]

C. Gong and X. D. Wang, Adaptive transmission for delay-constrained wireless video, IEEE Transactions on Wireless Communications, vol. 13, no. 1, pp. 49-61, 2014.

DOI Google Scholar

[18]

S. Lin, F. Miao, J. B. Zhang, G. Zhou, L. Gu, T. He, J. A. Stankovic, S. Son, and G. J. Pappas, ATPC: Adaptive transmission power control for wireless sensor networks, ACM Trans. Sensor Netw., vol. 12, no. 1, p. 6, 2016.

DOI Google Scholar

[19]

N. Lee, X. Q. Lin, J. G. Andrews, and R. W. Heath, Power control for D2D underlaid cellular networks: Modeling, algorithms, and analysis, IEEE Journal on Selected Areas in Communications, vol. 33, no. 1, pp. 1-13, 2015.

DOI Google Scholar

[20]

B. Matthiesen, A. Zappone, K. L. Besser, E. A. Jorswieck, and M. Debbah, A globally optimal energy-efficient power control framework and its efficient implementation in wireless interference networks, IEEE Trans. Signal Process., vol. 68, pp. 3887-3902, 2020.

DOI Google Scholar

[21]

A. A. Khalek, C. Caramanis, and R. W. Heath, Delay-constrained video transmission: Quality-driven resource allocation and scheduling, IEEE Journal of Selected Topics in Signal Processing, vol. 9, no. 1, pp. 60-75, 2015.

DOI Google Scholar

[22]

V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al., Human-level control through deep reinforcement learning, Nature, vol. 518, no. 7540, pp. 529-533, 2015.

DOI Google Scholar

[23]

D. W. Wang, L. Toni, P. C. Cosman, and L. B. Milstein, Resource allocation and performance analysis for multiuser video transmission over doubly selective channels, IEEE Transactions on Wireless Communications, vol. 14, no. 4, pp. 1954-1966, 2015.

DOI Google Scholar

[24]

Q. W. Liu, S. L. Zhou, and G. B. Giannakis, Cross-layer combining of adaptive modulation and coding with truncated ARQ over wireless links, IEEE Transactions on Wireless Communications, vol. 3, no. 5, pp. 1746-1755, 2004.

DOI Google Scholar

[25]

C. Jin, Z. Allen-Zhu, S. Bubeck, and M. I. Jordan, Is Q-learning provably efficient? in Proc. 32nd Int. Conf. Neural Information Processing Systems, Red Hook, NY, USA, 2018, pp. 4868-4878.

[26]

G. B. Ou and Y. L. Murphey, Multi-class pattern classification using neural networks, Pattern Recognition, vol. 40, no. 1, pp. 4-18, 2007.

DOI Google Scholar

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 29 September 2020

Revised: 01 December 2020

Accepted: 08 December 2020

Published: 30 December 2020

Issue date: December 2020

Copyright

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61971366, 61671396, and 61901403), the Youth Innovation Fund of Xiamen (No. 3502Z20206039), and the Natural Science Foundation of Fujian Province of China (No. 2020J01430).