[3]
B. Serbetci and J. Goseling, On optimal geographical caching in heterogeneous cellular networks, in Proc. IEEE Wireless Communications and Networking Conf. (WCNC), San Francisco, CA, USA, 2017, pp. 1–6.
[4]
B. Blaszczyszyn and A. Giovanidis, Optimal geographic caching in cellular networks, in Proc. IEEE Int. Conf. Communications (ICC), London, UK, 2015, pp. 3358–3363.
[8]
J. Wu, B. Chen, C. Yang, and Q. Li, Caching and bandwidth allocation policy optimization in heterogeneous networks, in Proc. IEEE 28th Annual Int. Symp. on Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, Canada, 2017, pp. 1–6.
[12]
Z. Wang, Z. Cao, Y. Cui, and Y. Yang, Joint and competitive caching designs in large-scale multi-tier wireless multicasting networks, in Proc. GLOBECOM 2017—2017 IEEE Global Communications Conf., Singapore, 2017, pp. 1–7.
[20]
Y. Wei, Z. Zhang, F. R. Yu, and Z. Han, Joint user scheduling and content caching strategy for mobile edge networks using deep reinforcement learning, in Proc. IEEE Int. Conf. Communications Workshops (ICC Workshops), Kansas City, MO, USA, 2018, pp. 1–6.
[21]
D. Li, Y. Han, C. Wang, G. Shi, X. Wang, X. Li, and V. C. M. Leung, Deep reinforcement learning for cooperative edge caching in future mobile networks, in Proc. IEEE Wireless Communications and Networking Conf. (WCNC), Marrakesh, Morocco, 2019, pp. 1–6.
[24]
M. Amidzadeh, H. Al-Tous, O. Tirkkonen, and J. Zhang, Joint cache placement and delivery design using reinforcement learning for cellular networks, in Proc. IEEE 93rd Vehicular Technology Conf. (VTC2021-Spring), Helsinki, Finland, 2021, pp. 1–6.
[25]
T. Ni, B. Eysenbach, and R. Salakhutdinov, Recurrent model-free RL is a strong baseline for many POMDPs, arXiv preprint arXiv: 2110.05038, 2021.
[26]
J. G. Andrews, A. K. Gupta, and H. S. Dhillon, A primer on cellular network analysis using stochastic geometry, arXiv preprint arXiv: 1604.03183, 2016.
[27]
M. Chiang, Networked Life. Cambridge, UK: Cambridge University Press, 2012.
[28]
L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, Web caching and Zipf-like distributions: Evidence and implications, in Proc. IEEE Annual Joint Conference : INFOCOM, IEEE Computer and Communications Societies, New York, NY, USA, 1999, pp. 126–134.
[31]
A. Baisero and C. Amato, Unbiased asymmetric reinforcement learning under partial observability, arXiv preprint arXiv: 2105.11674v2, 2022.
[35]
3GPP, UMTS Universal Mobile Telecommunications System, RF system scenarios (3GPP TR 25.942 version 14.0. 0), Tech. Rep. ETSI TR 125 942, 3GPP, 2017.