Journal Home > Volume 27 , Issue 1

Road pricing is an urban traffic management mechanism to reduce traffic congestion. Currently, most of the road pricing systems based on predefined charging tolls fail to consider the dynamics of urban traffic flows and travelers’ demands on the arrival time. In this paper, we propose a method to dynamically adjust online road toll based on traffic conditions and travelers’ demands to resolve the above-mentioned problems. The method, based on deep reinforcement learning, automatically allocates the optimal toll for each road during peak hours and guides vehicles to roads with lower toll charges. Moreover, it further considers travelers’ demands to ensure that more vehicles arrive at their destinations before their estimated arrival time. Our method can increase the traffic volume effectively, as compared to the existing static mechanisms.


menu
Abstract
Full text
Outline
About this article

A Dynamic and Deadline-Oriented Road Pricing Mechanism for Urban Traffic Management

Show Author's information Jiahui JinXiaoxuan ZhuBiwei WuJinghui Zhang( )Yuxiang Wang
School of Computer Science and Engineering, Southeast University, Nanjing 211189, China
Department of Computer Science and Engineering, Hangzhou Dianzi University, Hangzhou 310018, China

Abstract

Road pricing is an urban traffic management mechanism to reduce traffic congestion. Currently, most of the road pricing systems based on predefined charging tolls fail to consider the dynamics of urban traffic flows and travelers’ demands on the arrival time. In this paper, we propose a method to dynamically adjust online road toll based on traffic conditions and travelers’ demands to resolve the above-mentioned problems. The method, based on deep reinforcement learning, automatically allocates the optimal toll for each road during peak hours and guides vehicles to roads with lower toll charges. Moreover, it further considers travelers’ demands to ensure that more vehicles arrive at their destinations before their estimated arrival time. Our method can increase the traffic volume effectively, as compared to the existing static mechanisms.

Keywords: deep reinforcement learning, road pricing, traffic congestion alleviation

References(26)

[1]
S. Liu, W. T. Zhang, X. J. Wu, S. Feng, X. Pei, and D. Y. Yao, A simulation system and speed guidance algorithms for intersection traffic control using connected vehicle technology, Tsinghua Science and Technology, vol. 24, no. 2, pp. 160-170, 2019.
[2]
N. Wang, G. Guo, B. Wang, and C. Wang, Traffic clustering algorithm of urban data brain based on a hybrid-augmented architecture of quantum annealing and brain-inspired cognitive computing, Tsinghua Science and Technology, vol. 25, no. 6, pp. 813-825, 2020.
[3]
J. J. Li, H. H. Jiao, J. Wang, Z. G. Liu, and J. Wu, Online real-time trajectory analysis based on adaptive time interval clustering algorithm, Big Data Mining and Analytics, vol. 3, no. 2, pp. 131-142, 2020.
[4]
J. R. Gan, B. An, H. Z. Wang, X. M. Sun, and Z. Z. Shi, Optimal pricing for improving efficiency of taxi systems, in Proc. 23rd Int. Joint Conf. Artificial Intelligence, Beijing, China, 2013, pp. 2811-2818.
[5]
M. J. Hausknecht and P. Stone, Deep reinforcement learning in parameterized action space, arXiv preprint arXiv:1511.04143, 2016.
[6]
D. Joksimovic, M. C. J. Bliemer, P. H. L. Bovy, and Z. Verwater-Lukszo, Dynamic road pricing for optimizing network performance with heterogeneous users, in Proc. Networking, Sensing and Control, Tucson, AZ, USA, 2005, pp. 407-412.
[7]
H. P. Chen, B. An, G. Sharon, J. P. Hanna, P. Stone, C. Y. Miao, and Y. C. Soh, DyETC: Dynamic electronic toll collection for traffic congestion alleviation, in Proc. 32nd AAAI Conf. Artificial Intelligence, New Orleans, Lousiana, USA, 2018, pp.757-765.
[8]
R. S. Sutton, D. A. McAllester, S. P. Singh, and Y. Mansour, Policy gradient methods for reinforcement learning with function approximation, in Proc. 12th Int. Conf. Neural Information Processing Systems, Cambridge, MA, USA, 1999, pp. 1057-1063.
[9]
D. Joksimovic, M. C. J. Bliemer, and P. H. L. Bovy, Optimal toll design problem in dynamic traffic networks with joint route and departure time choice, Transportation Research Record, vol. 1923, no. 1, pp. 61-72, 2005.
[10]
D. Y. Lin, A. Unnikrishnan, and S. T. Waller, A dual variable approximation based heuristic for dynamic congestion pricing, Networks and Spatial Economics, vol. 11, no. 2, pp. 271-293, 2011.
[11]
B. J. Zhou, M. Bliemer, H. Yang, and J. He, A trial-and-error congestion pricing scheme for networks with elastic demand and link capacity constraints, Transportation Research Part B: Methodological, vol. 72, pp. 77-92, 2015.
[12]
G. Sharon, J. Hanna, T. Rambha, M. Albert, P. Stone, and S. D. Boyles, Delta-tolling: Adaptive tolling for optimizing traffic throughput, in Proc. 9th Int. Workshop on Agents in Traffic and Transportation, New York, NY, USA, 2016.
[13]
G. Sharon, J. P. Hanna, T. Rambha, M. W. Levin, M. Albert, S. D. Boyles, and P. Stone, Real-time adaptive tolling scheme for optimized social welfare in traffic networks, in Proc. 16th Conf. Autonomous Agents and MultiAgent Systems, Richland, SC, USA, 2017, pp. 828-836.
[14]
K. T. Bui, V. A. Huynh, and E. Frazzoli, Dynamic traffic congestion pricing mechanism with user-centric considerations, in Proc. 2012 15th Int. IEEE Conf. Intelligent Transportation Systems, Anchorage, AK, USA, 2012, pp. 147-154.
DOI
[15]
H. Mirzaei, G. Sharon, S. D. Boyles, T. Givargis, and P. Stone, Link-based parameterized micro-tolling scheme for optimal traffic management, in Proc. 17th Int. Conf. Autonomous Agents and MultiAgent Systems, Richland, SC, USA, 2018, pp. 2013-2015.
[16]
F. Soylemezgiller, M. Kuscu, and D. Kilinc, A traffic congestion avoidance algorithm with dynamic road pricing for smart cities, in Proc. 2013 IEEE 24th Annual Int. Symp. Personal, Indoor, and Mobile Radio Communications, London, UK, 2013, pp. 2571-2575.
DOI
[17]
W. Qiu, H. P. Chen, and B. An, Dynamic electronic toll collection via multi-agent deep reinforcement learning with edge-based graph convolutional networks, in Proc. 28th Int. Joint Conf. Artificial Intelligence, Macao, China, 2019, pp. 4568-4574.
DOI
[18]
N. Aung, W. D. Zhang, S. Dhelim, and Y. B. Ai, T-Coin: Dynamic traffic congestion pricing system for the internet of vehicles in smart cities, Information, vol. 11, no. 3, p. 149, 2020.
[19]
H. K. Lo and W. Y. Szeto, A methodology for sustainable traveler information services, Transportation Research Part B: Methodological, vol. 36, no. 2, pp. 113-130, 2002.
[20]
K. L. Hong, C. W. Yip, and K. H. Wan, Modeling transfer and non-linear fare structure in multi-modal network, Transportation Research Part B: Methodological, vol. 37, no. 2, pp. 149-170, 2003.
[21]
H. J. Huang and Z. C. Li, A multiclass, multicriteria logit-based traffic equilibrium assignment model under ATIS, Eur. J. Oper. Res., vol. 176, no. 3, pp. 1464-1477, 2007.
[22]
K. Zhu and T. Zhang, Deep reinforcement learning based mobile robot navigation: A review, Tsinghua Science and Technology, vol. 26, no. 5, pp. 674-691, 2021.
[23]
B. J. Zhou, M. Bliemer, H. Yang, and J. He, A trial-and-error congestion pricing scheme for networks with elastic demand and link capacity constraints, Transportation Research Part B : Methodological, vol. 72, pp. 77-92, 2015.
[24]
O. Anschel, N. Baram, and N. Shimkin, Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning, in Proc. 34th Int. Conf. Machine Learning, Sydney, Australia, 2017, pp. 176-185.
[25]
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, Continuous control with deep reinforcement learning, in Proc. 4th Int. Conf. Learning Representations, San Juan, Puerto Rico, 2016.
[26]
P. A. Barter, A vehicle quota integrated with road usage pricing: A mechanism to complete the phase-out of high fixed vehicle taxes in Singapore, Transport Policy, vol. 12, no. 6, pp. 525-536, 2005.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 01 December 2020
Accepted: 16 December 2020
Published: 17 August 2021
Issue date: February 2022

Copyright

© The author(s) 2022

Acknowledgements

This work was supported by the National Key R&D Program of China (No. 2018AAA0101200), National Natural Science Foundation of China (Nos. 62072099, 61972085, 62072149, and 61872079), Public Welfare Research Program of Zhejiang (No. LGG19F020017), Jiangsu Provincial Key Laboratory of Network and Information Security (No. BM2003201), Key Laboratory of Computer Network and Information Integration of Ministry of Education of China (No. 93K-9), "Zhishan" Scholars Programs of Southeast University, and partially supported by Collaborative Innovation Center of Novel Software Technology and Industrialization, and the Fundamental Research Funds for the Central Universities. We also thank the Big Data Computing Center of Southeast University for providing the experiment environment and computing facility.

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return