Journal Home > Volume 1 , Issue 2
Purpose

Decision-making is one of the key technologies for self-driving cars. The high dependency of previously existing methods on human driving data or rules makes it difficult to model policies for different driving situations.

Design/methodology/approach

In this research, a probabilistic decision-making method based on the Markov decision process (MDP) is proposed to deduce the optimal maneuver automatically in a two-lane highway scenario without using any human data. The decision-making issues in a traffic environment are formulated as the MDP by defining basic elements including states, actions and basic models. Transition and reward models are defined by using a complete prediction model of the surrounding cars. An optimal policy was deduced using a dynamic programing method and evaluated under a two-dimensional simulation environment.

Findings

Results show that, at the given scenario, the self-driving car maintained safety and efficiency with the proposed policy.

Originality/value

This paper presents a framework used to derive a driving policy for self-driving cars without relying on any human driving data or rules modeled by hand.


menu
Abstract
Full text
Outline
About this article

Markov probabilistic decision making of self-driving cars in highway with random traffic flow: a simulation study

Show Author's information Yang Guan1Shengbo Eben Li2( )Jingliang Duan1Wenjun Wang1Bo Cheng1
Tsinghua University, Beijing, China
Department of Automotive Engineering, Tsinghua University, Beijing, China

Abstract

Purpose

Decision-making is one of the key technologies for self-driving cars. The high dependency of previously existing methods on human driving data or rules makes it difficult to model policies for different driving situations.

Design/methodology/approach

In this research, a probabilistic decision-making method based on the Markov decision process (MDP) is proposed to deduce the optimal maneuver automatically in a two-lane highway scenario without using any human data. The decision-making issues in a traffic environment are formulated as the MDP by defining basic elements including states, actions and basic models. Transition and reward models are defined by using a complete prediction model of the surrounding cars. An optimal policy was deduced using a dynamic programing method and evaluated under a two-dimensional simulation environment.

Findings

Results show that, at the given scenario, the self-driving car maintained safety and efficiency with the proposed policy.

Originality/value

This paper presents a framework used to derive a driving policy for self-driving cars without relying on any human driving data or rules modeled by hand.

Keywords: Dynamic programming, Decision-making, Self-driving cars, Markov decision process

References(19)

Bacha, A., Bauman, C., Faruque, R., Fleming, M., Terwelp, C., Reinholtz, C., Hong, D., Wicks, A., Alberi, T., Anderson, D. and Cacciola, S. (2008), “Odin: team VictorTango’s entry in the DARPA Urban Challenge”, Journal of Field Robotics, Vol. 25 No. 8, pp. 467-492.

Bojarski, M., Del Testa, D., Dworakowski, D., Firner, B., Flepp, B., Goyal, P., Jackel, L.D., Monfort, M., Muller, U., Zhang, J. and Zhang, X. (2016), “End to end learning for self-driving cars”, arXiv Preprint arXiv, Vol. 1604, p. 07316.

Bojarski, M., Yeres, P., Choromanska, A., Choromanski, K., Firner, B., Jackel, L. and Muller, U. (2017), “Explaining how a deep neural network trained with end-to-end learning steers a car”, arXiv Preprint arXiv, Vol. 1704, p. 7911.

Buehler, M., Iagnemma, K., and Singh, S. (2009), The DARPA Urban Challenge: Autonomous Vehicles in City Traffic, Springer, Berlin, Vol. 56.

DOI

Carvalho, A., Lefévre, S., Schildbach, G., Kong, J. and Borrelli, F. (2015), “Automated driving: the role of forecasts and uncertainty a control perspective”, European Journal of Control, Vol. 24, pp. 14-32.

Chen, C., Seff, A., Kornhauser, A., and Xiao, J. (2015), Deep driving: learning affordance for direct perception in autonomous driving. in Proceedings of the IEEE International Conference on Computer Vision, pp. 2722-2730.
DOI

Furda, A. and Vlacic, L. (2011), “Enabling safe autonomous driving in real-world city traffic using multiple criteria decision making”, IEEE Intelligent Transportation Systems Magazine, Vol. 3 No. 1, pp. 4-17.

Janai, J., Gü ney, F., Behl, A. and Geiger, A. (2017), “Computer vision for autonomous vehicles: problems, datasets and state-of-the-art”, arXiv Preprint arXiv, Vol. 1704, p. 05519.

Lefèvre, S., Vasquez, D. and Laugier, C. (2014), “A survey on motion prediction and risk assessment for intelligent vehicles”, Robomech Journal, Vol. 1 No. 1, p. 1.

Li, G., Li, S.E., Cheng, B., and Green, P. (2017a), “Estimation of driving style in naturalistic highway traffic using maneuver transition probabilities”, Transportation Research Part C, Emerging Technologies, Vol. 74, pp. 113-125.
DOI

Li, S.E., Qin, X., Li, K., Wang, J. and Xie, B. (2017b), “Robustness analysis and controller synthesis of homogeneous vehicular platoons with bounded parameter uncertainty”, IEEEASME Transactions on Mechatronics, Vol. 22 No. 2, pp. 1014-1025.

Li, S.E., Zheng, Y., Li, K., Wu, Y., Hedrick, J.K., Gao, F. and Zhang, H. (2017c), “Dynamical modeling and distributed control of connected and automated vehicles: challenges and opportunities”, IEEE Intelligent Transportation Systems Magazine, Vol. 9 No. 3, pp. 46-58.

Liu, C., Li, S.E., Yang, D. and Hedrick, J.K. (2018), “Distributed Bayesian filter using measurement dissemination for multiple unmanned ground vehicles with dynamically changing interaction topologies”, Journal of Dynamic Systems, Measurement, and Control, Vol. 140 No. 3, p. 030903.

Montemerlo, M., Becker, J., Bhat, S., Dahlkamp, H., Dolgov, D., Ettinger, S., Haehnel, D., Hilden, T., Hoffmann, G., Huhnke, B., et al. (2008), “Junior: the Stanford entry in the urban challenge”, Journal of Field Robotics, Vol. 25 No. 9, pp. 569-597.

Paden, B., Čáp, M., Yong, S.Z., Yershov, D. and Frazzoli, E. (2016), “A survey of motion planning and control techniques for self-driving urban vehicles”, IEEE Transactions on Intelligent Vehicles, Vol. 1 No. 1, pp. 33-55.

Patole, S.M., Torlak, M., Wang, D. and Ali, M. (2017), “Automotive radars: a review of signal processing techniques”, IEEE Signal Processing Magazine, Vol. 34 No. 2, pp. 22-35.

Royden, H.L. and Fitzpatrick, P. (1968), Real Analysis, Vol. 2, Macmillan, New York, NY, Vol. 2.

Urmson, C., Anhalt, J., Bagnell, D., Baker, C., Bittner, R., Clark, M., Dolan, J., Duggins, D., Galatali, T., Geyer, C. and Gittleman, M. (2008), “Autonomous driving in urban environments: boss and the urban challenge”, Journal of Field Robotics, Vol. 25 No. 8, pp. 425-466.

Zheng, Y., Li, S.E., Li, K., Borrelli, F. and Hedrick, J.K. (2017), “Distributed model predictive control for heterogeneous vehicle platoons under unidirectional topologies”, IEEE Transactions on Control Systems Technology, Vol. 25 No. 3, pp. 899-910.

Publication history
Copyright
Rights and permissions

Publication history

Received: 31 January 2018
Revised: 06 July 2018
Accepted: 13 August 2018
Published: 18 October 2018
Issue date: December 2018

Copyright

© 2018 Yang Guan, Shengbo Eben Li, Jingliang Duan, Wenjun Wang and Bo Cheng. Published in Journal of Intelligent and Connected Vehicles. Published by Emerald Publishing Limited.

Rights and permissions

This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

Return