Journal Home > Volume 24 , Issue 1

Traffic forecasting has been an active research field in recent decades, and with the development of deep-learning technologies, researchers are trying to utilize deep learning to achieve tremendous improvements in traffic forecasting, as it has been seen in other research areas, such as speech recognition and image classification. In this study, we summarize recent works in which deep-learning methods were applied for geospatial data-based traffic forecasting problems. Based on the insights from previous works, we further propose a deep-learning framework, which transforms geospatial data to images, and then utilizes the state-of-the-art deep-learning methodologies such as Convolutional Neural Network (CNN) and residual networks. To demonstrate the simplicity and effectiveness of our framework, we present a formulation of the New York taxi pick-up/drop-off forecasting problem, and show that our framework significantly outperforms traditional methods, including Historical Average (HA) and AutoRegressive Integrated Moving Average (ARIMA).


menu
Abstract
Full text
Outline
About this article

Geospatial Data to Images: A Deep-Learning Framework for Traffic Forecasting

Show Author's information Weiwei Jiang( )Lin Zhang
Department of Electronic Engineering, Tsinghua University, Beijing 100084, China.
Department of Electronic Engineering and Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen 518055, China.

Abstract

Traffic forecasting has been an active research field in recent decades, and with the development of deep-learning technologies, researchers are trying to utilize deep learning to achieve tremendous improvements in traffic forecasting, as it has been seen in other research areas, such as speech recognition and image classification. In this study, we summarize recent works in which deep-learning methods were applied for geospatial data-based traffic forecasting problems. Based on the insights from previous works, we further propose a deep-learning framework, which transforms geospatial data to images, and then utilizes the state-of-the-art deep-learning methodologies such as Convolutional Neural Network (CNN) and residual networks. To demonstrate the simplicity and effectiveness of our framework, we present a formulation of the New York taxi pick-up/drop-off forecasting problem, and show that our framework significantly outperforms traditional methods, including Historical Average (HA) and AutoRegressive Integrated Moving Average (ARIMA).

Keywords: deep learning, convolutional neural network, geospatial data, residual network, traffic forecasting

References(65)

[1]
Zheng Y., Wang L., Zhang R., Xie X., and Ma W., GeoLife: Managing and understanding your past life over maps, in Proc. 9th International Conference on Mobile Data Management, Beijing, China, 2008, pp. 211-212.
DOI
[2]
California Department of Transportation Caltrans Performance Measurement System (PeMS) database, http://pems.dot.ca.gov/, 2017.
[3]
NYC Taxi & Limousine Commission TLC Trip Record Data, http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml, 2017.
[4]
Szegedy C., Toshev A., and Erhan D., Deep neural networks for object detection, in Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA, 2013, pp. 2553-2561.
[5]
Graves A. and Schmidhuber J., Offline handwriting recognition with multidimensional recurrent neural networks, in Advances in Neural Information Processing Systems, Hyatt Regency, Canada, 2009, pp. 545-552.
[6]
Hinton G., Deng L., Yu D., Dahl G., Mohamed A., Jaitly N., Senior A., Vanhoucke V., Nguyen P., Sainath T., et al., Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.
[7]
Krizhevsky A., Sutskever I., and Hinton G., Imagenet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA, 2012, pp. 1097-1105.
[8]
Sutskever I., Vinyals O., and Le Q., Sequence to sequence learning with neural networks, in Advances in Neural Information Processing Systems, Montreal, Canada, 2014, pp. 3104-3112.
[9]
Baek J. and Sohn K., Deep-learning architectures to forecast bus ridership at the stop and stop-to-stop levels for dense and crowded bus networks, Applied Artificial Intelligence, vol. 30, no. 9, pp. 861-885, 2017.
[10]
Zhang J., Zheng Y., Qi D., Li R., and Yi X., DNN-based prediction model for spatio-temporal data, in Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Francisco, CA, USA, 2016, p. 92.
DOI
[11]
Zhang J., Zheng Y., and Qi D., Deep spatio-temporal residual networks for citywide crowd flows prediction, in Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 2017, pp. 1655-1661.
[12]
Yu R., Li Y., Shahabi C., Demiryurek U., and Liu Y., Deep learning: A generic approach for extreme condition traffic forecasting, in Proceedings of the 2017 SIAM International Conference on Data Mining, Houston, TX, USA, 2017, pp. 777-785.
DOI
[13]
Ouyang X., Zhang C., Zhou P., and Jiang H., Deepspace: An online deep learning framework for mobile big data to understand human mobility patterns, https://arxiv.org/abs/1610.07009, 2016.
[14]
Varshneya D. and Srinivasaraghavan G., Human trajectory prediction using spatially aware deep attention models, https://arxiv.org/abs/1705.09436, 2017.
[15]
Wang D., Cao W., Li J., and Ye J., Deepsd: Supply-demand prediction for online car-hailing services using deep neural networks, in Proceedings of IEEE 33rd International Conference on Data Engineering, San Diego, CA, USA, 2017, pp. 243-254.
DOI
[16]
De Brébisson A., Simon É., Auvolat A., Vincent P., and Bengio Y., Artificial neural networks applied to taxi destination prediction, in Proceedings of the 2015 International Conference on ECML PKDD Discovery Challenge, Aachen, Germany, 2015, pp. 40-51.
[17]
Lv J., Li Q., and Wang X., Modeling trajectory as image: Convolutional neural networks for multi-scale taxi trajectory prediction, https://arxiv.org/pdf/1611.07635, 2016.
[18]
Chen Q., Song X., Yamada H., and Shibasaki R., Learning deep representation from big and heterogeneous data for traffic accident inference, in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 2016, pp. 338-344.
[19]
Sameen M. and Pradhan B., Severity prediction of traffic accidents with recurrent neural networks, Applied Sciences, vol. 7, no. 6, p. 476, 2017.
[20]
Chen Y., Lv Y., Li Z., and Wang F., Long short-term memory model for traffic congestion prediction with online open data, in Proceedings of the IEEE 19th International Conference on Intelligent Transportation Systems, Rio de Janeiro, Brazil, 2016, pp. 132-137.
[21]
Huang W., Song G., Hong H., and Xie K., Deep architecture for traffic flow prediction: Deep belief networks with multi-task learning, IEEE Transactions on Intelligent Transportation Systems, vol. 15, no. 5, pp. 2191-2201, 2014.
[22]
Lv Y., Duan Y., Kang W., Li Z., and Wang F., Traffic flow prediction with big data: A deep learning approach, IEEE Transactions on Intelligent Transportation Systems, vol. 16, no. 2, pp. 865-873, 2015.
[23]
Tian Y. and Pan L., Predicting short-term traffic flow by long short-term memory recurrent neural network, in Proceedings of the 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity), Chengdu, China, 2015, pp. 153-158.
DOI
[24]
Duan Y., Lv Y., and Wang F., Performance evaluation of the deep learning approach for traffic flow prediction at different times, in Proceedings of the 2016 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), Beijing, China, 2016, pp. 223-227.
DOI
[25]
Fu R., Zhang Z., and Li L., Using LSTM and GRU neural network methods for traffic flow, prediction, in Proceedings of the 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC), Wuhan, China, 2016, pp. 324-328.
DOI
[26]
Koesdwiady A., Soua R., and Karray F., Improving traffic flow prediction with weather information in connected cars: A deep learning approach, IEEE Transactions on Vehicular Technology, vol. 65, no. 12, pp. 9508-9517, 2016.
[27]
Shao H. and Soong B., Traffic flow prediction with long short-term memory networks (lstms), in Proceedings of the 2016 IEEE Region 10 Conference (TENCON), Singapore, 2016, pp. 2986-2989.
DOI
[28]
Zhao Z., Chen W., Wu X., Chen P. C., and Liu J., LSTM network: A deep learning approach for short-term traffic forecast, IET Intelligent Transport Systems, vol. 11, no. 2, pp. 68-75, 2017.
[29]
Lemieux J. and Ma Y., Vehicle speed prediction using deep learning, in Proceedings of the 2015 IEEE Vehicle Power and Propulsion Conference (VPPC), Montreal, Canada, 2015, pp. 1-5.
DOI
[30]
Ma X., Tao Z., Wang Y., Yu H., and Wang Y., Long short-term memory neural network for traffic speed prediction using remote microwave sensor data, Transportation Research Part C: Emerging Technologies, vol. 54, pp. 187-197, 2015.
[31]
Jia Y., Wu J., and Du Y., Traffic speed prediction using deep learning method, in Proceedings of the IEEE 19th International Conference on Intelligent Transportation Systems, Rio de Janeiro, Brazil, 2016, pp. 1217-1222.
[32]
Wang J., Gu Q., Wu J., Liu G., and Xiong Z., Traffic speed prediction and congestion source exploration: A deep learning method, in Proceedings of the 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, Spain, 2016, pp. 499-508.
DOI
[33]
Ma X., Dai Z., He Z., Ma J., Wang Y., and Wang Y., Learning traffic as images: A deep convolutional neural network for large-scale transportation network speed prediction, Sensors, vol. 17, no. 4, p. 818, 2017.
[34]
Gang X., Kang W., Wang F., Zhu F., Lv Y., Dong X., Riekki J., and Pirttikangas S., Continuous travel time prediction for transit signal priority based on a deep network, in Proceedings of the 2015 IEEE 18th International Conference on Intelligent Transportation Systems (ITSC), Las Palmas de Gran Canaria, Spain, 2015, pp. 523-528.
DOI
[35]
Siripanpornchana C., Panichpapiboon S., and Chaovalit P., Travel-time prediction with deep learning, in Proceedings of the 2016 IEEE Region 10 Conference (TENCON), Singapore, 2016, pp. 1859-1862.
DOI
[36]
LeCun Y., Bottou L., Bengio Y., and Haffner P., Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
[37]
He K., Zhang X., Ren S., and Sun J., Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 770-778.
DOI
[38]
Deng L. and Yu D., Deep learning: Methods and applications, Foundations and Trends in Signal Processing, vol. 7, nos. 3&4, pp. 197-387, 2014.
[39]
Rojas R., Neural Networks: A Systematic Introduction. Springer Science & Business Media, 2013.
[40]
Goodfellow I., Bengio Y., and Courville A., Deep Learning. MIT Press, 2016.
[41]
Rojas-Barahona L. M., Deep learning for sentiment analysis, Language and Linguistics Compass, vol. 10, no. 12, pp. 701-719, 2016.
[42]
Bengio Y., Learning deep architectures for AI, Foundations and Trends in Machine Learning, vol. 2, no. 1, pp. 1-127, 2009.
[43]
Schmidhuber J., Deep learning in neural networks: An overview, Neural Networks, vol. 61, pp. 85-117, 2015.
[44]
Bengio Y., Boulanger-Lewandowski N., and Pascanu R., Advances in optimizing recurrent networks, in Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, 2013, pp. 8624-8628.
DOI
[45]
Hinton G., A practical guide to training restricted boltzmann machines, Momentum, vol. 9, no. 1, p. 926, 2010.
[46]
Weng J. J., Ahuja N., and Huang T. S., Learning recognition and segmentation of 3-d objects from 2-d images, in Proceedings of the IEEE Fourth International Conference on Computer Vision, Berlin, Germany, 1993, pp. 121-128.
[47]
[48]
Goller C. and Kuchler A., Learning task-dependent distributed representations by backpropagation through structure, in Proceedings of the IEEE International Conference on Neural Networks, Washington, DC, USA, 1996, pp. 347-352.
[49]
Griewank A. and Walther A., Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation. SIAM, 2008.
DOI
[50]
Rumelhart D. E., Hinton G. E., and Williams R. J., Learning representations by back-propagating errors, Cognitive Modeling, vol. 5, no. 3, p. 1, 1988.
[51]
Hochreiter S, and Schmidhuber J,, Long short-term memory, Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
[52]
Bengio Y., Simard P., and Frasconi P., Learning long-term dependencies with gradient descent is difficult, IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 157-166, 1994.
[53]
Graves A., Supervised sequence labelling, in Supervised Sequence Labelling with Recurrent Neural Networks, 2012, pp. 5-13.
DOI
[54]
Chandra S. R. and Al-Deek H., Predictions of freeway traffic speeds and volumes using vector, autoregressive models, Journal of Intelligent Transportation Systems, vol. 13, no. 2, pp. 53-72, 2009.
[55]
Box G. E., Jenkins G. M., Reinsel G. C., and Ljung G. M., Time Series Analysis: Forecasting and Control. John Wiley & Sons, 2015.
[56]
Smith B. L., Williams B. M., and Oswald R. K., Comparison of parametric and nonparametric models for traffic flow forecasting, Transportation Research Part C: Emerging Technologies, vol. 10, no. 4, pp. 303-321, 2002.
[57]
Wu Y., Chen F., Lu C., Smith B., and Chen Y., Traffic flow prediction for urban network using spatio-temporal random effects model, presented at the 91st Annual Meeting of the Transportation Research Board (TRB), Washington, DC, USA, 2012.
[58]
Queen C. M. and Albers C. J., Intervention and causality: Forecasting traffic flows using a dynamic Bayesian network, Journal of the American Statistical Association, vol. 104, no. 486, pp. 669-681, 2009.
[59]
Xia D., Wang B., Li H., Li Y., and Zhang Z., A distributed spatial-temporal weighted model on Mapreduce for short-term traffic flow forecasting, Neurocomputing, vol. 179, pp. 246-263, 2016.
[60]
Chang G., Wang S., and Xiao X., Review of spatio-temporal models for short-term traffic forecasting, in Proceedings of the IEEE International Conference on Intelligent Transportation Engineering (ICITE), Singapore, 2016, pp. 8-12.
DOI
[61]
Haghighat M. B., Aghagolzadeh A., and Seyedarabi H., Multi-focus image fusion for visual sensor networks in DCT domain, Computers & Electrical Engineering, vol. 37, no. 5, pp. 789-797, 2011.
[62]
Theano Development Team, Theano: A Python framework for fast computation of mathematical expressions, https://arxiv.org/abs/1605.02688, 2016.
[63]
Chollet F., Keras, http://keras.io, 2015.
[64]
Ioffe S. and Szegedy C., Batch normalization: Accelerating deep network training by reducing internal covariate shift, https://arxiv.org/abs/1502.03167, 2015.
[65]
Kingma D. and Ba J., Adam: A method for stochastic optimization, https://arxiv.org/abs/1412.6980, 2014.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 05 July 2017
Accepted: 28 September 2017
Published: 08 November 2018
Issue date: February 2019

Copyright

© The author(s) 2019

Acknowledgements

This work was funded by Shenzhen Municipal Development and Reform Commission, Shenzhen Engineering Laboratory for Data Science and Information Technology (No. SDRC [2015]1872).

Rights and permissions

Return