Journal of Intelligent and Connected Vehicles 2023, 6(4): 237-249 https://doi.org/10.26599/JICV.2023.9210018

Research Article |

Open Access | Issue | Published: 30 December 2023

Enhanced target tracking algorithm for autonomous driving based on visible and infrared image fusion

Show Author's Information Hide Author's Information Quan Yuan(

), Haixu Shi, Ashton Tan Yu Xuan, Ming Gao, Qing Xu, Jianqiang Wang

State Key Laboratory of Intelligent Green Vehicle and Mobility, School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China

Keywords:

deep learning, autonomous driving, image fusion, target tracking, infrared image

Cite this article:

Yuan Q, Shi H, Xuan ATY, et al. Enhanced target tracking algorithm for autonomous driving based on visible and infrared image fusion. Journal of Intelligent and Connected Vehicles, 2023, 6(4): 237-249. https://doi.org/10.26599/JICV.2023.9210018

Download citation

EndNote(RIS)

BibTeX

134

Views

Downloads

Citations

Crossref

N/A

WoS

Scopus

N/A

CSCD

Abstract Full text About this article

Abstract

In autonomous driving, target tracking is essential to environmental perception. The study of target tracking algorithms can improve the accuracy of an autonomous driving vehicle’s perception, which is of great significance in ensuring the safety of autonomous driving and promoting the landing of technical applications. This study focuses on the fusion tracking algorithm based on visible and infrared images. The proposed approach utilizes a feature-level image fusion method, dividing the tracking process into two components: image fusion and target tracking. An unsupervised network, Visible and Infrared image Fusion Network (VIF-net), is employed for visible and infrared image fusion in the image fusion part. In the target tracking part, Siamese Region Proposal Network (SiamRPN), based on deep learning, tracks the target with fused images. The fusion tracking algorithm is trained and evaluated on the visible infrared image dataset RGBT234. Experimental results demonstrate that the algorithm outperforms training networks solely based on visible images, proving that the fusion of visible and infrared images in the target tracking algorithm can improve the accuracy of the target tracking even if it is like tracking-based visual images. This improvement is also attributed to the algorithm’s ability to extract infrared image features, augmenting the target tracking accuracy.

Full text

Abstract

Full text

Outline

About this article

Enhanced target tracking algorithm for autonomous driving based on visible and infrared image fusion

Show Author's information Hide Author's Information Quan Yuan(

), Haixu Shi, Ashton Tan Yu Xuan, Ming Gao, Qing Xu, Jianqiang Wang

State Key Laboratory of Intelligent Green Vehicle and Mobility, School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China

Abstract

Keywords: deep learning, autonomous driving, image fusion, target tracking, infrared image

References(19)

[1]

Bertinetto, L., Valmadre, J., Henriques, J. F., Vedaldi, A., Torr, P. H. S., 2016. Fully-convolutional Siamese networks for object tracking. In: European Conference on Computer Vision, 850–865.

DOI

[2]

Bolme, D., Beveridge, J. R., Draper, B. A., Lui, Y. M., 2010. Visual object tracking using adaptive correlation filters. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2544–2550.

DOI

[3]

Ghassemian, H., 2016. A review of remote sensing image fusion methods. Inf Fusion, 32, 75−89.

DOI Google Scholar

[4]

He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778.

DOI

[5]

Henriques, J. F., Caseiro, R., Martins, P., Batista, J., 2015. High-speed tracking with kernelized correlation filters. IEEE Trans Pattern Anal Mach Intell, 37, 583−596.

DOI Google Scholar

[6]

Hou, R., Zhou, D., Nie, R., Liu, D., Xiong, L., Guo, Y., et al., 2020. VIF-Net: An unsupervised framework for infrared and visible image fusion. IEEE Trans Comput Imaging, 6, 640−651.

DOI Google Scholar

[7]

Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K. Q., 2017. Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4700–4708.

DOI

[8]

James, A. P., Dasarathy, B. V., 2014. Medical image fusion: A survey of the state of the art. Inf Fusion, 19, 4−19.

DOI Google Scholar

[9]

Kristan, M., Leonardis, A., Matas, J., Felsberg, M., Pflugfelder, R., Zajc, L., et al., 2019. The sixth visual object tracking VOT2018 challenge results. In: European Conference on Computer Vision, 3–53.

[10]

Krizhevsky, A., Sutskever, I., Hinton, G. E., 2012. ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, 1097–1105.

[11]

Li, B., Yan, J., Wu, W., Zhu, Z., Hu, X., 2018. High performance visual tracking with Siamese region proposal network. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8971–8980.

DOI

[12]

Li, B., Wu, W., Wang, Q., Zhang, F., Xing, J., Yan, J., 2019. SiamRPN++: evolution of Siamese visual tracking with very deep networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4277–4286.

DOI

[13]

Liu, Y., Chen, X., Peng, H., Wang, Z., 2017. Multi-focus image fusion with a deep convolutional neural network. Inf Fusion, 36, 191−207.

DOI Google Scholar

[14]

Ma, K., Zeng, K., Wang, Z., 2015. Perceptual quality assessment for multi-exposure image fusion. IEEE Trans Image Process, 24, 3345−3356.

DOI Google Scholar

[15]

Ren, S., He, K., Girshick, R., Sun, J., 2017. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell, 39, 1137−1149.

DOI Google Scholar

[16]

Wang, Q., Zhang, L., Bertinetto, L., Hu, W., Torr, P. H. S., 2019. Fast online object tracking and segmentation: A unifying approach. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1328–1338.

DOI

[17]

Wu, Y., Lim, J., Yang, M. H., 2015. Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell, 37, 1834−1848.

DOI Google Scholar

[18]

Zhang, X., Ye, P., Leung, H., Gong, K., Xiao, G., 2020. Object fusion tracking based on visible and infrared images: A comprehensive review. Inf Fusion, 63, 166−187.

DOI Google Scholar

[19]

Zhu, Z., Wang, Q., Li, B., Wu, W., Yan, J., Hu, W., 2018. Distractor-aware Siamese networks for visual object tracking. In: European Conference on Computer Vision, 103–119.

DOI

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 08 July 2023

Revised: 20 August 2023

Accepted: 12 September 2023

Published: 30 December 2023

Issue date: December 2023

Copyright

Acknowledgements

The National Natural Science Foundation of China funded this research (Grant Nos. 52072214 and 52242213). The authors acknowledge Dr. Hui Xiong for his assistance in improving the manuscript.

Rights and permissions

This is an open access article under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).