Journal Home > Volume 25 , Issue 3

Person re-IDentification (re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although detection errors cause slightly misaligned bounding boxes, which lead to mismatches. In this paper, we study the person re-identification performance of using variously designed pedestrian parts instead of the horizontal partitioning routine typically applied in previous hand-crafted part works, and thereby obtain more effective feature descriptors. Specifically, we benchmark the accuracy of individual part matching with discriminatively trained Convolutional Neural Network (CNN) descriptors on the Market-1501 dataset. We also investigate the complementarity among different parts using combination and ablation studies, and provide novel insights into this issue. Compared with the state-of-the-art, our method yields a competitive accuracy rate when the best part combination is used on two large-scale datasets (Market-1501 and CUHK03) and one small-scale dataset (VIPeR).


menu
Abstract
Full text
Outline
About this article

Person Re-Identification with Effectively Designed Parts

Show Author's information Yali ZhaoYali Li( )Shengjin Wang( )
Tsinghua University, Beijing 100084, China.

Abstract

Person re-IDentification (re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although detection errors cause slightly misaligned bounding boxes, which lead to mismatches. In this paper, we study the person re-identification performance of using variously designed pedestrian parts instead of the horizontal partitioning routine typically applied in previous hand-crafted part works, and thereby obtain more effective feature descriptors. Specifically, we benchmark the accuracy of individual part matching with discriminatively trained Convolutional Neural Network (CNN) descriptors on the Market-1501 dataset. We also investigate the complementarity among different parts using combination and ablation studies, and provide novel insights into this issue. Compared with the state-of-the-art, our method yields a competitive accuracy rate when the best part combination is used on two large-scale datasets (Market-1501 and CUHK03) and one small-scale dataset (VIPeR).

Keywords: Convolutional Neural Network (CNN), person re-IDentification (re-ID), part model

References(44)

[1]
T. D’ Orazio and G. Cicirelli, People re-identification and tracking from multiple cameras: A review, in Proc. of 2012 19th IEEE International Conference on Image Processing, Phoenix, AZ, USA, 2012, pp. 1601–1604.
DOI
[2]
B. Wang, G. Wang, K. Luk Chan, and L. Wang, Tracklet association with online target-specific metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 1234–1241.
DOI
[3]
W. S. Zheng, S. Gong, and T. Xiang, Group association: Assisting re-identification by visual context, in Proc. of European Conference on Computer Vision, London, UK, 2014, pp. 183–201.
DOI
[4]
L. Zheng, Y. Yang, and Q. Tian, SIFT meets CNN: A decade survey of instance retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 5, pp. 1224–1244, 2017.
[5]
X. Jin and X. Tan, Face alignment in-the-wild: A survey, Journal of Computer Vision and Image Understanding, vol. 162, pp. 1–22, 2017.
[6]
W. Li, R. Zhao, T. Xiao, and X. Wang, Deepreid: Deep filter pairing neural network for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 152–159.
DOI
[7]
L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, Scalable person re-identification: A benchmark, in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 1116–1124.
DOI
[8]
D. Cheng, Y. Gong, S. Zhou, J. Wang, and N. Zheng, Person re-identification by multi-channel parts-based CNN with improved triplet loss function, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 1335–1344.
DOI
[9]
Y. Yang, J. Yang, J. Yan, S. Liao, D. Yi, and S. Z. Li, Salient color names for person re-identification, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 536–551.
DOI
[10]
S. Liao, Y. Hu, X. Zhu, and S. Z. Li, Person re-identification by local maximal occurrence representation and metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 2197–2206.
DOI
[11]
S. E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4724–4732.
DOI
[12]
E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 34–50.
DOI
[13]
A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 483–499.
DOI
[14]
M. Farenzena, L. Bazzani, A. Perina, V. Murino, and M. Cristani, Person re-identification by symmetry-driven accumulation of local features, in Proc. of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 2010, pp. 2360–2367.
DOI
[15]
J. Van de Weijer, C. Schmid, J. Verbeek, and D. Larlus, Learning color names for real-world applications, IEEE Transactions on Image Processing, vol. 18, no. 7, pp. 1512–1523, 2009.
[16]
E. Ahmed, M. Jones, and T. K. Marks, An improved deep learning architecture for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 3908–3916.
DOI
[17]
L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang, and Q. Tian, Person re-identification in the wild, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017, pp. 1367–1376.
DOI
[18]
M. Hirzer, P. M. Roth, M. Kostinger, and H. Bischof, Relaxed pairwise learned metric for person re-identification, in Proc. of European Conference on Computer Vision, Heidelberg, Germany, 2012, pp. 780–793.
DOI
[19]
L. Zhang, T. Xiang, and S. Gong, Learning a discriminative null space for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 1239–1248.
DOI
[20]
S. Lazebnik, C. Schmid, and J. Ponce, Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, in Proc. of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA, 2006, pp. 2169–2178.
[21]
R. Zhao, W. Ouyang, and X. Wang, Unsupervised salience learning for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 3586–3593.
DOI
[22]
R. Zhao, W. Ouyang, and X. Wang, Learning mid-level filters for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 144–151.
DOI
[23]
H. Yao, S. Zhang, R. Hong, Y. Zhang, C. Xu, and Q. Tian, Deep representation learning with part loss for person re-identification, IEEE Transactions on Image Processing, vol. 28, no. 6, pp. 2860–2871, 2019.
[24]
X. Liu, H. Zhao, M. Tian, L. Sheng, J. Shao, S. Yi, and X. Wang, Hydraplus-net: Attentive deep features for pedestrian analysis, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 350–359.
DOI
[25]
L. Zhao, X. Li, Y. Zhuang, and J. Wang, Deeply-learned part-aligned representations for person re-identification, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3219–3228.
DOI
[26]
H. Fang, S. Gupta, F. Iandola, R. K. Srivastava, L. Deng, P. Dollár, and C. L. Zitnick, From captions to visual concepts and back, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 1473–1482.
DOI
[27]
H. Liu, J. Feng, M. Qi, J. Jiang, and S. Yan, End-to-end comparative attention networks for person re-identification, IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3492–3506, 2017.
[28]
Z. Zheng, L. Zheng, and Y. Yang, Unlabeled samples generated by GAN improve the person re-identification baseline in vitro, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3754–3762.
DOI
[29]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, in Proc. of Advances in Neural Information Processing Systems, Portland, OR, USA, 2012, pp. 1097–1105.
[30]
K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 770–778.
DOI
[31]
D. Gray, S. Brennan, and H. Tao, Evaluating appearance models for recognition, reacquisition, and tracking, in Proc. IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), Las Vegas, NV, USA, 2007, pp. 1–7.
[32]
Z. Li, S. Chang, F. Liang, T. S. Huang, L. Cao, and J. R. Smith, Learning locally-adaptive decision functions for person verification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 3610–3617.
DOI
[33]
L. Ma, X. Yang, and D. Tao, Person re-identification over camera networks using multi-task distance metric learning, IEEE Transactions on Image Processing, vol. 23, no. 8, pp. 3656–3670, 2014.
[34]
R. Zhao, W. Ouyang, and X. Wang, Person re-identification by salience matching, in Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia, 2013, pp. 2528–2535.
DOI
[35]
S. Ding, L. Lin, G. Wang, and H. Chao, Deep feature learning with relative distance comparison for person re-identification, IEEE Transactions on Pattern Recognition, vol. 48, no. 10, pp. 2993–3003, 2015.
[36]
M. Koestinger, M. Hirzer, P. Wohlhart, P. M. Roth, and H. Bischof, Large scale metric learning from equivalence constraints, in Proc. of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2012, pp. 2288–2295.
DOI
[37]
B. McFee and G. R.Lanckriet, Metric learning to rank, in Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 2010, pp. 775–782.
[38]
S. Wu, Y. C. Chen, X. Li, A. C. Wu, J. J. You, and W. S. Zheng, An enhanced deep feature representation for person re-identification, in Proc. of 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, 2016, pp. 1–8.
DOI
[39]
F. Xiong, M. Gou, O. Camps, and M. Sznaier, Person re-identification using kernel-based metric learning methods, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 1–16.
DOI
[40]
Y. Sun, L. Zheng, W. Deng, and S. Wang, Svdnet for pedestrian retrieval, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3800–3808.
DOI
[41]
Z. Zheng, L. Zheng, and Y. Yang, Pedestrian alignment network for large-scale person re-identification, IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 5, pp. 2860–2871, 2018.
[42]
E. Ustinova, Y. Ganin, and V. Lempitsky, Multi-region bilinear convolutional neural networks for person re-identification, in Proc. of 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 2017, pp. 1–6.
DOI
[43]
Y. Chen, X. Zhu, and S. Gong, Person re-identification by deep learning multi-scale representations, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 2590–2600.
DOI
[44]
Z. Zhong, L. Zheng, G. Kang, S. Li, and Y. Yang, Random erasing data augmentation,arXiv preprint arXiv: 1708. 04896, 2017.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 28 December 2018
Revised: 22 June 2019
Accepted: 17 July 2019
Published: 07 October 2019
Issue date: June 2020

Copyright

© The author(s) 2020

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61771288 and 61701277) and the State Key Development Program of the 13th Five-Year Plan (No. 2017YFC0821601).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return