Person Re-Identification with Effectively Designed Parts

Yali Zhao; Yali Li; Shengjin Wang

doi:10.26599/TST.2019.9010031

Tsinghua Science and Technology 2020, 25(3): 415-424 https://doi.org/10.26599/TST.2019.9010031

Open Access | Issue | Published: 07 October 2019

Person Re-Identification with Effectively Designed Parts

Show Author's Information Hide Author's Information Yali Zhao, Yali Li(

), Shengjin Wang(

)

Tsinghua University, Beijing 100084, China.

Keywords:

Convolutional Neural Network (CNN), person re-IDentification (re-ID), part model

Cite this article:

Zhao Y, Li Y, Wang S. Person Re-Identification with Effectively Designed Parts. Tsinghua Science and Technology, 2020, 25(3): 415-424. https://doi.org/10.26599/TST.2019.9010031

Download citation

EndNote(RIS)

BibTeX

627

Views

Downloads

Citations

Crossref

N/A

WoS

Scopus

CSCD

Abstract Full text About this article

Abstract

Person re-IDentification (re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although detection errors cause slightly misaligned bounding boxes, which lead to mismatches. In this paper, we study the person re-identification performance of using variously designed pedestrian parts instead of the horizontal partitioning routine typically applied in previous hand-crafted part works, and thereby obtain more effective feature descriptors. Specifically, we benchmark the accuracy of individual part matching with discriminatively trained Convolutional Neural Network (CNN) descriptors on the Market-1501 dataset. We also investigate the complementarity among different parts using combination and ablation studies, and provide novel insights into this issue. Compared with the state-of-the-art, our method yields a competitive accuracy rate when the best part combination is used on two large-scale datasets (Market-1501 and CUHK03) and one small-scale dataset (VIPeR).

Full text

Abstract

Full text

Outline

About this article

Person Re-Identification with Effectively Designed Parts

Show Author's information Hide Author's Information Yali Zhao, Yali Li(

), Shengjin Wang(

)

Tsinghua University, Beijing 100084, China.

Abstract

Keywords: Convolutional Neural Network (CNN), person re-IDentification (re-ID), part model

References(44)

[1]

T. D’ Orazio and G. Cicirelli, People re-identification and tracking from multiple cameras: A review, in Proc. of 2012 19th IEEE International Conference on Image Processing, Phoenix, AZ, USA, 2012, pp. 1601–1604.

DOI

[2]

B. Wang, G. Wang, K. Luk Chan, and L. Wang, Tracklet association with online target-specific metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 1234–1241.

DOI

[3]

W. S. Zheng, S. Gong, and T. Xiang, Group association: Assisting re-identification by visual context, in Proc. of European Conference on Computer Vision, London, UK, 2014, pp. 183–201.

DOI

[4]

L. Zheng, Y. Yang, and Q. Tian, SIFT meets CNN: A decade survey of instance retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 5, pp. 1224–1244, 2017.

DOI Google Scholar

[5]

X. Jin and X. Tan, Face alignment in-the-wild: A survey, Journal of Computer Vision and Image Understanding, vol. 162, pp. 1–22, 2017.

DOI Google Scholar

[6]

W. Li, R. Zhao, T. Xiao, and X. Wang, Deepreid: Deep filter pairing neural network for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 152–159.

DOI

[7]

L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, Scalable person re-identification: A benchmark, in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 1116–1124.

DOI

[8]

D. Cheng, Y. Gong, S. Zhou, J. Wang, and N. Zheng, Person re-identification by multi-channel parts-based CNN with improved triplet loss function, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 1335–1344.

DOI

[9]

Y. Yang, J. Yang, J. Yan, S. Liao, D. Yi, and S. Z. Li, Salient color names for person re-identification, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 536–551.

DOI

[10]

S. Liao, Y. Hu, X. Zhu, and S. Z. Li, Person re-identification by local maximal occurrence representation and metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 2197–2206.

DOI

[11]

S. E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4724–4732.

DOI

[12]

E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 34–50.

DOI

[13]

A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 483–499.

DOI

[14]

M. Farenzena, L. Bazzani, A. Perina, V. Murino, and M. Cristani, Person re-identification by symmetry-driven accumulation of local features, in Proc. of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 2010, pp. 2360–2367.

DOI

[15]

J. Van de Weijer, C. Schmid, J. Verbeek, and D. Larlus, Learning color names for real-world applications, IEEE Transactions on Image Processing, vol. 18, no. 7, pp. 1512–1523, 2009.

DOI Google Scholar

[16]

E. Ahmed, M. Jones, and T. K. Marks, An improved deep learning architecture for person re-identification， in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 3908–3916.

DOI

[17]

L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang, and Q. Tian, Person re-identification in the wild, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017, pp. 1367–1376.

DOI

[18]

M. Hirzer, P. M. Roth, M. Kostinger, and H. Bischof, Relaxed pairwise learned metric for person re-identification, in Proc. of European Conference on Computer Vision, Heidelberg, Germany, 2012, pp. 780–793.

DOI

[19]

L. Zhang, T. Xiang, and S. Gong, Learning a discriminative null space for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 1239–1248.

DOI

[20]

S. Lazebnik, C. Schmid, and J. Ponce, Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, in Proc. of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA, 2006, pp. 2169–2178.

[21]

R. Zhao, W. Ouyang, and X. Wang, Unsupervised salience learning for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 3586–3593.

DOI

[22]

R. Zhao, W. Ouyang, and X. Wang, Learning mid-level filters for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 144–151.

DOI

[23]

H. Yao, S. Zhang, R. Hong, Y. Zhang, C. Xu, and Q. Tian, Deep representation learning with part loss for person re-identification, IEEE Transactions on Image Processing, vol. 28, no. 6, pp. 2860–2871, 2019.

DOI Google Scholar

[24]

X. Liu, H. Zhao, M. Tian, L. Sheng, J. Shao, S. Yi, and X. Wang, Hydraplus-net: Attentive deep features for pedestrian analysis, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 350–359.

DOI

[25]

L. Zhao, X. Li, Y. Zhuang, and J. Wang, Deeply-learned part-aligned representations for person re-identification, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3219–3228.

DOI

[26]

H. Fang, S. Gupta, F. Iandola, R. K. Srivastava, L. Deng, P. Dollár, and C. L. Zitnick, From captions to visual concepts and back, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 1473–1482.

DOI

[27]

H. Liu, J. Feng, M. Qi, J. Jiang, and S. Yan, End-to-end comparative attention networks for person re-identification, IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3492–3506, 2017.

DOI Google Scholar

[28]

Z. Zheng, L. Zheng, and Y. Yang, Unlabeled samples generated by GAN improve the person re-identification baseline in vitro, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3754–3762.

DOI

[29]

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, in Proc. of Advances in Neural Information Processing Systems, Portland, OR, USA, 2012, pp. 1097–1105.

[30]

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 770–778.

DOI

[31]

D. Gray, S. Brennan, and H. Tao, Evaluating appearance models for recognition, reacquisition, and tracking, in Proc. IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), Las Vegas, NV, USA, 2007, pp. 1–7.

[32]

Z. Li, S. Chang, F. Liang, T. S. Huang, L. Cao, and J. R. Smith, Learning locally-adaptive decision functions for person verification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 3610–3617.

DOI

[33]

L. Ma, X. Yang, and D. Tao, Person re-identification over camera networks using multi-task distance metric learning, IEEE Transactions on Image Processing, vol. 23, no. 8, pp. 3656–3670, 2014.

DOI Google Scholar

[34]

R. Zhao, W. Ouyang, and X. Wang, Person re-identification by salience matching, in Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia, 2013, pp. 2528–2535.

DOI

[35]

S. Ding, L. Lin, G. Wang, and H. Chao, Deep feature learning with relative distance comparison for person re-identification, IEEE Transactions on Pattern Recognition, vol. 48, no. 10, pp. 2993–3003, 2015.

DOI Google Scholar

[36]

M. Koestinger, M. Hirzer, P. Wohlhart, P. M. Roth, and H. Bischof, Large scale metric learning from equivalence constraints, in Proc. of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2012, pp. 2288–2295.

DOI

[37]

B. McFee and G. R.Lanckriet, Metric learning to rank, in Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 2010, pp. 775–782.

[38]

S. Wu, Y. C. Chen, X. Li, A. C. Wu, J. J. You, and W. S. Zheng, An enhanced deep feature representation for person re-identification, in Proc. of 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, 2016, pp. 1–8.

DOI

[39]

F. Xiong, M. Gou, O. Camps, and M. Sznaier, Person re-identification using kernel-based metric learning methods, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 1–16.

DOI

[40]

Y. Sun, L. Zheng, W. Deng, and S. Wang, Svdnet for pedestrian retrieval, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 3800–3808.

DOI

[41]

Z. Zheng, L. Zheng, and Y. Yang, Pedestrian alignment network for large-scale person re-identification, IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 5, pp. 2860–2871, 2018.

Google Scholar

[42]

E. Ustinova, Y. Ganin, and V. Lempitsky, Multi-region bilinear convolutional neural networks for person re-identification, in Proc. of 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 2017, pp. 1–6.

DOI

[43]

Y. Chen, X. Zhu, and S. Gong, Person re-identification by deep learning multi-scale representations, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 2590–2600.

DOI

[44]

Z. Zhong, L. Zheng, G. Kang, S. Li, and Y. Yang, Random erasing data augmentation，arXiv preprint arXiv: 1708. 04896, 2017.

Google Scholar

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 28 December 2018

Revised: 22 June 2019

Accepted: 17 July 2019

Published: 07 October 2019

Issue date: June 2020

Copyright

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61771288 and 61701277) and the State Key Development Program of the 13th Five-Year Plan (No. 2017YFC0821601).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).