HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Shaojun Guo; Feng Liu; Xiaohu Yuan; Chunrong Zou; Li Chen; Tongsheng Shen

doi:10.26599/TST.2020.9010011

Tsinghua Science and Technology 2021, 26(4): 475-483 https://doi.org/10.26599/TST.2020.9010011

Open Access | Issue | Published: 04 January 2021

HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Show Author's Information Hide Author's Information Shaojun Guo, Feng Liu, Xiaohu Yuan, Chunrong Zou, Li Chen, Tongsheng Shen(

)

National Innovation of Defense Technology, Academy of Military Sciences PLA China, Beijing 100071, China.

Department of Automation, Tsinghua University, Beijing 100084, China.

Keywords:

Histograms of Oriented Gradients (HOG), Histogram of Spatial Pyramid Oriented Gradients (HSPOG), object recognition, spatial pyramid segmentation

Cite this article:

Guo S, Liu F, Yuan X, et al. HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients. Tsinghua Science and Technology, 2021, 26(4): 475-483. https://doi.org/10.26599/TST.2020.9010011

Download citation

EndNote(RIS)

BibTeX

819

Views

Downloads

Citations

Crossref

WoS

Scopus

CSCD

Abstract Full text About this article

Abstract

The Histograms of Oriented Gradients (HOG) can produce good results in an image target recognition mission, but it requires the same size of the target images for classification of inputs. In response to this shortcoming, this paper performs spatial pyramid segmentation on target images of any size, gets the pixel size of each image block dynamically, and further calculates and normalizes the gradient of the oriented feature of each block region in each image layer. The new feature is called the Histogram of Spatial Pyramid Oriented Gradients (HSPOG). This approach can obtain stable vectors for images of any size, and increase the target detection rate in the image recognition process significantly. Finally, the article verifies the algorithm using VOC2012 image data and compares the effect of HOG.

Full text

Abstract

Full text

Outline

About this article

HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Show Author's information Hide Author's Information Shaojun Guo, Feng Liu, Xiaohu Yuan, Chunrong Zou, Li Chen, Tongsheng Shen(

)

National Innovation of Defense Technology, Academy of Military Sciences PLA China, Beijing 100071, China.

Department of Automation, Tsinghua University, Beijing 100084, China.

Abstract

Keywords: Histograms of Oriented Gradients (HOG), Histogram of Spatial Pyramid Oriented Gradients (HSPOG), object recognition, spatial pyramid segmentation

References(17)

[1]

L. M. Surhone, M. T. Tennoe, and S. F. Henssonow, Histogram of oriented gradients, Betascript Publishing, vol. 12, no. 4, pp. 1368-1371, 2010.

Google Scholar

[2]

B. Liang and L. Zheng, Diffractive phase elements based on two-dimensional artificial dielectrics, presented at the 22th International Conference on Pattern Recognition, Stockholm, Sweden, 2014.

[3]

Q. Liu, Z. G. Wu, and J. M. Guo, The conversion of histograms of oriented gradient in different vision-angle and rotation-angle, Control Theory & Applications, vol. 27, no. 9, pp. 1269-1272, 2010.

Google Scholar

[4]

S. A. Iamsa and P. Horata, Hand written character recognition using histograms of oriented gradient features in deep learning of artificial neural network, presented at the 3th International Conference on IT Convergence and Security, Macao, China, 2013.

[5]

Y. W. Pang, Y. Yuan, X. L. Li, and J. Pan, Efficient HOG human detection, Signal Processing, vol. 91, no. 4. pp. 773-781, 2011.

Google Scholar

[6]

Y. E. Lina, Y. L. Chen, and J. L. Lin, Pedestrian fast detection based on histograms of oriented gradient, Computer Engineering, vol. 36, no. 22, pp. 206-207, 2010.

Google Scholar

[7]

K. Grauman and T. Darrell, The pyramid match kernel: Discriminative classification with sets of image features, presented at the 10th IEEE Conference on Computer Vision and Pattern Recognition (CVDR), Beijing, China, 2005.

[8]

N. V. Tavari and A. V. Deorankar, Indian sign language recognition based on histograms of oriented gradient, International Journal of Computer Science & Information Technoloy, vol. 5, no. 3, pp. 3657-3660, 2014.

Google Scholar

[9]

H. X. Jia and Y. J. Zhang, Fast human detection by boosting histograms of oriented gradients, presented at the 8th International Conference on Image and Graphics, Tianjin, China, 2007.

[10]

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, vol. 25, no. 2, pp. 1-8, 2012.

Google Scholar

[11]

M. D. Zeiler and R. Fergus, Visualizing and understanding convolutional networks, presented at the 13th European Conference on Computer Vision (ECCV), Zurich, Switzerland, 2014.

[12]

J. Donahue, Y. Jia, and O. Vinyals, DeCAF: A deep convolutional activation feature for generic visual recognition, https://arxiv.org/abs/1310.1531, 2013.

[13]

R. Girshick, J. Donahue, T. Darrel, and J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, presented at the 31th IEEE Conference on Computer Vision, Columbia, CA, USA, 2014.

[14]

K. He, X. Zhang, and S. Ren, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 37, no. 9, pp. 1904-1916, 2015.

Google Scholar

[15]

P. C. Hung, Colorimetric calibration in electronic imaging devices using a look-up-table model and interpolations, Journal of Electronic Imaging, vol. 2, no. 1, p. 53, 1993.

Google Scholar

[16]

P. Felzenszwalb, D. Mcallester, and D. Ramanan, A discriminatively trained, multiscale, deformable part model, presented at the 25th Conference on Computer Vision and Pattern Recognition (CVPR), Alaska, AK, USA, 2008.

[17]

J. P. Dong and C. Kim, A hybrid bags-of-feature model for sports scene classification, Journal of Signal Processing Systems, vol. 81, no. 2, pp. 249-263, 2014.

Google Scholar

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 13 March 2020

Accepted: 29 March 2020

Published: 04 January 2021

Issue date: August 2021

Copyright

Acknowledgements

This work was partly supported by the National Natural Science Foundation of China (No. 51802348).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).