Journal Home > Volume 26 , Issue 4

The Histograms of Oriented Gradients (HOG) can produce good results in an image target recognition mission, but it requires the same size of the target images for classification of inputs. In response to this shortcoming, this paper performs spatial pyramid segmentation on target images of any size, gets the pixel size of each image block dynamically, and further calculates and normalizes the gradient of the oriented feature of each block region in each image layer. The new feature is called the Histogram of Spatial Pyramid Oriented Gradients (HSPOG). This approach can obtain stable vectors for images of any size, and increase the target detection rate in the image recognition process significantly. Finally, the article verifies the algorithm using VOC2012 image data and compares the effect of HOG.


menu
Abstract
Full text
Outline
About this article

HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Show Author's information Shaojun GuoFeng LiuXiaohu YuanChunrong ZouLi ChenTongsheng Shen( )
National Innovation of Defense Technology, Academy of Military Sciences PLA China, Beijing 100071, China.
Department of Automation, Tsinghua University, Beijing 100084, China.

Abstract

The Histograms of Oriented Gradients (HOG) can produce good results in an image target recognition mission, but it requires the same size of the target images for classification of inputs. In response to this shortcoming, this paper performs spatial pyramid segmentation on target images of any size, gets the pixel size of each image block dynamically, and further calculates and normalizes the gradient of the oriented feature of each block region in each image layer. The new feature is called the Histogram of Spatial Pyramid Oriented Gradients (HSPOG). This approach can obtain stable vectors for images of any size, and increase the target detection rate in the image recognition process significantly. Finally, the article verifies the algorithm using VOC2012 image data and compares the effect of HOG.

Keywords: Histograms of Oriented Gradients (HOG), Histogram of Spatial Pyramid Oriented Gradients (HSPOG), object recognition, spatial pyramid segmentation

References(17)

[1]
L. M. Surhone, M. T. Tennoe, and S. F. Henssonow, Histogram of oriented gradients, Betascript Publishing, vol. 12, no. 4, pp. 1368-1371, 2010.
[2]
B. Liang and L. Zheng, Diffractive phase elements based on two-dimensional artificial dielectrics, presented at the 22th International Conference on Pattern Recognition, Stockholm, Sweden, 2014.
[3]
Q. Liu, Z. G. Wu, and J. M. Guo, The conversion of histograms of oriented gradient in different vision-angle and rotation-angle, Control Theory & Applications, vol. 27, no. 9, pp. 1269-1272, 2010.
[4]
S. A. Iamsa and P. Horata, Hand written character recognition using histograms of oriented gradient features in deep learning of artificial neural network, presented at the 3th International Conference on IT Convergence and Security, Macao, China, 2013.
[5]
Y. W. Pang, Y. Yuan, X. L. Li, and J. Pan, Efficient HOG human detection, Signal Processing, vol. 91, no. 4. pp. 773-781, 2011.
[6]
Y. E. Lina, Y. L. Chen, and J. L. Lin, Pedestrian fast detection based on histograms of oriented gradient, Computer Engineering, vol. 36, no. 22, pp. 206-207, 2010.
[7]
K. Grauman and T. Darrell, The pyramid match kernel: Discriminative classification with sets of image features, presented at the 10th IEEE Conference on Computer Vision and Pattern Recognition (CVDR), Beijing, China, 2005.
[8]
N. V. Tavari and A. V. Deorankar, Indian sign language recognition based on histograms of oriented gradient, International Journal of Computer Science & Information Technoloy, vol. 5, no. 3, pp. 3657-3660, 2014.
[9]
H. X. Jia and Y. J. Zhang, Fast human detection by boosting histograms of oriented gradients, presented at the 8th International Conference on Image and Graphics, Tianjin, China, 2007.
[10]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, vol. 25, no. 2, pp. 1-8, 2012.
[11]
M. D. Zeiler and R. Fergus, Visualizing and understanding convolutional networks, presented at the 13th European Conference on Computer Vision (ECCV), Zurich, Switzerland, 2014.
[12]
J. Donahue, Y. Jia, and O. Vinyals, DeCAF: A deep convolutional activation feature for generic visual recognition, https://arxiv.org/abs/1310.1531, 2013.
[13]
R. Girshick, J. Donahue, T. Darrel, and J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, presented at the 31th IEEE Conference on Computer Vision, Columbia, CA, USA, 2014.
[14]
K. He, X. Zhang, and S. Ren, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 37, no. 9, pp. 1904-1916, 2015.
[15]
P. C. Hung, Colorimetric calibration in electronic imaging devices using a look-up-table model and interpolations, Journal of Electronic Imaging, vol. 2, no. 1, p. 53, 1993.
[16]
P. Felzenszwalb, D. Mcallester, and D. Ramanan, A discriminatively trained, multiscale, deformable part model, presented at the 25th Conference on Computer Vision and Pattern Recognition (CVPR), Alaska, AK, USA, 2008.
[17]
J. P. Dong and C. Kim, A hybrid bags-of-feature model for sports scene classification, Journal of Signal Processing Systems, vol. 81, no. 2, pp. 249-263, 2014.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 13 March 2020
Accepted: 29 March 2020
Published: 04 January 2021
Issue date: August 2021

Copyright

© The author(s) 2021

Acknowledgements

This work was partly supported by the National Natural Science Foundation of China (No. 51802348).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return