AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (1,004 KB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Lightweight Human Behavior Recognition Method for Visual Communication AGV Based on CNN-LSTM

Shuhua Zhao1Jianxin Zhu1( )Jiang Lu1Zhibo Ju1Dong Wu1
Anyang Cigarette Factory, China Tobacco Henan Industrial Co., Ltd., Anyang 455000, China
Show Author Information

Abstract

Behavior recognition uses deep learning network model to automatically extract the deep features of data, but traditional machine learning algorithms have some problems such as manual feature extraction and poor generalization ability of models. The S-MobileNet is proposed for human behavior recognition. Firstly, the 3D convolution to extract features is used to build a time series model to learn the long-term dependence of human behavior characteristics on time series. Secondly, Long Short-Term Memory (LSTM) is used as the input of multi-layer recurrent neural network time series model, so as to obtain individual dynamic features, and then individual features are aggregated by attention pooling mechanism to obtain corresponding group behavior features. At last, the recognition of individual behavior and group behavior is completed by relying on the characteristics of individual and group behavior. The experiments show that the network in this paper achieves high recognition accuracy on UCF101 and HMDB51 datasets, and the overall recognition rate of proposed model for 13 kinds of human behaviors is 95.3%.

References

[1]

H. Lu, H. Zhao, T. Wang, J. Lin, L. Geng, and H. Liu, Infrared pedestrian behavior recognition based on lightweight convolutional neural network, (in Chinese), Transducer and Microsystem Technologies, vol. 41, no. 9, pp. 129–131&135, 2022.

[2]

Y. C. Shi, S. Yan, D. Y. Yao, and Y. Zhang, SVM-LSTM-based car-following behavior recognition and information credibility discirmination, (in Chinese), J. Traffic Transp. Eng., vol. 22, no. 3, pp. 115–125, 2022.

[3]

S. Zhou, K. Chen, and Q. Xu, Behavior recognition of phased array radar based on two-level convolutional neural network, (in Chinese), Shipboard Electronic Countermeasure, vol. 45, no. 3, pp. 37–42&47, 2022.

[4]

J. Zhu, S. Wei, J. Tang, Y. Hu, X. Dai, Y. Zi, M. Wang, Y. Xiang, and W. Huang, MXene V2CTx nanosheet/bismuth quantum dot-based heterostructures for enhanced flexible photodetection and nonlinear photonics, ACS Applied Nano Materials, vol. 6, no. 14, pp. 13629–13636, 2023.

[5]
L. P. Hu, Research on the model of students’ classroom behavior recognition based on deep learning, (in Chinese), PhD dissertation, Hangzhou Normal University, Hangzhou, China, 2021.
[6]

M. Huang, R. X. Shang, and H. M. Qian, Composite deep neural network for human activities recognition in video, (in Chinese), Pattern Recognit. Artif. Intell., vol. 35, no. 6, pp. 562–570, 2022.

[7]
K. Qi, Design and implementation of DAC in cow behavior recognition and control circuit based on inertial sensor, (in Chinese), PhD dissertation, Shandong Agricultural University, Taian, China, 2022.
[8]
C. Yang, Recognition of abnormal behavior of logistics warehouse staff based on computer vision, (in Chinese), PhD dissertation, Anhui University of Science and Technology, Chuzhou, Anhui, 2021.
[9]
B. Tian, Research and implementation of subway driver behavior recognition algorithm based on deep learning, (in Chinese), PhD dissertation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, 2021.
[10]

Y. Zi, J. Zhu, M. Wang, L. Hu, Y. Hu, S. Wageh, O. A. Al-Hartomy, A. Al-Ghamdi, W. Huang, and H. Zhang, CdS@CdSe core/shell quantum dots for highly improved self-powered photodetection performance, Inorg. Chem., vol. 60, no. 24, pp. 18608–18613, 2021.

[11]

Y. Zhong and Z. Fang, Human abnormal behavior recognition model based on recurrent neural network, (in Chinese), Intelligent Computer and Applications, vol. 11, no. 11, pp. 76–78&83, 2021.

[12]

W. Huang, Y. Zhang, Q. You, P. Huang, Y. Wang, Z. N. Huang, Y. Ge, L. Wu, Z. Dong, X. Dai, et al., Enhanced photodetection properties of Tellurium@Selenium roll-to-roll nanotube heterojunctions, Small, vol. 15, no. 23, p. e1900902, 2019.

[13]

B. Zhou and J. F. Li, Behavior recognition based on multi-stream convolutional neural network, (in Chinese), Computer Systems & Applications, vol. 30, no. 8, pp. 118–125, 2021.

[14]
Z. Zhang, Research on multi-perspective behavior recognition method based on discriminant joint deep neural network, (in Chinese), PhD dissertation, Tianjin Normal University, Tianjin, China, 2019.
[15]

H. Wang, Y. S. Deng, Z. W. Chang, J. Wu, and M. J. Chen, Behavior recognition technology of power operating person based on deep learning, (in Chinese), Sichuan Electr. Power Technol., vol. 45, no. 3, pp. 23–28, 2022.

[16]

J. Han, C. Xiao, X. Song, J. Lu, and F. Wang, Identification method of electricity stealing behavior of energy Internet based on GA-BP neural network, (in Chinese), Electric Transmission, vol. 52, no. 14, pp. 38–44, 2022.

[17]
X. Liu, Research and application of human behavior recognition technology based on deep learning, (in Chinese), PhD dissertation, Beijing University of Posts and Telecommunications, Beijing, China, 2019.
[18]

Y. D. Tan, Y. X. Wang, S. Y. Chen, and Y. L. Miao, (2+1)D multi-spatio-temporal information fusion model and its application in behaviour recognition, (in Chinese), Inf. Contr., vol. 48, no. 6, pp. 715–722, 2019.

[19]

D. Xu, Y. Dai, and J. H. Ji, Research on driver behavior recognition method based on convolutional neural network, (in Chinese), China Saf. Sci. J., vol. 29, no. 10, pp. 12–17, 2019.

[20]

Y. Kong and Y. Fu, Human action recognition and prediction: A survey, Int. J. Comput. Vis., vol. 130, no. 5, pp. 1366–1401, 2022.

[21]
B. Yao, X. Jiang, A. Khosla, A. L. Lin, L. Guibas, and F. F. Li, Human action recognition by learning bases of action attributes and parts, in Proc. Int. Conf. Computer Vision, Barcelona, Spain, 2011, pp. 1331–1338.
[22]

H. Rahmani, A. Mian, and M. Shah, Learning a deep model for human action recognition from novel viewpoints, IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 3, pp. 667–681, 2018.

[23]

P. Pareek and A. Thakkar, A survey on video-based human action recognition: Recent updates, datasets, challenges, and applications, Artif. Intell. Rev., vol. 54, no. 3, pp. 2259–2322, 2021.

[24]
Y. Zhou, X. Sun, Z. J. Zha, and W. Zeng, MiCT: Mixed 3D/2D convolutional tube for human action recognition, in Proc. IEEE/CVF Conf. Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 449–458.
[25]
L. Sun, K. Jia, K. Chen, D. Y. Yeung, B. E. Shi, and S. Savarese, Lattice long short-term memory for human action recognition, in Proc. IEEE Int. Conf. Computer Vision (ICCV), Venice, Italy, 2017, pp. 2166–2175.
[26]

A. Akula, A. K. Shah, and R. Ghosh, Deep learning approach for human action recognition in infrared images, Cogn. Syst. Res., vol. 50, pp. 146–154, 2018.

[27]

V. Mazzia, S. Angarano, F. Salvetti, F. Angelini, and M. Chiaberge, Action Transformer: A self-attention model for short-time pose-based human action recognition, Pattern Recognit., vol. 124, p. 108487, 2022.

[28]

M. A. Khan, Y. D. Zhang, S. A. Khan, M. Attique, A. Rehman, and S. Seo, A resource conscious human action recognition framework using 26-layered deep convolutional neural network, Multimed. Tools Appl., vol. 80, no. 28, pp. 35827–35849, 2021.

International Journal of Crowd Science
Pages 133-138

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Zhao S, Zhu J, Lu J, et al. Lightweight Human Behavior Recognition Method for Visual Communication AGV Based on CNN-LSTM. International Journal of Crowd Science, 2025, 9(2): 133-138. https://doi.org/10.26599/IJCS.2024.9100014

134

Views

6

Downloads

1

Crossref

0

Scopus

Altmetrics

Published: 13 May 2025
© The author(s) 2025.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).