Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Zhiyun Zheng; Qilong Yuan; Huaizhu Zhang; Yizhou Wang; Junfeng Wang

doi:10.26599/BDMA.2024.9020095

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (7.2 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Zhiyun Zheng^¹, Qilong Yuan^¹, Huaizhu Zhang^¹, Yizhou Wang^¹, Junfeng Wang^¹(

)

1Lab of Cloud Computing and Big Data Processing, School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China

Show Author Information

Abstract

Using skeletal information to model and recognize human actions is currently a hot research subject in the realm of Human Action Recognition (HAR). Graph Convolutional Networks (GCN) have gained popularity in this discipline due to their capacity to efficiently process graph-structured data. However, it is challenging for current models to handle distant dependencies that commonly exist between human skeleton nodes, which hinders the development of algorithms in related fields. To solve these problems, the Lightweight Multiscale Spatio-Temporal Graph Convolutional Network (LMSTGCN) is proposed. Firstly, the Lightweight Multiscale Spatial Graph Convolutional Network (LMSGCN) is constructed to capture the information in various hierarchies, and multiple inner connections between skeleton joints are captured by dividing the input features into a number of subsets along the channel direction. Secondly, the dilated convolution is incorporated into the temporal convolution to construct Lightweight Multiscale Temporal Convolutional Network (LMTCN), which allows to obtain a wider receptive field while keeping the size of the convolution kernel unchanged. Thirdly, the Spatio-Temporal Location Attention (STLAtt) module is used to identify the most informative joints in the sequence of skeletal information at a specific frame, hence improving the model’s ability to extract features and recognize actions. Finally, multi-stream data fusion input structure is used to enhance the input data and expand the feature information. Experiments on three public datasets illustrate the effectiveness of the proposed network.

Keywords

Human Action Recognition (HAR)skeleton data Graph Convolutional Network (GCN)attention mechanism

References

【1】

Crossref Google Scholar

Big Data Mining and Analytics

Volume 8 Issue 2,
April 2025

Pages 310-325

DOI: 10.26599/BDMA.2024.9020095

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , {{reviewData.reportCite.doi}}

Cite this article:

Zheng Z, Yuan Q, Zhang H, et al. Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition. Big Data Mining and Analytics, 2025, 8(2): 310-325. https://doi.org/10.26599/BDMA.2024.9020095

1956

Views

171

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 03 May 2024

Revised: 20 November 2024

Accepted: 03 December 2024

Published: 28 January 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).