LWD-3D: Lightweight Detector Based on Self-Attention for 3D Object Detection

Shuo Yang; Huimin Lu; Tohru Kamiya; Yoshihisa Nakatoh; Seiichi Serikawa

doi:10.26599/AIR.2022.9150009

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (1.3 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Original Research | Open Access

LWD-3D: Lightweight Detector Based on Self-Attention for 3D Object Detection

Shuo Yang^¹, Huimin Lu^{¹^,²}(

), Tohru Kamiya^¹, Yoshihisa Nakatoh^¹, Seiichi Serikawa^¹

1School of Engineering, Kyushu Institute of Technology, Fukuoka 804-8550, Japan

2School of Information Engineering, Yangzhou University, Yangzhou 225127, China.

Show Author Information

Abstract

Lightweight modules play a key role in 3D object detection tasks for autonomous driving, which are necessary for the application of 3D object detectors. At present, research still focuses on constructing complex models and calculations to improve the detection precision at the expense of the running rate. However, building a lightweight model to learn the global features from point cloud data for 3D object detection is a significant problem. In this paper, we focus on combining convolutional neural networks with self-attention-based vision transformers to realize lightweight and high-speed computing for 3D object detection. We propose light-weight detection 3D (LWD-3D), which is a point cloud conversion and lightweight vision transformer for autonomous driving. LWD-3D utilizes a one-shot regression framework in 2D space and generates a 3D object bounding box from point cloud data, which provides a new feature representation method based on a vision transformer for 3D detection applications. The results of experiment on the KITTI 3D dataset show that LWD-3D achieves real-time detection (time per image < 20 ms). LWD-3D obtains a mean average precision (mAP) 75% higher than that of another 3D real-time detector with half the number of parameters. Our research extends the application of visual transformers to 3D object detection tasks.

Keywords

3D object detection point clouds vision transformer one-shot regression real-time

References

【1】

Crossref Google Scholar

CAAI Artificial Intelligence Research

Volume 1 Issue 2,
December 2022

Pages 137-143

DOI: 10.26599/AIR.2022.9150009

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Yang S, Lu H, Kamiya T, et al. LWD-3D: Lightweight Detector Based on Self-Attention for 3D Object Detection. CAAI Artificial Intelligence Research, 2022, 1(2): 137-143. https://doi.org/10.26599/AIR.2022.9150009

5112

Views

458

Downloads

Crossref

Google Scholar
Citation

Received: 05 December 2022

Revised: 01 January 2023

Accepted: 08 January 2023

Published: 10 March 2023

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).