Attention mechanisms in computer vision: A survey

Meng-Hao Guo; Tian-Xing Xu; Jiang-Jiang Liu; Zheng-Ning Liu; Peng-Tao Jiang; Tai-Jiang Mu; Song-Hai Zhang; Ralph R. Martin; Ming-Ming Cheng; Shi-Min Hu

doi:10.1007/s41095-022-0271-y

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (2.7 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Review Article | Open Access

Attention mechanisms in computer vision: A survey

Meng-Hao Guo^¹, Tian-Xing Xu^¹, Jiang-Jiang Liu^², Zheng-Ning Liu^¹, Peng-Tao Jiang^², Tai-Jiang Mu^¹, Song-Hai Zhang^¹, Ralph R. Martin^³, Ming-Ming Cheng^², Shi-Min Hu^¹(

)

1BNRist, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China

2TKLNDST, College of Computer Science, Nankai University, Tianjin 300350, China

3School of Computer Science and Informatics, Cardiff University, Cardiff, UK

Show Author Information

Abstract

Humans can naturally and effectively find salient regions in complex scenes. Motivated by thisobservation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multi-modal tasks, and self-supervised learning. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention; a related repository https://github.com/MenghaoGuo/Awesome-Vision-Attentions is dedicated to collecting related work. We also suggest future directions for attention mechanism research.

Graphical Abstract

Keywords

attention transformer computer vision deep learning salience

References

【1】

Crossref Google Scholar

Computational Visual Media

Volume 8 Issue 3,
September 2022

Pages 331-368

DOI: 10.1007/s41095-022-0271-y

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Guo M-H, Xu T-X, Liu J-J, et al. Attention mechanisms in computer vision: A survey. Computational Visual Media, 2022, 8(3): 331-368. https://doi.org/10.1007/s41095-022-0271-y

8091

Views

666

Downloads

2061

Crossref

1788

Web of Science

2245

Scopus

CSCD

Google Scholar
Citation

Received: 31 December 2021

Accepted: 18 January 2022

Published: 15 March 2022

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduc-tion in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www. editorialmanager.com/cvmj.