Classification of Medical Image Notes for Image Labeling by Using MinBERT

Bokai Yang; Yujie Yang; Qi Li; Denan Lin; Ye Li; Jing Zheng; Yunpeng Cai

doi:10.26599/TST.2022.9010012

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (14.3 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

Classification of Medical Image Notes for Image Labeling by Using MinBERT

Bokai Yang^{¹^,³}, Yujie Yang^{¹^,³}, Qi Li^{¹^,³}, Denan Lin^², Ye Li^{¹^,³}, Jing Zheng^²(

), Yunpeng Cai^{¹^,³}(

)

1Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China

2Shenzhen Health Development Research and Data Management Center, Shenzhen 518055, China

3University of Chinese Academy of Sciences, Beijing 100049, China

Show Author Information

Abstract

The lack of labeled image data poses a serious challenge to the application of artificial intelligence (AI) in medical image diagnosis. Medical image notes contain valuable patient information that could be used to label images for machine learning tasks. However, most image note texts are unstructured with heterogeneity and short-paragraph characters, which fail traditional keyword-based techniques. We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers. Bidirectional encoder representations from transformers trained on medical image notes corpus (MinBERT) were proposed. We applied the proposed techniques to two typical classification tasks: Medical image type identification and clinical diagnosis identification. The two methods significantly outperformed baseline methods and presented high accuracies of 99.56 $%$ and 99.72 $%$ in image type identification and of 94.56 $%$ and 92.45 $%$ in clinical diagnosis identification. Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions. Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information. Hence, it could serve as a powerful tool for exploring useful training data in various medical AI applications.

Keywords

MinBERT convolutional neural network electronic medical record medical image labeling word embedding

References

【1】

Crossref Google Scholar

Tsinghua Science and Technology

Volume 28 Issue 4,
August 2023

Pages 613-627

DOI: 10.26599/TST.2022.9010012

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Yang B, Yang Y, Li Q, et al. Classification of Medical Image Notes for Image Labeling by Using MinBERT. Tsinghua Science and Technology, 2023, 28(4): 613-627. https://doi.org/10.26599/TST.2022.9010012

4336

Views

625

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 05 January 2022

Revised: 29 April 2022

Accepted: 18 May 2022

Published: 06 January 2023

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).