GoM-ICD: Automatic ICD Coding with Gap Schemes and Mixture of Experts

Yifan Wu; Weiyan Qiu; Min Zeng; Xi Chen; Min Li; Hongtao Zhu

doi:10.26599/BDMA.2025.9020019

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (5 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

GoM-ICD: Automatic ICD Coding with Gap Schemes and Mixture of Experts

Yifan Wu^{¹^,^Y}, Weiyan Qiu^{¹^,^Y}, Min Zeng^¹, Xi Chen^², Min Li^¹, Hongtao Zhu^²(

)

1School of Computer Science and Engineering, Central South University, Changsha 410083, China

2Second Xiangya Hospital, Central South University, Changsha 410011, China

Yifan Wu and Weiyan Qiu contributed equally to this work.

Show Author Information

Abstract

Assigning standardized International Classification of Disease (ICD) codes to Electronic Medical Records (EMR) is crucial for enhancing the efficiency and accuracy of medical coding processes. However, existing methods face challenges in effectively capturing, integrating, and amalgamating specialized medical knowledge from complex textual data. In this study, we propose GoM-ICD, an advanced automatic ICD coding framework that integrates multiple gap schemes with a Mixture of Experts (MoE) architecture. GoM-ICD is designed to address the extreme multilabel text classification in ICD coding. It segments and reassembles text to facilitate seamless information exchange across different chunks, employing various segmentation methods derived from different gap schemes. Then the model-level MoE consolidates the predictions of these methods to enhance the prediction performance. Specifically, the segmented text is input to a Pretrained Language Model (PLM) to extract textual features. Next, a Bidirectional Long Short-Term Memory network (BiLSTM) is employed to capture long-term contextual information from the textual features. Finally, a text-level MoE, followed by a label-level MoE, enables precise attention matching between text and labels, thereby improving the fidelity of the coding process. The three levels of MoE leverage the collective insights of diverse expert models, effectively processing multi-dimensional text features and unifying model-level insights from various gap schemes. Extensive experimental results demonstrate that GoM-ICD achieves the state-of-the-art performance in automatic ICD coding tasks, reaching micro-F1 of 0.617, 0.620, and 0.613 on datasets MIMIC-III full, MIMIC-III clean, and MIMIC-IV ICD-10, respectively. The source code can be obtained from https://github.com/CSUBioGroup/GoM-ICD.

Keywords

automatic International Classification of Disease (ICD) coding mixture of experts (MoE)multi-label text classification Electronic Medical Record (EMR)

References

【1】

Crossref Google Scholar

Big Data Mining and Analytics

Volume 8 Issue 6,
December 2025

Pages 1211-1224

DOI: 10.26599/BDMA.2025.9020019

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Wu Y, Qiu W, Zeng M, et al. GoM-ICD: Automatic ICD Coding with Gap Schemes and Mixture of Experts. Big Data Mining and Analytics, 2025, 8(6): 1211-1224. https://doi.org/10.26599/BDMA.2025.9020019

1854

Views

285

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 02 October 2024

Revised: 28 December 2024

Accepted: 08 February 2025

Published: 19 September 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).