AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (5.9 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Multimodal Nested Attention Network for Lymph Node Metastasis Prediction of Thyroid Carcinoma

Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China, and also with Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou 310000, China
Zhejiang Cancer Hospital, Hangzhou 310022, China
Zhejiang Cancer Hospital, Hangzhou 310022, China, and also with Department of Ultrasound, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou 310000, China
Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou 310000, China
Shaoxing People’s Hospital, Shaoxing 312000, China
Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China, and also with Institute of Microelectronics, University of Chinese Academy of Sciences, Beijing 100049, China

These authors contributed equally to this work.

Show Author Information

Abstract

Accurate preoperative prediction of cervical Lymph Node Metastasis (LNM) is critical for surgical decision-making in thyroid cancer patients, and the difficulty in it often leads to over-treatment. UltraSound (US) and Computed Tomography (CT) are two primary non-invasive examinations, but neither method alone provides satisfactory diagnostic accuracy. To address this problem, we propose a Multimodal Nested Attention Network (MNANet) to integrate US and CT images. The network is designed to extract specific complementary information from US and CT images, and comprehensively fuse multimodal features at multiple granularities. In our internal cohort, MNANet achieves Areas Under the Curves (AUCs) of 0.88 and 0.86 for central and lateral cervical sites, respectively, representing a significant improvement of 0.06 to 0.10 compared to unimodal models and outperforming state-of-the-art medical multimodal methods across all metrics.The model demonstrates robust cross-institutional generalization and maintains superior performance across other imaging modalities(e.g., Magnetic Resonance Imaging (MRI)). Additionally, our model exhibits a more precise focus on the thyroid nodule, indicating enhanced learning ability. Moreover, we systematically evaluate the applicability across various clinical characteristics, identifying individuals who can benefit most from the multimodal approach.

Electronic Supplementary Material

Download File(s)
BDMA-2024-0475_ESM.pdf (3.3 MB)

References

【1】
【1】
 
 
Big Data Mining and Analytics
Pages 178-197

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Li G, Yao J, Peng C, et al. Multimodal Nested Attention Network for Lymph Node Metastasis Prediction of Thyroid Carcinoma. Big Data Mining and Analytics, 2026, 9(1): 178-197. https://doi.org/10.26599/BDMA.2025.9020046

1009

Views

52

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Received: 23 October 2024
Revised: 21 March 2025
Accepted: 20 April 2025
Published: 10 December 2025
© The author(s) 2026.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).