Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives

Ying Yu; Junwen Duan; Min Li

doi:10.26599/TST.2022.9010049

Tsinghua Science and Technology 2023, 28(4): 686-695 https://doi.org/10.26599/TST.2022.9010049

Open Access | Issue | Published: 06 January 2023

Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives

Show Author's Information Hide Author's Information Ying Yu^{¹^,³}, Junwen Duan^²(

), Min Li^²

1School of Computer Science and Engineering, Central South University, Changsha 410083, China

2Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China

3School of Computer Science, University of South China, Hengyang 421001, China

Keywords:

tentative diagnosis, clinical narrative, Bidirectional Long Short-Term Memory (BiLSTM), Term Frequency-Inverse Document Frequency (TF-IDF), fusion strategy

Cite this article:

Yu Y, Duan J, Li M. Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives. Tsinghua Science and Technology, 2023, 28(4): 686-695. https://doi.org/10.26599/TST.2022.9010049

Download citation

EndNote(RIS)

BibTeX

473

Views

Downloads

Citations

Crossref

WoS

Scopus

CSCD

Abstract Full text About this article

Abstract

In general, physicians make a preliminary diagnosis based on patients’ admission narratives and admission conditions, largely depending on their experiences and professional knowledge. An automatic and accurate tentative diagnosis based on clinical narratives would be of great importance to physicians, particularly in the shortage of medical resources. Despite its great value, little work has been conducted on this diagnosis method. Thus, in this study, we propose a fusion model that integrates the semantic and symptom features contained in the clinical text. The semantic features of the input text are initially captured by an attention-based Bidirectional Long Short-Term Memory (BiLSTM) network. The symptom concepts, recognized from the input text, are then vectorized by using the term frequency-inverse document frequency method based on the relations between symptoms and diseases. Finally, two fusion strategies are utilized to recommend the most potential candidate for the international classification of diseases code. Model training and evaluation are performed on a public clinical dataset. The results show that both fusion strategies achieved a promising performance, in which the best performance obtained a top-3 accuracy of 0.7412.

Full text

Abstract

Full text

Outline

About this article

Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives

Show Author's information Hide Author's Information Ying Yu^{¹^,³}, Junwen Duan^²(

), Min Li^²

1School of Computer Science and Engineering, Central South University, Changsha 410083, China

2Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China

3School of Computer Science, University of South China, Hengyang 421001, China

Abstract

Keywords: tentative diagnosis, clinical narrative, Bidirectional Long Short-Term Memory (BiLSTM), Term Frequency-Inverse Document Frequency (TF-IDF), fusion strategy

References(38)

[1]

I. Boas, Early and tentative diagnosis of gastrointestinal carcinoma, Am. J. Cancer, vol. 15, no. 3, pp. 1586–1589, 1931.

Google Scholar

[2]

Y. Yu, M. Li, L. Liu, Y. Li, and J. Wang, Clinical big data and deep learning: Applications, challenges, and future outlooks, Big Data Mining and Analytics, vol. 2, no. 4, pp. 288–305, 2019.

DOI Google Scholar

[3]

E. Choi, M. T. Bahadori, J. A. Kulas, A. Schuetz, W. F. Stewart, and J. Sun, RETAIN: An interpretable predictive model for healthcare using reverse time attention mechanism, in Proc. 30^th Int. Conf. on Neural Information Processing Systems, Barcelona, Spain, 2016, pp. 3512–3520.

Google Scholar

[4]

F. Ma, R. Chitta, J. Zhou, Q. You, T. Sun, and J. Gao, Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks, in Proc. 23^rd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, Halifax, Canada, 2017, pp. 1903–1911.

DOI Google Scholar

[5]

F. Ma, J. Gao, Q. Suo, Q. You, J. Zhou, and A. Zhang, Risk prediction on electronic health records with prior medical knowledge, in Proc. 24^th ACM SIGKDD Int. Conf. on Knowledge Discovery & Data, London, UK, 2018, pp. 1910–1919.

DOI Google Scholar

[6]

H. Liang, B. Y. Tsui, H. Ni, C. C. S. Valentim, S. L. Baxter, G. Liu, W. Cai, D. S. Kermany, X. Sun, J. Chen, et al., Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence, Nat. Med., vol. 25, no. 3, pp. 433–438, 2019.

DOI Google Scholar

[7]

A. Graves and J. Schmidhuber, Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Netw., vol. 18, nos. 5&6, pp. 602–610, 2005.

DOI Google Scholar

[8]

A. R. Aronson, Effective mapping of biomedical text to the UMLS metathesaurus: The MetaMap program, in Proc. AMIA 2001, Washington, DC, USA, 2001, p. 17.

Google Scholar

[9]

G. Salton, A. Wong, and C. S. Yang, A vector space model for automatic indexing, Commun. ACM, vol. 18, no. 11, pp. 613–620, 1975.

DOI Google Scholar

[10]

Y. Yu, M. Li, L. Liu, F. X. Wu, and J. Wang, Tentative diagnosis prediction via deep understanding of patient narratives, in Proc. 2019 IEEE Int. Conf. on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA, 2019, pp. 1000–1003.

DOI Google Scholar

[11]

A. N. Jagannatha and H. Yu, Bidirectional RNN for medical event detection in electronic health records, in Proc. 2016 Conf. North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA, 2016, pp. 473–482.

DOI Google Scholar

[12]

Y. Luo, Recurrent neural networks for classifying relations in clinical notes, J. Biomed. Inform., vol. 72, pp. 85–95, 2017.

DOI Google Scholar

[13]

S. Gao, M. T. Young, J. X. Qiu, H. J. Yoon, J. B. Christian, P. A. Fearn, G. D. Tourassi, and A. Ramanthan, Hierarchical attention networks for information extraction from cancer pathology reports, J. Am. Med. Inform. Assoc., vol. 25, no. 3, pp. 321–330, 2018.

DOI Google Scholar

[14]

L. Gligic, A. Kormilitzin, P. Goldberg, and A. Nevado-Holgado, Named entity recognition in electronic health records using transfer learning bootstrapped neural networks, Neural Netw., vol. 121, pp. 132–139, 2020.

DOI Google Scholar

[15]

S. Gehrmann, F. Dernoncourt, Y. Li, Y. Li, E. T. Carlson, J. T. Wu, J. Welt, J. Foote Jr., E. T. Moseley, D. W. Grant, et al., Comparing rule-based and deep learning models for patient phenotyping, arXiv preprint arXiv: 1703.08705, 2017.

Google Scholar

[16]

H. Shi, P. Xie, Z. Hu, M. Zhang, and E. P. Xing, Towards automated ICD coding using deep learning, arXiv preprint arXiv: 1711.04075, 2017.

Google Scholar

[17]

W. Ning, M. Yu, and R. Zhang, A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation, BMC Med. Inform. Decis. Mak., vol. 16, p. 30, 2016.

DOI Google Scholar

[18]

T. Baumel, J. Nassour-Kassis, R. Cohen, M. Elhadad, and N. Elhadad, Multi-label classification of patient notes a case study on ICD code assignment, arXiv preprint arXiv: 1709.09587, 2017.

Google Scholar

[19]

M. Li, Z. Fei, M. Zeng, F. X. Wu, Y. Li, Y. Pan, and J. Wang, Automated ICD-9 coding via a deep learning approach, IEEE/ACM Trans. Comput. Biol. Bioinform., vol. 16, no. 4, pp. 1193–1202, 2019.

DOI Google Scholar

[20]

M. Zeng, M. Li, Z. Fei, Y. Yu, Y. Pan, and J. Wang, Automatic ICD-9 coding via deep transfer learning, Neurocomputing, vol. 324, pp. 43–50, 2019.

DOI Google Scholar

[21]

Y. Wu, M. Zeng, Z. Fei, Y. Yu, F. X. Wu, and M. Li, KAICD: A knowledge attention-based deep learning framework for automatic ICD coding, Neurocomputing, vol. 469, pp. 376–383, 2022.

DOI Google Scholar

[22]

Z. Liu, B. Tang, X. Wang, and Q. Chen, De-identification of clinical notes via recurrent neural network and conditional random field, J. Biomed. Inform., vol. 75, pp. S34–S42, 2017.

DOI Google Scholar

[23]

F. Dernoncourt, J. Y. Lee, O. Uzuner, and P. Szolovits, De-identification of patient notes with recurrent neural networks, J. Am. Med. Inform. Assoc., vol. 24, no. 3, pp. 596–606, 2017.

DOI Google Scholar

[24]

Y. Yu, M. Li, L. Liu, Z. Fei, F. X. Wu, and J. Wang, Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN, J. Biomed. Inform., vol. 91, p. 103114, 2019.

DOI Google Scholar

[25]

G. B. Moody and R. G. Mark, A database to support development and evaluation of intelligent intensive care monitoring, in Proc. of Computers in Cardiology 1996, Indianapolis, IN, USA, 2002, pp. 657–660.

Google Scholar

[26]

M. Saeed, M. Villarroel, A. T. Reisner, G. Clifford, L. W. Lehman, G. Moody, T. Heldt, T. H. Kyaw, B. Moody, and R. G. Mark, Multiparameter intelligent monitoring in intensive care II: A public-access intensive care unit database, Crit. Care Med., vol. 39, no. 5, pp. 952–960, 2011.

DOI Google Scholar

[27]

A. E. W. Johnson, T. J. Pollard, L. Shen, L. W. H. Lehman, M. Feng, M. Ghassemi, B. Moody, P. Szolovits, L. A. Celi, and R. G. Mark, MIMIC-III, a freely accessible critical care database, Sci. Data, vol. 3, p. 160035, 2016.

DOI Google Scholar

[28]

A. R. Aronson and F. M. Lang, An overview of MetaMap: Historical perspective and recent advances, J. Am. Med. Inform. Assoc., vol. 17, no. 3, pp. 229–236, 2010.

DOI Google Scholar

[29]

P. Sondhi, J. Sun, H. Tong, and C. Zhai, SympGraph: A framework for mining clinical notes through symptom relation graphs, in Proc. 18^th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, Beijing, China, 2012, pp. 1167–1175.

DOI Google Scholar

[30]

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural Comput., vol. 9, no. 8, pp. 1735–1780, 1997.

DOI Google Scholar

[31]

A. Graves and N. Jaitly, Towards end-to-end speech recognition with recurrent neural networks, in Proc. 31^st Int. Conf. on Int. Conf. on Machine Learning, Beijing, China, 2014, pp. 1764–1772.

Google Scholar

[32]

J. Chorowski, D. Bahdanau, D. Serdyuk, K. Cho, and Y. Bengio, Attention-based models for speech recognition, in Proc. 28^th Int Conf on Neural Information Processing Systems, Montreal, Canada, 2015, pp. 577–585.

Google Scholar

[33]

Q. You, H. Jin, Z. Wang, C. Fang, and J. Luo, Image captioning with semantic attention, in Proc. 2016 IEEE Conf. on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4651–4659.

DOI Google Scholar

[34]

P. Zhou, W. Shi, J. Tian, Z. Qi, B. Li, H. Hao, and B. Xu, Attention-based bidirectional long short-term memory networks for relation classification, in Proc. 54^th Annu. Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Berlin, Germany, 2016, pp. 207–212.

DOI Google Scholar

[35]

L. Luo, Z. Yang, P. Yang, Y. Zhang, L. Wang, H. Lin, and J. Wang, An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition, Bioinformatics, vol. 34, no. 8, pp. 1381–1388 2017.

DOI Google Scholar

[36]

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł Kaiser, and I. Polosukhin, Attention is all you need, in Proc. 31^st Int. Conf. on Neural Information Processing Systems, Long Beach, CA, USA, 2017, pp. 6000–6010.

Google Scholar

[37]

K. S. Jones, A statistical interpretation of term specificity and its application in retrieval, J. Doc., vol. 28, no. 1, pp. 11–21, 1972.

DOI Google Scholar

[38]

X. Zhou, J. Menche, A. L. Barabási, and A. Sharma, Human symptoms-disease network, Nat. Commun., vol. 5, p. 4212, 2014.

DOI Google Scholar

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 08 August 2022

Revised: 04 September 2022

Accepted: 10 October 2022

Published: 06 January 2023

Issue date: August 2023

Copyright

Acknowledgements

We thank the anonymous reviewers for their helpful comments. This work was supported in part by the Science and Technology Major Project of Changsha (No. kh2202004) and the National Natural Science Foundation of China (No. 62006251). We are grateful for resources from the High-Performance Computing Center of Central South University.

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).