Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients

Haike Lei; Xiaosheng Li; Wuren Ma; Na Hong; Chun Liu; Wei Zhou; Hong Zhou; Mengchun Gong; Ying Wang; Guixue Wang; Yongzhong Wu

doi:10.1002/cai2.24

Cancer Innovation 2022, 1(2): 135-145 https://doi.org/10.1002/cai2.24

Original Article |

Open Access | Issue | Published: 30 August 2022

Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients

Show Author's Information Hide Author's Information Haike Lei^¹, Xiaosheng Li^¹, Wuren Ma^², Na Hong^², Chun Liu^², Wei Zhou^¹, Hong Zhou^¹, Mengchun Gong^², Ying Wang^¹, Guixue Wang^³(

), Yongzhong Wu^¹(

)

¹Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing, China

²Digital Health China Technologies, Co., Ltd., Beijing, China

³MOE Key Lab for Biorheological Science and Technology, State and Local Joint Engineering Laboratory for Vascular Implants, College of Bioengineering Chongqing University, Chongqing, China

Keywords:

machine learning, nomogram, non‐small cell lung cancer, overall survival, predictive model

Cite this article:

Lei H, Li X, Ma W, et al. Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients. Cancer Innovation, 2022, 1(2): 135-145. https://doi.org/10.1002/cai2.24

Download citation

EndNote(RIS)

BibTeX

787

Views

Downloads

Citations

Crossref

N/A

WoS

Scopus

N/A

CSCD

Abstract Full text About this article

Abstract

Background

Most patients with advanced non‐small cell lung cancer (NSCLC) have a poor prognosis. Predicting overall survival using clinical data would benefit cancer patients by allowing providers to design an optimum treatment plan. We compared the performance of nomograms with machine‐learning models at predicting the overall survival of NSCLC patients. This comparison benefits the development and selection of models during the clinical decision‐making process for NSCLC patients.

Methods

Multiple machine‐learning models were used in a retrospective cohort of 6586 patients. First, we modeled and validated a nomogram to predict the overall survival of NSCLC patients. Subsequently, five machine‐learning models (logistic regression, random forest, XGBoost, decision tree, and light gradient boosting machine) were used to predict survival status. Next, we evaluated the performance of the models. Finally, the machine‐learning model with the highest accuracy was chosen for comparison with the nomogram at predicting survival status by observing a novel performance measure: time‐dependent prediction accuracy.

Results

Among the five machine‐learning models, the accuracy of random forest model outperformed the others. Compared with the nomogram for time‐dependent prediction accuracy with a follow‐up time ranging from 12 to 60 months, the prediction accuracies of both the nomogram and machine‐learning models changed as time varied. The nomogram reached a maximum prediction accuracy of 0.85 in the 60th month, and the random forest algorithm reached a maximum prediction accuracy of 0.74 in the 13th month.

Conclusions

Overall, the nomogram provided more reliable prognostic assessments of NSCLC patients than machine‐learning models over our observation period. Although machine‐learning methods have been widely adopted for predicting clinical prognoses in recent studies, the conventional nomogram was competitive. In real clinical applications, a comprehensive model that combines these two methods may demonstrate superior capabilities.

Full text

Abstract

Full text

Outline

About this article

Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients

Show Author's information Hide Author's Information Haike Lei^¹, Xiaosheng Li^¹, Wuren Ma^², Na Hong^², Chun Liu^², Wei Zhou^¹, Hong Zhou^¹, Mengchun Gong^², Ying Wang^¹, Guixue Wang^³(

), Yongzhong Wu^¹(

)

¹Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing, China

²Digital Health China Technologies, Co., Ltd., Beijing, China

³MOE Key Lab for Biorheological Science and Technology, State and Local Joint Engineering Laboratory for Vascular Implants, College of Bioengineering Chongqing University, Chongqing, China

Abstract

Background

Methods

Results

Conclusions

Keywords: machine learning, nomogram, non‐small cell lung cancer, overall survival, predictive model

References(35)

Ambert KH, Cohen AM. A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection. J Am Med Inform Assoc. 2009;16(4):590–5.

DOI Google Scholar

Xie H, Zhang J‐F, Li Q. Development of a prognostic nomogram for patients with lung adenocarcinoma in the stages I, II, and III based on immune scores. Int J Gen Med. 2021;14:8677–88.

DOI Google Scholar

Ettinger DS, Wood DE, Akerley W, Bazhenova LA, Borghaei H, Camidge DR, et al. NCCN guidelines insights: non‐small cell lung cancer, version 4.2016. J Natl Compr Canc Netw. 2016;14(3):255–64.

DOI Google Scholar

Capanu M, Gönen M. Building a nomogram for survey‐weighted Cox models using R. J Stat Softw. 2015;64:1–17.

DOI Google Scholar

Balachandran VP, Gonen M, Smith JJ, DeMatteo RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173–80.

DOI Google Scholar

Zhang J, Fan J, Yin R, Geng L, Zhu M, Shen W, et al. A nomogram to predict overall survival of patients with early stage non‐small cell lung cancer. J Thorac Dis. 2019;11(12):5407–16.

DOI Google Scholar

Li G, Tian M, Bing Y, Wang H, Yuan C, Xiu D. Nomograms predict survival outcomes for distant metastatic pancreatic neuroendocrine tumor: a population based STROBE compliant study. Medicine. 2020;99(13):e19593.

DOI Google Scholar

Wang YY, Xiang B‐D, Ma L, Zhong J‐H, Ye J‐Z, Wang K, et al. Development and validation of a nomogram to preoperatively estimate post‐hepatectomy liver dysfunction risk and long‐term survival in patients with hepatocellular carcinoma. Ann Surg. 2021;274(6):e1209–17.

DOI Google Scholar

Benoit L, Balaya V, Guani B, Bresset A, Magaud L, Bonsang‐Kitzis H, et al. Nomogram predicting the likelihood of parametrial involvement in early‐stage cervical cancer: avoiding unjustified radical hysterectomies. J Clin Med. 2020;9(7):2121.

DOI Google Scholar

Yang Z, Bai Y, Liu M, Hu X, Han P. Development and validation of prognostic nomograms to predict overall and cancer‐specific survival for patients with adenocarcinoma of the urinary bladder: a population‐based study. J Invest Surg. 2022;35:30–7.

DOI Google Scholar

Kyei MY, Adusei B, Klufio GO, Mensah JE, Gepi‐Attee S, Asante E. Treatment of localized prostate cancer and use of nomograms among urologists in the West Africa sub‐region. Pan Afr Med J. 2020;36:251.

DOI Google Scholar

Dong D, Tang L, Li ZY, Fang MJ, Gao JB, Shan XH, et al. Development and validation of an individualized nomogram to identify occult peritoneal metastasis in patients with advanced gastric cancer. Ann Oncol. 2019;30(3):431–8.

DOI Google Scholar

Wang L, Dong T, Xin B, Xu C, Guo M, Zhang H, et al. Integrative nomogram of CT imaging, clinical, and hematological features for survival prediction of patients with locally advanced non‐small cell lung cancer. Eur Radiol. 2019;29(6):2958–67.

DOI Google Scholar

Zheng W, Huang Y, Chen H, Wang N, Xiao W, Liang Y, et al. Nomogram application to predict overall and cancer‐specific survival in osteosarcoma. Cancer Manag Res. 2018;10:5439–50.

DOI Google Scholar

Zindler JD, Jochems A, Lagerwaard FJ, Beumer R, Troost E, Eekers D, et al. Individualized early death and long‐term survival prediction after stereotactic radiosurgery for brain metastases of non‐small cell lung cancer: two externally validated nomograms. Radiother Oncol. 2017;123(2):189–94.

DOI Google Scholar

Bartholomai JA, Frieboes HB.Lung cancer survival prediction via machine learning regression, classification, and statistical techniques. Proc IEEE Int Symp Signal Proc Inf Tech.2018;2018:632–7.

DOI Google Scholar

Gupta S, Tran T, Luo W, Phung D, Kennedy RL, Broad A, et al. Machine‐learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry. BMJ Open. 2014;4(3):e004007.

DOI Google Scholar

Parikh RB, Manz C, Chivers C, Regli SH, Braun J, Draugelis ME, et al. Machine learning approaches to predict 6‐month mortality among patients with cancer. JAMA Netw. 2019;2(10):e1915997.

DOI Google Scholar

Ding D, Lang T, Zou D, Tan J, Chen J, Zhou L, et al. Machine learning‐based prediction of survival prognosis in cervical cancer. BMC Bioinform. 2021;22(1):331.

DOI Google Scholar

Lee

, Light

, Alaa

, Thurtle

, van der Schaar

, Gnanapragasam

. Application of a novel machine learning framework for predicting non‐metastatic prostate cancer‐specific mortality in men using the surveillance, epidemiology, and end results (SEER) database. Lancet Digit Health. 2021;3(3):E158–65. 10.1016/s2589-7500(20)30314-9

DOI Google Scholar

Fradkin D, Muchnik L, Schneider D. Machine learning methods in the analysis of lung cancer survival data. DIMACS Technical Report. 2006. 2005‐35.

Chen D, Xing K, Henson D, Sheng L, Schwartz AM, Cheng X. Developing prognostic systems of cancer patients by ensemble clustering. J Biomed Biotechnol. 2009;2009:632786.

DOI Google Scholar

Chen YC, Ke WC, Chiu HW. Risk classification of cancer survival using ANN with gene expression data from multiple laboratories. Comput Biol Med. 2014;48:1–7.

DOI Google Scholar

Dimitoglou

, Adams

, Jim

. Comparison of the C4.5 and a naive Bayes classifier for the prediction of lung cancer survivability. J Comput. 2012;4:1–9. https://arxiv.org/abs/1206.1121v2

DOI Google Scholar

Hosseninia S, Ameli A, Aslani MR, Pourfarzi F, Ghobadi H. Serum levels of Sirtuin‐1 in patients with lung cancer and its association with Karnofsky Performance Status. Acta Bio Medica: Atenei Parmensis. 2021;92(2):2021012.

DOI Google Scholar

Mirsadraee

, Oswal

, Alizadeh

, Caulo

, van Beek

E Jr

. The 7th lung cancer TNM classification and staging system: review of the changes and implications. World J Radiol. 2012;4:128–34. 10.4329/wjr.v4.i4.128

DOI Google Scholar

Chen S‐W, Zhang Q, Guo ZM, Chen WK, Liu WW, Chen YF, et al. Trends in clinical features and survival of oral cavity cancer: fifty years of experience with 3,362 consecutive cases from a single institution. Cancer Manag Res. 2018;10:4523–35.

DOI Google Scholar

Mankowski

, Kinchen

, Wasilewski

, Flyak

, Ray

, Crowe JE

, et al. Synergistic anti‐HCV broadly neutralizing human monoclonal antibodies with independent mechanisms. Proc Natl Acad Sci USA. 2018;115(1):E82–91. 10.1073/pnas.1718441115

DOI Google Scholar

Shahriyari L, Abdel‐Rahman M, Cebulla C. BAP1 expression is prognostic in breast and uveal melanoma but not colon cancer and is highly positively correlated with RBM15B and USP19. PLoS One. 2019;14(2):e0211507.

DOI Google Scholar

Kursa MB, Rudnicki WR. Feature selection with the Boruta package. J Stat Softw. 2010;36(11):1–13.

DOI Google Scholar

Tang

, Zhang

. decision tree combined with Boruta feature selection for medical data classification. Fifth IEEE International Conference on Big Data Analytics (ICBDA).

Xiamen, China

IEEE; 2020.

https://doi.org/10.1109/ICBDA49040.2020.9101199

10.1109/ICBDA49040.2020.9101199

DOI

Alabi RO, Mäkitie AA, Pirinen M, Elmusrati M, Leivo I, Almangush A. Comparison of nomogram with machine learning techniques for prediction of overall survival in patients with tongue cancer. Int J Med Inform. 2021;145:104313.

DOI Google Scholar

Yang J, Tian G, Pan Z, Zhao F, Feng X, Liu Q, et al. Nomograms for predicting the survival rate for cervical cancer patients who undergo radiation therapy: a SEER analysis. Future Oncol. 2019;15(26):3033–45.

DOI Google Scholar

Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID‐19 patient health prediction using boosted random forest algorithm. Front Public Health. 2020;8:357.

DOI Google Scholar

Shipe ME, Deppen SA, Farjah F, Grogan EL. Developing prediction models for clinical use using logistic regression: an overview. J Thorac Dis. 2019;11(Suppl 4):S574.

DOI Google Scholar

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Received: 29 March 2022

Revised: 28 May 2022

Accepted: 29 June 2022

Published: 30 August 2022

Issue date: August 2022

Copyright

Acknowledgements

The authors greatly appreciate all patients who contributed to this study.

Rights and permissions

This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.