Journal Home > Volume 1 , Issue 2
Background

Most patients with advanced non‐small cell lung cancer (NSCLC) have a poor prognosis. Predicting overall survival using clinical data would benefit cancer patients by allowing providers to design an optimum treatment plan. We compared the performance of nomograms with machine‐learning models at predicting the overall survival of NSCLC patients. This comparison benefits the development and selection of models during the clinical decision‐making process for NSCLC patients.

Methods

Multiple machine‐learning models were used in a retrospective cohort of 6586 patients. First, we modeled and validated a nomogram to predict the overall survival of NSCLC patients. Subsequently, five machine‐learning models (logistic regression, random forest, XGBoost, decision tree, and light gradient boosting machine) were used to predict survival status. Next, we evaluated the performance of the models. Finally, the machine‐learning model with the highest accuracy was chosen for comparison with the nomogram at predicting survival status by observing a novel performance measure: time‐dependent prediction accuracy.

Results

Among the five machine‐learning models, the accuracy of random forest model outperformed the others. Compared with the nomogram for time‐dependent prediction accuracy with a follow‐up time ranging from 12 to 60 months, the prediction accuracies of both the nomogram and machine‐learning models changed as time varied. The nomogram reached a maximum prediction accuracy of 0.85 in the 60th month, and the random forest algorithm reached a maximum prediction accuracy of 0.74 in the 13th month.

Conclusions

Overall, the nomogram provided more reliable prognostic assessments of NSCLC patients than machine‐learning models over our observation period. Although machine‐learning methods have been widely adopted for predicting clinical prognoses in recent studies, the conventional nomogram was competitive. In real clinical applications, a comprehensive model that combines these two methods may demonstrate superior capabilities.


menu
Abstract
Full text
Outline
About this article

Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients

Show Author's information Haike Lei1Xiaosheng Li1Wuren Ma2Na Hong2Chun Liu2Wei Zhou1Hong Zhou1Mengchun Gong2Ying Wang1Guixue Wang3( )Yongzhong Wu1( )
Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing, China
Digital Health China Technologies, Co., Ltd., Beijing, China
MOE Key Lab for Biorheological Science and Technology, State and Local Joint Engineering Laboratory for Vascular Implants, College of Bioengineering Chongqing University, Chongqing, China

Abstract

Background

Most patients with advanced non‐small cell lung cancer (NSCLC) have a poor prognosis. Predicting overall survival using clinical data would benefit cancer patients by allowing providers to design an optimum treatment plan. We compared the performance of nomograms with machine‐learning models at predicting the overall survival of NSCLC patients. This comparison benefits the development and selection of models during the clinical decision‐making process for NSCLC patients.

Methods

Multiple machine‐learning models were used in a retrospective cohort of 6586 patients. First, we modeled and validated a nomogram to predict the overall survival of NSCLC patients. Subsequently, five machine‐learning models (logistic regression, random forest, XGBoost, decision tree, and light gradient boosting machine) were used to predict survival status. Next, we evaluated the performance of the models. Finally, the machine‐learning model with the highest accuracy was chosen for comparison with the nomogram at predicting survival status by observing a novel performance measure: time‐dependent prediction accuracy.

Results

Among the five machine‐learning models, the accuracy of random forest model outperformed the others. Compared with the nomogram for time‐dependent prediction accuracy with a follow‐up time ranging from 12 to 60 months, the prediction accuracies of both the nomogram and machine‐learning models changed as time varied. The nomogram reached a maximum prediction accuracy of 0.85 in the 60th month, and the random forest algorithm reached a maximum prediction accuracy of 0.74 in the 13th month.

Conclusions

Overall, the nomogram provided more reliable prognostic assessments of NSCLC patients than machine‐learning models over our observation period. Although machine‐learning methods have been widely adopted for predicting clinical prognoses in recent studies, the conventional nomogram was competitive. In real clinical applications, a comprehensive model that combines these two methods may demonstrate superior capabilities.

Keywords: machine learning, nomogram, non‐small cell lung cancer, overall survival, predictive model

References(35)

Ambert KH, Cohen AM. A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection. J Am Med Inform Assoc. 2009;16(4):590–5.
Xie H, Zhang J‐F, Li Q. Development of a prognostic nomogram for patients with lung adenocarcinoma in the stages I, II, and III based on immune scores. Int J Gen Med. 2021;14:8677–88.
Ettinger DS, Wood DE, Akerley W, Bazhenova LA, Borghaei H, Camidge DR, et al. NCCN guidelines insights: non‐small cell lung cancer, version 4.2016. J Natl Compr Canc Netw. 2016;14(3):255–64.
Capanu M, Gönen M. Building a nomogram for survey‐weighted Cox models using R. J Stat Softw. 2015;64:1–17.
Balachandran VP, Gonen M, Smith JJ, DeMatteo RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173–80.
Zhang J, Fan J, Yin R, Geng L, Zhu M, Shen W, et al. A nomogram to predict overall survival of patients with early stage non‐small cell lung cancer. J Thorac Dis. 2019;11(12):5407–16.
Li G, Tian M, Bing Y, Wang H, Yuan C, Xiu D. Nomograms predict survival outcomes for distant metastatic pancreatic neuroendocrine tumor: a population based STROBE compliant study. Medicine. 2020;99(13):e19593.
Wang YY, Xiang B‐D, Ma L, Zhong J‐H, Ye J‐Z, Wang K, et al. Development and validation of a nomogram to preoperatively estimate post‐hepatectomy liver dysfunction risk and long‐term survival in patients with hepatocellular carcinoma. Ann Surg. 2021;274(6):e1209–17.
Benoit L, Balaya V, Guani B, Bresset A, Magaud L, Bonsang‐Kitzis H, et al. Nomogram predicting the likelihood of parametrial involvement in early‐stage cervical cancer: avoiding unjustified radical hysterectomies. J Clin Med. 2020;9(7):2121.
Yang Z, Bai Y, Liu M, Hu X, Han P. Development and validation of prognostic nomograms to predict overall and cancer‐specific survival for patients with adenocarcinoma of the urinary bladder: a population‐based study. J Invest Surg. 2022;35:30–7.
Kyei MY, Adusei B, Klufio GO, Mensah JE, Gepi‐Attee S, Asante E. Treatment of localized prostate cancer and use of nomograms among urologists in the West Africa sub‐region. Pan Afr Med J. 2020;36:251.
Dong D, Tang L, Li ZY, Fang MJ, Gao JB, Shan XH, et al. Development and validation of an individualized nomogram to identify occult peritoneal metastasis in patients with advanced gastric cancer. Ann Oncol. 2019;30(3):431–8.
Wang L, Dong T, Xin B, Xu C, Guo M, Zhang H, et al. Integrative nomogram of CT imaging, clinical, and hematological features for survival prediction of patients with locally advanced non‐small cell lung cancer. Eur Radiol. 2019;29(6):2958–67.
Zheng W, Huang Y, Chen H, Wang N, Xiao W, Liang Y, et al. Nomogram application to predict overall and cancer‐specific survival in osteosarcoma. Cancer Manag Res. 2018;10:5439–50.
Zindler JD, Jochems A, Lagerwaard FJ, Beumer R, Troost E, Eekers D, et al. Individualized early death and long‐term survival prediction after stereotactic radiosurgery for brain metastases of non‐small cell lung cancer: two externally validated nomograms. Radiother Oncol. 2017;123(2):189–94.
Bartholomai JA, Frieboes HB.Lung cancer survival prediction via machine learning regression, classification, and statistical techniques. Proc IEEE Int Symp Signal Proc Inf Tech.2018;2018:632–7.
Gupta S, Tran T, Luo W, Phung D, Kennedy RL, Broad A, et al. Machine‐learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry. BMJ Open. 2014;4(3):e004007.
Parikh RB, Manz C, Chivers C, Regli SH, Braun J, Draugelis ME, et al. Machine learning approaches to predict 6‐month mortality among patients with cancer. JAMA Netw. 2019;2(10):e1915997.
Ding D, Lang T, Zou D, Tan J, Chen J, Zhou L, et al. Machine learning‐based prediction of survival prognosis in cervical cancer. BMC Bioinform. 2021;22(1):331.
Lee C, Light A, Alaa A, Thurtle D, van der Schaar M, Gnanapragasam VJ. Application of a novel machine learning framework for predicting non‐metastatic prostate cancer‐specific mortality in men using the surveillance, epidemiology, and end results (SEER) database. Lancet Digit Health. 2021;3(3):E15865. 10.1016/s2589-7500(20)30314-9
Fradkin D, Muchnik L, Schneider D. Machine learning methods in the analysis of lung cancer survival data. DIMACS Technical Report. 2006. 2005‐35.
Chen D, Xing K, Henson D, Sheng L, Schwartz AM, Cheng X. Developing prognostic systems of cancer patients by ensemble clustering. J Biomed Biotechnol. 2009;2009:632786.
Chen YC, Ke WC, Chiu HW. Risk classification of cancer survival using ANN with gene expression data from multiple laboratories. Comput Biol Med. 2014;48:1–7.
Dimitoglou G, Adams JA, Jim CM. Comparison of the C4.5 and a naive Bayes classifier for the prediction of lung cancer survivability. J Comput. 2012;4:19. https://arxiv.org/abs/1206.1121v2
Hosseninia S, Ameli A, Aslani MR, Pourfarzi F, Ghobadi H. Serum levels of Sirtuin‐1 in patients with lung cancer and its association with Karnofsky Performance Status. Acta Bio Medica: Atenei Parmensis. 2021;92(2):2021012.
Mirsadraee S, Oswal D, Alizadeh Y, Caulo A, van Beek E Jr. The 7th lung cancer TNM classification and staging system: review of the changes and implications. World J Radiol. 2012;4:12834. 10.4329/wjr.v4.i4.128
Chen S‐W, Zhang Q, Guo ZM, Chen WK, Liu WW, Chen YF, et al. Trends in clinical features and survival of oral cavity cancer: fifty years of experience with 3,362 consecutive cases from a single institution. Cancer Manag Res. 2018;10:4523–35.
Mankowski MC, Kinchen VJ, Wasilewski LN, Flyak AI, Ray SC, Crowe JE Jr, et al. Synergistic anti‐HCV broadly neutralizing human monoclonal antibodies with independent mechanisms. Proc Natl Acad Sci USA. 2018;115(1):E8291. 10.1073/pnas.1718441115
Shahriyari L, Abdel‐Rahman M, Cebulla C. BAP1 expression is prognostic in breast and uveal melanoma but not colon cancer and is highly positively correlated with RBM15B and USP19. PLoS One. 2019;14(2):e0211507.
Kursa MB, Rudnicki WR. Feature selection with the Boruta package. J Stat Softw. 2010;36(11):1–13.
Tang R, Zhang X. decision tree combined with Boruta feature selection for medical data classification. Fifth IEEE International Conference on Big Data Analytics (ICBDA). Xiamen, China IEEE2020. https://doi.org/10.1109/ICBDA49040.2020.910119910.1109/ICBDA49040.2020.9101199
DOI
Alabi RO, Mäkitie AA, Pirinen M, Elmusrati M, Leivo I, Almangush A. Comparison of nomogram with machine learning techniques for prediction of overall survival in patients with tongue cancer. Int J Med Inform. 2021;145:104313.
Yang J, Tian G, Pan Z, Zhao F, Feng X, Liu Q, et al. Nomograms for predicting the survival rate for cervical cancer patients who undergo radiation therapy: a SEER analysis. Future Oncol. 2019;15(26):3033–45.
Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID‐19 patient health prediction using boosted random forest algorithm. Front Public Health. 2020;8:357.
Shipe ME, Deppen SA, Farjah F, Grogan EL. Developing prediction models for clinical use using logistic regression: an overview. J Thorac Dis. 2019;11(Suppl 4):S574.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 29 March 2022
Revised: 28 May 2022
Accepted: 29 June 2022
Published: 30 August 2022
Issue date: August 2022

Copyright

© 2022 The Authors.

Acknowledgements

The authors greatly appreciate all patients who contributed to this study.

Rights and permissions

This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

Return