Journal Home > Volume 1 , Issue 1

Cancer informatics has significantly progressed in the big data era. We summarize the application of informatics approaches to the cancer domain from both the informatics perspective (e.g., data management and data science) and the clinical perspective (e.g., cancer screening, risk assessment, diagnosis, treatment, and prognosis). We discuss various informatics methods and tools that are widely applied in cancer research and practices, such as cancer databases, data standards, terminologies, high‐throughput omics data mining, machine‐learning algorithms, artificial intelligence imaging, and intelligent radiation. We also address the informatics challenges within the cancer field that pursue better treatment decisions and patient outcomes, and focus on how informatics can provide opportunities for cancer research and practices. Finally, we conclude that the interdisciplinary nature of cancer informatics and collaborations are major drivers for future research and applications in clinical practices. It is hoped that this review is instrumental for cancer researchers and clinicians with its informatics‐specific insights.


menu
Abstract
Full text
Outline
About this article

Application of informatics in cancer research and clinical practice: Opportunities and challenges

Show Author's information Na Hong1Gang Sun2Xiuran Zuo3Meng Chen4Li Liu5Jiani Wang4Xiaobin Feng6Wenzhao Shi1Mengchun Gong1,7 ( )Pengcheng Ma5 ( )
Department of Medical Sciences, Digital Health China Technologies Co., Ltd., Beijing, China
Xinjiang Cancer Center, Key Laboratory of Oncology of Xinjiang Uyghur Autonomous Region, The Affiliated Cancer Hospital of Xinjiang Medical University, Ürümqi, China
Department of Information, Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
National Cancer Center, National Clinical Research Center for Cancer, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Big Data Center, Nanfang Hospital, Southern Medical University, Guangzhou, China
Hepato‐Pancreato‐Biliary Center, Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, Beijing, China
Institute of Health Management, Southern Medical University, Guangzhou, China

Abstract

Cancer informatics has significantly progressed in the big data era. We summarize the application of informatics approaches to the cancer domain from both the informatics perspective (e.g., data management and data science) and the clinical perspective (e.g., cancer screening, risk assessment, diagnosis, treatment, and prognosis). We discuss various informatics methods and tools that are widely applied in cancer research and practices, such as cancer databases, data standards, terminologies, high‐throughput omics data mining, machine‐learning algorithms, artificial intelligence imaging, and intelligent radiation. We also address the informatics challenges within the cancer field that pursue better treatment decisions and patient outcomes, and focus on how informatics can provide opportunities for cancer research and practices. Finally, we conclude that the interdisciplinary nature of cancer informatics and collaborations are major drivers for future research and applications in clinical practices. It is hoped that this review is instrumental for cancer researchers and clinicians with its informatics‐specific insights.

Keywords: machine learning, artificial intelligence application, cancer informatics

References(92)

DOI
Warner JL, Patt D. Informatics SEftIYSoC: cancer informatics in 2019: deep learning takes center stage. Yearb Med Inform. 2020;29(1):243–6.
Boffa DJ, Rosen JE, Mallin K, Loomis A, Gay G, Palis B, et al. Using the National Cancer Database for outcomes research: a review. JAMA Oncol. 2017;3(12):1722–8.
McCabe RM. National cancer database: the past, present, and future of the cancer registry and its efforts to improve the quality of cancer care. Semin Radiat Oncol. 2019;29(4):323–5.
DOI
Jouhet V, Defossez G, Ingrand P. Automated selection of relevant information for notification of incident cancer cases within a multisource cancer registry. Methods Inf Med. 2013;52(05):411–21.
Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72(1):7–33.
Islami F, Ward EM, Sung H, Cronin KA, Tangka FKL, Sherman RL, et al. Annual report to the nation on the status of cancer, Part 1: national cancer statistics. J Natl Cancer Inst. 2021;113(12):1648–69.
Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, et al. Cancer statistics in China, 2015. CA Cancer J Clin. 2016;66(2):115–32.
Wei W, Zeng H, Zheng R, Zhang S, An L, Chen R, et al. Cancer registration in China and its role in cancer prevention and control. Lancet Oncol. 2020;21(7):e342–9.
Blanchard P, Garden AS. Looking beyond the numbers: highlighting the challenges of population‐based studies in cancer research. J Clin Oncol. 2016;34(19):2317–8.
Telenti A, Pierce LC, Biggs WH, Di Iulio J, Wong EH, Fabani MM, et al. Deep sequencing of 10,000 human genomes. Proc Natl Acad Sci. 2016;113(42):11901–6.
Wang Z, Jensen MA, Zenklusen JC. A practical guide to the cancer genome atlas (TCGA). In: Statistical Genomics, Humana Press; 2016. p. 111–41.
DOI
DOI
Consortium ICG. International network of cancer genome projects. Nature. 2010;464(7291):993–8.
Percy C, Holten VV, Muir CS. World Health Organization. International classification of diseases for oncology. World Health Organization; 1990. https://apps.who.int/iris/handle/10665/39441
World Health Organization. International classification of diseases for oncology (ICD‐O). 3rd ed. 1st revision. 2013:1–242.
Fritz A, Percy C, Jack A, Shanmugaratnam K, Sobin L, Parkin D, et al. International classification of diseases for oncology. Geneva: World Health Organization; 2000.
DOI
International Health Terminology Standards Development Organisation: SNOMED‐CT. http://www.ihtsdo.org/snomed-ct/
Nikiema JN, Jouhet V, Mougin F. Integrating cancer diagnosis terminologies based on logical definitions of SNOMED CT concepts. J Biomed Inform. 2017;74:46–58.
Van Berkum MM. SNOMED CT encoded Cancer Protocols. AMIA Annu Symp Proc. 2003. 1039.
Torous VF, Allan RW, Balani J, Baskovich B, Birdsong GG, Dellers E, et al. Exploring the college of American pathologists electronic cancer checklists: what they are and what they can do for you. Arch Pathol Lab Med. 2021;145(4):392–8.
Sioutos N, de Coronado S, Haber MW, Hartel FW, Shaiu WL, Wright LW. NCI Thesaurus: a semantic model integrating cancer‐related clinical and molecular information. J Biomed Inform. 2007;40(1):30–43.
National Cancer Institute. CTEP NCI guidance on CTC terminology applications. Bethesda, MD: National Cancer Institute Bethesda; 2006.
Introductory Guide for Standardised MedDRA Queries (SMQs) Version 21.0.
DOI
Warner JL, Cowan AJ, Hall AC, Yang PC. HemOnc. org: a collaborative online knowledge platform for oncology professionals. J Oncol Pract. 2015;11(3):e336–50.
DOI
Lin FP, Groza T, Kocbek S, Antezana E, Epstein RJ. Cancer care treatment outcome ontology: a novel computable ontology for profiling treatment outcomes in patients with solid tumors. JCO Clin Cancer Inform. 2018;2:1–14.
Boeker M, Franca F, Bronsert P, Schulz S. TNM‐O: ontology support for staging of malignant tumours. J Biomed Semantics. 2016;7(1):64.
Traverso A, Van Soest J, Wee L, Dekker A. The radiation oncology ontology (ROO): publishing linked data in radiation oncology using semantic web and ontology techniques. Med Phys. 2018;45(10):e854–62.
Bibault J‐E, Zapletal E, Rance B, Giraud P, Burgun A. Labeling for Big Data in radiation oncology: the Radiation Oncology Structures ontology. PLoS One. 2018;13(1):e0191263.
Serra LM, Duncan WD, Diehl AD. An ontology for representing hematologic malignancies: the cancer cell ontology. BMC Bioinform. 2019;20(5):231–6.
Min H, Manion FJ, Goralczyk E, Wong Y‐N, Ross E, Beck JR. Integration of prostate cancer clinical data using an ontology. J Biomed Inf. 2009;42(6):1035–45.
Hong N, Chang F, Ou Z, Wang Y, Yang Y, Guo Q, et al. Construction of the cervical cancer common terminology for promoting semantic interoperability and utilization of Chinese clinical data. BMC Med Inform Decis Mak. 2021;21(9):1–11.
Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology. 2016;278:563–77.
Reuzé S, Schernberg A, Orlhac F, Sun R, Chargari C, Dercle L, et al. Radiomics in nuclear medicine applied to radiation therapy: methods, pitfalls, and challenges. Int J Radiat Oncol Biol Phys. 2018;102:1117–42.
Kumar Y, Gupta S, Singla R, Hu YC. Systematic a review of artificial intelligence techniques in cancer prediction and diagnosis. Arch Comput Methods Eng. 2021:1–28.
Hwang EJ, Park S, Jin K‐N, Kim JI, Choi SY, Lee JH, et al. Development and validation of a deep learning‐based automated detection algorithm for major thoracic diseases on chest radiographs. JAMA Network Open. 2019;2(3):e191095.
Rodríguez‐Ruiz A, Krupinski E, Mordang J‐J, Schilling K, Heywang‐Köbrunner SH, Sechopoulos I, et al. Detection of breast cancer with mammography: effect of an artificial intelligence support system. Radiology. 2019;290(2):305–14.
Lewis SJ, Gandomkar Z, Brennan PC. Artificial Intelligence in medical imaging practice: looking to the future. J Med Radiat Sci. 2019;66(4):292–5.
Gore JC. Artificial intelligence in medical imaging. Magn Reson Imaging. 2020;68:A1–4.
Cusumano D, Lenkowicz J, Votta C, Boldrini L, Placidi L, Catucci F, et al. A deep learning approach to generate synthetic CT in low field MR‐guided adaptive radiotherapy for abdominal and pelvic cases. Radiother Oncol. 2020;153:205–12.
Balakrishnan G, Zhao A, Sabuncu MR, Guttag J, Dalca AV. VoxelMorph: a learning framework for deformable medical image registration. IEEE Trans Med Imaging. 2019;38(8):1788‐800. https://doi.org/10.1109/TMI.2019.2897538
Han X, Hoogeman MS, Levendag PC, Hibbard LS, Teguh DN, Voet P, et al. Atlas‐based auto‐segmentation of head and neck CT images. Medical image computing and computer‐assisted intervention: MICCAI International Conference on Medical Image Computing and Computer‐Assisted Intervention. 2008;11:434–41.
Bondiau PY, Malandain G, Chanalet S, Marcy PY, Habrand JL, Fauchon F, et al. Atlas‐based automatic segmentation of MR images: validation study on the brainstem in radiotherapy context. Int J Radiat Oncol Biol Phys. 2005;61:289–98.
Florkow MC, Guerreiro F, Zijlstra F, Seravalli E, Janssens GO, Maduro JH, et al. Deep learning‐enabled MRI‐only photon and proton therapy treatment planning for paediatric abdominal tumours. Radiother Oncol. 2020;153:220–7.
Maspero M, Bentvelzen LG, Savenije MHF, Guerreiro F, Seravalli E, Janssens GO, et al. Philippens MEP Deep learning‐based synthetic CT generation for paediatric brain MR‐only photon and proton radiotherapy. Radiother Oncol. 2020;153:197–204.
Jia M, Wu Y, Yang Y, Wang L, Chuang C, Han B, et al. Deep learning‐enabled EPID‐based 3D dosimetry for dose verification of step‐and‐shoot radiotherapy. Med Phys. 2021;48:6810–9.
Barateau A, De Crevoisier R, Largent A, Mylona E, Perichon N, Castelli J, et al. Comparison of CBCT‐based dose calculation methods in head and neck cancer radiotherapy: from Hounsfield unit to density calibration curve to deep learning. Med Phys. 2020;47:4683–93.
Zhu C, Lin SH, Jiang X, Xiang Y, Belal Z, Jun G, et al. A novel deep learning model using dosimetric and clinical information for grade 4 radiotherapy‐induced lymphopenia prediction. Phys Med Biol. 2020;65:035014.
Hasin Y, Seldin M, Lusis A. Multi‐omics approaches to disease. Genome Biol. 2017;18(1):83.
Samuels Y, Wang Z, Bardelli A, Silliman N, Ptak J, Szabo S, et al. High frequency of mutations of the PIK3CA gene in human cancers. Science. 2004;304(5670):554.
Paez JG, Jänne PA, Lee JC, Tracy S, Greulich H, Gabriel S, et al. EGFR mutations in lung cancer: correlation with clinical response to gefitinib therapy. Science. 2004;304(5676):1497–500.
Stephens P, Hunter C, Bignell G, Edkins S, Davies H, Teague J, et al. Intragenic ERBB2 kinase mutations in tumours. Nature. 2004;431(7008):525–6.
Mardis ER. Genome sequencing and cancer. Curr Opin Genet Dev. 2012;22(3):245–50.
Kamps R, Brandão R, Bosch B, Paulussen A, Xanthoulea S, Blok M, et al. Next‐generation sequencing in oncology: genetic diagnosis, risk prediction and cancer classification. Int J Mol Sci. 2017;18(2):308.
Baylin SB. Aberrant patterns of DNA methylation, chromatin formation and gene expression in cancer. Hum Mol Gen. 2001;10(7):687–92.
Dawson Mark A, Kouzarides T. Cancer epigenetics: from mechanism to therapy. Cell. 2012;150(1):12–27.
Gut P, Verdin E. The nexus of chromatin regulation and intermediary metabolism. Nature. 2013;502(7472):489–98.
Liu L, Li Y, Tollefsbol TO. Gene‐environment interactions and epigenetic basis of human diseases. Curr Issues Mol Biol. 2008;10(1–2):25–36.
Taudt A, Colomé‐Tatché M, Johannes F. Genetic sources of population epigenomic variation. Nat Rev Genet. 2016;17(6):319–32.
Chakraborty S, Hosen MI, Ahmed M, Shekhar HU. Onco‐Multi‐OMICS approach: a new frontier in cancer research. BioMed Res Int. 2018;2018:1–14.
Prat A, Ellis MJ, Perou CM. Practical implications of gene‐expression‐based assays for breast oncologists. Nat Rev Clin Oncol. 2011;9(1):48–57.
Botling J, Edlund K, Lohr M, Hellwig B, Holmberg L, Lambe M, et al. Biomarker discovery in non‐small cell lung cancer: integrating gene expression profiling, meta‐analysis, and tissue microarray validation. Clin Cancer Res. 2013;19(1):194–204.
Song Q, Hawkins GA, Wudel L, Chou P‐C, Forbes E, Pullikuth AK, et al. Dissecting intratumoral myeloid cell plasticity by single cell RNA‐seq. Cancer Med. 2019;8(6):3072–85.
Suvà ML, Tirosh I. Single‐cell RNA sequencing in cancer: lessons learned and emerging challenges. Mol Cell. 2019;75(1):7–12.
Swiatly A, Horala A, Matysiak J, Hajduk J, Nowak‐Markwitz E, Kokot ZJ. Understanding ovarian cancer: iTRAQ‐based proteomics for biomarker discovery. Int J Mol Sci. 2018;19(8):E2240.
Yanovich G, Agmon H, Harel M, Sonnenblick A, Peretz T, Geiger T. Clinical proteomics of breast cancer reveals a novel layer of breast cancer classification. Cancer Res. 2018;78(20):6001–10.
Ali M, Khan SA, Wennerberg K, Aittokallio T. Global proteomics profiling improves drug sensitivity prediction: results from a multi‐omics, pan‐cancer modeling approach. Bioinformatics (Oxford, England). 2018;34(8):1353–62.
Zhang Y, He C, Qiu L, Wang Y, Qin X, Liu Y, et al. Serum unsaturated free fatty acids: a potential biomarker panel for early‐stage detection of colorectal cancer. J Cancer. 2016;7(4):477–83.
Giskeødegård GF, Bertilsson H, Selnæs KM, Wright AJ, Bathen TF, Viset T, et al. Spermine and citrate as metabolic biomarkers for assessing prostate cancer aggressiveness. PLoS One. 2013;8(4):e62375.
Rajagopala SV, Vashee S, Oldfield LM, Suzuki Y, Venter JC, Telenti A, et al. The Human Microbiome and Cancer. Cancer Prevention. Research (Philadelphia, Pa). 2017;10(4):226–34.
DOI
Helmink BA, Khan MAW, Hermann A, Gopalakrishnan V, Wargo JA. The microbiome, cancer, and cancer therapy. Nature Med. 2019;25(3):377–88.
Nicora G, Vitali F, Dagliati A, Geifman N, Bellazzi R. Integrated multi‐omics analyses in oncology: a review of machine learning methods and tools. Front Oncol. 2020;10:1030.
Chaudhary K, Poirion OB, Lu L, Garmire LX. Deep learning–based multi‐omics integration robustly predicts survival in liver cancer. Clin Cancer Res. 2018;24(6):1248–59.
Zhang Q, Lou Y, Yang J, Wang J, Feng J, Zhao Y, et al. Integrated multiomic analysis reveals comprehensive tumour heterogeneity and novel immunophenotypic classification in hepatocellular carcinomas. Gut. 2019;68(11):2019–31.
Seal DB, Das V, Goswami S, De RK. Estimating gene expression from DNA methylation and copy number variation: a deep learning regression model for multi‐omics integration. Genomics. 2020;112(4):2833–41.
Ouyang X, Fan Q, Ling G, Shi Y, Hu F. Identification of diagnostic biomarkers and subtypes of liver hepatocellular carcinoma by multi‐omics data analysis. Genes. 2020;11(9):E1051.
Huang G, Wang C, Fu X. Bidirectional deep neural networks to integrate RNA and DNA data for predicting outcome for patients with hepatocellular carcinoma. Future Oncol. 2021;17(33):4481–95.
Löffler MW, Mohr C, Bichmann L, Freudenmann LK, Walzer M, Schroeder CM, et al. Multi‐omics discovery of exome‐derived neoantigens in hepatocellular carcinoma. Genome Med. 2019;11(1):28.
Shen M, Xu M, Zhong F, Crist MC, Prior AB, Yang K, et al. Study revealing the metabolic effects of estrogen in liver cancer cells HepG2. Cells. 2021;10(2):455.
Locker GY, Hamilton S, Harris J, Jessup JM, Kemeny N, Macdonald JS, et al. ASCO: ASCO 2006 update of recommendations for the use of tumor markers in gastrointestinal cancer. J Clin Oncol. 2006;24(33):5313–27.
Henry NL, Hayes DF. Cancer biomarkers. Mol Oncol. 2012;6(2):140–6.
Nicolini A, Ferrari P, Duffy MJ. Prognostic and predictive biomarkers in breast cancer: past, present and future. Sem Cancer Biol. 2018;52(Pt 1):56–73.
Chin S‐F, Santonja A, Grzelak M, Ahn S, Sammut S‐J, Clifford H, et al. Shallow whole genome sequencing for robust copy number profiling of formalin‐fixed paraffin‐embedded breast cancers. Exp Mol Pathol. 2018;104(3):161–9.
Raman L, Van der Linden M, Van der Eecken K, Vermaelen K, Demedts I, Surmont V, et al. Shallow whole‐genome sequencing of plasma cell‐free DNA accurately differentiates small from non‐small cell lung carcinoma. Genome Med. 2020;12(1):35.
Van Roy N, Van Der Linden M, Menten B, Dheedene A, Vandeputte C, Van Dorpe J, et al. Shallow whole genome sequencing on circulating cell‐free DNA allows reliable noninvasive copy‐number profiling in neuroblastoma patients. Clin Cancer Res. 2017;23(20):6305–14.
Wang Q, Peng W‐X, Wang L, Ye L. Toward multiomics‐based next‐generation diagnostics for precision medicine. Personalized Med. 2019;16(2):157–70.
Menyhárt O, Győrffy B. Multi‐omics approaches in cancer research with applications in tumor subtyping, prognosis, and diagnosis. Comput Struct Biotechnol J. 2021;19:949–60.
Asagi A, Ohta K, Nasu J, Tanada M, Nadano S, Nishimura R, et al. Utility of contrast‐enhanced FDG‐PET/CT in the clinical management of pancreatic cancer: impact on diagnosis, staging, evaluation of treatment response, and detection of recurrence. Pancreas. 2013;42(1):11–9.
Echle A, Rindtorff NT, Brinker TJ, Luedde T, Pearson AT, Kather JN. Deep learning in cancer pathology: a new generation of clinical biomarkers. Br J Cancer. 2021;124(4):686–96.
Kelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 13 April 2022
Accepted: 24 April 2022
Published: 15 June 2022
Issue date: June 2022

Copyright

© 2022 The Authors.

Acknowledgements

We thank Chao Liu, Ge Wu, Xiaoyu Wu, Yuanshi Jiao, and Yunchuan Qiao for their literature collection and review support. 

Rights and permissions

This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

Return