1
|
Attique H, Shah S, Jabeen S, Khan FG, Khan A, ELAffendi M. Multiclass Cancer Prediction Based on Copy Number Variation Using Deep Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4742986. [PMID: 35720914 PMCID: PMC9203194 DOI: 10.1155/2022/4742986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 05/21/2022] [Indexed: 12/02/2022]
Abstract
DNA copy number variation (CNV) is the type of DNA variation which is associated with various human diseases. CNV ranges in size from 1 kilobase to several megabases on a chromosome. Most of the computational research for cancer classification is traditional machine learning based, which relies on handcrafted extraction and selection of features. To the best of our knowledge, the deep learning-based research also uses the step of feature extraction and selection. To understand the difference between multiple human cancers, we developed three end-to-end deep learning models, i.e., DNN (fully connected), CNN (convolution neural network), and RNN (recurrent neural network), to classify six cancer types using the CNV data of 24,174 genes. The strength of an end-to-end deep learning model lies in representation learning (automatic feature extraction). The purpose of proposing more than one model is to find which architecture among them performs better for CNV data. Our best model achieved 92% accuracy with an ROC of 0.99, and we compared the performances of our proposed models with state-of-the-art techniques. Our models have outperformed the state-of-the-art techniques in terms of accuracy, precision, and ROC. In the future, we aim to work on other types of cancers as well.
Collapse
Affiliation(s)
- Haleema Attique
- Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan
| | - Sajid Shah
- Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan
- EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
| | - Saima Jabeen
- Department of IT and Computer Science, Pak-Austria Facchochschule: Institute of Applied Sciences and Technology, Mang, Haripur, KPK, Pakistan
| | - Fiaz Gul Khan
- Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan
| | - Ahmad Khan
- Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan
| | - Mohammed ELAffendi
- EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Kutzner H, Jutzi TB, Krahl D, Krieghoff‐Henning EI, Heppt MV, Hekler A, Schmitt M, Maron RCR, Fröhling S, Kalle C, Brinker TJ. Überdiagnose von Melanomen – Ursachen, Konsequenzen und Lösungsansätze. J Dtsch Dermatol Ges 2020; 18:1236-1244. [DOI: 10.1111/ddg.14233_g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 04/13/2020] [Indexed: 11/28/2022]
Affiliation(s)
| | - Tanja B. Jutzi
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | - Dieter Krahl
- Privates Labor für Dermatohistopathologie Mönchhofstraße 52 Heidelberg
| | - Eva I. Krieghoff‐Henning
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | | | - Achim Hekler
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | - Max Schmitt
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | - Roman C. R. Maron
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | - Stefan Fröhling
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| | - Christof Kalle
- Berlin Institute of Health (BIH) und Charité – Universitätsmedizin Berlin
| | - Titus J. Brinker
- Nachwuchsgruppe Digitale Biomarker für die Onkologie (DBO), Nationales Centrum für Tumorerkrankungen (NCT) Deutsches Krebsforschungszentrum (DKFZ) Heidelberg
| |
Collapse
|
3
|
Kutzner H, Jutzi TB, Krahl D, Krieghoff-Henning EI, Heppt MV, Hekler A, Schmitt M, Maron RCR, Fröhling S, von Kalle C, Brinker TJ. Overdiagnosis of melanoma - causes, consequences and solutions. J Dtsch Dermatol Ges 2020; 18:1236-1243. [PMID: 32841508 DOI: 10.1111/ddg.14233] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 04/13/2020] [Indexed: 12/14/2022]
Abstract
Malignant melanoma is the skin tumor that causes most deaths in Germany. At an early stage, melanoma is well treatable, so early detection is essential. However, the skin cancer screening program in Germany has been criticized because although melanomas have been diagnosed more frequently since introduction of the program, the mortality from malignant melanoma has not decreased. This indicates that the observed increase in melanoma diagnoses be due to overdiagnosis, i.e. to the detection of lesions that would never have created serious health problems for the patients. One of the reasons is the challenging distinction between some benign and malignant lesions. In addition, there may be lesions that are biologically equivocal, and other lesions that are classified as malignant according to current criteria, but that grow so slowly that they would never have posed a threat to patient's life. So far, these "indolent" melanomas cannot be identified reliably due to a lack of biomarkers. Moreover, the likelihood that an in-situ melanoma will progress to an invasive tumor still cannot be determined with any certainty. When benign lesions are diagnosed as melanoma, the consequences are unnecessary psychological and physical stress for the affected patients and incurred therapy costs. Vice versa, underdiagnoses in the sense of overlooked melanomas can adversely affect patients' prognoses and may necessitate more intense therapies. Novel diagnostic options could reduce the number of over- and underdiagnoses and contribute to more objective diagnoses in borderline cases. One strategy that has yielded promising results in pilot studies is the use of artificial intelligence-based diagnostic tools. However, these applications still await translation into clinical and pathological routine.
Collapse
Affiliation(s)
| | - Tanja B Jutzi
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Dieter Krahl
- Private Laboratory for Dermatohistopathology, Mönchhofstraße 52, Heidelberg, Germany
| | - Eva I Krieghoff-Henning
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Markus V Heppt
- Department of Dermatology, University Hospital Erlangen, Erlangen, Germany
| | - Achim Hekler
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Max Schmitt
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Roman C R Maron
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefan Fröhling
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christof von Kalle
- Berlin Institute of Health (BIH) and Charité-University Medical Center Berlin, Berlin, Germany
| | - Titus J Brinker
- Digital Biomarkers for Oncology group (DBO), National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
4
|
A Shallow Convolutional Learning Network for Classification of Cancers Based on Copy Number Variations. SENSORS 2019; 19:s19194207. [PMID: 31569801 PMCID: PMC6806227 DOI: 10.3390/s19194207] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 09/18/2019] [Accepted: 09/25/2019] [Indexed: 12/29/2022]
Abstract
Genomic copy number variations (CNVs) are among the most important structural variations. They are linked to several diseases and cancer types. Cancer is a leading cause of death worldwide. Several studies were conducted to investigate the causes of cancer and its association with genomic changes to enhance its management and improve the treatment opportunities. Classification of cancer types based on the CNVs falls in this category of research. We reviewed the recent, most successful methods that used machine learning algorithms to solve this problem and obtained a dataset that was tested by some of these methods for evaluation and comparison purposes. We propose three deep learning techniques to classify cancer types based on CNVs: a six-layer convolutional net (CNN6), residual six-layer convolutional net (ResCNN6), and transfer learning of pretrained VGG16 net. The results of the experiments performed on the data of six cancer types demonstrated a high accuracy of 86% for ResCNN6 followed by 85% for CNN6 and 77% for VGG16. The results revealed a lower prediction accuracy for one of the classes (uterine corpus endometrial carcinoma (UCEC)). Repeating the experiments after excluding this class reveals improvements in the accuracies: 91% for CNN6 and 92% for Res CNN6. We observed that UCEC and ovarian serous carcinoma (OV) share a considerable subset of their features, which causes a struggle for learning in the classifiers. We repeated the experiment again by balancing the six classes through oversampling of the training dataset and the result was an enhancement in both overall and UCEC classification accuracies.
Collapse
|