Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Maron RC, Haggenmüller S, von Kalle C, Utikal JS, Meier F, Gellrich FF, Hauschild A, French LE, Schlaak M, Ghoreschi K, Kutzner H, Heppt MV, Haferkamp S, Sondermann W, Schadendorf D, Schilling B, Hekler A, Krieghoff-Henning E, Kather JN, Fröhling S, Lipka DB, Brinker TJ. Robustness of convolutional neural networks in recognition of pigmented skin lesions. Eur J Cancer 2021;145:81-91. [PMID: 33423009 DOI: 10.1016/j.ejca.2020.11.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 11/06/2020] [Accepted: 11/15/2020] [Indexed: 12/26/2022]

For:	Maron RC, Haggenmüller S, von Kalle C, Utikal JS, Meier F, Gellrich FF, Hauschild A, French LE, Schlaak M, Ghoreschi K, Kutzner H, Heppt MV, Haferkamp S, Sondermann W, Schadendorf D, Schilling B, Hekler A, Krieghoff-Henning E, Kather JN, Fröhling S, Lipka DB, Brinker TJ. Robustness of convolutional neural networks in recognition of pigmented skin lesions. Eur J Cancer 2021;145:81-91. [PMID: 33423009 DOI: 10.1016/j.ejca.2020.11.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 11/06/2020] [Accepted: 11/15/2020] [Indexed: 12/26/2022]

Number

Cited by Other Article(s)

Alipour N, Burke T, Courtney J. Skin Type Diversity in Skin Lesion Datasets: A Review. CURRENT DERMATOLOGY REPORTS 2024;13:198-210. [PMID: 39184010 PMCID: PMC11343783 DOI: 10.1007/s13671-024-00440-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/22/2024] [Indexed: 08/27/2024]

Abstract

Purpose of review

Skin type diversity in image datasets refers to the representation of various skin types. This diversity allows for the verification of comparable performance of a trained model across different skin types. A widespread problem in datasets involving human skin is the lack of verifiable diversity in skin types, making it difficult to evaluate whether the performance of the trained models generalizes across different skin types. For example, the diversity issues in skin lesion datasets, which are used to train deep learning-based models, often result in lower accuracy for darker skin types that are typically under-represented in these datasets. Under-representation in datasets results in lower performance in deep learning models for under-represented skin types.

Recent findings

This issue has been discussed in previous works; however, the reporting of skin types, and inherent diversity, have not been fully assessed. Some works report skin types but do not attempt to assess the representation of each skin type in datasets. Others, focusing on skin lesions, identify the issue but do not measure skin type diversity in the datasets examined.

Summary

Effort is needed to address these shortcomings and move towards facilitating verifiable diversity. Building on previous works in skin lesion datasets, this review explores the general issue of skin type diversity by investigating and evaluating skin lesion datasets specifically. The main contributions of this work are an evaluation of publicly available skin lesion datasets and their metadata to assess the frequency and completeness of reporting of skin type and an investigation into the diversity and representation of each skin type within these datasets.

Supplementary Information

The online version contains material available at 10.1007/s13671-024-00440-0.

Collapse

Attallah O. Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning. Comput Biol Med 2024;178:108798. [PMID: 38925085 DOI: 10.1016/j.compbiomed.2024.108798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 05/30/2024] [Accepted: 06/19/2024] [Indexed: 06/28/2024]

Abstract

Skin cancer (SC) significantly impacts many individuals' health all over the globe. Hence, it is imperative to promptly identify and diagnose such conditions at their earliest stages using dermoscopic imaging. Computer-aided diagnosis (CAD) methods relying on deep learning techniques especially convolutional neural networks (CNN) can effectively address this issue with outstanding outcomes. Nevertheless, such black box methodologies lead to a deficiency in confidence as dermatologists are incapable of comprehending and verifying the predictions that were made by these models. This article presents an advanced an explainable artificial intelligence (XAI) based CAD system named "Skin-CAD" which is utilized for the classification of dermoscopic photographs of SC. The system accurately categorises the photographs into two categories: benign or malignant, and further classifies them into seven subclasses of SC. Skin-CAD employs four CNNs of different topologies and deep layers. It gathers features out of a pair of deep layers of every CNN, particularly the final pooling and fully connected layers, rather than merely depending on attributes from a single deep layer. Skin-CAD applies the principal component analysis (PCA) dimensionality reduction approach to minimise the dimensions of pooling layer features. This also reduces the complexity of the training procedure compared to using deep features from a CNN that has a substantial size. Furthermore, it combines the reduced pooling features with the fully connected features of each CNN. Additionally, Skin-CAD integrates the dual-layer features of the four CNNs instead of entirely depending on the features of a single CNN architecture. In the end, it utilizes a feature selection step to determine the most important deep attributes. This helps to decrease the general size of the feature set and streamline the classification process. Predictions are analysed in more depth using the local interpretable model-agnostic explanations (LIME) approach. This method is used to create visual interpretations that align with an already existing viewpoint and adhere to recommended standards for general clarifications. Two benchmark datasets are employed to validate the efficiency of Skin-CAD which are the Skin Cancer: Malignant vs. Benign and HAM10000 datasets. The maximum accuracy achieved using Skin-CAD is 97.2 % and 96.5 % for the Skin Cancer: Malignant vs. Benign and HAM10000 datasets respectively. The findings of Skin-CAD demonstrate its potential to assist professional dermatologists in detecting and classifying SC precisely and quickly.

Collapse

Ingvar Å, Oloruntoba A, Sashindranath M, Miller R, Soyer HP, Guitera P, Caccetta T, Shumack S, Abbott L, Arnold C, Lawn C, Button-Sloan A, Janda M, Mar V. Minimum labelling requirements for dermatology artificial intelligence-based Software as Medical Device (SaMD): A consensus statement. Australas J Dermatol 2024;65:e21-e29. [PMID: 38419186 DOI: 10.1111/ajd.14222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Accepted: 01/21/2024] [Indexed: 03/02/2024]

Abstract

BACKGROUND/OBJECTIVES

Artificial intelligence (AI) holds remarkable potential to improve care delivery in dermatology. End users (health professionals and general public) of AI-based Software as Medical Devices (SaMD) require relevant labelling information to ensure that these devices can be used appropriately. Currently, there are no clear minimum labelling requirements for dermatology AI-based SaMDs.

METHODS

Common labelling recommendations for AI-based SaMD identified in a recent literature review were evaluated by an Australian expert panel in digital health and dermatology via a modified Delphi consensus process. A nine-point Likert scale was used to indicate importance of 10 items, and voting was conducted to determine the specific characteristics to include for some items. Consensus was achieved when more than 75% of the experts agreed that inclusion of information was necessary.

RESULTS

There was robust consensus supporting inclusion of all proposed items as minimum labelling requirements; indication for use, intended user, training and test data sets, algorithm design, image processing techniques, clinical validation, performance metrics, limitations, updates and adverse events. Nearly all suggested characteristics of the labelling items received endorsement, except for some characteristics related to performance metrics. Moreover, there was consensus that uniform labelling criteria should apply across all AI categories and risk classes set out by the Therapeutic Goods Administration.

CONCLUSIONS

This study provides critical evidence for setting labelling standards by the Therapeutic Goods Administration to safeguard patients, health professionals, consumers, industry, and regulatory bodies from AI-based dermatology SaMDs that do not currently provide adequate information about how they were developed and tested.

Collapse

Affiliation(s)

Åsa Ingvar Victorian Melanoma Service, Alfred Health, Melbourne, Victoria, Australia School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia Department of Dermatology, Skåne University Hospital, Lund, Sweden Department of Clinical Sciences, Lund University, Lund, Sweden
Ayooluwatomiwa Oloruntoba School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
Maithili Sashindranath School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
Robert Miller Australasian College of Dermatologists, Sydney, Australia
H Peter Soyer Australasian College of Dermatologists, Sydney, Australia Dermatology Research Centre, Frazer Institute, The University of Queensland, Brisbane, Queensland, Australia
Pascale Guitera Australasian College of Dermatologists, Sydney, Australia Faculty of Medicine and Health, The University of Sydney, Sydney, New South Wales, Australia Sydney Melanoma Diagnostic Centre, Royal Prince Alfred Hospital, Camperdown, Victoria, Australia Melanoma Institute Australia, The University of Sydney, Sydney, New South Wales, Australia
Tony Caccetta Australasian College of Dermatologists, Sydney, Australia Perth Dermatology Clinic, Perth, Western Australia, Australia
Stephen Shumack Australasian College of Dermatologists, Sydney, Australia Royal North Shore Hospital of Sydney, Sydney, New South Wales, Australia
Lisa Abbott Australasian College of Dermatologists, Sydney, Australia Faculty of Medicine and Health, The University of Sydney, Sydney, New South Wales, Australia The Skin Hospital, Sydney, New South Wales, Australia
Chris Arnold BioGrid Australia Ltd, Melbourne, Australia Hodgson Associates, Melbourne, Australia Australasian Society of Cosmetic Dermatologists, Melbourne, Australia
Craig Lawn Melanoma Institute Australia, The University of Sydney, Sydney, New South Wales, Australia Centre of Excellence in Melanoma Imaging, Brisbane, Queensland, Australia
Alison Button-Sloan Australian Melanoma Consumer Alliance, Melbourne, Victoria, Australia
Monika Janda Australasian College of Dermatologists, Sydney, Australia Dermatology Research Centre, Frazer Institute, The University of Queensland, Brisbane, Queensland, Australia Centre for Health Services Research, The University of Queensland, Brisbane, Queensland, Australia
Victoria Mar Victorian Melanoma Service, Alfred Health, Melbourne, Victoria, Australia School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia Australasian College of Dermatologists, Sydney, Australia

Collapse

Goessinger EV, Cerminara SE, Mueller AM, Gottfrois P, Huber S, Amaral M, Wenz F, Kostner L, Weiss L, Kunz M, Maul JT, Wespi S, Broman E, Kaufmann S, Patpanathapillai V, Treyer I, Navarini AA, Maul LV. Consistency of convolutional neural networks in dermoscopic melanoma recognition: A prospective real-world study about the pitfalls of augmented intelligence. J Eur Acad Dermatol Venereol 2024;38:945-953. [PMID: 38158385 DOI: 10.1111/jdv.19777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 10/23/2023] [Indexed: 01/03/2024]

Abstract

BACKGROUND

Deep-learning convolutional neural networks (CNNs) have outperformed even experienced dermatologists in dermoscopic melanoma detection under controlled conditions. It remains unexplored how real-world dermoscopic image transformations affect CNN robustness.

OBJECTIVES

To investigate the consistency of melanoma risk assessment by two commercially available CNNs to help formulate recommendations for current clinical use.

METHODS

A comparative cohort study was conducted from January to July 2022 at the Department of Dermatology, University Hospital Basel. Five dermoscopic images of 116 different lesions on the torso of 66 patients were captured consecutively by the same operator without deliberate rotation. Classification was performed by two CNNs (CNN-1/CNN-2). Lesions were divided into four subgroups based on their initial risk scoring and clinical dignity assessment. Reliability was assessed by variation and intraclass correlation coefficients. Excisions were performed for melanoma suspicion or two consecutively elevated CNN risk scores, and benign lesions were confirmed by expert consensus (n = 3).

RESULTS

117 repeated image series of 116 melanocytic lesions (2 melanomas, 16 dysplastic naevi, 29 naevi, 1 solar lentigo, 1 suspicious and 67 benign) were classified. CNN-1 demonstrated superior measurement repeatability for clinically benign lesions with an initial malignant risk score (mean variation coefficient (mvc): CNN-1: 49.5(±34.3)%; CNN-2: 71.4(±22.5)%; p = 0.03), while CNN-2 outperformed for clinically benign lesions with benign scoring (mvc: CNN-1: 49.7(±22.7)%; CNN-2: 23.8(±29.3)%; p = 0.002). Both systems exhibited lowest score consistency for lesions with an initial malignant risk score and benign assessment. In this context, averaging three initial risk scores achieved highest sensitivity of dignity assessment (CNN-1: 94%; CNN-2: 89%). Intraclass correlation coefficients indicated 'moderate'-to-'good' reliability for both systems (CNN-1: 0.80, 95% CI:0.71-0.87, p < 0.001; CNN-2: 0.67, 95% CI:0.55-0.77, p < 0.001).

CONCLUSIONS

Potential user-induced image changes can significantly influence CNN classification. For clinical application, we recommend using the average of three initial risk scores. Furthermore, we advocate for CNN robustness optimization by cross-validation with repeated image sets.

TRIAL REGISTRATION

ClinicalTrials.gov (NCT04605822).

Collapse

Hekler A, Maron RC, Haggenmüller S, Schmitt M, Wies C, Utikal JS, Meier F, Hobelsberger S, Gellrich FF, Sergon M, Hauschild A, French LE, Heinzerling L, Schlager JG, Ghoreschi K, Schlaak M, Hilke FJ, Poch G, Korsing S, Berking C, Heppt MV, Erdmann M, Haferkamp S, Drexler K, Schadendorf D, Sondermann W, Goebeler M, Schilling B, Kather JN, Krieghoff-Henning E, Brinker TJ. Using multiple real-world dermoscopic photographs of one lesion improves melanoma classification via deep learning. J Am Acad Dermatol 2024;90:1028-1031. [PMID: 38199280 DOI: 10.1016/j.jaad.2023.11.065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 10/22/2023] [Accepted: 11/27/2023] [Indexed: 01/12/2024]

Affiliation(s)

Achim Hekler Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Roman C Maron Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Sarah Haggenmüller Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Max Schmitt Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Christoph Wies Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany; Medical Faculty, University Heidelberg, Heidelberg, Germany
Jochen S Utikal Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karl University of Heidelberg, Mannheim, Germany; Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany; DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany
Friedegund Meier Department of Dermatology, Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Sarah Hobelsberger Department of Dermatology, Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Frank F Gellrich Department of Dermatology, Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Mildred Sergon Department of Dermatology, Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Axel Hauschild Department of Dermatology, University Hospital (UKSH), Kiel, Germany
Lars E French Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany; Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami, Miller School of Medicine, Miami, Florida
Lucie Heinzerling Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany; Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Justin G Schlager Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany
Kamran Ghoreschi Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Max Schlaak Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Franz J Hilke Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Gabriela Poch Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Sören Korsing Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Carola Berking Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Markus V Heppt Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Michael Erdmann Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Sebastian Haferkamp Department of Dermatology, University Hospital Regensburg, Regensburg, Germany
Konstantin Drexler Department of Dermatology, University Hospital Regensburg, Regensburg, Germany
Dirk Schadendorf Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany
Wiebke Sondermann Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany
Matthias Goebeler Department of Dermatology, Venereology and Allergology, University Hospital Würzburg and National Center for Tumor Diseases (NCT) WERA Würzburg, Würzburg, Germany
Bastian Schilling Department of Dermatology, Venereology and Allergology, University Hospital Würzburg and National Center for Tumor Diseases (NCT) WERA Würzburg, Würzburg, Germany
Jakob N Kather Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany
Eva Krieghoff-Henning Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Titus J Brinker Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany.

Collapse

Yee J, Rosendahl C, Aoude LG. The role of artificial intelligence and convolutional neural networks in the management of melanoma: a clinical, pathological, and radiological perspective. Melanoma Res 2024;34:96-104. [PMID: 38141179 PMCID: PMC10906187 DOI: 10.1097/cmr.0000000000000951] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 11/29/2023] [Indexed: 12/25/2023]

Wei ML, Tada M, So A, Torres R. Artificial intelligence and skin cancer. Front Med (Lausanne) 2024;11:1331895. [PMID: 38566925 PMCID: PMC10985205 DOI: 10.3389/fmed.2024.1331895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 02/26/2024] [Indexed: 04/04/2024] Open

Riaz S, Naeem A, Malik H, Naqvi RA, Loh WK. Federated and Transfer Learning Methods for the Classification of Melanoma and Nonmelanoma Skin Cancers: A Prospective Study. SENSORS (BASEL, SWITZERLAND) 2023;23:8457. [PMID: 37896548 PMCID: PMC10611214 DOI: 10.3390/s23208457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 10/09/2023] [Accepted: 10/12/2023] [Indexed: 10/29/2023]

Dong Z, Tao X, Du H, Wang J, Huang L, He C, Zhao Z, Mao X, Ai Y, Zhang B, Liu M, Xu H, Jiang Z, Sun Y, Li X, Liu Z, Chen J, Song Y, Liu G, Luo C, Li Y, Zeng X, Liu J, Zhu Y, Wu L, Yu H. Exploring the challenge of early gastric cancer diagnostic AI system face in multiple centers and its potential solutions. J Gastroenterol 2023;58:978-989. [PMID: 37515597 DOI: 10.1007/s00535-023-02025-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 07/10/2023] [Indexed: 07/31/2023]

Abstract

BACKGROUND

Artificial intelligence (AI) performed variously among test sets with different diversity due to sample selection bias, which can be stumbling block for AI applications. We previously tested AI named ENDOANGEL, diagnosing early gastric cancer (EGC) on single-center videos in man-machine competition. We aimed to re-test ENDOANGEL on multi-center videos to explore challenges applying AI in multiple centers, then upgrade ENDOANGEL and explore solutions to the challenge.

METHODS

ENDOANGEL was re-tested on multi-center videos retrospectively collected from 12 institutions and compared with performance in previously reported single-center videos. We then upgraded ENDOANGEL to ENDOANGEL-2022 with more training samples and novel algorithms and conducted competition between ENDOANGEL-2022 and endoscopists. ENDOANGEL-2022 was then tested on single-center videos and compared with performance in multi-center videos; the two AI systems were also compared with each other and endoscopists.

RESULTS

Forty-six EGCs and 54 non-cancers were included in multi-center video cohort. On diagnosing EGCs, compared with single-center videos, ENDOANGEL showed stable sensitivity (97.83% vs. 100.00%) while sharply decreased specificity (61.11% vs. 82.54%); ENDOANGEL-2022 showed similar tendency while achieving significantly higher specificity (79.63%, p < 0.01) making fewer mistakes on typical lesions than ENDOANGEL. On detecting gastric neoplasms, both AI showed stable sensitivity while sharply decreased specificity. Nevertheless, both AI outperformed endoscopists in the two competitions.

CONCLUSIONS

Great increase of false positives is a prominent challenge for applying EGC diagnostic AI in multiple centers due to high heterogeneity of negative cases. Optimizing AI by adding samples and using novel algorithms is promising to overcome this challenge.

Collapse

Affiliation(s)

Zehua Dong Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Xiao Tao Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Hongliu Du Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Junxiao Wang Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Li Huang Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Chiyi He Department of Gastroenterology, Yijishan Hospital of Wannan Medical College, Wuhu, 241001, Anhui, People's Republic of China
Zhifeng Zhao Department of Digestive Endoscopy, The Fourth Hospital of China Medical University, Shenyang, 110032, Liaoning Province, People's Republic of China
Xinli Mao Department of Gastroenterology, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Linhai, Zhejiang, China
Yaowei Ai Department of Gastroenterology, The People's Hospital of China Three Gorges University, The First People's Hospital of Yichang, Yichang, China
Beiping Zhang Department of Gastroenterology, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China
Mei Liu Department of Gastroenterology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Hong Xu Department of Endoscopy, The First Hospital of Jilin University, Changchun, China
Zhenyu Jiang Department of Gastroenterology, The Second Affiliated Hospital of Baotou Medical College, Baotou, Inner Mongolia, China
Yunwei Sun Department of Gastroenterology, Ruijin Hospital, Shanghai Jiaotong University, Gubei Branch, Shanghai, People's Republic of China
Xiuling Li Department of Gastroenterology, School of Clinical Medicine, Henan Provincial People's Hospital, People's Hospital of Zhengzhou University, Henan University, Zhengzhou, Henan, China
Zhihong Liu Department of Gastroenterology, Jilin City People's Hospital, Jilin, China
Jinzhong Chen Endoscopy Center, School of Medicine, The First Affiliated Hospital of Xiamen University, Xiamen University, Xiamen, China
Ying Song Department of Gastroenterology, Xi'an Gaoxin Hospital, Xi'an, 710032, Shaanxi Province, China
Guowei Liu Yi Xin Clinic, Changzhou, Jiangsu, China
Chaijie Luo Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Yanxia Li Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Xiaoquan Zeng Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Jun Liu Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Yijie Zhu Renmin Hospital of Wuhan University, Wuhan, China Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China
Lianlian Wu Renmin Hospital of Wuhan University, Wuhan, China. Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China. Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China. Department of Gastroenterology, Renmin Hospital of Wuhan University, 99 Zhangzhidong Road, Wuhan, 430060, Hubei Province, China.
Honggang Yu Renmin Hospital of Wuhan University, Wuhan, China. Key Laboratory of Hubei Province for Digestive System Disease, Renmin Hospital of Wuhan University, Wuhan, China. Hubei Provincial Clinical Research Center for Digestive Disease Minimally Invasive Incision, Renmin Hospital of Wuhan University, Wuhan, China. Department of Gastroenterology, Renmin Hospital of Wuhan University, 99 Zhangzhidong Road, Wuhan, 430060, Hubei Province, China.

Collapse

Jiang J, Jiang X, Xu L, Zhang Y, Zheng Y, Kong D. Noise-robustness test for ultrasound breast nodule neural network models as medical devices. Front Oncol 2023;13:1177225. [PMID: 37427110 PMCID: PMC10325648 DOI: 10.3389/fonc.2023.1177225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 06/05/2023] [Indexed: 07/11/2023] Open

Abstract

Background

Deep learning technology has been widely applied to medical image analysis. But due to the limitations of its own imaging principle, ultrasound image has the disadvantages of low resolution and high Speckle Noise density, which not only hinder the diagnosis of patients' conditions but also affect the extraction of ultrasound image features by computer technology.

Objective

In this study, we investigate the robustness of deep convolutional neural network (CNN) for classification, segmentation, and target detection of breast ultrasound image through random Salt & Pepper Noise and Gaussian Noise.

Methods

We trained and validated 9 CNN architectures in 8617 breast ultrasound images, but tested the models with noisy test set. Then, we trained and validated 9 CNN architectures with different levels of noise in these breast ultrasound images, and tested the models with noisy test set. Diseases of each breast ultrasound image in our dataset were annotated and voted by three sonographers based on their malignancy suspiciousness. we use evaluation indexes to evaluate the robustness of the neural network algorithm respectively.

Results

There is a moderate to high impact (The accuracy of the model decreased by about 5%-40%) on model accuracy when Salt and Pepper Noise, Speckle Noise, or Gaussian Noise is introduced to the images respectively. Consequently, DenseNet, UNet++ and Yolov5 were selected as the most robust model based on the selected index. When any two of these three kinds of noise are introduced into the image at the same time, the accuracy of the model will be greatly affected.

Conclusions

Our experimental results reveal new insights: The variation trend of accuracy with the noise level in Each network used for classification tasks and object detection tasks has some unique characteristics. This finding provides us with a method to reveal the black-box architecture of computer-aided diagnosis (CAD) systems. On the other hand, the purpose of this study is to explore the impact of adding noise directly to the image on the performance of neural networks, which is different from the existing articles on robustness in the field of medical image processing. Consequently, it provides a new way to evaluate the robustness of CAD systems in the future.

Collapse

Kränke T, Tripolt-Droschl K, Röd L, Hofmann-Wellenhof R, Koppitz M, Tripolt M. New AI-algorithms on smartphones to detect skin cancer in a clinical setting-A validation study. PLoS One 2023;18:e0280670. [PMID: 36791068 PMCID: PMC9931135 DOI: 10.1371/journal.pone.0280670] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 01/05/2023] [Indexed: 02/16/2023] Open

Abstract

BACKGROUND AND OBJECTIVES

The incidence of skin cancer is rising worldwide and there is medical need to optimize its early detection. This study was conducted to determine the diagnostic and risk-assessment accuracy of two new diagnosis-based neural networks (analyze and detect), which comply with the CE-criteria, in evaluating the malignant potential of various skin lesions on a smartphone. Of note, the intention of our study was to evaluate the performance of these medical products in a clinical setting for the first time.

METHODS

This was a prospective, single-center clinical study at one tertiary referral center in Graz, Austria. Patients, who were either scheduled for preventive skin examination or removal of at least one skin lesion were eligible for participation. Patients were assessed by at least two dermatologists and by the integrated algorithms on different mobile phones. The lesions to be recorded were randomly selected by the dermatologists. The diagnosis of the algorithm was stated as correct if it matched the diagnosis of the two dermatologists or the histology (if available). The histology was the reference standard, however, if both clinicians considered a lesion as being benign no histology was performed and the dermatologists were stated as reference standard.

RESULTS

A total of 238 patients with 1171 lesions (86 female; 36.13%) with an average age of 66.19 (SD = 17.05) was included. Sensitivity and specificity of the detect algorithm were 96.4% (CI 93.94-98.85) and 94.85% (CI 92.46-97.23); for the analyze algorithm a sensitivity of 95.35% (CI 93.45-97.25) and a specificity of 90.32% (CI 88.1-92.54) were achieved.

DISCUSSION

The studied neural networks succeeded analyzing the risk of skin lesions with a high diagnostic accuracy showing that they are sufficient tools in calculating the probability of a skin lesion being malignant. In conjunction with the wide spread use of smartphones this new AI approach opens the opportunity for a higher early detection rate of skin cancer with consecutive lower epidemiological burden of metastatic cancer and reducing health care costs. This neural network moreover facilitates the empowerment of patients, especially in regions with a low density of medical doctors.

REGISTRATION

Approved and registered at the ethics committee of the Medical University of Graz, Austria (Approval number: 30-199 ex 17/18).

Collapse

Wang J, Luo Y, Wang Z, Hounye AH, Cao C, Hou M, Zhang J. A cell phone app for facial acne severity assessment. APPL INTELL 2023;53:7614-7633. [PMID: 35919632 PMCID: PMC9336136 DOI: 10.1007/s10489-022-03774-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/15/2022] [Indexed: 11/28/2022]

Foahom Gouabou AC, Collenne J, Monnier J, Iguernaissi R, Damoiseaux JL, Moudafi A, Merad D. Computer Aided Diagnosis of Melanoma Using Deep Neural Networks and Game Theory: Application on Dermoscopic Images of Skin Lesions. Int J Mol Sci 2022;23:ijms232213838. [PMID: 36430315 PMCID: PMC9696950 DOI: 10.3390/ijms232213838] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 10/31/2022] [Accepted: 11/07/2022] [Indexed: 11/12/2022] Open

Abstract

Early detection of melanoma remains a daily challenge due to the increasing number of cases and the lack of dermatologists. Thus, AI-assisted diagnosis is considered as a possible solution for this issue. Despite the great advances brought by deep learning and especially convolutional neural networks (CNNs), computer-aided diagnosis (CAD) systems are still not used in clinical practice. This may be explained by the dermatologist's fear of being misled by a false negative and the assimilation of CNNs to a "black box", making their decision process difficult to understand by a non-expert. Decision theory, especially game theory, is a potential solution as it focuses on identifying the best decision option that maximizes the decision-maker's expected utility. This study presents a new framework for automated melanoma diagnosis. Pursuing the goal of improving the performance of existing systems, our approach also attempts to bring more transparency in the decision process. The proposed framework includes a multi-class CNN and six binary CNNs assimilated to players. The players' strategies is to first cluster the pigmented lesions (melanoma, nevus, and benign keratosis), using the introduced method of evaluating the confidence of the predictions, into confidence level (confident, medium, uncertain). Then, a subset of players has the strategy to refine the diagnosis for difficult lesions with medium and uncertain prediction. We used EfficientNetB5 as the backbone of our networks and evaluated our approach on the public ISIC dataset consisting of 8917 lesions: melanoma (1113), nevi (6705) and benign keratosis (1099). The proposed framework achieved an area under the receiver operating curve (AUROC) of 0.93 for melanoma, 0.96 for nevus and 0.97 for benign keratosis. Furthermore, our approach outperformed existing methods in this task, improving the balanced accuracy (BACC) of the best compared method from 77% to 86%. These results suggest that our framework provides an effective and explainable decision-making strategy. This approach could help dermatologists in their clinical practice for patients with atypical and difficult-to-diagnose pigmented lesions. We also believe that our system could serve as a didactic tool for less experienced dermatologists.

Collapse

Maron RC, Hekler A, Haggenmüller S, von Kalle C, Utikal JS, Müller V, Gaiser M, Meier F, Hobelsberger S, Gellrich FF, Sergon M, Hauschild A, French LE, Heinzerling L, Schlager JG, Ghoreschi K, Schlaak M, Hilke FJ, Poch G, Korsing S, Berking C, Heppt MV, Erdmann M, Haferkamp S, Schadendorf D, Sondermann W, Goebeler M, Schilling B, Kather JN, Fröhling S, Lipka DB, Krieghoff-Henning E, Brinker TJ. Model soups improve performance of dermoscopic skin cancer classifiers. Eur J Cancer 2022;173:307-316. [PMID: 35973360 DOI: 10.1016/j.ejca.2022.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 07/04/2022] [Indexed: 11/20/2022]

Abstract

BACKGROUND

Image-based cancer classifiers suffer from a variety of problems which negatively affect their performance. For example, variation in image brightness or different cameras can already suffice to diminish performance. Ensemble solutions, where multiple model predictions are combined into one, can improve these problems. However, ensembles are computationally intensive and less transparent to practitioners than single model solutions. Constructing model soups, by averaging the weights of multiple models into a single model, could circumvent these limitations while still improving performance.

OBJECTIVE

To investigate the performance of model soups for a dermoscopic melanoma-nevus skin cancer classification task with respect to (1) generalisation to images from other clinics, (2) robustness against small image changes and (3) calibration such that the confidences correspond closely to the actual predictive uncertainties.

METHODS

We construct model soups by fine-tuning pre-trained models on seven different image resolutions and subsequently averaging their weights. Performance is evaluated on a multi-source dataset including holdout and external components.

RESULTS

We find that model soups improve generalisation and calibration on the external component while maintaining performance on the holdout component. For robustness, we observe performance improvements for pertubated test images, while the performance on corrupted test images remains on par.

CONCLUSIONS

Overall, souping for skin cancer classifiers has a positive effect on generalisation, robustness and calibration. It is easy for practitioners to implement and by combining multiple models into a single model, complexity is reduced. This could be an important factor in achieving clinical applicability, as less complexity generally means more transparency.

Collapse

Affiliation(s)

Roman C Maron Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Achim Hekler Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Sarah Haggenmüller Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Christof von Kalle Department of Clinical-Translational Sciences, Charité University Medicine and Berlin Institute of Health (BIH), Berlin, Germany
Jochen S Utikal Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karl University of Heidelberg, Mannheim, Germany; Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany; DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany
Verena Müller Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karl University of Heidelberg, Mannheim, Germany; Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany; DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany
Maria Gaiser Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karl University of Heidelberg, Mannheim, Germany; Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany; DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany
Friedegund Meier Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Sarah Hobelsberger Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Frank F Gellrich Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Mildred Sergon Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Axel Hauschild Department of Dermatology, University Hospital (UKSH), Kiel, Germany
Lars E French Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany; Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami, Miller School of Medicine, Miami, FL, USA
Lucie Heinzerling Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany
Justin G Schlager Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany
Kamran Ghoreschi Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Max Schlaak Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Franz J Hilke Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Gabriela Poch Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Sören Korsing Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
Carola Berking Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Markus V Heppt Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Michael Erdmann Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany
Sebastian Haferkamp Department of Dermatology, University Hospital Regensburg, Regensburg, Germany
Dirk Schadendorf Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany
Wiebke Sondermann Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany
Matthias Goebeler Department of Dermatology, Venereology and Allergology, University Hospital Würzburg, Würzburg, Germany
Bastian Schilling Department of Dermatology, Venereology and Allergology, University Hospital Würzburg, Würzburg, Germany
Jakob N Kather Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany
Stefan Fröhling Department of Translational Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany
Daniel B Lipka Department of Translational Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany
Eva Krieghoff-Henning Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Titus J Brinker Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany.

Collapse

Jin T, Jiang Y, Mao B, Wang X, Lu B, Qian J, Zhou H, Ma T, Zhang Y, Li S, Shi Y, Yao Z. Multi-center verification of the influence of data ratio of training sets on test results of an AI system for detecting early gastric cancer based on the YOLO-v4 algorithm. Front Oncol 2022;12:953090. [PMID: 36052264 PMCID: PMC9425091 DOI: 10.3389/fonc.2022.953090] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 07/27/2022] [Indexed: 11/24/2022] Open

Combalia M, Codella N, Rotemberg V, Carrera C, Dusza S, Gutman D, Helba B, Kittler H, Kurtansky NR, Liopyris K, Marchetti MA, Podlipnik S, Puig S, Rinner C, Tschandl P, Weber J, Halpern A, Malvehy J. Validation of artificial intelligence prediction models for skin cancer diagnosis using dermoscopy images: the 2019 International Skin Imaging Collaboration Grand Challenge. Lancet Digit Health 2022;4:e330-e339. [PMID: 35461690 PMCID: PMC9295694 DOI: 10.1016/s2589-7500(22)00021-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 12/23/2021] [Accepted: 01/26/2022] [Indexed: 01/08/2023]

Abstract

BACKGROUND

Previous studies of artificial intelligence (AI) applied to dermatology have shown AI to have higher diagnostic classification accuracy than expert dermatologists; however, these studies did not adequately assess clinically realistic scenarios, such as how AI systems behave when presented with images of disease categories that are not included in the training dataset or images drawn from statistical distributions with significant shifts from training distributions. We aimed to simulate these real-world scenarios and evaluate the effects of image source institution, diagnoses outside of the training set, and other image artifacts on classification accuracy, with the goal of informing clinicians and regulatory agencies about safety and real-world accuracy.

METHODS

We designed a large dermoscopic image classification challenge to quantify the performance of machine learning algorithms for the task of skin cancer classification from dermoscopic images, and how this performance is affected by shifts in statistical distributions of data, disease categories not represented in training datasets, and imaging or lesion artifacts. Factors that might be beneficial to performance, such as clinical metadata and external training data collected by challenge participants, were also evaluated. 25 331 training images collected from two datasets (in Vienna [HAM10000] and Barcelona [BCN20000]) between Jan 1, 2000, and Dec 31, 2018, across eight skin diseases, were provided to challenge participants to design appropriate algorithms. The trained algorithms were then tested for balanced accuracy against the HAM10000 and BCN20000 test datasets and data from countries not included in the training dataset (Turkey, New Zealand, Sweden, and Argentina). Test datasets contained images of all diagnostic categories available in training plus other diagnoses not included in training data (not trained category). We compared the performance of the algorithms against that of 18 dermatologists in a simulated setting that reflected intended clinical use.

FINDINGS

64 teams submitted 129 state-of-the-art algorithm predictions on a test set of 8238 images. The best performing algorithm achieved 58·8% balanced accuracy on the BCN20000 data, which was designed to better reflect realistic clinical scenarios, compared with 82·0% balanced accuracy on HAM10000, which was used in a previously published benchmark. Shifted statistical distributions and disease categories not included in training data contributed to decreases in accuracy. Image artifacts, including hair, pen markings, ulceration, and imaging source institution, decreased accuracy in a complex manner that varied based on the underlying diagnosis. When comparing algorithms to expert dermatologists (2460 ratings on 1269 images), algorithms performed better than experts in most categories, except for actinic keratoses (similar accuracy on average) and images from categories not included in training data (26% correct for experts vs 6% correct for algorithms, p<0·0001). For the top 25 submitted algorithms, 47·1% of the images from categories not included in training data were misclassified as malignant diagnoses, which would lead to a substantial number of unnecessary biopsies if current state-of-the-art AI technologies were clinically deployed.

INTERPRETATION

We have identified specific deficiencies and safety issues in AI diagnostic systems for skin cancer that should be addressed in future diagnostic evaluation protocols to improve safety and reliability in clinical practice.

FUNDING

Melanoma Research Alliance and La Marató de TV3.

Collapse

Affiliation(s)

Marc Combalia Melanoma Unit, Dermatology Department, Hospital Cĺınic Barcelona, Universitat de Barcelona, CIBER de Enfermedades raras IDIBAPS, Barcelona, Spain
Noel Codella Microsoft, Redmond, WA, USA
Veronica Rotemberg Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
Cristina Carrera Melanoma Unit, Dermatology Department, Hospital Cĺınic Barcelona, Universitat de Barcelona, CIBER de Enfermedades raras IDIBAPS, Barcelona, Spain
Stephen Dusza Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
David Gutman Emory University School of Medicine, Department of Biomedical Informatics, Atlanta, GA, USA
Brian Helba Kitware, Clifton Park, NY, USA
Harald Kittler Department of Dermatology, Medical University of Vienna, Vienna, Austria
Nicholas R Kurtansky Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Konstantinos Liopyris University of Athens Medical School, Department of Dermatology-Venereology, Athens, Greece
Michael A Marchetti Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Sebastian Podlipnik Melanoma Unit, Dermatology Department, Hospital Cĺınic Barcelona, Universitat de Barcelona, CIBER de Enfermedades raras IDIBAPS, Barcelona, Spain
Susana Puig Melanoma Unit, Dermatology Department, Hospital Cĺınic Barcelona, Universitat de Barcelona, CIBER de Enfermedades raras IDIBAPS, Barcelona, Spain
Christoph Rinner Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria
Philipp Tschandl Department of Dermatology, Medical University of Vienna, Vienna, Austria
Jochen Weber Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Allan Halpern Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Josep Malvehy Melanoma Unit, Dermatology Department, Hospital Cĺınic Barcelona, Universitat de Barcelona, CIBER de Enfermedades raras IDIBAPS, Barcelona, Spain

Collapse

Hauser K, Kurz A, Haggenmüller S, Maron RC, von Kalle C, Utikal JS, Meier F, Hobelsberger S, Gellrich FF, Sergon M, Hauschild A, French LE, Heinzerling L, Schlager JG, Ghoreschi K, Schlaak M, Hilke FJ, Poch G, Kutzner H, Berking C, Heppt MV, Erdmann M, Haferkamp S, Schadendorf D, Sondermann W, Goebeler M, Schilling B, Kather JN, Fröhling S, Lipka DB, Hekler A, Krieghoff-Henning E, Brinker TJ. Explainable artificial intelligence in skin cancer recognition: A systematic review. Eur J Cancer 2022;167:54-69. [PMID: 35390650 DOI: 10.1016/j.ejca.2022.02.025] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 02/22/2022] [Accepted: 02/24/2022] [Indexed: 01/18/2023]

Affiliation(s)

Katja Hauser Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Alexander Kurz Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Sarah Haggenmüller Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Roman C Maron Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Christof von Kalle Department of Clinical-Translational Sciences, Charité University Medicine and Berlin Institute of Health (BIH), Berlin, Germany
Jochen S Utikal Department of Dermatology, Heidelberg University, Mannheim, Germany; Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany
Friedegund Meier Skin Cancer Center at the University Cancer Centre and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Sarah Hobelsberger Skin Cancer Center at the University Cancer Centre and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Frank F Gellrich Skin Cancer Center at the University Cancer Centre and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Mildred Sergon Skin Cancer Center at the University Cancer Centre and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Germany
Axel Hauschild Department of Dermatology, University Hospital (UKSH), Kiel, Germany
Lars E French Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany; Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami, Miller School of Medicine, Miami, FL, USA
Lucie Heinzerling Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany
Justin G Schlager Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany
Kamran Ghoreschi Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Max Schlaak Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Franz J Hilke Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Gabriela Poch Department of Dermatology, Venereology and Allergology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Heinz Kutzner Dermatopathology Laboratory, Friedrichshafen, Germany
Carola Berking Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - EMN, Friedrich-Alexander University Erlangen, Nuremberg, Germany
Markus V Heppt Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - EMN, Friedrich-Alexander University Erlangen, Nuremberg, Germany
Michael Erdmann Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen - EMN, Friedrich-Alexander University Erlangen, Nuremberg, Germany
Sebastian Haferkamp Department of Dermatology, University Hospital Regensburg, Regensburg, Germany
Dirk Schadendorf Department of Dermatology, University Hospital Essen, Essen, Germany
Wiebke Sondermann Department of Dermatology, University Hospital Essen, Essen, Germany
Matthias Goebeler Department of Dermatology, University Hospital Würzburg, Würzburg, Germany
Bastian Schilling Department of Dermatology, University Hospital Würzburg, Würzburg, Germany
Jakob N Kather Division of Translational Medical Oncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
Stefan Fröhling National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Daniel B Lipka National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Achim Hekler Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Eva Krieghoff-Henning Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany
Titus J Brinker Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany.

Collapse

Stiff KM, Franklin MJ, Zhou Y, Madabhushi A, Knackstedt TJ. Artificial Intelligence and Melanoma: A Comprehensive Review of Clinical, Dermoscopic, and Histologic Applications. Pigment Cell Melanoma Res 2022;35:203-211. [PMID: 35038383 DOI: 10.1111/pcmr.13027] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 11/24/2021] [Accepted: 01/09/2022] [Indexed: 11/30/2022]

Popescu D, El-Khatib M, El-Khatib H, Ichim L. New Trends in Melanoma Detection Using Neural Networks: A Systematic Review. SENSORS (BASEL, SWITZERLAND) 2022;22:496. [PMID: 35062458 PMCID: PMC8778535 DOI: 10.3390/s22020496] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 12/28/2021] [Accepted: 01/05/2022] [Indexed: 12/29/2022]

Bozsányi S, Varga NN, Farkas K, Bánvölgyi A, Lőrincz K, Lihacova I, Lihachev A, Plorina EV, Bartha Á, Jobbágy A, Kuroli E, Paragh G, Holló P, Medvecz M, Kiss N, Wikonkál NM. Multispectral Imaging Algorithm Predicts Breslow Thickness of Melanoma. J Clin Med 2021;11:jcm11010189. [PMID: 35011930 PMCID: PMC8745435 DOI: 10.3390/jcm11010189] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 12/21/2021] [Accepted: 12/26/2021] [Indexed: 12/20/2022] Open

Affiliation(s)

Szabolcs Bozsányi Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.) Selye János Doctoral College for Advanced Studies, Clinical Sciences Research Group, 1085 Budapest, Hungary
Noémi Nóra Varga Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Klára Farkas Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
András Bánvölgyi Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Kende Lőrincz Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Ilze Lihacova Biophotonics Laboratory, Institute of Atomic Physics and Spectroscopy, University of Latvia, 1004 Riga, Latvia; (I.L.); (A.L.); (E.V.P.)
Alexey Lihachev Biophotonics Laboratory, Institute of Atomic Physics and Spectroscopy, University of Latvia, 1004 Riga, Latvia; (I.L.); (A.L.); (E.V.P.)
Emilija Vija Plorina Biophotonics Laboratory, Institute of Atomic Physics and Spectroscopy, University of Latvia, 1004 Riga, Latvia; (I.L.); (A.L.); (E.V.P.)
Áron Bartha Department of Bioinformatics, Semmelweis University, 1085 Budapest, Hungary; 2nd Department of Pediatrics, Semmelweis University, 1085 Budapest, Hungary
Antal Jobbágy Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Enikő Kuroli Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.) 1st Department of Pathology and Experimental Cancer Research, Semmelweis University, 1085 Budapest, Hungary
György Paragh Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA; Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
Péter Holló Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Márta Medvecz Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Norbert Kiss Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.)
Norbert M. Wikonkál Department of Dermatology, Venereology and Dermatooncology, Semmelweis University, 1085 Budapest, Hungary; (S.B.); (N.N.V.); (K.F.); (A.B.); (K.L.); (A.J.); (E.K.); (P.H.); (M.M.); (N.K.) Correspondence:

Collapse

Winkler JK, Tschandl P, Toberer F, Sies K, Fink C, Enk A, Kittler H, Haenssle HA. Monitoring patients at risk for melanoma: May convolutional neural networks replace the strategy of sequential digital dermoscopy? Eur J Cancer 2021;160:180-188. [PMID: 34840028 DOI: 10.1016/j.ejca.2021.10.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 10/06/2021] [Accepted: 10/25/2021] [Indexed: 01/11/2023]

Skin cancer classification via convolutional neural networks: systematic review of studies involving human experts. Eur J Cancer 2021;156:202-216. [PMID: 34509059 DOI: 10.1016/j.ejca.2021.06.049] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 06/18/2021] [Accepted: 06/28/2021] [Indexed: 12/23/2022]

Abstract

BACKGROUND

Multiple studies have compared the performance of artificial intelligence (AI)-based models for automated skin cancer classification to human experts, thus setting the cornerstone for a successful translation of AI-based tools into clinicopathological practice.

OBJECTIVE

The objective of the study was to systematically analyse the current state of research on reader studies involving melanoma and to assess their potential clinical relevance by evaluating three main aspects: test set characteristics (holdout/out-of-distribution data set, composition), test setting (experimental/clinical, inclusion of metadata) and representativeness of participating clinicians.

METHODS

PubMed, Medline and ScienceDirect were screened for peer-reviewed studies published between 2017 and 2021 and dealing with AI-based skin cancer classification involving melanoma. The search terms skin cancer classification, deep learning, convolutional neural network (CNN), melanoma (detection), digital biomarkers, histopathology and whole slide imaging were combined. Based on the search results, only studies that considered direct comparison of AI results with clinicians and had a diagnostic classification as their main objective were included.

RESULTS

A total of 19 reader studies fulfilled the inclusion criteria. Of these, 11 CNN-based approaches addressed the classification of dermoscopic images; 6 concentrated on the classification of clinical images, whereas 2 dermatopathological studies utilised digitised histopathological whole slide images.

CONCLUSIONS

All 19 included studies demonstrated superior or at least equivalent performance of CNN-based classifiers compared with clinicians. However, almost all studies were conducted in highly artificial settings based exclusively on single images of the suspicious lesions. Moreover, test sets mainly consisted of holdout images and did not represent the full range of patient populations and melanoma subtypes encountered in clinical practice.

Collapse

A benchmark for neural network robustness in skin cancer classification. Eur J Cancer 2021;155:191-199. [PMID: 34388516 DOI: 10.1016/j.ejca.2021.06.047] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 06/18/2021] [Accepted: 06/29/2021] [Indexed: 02/06/2023]

Abstract

BACKGROUND

One prominent application for deep learning-based classifiers is skin cancer classification on dermoscopic images. However, classifier evaluation is often limited to holdout data which can mask common shortcomings such as susceptibility to confounding factors. To increase clinical applicability, it is necessary to thoroughly evaluate such classifiers on out-of-distribution (OOD) data.

OBJECTIVE

The objective of the study was to establish a dermoscopic skin cancer benchmark in which classifier robustness to OOD data can be measured.

METHODS

Using a proprietary dermoscopic image database and a set of image transformations, we create an OOD robustness benchmark and evaluate the robustness of four different convolutional neural network (CNN) architectures on it.

RESULTS

The benchmark contains three data sets-Skin Archive Munich (SAM), SAM-corrupted (SAM-C) and SAM-perturbed (SAM-P)-and is publicly available for download. To maintain the benchmark's OOD status, ground truth labels are not provided and test results should be sent to us for assessment. The SAM data set contains 319 unmodified and biopsy-verified dermoscopic melanoma (n = 194) and nevus (n = 125) images. SAM-C and SAM-P contain images from SAM which were artificially modified to test a classifier against low-quality inputs and to measure its prediction stability over small image changes, respectively. All four CNNs showed susceptibility to corruptions and perturbations.

CONCLUSIONS

This benchmark provides three data sets which allow for OOD testing of binary skin cancer classifiers. Our classifier performance confirms the shortcomings of CNNs and provides a frame of reference. Altogether, this benchmark should facilitate a more thorough evaluation process and thereby enable the development of more robust skin cancer classifiers.

Collapse

Jubair F, Al-Karadsheh O, Malamos D, Al Mahdi S, Saad Y, Hassona Y. A novel lightweight deep convolutional neural network for early detection of oral cancer. Oral Dis 2021;28:1123-1130. [PMID: 33636041 DOI: 10.1111/odi.13825] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 01/30/2021] [Accepted: 02/06/2021] [Indexed: 12/11/2022]