Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fink C, Blum A, Buhl T, Mitteldorf C, Hofmann-Wellenhof R, Deinlein T, Stolz W, Trennheuser L, Cussigh C, Deltgen D, Winkler JK, Toberer F, Enk A, Rosenberger A, Haenssle HA. Diagnostic performance of a deep learning convolutional neural network in the differentiation of combined naevi and melanomas. J Eur Acad Dermatol Venereol 2020;34:1355-1361. [PMID: 31856342 DOI: 10.1111/jdv.16165] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 11/26/2019] [Indexed: 01/10/2023]

For:	Fink C, Blum A, Buhl T, Mitteldorf C, Hofmann-Wellenhof R, Deinlein T, Stolz W, Trennheuser L, Cussigh C, Deltgen D, Winkler JK, Toberer F, Enk A, Rosenberger A, Haenssle HA. Diagnostic performance of a deep learning convolutional neural network in the differentiation of combined naevi and melanomas. J Eur Acad Dermatol Venereol 2020;34:1355-1361. [PMID: 31856342 DOI: 10.1111/jdv.16165] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 11/26/2019] [Indexed: 01/10/2023]

Number

Cited by Other Article(s)

Tognetti L, Miracapillo C, Leonardelli S, Luschi A, Iadanza E, Cevenini G, Rubegni P, Cartocci A. Deep Learning Techniques for the Dermoscopic Differential Diagnosis of Benign/Malignant Melanocytic Skin Lesions: From the Past to the Present. Bioengineering (Basel) 2024;11:758. [PMID: 39199716 PMCID: PMC11351129 DOI: 10.3390/bioengineering11080758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 07/15/2024] [Accepted: 07/17/2024] [Indexed: 09/01/2024] Open

Abstract

There has been growing scientific interest in the research field of deep learning techniques applied to skin cancer diagnosis in the last decade. Though encouraging data have been globally reported, several discrepancies have been observed in terms of study methodology, result presentations and validation in clinical settings. The present review aimed to screen the scientific literature on the application of DL techniques to dermoscopic melanoma/nevi differential diagnosis and extrapolate those original studies adequately by reporting on a DL model, comparing them among clinicians and/or another DL architecture. The second aim was to examine those studies together according to a standard set of statistical measures, and the third was to provide dermatologists with a comprehensive explanation and definition of the most used artificial intelligence (AI) terms to better/further understand the scientific literature on this topic and, in parallel, to be updated on the newest applications in the medical dermatologic field, along with a historical perspective. After screening nearly 2000 records, a subset of 54 was selected. Comparing the 20 studies reporting on convolutional neural network (CNN)/deep convolutional neural network (DCNN) models, we have a scenario of highly performant DL algorithms, especially in terms of low false positive results, with average values of accuracy (83.99%), sensitivity (77.74%), and specificity (80.61%). Looking at the comparison with diagnoses by clinicians (13 studies), the main difference relies on the specificity values, with a +15.63% increase for the CNN/DCNN models (average specificity of 84.87%) compared to humans (average specificity of 64.24%) with a 14,85% gap in average accuracy; the sensitivity values were comparable (79.77% for DL and 79.78% for humans). To obtain higher diagnostic accuracy and feasibility in clinical practice, rather than in experimental retrospective settings, future DL models should be based on a large dataset integrating dermoscopic images with relevant clinical and anamnestic data that is prospectively tested and adequately compared with physicians.

Collapse

Yazdanparast T, Shamsipour M, Ayatollahi A, Delavar S, Ahmadi M, Samadi A, Firooz A. Comparison of the Diagnostic Accuracy of Teledermoscopy, Face-to-Face Examinations and Artificial Intelligence in the Diagnosis of Melanoma. Indian J Dermatol 2024;69:296-300. [PMID: 39296707 PMCID: PMC11407570 DOI: 10.4103/ijd.ijd_61_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Accepted: 03/01/2024] [Indexed: 09/21/2024] Open

Abstract

Background

Rapid diagnosis of melanoma is necessary for a good prognosis. Using teledermatology and artificial intelligence for this issue is developing, but its diagnostic accuracy is less measured in a clinical setting.

Objective

The purpose of this study was to assess the diagnostic accuracy of the teledermoscopy method using the FotoFinder device as well as the Moleanalyzer Pro artificial intelligence (AI) Assistant and to compare them with the face-to-face clinical examination for the diagnosis of melanoma confirmed with histopathology.

Methods

Thirty melanocytic moles of 29 patients were included in the study. Each mole was assessed face-to-face, using FotoFinder teledermoscopy and Moleanalyzer Pro software methods. The results obtained from each method were compared with the results of the gold standard (pathology). The sensitivity and specificity of the three methods were calculated for malignant and borderline versus benign lesions. Inter-method reliability between a gold standard and other methods was evaluated using per cent agreement and Cohen's kappa coefficient.

Results

Five moles had a histopathological diagnosis of melanoma, and six and 19 moles were diagnosed as borderline and benign, respectively. Sensitivities and specificities were, respectively, as follows: face-to-face (90.9%, 57.9%), FotoFinder teledermoscopy (63.6%, 78.9%), FotoFinder® Moleanalyzer Pro (36.4%, 42.1%). Agreement with biopsy-obtained diagnosis categories of benign, borderline and malignant for face-to-face was 63.33%, FotoFinder teledermoscopy 73.33%, and FotoFinder® Moleanalyzer Pro 40%.

Conclusions

Teledermoscopy had the highest agreement with reference diagnosis as well as the highest specificities that caused a reduction of biopsy referrals. The FotoFinder® Moleanalyzer Pro had the lowest agreement. Therefore, it cannot replace dermatologist decision making.

Collapse

Salinas MP, Sepúlveda J, Hidalgo L, Peirano D, Morel M, Uribe P, Rotemberg V, Briones J, Mery D, Navarrete-Dechent C. A systematic review and meta-analysis of artificial intelligence versus clinicians for skin cancer diagnosis. NPJ Digit Med 2024;7:125. [PMID: 38744955 PMCID: PMC11094047 DOI: 10.1038/s41746-024-01103-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 04/04/2024] [Indexed: 05/16/2024] Open

Goessinger EV, Cerminara SE, Mueller AM, Gottfrois P, Huber S, Amaral M, Wenz F, Kostner L, Weiss L, Kunz M, Maul JT, Wespi S, Broman E, Kaufmann S, Patpanathapillai V, Treyer I, Navarini AA, Maul LV. Consistency of convolutional neural networks in dermoscopic melanoma recognition: A prospective real-world study about the pitfalls of augmented intelligence. J Eur Acad Dermatol Venereol 2024;38:945-953. [PMID: 38158385 DOI: 10.1111/jdv.19777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 10/23/2023] [Indexed: 01/03/2024]

Abstract

BACKGROUND

Deep-learning convolutional neural networks (CNNs) have outperformed even experienced dermatologists in dermoscopic melanoma detection under controlled conditions. It remains unexplored how real-world dermoscopic image transformations affect CNN robustness.

OBJECTIVES

To investigate the consistency of melanoma risk assessment by two commercially available CNNs to help formulate recommendations for current clinical use.

METHODS

A comparative cohort study was conducted from January to July 2022 at the Department of Dermatology, University Hospital Basel. Five dermoscopic images of 116 different lesions on the torso of 66 patients were captured consecutively by the same operator without deliberate rotation. Classification was performed by two CNNs (CNN-1/CNN-2). Lesions were divided into four subgroups based on their initial risk scoring and clinical dignity assessment. Reliability was assessed by variation and intraclass correlation coefficients. Excisions were performed for melanoma suspicion or two consecutively elevated CNN risk scores, and benign lesions were confirmed by expert consensus (n = 3).

RESULTS

117 repeated image series of 116 melanocytic lesions (2 melanomas, 16 dysplastic naevi, 29 naevi, 1 solar lentigo, 1 suspicious and 67 benign) were classified. CNN-1 demonstrated superior measurement repeatability for clinically benign lesions with an initial malignant risk score (mean variation coefficient (mvc): CNN-1: 49.5(±34.3)%; CNN-2: 71.4(±22.5)%; p = 0.03), while CNN-2 outperformed for clinically benign lesions with benign scoring (mvc: CNN-1: 49.7(±22.7)%; CNN-2: 23.8(±29.3)%; p = 0.002). Both systems exhibited lowest score consistency for lesions with an initial malignant risk score and benign assessment. In this context, averaging three initial risk scores achieved highest sensitivity of dignity assessment (CNN-1: 94%; CNN-2: 89%). Intraclass correlation coefficients indicated 'moderate'-to-'good' reliability for both systems (CNN-1: 0.80, 95% CI:0.71-0.87, p < 0.001; CNN-2: 0.67, 95% CI:0.55-0.77, p < 0.001).

CONCLUSIONS

Potential user-induced image changes can significantly influence CNN classification. For clinical application, we recommend using the average of three initial risk scores. Furthermore, we advocate for CNN robustness optimization by cross-validation with repeated image sets.

TRIAL REGISTRATION

ClinicalTrials.gov (NCT04605822).

Collapse

Miller I, Rosic N, Stapelberg M, Hudson J, Coxon P, Furness J, Walsh J, Climstein M. Performance of Commercial Dermatoscopic Systems That Incorporate Artificial Intelligence for the Identification of Melanoma in General Practice: A Systematic Review. Cancers (Basel) 2024;16:1443. [PMID: 38611119 PMCID: PMC11011068 DOI: 10.3390/cancers16071443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/03/2024] [Accepted: 04/04/2024] [Indexed: 04/14/2024] Open

Yee J, Rosendahl C, Aoude LG. The role of artificial intelligence and convolutional neural networks in the management of melanoma: a clinical, pathological, and radiological perspective. Melanoma Res 2024;34:96-104. [PMID: 38141179 PMCID: PMC10906187 DOI: 10.1097/cmr.0000000000000951] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 11/29/2023] [Indexed: 12/25/2023]

Joly-Chevrier M, Nguyen AXL, Liang L, Lesko-Krleza M, Lefrançois P. The State of Artificial Intelligence in Skin Cancer Publications. J Cutan Med Surg 2024;28:146-152. [PMID: 38323537 PMCID: PMC11015717 DOI: 10.1177/12034754241229361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2024]

Helenason J, Ekström C, Falk M, Papachristou P. Exploring the feasibility of an artificial intelligence based clinical decision support system for cutaneous melanoma detection in primary care - a mixed method study. Scand J Prim Health Care 2024;42:51-60. [PMID: 37982736 PMCID: PMC10851794 DOI: 10.1080/02813432.2023.2283190] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 11/08/2023] [Indexed: 11/21/2023] Open

Goessinger EV, Niederfeilner JC, Cerminara S, Maul JT, Kostner L, Kunz M, Huber S, Koral E, Habermacher L, Sabato G, Tadic A, Zimmermann C, Navarini A, Maul LV. Patient and dermatologists' perspectives on augmented intelligence for melanoma screening: A prospective study. J Eur Acad Dermatol Venereol 2024. [PMID: 38411348 DOI: 10.1111/jdv.19905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 01/22/2024] [Indexed: 02/28/2024]

Abstract

BACKGROUND

Artificial intelligence (AI) shows promising potential to enhance human decision-making as synergistic augmented intelligence (AuI), but requires critical evaluation for skin cancer screening in a real-world setting.

OBJECTIVES

To investigate the perspectives of patients and dermatologists after skin cancer screening by human, artificial and augmented intelligence.

METHODS

A prospective comparative cohort study conducted at the University Hospital Basel included 205 patients (at high-risk of developing melanoma, with resected or advanced disease) and 8 dermatologists. Patients underwent skin cancer screening by a dermatologist with subsequent 2D and 3D total-body photography (TBP). Any suspicious and all melanocytic skin lesions ≥3 mm were imaged with digital dermoscopes and classified by corresponding convolutional neural networks (CNNs). Excisions were performed based on dermatologist's melanoma suspicion, study-defined elevated CNN risk-scores and/or melanoma suspicion by AuI. Subsequently, all patients and dermatologists were surveyed about their experience using questionnaires, including quantification of patient's safety sense following different examinations (subjective safety score (SSS): 0-10).

RESULTS

Most patients believed AI could improve diagnostic performance (95.5%, n = 192/201). In total, 83.4% preferred AuI-based skin cancer screening compared to examination by AI or dermatologist alone (3D-TBP: 61.3%; 2D-TBP: 22.1%, n = 199). Regarding SSS, AuI induced a significantly higher feeling of safety than AI (mean-SSS (mSSS): 9.5 vs. 7.7, p < 0.0001) or dermatologist screening alone (mSSS: 9.5 vs. 9.1, p = 0.001). Most dermatologists expressed high trust in AI examination results (3D-TBP: 90.2%; 2D-TBP: 96.1%, n = 205). In 68.3% of the examinations, dermatologists felt that diagnostic accuracy improved through additional AI-assessment (n = 140/205). Especially beginners (<2 years' dermoscopic experience; 61.8%, n = 94/152) felt AI facilitated their clinical work compared to experts (>5 years' dermoscopic experience; 20.9%, n = 9/43). Contrarily, in divergent risk assessments, only 1.5% of dermatologists trusted a benign CNN-classification more than personal malignancy suspicion (n = 3/205).

CONCLUSIONS

While patients already prefer AuI with 3D-TBP for melanoma recognition, dermatologists continue to rely largely on their own decision-making despite high confidence in AI-results.

TRIAL REGISTRATION

ClinicalTrials.gov (NCT04605822).

Collapse

Crawford ME, Kamali K, Dorey RA, MacIntyre OC, Cleminson K, MacGillivary ML, Green PJ, Langley RG, Purdy KS, DeCoste RC, Gruchy JR, Pasternak S, Oakley A, Hull PR. Using Artificial Intelligence as a Melanoma Screening Tool in Self-Referred Patients. J Cutan Med Surg 2024;28:37-43. [PMID: 38156628 PMCID: PMC10908200 DOI: 10.1177/12034754231216967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]

Zhu AQ, Wang Q, Shi YL, Ren WW, Cao X, Ren TT, Wang J, Zhang YQ, Sun YK, Chen XW, Lai YX, Ni N, Chen YC, Hu JL, Mou LC, Zhao YJ, Liu YQ, Sun LP, Zhu XX, Xu HX, Guo LH. A deep learning fusion network trained with clinical and high-frequency ultrasound images in the multi-classification of skin diseases in comparison with dermatologists: a prospective and multicenter study. EClinicalMedicine 2024;67:102391. [PMID: 38274117 PMCID: PMC10808933 DOI: 10.1016/j.eclinm.2023.102391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/07/2023] [Accepted: 12/07/2023] [Indexed: 01/27/2024] Open

Abstract

Background

Clinical appearance and high-frequency ultrasound (HFUS) are indispensable for diagnosing skin diseases by providing internal and external information. However, their complex combination brings challenges for primary care physicians and dermatologists. Thus, we developed a deep multimodal fusion network (DMFN) model combining analysis of clinical close-up and HFUS images for binary and multiclass classification in skin diseases.

Methods

Between Jan 10, 2017, and Dec 31, 2020, the DMFN model was trained and validated using 1269 close-ups and 11,852 HFUS images from 1351 skin lesions. The monomodal convolutional neural network (CNN) model was trained and validated with the same close-up images for comparison. Subsequently, we did a prospective and multicenter study in China. Both CNN models were tested prospectively on 422 cases from 4 hospitals and compared with the results from human raters (general practitioners, general dermatologists, and dermatologists specialized in HFUS). The performance of binary classification (benign vs. malignant) and multiclass classification (the specific diagnoses of 17 types of skin diseases) measured by the area under the receiver operating characteristic curve (AUC) were evaluated. This study is registered with www.chictr.org.cn (ChiCTR2300074765).

Findings

The performance of the DMFN model (AUC, 0.876) was superior to that of the monomodal CNN model (AUC, 0.697) in the binary classification (P = 0.0063), which was also better than that of the general practitioner (AUC, 0.651, P = 0.0025) and general dermatologists (AUC, 0.838; P = 0.0038). By integrating close-up and HFUS images, the DMFN model attained an almost identical performance in comparison to dermatologists (AUC, 0.876 vs. AUC, 0.891; P = 0.0080). For the multiclass classification, the DMFN model (AUC, 0.707) exhibited superior prediction performance compared with general dermatologists (AUC, 0.514; P = 0.0043) and dermatologists specialized in HFUS (AUC, 0.640; P = 0.0083), respectively. Compared to dermatologists specialized in HFUS, the DMFN model showed better or comparable performance in diagnosing 9 of the 17 skin diseases.

Interpretation

The DMFN model combining analysis of clinical close-up and HFUS images exhibited satisfactory performance in the binary and multiclass classification compared with the dermatologists. It may be a valuable tool for general dermatologists and primary care providers.

Funding

This work was supported in part by the National Natural Science Foundation of China and the Clinical research project of Shanghai Skin Disease Hospital.

Collapse

Affiliation(s)

An-Qi Zhu Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Department of Ultrasound, Zhongshan Hospital, Institute of Ultrasound in Medicine and Engineering, Fudan University, Shanghai, China
Qiao Wang Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Shanghai Engineering Research Center of Ultrasound Diagnosis and Treatment, Shanghai, China
Yi-Lei Shi MedAI Technology (Wuxi) Co., Ltd., Wuxi, China
Wei-Wei Ren Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Shanghai Engineering Research Center of Ultrasound Diagnosis and Treatment, Shanghai, China
Xu Cao MedAI Technology (Wuxi) Co., Ltd., Wuxi, China
Tian-Tian Ren Department of Medical Ultrasound, Ma'anshan People's Hospital, Ma'anshan, China
Jing Wang Department of Ultrasound, Jiading District Central Hospital Affiliated Shanghai University of Medicine & Health Sciences, Shanghai, China
Ya-Qin Zhang Department of Ultrasound, Zhongshan Hospital, Institute of Ultrasound in Medicine and Engineering, Fudan University, Shanghai, China
Yi-Kang Sun Department of Ultrasound, Zhongshan Hospital, Institute of Ultrasound in Medicine and Engineering, Fudan University, Shanghai, China
Xue-Wen Chen Department of Dermatological Surgery, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Yong-Xian Lai Department of Dermatological Surgery, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Na Ni Department of Dermatological Surgery, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Yu-Chong Chen Department of Dermatological Surgery, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Jing-Liang Hu MedAI Technology (Wuxi) Co., Ltd., Wuxi, China
Li-Chao Mou MedAI Technology (Wuxi) Co., Ltd., Wuxi, China
Yu-Jing Zhao Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Ye-Qiang Liu Department of Pathology, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
Li-Ping Sun Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Shanghai Engineering Research Center of Ultrasound Diagnosis and Treatment, Shanghai, China
Xiao-Xiang Zhu Chair of Data Science in Earth Observation, Technical University of Munich, Munich, Germany
Hui-Xiong Xu Department of Ultrasound, Zhongshan Hospital, Institute of Ultrasound in Medicine and Engineering, Fudan University, Shanghai, China
Le-Hang Guo Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Shanghai Engineering Research Center of Ultrasound Diagnosis and Treatment, Shanghai, China
China Alliance of Multi-Center Clinical Study for Ultrasound (Ultra-Chance) Department of Medical Ultrasound, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Medical Ultrasound, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, China Shanghai Engineering Research Center of Ultrasound Diagnosis and Treatment, Shanghai, China MedAI Technology (Wuxi) Co., Ltd., Wuxi, China Department of Medical Ultrasound, Ma'anshan People's Hospital, Ma'anshan, China Department of Ultrasound, Jiading District Central Hospital Affiliated Shanghai University of Medicine & Health Sciences, Shanghai, China Department of Ultrasound, Zhongshan Hospital, Institute of Ultrasound in Medicine and Engineering, Fudan University, Shanghai, China Department of Dermatological Surgery, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Department of Pathology, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China Chair of Data Science in Earth Observation, Technical University of Munich, Munich, Germany

Collapse

Thomas L, Hyde C, Mullarkey D, Greenhalgh J, Kalsi D, Ko J. Real-world post-deployment performance of a novel machine learning-based digital health technology for skin lesion assessment and suggestions for post-market surveillance. Front Med (Lausanne) 2023;10:1264846. [PMID: 38020164 PMCID: PMC10645139 DOI: 10.3389/fmed.2023.1264846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 10/10/2023] [Indexed: 12/01/2023] Open

Luo N, Zhong X, Su L, Cheng Z, Ma W, Hao P. Artificial intelligence-assisted dermatology diagnosis: From unimodal to multimodal. Comput Biol Med 2023;165:107413. [PMID: 37703714 DOI: 10.1016/j.compbiomed.2023.107413] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/02/2023] [Accepted: 08/28/2023] [Indexed: 09/15/2023]

Xue P, Si M, Qin D, Wei B, Seery S, Ye Z, Chen M, Wang S, Song C, Zhang B, Ding M, Zhang W, Bai A, Yan H, Dang L, Zhao Y, Rezhake R, Zhang S, Qiao Y, Qu Y, Jiang Y. Unassisted Clinicians Versus Deep Learning-Assisted Clinicians in Image-Based Cancer Diagnostics: Systematic Review With Meta-analysis. J Med Internet Res 2023;25:e43832. [PMID: 36862499 PMCID: PMC10020907 DOI: 10.2196/43832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 01/19/2023] [Accepted: 02/13/2023] [Indexed: 02/16/2023] Open

Abstract

BACKGROUND

A number of publications have demonstrated that deep learning (DL) algorithms matched or outperformed clinicians in image-based cancer diagnostics, but these algorithms are frequently considered as opponents rather than partners. Despite the clinicians-in-the-loop DL approach having great potential, no study has systematically quantified the diagnostic accuracy of clinicians with and without the assistance of DL in image-based cancer identification.

OBJECTIVE

We systematically quantified the diagnostic accuracy of clinicians with and without the assistance of DL in image-based cancer identification.

METHODS

PubMed, Embase, IEEEXplore, and the Cochrane Library were searched for studies published between January 1, 2012, and December 7, 2021. Any type of study design was permitted that focused on comparing unassisted clinicians and DL-assisted clinicians in cancer identification using medical imaging. Studies using medical waveform-data graphics material and those investigating image segmentation rather than classification were excluded. Studies providing binary diagnostic accuracy data and contingency tables were included for further meta-analysis. Two subgroups were defined and analyzed, including cancer type and imaging modality.

RESULTS

In total, 9796 studies were identified, of which 48 were deemed eligible for systematic review. Twenty-five of these studies made comparisons between unassisted clinicians and DL-assisted clinicians and provided sufficient data for statistical synthesis. We found a pooled sensitivity of 83% (95% CI 80%-86%) for unassisted clinicians and 88% (95% CI 86%-90%) for DL-assisted clinicians. Pooled specificity was 86% (95% CI 83%-88%) for unassisted clinicians and 88% (95% CI 85%-90%) for DL-assisted clinicians. The pooled sensitivity and specificity values for DL-assisted clinicians were higher than for unassisted clinicians, at ratios of 1.07 (95% CI 1.05-1.09) and 1.03 (95% CI 1.02-1.05), respectively. Similar diagnostic performance by DL-assisted clinicians was also observed across the predefined subgroups.

CONCLUSIONS

The diagnostic performance of DL-assisted clinicians appears better than unassisted clinicians in image-based cancer identification. However, caution should be exercised, because the evidence provided in the reviewed studies does not cover all the minutiae involved in real-world clinical practice. Combining qualitative insights from clinical practice with data-science approaches may improve DL-assisted practice, although further research is required.

TRIAL REGISTRATION

PROSPERO CRD42021281372; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=281372.

Collapse

Affiliation(s)

Peng Xue Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Mingyu Si Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Dongxu Qin Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Bingrui Wei Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Samuel Seery Faculty of Health and Medicine, Division of Health Research, Lancaster University, Lancaster, United Kingdom
Zichen Ye Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Mingyang Chen Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Sumeng Wang Department of Cancer Epidemiology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Cheng Song Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Bo Zhang Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Ming Ding Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Wenling Zhang Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Anying Bai Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Huijiao Yan Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Le Dang Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Yuqian Zhao Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science & Technology of China, Sichuan, China
Remila Rezhake Affiliated Cancer Hospital, The 3rd Affiliated Teaching Hospital of Xinjiang Medical University, Xinjiang, China
Shaokai Zhang Henan Cancer Hospital, Affiliated Cancer Hospital of Zhengzhou University, Henan, China
Youlin Qiao Center for Global Health, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Yimin Qu Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Yu Jiang Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Collapse

Li Z, Koban KC, Schenck TL, Giunta RE, Li Q, Sun Y. Artificial Intelligence in Dermatology Image Analysis: Current Developments and Future Trends. J Clin Med 2022;11:jcm11226826. [PMID: 36431301 PMCID: PMC9693628 DOI: 10.3390/jcm11226826] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 10/24/2022] [Accepted: 10/28/2022] [Indexed: 11/22/2022] Open

Distinguish the Value of the Benign Nevus and Melanomas Using Machine Learning: A Meta-Analysis and Systematic Review. Mediators Inflamm 2022;2022:1734327. [PMID: 36274972 PMCID: PMC9586788 DOI: 10.1155/2022/1734327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 09/22/2022] [Accepted: 10/01/2022] [Indexed: 11/26/2022] Open

Rasheed A, Umar AI, Shirazi SH, Khan Z, Nawaz S, Shahzad M. Automatic eczema classification in clinical images based on hybrid deep neural network. Comput Biol Med 2022;147:105807. [PMID: 35809409 DOI: 10.1016/j.compbiomed.2022.105807] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/09/2022] [Accepted: 05/13/2022] [Indexed: 11/24/2022]

Yu Z, Nguyen J, Nguyen TD, Kelly J, Mclean C, Bonnington P, Zhang L, Mar V, Ge Z. Early Melanoma Diagnosis With Sequential Dermoscopic Images. IEEE TRANSACTIONS ON MEDICAL IMAGING 2022;41:633-646. [PMID: 34648437 DOI: 10.1109/tmi.2021.3120091] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Smartphone-Based Visual Inspection with Acetic Acid: An Innovative Tool to Improve Cervical Cancer Screening in Low-Resource Setting. Healthcare (Basel) 2022;10:healthcare10020391. [PMID: 35207002 PMCID: PMC8871553 DOI: 10.3390/healthcare10020391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/06/2022] [Accepted: 02/11/2022] [Indexed: 11/17/2022] Open

Sies K, Winkler JK, Fink C, Bardehle F, Toberer F, Buhl T, Enk A, Blum A, Stolz W, Rosenberger A, Haenssle HA. Does sex matter? Analysis of sex-related differences in the diagnostic performance of a market-approved convolutional neural network for skin cancer detection. Eur J Cancer 2022;164:88-94. [PMID: 35182926 DOI: 10.1016/j.ejca.2021.12.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 12/17/2021] [Accepted: 12/29/2021] [Indexed: 11/03/2022]

Abstract

BACKGROUND

Advances in biomedical artificial intelligence may introduce or perpetuate sex and gender discriminations. Convolutional neural networks (CNN) have proven a dermatologist-level performance in image classification tasks but have not been assessed for sex and gender biases that may affect training data and diagnostic performance. In this study, we investigated sex-related imbalances in training data and diagnostic performance of a market-approved CNN for skin cancer classification (Moleanalyzer Pro®, Fotofinder Systems GmbH, Bad Birnbach, Germany).

METHODS

We screened open-access dermoscopic image repositories widely used for CNN training for distribution of sex. Moreover, the sex-related diagnostic performance of the market-approved CNN was tested in 1549 dermoscopic images stratified by sex (female n = 773; male n = 776).

RESULTS

Most open-access repositories showed a marked under-representation of images originating from female (40%) versus male (60%) patients. Despite these imbalances and well-known sex-related differences in skin anatomy or skin-directed behaviour, the tested CNN achieved a comparable sensitivity of 87.0% [80.9%-91.3%] versus 87.1% [81.1%-91.4%], specificity of 98.7% [97.4%-99.3%] versus 96.9% [95.2%-98.0%] and ROC-AUC of 0.984 [0.975-0.993] versus 0.979 [0.969-0.988] in dermoscopic images of female versus male origin, respectively. In the sample at hand, sex-related differences in ROC-AUCs were not statistically significant in the per-image analysis nor in an additional per-individual analysis (p ≥ 0.59).

CONCLUSION

Design and training of artificial intelligence algorithms for medical applications should generally acknowledge sex and gender dimensions. Despite sex-related imbalances in open-access training data, the diagnostic performance of the tested CNN showed no sex-related bias in the classification of skin lesions.

Collapse

Winkler JK, Tschandl P, Toberer F, Sies K, Fink C, Enk A, Kittler H, Haenssle HA. Monitoring patients at risk for melanoma: May convolutional neural networks replace the strategy of sequential digital dermoscopy? Eur J Cancer 2021;160:180-188. [PMID: 34840028 DOI: 10.1016/j.ejca.2021.10.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 10/06/2021] [Accepted: 10/25/2021] [Indexed: 01/11/2023]

Weber P, Sinz C, Rinner C, Kittler H, Tschandl P. Perilesional sun damage as a diagnostic clue for pigmented actinic keratosis and Bowen's disease. J Eur Acad Dermatol Venereol 2021;35:2022-2026. [PMID: 34146354 PMCID: PMC8518404 DOI: 10.1111/jdv.17464] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 06/10/2021] [Indexed: 11/29/2022]

Sies K, Winkler JK, Fink C, Bardehle F, Toberer F, Kommoss FKF, Buhl T, Enk A, Rosenberger A, Haenssle HA. Auswirkungen des „dunklen Rand‐Artefakts“ in dermatoskopischen Bildern auf die diagnostische Leistungsfähigkeit eines deep learning neuronalen Netzwerkes mit Marktzulassung. J Dtsch Dermatol Ges 2021;19:842-851. [PMID: 34139087 DOI: 10.1111/ddg.14384_g] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 11/27/2020] [Indexed: 02/06/2023]

Abstract

HINTERGRUND UND ZIELE

Systeme künstlicher Intelligenz (durch "deep learning" faltende neuronale Netzwerke; engl. convolutional neural networks, CNN) erreichen inzwischen bei der Klassifikation von Hautläsionen vergleichbar gute Ergebnisse wie Dermatologen. Allerdings müssen die Limitationen solcher Systeme vor flächendeckendem klinischem Einsatz bekannt sein. Daher haben wir den Einfluss des "dunklen Rand-Artefakts" (engl. dark corner artefact; DCA) in dermatoskopischen Bildern auf die diagnostische Leistung eines CNN mit Marktzulassung zur Klassifikation von Hautläsionen untersucht.

PATIENTEN UND METHODEN

Ein Datensatz aus 233 Bildern von Hautläsionen (60 maligne und 173 benigne) ohne DCA (Kontrolle) wurde digital so modifiziert, dass kleine, mittlere oder große DCA zu sehen waren. Alle 932 Bilder wurden dann mittels CNN mit Marktzulassung (Moleanalyzer-Pro^® , FotoFinder Systems) auf Malignitätsscores hin analysiert. Das Spektrum reichte von 0-1; ein Score von > 0,5 wurde als maligne klassifiziert.

ERGEBNISSE

In der Kontrollserie ohne DCA erreichte das CNN eine Sensitivität von 90,0 % (79,9 %-95,3 %), eine Spezifität von 96,5 % (92,6 %-98,4 %) sowie eine Fläche unter der Kurve (AUC, area under the curve) der "receiver operating characteristic" (ROC) von 0,961 (0,932-0,989). In den Datensätzen mit kleinen beziehungsweise mittleren DCA war die diagnostische Leistung vergleichbar. In den Bildersätzen mit großen DCA wurden allerdings signifikant höhere Malignitätsscores erzielt. Dies führte zu einer signifikant verminderten Spezifität (87,9 % [82,2 %-91,9 %], P < 0,001) sowie einer nicht signifikant erhöhten Sensitivität (96,7 % [88,6 %-99,1 %]). Die ROC-AUC blieb mit 0,962 (0,935-0,989) unverändert.

SCHLUSSFOLGERUNGEN

Die Klassifizierung mittels des CNN war bei dermatoskopischen Bildern mit kleinen oder mittleren DCA nicht beeinträchtigt, das System zeigte jedoch Schwächen bei großen DCA. Wenn Ärzte solche Bilder zur Klassifikation mittels CNN einreichen, sollten sie sich dieser Grenzen der Technologie bewusst sein.

Collapse

Sies K, Winkler JK, Fink C, Bardehle F, Toberer F, Kommoss FKF, Buhl T, Enk A, Rosenberger A, Haenssle HA. Dark corner artefact and diagnostic performance of a market-approved neural network for skin cancer classification. J Dtsch Dermatol Ges 2021;19:842-850. [PMID: 33973372 DOI: 10.1111/ddg.14384] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 11/27/2020] [Indexed: 01/10/2023]

Felmingham CM, Adler NR, Ge Z, Morton RL, Janda M, Mar VJ. The Importance of Incorporating Human Factors in the Design and Implementation of Artificial Intelligence for Skin Cancer Diagnosis in the Real World. Am J Clin Dermatol 2021;22:233-242. [PMID: 33354741 DOI: 10.1007/s40257-020-00574-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Winkler JK, Sies K, Fink C, Toberer F, Enk A, Abassi MS, Fuchs T, Haenssle HA. Association between different scale bars in dermoscopic images and diagnostic performance of a market-approved deep learning convolutional neural network for melanoma recognition. Eur J Cancer 2021;145:146-154. [PMID: 33465706 DOI: 10.1016/j.ejca.2020.12.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 12/01/2020] [Accepted: 12/03/2020] [Indexed: 12/22/2022]

Abstract

BACKGROUND

Studies systematically unravelling possible causes for false diagnoses of deep learning convolutional neural networks (CNNs) are scarce, yet needed before broader application.

OBJECTIVES

The objective of the study was to investigate whether scale bars in dermoscopic images are associated with the diagnostic accuracy of a market-approved CNN.

METHODS

This cross-sectional analysis applied a CNN trained with more than 150,000 images (Moleanalyzer-pro®, FotoFinder Systems Inc., Bad Birnbach, Germany) to investigate seven dermoscopic image sets depicting the same 130 melanocytic lesions (107 nevi, 23 melanomas) without or with digitally superimposed scale bars of different manufacturers. Sensitivity, specificity and area under the curve (AUC) of receiver operating characteristics (ROC) for the CNN's binary classification of images with or without superimposed scale bars were assessed.

RESULTS

Six dermoscopic image sets with different scale bars and one control set without scale bars (overall 910 images) were submitted to CNN analysis. In images without scale bars, the CNN attained a sensitivity [95% confidence interval] of 87.0% [67.9%-95.5%] and a specificity of 87.9% [80.3%-92.8%]. ROC AUC was 0.953 [0.914-0.992]. Scale bars were not associated with significant changes in sensitivity (range 87%-95.7%, all p ≥ 1.0). However, four scale bars induced a decrease of the CNN's specificity (range 0%-43.9%, all p < 0.001). Moreover, ROC AUC was significantly reduced by two scale bars (range 0.520-0.848, both p ≤ 0.042).

CONCLUSIONS

Superimposed scale bars in dermoscopic images may impair the CNN's diagnostic accuracy, mostly by increasing the rate of the false-positive diagnoses. We recommend avoiding scale bars in images intended for CNN analysis unless specific measures counteracting effects are implemented.

CLINICAL TRIAL NUMBER

This study was registered at the German Clinical Trial Register (DRKS-Study-ID: DRKS00013570; URL: https://www.drks.de/drks_web/).

Collapse

Haenssle HA, Winkler JK, Fink C, Toberer F, Enk A, Stolz W, Deinlein T, Hofmann-Wellenhof R, Kittler H, Tschandl P, Rosendahl C, Lallas A, Blum A, Abassi MS, Thomas L, Tromme I, Rosenberger A. Skin lesions of face and scalp - Classification by a market-approved convolutional neural network in comparison with 64 dermatologists. Eur J Cancer 2020;144:192-199. [PMID: 33370644 DOI: 10.1016/j.ejca.2020.11.034] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/02/2020] [Accepted: 11/22/2020] [Indexed: 02/06/2023]

Abstract

BACKGROUND

The clinical differentiation of face and scalp lesions (FSLs) is challenging even for trained dermatologists. Studies comparing the diagnostic performance of a convolutional neural network (CNN) with dermatologists in FSL are lacking.

METHODS

A market-approved CNN (Moleanalyzer-Pro, FotoFinder Systems) was used for binary classifications of 100 dermoscopic images of FSL. The same lesions were used in a two-level reader study including 64 dermatologists (level I: dermoscopy only; level II: dermoscopy, clinical close-up images, textual information). Primary endpoints were the CNN's sensitivity and specificity in comparison with the dermatologists' management decisions in level II. Generalizability of the CNN results was tested by using four additional external data sets.

RESULTS

The CNN's sensitivity, specificity and ROC AUC were 96.2% [87.0%-98.9%], 68.8% [54.7%-80.1%] and 0.929 [0.880-0.978], respectively. In level II, the dermatologists' management decisions showed a mean sensitivity of 84.2% [82.2%-86.2%] and specificity of 69.4% [66.0%-72.8%]. When fixing the CNN's specificity at the dermatologists' mean specificity (69.4%), the CNN's sensitivity (96.2% [87.0%-98.9%]) was significantly higher than that of dermatologists (84.2% [82.2%-86.2%]; p < 0.001). Dermatologists of all training levels were outperformed by the CNN (all p < 0.001). In confirmation, the CNN's accuracy (83.0%) was significantly higher than dermatologists' accuracies in level II management decisions (all p < 0.001). The CNN's performance was largely confirmed in three additional external data sets but particularly showed a reduced specificity in one Australian data set including FSL on severely sun-damaged skin.

CONCLUSIONS

When applied as an assistant system, the CNN's higher sensitivity at an equivalent specificity may result in an improved early detection of face and scalp skin cancers.

Collapse

Stojkovic-Filipovic J, Tiodorovic D, Lallas A, Akay BN, Longo C, Rosendahl C, Dobrosavljevic D, Nazzaro G, Argenziano G, Zalaudek I, Tromme I, Tschandl P, Puig S, Lanssens S, Kittler H. Dermatoscopy of combined blue nevi: a multicentre study of the International Dermoscopy Society. J Eur Acad Dermatol Venereol 2020;35:900-905. [PMID: 33274487 DOI: 10.1111/jdv.17059] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Accepted: 11/13/2020] [Indexed: 01/31/2023]

Abstract

BACKGROUND

Combined blue nevi (CBN) may mimic melanoma and are relatively often biopsied for diagnostic reasons.

OBJECTIVE

To better characterize CBN and to compare it with melanoma.

METHODS

We collected clinical and dermatoscopic images of 111 histologically confirmed CBN and contrasted their dermatoscopic characteristics with 132 partly blue coloured melanomas. Furthermore, we compared the accuracy of human experts using pattern analysis with a computer algorithm based on deep learning.

RESULTS

Combined blue nevi are usually flat or slightly elevated and, in comparison with melanoma, more frequent on the head and neck. Dermatoscopically, they are typified by a blue structureless part in combination with either brown clods (n = 52, 46.8%), lines (n = 28, 25.2%) or skin-coloured or brown structureless areas (n = 31, 27.9%). In contrast with melanoma, the blue part of CBN is more often well defined (18.9% vs. 4.5%, P < 0.001) and more often located in the centre (22.5% vs. 5.3%, P < 0.001). Melanomas are more often chaotic (OR: 28.7, 95% CI: 14.8-55.7, P < 0.001), have at least one melanoma clue (OR: 10.8, 95% CI: 5.2-22.2 P < 0.001) in particular white lines (OR: 37.1, 95% CI: 13.4-102.9, P < 0.001). Using simplified pattern analysis (chaos and clues), two raters reached sensitivities of 93.9% (95% CI: 88.4-97.3%) and 92.4% (95% CI: 86.5-96.3%) at corresponding specificities of 59.5% (95% CI: 49.7-68.7%) and 65.8% (95% CI: 56.2-74.5%). The human accuracy with pattern analysis was on par with a state-of-the-art computer algorithm based on deep learning that achieved an area under the curve of (0.92, 95% CI: 0.87-0.96) and a specificity of 85.3% (95% CI: 76.5-91.7%) at a given sensitivity of 83.6% (95% CI: 72.5-91.5%).

CONCLUSION

CBN usually lack melanoma clues, in particular white lines. The accuracy of pattern analysis for combined nevi is acceptable, and histopathologic confirmation may not be necessary in exemplary cases.

Collapse

Ilan Y. Second-Generation Digital Health Platforms: Placing the Patient at the Center and Focusing on Clinical Outcomes. Front Digit Health 2020;2:569178. [PMID: 34713042 PMCID: PMC8521820 DOI: 10.3389/fdgth.2020.569178] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 10/02/2020] [Indexed: 12/13/2022] Open

Tognetti L, Bonechi S, Andreini P, Bianchini M, Scarselli F, Cevenini G, Moscarella E, Farnetani F, Longo C, Lallas A, Carrera C, Puig S, Tiodorovic D, Perrot JL, Pellacani G, Argenziano G, Cinotti E, Cataldo G, Balistreri A, Mecocci A, Gori M, Rubegni P, Cartocci A. A new deep learning approach integrated with clinical data for the dermoscopic differentiation of early melanomas from atypical nevi. J Dermatol Sci 2020;101:115-122. [PMID: 33358096 DOI: 10.1016/j.jdermsci.2020.11.009] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 11/23/2020] [Accepted: 11/30/2020] [Indexed: 12/13/2022]

Abstract

BACKGROUND

Timely recognition of malignant melanoma (MM) is challenging for dermatologists worldwide and represents the main determinant for mortality. Dermoscopic examination is influenced by dermatologists' experience and fails to achieve adequate accuracy and reproducibility in discriminating atypical nevi (AN) from early melanomas (EM).

OBJECTIVE

We aimed to develop a Deep Convolutional Neural Network (DCNN) model able to support dermatologists in the classification and management of atypical melanocytic skin lesions (aMSL).

METHODS

A training set (630 images), a validation set (135) and a testing set (214) were derived from the idScore dataset of 979 challenging aMSL cases in which the dermoscopic image is integrated with clinical data (age, sex, body site and diameter) and associated with histological data. A DCNN_aMSL architecture was designed and then trained on both dermoscopic images of aMSL and the clinical/anamnestic data, resulting in the integrated "iDCNN_aMSL" model. Responses of 111 dermatologists with different experience levels on both aMSL classification (intuitive diagnosis) and management decisions (no/long follow-up; short follow-up; excision/preventive excision) were compared with the DCNNs models.

RESULTS

In the lesion classification study, the iDCNN_aMSL achieved the best accuracy, reaching an AUC = 90.3 %, SE = 86.5 % and SP = 73.6 %, compared to DCNN_aMSL (SE = 89.2 %, SP = 65.7 %) and intuitive diagnosis of dermatologists (SE = 77.0 %; SP = 61.4 %).

CONCLUSIONS

The iDCNN_aMSL proved to be the best support tool for management decisions reducing the ratio of inappropriate excision. The proposed iDCNN_aMSL model can represent a valid support for dermatologists in discriminating AN from EM with high accuracy and for medical decision making by reducing their rates of inappropriate excisions.

Collapse

Affiliation(s)

Linda Tognetti Dermatology Unit, Department of Medical, Surgical and Neurosciences, University of Siena, Italy.
Simone Bonechi Department of Information Engineering and Mathematics, University of Siena, Siena, Italy; Department of Economy Engineering Society and Buisiness, Tuscia University, Viterbo, Italy
Paolo Andreini Department of Information Engineering and Mathematics, University of Siena, Siena, Italy
Monica Bianchini Department of Information Engineering and Mathematics, University of Siena, Siena, Italy
Franco Scarselli Department of Information Engineering and Mathematics, University of Siena, Siena, Italy
Gabriele Cevenini Bioengineering Unit, Department of Medical Biotechnology, University of Siena, Italy
Elvira Moscarella Dermatology Unit, University of Campania Luigi Vanvitelli, Naples, Italy
Francesca Farnetani Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy
Caterina Longo Centro Oncologico ad Alta Tecnologia Diagnostica, Azienda Unità Sanitaria Locale, IRCCS di Reggio Emilia, Reggio Emilia, Italy
Aimilios Lallas First Department of Dermatology, Aristotle University, Thessaloniki, Greece
Cristina Carrera Melanoma Unit, Department of Dermatology, University of Barcelona, Barcelona, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, University of Barcelona, Barcelona, Spain
Susana Puig Melanoma Unit, Department of Dermatology, University of Barcelona, Barcelona, Spain
Danica Tiodorovic Dermatology Clinic, Medical Faculty, Nis University, Nis, Serbia
Jean Luc Perrot Dermatology Unit, University Hospital of St-Etienne, Saint Etienne, France
Giovanni Pellacani Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy
Giuseppe Argenziano Dermatology Unit, University of Campania Luigi Vanvitelli, Naples, Italy
Elisa Cinotti Dermatology Unit, Department of Medical, Surgical and Neurosciences, University of Siena, Italy
Gennaro Cataldo Bioengineering Unit, Department of Medical Biotechnology, University of Siena, Italy
Alberto Balistreri Bioengineering Unit, Department of Medical Biotechnology, University of Siena, Italy
Alessandro Mecocci Department of Information Engineering and Mathematics, University of Siena, Siena, Italy
Marco Gori Department of Information Engineering and Mathematics, University of Siena, Siena, Italy
Pietro Rubegni Dermatology Unit, Department of Medical, Surgical and Neurosciences, University of Siena, Italy
Alessandra Cartocci Dermatology Unit, Department of Medical, Surgical and Neurosciences, University of Siena, Italy; Bioengineering Unit, Department of Medical Biotechnology, University of Siena, Italy

Collapse

Blum A, Bosch S, Haenssle HA, Fink C, Hofmann-Wellenhof R, Zalaudek I, Kittler H, Tschandl P. [Artificial intelligence and smartphone program applications (Apps) : Relevance for dermatological practice]. Hautarzt 2020;71:691-698. [PMID: 32720165 DOI: 10.1007/s00105-020-04658-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Heppt M, Berking C. The value of convolutional neural networks in the diagnosis of melanoma simulators. J Eur Acad Dermatol Venereol 2020;34:1134-1135. [DOI: 10.1111/jdv.16577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 04/29/2020] [Indexed: 11/29/2022]