1
|
Carnabatu CJ, Fetzer DT, Tessnow A, Holt S, Sant VR. Avoidable biopsies? Validating artificial intelligence-based decision support software in indeterminate thyroid nodules. Surgery 2025; 177:108829. [PMID: 39396888 DOI: 10.1016/j.surg.2024.07.074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 06/23/2024] [Accepted: 07/16/2024] [Indexed: 10/15/2024]
Abstract
BACKGROUND Multiple artificial intelligence (AI) systems have been approved to risk-stratify thyroid nodules through sonographic characterization. We sought to validate the ability of one such AI system, Koios DS (Koios Medical, Chicago, IL), to aid in improving risk stratification of indeterminate thyroid nodules. METHODS A retrospective single-institution dataset was compiled of 28 cytologically indeterminate thyroid nodules having undergone molecular testing and surgical resection, with surgical pathology categorized as malignant or benign. Nodules were retrospectively evaluated with Koios DS. After nodule selection, automated and AI-adapter-derived Thyroid Imaging Reporting and Data System (TI-RADS) levels were recorded, and agreement with radiologist-derived levels was assessed using Cohen's κ statistic. The performance of malignancy classification was compared between the radiologist and AI-adapter. Biopsy thresholds were re-evaluated using the AI-adapter. RESULTS In this cohort, 7 (25%) nodules were malignant on surgical pathology. The median nodule size was 2.4 cm (interquartile range: 1.8-2.9 cm). Median radiologist and automated TI-RADS levels were both 4, with κ 0.25 ("fair agreement"). Malignancy classification by the radiologist provided sensitivity 100%, specificity 33.3%, positive predictive value (PPV) 33.3%, and negative predictive value (NPV) 100%, compared with the AI-adapter's performance with sensitivity 85.7%, specificity 76.2%, PPV 54.5%, and NPV 94.1%. Using the AI-adapter, 14 of 28 biopsies would have been deferred, 13 of which were surgically benign. CONCLUSION Koios automated and radiologist-derived TI-RADS levels were in consistent agreement for indeterminate thyroid nodules. Malignancy reclassification with the AI-adapter improved PPV at minimal cost to NPV. Risk stratification with the addition of the AI-adapter may allow for more accurate patient counseling and the avoidance of biopsies in select cases that would otherwise be cytologically indeterminate.
Collapse
Affiliation(s)
- Christopher J Carnabatu
- Division of Endocrine Surgery, UT Southwestern Medical Center, Dallas, TX. https://twitter.com/CarnabatuMD
| | - David T Fetzer
- Department of Radiology, UT Southwestern Medical Center, Dallas, TX. https://twitter.com/DTFetzer
| | - Alexander Tessnow
- Division of Endocrinology and Metabolism, UT Southwestern Medical Center, Dallas, TX. https://twitter.com/AlexTessnow
| | - Shelby Holt
- Division of Endocrine Surgery, UT Southwestern Medical Center, Dallas, TX
| | - Vivek R Sant
- Division of Endocrine Surgery, UT Southwestern Medical Center, Dallas, TX.
| |
Collapse
|
2
|
McMahon GT. The Risks and Challenges of Artificial Intelligence in Endocrinology. J Clin Endocrinol Metab 2024; 109:e1468-e1471. [PMID: 38471009 DOI: 10.1210/clinem/dgae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Indexed: 03/14/2024]
Abstract
Artificial intelligence (AI) holds the promise of addressing many of the numerous challenges healthcare faces, which include a growing burden of illness, an increase in chronic health conditions and disabilities due to aging and epidemiological changes, higher demand for health services, overworked and burned-out clinicians, greater societal expectations, and rising health expenditures. While technological advancements in processing power, memory, storage, and the abundance of data have empowered computers to handle increasingly complex tasks with remarkable success, AI introduces a variety of meaningful risks and challenges. Among these are issues related to accuracy and reliability, bias and equity, errors and accountability, transparency, misuse, and privacy of data. As AI systems continue to rapidly integrate into healthcare settings, it is crucial to recognize the inherent risks they bring. These risks demand careful consideration to ensure the responsible and safe deployment of AI in healthcare.
Collapse
Affiliation(s)
- Graham T McMahon
- Accreditation Council for Continuing Medical Education, Chicago, IL 60611, USA
- Department of Medical Education and Division of Endocrinology, Metabolism and Molecular Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| |
Collapse
|
3
|
Yao S, Dai F, Sun P, Zhang W, Qian B, Lu H. Enhancing the fairness of AI prediction models by Quasi-Pareto improvement among heterogeneous thyroid nodule population. Nat Commun 2024; 15:1958. [PMID: 38438371 PMCID: PMC10912763 DOI: 10.1038/s41467-024-44906-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 01/09/2024] [Indexed: 03/06/2024] Open
Abstract
Artificial Intelligence (AI) models for medical diagnosis often face challenges of generalizability and fairness. We highlighted the algorithmic unfairness in a large thyroid ultrasound dataset with significant diagnostic performance disparities across subgroups linked causally to sample size imbalances. To address this, we introduced the Quasi-Pareto Improvement (QPI) approach and a deep learning implementation (QP-Net) combining multi-task learning and domain adaptation to improve model performance among disadvantaged subgroups without compromising overall population performance. On the thyroid ultrasound dataset, our method significantly mitigated the area under curve (AUC) disparity for three less-prevalent subgroups by 0.213, 0.112, and 0.173 while maintaining the AUC for dominant subgroups; we also further confirmed the generalizability of our approach on two public datasets: the ISIC2019 skin disease dataset and the CheXpert chest radiograph dataset. Here we show the QPI approach to be widely applicable in promoting AI for equitable healthcare outcomes.
Collapse
Affiliation(s)
- Siqiong Yao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
- SJTU-Yale Joint Center of Biostatistics and Data Science, National Center for Translational Medicine, MoE Key Lab of Artificial Intelligence, AI Institute Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Fang Dai
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Peng Sun
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Weituo Zhang
- Hongqiao International Institute of Medicine, Shanghai Tong Ren Hospital and School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, PR China.
| | - Biyun Qian
- Hongqiao International Institute of Medicine, Shanghai Tong Ren Hospital and School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, PR China.
| | - Hui Lu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China.
- SJTU-Yale Joint Center of Biostatistics and Data Science, National Center for Translational Medicine, MoE Key Lab of Artificial Intelligence, AI Institute Shanghai Jiao Tong University, Shanghai, 200240, PR China.
- Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, NHC Key Laboratory of Medical Embryogenesis and Developmental Molecular Biology & Shanghai Key Laboratory of Embryo and Reproduction Engineering, Shanghai, 200020, PR China.
| |
Collapse
|
4
|
Duan H, Yan W. Visual fatigue a comprehensive review of mechanisms of occurrence, animal model design and nutritional intervention strategies. Crit Rev Food Sci Nutr 2023:1-25. [PMID: 38153314 DOI: 10.1080/10408398.2023.2298789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
When the eyes work intensively, it is easy to have eye discomfort such as blurred vision, soreness, dryness, and tearing, that is, visual fatigue. Visual fatigue not only affects work and study efficiency, but long-term visual fatigue can also easily affect physical and mental health. In recent years, with the popularization of electronic products, although it has brought convenience to the office and study, it has also caused more frequent visual fatigue among people who use electronic devices. Moreover, studies have reported that the number of people with visual fatigue is showing a trend of increasing year by year. The range of people involved is also extensive, especially students, people who have been engaged in computer work and fine instruments (such as microscopes) for a long time, and older adults with aging eye function. More and more studies have proposed that supplementation with the proper nutrients can effectively relieve visual fatigue and promote eye health. This review discusses the physiological mechanisms of visual fatigue and the design ideas of animal experiments from the perspective of modern nutritional science. Functional food ingredients with the ability to alleviate visual fatigue are discussed in detail.
Collapse
Affiliation(s)
- Hao Duan
- College of Biochemical Engineering, Beijing Key Laboratory of Bioactive Substances and Functional Food, Beijing Union University, Beijing, China
| | - Wenjie Yan
- College of Biochemical Engineering, Beijing Key Laboratory of Bioactive Substances and Functional Food, Beijing Union University, Beijing, China
| |
Collapse
|
5
|
Dell’Era V, Perotti A, Starnini M, Campagnoli M, Rosa MS, Saino I, Aluffi Valletti P, Garzaro M. Machine Learning Model as a Useful Tool for Prediction of Thyroid Nodules Histology, Aggressiveness and Treatment-Related Complications. J Pers Med 2023; 13:1615. [PMID: 38003930 PMCID: PMC10672369 DOI: 10.3390/jpm13111615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/14/2023] [Accepted: 11/15/2023] [Indexed: 11/26/2023] Open
Abstract
Thyroid nodules are very common, 5-15% of which are malignant. Despite the low mortality rate of well-differentiated thyroid cancer, some variants may behave aggressively, making nodule differentiation mandatory. Ultrasound and fine-needle aspiration biopsy are simple, safe, cost-effective and accurate diagnostic tools, but have some potential limits. Recently, machine learning (ML) approaches have been successfully applied to healthcare datasets to predict the outcomes of surgical procedures. The aim of this work is the application of ML to predict tumor histology (HIS), aggressiveness and post-surgical complications in thyroid patients. This retrospective study was conducted at the ENT Division of Eastern Piedmont University, Novara (Italy), and reported data about 1218 patients who underwent surgery between January 2006 and December 2018. For each patient, general information, HIS and outcomes are reported. For each prediction task, we trained ML models on pre-surgery features alone as well as on both pre- and post-surgery data. The ML pipeline included data cleaning, oversampling to deal with unbalanced datasets and exploration of hyper-parameter space for random forest models, testing their stability and ranking feature importance. The main results are (i) the construction of a rich, hand-curated, open dataset including pre- and post-surgery features (ii) the development of accurate yet explainable ML models. Results highlight pre-screening as the most important feature to predict HIS and aggressiveness, and that, in our population, having an out-of-range (Low) fT3 dosage at pre-operative examination is strongly associated with a higher aggressiveness of the disease. Our work shows how ML models can find patterns in thyroid patient data and could support clinicians to refine diagnostic tools and improve their accuracy.
Collapse
Affiliation(s)
- Valeria Dell’Era
- ENT Division, Novara Maggiore Hospital, 28100 Novara, Italy; (M.S.R.); (I.S.)
| | | | - Michele Starnini
- CENTAI Institute, 10138 Turin, Italy; (A.P.)
- Departament de Fisica, Universitat Politecnica de Catalunya, Campus Nord, 08034 Barcelona, Spain
| | - Massimo Campagnoli
- Department of Otorhinolaryngology, Ss. Trinità Hospital, 28021 Borgomanero, Italy;
| | - Maria Silvia Rosa
- ENT Division, Novara Maggiore Hospital, 28100 Novara, Italy; (M.S.R.); (I.S.)
| | - Irene Saino
- ENT Division, Novara Maggiore Hospital, 28100 Novara, Italy; (M.S.R.); (I.S.)
| | - Paolo Aluffi Valletti
- ENT Division, Health Science Department, School of Medicine, Universitá del Piemonte Orientale, 28100 Novara, Italy; (P.A.V.); (M.G.)
| | - Massimiliano Garzaro
- ENT Division, Health Science Department, School of Medicine, Universitá del Piemonte Orientale, 28100 Novara, Italy; (P.A.V.); (M.G.)
| |
Collapse
|
6
|
Chalamalasetti SD, Tamrakar S, Doshi P, Vora NN, Karrothu V, Pathe AR. Gender Equality Trends of First Authors in Publications of Artificial Intelligence and Thyroid. Cureus 2023; 15:e45820. [PMID: 37876392 PMCID: PMC10593141 DOI: 10.7759/cureus.45820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/23/2023] [Indexed: 10/26/2023] Open
Abstract
Thyroid diseases are diverse, ranging from benign conditions to potentially life-threatening disorders. Recently, the application of artificial intelligence (AI) in evaluating thyroid disease has significantly enhanced medical research, diagnosis prediction, and algorithm development. Coupled with this advancement is the rising focus on the importance of gender equality in scientific publications. This study delves into the gender trends of first authors in papers related to "Artificial Intelligence and Thyroid" sourced from PubMed from 2003 to 2022, scrutinizing these trends based on both country and year. A bibliometric analysis was conducted on PubMed to retrieve relevant articles over this 19-year time span. Following this, the names and affiliated countries of the first authors were determined. The Namsor app, a tool for classifying personal names by gender, origin, or ethnicity, was then used to segregate the data based on gender. Statistical analyses were performed using the R software- ARIMA model and Fisher's exact test was applied to examine the correlations between gender and country of origin. From the 254 analyzed articles, 43.5% of the first authors were female, while 56.69% were male. The year 2022 saw the most significant number of female first-author publications. Intriguingly, the European Journal of Radiology was prominent due to its favorable gender ratio. Moreover, the association between gender and country was significant, with China being a standout. Limitations included focusing only on PubMed journals and using a third party for gender identification. Nevertheless, the study reveals a move toward gender parity in AI and thyroid research over the past 18 years, emphasizing the importance of sustained efforts for academic inclusivity.
Collapse
Affiliation(s)
| | - Silbin Tamrakar
- Internal Medicine, Enam Medical College and Hospital, Dhaka, BGD
| | - Preyansh Doshi
- Internal Medicine, Gujarat Cancer Society (GCS) Medical College, Hospital and Research Centre, Ahmedabad, IND
| | - Neera N Vora
- Internal Medicine, Gujarat Cancer Society (GCS) Medical College, Hospital and Research Centre, Ahmedabad, IND
| | - Vishnu Karrothu
- Internal Medicine, All India Institute of Medical Sciences, Bhopal, Bhopal, IND
| | | |
Collapse
|
7
|
Ha EJ, Lee JH, Lee DH, Na DG, Kim JH. Development of a machine learning-based fine-grained risk stratification system for thyroid nodules using predefined clinicoradiological features. Eur Radiol 2023; 33:3211-3221. [PMID: 36600122 DOI: 10.1007/s00330-022-09376-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 09/07/2022] [Accepted: 12/11/2022] [Indexed: 01/05/2023]
Abstract
OBJECTIVE We constructed and validated a machine learning-based malignancy risk estimation model using predefined clinicoradiological features, and evaluated its clinical utility for the management of thyroid nodules. METHODS In total, 5708 benign (n = 4597) and malignant (n = 1111) thyroid nodules were collected from 5081 consecutive patients treated in 26 institutions. Seventeen experienced radiologists evaluated nodule characteristics on ultrasonographic images. Eight predictive models were used to stratify the thyroid nodules according to malignancy risk; model performance was assessed via nested 10-fold cross-validation. The best-performing algorithm was externally validated using data for 454 thyroid nodules from a tertiary hospital, then compared to the Thyroid Imaging Reporting and Data System (TIRADS)-based interpretations of radiologists (American College of Radiology, European and Korean TIRADS, and AACE/ACE/AME guidelines). RESULTS The area under the receiver operating characteristic (AUROC) curves of the algorithms ranged from 0.773 to 0.862. The sensitivities, specificities, positive predictive values, and negative predictive values of the best-performing models were 74.1-76.6%, 80.9-83.4%, 49.2-51.9%, and 93.0-93.5%, respectively. For the external validation set, the ElasticNet values were 83.2%, 89.2%, 81.8%, and 90.1%, respectively. The corresponding TIRADS values were 66.5-85.0%, 61.3-80.8%, 45.9-72.1%, and 81.5-90.3%, respectively. The new model exhibited a significantly higher AUROC and specificity than did the TIRADS risk stratification, although its sensitivity was similar. CONCLUSION We developed a reliable machine learning-based predictive model that demonstrated enhanced specificity when stratifying thyroid nodules according to malignancy risk. This system will contribute to improved personalized management of thyroid nodules. KEY POINTS • The area under the receiver operating characteristic (AUROC) curve, sensitivity, and specificity of our model were 0.914, 83.2%, and 89.2%, respectively (derived using the validation dataset). • Compared to the TIRADS values, the AUROC and specificity are significantly higher, while the sensitivity is similar. • An interactive version of our AI algorithm is at http://tirads.cdss.co.kr .
Collapse
Affiliation(s)
- Eun Ju Ha
- Department of Radiology, Ajou University School of Medicine, Wonchon-Dong, Yeongtong-Gu, Suwon, 16499, South Korea
| | - Jeong Hoon Lee
- Department of Radiology, Ajou University School of Medicine, Wonchon-Dong, Yeongtong-Gu, Suwon, 16499, South Korea
| | - Da Hyun Lee
- Department of Radiology, Ajou University School of Medicine, Wonchon-Dong, Yeongtong-Gu, Suwon, 16499, South Korea
| | - Dong Gyu Na
- Department of Radiology, GangNeung Asan Hospital, University of Ulsan College of Medicine, Gangneung-si, Gangwon-do, 25440, South Korea
| | - Ji-Hoon Kim
- Department of Radiology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, South Korea.
| |
Collapse
|
8
|
Nittas V, Daniore P, Landers C, Gille F, Amann J, Hubbs S, Puhan MA, Vayena E, Blasimme A. Beyond high hopes: A scoping review of the 2019-2021 scientific discourse on machine learning in medical imaging. PLOS DIGITAL HEALTH 2023; 2:e0000189. [PMID: 36812620 PMCID: PMC9931290 DOI: 10.1371/journal.pdig.0000189] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 01/02/2023] [Indexed: 02/04/2023]
Abstract
Machine learning has become a key driver of the digital health revolution. That comes with a fair share of high hopes and hype. We conducted a scoping review on machine learning in medical imaging, providing a comprehensive outlook of the field's potential, limitations, and future directions. Most reported strengths and promises included: improved (a) analytic power, (b) efficiency (c) decision making, and (d) equity. Most reported challenges included: (a) structural barriers and imaging heterogeneity, (b) scarcity of well-annotated, representative and interconnected imaging datasets (c) validity and performance limitations, including bias and equity issues, and (d) the still missing clinical integration. The boundaries between strengths and challenges, with cross-cutting ethical and regulatory implications, remain blurred. The literature emphasizes explainability and trustworthiness, with a largely missing discussion about the specific technical and regulatory challenges surrounding these concepts. Future trends are expected to shift towards multi-source models, combining imaging with an array of other data, in a more open access, and explainable manner.
Collapse
Affiliation(s)
- Vasileios Nittas
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
- Epidemiology, Biostatistics and Prevention Institute, Faculty of Medicine, Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Paola Daniore
- Institute for Implementation Science in Health Care, Faculty of Medicine, University of Zurich, Switzerland
- Digital Society Initiative, University of Zurich, Switzerland
| | - Constantin Landers
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
| | - Felix Gille
- Institute for Implementation Science in Health Care, Faculty of Medicine, University of Zurich, Switzerland
- Digital Society Initiative, University of Zurich, Switzerland
| | - Julia Amann
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
| | - Shannon Hubbs
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
| | - Milo Alan Puhan
- Epidemiology, Biostatistics and Prevention Institute, Faculty of Medicine, Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Effy Vayena
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
| | - Alessandro Blasimme
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, Swiss Federal Institute of Technology (ETH Zurich), Zurich, Switzerland
| |
Collapse
|
9
|
Gao X, Ran X, Ding W. The progress of radiomics in thyroid nodules. Front Oncol 2023; 13:1109319. [PMID: 36959790 PMCID: PMC10029726 DOI: 10.3389/fonc.2023.1109319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Accepted: 02/03/2023] [Indexed: 03/09/2023] Open
Abstract
Due to the development of Artificial Intelligence (AI), Machine Learning (ML), and the improvement of medical imaging equipment, radiomics has become a popular research in recent years. Radiomics can obtain various quantitative features from medical images, highlighting the invisible image traits and significantly enhancing the ability of medical imaging identification and prediction. The literature indicates that radiomics has a high potential in identifying and predicting thyroid nodules. So in this article, we explain the development, definition, and workflow of radiomics. And then, we summarize the applications of various imaging techniques in identifying benign and malignant thyroid nodules, predicting invasiveness and metastasis of thyroid lymph nodes, forecasting the prognosis of thyroid malignancies, and some new advances in molecular level and deep learning. The shortcomings of this technique are also summarized, and future development prospects are provided.
Collapse
Affiliation(s)
| | - Xuan Ran
- *Correspondence: Wei Ding, ; Xuan Ran,
| | - Wei Ding
- *Correspondence: Wei Ding, ; Xuan Ran,
| |
Collapse
|
10
|
Sorrenti S, Dolcetti V, Radzina M, Bellini MI, Frezza F, Munir K, Grani G, Durante C, D’Andrea V, David E, Calò PG, Lori E, Cantisani V. Artificial Intelligence for Thyroid Nodule Characterization: Where Are We Standing? Cancers (Basel) 2022; 14:cancers14143357. [PMID: 35884418 PMCID: PMC9315681 DOI: 10.3390/cancers14143357] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 06/24/2022] [Accepted: 07/08/2022] [Indexed: 12/12/2022] Open
Abstract
Simple Summary In the present review, an up-to-date summary of the state of the art of artificial intelligence (AI) implementation for thyroid nodule characterization and cancer is provided. The opinion on the real effectiveness of AI systems remains controversial. Taking into consideration the largest and most scientifically valid studies, it is possible to state that AI provides results that are comparable or inferior to expert ultrasound specialists and radiologists. Promising data approve AI as a support tool and simultaneously highlight the need for a radiologist supervisory framework for AI provided results. Therefore, current solutions might be more suitable for educational purposes. Abstract Machine learning (ML) is an interdisciplinary sector in the subset of artificial intelligence (AI) that creates systems to set up logical connections using algorithms, and thus offers predictions for complex data analysis. In the present review, an up-to-date summary of the current state of the art regarding ML and AI implementation for thyroid nodule ultrasound characterization and cancer is provided, highlighting controversies over AI application as well as possible benefits of ML, such as, for example, training purposes. There is evidence that AI increases diagnostic accuracy and significantly limits inter-observer variability by using standardized mathematical algorithms. It could also be of aid in practice settings with limited sub-specialty expertise, offering a second opinion by means of radiomics and computer-assisted diagnosis. The introduction of AI represents a revolutionary event in thyroid nodule evaluation, but key issues for further implementation include integration with radiologist expertise, impact on workflow and efficiency, and performance monitoring.
Collapse
Affiliation(s)
- Salvatore Sorrenti
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Vincenzo Dolcetti
- Department of Radiological, Anatomo-Pathological Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (V.D.); (V.C.)
| | - Maija Radzina
- Radiology Research Laboratory, Riga Stradins University, LV-1007 Riga, Latvia;
- Medical Faculty, University of Latvia, Diagnostic Radiology Institute, Paula Stradina Clinical University Hospital, LV-1007 Riga, Latvia
| | - Maria Irene Bellini
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
- Correspondence:
| | - Fabrizio Frezza
- Department of Information Engineering, Electronics and Telecommunications, “Sapienza” University of Rome, 00184 Rome, Italy; (F.F.); (K.M.)
- Consorzio Nazionale Interuniversitario per le Telecomunicazioni (CNIT), Viale G.P. Usberti 181/A Sede Scientifica di Ingegneria-Palazzina 3, 43124 Parma, Italy
| | - Khushboo Munir
- Department of Information Engineering, Electronics and Telecommunications, “Sapienza” University of Rome, 00184 Rome, Italy; (F.F.); (K.M.)
| | - Giorgio Grani
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Cosimo Durante
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Vito D’Andrea
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Emanuele David
- Department of Translational and Precision Medicine, “Sapienza” University of Rome, 00161 Rome, Italy; (G.G.); (C.D.); (E.D.)
| | - Pietro Giorgio Calò
- Department of Surgical Sciences, “Policlinico Universitario Duilio Casula”, University of Cagliari, 09042 Monserrato, Italy;
| | - Eleonora Lori
- Department of Surgical Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (S.S.); (V.D.); (E.L.)
| | - Vito Cantisani
- Department of Radiological, Anatomo-Pathological Sciences, “Sapienza” University of Rome, 00161 Rome, Italy; (V.D.); (V.C.)
| |
Collapse
|
11
|
Gruson D, Dabla P, Stankovic S, Homsak E, Gouget B, Bernardini S, Macq B. Artificial intelligence and thyroid disease management: considerations for thyroid function tests. Biochem Med (Zagreb) 2022; 32:020601. [PMID: 35799984 PMCID: PMC9195598 DOI: 10.11613/bm.2022.020601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Accepted: 04/05/2022] [Indexed: 12/07/2022] Open
Abstract
Artificial intelligence (AI) is transforming healthcare and offers new tools in clinical research, personalized medicine, and medical diagnostics. Thyroid function tests represent an important asset for physicians in the diagnosis and monitoring of pathologies. Artificial intelligence tools can clearly assist physicians and specialists in laboratory medicine to optimize test prescription, tests interpretation, decision making, process optimization, and assay design. Our article is reviewing several of these aspects. As thyroid AI models rely on large data sets, which often requires distributed learning from multi-center contributions, this article also briefly discusses this issue.
Collapse
Affiliation(s)
- Damien Gruson
- Department of Clinical Biochemistry, Cliniques Universitaires Saint-Luc, Université catholique de Louvain, Ottignies-Louvain-la-Neuve, Belgium
| | - Pradeep Dabla
- Department of Biochemistry, Pant Institute of Postgraduate Medical Education & Research, Delhi, India
| | - Sanja Stankovic
- Center for Medical Biochemistry, University Clinical Center of Serbia, Beograd, Serbia
| | - Evgenija Homsak
- Department for Laboratory Diagnostics, University Clinical Center Maribor, Maribor, Slovenia
| | - Bernard Gouget
- Healthcare Division Committee, Comité Français d’accréditation, Paris, France
| | - Sergio Bernardini
- Department of Experimental Medicine, University of Tor Vergata, Rome, Italy
| | - Benoit Macq
- Institute of Information and Communication Technologies, UCLouvain, Ottignies-Louvain-la-Neuve, Belgium
| |
Collapse
|
12
|
Abstract
Clinical evidence supports the association of ultrasound features with benign or malignant thyroid nodules and serves as the basis for sonographic stratification of thyroid nodules, according to an estimated thyroid cancer risk. Contemporary guidelines recommend management strategies according to thyroid cancer risk, thyroid nodule size, and the clinical scenario. Yet, reproducible and accurate thyroid nodule risk stratification requires expertise, time, and understanding of the weight different ultrasound features have on thyroid cancer risk. The application of artificial intelligence to overcome these limitations is promising and has the potential to improve the care of patients with thyroid nodules.
Collapse
Affiliation(s)
- Nydia Burgos
- Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, University of Puerto Rico, Medical Sciences Campus, Paseo Dr. Jose Celso Barbosa, San Juan 00921, Puerto Rico
| | - Naykky Singh Ospina
- Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, University of Florida, 1600 SW Archer Road, Gainesville, FL 32610, USA
| | - Jennifer A Sipos
- Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, The Ohio State University Wexner Medical Center, 1581 Dodd Drive, Columbus, OH 43210, USA.
| |
Collapse
|
13
|
Swan KZ, Thomas J, Nielsen VE, Jespersen ML, Bonnema SJ. External validation of AIBx, an artificial intelligence model for risk stratification, in thyroid nodules. Eur Thyroid J 2022; 11:e210129. [PMID: 35113036 PMCID: PMC8963165 DOI: 10.1530/etj-21-0129] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 02/03/2022] [Indexed: 11/08/2022] Open
Abstract
BACKGROUND Artificial intelligence algorithms could be used to risk-stratify thyroid nodules and may reduce the subjectivity of ultrasonography. One such algorithm is AIBx which has shown good performance. However, external validation is crucial prior to clinical implementation. MATERIALS AND METHODS Patients harboring thyroid nodules 1-4 cm in size, undergoing thyroid surgery from 2014 to 2016 in a single institution, were included. A histological diagnosis was obtained in all cases. Medullary thyroid cancer, metastasis from other cancers, thyroid lymphomas, and purely cystic nodules were excluded. Retrospectively, transverse ultrasound images of the nodules were analyzed by AIBx, and the results were compared with histopathology and Thyroid Imaging Reporting and Data System (TIRADS), calculated by experienced physicians. RESULTS Out of 329 patients, 257 nodules from 209 individuals met the eligibility criteria. Fifty-one nodules (20%) were malignant. AIBx had a negative predictive value (NPV) of 89.2%. Sensitivity, specificity, and positive predictive values (PPV) were 78.4, 44.2, and 25.8%, respectively. Considering both TIRADS 4 and TIRADS 5 nodules as malignant lesions resulted in an NPV of 93.0%, while PPV and specificity were only 22.4 and 19.4%, respectively. By combining AIBx with TIRADS, no malignant nodules were overlooked. CONCLUSION When applied to ultrasound images obtained in a different setting than used for training, AIBx had comparable NPVs to TIRADS. AIBx performed even better when combined with TIRADS, thus reducing false negative assessments. These data support the concept of AIBx for thyroid nodules, and this tool may help less experienced operators by reducing the subjectivity inherent to thyroid ultrasound interpretation.
Collapse
Affiliation(s)
- Kristine Z Swan
- Department of ORL, Head- and Neck Surgery, Aarhus University Hospital, Aarhus, Denmark
- Correspondence should be addressed to K Z Swan:
| | - Johnson Thomas
- Department of Endocrinology, Mercy Hospital, Springfield, Missouri, USA
| | - Viveque E Nielsen
- Department of ORL, Head- and Neck Surgery, Odense University Hospital, Odense, Denmark
| | | | - Steen J Bonnema
- Department of Endocrinology, Odense University Hospital, Odense, Denmark
| |
Collapse
|
14
|
Chambara N, Liu SYW, Lo X, Ying M. Comparative Analysis of Computer-Aided Diagnosis and Computer-Assisted Subjective Assessment in Thyroid Ultrasound. Life (Basel) 2021; 11:life11111148. [PMID: 34833024 PMCID: PMC8621517 DOI: 10.3390/life11111148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 10/21/2021] [Accepted: 10/25/2021] [Indexed: 12/17/2022] Open
Abstract
The value of computer-aided diagnosis (CAD) and computer-assisted techniques equipped with different TIRADS remains ambiguous. Parallel diagnosis performances of computer-assisted subjective assessments and CAD were compared based on AACE, ATA, EU, and KSThR TIRADS. CAD software computed the diagnosis of 162 thyroid nodule sonograms. Two raters (R1 and R2) independently rated the sonographic features of the nodules using an online risk calculator while blinded to pathology results. Diagnostic efficiency measures were calculated based on the final pathology results. R1 had higher diagnostic performance outcomes than CAD with similarities between KSThR (SEN: 90.3% vs. 83.9%, p = 0.57; SPEC: 46% vs. 51%, p = 0.21; AUROC: 0.76 vs. 0.67, p = 0.02), and EU (SEN: 85.5% vs. 79%, p = 0.82; SPEC: 62% vs. 55%, p = 0.27; AUROC: 0.74 vs. 0.67, p = 0.06). Similarly, R2 had higher AUROC and specificity but lower sensitivity than CAD (KSThR-AUROC: 0.74 vs. 0.67, p = 0.13; SPEC: 61% vs. 46%, p = 0.02 and SEN: 75.8% vs. 83.9%, p = 0.31, and EU-AUROC: 0.69 vs. 0.67, p = 0.57, SPEC: 64% vs. 55%, p = 0.19, and SEN: 71% vs. 79%, p = 0.51, respectively). CAD had higher sensitivity but lower specificity than both R1 and R2 with AACE for 114 specified nodules (SEN: 92.5% vs. 88.7%, p = 0.50; 92.5% vs. 79.3%, p = 0.02, and SPEC: 26.2% vs. 54.1%, p = 0.001; 26.2% vs. 62.3%, p < 0.001, respectively). All diagnostic performance outcomes were comparable for ATA with 96 specified nodules. Computer-assisted subjective interpretation using KSThR is more ideal for ruling out papillary thyroid carcinomas than CAD. Future larger multi-center and multi-rater prospective studies with a diverse representation of thyroid cancers are necessary to validate these findings.
Collapse
Affiliation(s)
- Nonhlanhla Chambara
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China;
| | - Shirley Yuk Wah Liu
- Department of Surgery, The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, New Territories, Hong Kong, China;
| | - Xina Lo
- Department of Surgery, North District Hospital, Sheung Shui, New Territories, Hong Kong, China;
| | - Michael Ying
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China;
- Correspondence: ; Tel.: +852-3400-8566
| |
Collapse
|
15
|
An integrated AI model to improve diagnostic accuracy of ultrasound and output known risk features in suspicious thyroid nodules. Eur Radiol 2021; 32:2120-2129. [PMID: 34657970 DOI: 10.1007/s00330-021-08298-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 08/16/2021] [Accepted: 08/23/2021] [Indexed: 12/12/2022]
Abstract
OBJECTIVES From the viewpoint of ultrasound (US) physicians, an ideal thyroid US computer-assisted diagnostic (CAD) system for thyroid cancer should perform well in suspicious thyroid nodules with atypical risk features and be able to output explainable results. This study aims to develop an explainable US CAD model for suspicious thyroid nodules. METHODS A total of 2992 solid or almost-solid thyroid nodules were analyzed retrospectively. All nodules had pathological results (1070 malignancies and 1992 benignities) confirmed by ultrasound-guided fine-needle aspiration cytology and histopathology after thyroidectomy. A deep learning model (ResNet50) and a multiple risk features learning ensemble model (XGBoost) were used to train the US images of 2794 thyroid nodules. Then, an integrated AI model was generated by combining both models. The diagnostic accuracies of the three AI models (ResNet50, XGBoost, and the integrated model) were predicted in a testing set including 198 thyroid nodules and compared to the diagnostic efficacy of five ultrasonographers. RESULTS The accuracy of the integrated model was 76.77%, while the mean accuracy of the ultrasonographers was 68.38%. Of the risk features, microcalcifications showed the highest contribution to the diagnosis of malignant nodules. CONCLUSIONS The integrated AI model in our study can improve the diagnostic accuracy of suspicious thyroid nodules and output the known risk features simultaneously, thus aiding in training young ultrasonographers by linking the explainable results to their clinical experience and advancing the acceptance of AI diagnosis for thyroid cancer in clinical practice. KEY POINTS • We developed an artificial intelligence (AI) diagnosis model based on both deep learning and multiple risk feature ensemble learning methods. • The AI diagnosis model showed higher diagnostic accuracy for suspicious thyroid nodules than ultrasonographers. • The AI diagnosis model showed partial explainability by outputting the known risk features, thus aiding young ultrasonic doctors in increasing the diagnostic level for thyroid cancer.
Collapse
|