1
|
Cichosz SL. Enhancing Transparency and Reporting Standards in Diabetes Prediction Modeling: The Significance of the TRIPOD+AI 2024 Statement. J Diabetes Sci Technol 2024; 18:989-990. [PMID: 38801203 PMCID: PMC11307208 DOI: 10.1177/19322968241255106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Affiliation(s)
- Simon Lebech Cichosz
- Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
| |
Collapse
|
2
|
Xu R, Wang Z. Generative artificial intelligence in healthcare from the perspective of digital media: Applications, opportunities and challenges. Heliyon 2024; 10:e32364. [PMID: 38975200 PMCID: PMC11225727 DOI: 10.1016/j.heliyon.2024.e32364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 06/03/2024] [Indexed: 07/09/2024] Open
Abstract
Introduction The emergence and application of generative artificial intelligence/large language models (hereafter GenAI LLMs) have the potential for significant impact on the healthcare industry. However, there is currently a lack of systematic research on GenAI LLMs in healthcare based on reliable data. This article aims to conduct an exploratory study of the application of GenAI LLMs (i.e., ChatGPT) in healthcare from the perspective of digital media (i.e., online news), including the application scenarios, potential opportunities, and challenges. Methods This research used thematic qualitative text analysis in five steps: firstly, developing main topical categories based on relevant articles; secondly, encoding the search keywords using these categories; thirdly, conducting searches for news articles via Google ; fourthly, encoding the sub-categories using the elaborate category system; and finally, conducting category-based analysis and presenting the results. Natural language processing techniques, including the TermRaider and AntConc tool, were applied in the aforementioned steps to assist in text qualitative analysis. Additionally, this study built a framework, using for analyzing the above three topics, from the perspective of five different stakeholders, including healthcare demanders and providers. Results This study summarizes 26 applications (e.g., provide medical advice, provide diagnosis and triage recommendations, provide mental health support, etc.), 21 opportunities (e.g., make healthcare more accessible, reduce healthcare costs, improve patients care, etc.), and 17 challenges (e.g., generate inaccurate/misleading/wrong answers, raise privacy concerns, lack of transparency, etc.), and analyzes the reasons for the formation of these key items and the links between the three research topics. Conclusions The application of GenAI LLMs in healthcare is primarily focused on transforming the way healthcare demanders access medical services (i.e., making it more intelligent, refined, and humane) and optimizing the processes through which healthcare providers offer medical services (i.e., simplifying, ensuring timeliness, and reducing errors). As the application becomes more widespread and deepens, GenAI LLMs is expected to have a revolutionary impact on traditional healthcare service models, but it also inevitably raises ethical and security concerns. Furthermore, GenAI LLMs applied in healthcare is still in the initial stage, which can be accelerated from a specific healthcare field (e.g., mental health) or a specific mechanism (e.g., GenAI LLMs' economic benefits allocation mechanism applied to healthcare) with empirical or clinical research.
Collapse
Affiliation(s)
- Rui Xu
- School of Economics, Guangdong University of Technology, Guangzhou, China
| | - Zhong Wang
- School of Economics, Guangdong University of Technology, Guangzhou, China
- Key Laboratory of Digital Economy and Data Governance, Guangdong University of Technology, Guangzhou, China
| |
Collapse
|
3
|
Neo JRE, Ser JS, Tay SS. Use of large language model-based chatbots in managing the rehabilitation concerns and education needs of outpatient stroke survivors and caregivers. Front Digit Health 2024; 6:1395501. [PMID: 38784703 PMCID: PMC11111889 DOI: 10.3389/fdgth.2024.1395501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/19/2024] [Indexed: 05/25/2024] Open
Abstract
Background The utility of large language model-based (LLM) artificial intelligence (AI) chatbots in many aspects of healthcare is becoming apparent though their ability to address patient concerns remains unknown. We sought to evaluate the performance of two well-known, freely-accessible chatbots, ChatGPT and Google Bard, in responding to common questions about stroke rehabilitation posed by patients and their caregivers. Methods We collected questions from outpatients and their caregivers through a survey, categorised them by theme, and created representative questions to be posed to both chatbots. We then evaluated the chatbots' responses based on accuracy, safety, relevance, and readability. Interrater agreement was also tracked. Results Although both chatbots achieved similar overall scores, Google Bard performed slightly better in relevance and safety. Both provided readable responses with some general accuracy, but struggled with hallucinated responses, were often not specific, and lacked awareness of the possibility for emotional situations with the potential to turn dangerous. Additionally, interrater agreement was low, highlighting the variability in physician acceptance of their responses. Conclusions AI chatbots show potential in patient-facing support roles, but issues remain regarding safety, accuracy, and relevance. Future chatbots should address these problems to ensure that they can reliably and independently manage the concerns and questions of stroke patients and their caregivers.
Collapse
Affiliation(s)
- Jin Rui Edmund Neo
- Department of Rehabilitation Medicine, Changi General Hospital, Singapore, Singapore
| | - Joon Sin Ser
- Rehabilitation Medicine, SingHealth Residency, Singapore, Singapore
| | - San San Tay
- Department of Rehabilitation Medicine, Changi General Hospital, Singapore, Singapore
| |
Collapse
|
4
|
Mert S, Stoerzer P, Brauer J, Fuchs B, Haas-Lützenberger EM, Demmer W, Giunta RE, Nuernberger T. Diagnostic power of ChatGPT 4 in distal radius fracture detection through wrist radiographs. Arch Orthop Trauma Surg 2024; 144:2461-2467. [PMID: 38578309 PMCID: PMC11093861 DOI: 10.1007/s00402-024-05298-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Accepted: 03/27/2024] [Indexed: 04/06/2024]
Abstract
Distal radius fractures rank among the most prevalent fractures in humans, necessitating accurate radiological imaging and interpretation for optimal diagnosis and treatment. In addition to human radiologists, artificial intelligence systems are increasingly employed for radiological assessments. Since 2023, ChatGPT 4 has offered image analysis capabilities, which can also be used for the analysis of wrist radiographs. This study evaluates the diagnostic power of ChatGPT 4 in identifying distal radius fractures, comparing it with a board-certified radiologist, a hand surgery resident, a medical student, and the well-established AI Gleamer BoneView™. Results demonstrate ChatGPT 4's good diagnostic accuracy (sensitivity 0.88, specificity 0.98, diagnostic power (AUC) 0.93), surpassing the medical student (sensitivity 0.98, specificity 0.72, diagnostic power (AUC) 0.85; p = 0.04) significantly. Nevertheless, the diagnostic power of ChatGPT 4 lags behind the hand surgery resident (sensitivity 0.99, specificity 0.98, diagnostic power (AUC) 0.985; p = 0.014) and Gleamer BoneView™(sensitivity 1.00, specificity 0.98, diagnostic power (AUC) 0.99; p = 0.006). This study highlights the utility and potential applications of artificial intelligence in modern medicine, emphasizing ChatGPT 4 as a valuable tool for enhancing diagnostic capabilities in the field of medical imaging.
Collapse
Affiliation(s)
- Sinan Mert
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany.
| | - Patrick Stoerzer
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| | - Johannes Brauer
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| | - Benedikt Fuchs
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| | | | - Wolfram Demmer
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| | - Riccardo E Giunta
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| | - Tim Nuernberger
- Division of Hand, Plastic and Aesthetic Surgery, LMU University Hospital, LMU Munich, 80336, München, Germany
| |
Collapse
|
5
|
Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, Ghassemi M, Liu X, Reitsma JB, van Smeden M, Boulesteix AL, Camaradou JC, Celi LA, Denaxas S, Denniston AK, Glocker B, Golub RM, Harvey H, Heinze G, Hoffman MM, Kengne AP, Lam E, Lee N, Loder EW, Maier-Hein L, Mateen BA, McCradden MD, Oakden-Rayner L, Ordish J, Parnell R, Rose S, Singh K, Wynants L, Logullo P. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024; 385:e078378. [PMID: 38626948 PMCID: PMC11019967 DOI: 10.1136/bmj-2023-078378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 04/19/2024]
Affiliation(s)
- Gary S Collins
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Karel G M Moons
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Paula Dhiman
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Andrew L Beam
- Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Science, Leiden University Medical Centre, Leiden, Netherlands
| | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xiaoxuan Liu
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Johannes B Reitsma
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Anne-Laure Boulesteix
- Institute for Medical Information Processing, Biometry and Epidemiology, Faculty of Medicine, Ludwig-Maximilians-University of Munich and Munich Centre of Machine Learning, Germany
| | - Jennifer Catherine Camaradou
- Patient representative, Health Data Research UK patient and public involvement and engagement group
- Patient representative, University of East Anglia, Faculty of Health Sciences, Norwich Research Park, Norwich, UK
| | - Leo Anthony Celi
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, London, UK
- British Heart Foundation Data Science Centre, London, UK
| | - Alastair K Denniston
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Ben Glocker
- Department of Computing, Imperial College London, London, UK
| | - Robert M Golub
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Georg Heinze
- Section for Clinical Biometrics, Centre for Medical Data Science, Medical University of Vienna, Vienna, Austria
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | | | - Emily Lam
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Naomi Lee
- National Institute for Health and Care Excellence, London, UK
| | - Elizabeth W Loder
- The BMJ, London, UK
- Department of Neurology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Lena Maier-Hein
- Department of Intelligent Medical Systems, German Cancer Research Centre, Heidelberg, Germany
| | - Bilal A Mateen
- Institute of Health Informatics, University College London, London, UK
- Wellcome Trust, London, UK
- Alan Turing Institute, London, UK
| | - Melissa D McCradden
- Department of Bioethics, Hospital for Sick Children Toronto, ON, Canada
- Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
| | - Lauren Oakden-Rayner
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
| | - Johan Ordish
- Medicines and Healthcare products Regulatory Agency, London, UK
| | - Richard Parnell
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Sherri Rose
- Department of Health Policy and Center for Health Policy, Stanford University, Stanford, CA, USA
| | - Karandeep Singh
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Laure Wynants
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Patricia Logullo
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| |
Collapse
|
6
|
Kolbinger FR, Veldhuizen GP, Zhu J, Truhn D, Kather JN. Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis. COMMUNICATIONS MEDICINE 2024; 4:71. [PMID: 38605106 PMCID: PMC11009315 DOI: 10.1038/s43856-024-00492-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 03/27/2024] [Indexed: 04/13/2024] Open
Abstract
BACKGROUND The field of Artificial Intelligence (AI) holds transformative potential in medicine. However, the lack of universal reporting guidelines poses challenges in ensuring the validity and reproducibility of published research studies in this field. METHODS Based on a systematic review of academic publications and reporting standards demanded by both international consortia and regulatory stakeholders as well as leading journals in the fields of medicine and medical informatics, 26 reporting guidelines published between 2009 and 2023 were included in this analysis. Guidelines were stratified by breadth (general or specific to medical fields), underlying consensus quality, and target research phase (preclinical, translational, clinical) and subsequently analyzed regarding the overlap and variations in guideline items. RESULTS AI reporting guidelines for medical research vary with respect to the quality of the underlying consensus process, breadth, and target research phase. Some guideline items such as reporting of study design and model performance recur across guidelines, whereas other items are specific to particular fields and research stages. CONCLUSIONS Our analysis highlights the importance of reporting guidelines in clinical AI research and underscores the need for common standards that address the identified variations and gaps in current guidelines. Overall, this comprehensive overview could help researchers and public stakeholders reinforce quality standards for increased reliability, reproducibility, clinical validity, and public trust in AI research in healthcare. This could facilitate the safe, effective, and ethical translation of AI methods into clinical applications that will ultimately improve patient outcomes.
Collapse
Grants
- UM1 TR004402 NCATS NIH HHS
- JNK is supported by the German Federal Ministry of Health (DEEP LIVER, ZMVI1-2520DAT111) and the Max-Eder-Programme of the German Cancer Aid (grant #70113864), the German Federal Ministry of Education and Research (PEARL, 01KD2104C; CAMINO, 01EO2101; SWAG, 01KD2215A; TRANSFORM LIVER, 031L0312A), the German Academic Exchange Service (SECAI, 57616814), the German Federal Joint Committee (Transplant.KI, 01VSF21048) the European Union (ODELIA, 101057091; GENIAL, 101096312) and the National Institute for Health and Care Research (NIHR, NIHR213331) Leeds Biomedical Research Centre.
Collapse
Affiliation(s)
- Fiona R Kolbinger
- Else Kroener Fresenius Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
- Department of Visceral, Thoracic and Vascular Surgery, University Hospital and Faculty of Medicine Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA
- Regenstrief Center for Healthcare Engineering, Purdue University, West Lafayette, IN, USA
- Department of Biostatistics and Health Data Science, Richard M. Fairbanks School of Public Health, Indiana University, Indianapolis, IN, USA
- Indiana University Simon Comprehensive Cancer Center, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Gregory P Veldhuizen
- Else Kroener Fresenius Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Jiefu Zhu
- Else Kroener Fresenius Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Daniel Truhn
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany.
- Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.
- Department of Medicine I, University Hospital Dresden, Dresden, Germany.
- Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.
| |
Collapse
|
7
|
Pruski M. What does it mean for a clinical AI to be just: conflicts between local fairness and being fit-for-purpose? JOURNAL OF MEDICAL ETHICS 2024:jme-2023-109675. [PMID: 38423759 DOI: 10.1136/jme-2023-109675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Accepted: 02/15/2024] [Indexed: 03/02/2024]
Abstract
There have been repeated calls to ensure that clinical artificial intelligence (AI) is not discriminatory, that is, it provides its intended benefit to all members of society irrespective of the status of any protected characteristics of individuals in whose healthcare the AI might participate. There have also been repeated calls to ensure that any clinical AI is tailored to the local population in which it is being used to ensure that it is fit-for-purpose. Yet, there might be a clash between these two calls since tailoring an AI to a local population might reduce its effectiveness when the AI is used in the care of individuals who have characteristics which are not represented in the local population. Here, I explore the bioethical concept of local fairness as applied to clinical AI. I first introduce the discussion concerning fairness and inequalities in healthcare and how this problem has continued in attempts to develop AI-enhanced healthcare. I then discuss various technical aspects which might affect the implementation of local fairness. Next, I introduce some rule of law considerations into the discussion to contextualise the issue better by drawing key parallels. I then discuss some potential technical solutions which have been proposed to address the issue of local fairness. Finally, I outline which solutions I consider most likely to contribute to a fit-for-purpose and fair AI.
Collapse
Affiliation(s)
- Michal Pruski
- Department of Medical Physics and Clinical Engineering, Cardiff and Vale UHB, Cardiff, UK
- School of Health Sciences, The University of Manchester, Manchester, UK
| |
Collapse
|
8
|
Cai Y, Cai YQ, Tang LY, Wang YH, Gong M, Jing TC, Li HJ, Li-Ling J, Hu W, Yin Z, Gong DX, Zhang GW. Artificial intelligence in the risk prediction models of cardiovascular disease and development of an independent validation screening tool: a systematic review. BMC Med 2024; 22:56. [PMID: 38317226 PMCID: PMC10845808 DOI: 10.1186/s12916-024-03273-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Accepted: 01/23/2024] [Indexed: 02/07/2024] Open
Abstract
BACKGROUND A comprehensive overview of artificial intelligence (AI) for cardiovascular disease (CVD) prediction and a screening tool of AI models (AI-Ms) for independent external validation are lacking. This systematic review aims to identify, describe, and appraise AI-Ms of CVD prediction in the general and special populations and develop a new independent validation score (IVS) for AI-Ms replicability evaluation. METHODS PubMed, Web of Science, Embase, and IEEE library were searched up to July 2021. Data extraction and analysis were performed for the populations, distribution, predictors, algorithms, etc. The risk of bias was evaluated with the prediction risk of bias assessment tool (PROBAST). Subsequently, we designed IVS for model replicability evaluation with five steps in five items, including transparency of algorithms, performance of models, feasibility of reproduction, risk of reproduction, and clinical implication, respectively. The review is registered in PROSPERO (No. CRD42021271789). RESULTS In 20,887 screened references, 79 articles (82.5% in 2017-2021) were included, which contained 114 datasets (67 in Europe and North America, but 0 in Africa). We identified 486 AI-Ms, of which the majority were in development (n = 380), but none of them had undergone independent external validation. A total of 66 idiographic algorithms were found; however, 36.4% were used only once and only 39.4% over three times. A large number of different predictors (range 5-52,000, median 21) and large-span sample size (range 80-3,660,000, median 4466) were observed. All models were at high risk of bias according to PROBAST, primarily due to the incorrect use of statistical methods. IVS analysis confirmed only 10 models as "recommended"; however, 281 and 187 were "not recommended" and "warning," respectively. CONCLUSION AI has led the digital revolution in the field of CVD prediction, but is still in the early stage of development as the defects of research design, report, and evaluation systems. The IVS we developed may contribute to independent external validation and the development of this field.
Collapse
Affiliation(s)
- Yue Cai
- China Medical University, Shenyang, 110122, China
| | - Yu-Qing Cai
- China Medical University, Shenyang, 110122, China
| | - Li-Ying Tang
- China Medical University, Shenyang, 110122, China
| | - Yi-Han Wang
- China Medical University, Shenyang, 110122, China
| | - Mengchun Gong
- Digital Health China Co. Ltd, Beijing, 100089, China
| | - Tian-Ci Jing
- Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China
| | - Hui-Jun Li
- Shenyang Medical & Film Science and Technology Co. Ltd., Shenyang, 110001, China
- Enduring Medicine Smart Innovation Research Institute, Shenyang, 110001, China
| | - Jesse Li-Ling
- Institute of Genetic Medicine, School of Life Science, State Key Laboratory of Biotherapy, Sichuan University, Chengdu, 610065, China
| | - Wei Hu
- Bayi Orthopedic Hospital, Chengdu, 610017, China
| | - Zhihua Yin
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, China.
| | - Da-Xin Gong
- Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China.
- The Internet Hospital Branch of the Chinese Research Hospital Association, Beijing, 100006, China.
| | - Guang-Wei Zhang
- Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China.
- The Internet Hospital Branch of the Chinese Research Hospital Association, Beijing, 100006, China.
| |
Collapse
|
9
|
Braun BJ, Histing T, Menger MM, Herath SC, Mueller-Franzes GA, Grimm B, Marmor MT, Truhn D. Wearable activity data can predict functional recovery after musculoskeletal injury: Feasibility of a machine learning approach. Injury 2024; 55:111254. [PMID: 38070329 DOI: 10.1016/j.injury.2023.111254] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 10/23/2023] [Accepted: 11/26/2023] [Indexed: 01/29/2024]
Abstract
Delayed functional recovery after injury is associated with significant personal and socioeconomic burden. Identification of patients at risk for a prolonged recovery after a musculoskeletal injury is thus of high relevance. The aim of the current study was to show the feasibility of using a machine learning assisted model to predict functional recovery based on the pre- and immediate post injury patient activity as measured with wearable systems in trauma patients. Patients with a pre-existing wearable (smartphone and/or body-worn sensor), data availability of at least 7 days prior to their injury, and any musculoskeletal injury of the upper or lower extremity were included in this study. Patient age, sex, injured extremity, time off work and step count as activity data were recorded continuously both pre- and post-injury. Descriptive statistics were performed and a logistic regression machine learning model was used to predict the patient's functional recovery status after 6 weeks based on their pre- and post-injury activity characteristics. Overall 38 patients (7 upper extremity, 24 lower extremity, 5 pelvis, 2 combined) were included in this proof-of-concept study. The average follow-up with available wearable data was 85.4 days. Based on the activity data, a predictive model was constructed to determine the likelihood of having a recovery of at least 50 % of the pre-injury activity state by post injury week 6. Based on the individual activity by week 3 a predictive accuracy of over 80 % was achieved on an independent test set (F1=0,82; AUC=0,86; ACC=8,83). The employed model is feasible to assess the principal risk for a slower recovery based on readily available personal wearable activity data. The model has the potential to identify patients requiring additional aftercare attention early during the treatment course, thus optimizing return to the pre-injury status through focused interventions. Additional patient data is needed to adapt the model to more specifically focus on different fracture entities and patient groups.
Collapse
Affiliation(s)
- Benedikt J Braun
- University Hospital Tuebingen, Eberhard-Karls-University Tuebingen, BG Unfallklinik, Schnarrenbergstr. 95, Tuebingen 72076, Federal Republic of Germany.
| | - Tina Histing
- University Hospital Tuebingen, Eberhard-Karls-University Tuebingen, BG Unfallklinik, Schnarrenbergstr. 95, Tuebingen 72076, Federal Republic of Germany
| | - Maximilian M Menger
- University Hospital Tuebingen, Eberhard-Karls-University Tuebingen, BG Unfallklinik, Schnarrenbergstr. 95, Tuebingen 72076, Federal Republic of Germany
| | - Steven C Herath
- University Hospital Tuebingen, Eberhard-Karls-University Tuebingen, BG Unfallklinik, Schnarrenbergstr. 95, Tuebingen 72076, Federal Republic of Germany
| | - Gustav A Mueller-Franzes
- Departments of Diagnostic and Interventional Radiology, RWTH Aachen University Aachen, Aachen, Federal Republic of Germany
| | - Bernd Grimm
- Orthopaedic Trauma Institute (OTI), University of California, San Francisco General Hospital, San Franci-sco, CA, USA
| | - Meir T Marmor
- Department of Precision Health, Human Motion, Orthopaedics, Sports Medicine and Digital Methods Group, Lux-embourg Institute of Health, Strassen 1445, Luxembourg
| | - Daniel Truhn
- Departments of Diagnostic and Interventional Radiology, RWTH Aachen University Aachen, Aachen, Federal Republic of Germany
| |
Collapse
|
10
|
Zhong J, Xing Y, Lu J, Zhang G, Mao S, Chen H, Yin Q, Cen Q, Jiang R, Hu Y, Ding D, Ge X, Zhang H, Yao W. The endorsement of general and artificial intelligence reporting guidelines in radiological journals: a meta-research study. BMC Med Res Methodol 2023; 23:292. [PMID: 38093215 PMCID: PMC10717715 DOI: 10.1186/s12874-023-02117-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 12/01/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND Complete reporting is essential for clinical research. However, the endorsement of reporting guidelines in radiological journals is still unclear. Further, as a field extensively utilizing artificial intelligence (AI), the adoption of both general and AI reporting guidelines would be necessary for enhancing quality and transparency of radiological research. This study aims to investigate the endorsement of general reporting guidelines and those for AI applications in medical imaging in radiological journals, and explore associated journal characteristic variables. METHODS This meta-research study screened journals from the Radiology, Nuclear Medicine & Medical Imaging category, Science Citation Index Expanded of the 2022 Journal Citation Reports, and excluded journals not publishing original research, in non-English languages, and instructions for authors unavailable. The endorsement of fifteen general reporting guidelines and ten AI reporting guidelines was rated using a five-level tool: "active strong", "active weak", "passive moderate", "passive weak", and "none". The association between endorsement and journal characteristic variables was evaluated by logistic regression analysis. RESULTS We included 117 journals. The top-five endorsed reporting guidelines were CONSORT (Consolidated Standards of Reporting Trials, 58.1%, 68/117), PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses, 54.7%, 64/117), STROBE (STrengthening the Reporting of Observational Studies in Epidemiology, 51.3%, 60/117), STARD (Standards for Reporting of Diagnostic Accuracy, 50.4%, 59/117), and ARRIVE (Animal Research Reporting of In Vivo Experiments, 35.9%, 42/117). The most implemented AI reporting guideline was CLAIM (Checklist for Artificial Intelligence in Medical Imaging, 1.7%, 2/117), while other nine AI reporting guidelines were not mentioned. The Journal Impact Factor quartile and publisher were associated with endorsement of reporting guidelines in radiological journals. CONCLUSIONS The general reporting guideline endorsement was suboptimal in radiological journals. The implementation of reporting guidelines for AI applications in medical imaging was extremely low. Their adoption should be strengthened to facilitate quality and transparency of radiological study reporting.
Collapse
Affiliation(s)
- Jingyu Zhong
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China
| | - Yue Xing
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China
| | - Junjie Lu
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Guangcheng Zhang
- Department of Orthopedics, Shanghai Sixth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200233, China
| | - Shiqi Mao
- Department of Medical Oncology, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai, 200433, China
| | - Haoda Chen
- Department of General Surgery, Pancreatic Disease Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Qian Yin
- Department of Pathology, Shanghai Sixth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200233, China
| | - Qingqing Cen
- Department of Dermatology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200011, China
| | - Run Jiang
- Department of Pharmacovigilance, Shanghai Hansoh BioMedical Co., Ltd., Shanghai, 201203, China
| | - Yangfan Hu
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China
| | - Defang Ding
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China
| | - Xiang Ge
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China
| | - Huan Zhang
- Department of Radiology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Weiwu Yao
- Department of Imaging, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200336, China.
| |
Collapse
|
11
|
Serdarogullari M, Liperis G, Sharma K, Ammar OF, Uraji J, Cimadomo D, Alteri A, Popovic M, Fraire-Zamora JJ. Unpacking the artificial intelligence toolbox for embryo ploidy prediction. Hum Reprod 2023; 38:2538-2542. [PMID: 37877410 DOI: 10.1093/humrep/dead223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023] Open
Affiliation(s)
- Munevver Serdarogullari
- Department of Histology and Embryology, Faculty of Medicine, Cyprus International University, Northern Cyprus, Turkey
| | - George Liperis
- Westmead Fertility Centre, Institute of Reproductive Medicine, University of Sydney, Westmead, NSW, Australia
| | - Kashish Sharma
- HealthPlus Fertility Center, HealthPlus Network of Specialty Centers, Abu Dhabi, United Arab Emirates
| | - Omar F Ammar
- Biomaterials Cluster, Bernal Institute, University of Limerick, Limerick, Ireland
- School of Engineering, Faculty of Science and Engineering, University of Limerick, Limerick, Ireland
| | - Julia Uraji
- IVF Laboratory, TFP Düsseldorf GmbH, Düsseldorf, Germany
| | - Danilo Cimadomo
- IVIRMA Global Research Alliance, Genera, Clinica Valle Giulia, Rome, Italy
| | | | | | | |
Collapse
|
12
|
Zantvoort K, Scharfenberger J, Boß L, Lehr D, Funk B. Finding the Best Match - a Case Study on the (Text-)Feature and Model Choice in Digital Mental Health Interventions. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023; 7:447-479. [PMID: 37927375 PMCID: PMC10620349 DOI: 10.1007/s41666-023-00148-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 08/29/2023] [Indexed: 11/07/2023]
Abstract
With the need for psychological help long exceeding the supply, finding ways of scaling, and better allocating mental health support is a necessity. This paper contributes by investigating how to best predict intervention dropout and failure to allow for a need-based adaptation of treatment. We systematically compare the predictive power of different text representation methods (metadata, TF-IDF, sentiment and topic analysis, and word embeddings) in combination with supplementary numerical inputs (socio-demographic, evaluation, and closed-question data). Additionally, we address the research gap of which ML model types - ranging from linear to sophisticated deep learning models - are best suited for different features and outcome variables. To this end, we analyze nearly 16.000 open-text answers from 849 German-speaking users in a Digital Mental Health Intervention (DMHI) for stress. Our research proves that - contrary to previous findings - there is great promise in using neural network approaches on DMHI text data. We propose a task-specific LSTM-based model architecture to tackle the challenge of long input sequences and thereby demonstrate the potential of word embeddings (AUC scores of up to 0.7) for predictions in DMHIs. Despite the relatively small data set, sequential deep learning models, on average, outperform simpler features such as metadata and bag-of-words approaches when predicting dropout. The conclusion is that user-generated text of the first two sessions carries predictive power regarding patients' dropout and intervention failure risk. Furthermore, the match between the sophistication of features and models needs to be closely considered to optimize results, and additional non-text features increase prediction results. Supplementary Information The online version contains supplementary material available at 10.1007/s41666-023-00148-z.
Collapse
Affiliation(s)
- Kirsten Zantvoort
- Institute of Information Systems, Leuphana University, Lüneburg, Germany
| | | | - Leif Boß
- Institute of Psychology, Leuphana University, Lüneburg, Germany
| | - Dirk Lehr
- Institute of Psychology, Leuphana University, Lüneburg, Germany
| | - Burkhardt Funk
- Institute of Information Systems, Leuphana University, Lüneburg, Germany
| |
Collapse
|
13
|
Michelsen C, Jørgensen CC, Heltberg M, Jensen MH, Lucchetti A, Petersen PB, Petersen T, Kehlet H, Madsen F, Hansen TB, Gromov K, Jakobsen T, Varnum C, Overgaard S, Rathsach M, Hansen L. Machine-learning vs. logistic regression for preoperative prediction of medical morbidity after fast-track hip and knee arthroplasty-a comparative study. BMC Anesthesiol 2023; 23:391. [PMID: 38030979 PMCID: PMC10685559 DOI: 10.1186/s12871-023-02354-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 11/21/2023] [Indexed: 12/01/2023] Open
Abstract
BACKGROUND Machine-learning models may improve prediction of length of stay (LOS) and morbidity after surgery. However, few studies include fast-track programs, and most rely on administrative coding with limited follow-up and information on perioperative care. This study investigates potential benefits of a machine-learning model for prediction of postoperative morbidity in fast-track total hip (THA) and knee arthroplasty (TKA). METHODS Cohort study in consecutive unselected primary THA/TKA between 2014-2017 from seven Danish centers with established fast-track protocols. Preoperative comorbidity and prescribed medication were recorded prospectively and information on length of stay and readmissions was obtained through the Danish National Patient Registry and medical records. We used a machine-learning model (Boosted Decision Trees) based on boosted decision trees with 33 preoperative variables for predicting "medical" morbidity leading to LOS > 4 days or 90-days readmissions and compared to a logistical regression model based on the same variables. We also evaluated two parsimonious models, using the ten most important variables in the full machine-learning and logistic regression models. Data collected between 2014-2016 (n:18,013) was used for model training and data from 2017 (n:3913) was used for testing. Model performances were analyzed using precision, area under receiver operating (AUROC) and precision recall curves (AUPRC), as well as the Mathews Correlation Coefficient. Variable importance was analyzed using Shapley Additive Explanations values. RESULTS Using a threshold of 20% "risk-patients" (n:782), precision, AUROC and AUPRC were 13.6%, 76.3% and 15.5% vs. 12.4%, 74.7% and 15.6% for the machine-learning and logistic regression model, respectively. The parsimonious machine-learning model performed better than the full logistic regression model. Of the top ten variables, eight were shared between the machine-learning and logistic regression models, but with a considerable age-related variation in importance of specific types of medication. CONCLUSION A machine-learning model using preoperative characteristics and prescriptions slightly improved identification of patients in high-risk of "medical" complications after fast-track THA and TKA compared to a logistic regression model. Such algorithms could help find a manageable population of patients who may benefit most from intensified perioperative care.
Collapse
Affiliation(s)
- Christian Michelsen
- The Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100, Copenhagen, Denmark
| | - Christoffer C Jørgensen
- Department of Anesthesia and Intensive Care, Hospital of Northern Zealand, Dyrehavevej 29 3400, Hillerød, Denmark.
- The Centre for Fast-Track Hip and Knee Replacement, 7621, Rigshospitalet, Blegdamsvej 9, 2100, Copenhagen, Denmark.
| | - Mathias Heltberg
- The Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100, Copenhagen, Denmark
| | - Mogens H Jensen
- The Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100, Copenhagen, Denmark
| | - Alessandra Lucchetti
- The Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100, Copenhagen, Denmark
| | - Pelle B Petersen
- Department of Anesthesia and Intensive Care, Hospital of Northern Zealand, Dyrehavevej 29 3400, Hillerød, Denmark
- The Centre for Fast-Track Hip and Knee Replacement, 7621, Rigshospitalet, Blegdamsvej 9, 2100, Copenhagen, Denmark
| | - Troels Petersen
- The Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100, Copenhagen, Denmark
| | - Henrik Kehlet
- The Centre for Fast-Track Hip and Knee Replacement, 7621, Rigshospitalet, Blegdamsvej 9, 2100, Copenhagen, Denmark
- Section of Surgical Pathophysiology, 7621, Rigshospitalet, Blegdamsvej 9, 2100, Copenhagen, Denmark
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Baygül Eden A, Bakir Kayi A, Erdem MG, Demirci M. COVID-19 studies involving machine learning methods: A bibliometric study. Medicine (Baltimore) 2023; 102:e35564. [PMID: 37904407 PMCID: PMC10615482 DOI: 10.1097/md.0000000000035564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 09/18/2023] [Indexed: 11/01/2023] Open
Abstract
BACKGROUND Machine learning (ML) and artificial intelligence (AI) techniques are gaining popularity as effective tools for coronavirus disease of 2019 (COVID-19) research. These strategies can be used in diagnosis, prognosis, therapy, and public health management. Bibliometric analysis quantifies the quality and impact of scholarly publications. ML in COVID-19 research is the focus of this bibliometric analysis. METHODS A comprehensive literature study found ML-based COVID-19 research. Web of Science (WoS) was used for the study. The searches included "machine learning," "artificial intelligence," and COVID-19. To find all relevant studies, 2 reviewers searched independently. The network visualization was analyzed using VOSviewer 1.6.19. RESULTS In the WoS Core, the average citation count was 13.6 ± 41.3. The main research areas were computer science, engineering, and science and technology. According to document count, Tao Huang wrote 14 studies, Fadi Al-Turjman wrote 11, and Imran Ashraf wrote 11. The US, China, and India produced the most studies and citations. The most prolific research institutions were Harvard Medical School, Huazhong University of Science and Technology, and King Abdulaziz University. In contrast, Nankai University, Oxford, and Imperial College London were the most mentioned organizations, reflecting their significant research contributions. First, "Covid-19" appeared 1983 times, followed by "machine learning" and "deep learning." The US Department of Health and Human Services funded this topic most heavily. Huang Tao, Feng Kaiyan, and Ashraf Imran pioneered bibliographic coupling. CONCLUSION This study provides useful insights for academics and clinicians studying COVID-19 using ML. Through bibliometric data analysis, scholars can learn about highly recognized and productive authors and countries, as well as the publications with the most citations and keywords. New data and methodologies from the pandemic are expected to advance ML and AI modeling. It is crucial to recognize that these studies will pioneer this subject.
Collapse
Affiliation(s)
- Arzu Baygül Eden
- Koç University, School of Medicine, Department of Biostatistics, Istanbul, Turkey
| | - Alev Bakir Kayi
- Istanbul University, Institute of Child Health, Department of Social Pediatrics, Istanbul, Türkiye
| | - Mustafa Genco Erdem
- Department of Internal Medicine, Faculty of Medicine, Beykent University, Istanbul, Turkey
| | - Mehmet Demirci
- Department of Medical Microbiology, Faculty of Medicine, Kirklareli University, Kirklareli, Turkey
| |
Collapse
|
15
|
Klement W, El Emam K. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation. J Med Internet Res 2023; 25:e48763. [PMID: 37651179 PMCID: PMC10502599 DOI: 10.2196/48763] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 07/11/2023] [Accepted: 07/31/2023] [Indexed: 09/01/2023] Open
Abstract
BACKGROUND The reporting of machine learning (ML) prognostic and diagnostic modeling studies is often inadequate, making it difficult to understand and replicate such studies. To address this issue, multiple consensus and expert reporting guidelines for ML studies have been published. However, these guidelines cover different parts of the analytics lifecycle, and individually, none of them provide a complete set of reporting requirements. OBJECTIVE We aimed to consolidate the ML reporting guidelines and checklists in the literature to provide reporting items for prognostic and diagnostic ML in in-silico and shadow mode studies. METHODS We conducted a literature search that identified 192 unique peer-reviewed English articles that provide guidance and checklists for reporting ML studies. The articles were screened by their title and abstract against a set of 9 inclusion and exclusion criteria. Articles that were filtered through had their quality evaluated by 2 raters using a 9-point checklist constructed from guideline development good practices. The average κ was 0.71 across all quality criteria. The resulting 17 high-quality source papers were defined as having a quality score equal to or higher than the median. The reporting items in these 17 articles were consolidated and screened against a set of 6 inclusion and exclusion criteria. The resulting reporting items were sent to an external group of 11 ML experts for review and updated accordingly. The updated checklist was used to assess the reporting in 6 recent modeling papers in JMIR AI. Feedback from the external review and initial validation efforts was used to improve the reporting items. RESULTS In total, 37 reporting items were identified and grouped into 5 categories based on the stage of the ML project: defining the study details, defining and collecting the data, modeling methodology, model evaluation, and explainability. None of the 17 source articles covered all the reporting items. The study details and data description reporting items were the most common in the source literature, with explainability and methodology guidance (ie, data preparation and model training) having the least coverage. For instance, a median of 75% of the data description reporting items appeared in each of the 17 high-quality source guidelines, but only a median of 33% of the data explainability reporting items appeared. The highest-quality source articles tended to have more items on reporting study details. Other categories of reporting items were not related to the source article quality. We converted the reporting items into a checklist to support more complete reporting. CONCLUSIONS Our findings supported the need for a set of consolidated reporting items, given that existing high-quality guidelines and checklists do not individually provide complete coverage. The consolidated set of reporting items is expected to improve the quality and reproducibility of ML modeling studies.
Collapse
Affiliation(s)
- William Klement
- University of Ottawa, Ottawa, ON, Canada
- CHEO Research Institute, Ottawa, ON, Canada
| | - Khaled El Emam
- University of Ottawa, Ottawa, ON, Canada
- CHEO Research Institute, Ottawa, ON, Canada
| |
Collapse
|
16
|
Severin A, Strinzel M, Egger M, Barros T, Sokolov A, Mouatt JV, Müller S. Relationship between journal impact factor and the thoroughness and helpfulness of peer reviews. PLoS Biol 2023; 21:e3002238. [PMID: 37643173 PMCID: PMC10464996 DOI: 10.1371/journal.pbio.3002238] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 07/06/2023] [Indexed: 08/31/2023] Open
Abstract
The Journal Impact Factor is often used as a proxy measure for journal quality, but the empirical evidence is scarce. In particular, it is unclear how peer review characteristics for a journal relate to its impact factor. We analysed 10,000 peer review reports submitted to 1,644 biomedical journals with impact factors ranging from 0.21 to 74.7. Two researchers hand-coded sentences using categories of content related to the thoroughness of the review (Materials and Methods, Presentation and Reporting, Results and Discussion, Importance and Relevance) and helpfulness (Suggestion and Solution, Examples, Praise, Criticism). We fine-tuned and validated transformer machine learning language models to classify sentences. We then examined the association between the number and percentage of sentences addressing different content categories and 10 groups defined by the Journal Impact Factor. The median length of reviews increased with higher impact factor, from 185 words (group 1) to 387 words (group 10). The percentage of sentences addressing Materials and Methods was greater in the highest Journal Impact Factor journals than in the lowest Journal Impact Factor group. The results for Presentation and Reporting went in the opposite direction, with the highest Journal Impact Factor journals giving less emphasis to such content. For helpfulness, reviews for higher impact factor journals devoted relatively less attention to Suggestion and Solution than lower impact factor journals. In conclusion, peer review in journals with higher impact factors tends to be more thorough, particularly in addressing study methods while giving relatively less emphasis to presentation or suggesting solutions. Differences were modest and variability high, indicating that the Journal Impact Factor is a bad predictor of the quality of peer review of an individual manuscript.
Collapse
Affiliation(s)
- Anna Severin
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Graduate School for Health Sciences, University of Bern, Bern, Switzerland
| | | | - Matthias Egger
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Swiss National Science Foundation, Bern, Switzerland
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | | | | | | | - Stefan Müller
- School of Politics and International Relations, University College Dublin, Dublin, Ireland
| |
Collapse
|
17
|
Kaya Bicer E, Fangerau H, Sur H. Artifical intelligence use in orthopedics: an ethical point of view. EFORT Open Rev 2023; 8:592-596. [PMID: 37526254 PMCID: PMC10441251 DOI: 10.1530/eor-23-0083] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 08/02/2023] Open
Abstract
Artificial intelligence (AI) is increasingly being utilized in orthopedics practice. Ethical concerns have arisen alongside marked improvements and widespread utilization of AI. Patient privacy, consent, data protection, cybersecurity, data safety and monitoring, bias, and accountability are some of the ethical concerns.
Collapse
Affiliation(s)
- Elcil Kaya Bicer
- Department of Orthopedics and Traumatology, Ege University Faculty of Medicine, Izmir, Turkey
| | - Heiner Fangerau
- Department of the History, Philosophy and Ethics of Medicine, Heinrich-Heine-Universität Düsseldorf, Germany
| | - Hakki Sur
- Department of Orthopedics and Traumatology, Ege University Faculty of Medicine, Izmir, Turkey
| |
Collapse
|
18
|
Lisacek-Kiosoglous AB, Powling AS, Fontalis A, Gabr A, Mazomenos E, Haddad FS. Artificial intelligence in orthopaedic surgery. Bone Joint Res 2023; 12:447-454. [PMID: 37423607 DOI: 10.1302/2046-3758.127.bjr-2023-0111.r1] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 07/11/2023] Open
Abstract
The use of artificial intelligence (AI) is rapidly growing across many domains, of which the medical field is no exception. AI is an umbrella term defining the practical application of algorithms to generate useful output, without the need of human cognition. Owing to the expanding volume of patient information collected, known as 'big data', AI is showing promise as a useful tool in healthcare research and across all aspects of patient care pathways. Practical applications in orthopaedic surgery include: diagnostics, such as fracture recognition and tumour detection; predictive models of clinical and patient-reported outcome measures, such as calculating mortality rates and length of hospital stay; and real-time rehabilitation monitoring and surgical training. However, clinicians should remain cognizant of AI's limitations, as the development of robust reporting and validation frameworks is of paramount importance to prevent avoidable errors and biases. The aim of this review article is to provide a comprehensive understanding of AI and its subfields, as well as to delineate its existing clinical applications in trauma and orthopaedic surgery. Furthermore, this narrative review expands upon the limitations of AI and future direction.
Collapse
Affiliation(s)
- Anthony B Lisacek-Kiosoglous
- Department of Trauma and Orthopaedic Surgery, University College London Hospitals NHS Foundation Trust, London, UK
| | - Amber S Powling
- Department of Trauma and Orthopaedic Surgery, University College London Hospitals NHS Foundation Trust, London, UK
- Barts and The London School of Medicine and Dentistry, School of Medicine London, London, UK
| | - Andreas Fontalis
- Department of Trauma and Orthopaedic Surgery, University College London Hospitals NHS Foundation Trust, London, UK
- Division of Surgery and Interventional Science, University College London, London, UK
- Wellcome / EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
| | - Ayman Gabr
- Department of Trauma and Orthopaedic Surgery, University College London Hospitals NHS Foundation Trust, London, UK
| | - Evangelos Mazomenos
- Wellcome / EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
| | - Fares S Haddad
- Department of Trauma and Orthopaedic Surgery, University College London Hospitals NHS Foundation Trust, London, UK
- Division of Surgery and Interventional Science, University College London, London, UK
| |
Collapse
|
19
|
Palacios Gomez M. Human intelligence for authors, reviewers and editors using artificial intelligence. Colomb Med (Cali) 2023; 54:e1005867. [PMID: 38076466 PMCID: PMC10702474 DOI: 10.25100/cm.v54i3.5867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2023] Open
Abstract
We call artificial intelligence any machine that processes information with some purpose, complying with the logical rules of Turing's computation described more than 70 years ago. These machines work with instructions called algorithms, a finite and well-defined sequence of information processing implemented by automata (computers) or any digital technology to optimize a process. (2) This means that the purpose of artificial intelligence is optimization.
Optimization is the ability to do or solve something in the most efficient way possible and, in the best case, using the least amount of resources. The intended optimization is programmed and preset by humans; therefore, these technologies are tools humans create for human purposes.
The optimization capability of artificial intelligence is staggering. It is estimated that using artificial intelligence will facilitate the achievement of 134 of the 169 goals agreed in the 2030 Agenda for Sustainable Development. However, in this evaluation, it was projected that it could negatively affect the progress of 59 goals of the same agreement, being social, economic, educational, legal and gender inequality, the phenomenon most affected by artificial intelligence.
This projection shows us that it is necessary to counterbalance the development and implementation of processes mediated by artificial intelligence, to maintain reflection and question the influence of these technological tools, and, above all, to be based on human intelligence. A definition of human intelligence in the data science and artificial intelligence environment would be a collection of contextual tacit knowledge about human values, responsibility, empathy, intuition, or care for another living being that algorithms cannot describe or execute.
Collapse
Affiliation(s)
- Mauricio Palacios Gomez
- Editor en jefe de la Revista Colombia Médica, Facultad de salud, Universidad del Valle, Cali, Colombia
| |
Collapse
|
20
|
Fraser AG, Biasin E, Bijnens B, Bruining N, Caiani EG, Cobbaert K, Davies RH, Gilbert SH, Hovestadt L, Kamenjasevic E, Kwade Z, McGauran G, O'Connor G, Vasey B, Rademakers FE. Artificial intelligence in medical device software and high-risk medical devices - a review of definitions, expert recommendations and regulatory initiatives. Expert Rev Med Devices 2023; 20:467-491. [PMID: 37157833 DOI: 10.1080/17434440.2023.2184685] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
INTRODUCTION Artificial intelligence (AI) encompasses a wide range of algorithms with risks when used to support decisions about diagnosis or treatment, so professional and regulatory bodies are recommending how they should be managed. AREAS COVERED AI systems may qualify as standalone medical device software (MDSW) or be embedded within a medical device. Within the European Union (EU) AI software must undergo a conformity assessment procedure to be approved as a medical device. The draft EU Regulation on AI proposes rules that will apply across industry sectors, while for devices the Medical Device Regulation also applies. In the CORE-MD project (Coordinating Research and Evidence for Medical Devices), we have surveyed definitions and summarize initiatives made by professional consensus groups, regulators, and standardization bodies. EXPERT OPINION The level of clinical evidence required should be determined according to each application and to legal and methodological factors that contribute to risk, including accountability, transparency, and interpretability. EU guidance for MDSW based on international recommendations does not yet describe the clinical evidence needed for medical AI software. Regulators, notified bodies, manufacturers, clinicians and patients would all benefit from common standards for the clinical evaluation of high-risk AI applications and transparency of their evidence and performance.
Collapse
Affiliation(s)
- Alan G Fraser
- University Hospital of Wales, School of Medicine, Cardiff University, Heath Park, Cardiff, U.K
- KU Leuven, Leuven, Belgium
| | | | - Bart Bijnens
- Engineering Sciences, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
| | - Nico Bruining
- Department of Clinical and Experimental Information processing (Digital Cardiology), Erasmus Medical Center, Thoraxcenter, Rotterdam, the Netherlands
| | - Enrico G Caiani
- Department of Electronics, Information and Biomedical Engineering, Politecnico di Milano, Milan, Italy
| | | | - Rhodri H Davies
- Institute of Cardiovascular Science, University College London, London, U.K
| | - Stephen H Gilbert
- Technische Universität Dresden, Else Kröner Fresenius Center for Digital Health, Dresden, Germany
| | | | | | | | | | | | - Baptiste Vasey
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | | |
Collapse
|
21
|
Ezugwu AE, Oyelade ON, Ikotun AM, Agushaka JO, Ho YS. Machine Learning Research Trends in Africa: A 30 Years Overview with Bibliometric Analysis Review. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2023; 30:1-31. [PMID: 37359741 PMCID: PMC10148585 DOI: 10.1007/s11831-023-09930-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 04/19/2023] [Indexed: 06/28/2023]
Abstract
The machine learning (ML) paradigm has gained much popularity today. Its algorithmic models are employed in every field, such as natural language processing, pattern recognition, object detection, image recognition, earth observation and many other research areas. In fact, machine learning technologies and their inevitable impact suffice in many technological transformation agendas currently being propagated by many nations, for which the already yielded benefits are outstanding. From a regional perspective, several studies have shown that machine learning technology can help address some of Africa's most pervasive problems, such as poverty alleviation, improving education, delivering quality healthcare services, and addressing sustainability challenges like food security and climate change. In this state-of-the-art paper, a critical bibliometric analysis study is conducted, coupled with an extensive literature survey on recent developments and associated applications in machine learning research with a perspective on Africa. The presented bibliometric analysis study consists of 2761 machine learning-related documents, of which 89% were articles with at least 482 citations published in 903 journals during the past three decades. Furthermore, the collated documents were retrieved from the Science Citation Index EXPANDED, comprising research publications from 54 African countries between 1993 and 2021. The bibliometric study shows the visualization of the current landscape and future trends in machine learning research and its application to facilitate future collaborative research and knowledge exchange among authors from different research institutions scattered across the African continent.
Collapse
Affiliation(s)
- Absalom E. Ezugwu
- Unit for Data Science and Computing, North-West University, 11 Hoffman Street, Potchefstroom, 2520 South Africa
| | - Olaide N. Oyelade
- Department of Computer Science, Faculty of Physical Sciences, Ahmadu Bello University, Zaria, Nigeria
| | - Abiodun M. Ikotun
- Unit for Data Science and Computing, North-West University, 11 Hoffman Street, Potchefstroom, 2520 South Africa
| | - Jeffery O. Agushaka
- Unit for Data Science and Computing, North-West University, 11 Hoffman Street, Potchefstroom, 2520 South Africa
| | - Yuh-Shan Ho
- Trend Research Centre, Asia University, No. 500, Lioufeng RoadWufeng, Taichung, 41354 Taiwan
| |
Collapse
|
22
|
Anttila TT, Karjalainen TV, Mäkelä TO, Waris EM, Lindfors NC, Leminen MM, Ryhänen JO. Detecting Distal Radius Fractures Using a Segmentation-Based Deep Learning Model. J Digit Imaging 2023; 36:679-687. [PMID: 36542269 PMCID: PMC10039188 DOI: 10.1007/s10278-022-00741-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 11/08/2022] [Accepted: 11/09/2022] [Indexed: 12/24/2022] Open
Abstract
Deep learning algorithms can be used to classify medical images. In distal radius fracture treatment, fracture detection and radiographic assessment of fracture displacement are critical steps. The aim of this study was to use pixel-level annotations of fractures to develop a deep learning model for precise distal radius fracture detection. We randomly divided 3785 consecutive emergency wrist radiograph examinations from six hospitals to a training set (3399 examinations) and test set (386 examinations). The training set was used to develop the deep learning model and the test set to assess its validity. The consensus of three hand surgeons was used as the gold standard for the test set. The area under the ROC curve was 0.97 (CI 0.95-0.98) and 0.95 (CI 0.92-0.98) for examinations without a cast. Fractures were identified with higher accuracy in the postero-anterior radiographs than in the lateral radiographs. Our deep learning model performed well in our multi-hospital and multi-radiograph system manufacturer settings. Thus, segmentation-based deep learning models may provide additional benefit. Further research is needed with algorithm comparison and external validation.
Collapse
Affiliation(s)
- Turkka T Anttila
- Musculoskeletal and Plastic Surgery, Department of Hand Surgery, University of Helsinki and Helsinki University Hospital, Topeliuksenkatu 5B, Helsinki, 00260, Finland.
| | - Teemu V Karjalainen
- Department of Orthopedics, Traumatology and Hand Surgery, Central Finland Hospital, Jyvaskyla, Finland
| | - Teemu O Mäkelä
- Medical Imaging Center, Radiology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- Department of Physics, University of Helsinki, Helsinki, Finland
| | - Eero M Waris
- Musculoskeletal and Plastic Surgery, Department of Hand Surgery, University of Helsinki and Helsinki University Hospital, Topeliuksenkatu 5B, Helsinki, 00260, Finland
| | - Nina C Lindfors
- Musculoskeletal and Plastic Surgery, Department of Hand Surgery, University of Helsinki and Helsinki University Hospital, Topeliuksenkatu 5B, Helsinki, 00260, Finland
| | - Miika M Leminen
- Analytics and AI Development Services, IT Department, Helsinki University Hospital, Helsinki, Finland
- Department of Otorhinolaryngology and Phoniatrics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Jorma O Ryhänen
- Musculoskeletal and Plastic Surgery, Department of Hand Surgery, University of Helsinki and Helsinki University Hospital, Topeliuksenkatu 5B, Helsinki, 00260, Finland
| |
Collapse
|
23
|
Entezari B, Koucheki R, Abbas A, Toor J, Wolfstadt JI, Ravi B, Whyne C, Lex JR. Improving Resource Utilization for Arthroplasty Care by Leveraging Machine Learning and Optimization: A Systematic Review. Arthroplast Today 2023; 20:101116. [PMID: 36938350 PMCID: PMC10014272 DOI: 10.1016/j.artd.2023.101116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 01/28/2023] [Indexed: 03/21/2023] Open
Abstract
Background There is a growing demand for total joint arthroplasty (TJA) surgery. The applications of machine learning (ML), mathematical optimization, and computer simulation have the potential to improve efficiency of TJA care delivery through outcome prediction and surgical scheduling optimization, easing the burden on health-care systems. The purpose of this study was to evaluate strategies using advances in analytics and computational modeling that may improve planning and the overall efficiency of TJA care. Methods A systematic review including MEDLINE, Embase, and IEEE Xplore databases was completed from inception to October 3, 2022, for identification of studies generating ML models for TJA length of stay, duration of surgery, and hospital readmission prediction. A scoping review of optimization strategies in elective surgical scheduling was also conducted. Results Twenty studies were included for evaluating ML predictions and 17 in the scoping review of scheduling optimization. Among studies generating linear or logistic control models alongside ML models, only 1 found a control model to outperform its ML counterpart. Furthermore, neural networks performed superior to or at the same level as conventional ML models in all but 1 study. Implementation of mathematical and simulation strategies improved the optimization efficiency when compared to traditional scheduling methods at the operational level. Conclusions High-performing predictive ML-based models have been developed for TJA, as have mathematical strategies for elective surgical scheduling optimization. By leveraging artificial intelligence for outcome prediction and surgical optimization, there exist greater opportunities for improved resource utilization and cost-savings in TJA than when using traditional modeling and scheduling methods.
Collapse
Affiliation(s)
- Bahar Entezari
- Granovsky Gluskin Division of Orthopaedics, Mount Sinai Hospital, Toronto, Ontario, Canada
- Queen’s University School of Medicine, Kingston, Ontario, Canada
- Corresponding author. Mount Sinai Hospital, 15 Arch Street, Kingston, Ontario, Canada K7L 3N6. Tel.: +1 647 866 8729.
| | - Robert Koucheki
- Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
| | - Aazad Abbas
- Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Orthopaedic Biomechanics Lab, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Jay Toor
- Division of Orthopaedic Surgery, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Jesse I. Wolfstadt
- Granovsky Gluskin Division of Orthopaedics, Mount Sinai Hospital, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Bheeshma Ravi
- Division of Orthopaedic Surgery, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Holland Bone and Joint Program, Sunnybrook Health Science Centre, Toronto, Ontario, Canada
| | - Cari Whyne
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Orthopaedic Biomechanics Lab, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Holland Bone and Joint Program, Sunnybrook Health Science Centre, Toronto, Ontario, Canada
| | - Johnathan R. Lex
- Orthopaedic Biomechanics Lab, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
24
|
Banaye Yazdipour A, Masoorian H, Ahmadi M, Mohammadzadeh N, Ayyoubzadeh SM. Predicting the toxicity of nanoparticles using artificial intelligence tools: a systematic review. Nanotoxicology 2023; 17:62-77. [PMID: 36883698 DOI: 10.1080/17435390.2023.2186279] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
Nanoparticles have been used extensively in different scientific fields. Due to the possible destructive effects of nanoparticles on the environment or the biological systems, their toxicity evaluation is a crucial phase for studying nanomaterial safety. In the meantime, experimental approaches for toxicity assessment of various nanoparticles are expensive and time-consuming. Thus, an alternative technique, such as artificial intelligence (AI), could be valuable for predicting nanoparticle toxicity. Therefore, in this review, the AI tools were investigated for the toxicity assessment of nanomaterials. To this end, a systematic search was performed on PubMed, Web of Science, and Scopus databases. Articles were included or excluded based on pre-defined inclusion and exclusion criteria, and duplicate studies were excluded. Finally, twenty-six studies were included. The majority of the studies were conducted on metal oxide and metallic nanoparticles. In addition, Random Forest (RF) and Support Vector Machine (SVM) had the most frequency in the included studies. Most of the models demonstrated acceptable performance. Overall, AI could provide a robust, fast, and low-cost tool for the evaluation of nanoparticle toxicity.
Collapse
Affiliation(s)
- Alireza Banaye Yazdipour
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran.,Students' Scientific Research Center (SSRC), Tehran University of Medical Sciences, Tehran, Iran
| | - Hoorie Masoorian
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Mahnaz Ahmadi
- Department of Pharmaceutics and Pharmaceutical Nanotechnology, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Niloofar Mohammadzadeh
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Seyed Mohammad Ayyoubzadeh
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
25
|
Heredia-Negron F, Alamo-Rodriguez N, Oyola-Velazquez L, Nieves B, Carrasquillo K, Hochheiser H, Fristensky B, Daluz-Santana I, Fernandez-Repollet E, Roche-Lima A. Evaluation of AIML + HDR-A Course to Enhance Data Science Workforce Capacity for Hispanic Biomedical Researchers. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:2726. [PMID: 36768092 PMCID: PMC9914971 DOI: 10.3390/ijerph20032726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/25/2023] [Accepted: 01/29/2023] [Indexed: 06/18/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) facilitate the creation of revolutionary medical techniques. Unfortunately, biases in current AI and ML approaches are perpetuating minority health inequity. One of the strategies to solve this problem is training a diverse workforce. For this reason, we created the course "Artificial Intelligence and Machine Learning applied to Health Disparities Research (AIML + HDR)" which applied general Data Science (DS) approaches to health disparities research with an emphasis on Hispanic populations. Some technical topics covered included the Jupyter Notebook Framework, coding with R and Python to manipulate data, and ML libraries to create predictive models. Some health disparities topics covered included Electronic Health Records, Social Determinants of Health, and Bias in Data. As a result, the course was taught to 34 selected Hispanic participants and evaluated by a survey on a Likert scale (0-4). The surveys showed high satisfaction (more than 80% of participants agreed) regarding the course organization, activities, and covered topics. The students strongly agreed that the activities were relevant to the course and promoted their learning (3.71 ± 0.21). The students strongly agreed that the course was helpful for their professional development (3.76 ± 0.18). The open question was quantitatively analyzed and showed that seventy-five percent of the comments received from the participants confirmed their great satisfaction.
Collapse
Affiliation(s)
- Frances Heredia-Negron
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Natalie Alamo-Rodriguez
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Lenamari Oyola-Velazquez
- Department of Public Health, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Brenda Nieves
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Kelvin Carrasquillo
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Harry Hochheiser
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Brian Fristensky
- Department of Plant Science, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Istoni Daluz-Santana
- Department of Biostatistics and Epidemiology, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Emma Fernandez-Repollet
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| | - Abiel Roche-Lima
- RCMI-CCRHD Program, Medical Sciences Campus, University of Puerto Rico, San Juan 00934, Puerto Rico
| |
Collapse
|
26
|
Cerdá-Alberich L, Solana J, Mallol P, Ribas G, García-Junco M, Alberich-Bayarri A, Marti-Bonmati L. MAIC-10 brief quality checklist for publications using artificial intelligence and medical images. Insights Imaging 2023; 14:11. [PMID: 36645542 PMCID: PMC9842808 DOI: 10.1186/s13244-022-01355-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 12/20/2022] [Indexed: 01/17/2023] Open
Abstract
The use of artificial intelligence (AI) with medical images to solve clinical problems is becoming increasingly common, and the development of new AI solutions is leading to more studies and publications using this computational technology. As a novel research area, the use of common standards to aid AI developers and reviewers as quality control criteria will improve the peer review process. Although some guidelines do exist, their heterogeneity and extension advocate that more explicit and simple schemes should be applied on the publication practice. Based on a review of existing AI guidelines, a proposal which collects, unifies, and simplifies the most relevant criteria was developed. The MAIC-10 (Must AI Criteria-10) checklist with 10 items was implemented as a guide to design studies and evaluate publications related to AI in the field of medical imaging. Articles published in Insights into Imaging in 2021 were selected to calculate their corresponding MAIC-10 quality score. The mean score was found to be 5.6 ± 1.6, with critical items present in most articles, such as "Clinical need", "Data annotation", "Robustness", and "Transparency" present in more than 80% of papers, while improvements in other areas were identified. MAIC-10 was also observed to achieve the highest intra-observer reproducibility when compared to other existing checklists, with an overall reduction in terms of checklist length and complexity. In summary, MAIC-10 represents a short and simple quality assessment tool which is objective, robust and widely applicable to AI studies in medical imaging.
Collapse
Affiliation(s)
- Leonor Cerdá-Alberich
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - Jimena Solana
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - Pedro Mallol
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - Gloria Ribas
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - Miguel García-Junco
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| | - Angel Alberich-Bayarri
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain ,Quantitative Imaging Biomarkers in Medicine, Quibim SL, Valencia, Spain
| | - Luis Marti-Bonmati
- grid.84393.350000 0001 0360 9602Clinical Medical Imaging Area and Biomedical Imaging Research Group (GIBI230-PREBI), Hospital Universitario y Politécnico La Fe – Instituto de Investigación Sanitaria La Fe, Valencia, Spain
| |
Collapse
|
27
|
Prijs J, Liao Z, To MS, Verjans J, Jutte PC, Stirler V, Olczak J, Gordon M, Guss D, DiGiovanni CW, Jaarsma RL, IJpma FFA, Doornberg JN. Development and external validation of automated detection, classification, and localization of ankle fractures: inside the black box of a convolutional neural network (CNN). Eur J Trauma Emerg Surg 2022; 49:1057-1069. [PMID: 36374292 PMCID: PMC10175446 DOI: 10.1007/s00068-022-02136-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022]
Abstract
Abstract
Purpose
Convolutional neural networks (CNNs) are increasingly being developed for automated fracture detection in orthopaedic trauma surgery. Studies to date, however, are limited to providing classification based on the entire image—and only produce heatmaps for approximate fracture localization instead of delineating exact fracture morphology. Therefore, we aimed to answer (1) what is the performance of a CNN that detects, classifies, localizes, and segments an ankle fracture, and (2) would this be externally valid?
Methods
The training set included 326 isolated fibula fractures and 423 non-fracture radiographs. The Detectron2 implementation of the Mask R-CNN was trained with labelled and annotated radiographs. The internal validation (or ‘test set’) and external validation sets consisted of 300 and 334 radiographs, respectively. Consensus agreement between three experienced fellowship-trained trauma surgeons was defined as the ground truth label. Diagnostic accuracy and area under the receiver operator characteristic curve (AUC) were used to assess classification performance. The Intersection over Union (IoU) was used to quantify accuracy of the segmentation predictions by the CNN, where a value of 0.5 is generally considered an adequate segmentation.
Results
The final CNN was able to classify fibula fractures according to four classes (Danis-Weber A, B, C and No Fracture) with AUC values ranging from 0.93 to 0.99. Diagnostic accuracy was 89% on the test set with average sensitivity of 89% and specificity of 96%. External validity was 89–90% accurate on a set of radiographs from a different hospital. Accuracies/AUCs observed were 100/0.99 for the ‘No Fracture’ class, 92/0.99 for ‘Weber B’, 88/0.93 for ‘Weber C’, and 76/0.97 for ‘Weber A’. For the fracture bounding box prediction by the CNN, a mean IoU of 0.65 (SD ± 0.16) was observed. The fracture segmentation predictions by the CNN resulted in a mean IoU of 0.47 (SD ± 0.17).
Conclusions
This study presents a look into the ‘black box’ of CNNs and represents the first automated delineation (segmentation) of fracture lines on (ankle) radiographs. The AUC values presented in this paper indicate good discriminatory capability of the CNN and substantiate further study of CNNs in detecting and classifying ankle fractures.
Level of evidence
II, Diagnostic imaging study.
Collapse
Affiliation(s)
- Jasper Prijs
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands.
- Department of Surgery, Groningen University Medical Centre, Groningen, The Netherlands.
- Department of Orthopaedic & Trauma Surgery, Flinders Medical Centre, Flinders University, Adelaide, Australia.
| | - Zhibin Liao
- Australian Institute for Machine Learning, Adelaide, Australia
| | - Minh-Son To
- College of Medicine and Public Health, Flinders University, Adelaide, Australia
- Department of Neurosurgery, Flinders Medical Center, Adelaide, Australia
| | - Johan Verjans
- Australian Institute for Machine Learning, Adelaide, Australia
| | - Paul C Jutte
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands
| | - Vincent Stirler
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands
| | - Jakub Olczak
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Solna, Sweden
| | - Max Gordon
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Solna, Sweden
| | - Daniel Guss
- Massachusetts General Hospital, Boston, USA
- Harvard Medical School, Boston, USA
| | | | - Ruurd L Jaarsma
- Department of Orthopaedic & Trauma Surgery, Flinders Medical Centre, Flinders University, Adelaide, Australia
| | - Frank F A IJpma
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands
| | - Job N Doornberg
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, The Netherlands
- Department of Orthopaedic & Trauma Surgery, Flinders Medical Centre, Flinders University, Adelaide, Australia
- College of Medicine and Public Health, Flinders University, Adelaide, Australia
| |
Collapse
|
28
|
Nelson AE, Arbeeva L. Narrative Review of Machine Learning in Rheumatic and Musculoskeletal Diseases for Clinicians and Researchers: Biases, Goals, and Future Directions. J Rheumatol 2022; 49:1191-1200. [PMID: 35840150 PMCID: PMC9633365 DOI: 10.3899/jrheum.220326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2022] [Indexed: 11/22/2022]
Abstract
There has been rapid growth in the use of artificial intelligence (AI) analytics in medicine in recent years, including in rheumatic and musculoskeletal diseases (RMDs). Such methods represent a challenge to clinicians, patients, and researchers, given the "black box" nature of most algorithms, the unfamiliarity of the terms, and the lack of awareness of potential issues around these analyses. Therefore, this review aims to introduce this subject area in a way that is relevant and meaningful to clinicians and researchers. We hope to provide some insights into relevant strengths and limitations, reporting guidelines, as well as recent examples of such analyses in key areas, with a focus on lessons learned and future directions in diagnosis, phenotyping, prognosis, and precision medicine in RMDs.
Collapse
Affiliation(s)
- Amanda E Nelson
- A.E. Nelson, MD, MSCR, Department of Medicine, Division of Rheumatology, Allergy, and Immunology, University of North Carolina at Chapel Hill;
| | - Liubov Arbeeva
- L. Arbeeva, MS, Thurston Arthritis Research Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
29
|
Alsoof D, McDonald CL, Kuris EO, Daniels AH. Machine Learning for the Orthopaedic Surgeon: Uses and Limitations. J Bone Joint Surg Am 2022; 104:1586-1594. [PMID: 35383655 DOI: 10.2106/jbjs.21.01305] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
➤ Machine learning is a subset of artificial intelligence in which computer algorithms are trained to make classifications and predictions based on patterns in data. The utilization of these techniques is rapidly expanding in the field of orthopaedic research. ➤ There are several domains in which machine learning has application to orthopaedics, including radiographic diagnosis, gait analysis, implant identification, and patient outcome prediction. ➤ Several limitations prevent the widespread use of machine learning in the daily clinical environment. However, future work can overcome these issues and enable machine learning tools to be a useful adjunct for orthopaedic surgeons in their clinical decision-making.
Collapse
Affiliation(s)
- Daniel Alsoof
- Department of Orthopedic Surgery, Warren Alpert Medical School of Brown University, Providence, Rhode Island
| | | | | | | |
Collapse
|
30
|
Reumann MK, Braun BJ, Menger MM, Springer F, Jazewitsch J, Schwarz T, Nüssler A, Histing T, Rollmann MFR. [Artificial intelligence and novel approaches for treatment of non-union in bone : From established standard methods in medicine up to novel fields of research]. UNFALLCHIRURGIE (HEIDELBERG, GERMANY) 2022; 125:611-618. [PMID: 35810261 DOI: 10.1007/s00113-022-01202-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 06/15/2023]
Abstract
Methods of artificial intelligence (AI) have found applications in many fields of medicine within the last few years. Some disciplines already use these methods regularly within their clinical routine. However, the fields of application are wide and there are still many opportunities to apply these new AI concepts. This review article gives an insight into the history of AI and defines the special terms and fields, such as machine learning (ML), neural networks and deep learning. The classical steps in developing AI models are demonstrated here, as well as the iteration of data rectification and preparation, the training of a model and subsequent validation before transfer into a clinical setting are explained. Currently, musculoskeletal disciplines implement methods of ML and also neural networks, e.g. for identification of fractures or for classifications. Also, predictive models based on risk factor analysis for prevention of complications are being initiated. As non-union in bone is a rare but very complex disease with dramatic socioeconomic impact for the healthcare system, many open questions arise which could be better understood by using methods of AI in the future. New fields of research applying AI models range from predictive models and cost analysis to personalized treatment strategies.
Collapse
Affiliation(s)
- Marie K Reumann
- Klinik für Unfall- und Wiederherstellungschirurgie an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Schnarrenbergstr. 95, 72076, Tübingen, Deutschland.
- Siegfried Weller Institut für Unfallmedizinische Forschung an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Tübingen, Deutschland.
| | - Benedikt J Braun
- Klinik für Unfall- und Wiederherstellungschirurgie an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Schnarrenbergstr. 95, 72076, Tübingen, Deutschland
| | - Maximilian M Menger
- Klinik für Unfall- und Wiederherstellungschirurgie an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Schnarrenbergstr. 95, 72076, Tübingen, Deutschland
| | - Fabian Springer
- Klinik für Diagnostische und Interventionelle Radiologie, Eberhard Karls Universität Tübingen, Tübingen, Deutschland
| | - Johann Jazewitsch
- Siegfried Weller Institut für Unfallmedizinische Forschung an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Tübingen, Deutschland
| | - Tobias Schwarz
- Siegfried Weller Institut für Unfallmedizinische Forschung an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Tübingen, Deutschland
| | - Andreas Nüssler
- Siegfried Weller Institut für Unfallmedizinische Forschung an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Tübingen, Deutschland
| | - Tina Histing
- Klinik für Unfall- und Wiederherstellungschirurgie an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Schnarrenbergstr. 95, 72076, Tübingen, Deutschland
| | - Mika F R Rollmann
- Klinik für Unfall- und Wiederherstellungschirurgie an der Eberhard Karls Universität Tübingen, BG Klinik Tübingen, Schnarrenbergstr. 95, 72076, Tübingen, Deutschland
| |
Collapse
|
31
|
Velasquez R, Barja-Ore J, Salazar-Salvatierra E, Gutiérrez-Ilave M, Mauricio-Vilchez C, Mendoza R, Mayta-Tovalino F. Characteristics, Impact, and Visibility of Scientific Publications on Artificial Intelligence in Dentistry: A Scientometric Analysis. J Contemp Dent Pract 2022; 23:761-767. [PMID: 37283008 DOI: 10.5005/jp-journals-10024-3386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
AIM To analyze the bibliometric characteristics, impact, and visibility of scientific publications on artificial intelligence (AI) in dentistry in Scopus. MATERIALS AND METHODS Descriptive and cross-sectional bibliometric study, based on the systematic search of information in Scopus between 2017 and July 10, 2022. The search strategy was elaborated with Medical Subject Headings (MeSH) and Boolean operators. The analysis of bibliometric indicators was performed with Elsevier's SciVal program. RESULTS From 2017 to 2022, the number of publications in indexed scientific journals increased, especially in the Q1 (56.1%) and Q2 (30.6%) quartile. Among the journals with the highest production, the majority was from the United States and the United Kingdom, and the Journal of Dental Research has the highest impact (14.9 citations per publication) and the most publications (31). In addition, the Charité - Universitätsmedizin Berlin (FWCI: 8.24) and Krois Joachim (FWCI: 10.09) from Germany were the institution and author with the highest expected performance relative to the world average, respectively. The United States is the country with the highest number of published papers. CLINICAL SIGNIFICANCE There is an increasing tendency to increase the scientific production on artificial intelligence in the field of dentistry, with a preference for publication in prestigious scientific journals of high impact. Most of the productive authors and institutions were from Japan. There is a need to promote and consolidate strategies to develop collaborative research both nationally and internationally.
Collapse
Affiliation(s)
- Ricardo Velasquez
- Postgraduate Department, Faculty of Dentistry, Universidad Nacional Federico Villarreal, Lima, Peru
| | - John Barja-Ore
- Research Direction, Universidad Privada del Norte, Lima, Peru
| | | | - Margot Gutiérrez-Ilave
- Academic Department of Preventive and Social Stomatology, Faculty of Dentistry, Universidad Nacional Mayor de San Marcos, Lima, Peru
| | - Cesar Mauricio-Vilchez
- Postgraduate Department, Faculty of Dentistry, Universidad Nacional Federico Villarreal, Lima, Peru
| | - Roman Mendoza
- Postgraduate Department, Faculty of Dentistry, Universidad Nacional Federico Villarreal, Lima, Peru
| | - Frank Mayta-Tovalino
- Vicerrectorado de Investigación, Universidad San Ignacio de Loyola, Av. la Fontana, La Molina, Lima, Peru, Phone: +013171000, e-mail:
| |
Collapse
|
32
|
Prijs J, Liao Z, Ashkani-Esfahani S, Olczak J, Gordon M, Jayakumar P, Jutte PC, Jaarsma RL, IJpma FFA, Doornberg JN. Artificial intelligence and computer vision in orthopaedic trauma : the why, what, and how. Bone Joint J 2022; 104-B:911-914. [PMID: 35909378 DOI: 10.1302/0301-620x.104b8.bjj-2022-0119.r1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Artificial intelligence (AI) is, in essence, the concept of 'computer thinking', encompassing methods that train computers to perform and learn from executing certain tasks, called machine learning, and methods to build intricate computer models that both learn and adapt, called complex neural networks. Computer vision is a function of AI by which machine learning and complex neural networks can be applied to enable computers to capture, analyze, and interpret information from clinical images and visual inputs. This annotation summarizes key considerations and future perspectives concerning computer vision, questioning the need for this technology (the 'why'), the current applications (the 'what'), and the approach to unlocking its full potential (the 'how'). Cite this article: Bone Joint J 2022;104-B(8):911-914.
Collapse
Affiliation(s)
- Jasper Prijs
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, the Netherlands.,Department of Surgery, Groningen University Medical Centre, Groningen, the Netherlands.,Department of Orthopaedic & Trauma Surgery, Flinders University, Flinders Medical Centre, Adelaide, Australia
| | - Zhibin Liao
- Australian Institute for Machine Learning, Adelaide, Australia
| | | | - Jakub Olczak
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Stockholm, Sweden
| | - Max Gordon
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Stockholm, Sweden
| | - Prakash Jayakumar
- The University of Texas at Austin, Dell Medical School, Austin, Texas, USA
| | - Paul C Jutte
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, the Netherlands
| | - Ruurd L Jaarsma
- Department of Orthopaedic & Trauma Surgery, Flinders University, Flinders Medical Centre, Adelaide, Australia
| | - Frank F A IJpma
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, the Netherlands
| | - Job N Doornberg
- Department of Orthopaedic Surgery, Groningen University Medical Centre, Groningen, the Netherlands.,Department of Orthopaedic & Trauma Surgery, Flinders University, Flinders Medical Centre, Adelaide, Australia
| | | |
Collapse
|
33
|
Abstract
The deployment of machine learning for tasks relevant to complementing standard of care and advancing tools for precision health has gained much attention in the clinical community, thus meriting further investigations into its broader use. In an introduction to predictive modelling using machine learning, we conducted a review of the recent literature that explains standard taxonomies, terminology and central concepts to a broad clinical readership. Articles aimed at readers with little or no prior experience of commonly used methods or typical workflows were summarised and key references are highlighted. Continual interdisciplinary developments in data science, biostatistics and epidemiology also motivated us to further discuss emerging topics in predictive and data-driven (hypothesis-less) analytics with machine learning. Through two methodological deep dives using examples from precision psychiatry and outcome prediction after lymphoma, we highlight how the use of, for example, natural language processing can outperform established clinical risk scores and aid dynamic prediction and adaptive care strategies. Such realistic and detailed examples allow for critical analysis of the importance of new technological advances in artificial intelligence for clinical decision-making. New clinical decision support systems can assist in prevention and care by leveraging precision medicine.
Collapse
Affiliation(s)
- Sandra Eloranta
- Division of Clinical Epidemiology, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden
| | - Magnus Boman
- Division of Software and Computer Systems, School of Electrical Engineering and Computer Science, KTH, Stockholm, Sweden.,Department of Learning, Informatics, Management, and Ethics, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
34
|
Stafford IS, Gosink MM, Mossotto E, Ennis S, Hauben M. A Systematic Review of Artificial Intelligence and Machine Learning Applications to Inflammatory Bowel Disease, with Practical Guidelines for Interpretation. Inflamm Bowel Dis 2022; 28:1573-1583. [PMID: 35699597 PMCID: PMC9527612 DOI: 10.1093/ibd/izac115] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Indexed: 12/15/2022]
Abstract
BACKGROUND Inflammatory bowel disease (IBD) is a gastrointestinal chronic disease with an unpredictable disease course. Computational methods such as machine learning (ML) have the potential to stratify IBD patients for the provision of individualized care. The use of ML methods for IBD was surveyed, with an additional focus on how the field has changed over time. METHODS On May 6, 2021, a systematic review was conducted through a search of MEDLINE and Embase databases, with the search structure ("machine learning" OR "artificial intelligence") AND ("Crohn* Disease" OR "Ulcerative Colitis" OR "Inflammatory Bowel Disease"). Exclusion criteria included studies not written in English, no human patient data, publication before 2001, studies that were not peer reviewed, nonautoimmune disease comorbidity research, and record types that were not primary research. RESULTS Seventy-eight (of 409) records met the inclusion criteria. Random forest methods were most prevalent, and there was an increase in neural networks, mainly applied to imaging data sets. The main applications of ML to clinical tasks were diagnosis (18 of 78), disease course (22 of 78), and disease severity (16 of 78). The median sample size was 263. Clinical and microbiome-related data sets were most popular. Five percent of studies used an external data set after training and testing for additional model validation. DISCUSSION Availability of longitudinal and deep phenotyping data could lead to better modeling. Machine learning pipelines that consider imbalanced data and that feature selection only on training data will generate more generalizable models. Machine learning models are increasingly being applied to more complex clinical tasks for specific phenotypes, indicating progress towards personalized medicine for IBD.
Collapse
Affiliation(s)
| | | | - Enrico Mossotto
- Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| | - Sarah Ennis
- Address correspondence to: Sarah Ennis, Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK ()
| | | |
Collapse
|
35
|
Crossnohere NL, Elsaid M, Paskett J, Bose-Brill S, Bridges JFP. Guidelines for artificial intelligence in medicine: A literature review and content analysis of frameworks (Preprint). J Med Internet Res 2022; 24:e36823. [PMID: 36006692 PMCID: PMC9459836 DOI: 10.2196/36823] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 06/02/2022] [Accepted: 07/14/2022] [Indexed: 12/15/2022] Open
Abstract
Background Artificial intelligence (AI) is rapidly expanding in medicine despite a lack of consensus on its application and evaluation. Objective We sought to identify current frameworks guiding the application and evaluation of AI for predictive analytics in medicine and to describe the content of these frameworks. We also assessed what stages along the AI translational spectrum (ie, AI development, reporting, evaluation, implementation, and surveillance) the content of each framework has been discussed. Methods We performed a literature review of frameworks regarding the oversight of AI in medicine. The search included key topics such as “artificial intelligence,” “machine learning,” “guidance as topic,” and “translational science,” and spanned the time period 2014-2022. Documents were included if they provided generalizable guidance regarding the use or evaluation of AI in medicine. Included frameworks are summarized descriptively and were subjected to content analysis. A novel evaluation matrix was developed and applied to appraise the frameworks’ coverage of content areas across translational stages. Results Fourteen frameworks are featured in the review, including six frameworks that provide descriptive guidance and eight that provide reporting checklists for medical applications of AI. Content analysis revealed five considerations related to the oversight of AI in medicine across frameworks: transparency, reproducibility, ethics, effectiveness, and engagement. All frameworks include discussions regarding transparency, reproducibility, ethics, and effectiveness, while only half of the frameworks discuss engagement. The evaluation matrix revealed that frameworks were most likely to report AI considerations for the translational stage of development and were least likely to report considerations for the translational stage of surveillance. Conclusions Existing frameworks for the application and evaluation of AI in medicine notably offer less input on the role of engagement in oversight and regarding the translational stage of surveillance. Identifying and optimizing strategies for engagement are essential to ensure that AI can meaningfully benefit patients and other end users.
Collapse
Affiliation(s)
- Norah L Crossnohere
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
- Division of General Internal Medicine, Department of Internal Medicine, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Mohamed Elsaid
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Jonathan Paskett
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Seuli Bose-Brill
- Division of General Internal Medicine, Department of Internal Medicine, The Ohio State University College of Medicine, Columbus, OH, United States
| | - John F P Bridges
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| |
Collapse
|
36
|
JOHANNESDOTTIR KB, KEHLET H, PETERSEN PB, AASVANG EK, SØRENSEN HBD, JØRGENSEN CC. Machine learning classifiers do not improve prediction of hospitalization > 2 days after fast-track hip and knee arthroplasty compared with a classical statistical risk model. Acta Orthop 2022; 93:117-123. [PMID: 34984485 PMCID: PMC8815306 DOI: 10.2340/17453674.2021.843] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Indexed: 01/31/2023] Open
Abstract
Background and purpose: Prediction of postoperative outcomes and length of hospital stay (LOS) of patients is vital for allocation of healthcare resources. We investigated the performance of prediction models based on machinelearning algorithms compared with a previous risk stratification model using traditional multiple logistic regression, for predicting the risk of a LOS of > 2 days after fast-track total hip and knee replacement. Patients and methods: 3 different machine learning classifiers were trained on data from the Lundbeck Centre for Fast-track Hip and Knee Replacement Database (LCDB) collected from 9,512 patients between 2016 and 2017. The chosen classifiers were a random forest classifier (RF), a support vector machine classifier with a polynomial kernel (SVM), and a multinomial Naïve-Bayes classifier (NB). Results: Comparing performance measures of the classifiers with the traditional model revealed that all the models had a similar performance in terms of F1 score, accuracy, sensitivity, specificity, area under the receiver operating curve (AUC), and area under the precision-recall curve (AUPRC). A feature importance analysis of the RF classifier found hospital, age, use of walking aid, living alone, and joint operated on to be the most relevant input features. None of the classifiers reached a clinically relevant performance with the input data from the LCDB. Interpretation: Despite the promising prospects of machine-learning practices for disease and risk prediction, none of the machine learning models tested outperformed the traditional multiple regression model in predicting which patients in this cohort had a LOS > 2 days.
Collapse
Affiliation(s)
- Katrin B JOHANNESDOTTIR
- Biomedical Signal Processing & AI research group, Digital Health Section, DTU Health Tech, Technical University of Denmark, Lyngby
| | - Henrik KEHLET
- Section of Surgical Pathophysiology 7621, Rigshospitalet, Copenhagen
| | - Pelle B PETERSEN
- Section of Surgical Pathophysiology 7621, Rigshospitalet, Copenhagen
| | - Eske K AASVANG
- Section of Surgical Pathophysiology 7621, Rigshospitalet, Copenhagen,Department of Anesthesiology, Center for Cancer and Organ Diseases, Copenhagen, Denmark
| | - Helge B D SØRENSEN
- Biomedical Signal Processing & AI research group, Digital Health Section, DTU Health Tech, Technical University of Denmark, Lyngby
| | - Christoffer C JØRGENSEN
- Section of Surgical Pathophysiology 7621, Rigshospitalet, Copenhagen,The Centre for Fast-track Hip and Knee Replacement Collaborative Group: Frank MADSEN, Dept. of Orthopedics, Aarhus University Hospital, Aarhus, DK; Torben Bæk HANSEN, Dept. of Orthopedics, Regional Hospital Holstebro, Holstebro, DK; Thomas JAKOBSEN, Aalborg University Hospital Northern Orthopaedic Division, Aalborg, DK; Lars Tambour HANSEN, Dept. of Orthopedics, Sydvestjysk Hospital Esbjerg/Grindsted, Grindsted, DK; Claus VARNUM, Dept. of Orthopedics, Lillebælt Hospital Vejle, DK; Mikkel Rathsach ANDERSEN, Dept. of Orthopedics, Gentofte University Hospital, Copenhagen, DK; Niels Harry KRARUP, Dept. of Orthopedics, Viborg Hospital, Viborg, DK; and Henrik PALM, Dept. of Orthopaedic Surgery, Copenhagen University Hospital Bispebjerg, Copenhagen, DK
| |
Collapse
|
37
|
Oliveira E Carmo L, van den Merkhof A, Olczak J, Gordon M, Jutte PC, Jaarsma RL, IJpma FFA, Doornberg JN, Prijs J. An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics : are these externally validated and ready for clinical application? Bone Jt Open 2021; 2:879-885. [PMID: 34669518 PMCID: PMC8558452 DOI: 10.1302/2633-1462.210.bjo-2021-0133] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Aims The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? Methods The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). Results Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). Conclusion The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve methodological rigor, quality of future models, and facilitate eventual implementation in clinical practice. Cite this article: Bone Jt Open 2021;2(10):879–885.
Collapse
Affiliation(s)
- Luisa Oliveira E Carmo
- Department of Orthopaedic Surgery, University Medical Centre, University of Groningen, Groningen, Groningen, Netherlands
| | - Anke van den Merkhof
- Department of Orthopaedic Surgery, Flinders Medical Centre, Bedford Park, Adelaide, South Australia, Australia.,Flinders University, Bedford Park, Adelaide, South Australia, Australia
| | - Jakub Olczak
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Stockholm, Sweden
| | - Max Gordon
- Institute of Clinical Sciences, Danderyd University Hospital, Karolinska Institute, Stockholm, Sweden
| | - Paul C Jutte
- Department of Orthopaedic Surgery, University Medical Centre, University of Groningen, Groningen, Groningen, Netherlands
| | - Ruurd L Jaarsma
- Department of Orthopaedic Surgery, Flinders Medical Centre, Bedford Park, Adelaide, South Australia, Australia.,Flinders University, Bedford Park, Adelaide, South Australia, Australia
| | - Frank F A IJpma
- Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, Groningen, Netherlands
| | - Job N Doornberg
- Department of Orthopaedic Surgery, University Medical Centre, University of Groningen, Groningen, Groningen, Netherlands.,Department of Orthopaedic Surgery, Flinders Medical Centre, Bedford Park, Adelaide, South Australia, Australia.,Flinders University, Bedford Park, Adelaide, South Australia, Australia.,Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, Groningen, Netherlands
| | - Jasper Prijs
- Department of Orthopaedic Surgery, University Medical Centre, University of Groningen, Groningen, Groningen, Netherlands.,Department of Orthopaedic Surgery, Flinders Medical Centre, Bedford Park, Adelaide, South Australia, Australia.,Flinders University, Bedford Park, Adelaide, South Australia, Australia.,Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, Groningen, Netherlands
| | -
- Machine Learning Consortium
| |
Collapse
|