1
|
Moukheiber D, Restrepo D, Cajas SA, Montoya MPA, Celi LA, Kuo KT, López DM, Moukheiber L, Moukheiber M, Moukheiber S, Osorio-Valencia JS, Purkayastha S, Paddo AR, Wu C, Kuo PC. A multimodal framework for extraction and fusion of satellite images and public health data. Sci Data 2024; 11:634. [PMID: 38879585 PMCID: PMC11180113 DOI: 10.1038/s41597-024-03366-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/10/2024] [Indexed: 06/19/2024] Open
Abstract
In low- and middle-income countries, the substantial costs associated with traditional data collection pose an obstacle to facilitating decision-making in the field of public health. Satellite imagery offers a potential solution, but the image extraction and analysis can be costly and requires specialized expertise. We introduce SatelliteBench, a scalable framework for satellite image extraction and vector embeddings generation. We also propose a novel multimodal fusion pipeline that utilizes a series of satellite imagery and metadata. The framework was evaluated generating a dataset with a collection of 12,636 images and embeddings accompanied by comprehensive metadata, from 81 municipalities in Colombia between 2016 and 2018. The dataset was then evaluated in 3 tasks: including dengue case prediction, poverty assessment, and access to education. The performance showcases the versatility and practicality of SatelliteBench, offering a reproducible, accessible and open tool to enhance decision-making in public health.
Collapse
Affiliation(s)
- Dana Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - David Restrepo
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.
- Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia.
| | - Sebastián Andrés Cajas
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, Massachusetts, USA
- School of Computer Science, University College Dublin, Dublin, Ireland
| | | | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, USA
- Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
| | - Kuan-Ting Kuo
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
| | - Diego M López
- Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia
| | - Lama Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Mira Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Sulaiman Moukheiber
- Department of Computer Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | | | - Saptarshi Purkayastha
- Department of BioHealth Informatics, Indiana University Luddy School of Informatics, Computing, and Engineering, Indianapolis, Indiana, USA
| | - Atika Rahman Paddo
- Department of BioHealth Informatics, Indiana University Luddy School of Informatics, Computing, and Engineering, Indianapolis, Indiana, USA
| | - Chenwei Wu
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, Michigan, USA
| | - Po-Chih Kuo
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan.
| |
Collapse
|
2
|
Guo XJ, Huang LY, Gong ST, Li M, Wang W, Chen J, Zhang YD, Lu X, Chen X, Luo L, Yang Y, Luo X, Qi SH. Peroxynitrite-Triggered Carbon Monoxide Donor Improves Ischemic Stroke Outcome by Inhibiting Neuronal Apoptosis and Ferroptosis. Mol Neurobiol 2024:10.1007/s12035-024-04238-w. [PMID: 38767837 DOI: 10.1007/s12035-024-04238-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 04/29/2024] [Indexed: 05/22/2024]
Abstract
Cerebral ischemia-reperfusion injury produces excessive reactive oxygen and nitrogen species, including superoxide, nitric oxide, and peroxynitrite (ONOO-). We recently developed a new ONOO--triggered metal-free carbon monoxide donor (PCOD585), exhibiting a notable neuroprotective outcome on the rat middle cerebral artery occlusion model and rendering an exciting intervention opportunity toward ischemia-induced brain injuries. However, its therapeutic mechanism still needs to be addressed. In the pharmacological study, we found PCOD585 inhibited neuronal Bcl2/Bax/caspase-3 apoptosis pathway in the peri-infarcted area of stroke by scavenging ONOO-. ONOO- scavenging further led to decreased Acyl-CoA synthetase long-chain family member 4 and increased glutathione peroxidase 4, to minimize lipoperoxidation. Additionally, the carbon monoxide release upon the ONOO- reaction with PCOD585 further inhibited the neuronal Iron-dependent ferroptosis associated with ischemia-reperfusion. Such a synergistic neuroprotective mechanism of PCOD585 yields as potent a neuroprotective effect as Edaravone. Additionally, PCOD585 penetrates the blood-brain barrier and reduces the degradation of zonula occludens-1 by inhibiting matrix metalloproteinase-9, thereby protecting the integrity of the blood-brain barrier. Our study provides a new perspective for developing multi-functional compounds to treat ischemic stroke.
Collapse
Affiliation(s)
- Xin-Jian Guo
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Lin-Yan Huang
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Shi-Tong Gong
- Xuzhou Central Hospital, Affiliated Xuzhou Clinical College of Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Ming Li
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Wan Wang
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Jie Chen
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Yi-De Zhang
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Xicun Lu
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of Chemical Biology, School of Pharmacy, East China University of Science and Technology, Meilong Road 130, Shanghai, 200237, China
| | - Xiaohua Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of Chemical Biology, School of Pharmacy, East China University of Science and Technology, Meilong Road 130, Shanghai, 200237, China
| | - Lan Luo
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Youjun Yang
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of Chemical Biology, School of Pharmacy, East China University of Science and Technology, Meilong Road 130, Shanghai, 200237, China
| | - Xiao Luo
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of Chemical Biology, School of Pharmacy, East China University of Science and Technology, Meilong Road 130, Shanghai, 200237, China.
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, 200241, China.
| | - Su-Hua Qi
- School of Medical Technology, Xuzhou Key Laboratory of Laboratory Diagnostics, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
3
|
Anger M, Wendelborn C, Schickhardt C. German funders' data sharing policies-A qualitative interview study. PLoS One 2024; 19:e0296956. [PMID: 38330079 PMCID: PMC10852319 DOI: 10.1371/journal.pone.0296956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 12/21/2023] [Indexed: 02/10/2024] Open
Abstract
BACKGROUND Data sharing is commonly seen as beneficial for science but is not yet common practice. Research funding agencies are known to play a key role in promoting data sharing, but German funders' data sharing policies appear to lag behind in international comparison. This study aims to answer the question of how German data sharing experts inside and outside funding agencies perceive and evaluate German funders' data sharing policies and overall efforts to promote data sharing. METHODS This study is based on sixteen guided expert interviews with representatives of German funders and German research data experts from stakeholder organisations, who shared their perceptions of German' funders efforts to promote data sharing. By applying the method of qualitative content analysis to our interview data, we categorise and describe noteworthy aspects of the German data sharing policy landscape and illustrate our findings with interview passages. RESULTS We present our findings in five sections to distinguish our interviewees' perceptions on a) the status quo of German funders' data sharing policies, b) the role of funders in promoting data sharing, c) current and potential measures by funders to promote data sharing, d) general barriers to those measures, and e) the implementation of more binding data sharing requirements. DISCUSSION AND CONCLUSION Although funders are perceived to be important promoters and facilitators of data sharing throughout our interviews, only few German funding agencies have data sharing policies in place. Several interviewees stated that funders could do more, for example by providing incentives for data sharing or by introducing more concrete policies. Our interviews suggest the academic freedom of grantees is widely perceived as an obstacle for German funders in introducing mandatory data sharing requirements. However, some interviewees stated that stricter data sharing requirements could be justified if data sharing is a part of good scientific practice.
Collapse
Affiliation(s)
- Michael Anger
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christian Wendelborn
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christoph Schickhardt
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
4
|
Ciobanu-Caraus O, Aicher A, Kernbach JM, Regli L, Serra C, Staartjes VE. A critical moment in machine learning in medicine: on reproducible and interpretable learning. Acta Neurochir (Wien) 2024; 166:14. [PMID: 38227273 DOI: 10.1007/s00701-024-05892-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 12/14/2023] [Indexed: 01/17/2024]
Abstract
Over the past two decades, advances in computational power and data availability combined with increased accessibility to pre-trained models have led to an exponential rise in machine learning (ML) publications. While ML may have the potential to transform healthcare, this sharp increase in ML research output without focus on methodological rigor and standard reporting guidelines has fueled a reproducibility crisis. In addition, the rapidly growing complexity of these models compromises their interpretability, which currently impedes their successful and widespread clinical adoption. In medicine, where failure of such models may have severe implications for patients' health, the high requirements for accuracy, robustness, and interpretability confront ML researchers with a unique set of challenges. In this review, we discuss the semantics of reproducibility and interpretability, as well as related issues and challenges, and outline possible solutions to counteracting the "black box". To foster reproducibility, standard reporting guidelines need to be further developed and data or code sharing encouraged. Editors and reviewers may equally play a critical role by establishing high methodological standards and thus preventing the dissemination of low-quality ML publications. To foster interpretable learning, the use of simpler models more suitable for medical data can inform the clinician how results are generated based on input data. Model-agnostic explanation tools, sensitivity analysis, and hidden layer representations constitute further promising approaches to increase interpretability. Balancing model performance and interpretability are important to ensure clinical applicability. We have now reached a critical moment for ML in medicine, where addressing these issues and implementing appropriate solutions will be vital for the future evolution of the field.
Collapse
Affiliation(s)
- Olga Ciobanu-Caraus
- Machine Intelligence in Clinical Neuroscience & Microsurgical Neuroanatomy (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Anatol Aicher
- Machine Intelligence in Clinical Neuroscience & Microsurgical Neuroanatomy (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Julius M Kernbach
- Department of Neuroradiology, University Hospital Heidelberg, Heidelberg, Germany
| | - Luca Regli
- Machine Intelligence in Clinical Neuroscience & Microsurgical Neuroanatomy (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Carlo Serra
- Machine Intelligence in Clinical Neuroscience & Microsurgical Neuroanatomy (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Victor E Staartjes
- Machine Intelligence in Clinical Neuroscience & Microsurgical Neuroanatomy (MICN) Laboratory, Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
5
|
Li LT, Haley LC, Boyd AK, Bernstam EV. Technical/Algorithm, Stakeholder, and Society (TASS) barriers to the application of artificial intelligence in medicine: A systematic review. J Biomed Inform 2023; 147:104531. [PMID: 37884177 DOI: 10.1016/j.jbi.2023.104531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 09/14/2023] [Accepted: 10/22/2023] [Indexed: 10/28/2023]
Abstract
INTRODUCTION The use of artificial intelligence (AI), particularly machine learning and predictive analytics, has shown great promise in health care. Despite its strong potential, there has been limited use in health care settings. In this systematic review, we aim to determine the main barriers to successful implementation of AI in healthcare and discuss potential ways to overcome these challenges. METHODS We conducted a literature search in PubMed (1/1/2001-1/1/2023). The search was restricted to publications in the English language, and human study subjects. We excluded articles that did not discuss AI, machine learning, predictive analytics, and barriers to the use of these techniques in health care. Using grounded theory methodology, we abstracted concepts to identify major barriers to AI use in medicine. RESULTS We identified a total of 2,382 articles. After reviewing the 306 included papers, we developed 19 major themes, which we categorized into three levels: the Technical/Algorithm, Stakeholder, and Social levels (TASS). These themes included: Lack of Explainability, Need for Validation Protocols, Need for Standards for Interoperability, Need for Reporting Guidelines, Need for Standardization of Performance Metrics, Lack of Plan for Updating Algorithm, Job Loss, Skills Loss, Workflow Challenges, Loss of Patient Autonomy and Consent, Disturbing the Patient-Clinician Relationship, Lack of Trust in AI, Logistical Challenges, Lack of strategic plan, Lack of Cost-effectiveness Analysis and Proof of Efficacy, Privacy, Liability, Bias and Social Justice, and Education. CONCLUSION We identified 19 major barriers to the use of AI in healthcare and categorized them into three levels: the Technical/Algorithm, Stakeholder, and Social levels (TASS). Future studies should expand on barriers in pediatric care and focus on developing clearly defined protocols to overcome these barriers.
Collapse
Affiliation(s)
- Linda T Li
- Department of Surgery, Division of Pediatric Surgery, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY 10029, United States; McWilliams School of Biomedical Informatics at UT Health Houston, 7000 Fannin St, Suite 600, Houston, TX 77030, United States.
| | - Lauren C Haley
- McGovern Medical School at the University of Texas Health Science Center at Houston, 6431 Fannin St, Houston, TX 77030, United States.
| | - Alexandra K Boyd
- McGovern Medical School at the University of Texas Health Science Center at Houston, 6431 Fannin St, Houston, TX 77030, United States.
| | - Elmer V Bernstam
- McWilliams School of Biomedical Informatics at UT Health Houston, 7000 Fannin St, Suite 600, Houston, TX 77030, United States; McGovern Medical School at the University of Texas Health Science Center at Houston, 6431 Fannin St, Houston, TX 77030, United States.
| |
Collapse
|
6
|
Restrepo D, Quion J, Vásquez-Venegas C, Villanueva C, Anthony Celi L, Nakayama LF. A scoping review of the landscape of health-related open datasets in Latin America. PLOS DIGITAL HEALTH 2023; 2:e0000368. [PMID: 37878549 PMCID: PMC10599518 DOI: 10.1371/journal.pdig.0000368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 09/16/2023] [Indexed: 10/27/2023]
Abstract
Artificial intelligence (AI) algorithms have the potential to revolutionize healthcare, but their successful translation into clinical practice has been limited. One crucial factor is the data used to train these algorithms, which must be representative of the population. However, most healthcare databases are derived from high-income countries, leading to non-representative models and potentially exacerbating health inequities. This review focuses on the landscape of health-related open datasets in Latin America, aiming to identify existing datasets, examine data-sharing frameworks, techniques, platforms, and formats, and identify best practices in Latin America. The review found 61 datasets from 23 countries, with the DATASUS dataset from Brazil contributing to the majority of articles. The analysis revealed a dearth of datasets created by the authors themselves, indicating a reliance on existing open datasets. The findings underscore the importance of promoting open data in Latin America. We provide recommendations for enhancing data sharing in the region.
Collapse
Affiliation(s)
- David Restrepo
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Telematics Department, University of Cauca, Popayán, Cauca, Colombia
| | - Justin Quion
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Constanza Vásquez-Venegas
- Scientific Image Analysis Lab, Integrative Biology Program, Biomedical Sciences Institute (ICBM), Faculty of Medicine, Universidad de Chile, Santiago, Chile
| | - Cleva Villanueva
- Instituto Politécnico Nacional, Escuela Superior de Medicina, Ciudad de Mexico, Mexico
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Luis Filipe Nakayama
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Ophthalmology, São Paulo Federal University, São Paulo, São Paulo, Brazil
| |
Collapse
|
7
|
Olaleye SA, Mogaji E, Agbo FJ, Ukpabi D, Gyamerah A. The composition of data economy: a bibliometric approach and TCCM framework of conceptual, intellectual and social structure. INFORMATION DISCOVERY AND DELIVERY 2022. [DOI: 10.1108/idd-02-2022-0014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The data economy mainly relies on the surveillance capitalism business model, enabling companies to monetize their data. The surveillance allows for transforming private human experiences into behavioral data that can be harnessed in the marketing sphere. This study aims to focus on investigating the domain of data economy with the methodological lens of quantitative bibliometric analysis of published literature.
Design/methodology/approach
The bibliometric analysis seeks to unravel trends and timelines for the emergence of the data economy, its conceptualization, scientific progression and thematic synergy that could predict the future of the field. A total of 591 data between 2008 and June 2021 were used in the analysis with the Biblioshiny app on the web interfaced and VOSviewer version 1.6.16 to analyze data from Web of Science and Scopus.
Findings
This study combined findable, accessible, interoperable and reusable (FAIR) data and data economy and contributed to the literature on big data, information discovery and delivery by shedding light on the conceptual, intellectual and social structure of data economy and demonstrating data relevance as a key strategic asset for companies and academia now and in the future.
Research limitations/implications
Findings from this study provide a steppingstone for researchers who may engage in further empirical and longitudinal studies by employing, for example, a quantitative and systematic review approach. In addition, future research could expand the scope of this study beyond FAIR data and data economy to examine aspects such as theories and show a plausible explanation of several phenomena in the emerging field.
Practical implications
The researchers can use the results of this study as a steppingstone for further empirical and longitudinal studies.
Originality/value
This study confirmed the relevance of data to society and revealed some gaps to be undertaken for the future.
Collapse
|
8
|
Belbasis L, Panagiotou OA. Reproducibility of prediction models in health services research. BMC Res Notes 2022; 15:204. [PMID: 35690767 PMCID: PMC9188254 DOI: 10.1186/s13104-022-06082-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 05/18/2022] [Indexed: 12/23/2022] Open
Abstract
The field of health services research studies the health care system by examining outcomes relevant to patients and clinicians but also health economists and policy makers. Such outcomes often include health care spending, and utilization of care services. Building accurate prediction models using reproducible research practices for health services research is important for evidence-based decision making. Several systematic reviews have summarized prediction models for outcomes relevant to health services research, but these systematic reviews do not present a thorough assessment of reproducibility and research quality of the prediction modelling studies. In the present commentary, we discuss how recent advances in prediction modelling in other medical fields can be applied to health services research. We also describe the current status of prediction modelling in health services research, and we summarize available methodological guidance for the development, update, external validation and systematic appraisal of prediction models.
Collapse
Affiliation(s)
- Lazaros Belbasis
- Meta-Research Innovation Center Berlin, QUEST Center, Berlin Institute of Health, Charité - Universitätsmedizin Berlin, Berlin, Germany.
| | - Orestis A Panagiotou
- Center for Evidence Synthesis in Health, School of Public Health, Brown University, Providence, RI, USA.,Department of Health Services, Policy and Practice, School of Public Health, Brown University, Providence, RI, USA.,Department of Epidemiology, School of Public Health, Brown University, Providence, RI, USA
| |
Collapse
|
9
|
Nuñez-Garcia JC, Sánchez-Puente A, Sampedro-Gómez J, Vicente-Palacios V, Jiménez-Navarro M, Oterino-Manzanas A, Jiménez-Candil J, Dorado-Diaz PI, Sánchez PL. Outcome Analysis in Elective Electrical Cardioversion of Atrial Fibrillation Patients: Development and Validation of a Machine Learning Prognostic Model. J Clin Med 2022; 11:jcm11092636. [PMID: 35566761 PMCID: PMC9101912 DOI: 10.3390/jcm11092636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/03/2022] [Accepted: 05/04/2022] [Indexed: 11/16/2022] Open
Abstract
Background: The integrated approach to electrical cardioversion (EC) in atrial fibrillation (AF) is complex; candidates can resolve spontaneously while waiting for EC, and post-cardioversion recurrence is high. Thus, it is especially interesting to avoid the programming of EC in patients who would restore sinus rhythm (SR) spontaneously or present early recurrence. We have analyzed the whole elective EC of the AF process using machine-learning (ML) in order to enable a more realistic and detailed simulation of the patient flow for decision making purposes. Methods: The dataset consisted of electronic health records (EHRs) from 429 consecutive AF patients referred for EC. For analysis of the patient outcome, we considered five pathways according to restoring and maintaining SR: (i) spontaneous SR restoration, (ii) pharmacologic-cardioversion, (iii) direct-current cardioversion, (iv) 6-month AF recurrence, and (v) 6-month rhythm control. We applied ML classifiers for predicting outcomes at each pathway and compared them with the CHA2DS2-VASc and HATCH scores. Results: With the exception of pathway (iii), all ML models achieved improvements in comparison with CHA2DS2-VASc or HATCH scores (p < 0.01). Compared to the most competitive score, the area under the ROC curve (AUC-ROC) was: 0.80 vs. 0.66 for predicting (i); 0.71 vs. 0.55 for (ii); 0.64 vs. 0.52 for (iv); and 0.66 vs. 0.51 for (v). For a threshold considered optimal, the empirical net reclassification index was: +7.8%, +47.2%, +28.2%, and +34.3% in favor of our ML models for predicting outcomes for pathways (i), (ii), (iv), and (v), respectively. As an example tool of generalizability of ML models, we deployed our algorithms in an open-source calculator, where the model would personalize predictions. Conclusions: An ML model improves the accuracy of restoring and maintaining SR predictions over current discriminators. The proposed approach enables a detailed simulation of the patient flow through personalized predictions.
Collapse
Affiliation(s)
- Jean C. Nuñez-Garcia
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
| | - Antonio Sánchez-Puente
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- CIBERCV (Centro de Investigacion Biomedica en Red Enfermedades Cardiovasculares), Instituto de Salud Carlos III, C/Monforte de Lemos 3-5, Pabellón 11, Planta 0, 28029 Madrid, Spain
- Correspondence: (A.S.-P.); (P.L.S.); Tel.: +34-92-329-1100 (ext. 55738) (P.L.S.)
| | - Jesús Sampedro-Gómez
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- CIBERCV (Centro de Investigacion Biomedica en Red Enfermedades Cardiovasculares), Instituto de Salud Carlos III, C/Monforte de Lemos 3-5, Pabellón 11, Planta 0, 28029 Madrid, Spain
| | - Victor Vicente-Palacios
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- Philips Healthcare, 28050 Madrid, Spain
| | - Manuel Jiménez-Navarro
- Department of Cardiology, Hospital Virgen de la Victoria—IBIMA, 29010 Malaga, Spain;
- Facultad de Medicina, Universidad de Málaga, 29071 Malaga, Spain
| | - Armando Oterino-Manzanas
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
| | - Javier Jiménez-Candil
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- CIBERCV (Centro de Investigacion Biomedica en Red Enfermedades Cardiovasculares), Instituto de Salud Carlos III, C/Monforte de Lemos 3-5, Pabellón 11, Planta 0, 28029 Madrid, Spain
- Departamento de Medicina, Universidad de Salamanca, 37007 Salamanca, Spain
| | - P. Ignacio Dorado-Diaz
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- CIBERCV (Centro de Investigacion Biomedica en Red Enfermedades Cardiovasculares), Instituto de Salud Carlos III, C/Monforte de Lemos 3-5, Pabellón 11, Planta 0, 28029 Madrid, Spain
| | - Pedro L. Sánchez
- Department of Cardiology, Hospital Universitario de Salamanca—IBSAL, 37007 Salamanca, Spain; (J.C.N.-G.); (J.S.-G.); (V.V.-P.); (A.O.-M.); (J.J.-C.); (P.I.D.-D.)
- CIBERCV (Centro de Investigacion Biomedica en Red Enfermedades Cardiovasculares), Instituto de Salud Carlos III, C/Monforte de Lemos 3-5, Pabellón 11, Planta 0, 28029 Madrid, Spain
- Departamento de Medicina, Universidad de Salamanca, 37007 Salamanca, Spain
- Correspondence: (A.S.-P.); (P.L.S.); Tel.: +34-92-329-1100 (ext. 55738) (P.L.S.)
| |
Collapse
|
10
|
Grimes DR, Heathers J. The new normal? Redaction bias in biomedical science. ROYAL SOCIETY OPEN SCIENCE 2021; 8:211308. [PMID: 34966555 PMCID: PMC8633797 DOI: 10.1098/rsos.211308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 11/01/2021] [Indexed: 06/14/2023]
Abstract
A concerning amount of biomedical research is not reproducible. Unreliable results impede empirical progress in medical science, ultimately putting patients at risk. Many proximal causes of this irreproducibility have been identified, a major one being inappropriate statistical methods and analytical choices by investigators. Within this, we formally quantify the impact of inappropriate redaction beyond a threshold value in biomedical science. This is effectively truncation of a dataset by removing extreme data points, and we elucidate its potential to accidentally or deliberately engineer a spurious result in significance testing. We demonstrate that the removal of a surprisingly small number of data points can be used to dramatically alter a result. It is unknown how often redaction bias occurs in the broader literature, but given the risk of distortion to the literature involved, we suggest that it must be studiously avoided, and mitigated with approaches to counteract any potential malign effects to the research quality of medical science.
Collapse
Affiliation(s)
- David Robert Grimes
- School of Physical Sciences, Dublin City University, Glasnevin, Dublin 9, Ireland
- Department of Oncology, University of Oxford, Oxford, Oxfordshire OX3 7DQ, UK
| | | |
Collapse
|
11
|
Yagis E, Atnafu SW, García Seco de Herrera A, Marzi C, Scheda R, Giannelli M, Tessa C, Citi L, Diciotti S. Effect of data leakage in brain MRI classification using 2D convolutional neural networks. Sci Rep 2021; 11:22544. [PMID: 34799630 PMCID: PMC8604922 DOI: 10.1038/s41598-021-01681-w] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 10/28/2021] [Indexed: 11/10/2022] Open
Abstract
In recent years, 2D convolutional neural networks (CNNs) have been extensively used to diagnose neurological diseases from magnetic resonance imaging (MRI) data due to their potential to discern subtle and intricate patterns. Despite the high performances reported in numerous studies, developing CNN models with good generalization abilities is still a challenging task due to possible data leakage introduced during cross-validation (CV). In this study, we quantitatively assessed the effect of a data leakage caused by 3D MRI data splitting based on a 2D slice-level using three 2D CNN models to classify patients with Alzheimer's disease (AD) and Parkinson's disease (PD). Our experiments showed that slice-level CV erroneously boosted the average slice level accuracy on the test set by 30% on Open Access Series of Imaging Studies (OASIS), 29% on Alzheimer's Disease Neuroimaging Initiative (ADNI), 48% on Parkinson's Progression Markers Initiative (PPMI) and 55% on a local de-novo PD Versilia dataset. Further tests on a randomly labeled OASIS-derived dataset produced about 96% of (erroneous) accuracy (slice-level split) and 50% accuracy (subject-level split), as expected from a randomized experiment. Overall, the extent of the effect of an erroneous slice-based CV is severe, especially for small datasets.
Collapse
Affiliation(s)
- Ekin Yagis
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK
| | - Selamawet Workalemahu Atnafu
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi", University of Bologna, Via dell'Università 50, 47521, Cesena, Italy
| | | | - Chiara Marzi
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi", University of Bologna, Via dell'Università 50, 47521, Cesena, Italy
| | - Riccardo Scheda
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi", University of Bologna, Via dell'Università 50, 47521, Cesena, Italy
| | - Marco Giannelli
- Unit of Medical Physics, Pisa University Hospital "Azienda Ospedaliero-Universitaria Pisana", Pisa, Italy
| | - Carlo Tessa
- Division of Radiology, Versilia Hospital, Azienda USL Toscana Nord Ovest, Lido di Camaiore, LU, Italy
| | - Luca Citi
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK
| | - Stefano Diciotti
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi", University of Bologna, Via dell'Università 50, 47521, Cesena, Italy.
| |
Collapse
|
12
|
|
13
|
AlRyalat SA, Al-Ryalat N, Ryalat S. Machine learning in glaucoma: a bibliometric analysis comparing computer science and medical fields’ research. EXPERT REVIEW OF OPHTHALMOLOGY 2021. [DOI: 10.1080/17469899.2021.1964956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
| | - Nosaiba Al-Ryalat
- Department Of Radiology And Nuclear Science, The University of Jordan, Amman, Jordan
| | - Soukaina Ryalat
- Department Of Maxillofacial Surgery, The University of Jordan, Amman, Jordan
| |
Collapse
|
14
|
In-code citation practices in open research software libraries. J Informetr 2021. [DOI: 10.1016/j.joi.2021.101139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
15
|
An open-source data set of anti-VEGF therapy in diabetic macular oedema patients over 4 years and their visual acuity outcomes. Eye (Lond) 2020; 35:1354-1364. [PMID: 32591734 DOI: 10.1038/s41433-020-1048-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 06/10/2020] [Accepted: 06/16/2020] [Indexed: 01/18/2023] Open
Abstract
OBJECTIVES The objective of this paper is to evaluate visual acuity (VA) outcomes of intravitreal anti-vascular endothelial growth factor (VEGF) in diabetic macular oedema (DMO). METHODS In this retrospective cohort study, electronic medical records for all patients undergoing intravitreal injections in a tertiary referral centre between March 2013 and October 2018 were analysed. Treatment response in terms of VA outcomes was reported for all eyes over a 4-year observation period. RESULTS Our cohort includes 2614 DMO eyes of 1964 patients over 48 months. Cox proportional-hazards modelling identified injection number (hazard ratio (HR) = 1.18), male gender (HR = 1.13) and baseline VA (HR = 1.09) as independent predictors to reach a favourable visual outcome of more than 70 Early Treatment Diabetic Retinopathy Study letters. Half of our cohort reached 70 letters 1.9 months after starting anti-VEGF therapy. Of those that reached 70 letters, 50% fell below 70 letters by 14.7 months. CONCLUSION To date, this is the largest single centre cohort study and over the longest observation period reporting on real-life outcomes of anti-VEGF in DMO. We have made an anonymised version of our data set available on an open-source data repository as a resource for clinical researchers globally.
Collapse
|
16
|
Nantasenamat C. Best Practices for Constructing Reproducible QSAR Models. METHODS IN PHARMACOLOGY AND TOXICOLOGY 2020. [DOI: 10.1007/978-1-0716-0150-1_3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
17
|
Abstract
Emergence of health-related smartphone applications and their wide dissemination in public as well as healthcare practitioners has undergone criticism under the scope of public health. Still, despite methodological issues curbing the initial enthusiasm, availability, safety and, in certain cases, documented efficacy of these measures has secured regulatory approval. Bearing in mind these pitfalls, we describe the necessary steps towards implementation of deep learning techniques in the specific clinical context of women’s health and infertility in particular.
Collapse
|
18
|
Borycki E. Quality and Safety in eHealth: The Need to Build the Evidence Base. J Med Internet Res 2019; 21:e16689. [PMID: 31855183 PMCID: PMC6940858 DOI: 10.2196/16689] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 12/01/2019] [Accepted: 12/09/2019] [Indexed: 12/26/2022] Open
Abstract
Research in the area of health technology safety has demonstrated that technology may both improve patient safety and introduce new types of technology-induced errors. Thus, there is a need to publish safety science literature to develop an evidence-based research base, on which we can continually develop new, safe technologies and improve patient safety. The aim of this viewpoint is to argue for the need to advance evidence-based research in health informatics, so that new technologies can be designed, developed, and implemented for their safety prior to their use in health care. This viewpoint offers a historical perspective on the development of health informatics and safety literature in the area of health technology. I argue for the need to conduct safety studies of technologies used by health professionals and consumers to develop an evidence base in this area. Ongoing research is necessary to improve the quality and safety of health technologies. Over the past several decades, we have seen health informatics emerge as a discipline, with growing research in the field examining the design, development, and implementation of different health technologies and new challenges such as those associated with the quality and safety of technology use. Future research will need to focus on how we can continually extend safety science in this area. There is a need to integrate evidence-based research into the design, development, and implementation of health technologies to improve their safety and reduce technology-induced errors.
Collapse
Affiliation(s)
- Elizabeth Borycki
- School of Health Information Science, University of Victoria, Victoria, BC, Canada
| |
Collapse
|
19
|
|
20
|
Toh TS, Dondelinger F, Wang D. Looking beyond the hype: Applied AI and machine learning in translational medicine. EBioMedicine 2019; 47:607-615. [PMID: 31466916 PMCID: PMC6796516 DOI: 10.1016/j.ebiom.2019.08.027] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 07/30/2019] [Accepted: 08/13/2019] [Indexed: 12/22/2022] Open
Abstract
Big data problems are becoming more prevalent for laboratory scientists who look to make clinical impact. A large part of this is due to increased computing power, in parallel with new technologies for high quality data generation. Both new and old techniques of artificial intelligence (AI) and machine learning (ML) can now help increase the success of translational studies in three areas: drug discovery, imaging, and genomic medicine. However, ML technologies do not come without their limitations and shortcomings. Current technical limitations and other limitations including governance, reproducibility, and interpretation will be discussed in this article. Overcoming these limitations will enable ML methods to be more powerful for discovery and reduce ambiguity within translational medicine, allowing data-informed decision-making to deliver the next generation of diagnostics and therapeutics to patients quicker, at lowered costs, and at scale.
Collapse
Affiliation(s)
- Tzen S Toh
- The Medical School, University of Sheffield, Sheffield, UK
| | - Frank Dondelinger
- Lancaster Medical School, Furness College, Lancaster University, Bailrigg, Lancaster, UK
| | - Dennis Wang
- NIHR Sheffield Biomedical Research Centre, University of Sheffield, Sheffield, UK; Department of Computer Science, University of Sheffield, Sheffield, UK.
| |
Collapse
|
21
|
The reproducibility crisis in the age of digital medicine. NPJ Digit Med 2019; 2:2. [PMID: 31304352 PMCID: PMC6550262 DOI: 10.1038/s41746-019-0079-z] [Citation(s) in RCA: 71] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 01/11/2019] [Indexed: 11/08/2022] Open
|