51
|
Xie H, Liu Y, Lei H, Song T, Yue G, Du Y, Wang T, Zhang G, Lei B. Adversarial learning-based multi-level dense-transmission knowledge distillation for AP-ROP detection. Med Image Anal 2023; 84:102725. [PMID: 36527770 DOI: 10.1016/j.media.2022.102725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 10/31/2022] [Accepted: 12/02/2022] [Indexed: 12/13/2022]
Abstract
The Aggressive Posterior Retinopathy of Prematurity (AP-ROP) is the major cause of blindness for premature infants. The automatic diagnosis method has become an important tool for detecting AP-ROP. However, most existing automatic diagnosis methods were with heavy complexity, which hinders the development of the detecting devices. Hence, a small network (student network) with a high imitation ability is exactly needed, which can mimic a large network (teacher network) with promising diagnostic performance. Also, if the student network is too small due to the increasing gap between teacher and student networks, the diagnostic performance will drop. To tackle the above issues, we propose a novel adversarial learning-based multi-level dense knowledge distillation method for detecting AP-ROP. Specifically, the pre-trained teacher network is utilized to train multiple intermediate-size networks (i.e., teacher-assistant networks) and one student network by dense transmission mode, where the knowledge from all upper-level networks is transmitted to the current lower-level network. To ensure that two adjacent networks can distill the abundant knowledge, the adversarial learning module is leveraged to enforce the lower-level network to generate the features that are similar to those of the upper-level network. Extensive experiments demonstrate that our proposed method can realize the effective knowledge distillation from the teacher to student networks. We achieve a promising knowledge distillation performance for our private dataset and a public dataset, which can provide a new insight for devising lightweight detecting systems of fundus diseases for practical use.
Collapse
Affiliation(s)
- Hai Xie
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
| | - Yaling Liu
- Shenzhen Eye Hospital, Jinan University, Shenzhen Eye Institute, Shenzhen, China
| | - Haijun Lei
- Guangdong Province Key Laboratory of Popular High-performance Computers, School of Computer and Software Engineering, Shenzhen University, Shenzhen, China
| | - Tiancheng Song
- Shenzhen Silan Zhichuang Technology Co., Ltd., Shenzhen, China
| | - Guanghui Yue
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
| | - Yueshanyi Du
- Shenzhen Eye Hospital, Jinan University, Shenzhen Eye Institute, Shenzhen, China
| | - Tianfu Wang
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
| | - Guoming Zhang
- Shenzhen Eye Hospital, Jinan University, Shenzhen Eye Institute, Shenzhen, China.
| | - Baiying Lei
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China.
| |
Collapse
|
52
|
Hadjiiski L, Cha K, Chan HP, Drukker K, Morra L, Näppi JJ, Sahiner B, Yoshida H, Chen Q, Deserno TM, Greenspan H, Huisman H, Huo Z, Mazurchuk R, Petrick N, Regge D, Samala R, Summers RM, Suzuki K, Tourassi G, Vergara D, Armato SG. AAPM task group report 273: Recommendations on best practices for AI and machine learning for computer-aided diagnosis in medical imaging. Med Phys 2023; 50:e1-e24. [PMID: 36565447 DOI: 10.1002/mp.16188] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 11/13/2022] [Accepted: 11/22/2022] [Indexed: 12/25/2022] Open
Abstract
Rapid advances in artificial intelligence (AI) and machine learning, and specifically in deep learning (DL) techniques, have enabled broad application of these methods in health care. The promise of the DL approach has spurred further interest in computer-aided diagnosis (CAD) development and applications using both "traditional" machine learning methods and newer DL-based methods. We use the term CAD-AI to refer to this expanded clinical decision support environment that uses traditional and DL-based AI methods. Numerous studies have been published to date on the development of machine learning tools for computer-aided, or AI-assisted, clinical tasks. However, most of these machine learning models are not ready for clinical deployment. It is of paramount importance to ensure that a clinical decision support tool undergoes proper training and rigorous validation of its generalizability and robustness before adoption for patient care in the clinic. To address these important issues, the American Association of Physicists in Medicine (AAPM) Computer-Aided Image Analysis Subcommittee (CADSC) is charged, in part, to develop recommendations on practices and standards for the development and performance assessment of computer-aided decision support systems. The committee has previously published two opinion papers on the evaluation of CAD systems and issues associated with user training and quality assurance of these systems in the clinic. With machine learning techniques continuing to evolve and CAD applications expanding to new stages of the patient care process, the current task group report considers the broader issues common to the development of most, if not all, CAD-AI applications and their translation from the bench to the clinic. The goal is to bring attention to the proper training and validation of machine learning algorithms that may improve their generalizability and reliability and accelerate the adoption of CAD-AI systems for clinical decision support.
Collapse
Affiliation(s)
- Lubomir Hadjiiski
- Department of Radiology, University of Michigan, Ann Arbor, Michigan, USA
| | - Kenny Cha
- U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Heang-Ping Chan
- Department of Radiology, University of Michigan, Ann Arbor, Michigan, USA
| | - Karen Drukker
- Department of Radiology, University of Chicago, Chicago, Illinois, USA
| | - Lia Morra
- Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy
| | - Janne J Näppi
- 3D Imaging Research, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Berkman Sahiner
- U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Hiroyuki Yoshida
- 3D Imaging Research, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Quan Chen
- Department of Radiation Medicine, University of Kentucky, Lexington, Kentucky, USA
| | - Thomas M Deserno
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Braunschweig, Germany
| | - Hayit Greenspan
- Department of Biomedical Engineering, Faculty of Engineering, Tel Aviv, Israel & Department of Radiology, Ichan School of Medicine, Tel Aviv University, Mt Sinai, New York, New York, USA
| | - Henkjan Huisman
- Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Zhimin Huo
- Tencent America, Palo Alto, California, USA
| | - Richard Mazurchuk
- Division of Cancer Prevention, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Daniele Regge
- Radiology Unit, Candiolo Cancer Institute, FPO-IRCCS, Candiolo, Italy.,Department of Surgical Sciences, University of Turin, Turin, Italy
| | - Ravi Samala
- U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Ronald M Summers
- Radiology and Imaging Sciences, National Institutes of Health Clinical Center, Bethesda, Maryland, USA
| | - Kenji Suzuki
- Institute of Innovative Research, Tokyo Institute of Technology, Tokyo, Japan
| | | | - Daniel Vergara
- Department of Radiology, Yale New Haven Hospital, New Haven, Connecticut, USA
| | - Samuel G Armato
- Department of Radiology, University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
53
|
A Comprehensive Survey on Federated Learning Techniques for Healthcare Informatics. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023; 2023:8393990. [PMID: 36909974 PMCID: PMC9995203 DOI: 10.1155/2023/8393990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 04/18/2022] [Accepted: 05/18/2022] [Indexed: 03/06/2023]
Abstract
Healthcare is predominantly regarded as a crucial consideration in promoting the general physical and mental health and well-being of people around the world. The amount of data generated by healthcare systems is enormous, making it challenging to manage. Many machine learning (ML) approaches were implemented to develop dependable and robust solutions to handle the data. ML cannot fully utilize data due to privacy concerns. This primarily happens in the case of medical data. Due to a lack of precise clinical data, the application of ML for the same is challenging and may not yield desired results. Federated learning (FL), which is a recent development in ML where the computation is offloaded to the source of data, appears to be a promising solution to this problem. In this study, we present a detailed survey of applications of FL for healthcare informatics. We initiate a discussion on the need for FL in the healthcare domain, followed by a review of recent review papers. We focus on the fundamentals of FL and the major motivations behind FL for healthcare applications. We then present the applications of FL along with recent state of the art in several verticals of healthcare. Then, lessons learned, open issues, and challenges that are yet to be solved are also highlighted. This is followed by future directions that give directions to the prospective researchers willing to do their research in this domain.
Collapse
|
54
|
Souza R, Mouches P, Wilms M, Tuladhar A, Langner S, Forkert ND. An analysis of the effects of limited training data in distributed learning scenarios for brain age prediction. J Am Med Inform Assoc 2022; 30:112-119. [PMID: 36287916 PMCID: PMC9748540 DOI: 10.1093/jamia/ocac204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 10/08/2022] [Accepted: 10/12/2022] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVE Distributed learning avoids problems associated with central data collection by training models locally at each site. This can be achieved by federated learning (FL) aggregating multiple models that were trained in parallel or training a single model visiting sites sequentially, the traveling model (TM). While both approaches have been applied to medical imaging tasks, their performance in limited local data scenarios remains unknown. In this study, we specifically analyze FL and TM performances when very small sample sizes are available per site. MATERIALS AND METHODS 2025 T1-weighted magnetic resonance imaging scans were used to investigate the effect of sample sizes on FL and TM for brain age prediction. We evaluated models across 18 scenarios varying the number of samples per site (1, 2, 5, 10, and 20) and the number of training rounds (20, 40, and 200). RESULTS Our results demonstrate that the TM outperforms FL, for every sample size examined. In the extreme case when each site provided only one sample, FL achieved a mean absolute error (MAE) of 18.9 ± 0.13 years, while the TM achieved a MAE of 6.21 ± 0.50 years, comparable to central learning (MAE = 5.99 years). DISCUSSION Although FL is more commonly used, our study demonstrates that TM is the best implementation for small sample sizes. CONCLUSION The TM offers new opportunities to apply machine learning models in rare diseases and pediatric research but also allows even small hospitals to contribute small datasets.
Collapse
Affiliation(s)
- Raissa Souza
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, Alberta, Canada
| | - Pauline Mouches
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, Alberta, Canada
| | - Matthias Wilms
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada
- Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, Alberta, Canada
| | - Anup Tuladhar
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, Alberta, Canada
| | - Sönke Langner
- Institute for Diagnostic Radiology and Neuroradiology, Rostock University Medical Center, Rostock, Germany
| | - Nils D Forkert
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada
- Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, Alberta, Canada
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|
55
|
Kumar R, Kumar J, Khan AA, Zakria, Ali H, Bernard CM, Khan RU, Zeng S. Blockchain and homomorphic encryption based privacy-preserving model aggregation for medical images. Comput Med Imaging Graph 2022; 102:102139. [PMID: 36395604 PMCID: PMC9632244 DOI: 10.1016/j.compmedimag.2022.102139] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 08/04/2022] [Accepted: 10/20/2022] [Indexed: 11/06/2022]
Abstract
Medical healthcare centers are envisioned as a promising paradigm to handle the massive volume of data for COVID-19 patients using artificial intelligence (AI). Traditionally, AI techniques require centralized data collection and training models within a single organization. This practice can be considered a weakness as it leads to several privacy and security concerns related to raw data communication. To overcome this weakness and secure raw data communication, we propose a blockchain-based federated learning framework that provides a solution for collaborative data training. The proposed framework enables the coordination of multiple hospitals to train and share encrypted federated models while preserving data privacy. Blockchain ledger technology provides decentralization of federated learning models without relying on a central server. Moreover, the proposed homomorphic encryption scheme encrypts and decrypts the gradients of the model to preserve privacy. More precisely, the proposed framework: (i) train the local model by a novel capsule network for segmentation and classification of COVID-19 images, (ii) furthermore, we use the homomorphic encryption scheme to secure the local model that encrypts and decrypts the gradients, (iii) finally, the model is shared over a decentralized platform through the proposed blockchain-based federated learning algorithm. The integration of blockchain and federated learning leads to a new paradigm for medical image data sharing over the decentralized network. To validate our proposed model, we conducted comprehensive experiments and the results demonstrate the superior performance of the proposed scheme.
Collapse
Affiliation(s)
- Rajesh Kumar
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China
| | - Jay Kumar
- Institute for Big Data Analytics – Dalhousie University, 6299 South St, Halifax NS B3H4R2, Canada
| | - Abdullah Aman Khan
- Sichuan Artificial Intelligence Research Institute, Yibin 644000, China,School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Zakria
- School of Software Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Hub Ali
- Institute of Automation,Chinese Academy of Sciences, Beijing 100190, China
| | - Cobbinah M. Bernard
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Riaz Ullah Khan
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China
| | - Shaoning Zeng
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China,Corresponding author
| |
Collapse
|
56
|
Zhang A, Xing L, Zou J, Wu JC. Shifting machine learning for healthcare from development to deployment and from models to data. Nat Biomed Eng 2022; 6:1330-1345. [PMID: 35788685 DOI: 10.1038/s41551-022-00898-y] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 05/03/2022] [Indexed: 01/14/2023]
Abstract
In the past decade, the application of machine learning (ML) to healthcare has helped drive the automation of physician tasks as well as enhancements in clinical capabilities and access to care. This progress has emphasized that, from model development to model deployment, data play central roles. In this Review, we provide a data-centric view of the innovations and challenges that are defining ML for healthcare. We discuss deep generative models and federated learning as strategies to augment datasets for improved model performance, as well as the use of the more recent transformer models for handling larger datasets and enhancing the modelling of clinical text. We also discuss data-focused problems in the deployment of ML, emphasizing the need to efficiently deliver data to ML models for timely clinical predictions and to account for natural data shifts that can deteriorate model performance.
Collapse
Affiliation(s)
- Angela Zhang
- Stanford Cardiovascular Institute, School of Medicine, Stanford University, Stanford, CA, USA. .,Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA. .,Greenstone Biosciences, Palo Alto, CA, USA. .,Department of Computer Science, Stanford University, Stanford, CA, USA.
| | - Lei Xing
- Department of Radiation Oncology, School of Medicine, Stanford University, Stanford, CA, USA
| | - James Zou
- Department of Computer Science, Stanford University, Stanford, CA, USA.,Department of Biomedical Informatics, School of Medicine, Stanford University, Stanford, CA, USA
| | - Joseph C Wu
- Stanford Cardiovascular Institute, School of Medicine, Stanford University, Stanford, CA, USA. .,Greenstone Biosciences, Palo Alto, CA, USA. .,Departments of Medicine, Division of Cardiovascular Medicine Stanford University, Stanford, CA, USA. .,Department of Radiology, School of Medicine, Stanford University, Stanford, CA, USA.
| |
Collapse
|
57
|
Nguyen TX, Ran AR, Hu X, Yang D, Jiang M, Dou Q, Cheung CY. Federated Learning in Ocular Imaging: Current Progress and Future Direction. Diagnostics (Basel) 2022; 12:2835. [PMID: 36428895 PMCID: PMC9689273 DOI: 10.3390/diagnostics12112835] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 11/18/2022] Open
Abstract
Advances in artificial intelligence deep learning (DL) have made tremendous impacts on the field of ocular imaging over the last few years. Specifically, DL has been utilised to detect and classify various ocular diseases on retinal photographs, optical coherence tomography (OCT) images, and OCT-angiography images. In order to achieve good robustness and generalisability of model performance, DL training strategies traditionally require extensive and diverse training datasets from various sites to be transferred and pooled into a "centralised location". However, such a data transferring process could raise practical concerns related to data security and patient privacy. Federated learning (FL) is a distributed collaborative learning paradigm which enables the coordination of multiple collaborators without the need for sharing confidential data. This distributed training approach has great potential to ensure data privacy among different institutions and reduce the potential risk of data leakage from data pooling or centralisation. This review article aims to introduce the concept of FL, provide current evidence of FL in ocular imaging, and discuss potential challenges as well as future applications.
Collapse
Affiliation(s)
- Truong X. Nguyen
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - An Ran Ran
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Xiaoyan Hu
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Dawei Yang
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Meirui Jiang
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Qi Dou
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Carol Y. Cheung
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
58
|
Shi C, Xin X, Zhang J. A novel multigranularity feature-selection method based on neighborhood mutual information and its application in autistic patient identification. Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2022.103887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
59
|
Cobbinah BM, Sorg C, Yang Q, Ternblom A, Zheng C, Han W, Che L, Shao J. Reducing variations in multi-center Alzheimer's disease classification with convolutional adversarial autoencoder. Med Image Anal 2022; 82:102585. [PMID: 36057187 DOI: 10.1016/j.media.2022.102585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 07/22/2022] [Accepted: 08/15/2022] [Indexed: 11/29/2022]
Abstract
Based on brain magnetic resonance imaging (MRI), multiple variations ranging from MRI scanners to center-specific parameter settings, imaging protocols, and brain region-of-interest (ROI) definitions pose a big challenge for multi-center Alzheimer's disease characterization and classification. Existing approaches to reduce such variations require intricate multi-step, often manual preprocessing pipelines, including skull stripping, segmentation, registration, cortical reconstruction, and ROI outlining. Such procedures are time-consuming, and more importantly, tend to be user biased. Contrasting costly and biased preprocessing pipelines, the question arises whether we can design a deep learning model to automatically reduce these variations from multiple centers for Alzheimer's disease classification? In this study, we used T1 and T2-weighted structural MRI from Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset based on three groups with 375 subjects, respectively: patients with Alzheimer's disease (AD) dementia, with mild cognitive impairment (MCI), and healthy controls (HC); to test our approach, we defined AD classification as classifying an individual's structural image to one of the three group labels. We first introduced a convolutional adversarial autoencoder (CAAE) to reduce the variations existing in multi-center raw MRI scans by automatically registering them into a common aligned space. Afterward, a convolutional residual soft attention network (CRAT) was further proposed for AD classification. Canonical classification procedures demonstrated that our model achieved classification accuracies of 91.8%, 90.05%, and 88.10% for the 2-way classification tasks using the RAW aligned MRI scans, including AD vs. HC, AD vs. MCI, and MCI vs. HC, respectively. Thus, our automated approach achieves comparable or even better classification performance by comparing it with many baselines with dedicated conventional preprocessing pipelines. Furthermore, the uncovered brain hotpots, i.e., hippocampus, amygdala, and temporal pole, are consistent with previous studies.
Collapse
Affiliation(s)
- Bernard M Cobbinah
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Christian Sorg
- Department of Neuroradiology, TUM-NIC Neuroimaging Center of Technical University Munich, Germany
| | - Qinli Yang
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Arvid Ternblom
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Changgang Zheng
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Wei Han
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Liwei Che
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China
| | - Junming Shao
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731 Chengdu, China; Center for Information in BioMedicine, University of Electronic Science and Technology of China, 611731 Chengdu, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China.
| |
Collapse
|
60
|
Bercea CI, Wiestler B, Rueckert D, Albarqouni S. Federated disentangled representation learning for unsupervised brain anomaly detection. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00515-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
61
|
Zhang L, Vashisht H, Totev A, Trinh N, Ward T. A comparison of distributed machine learning methods for the support of “many labs” collaborations in computational modeling of decision making. Front Psychol 2022; 13:943198. [PMID: 36092038 PMCID: PMC9453750 DOI: 10.3389/fpsyg.2022.943198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 07/25/2022] [Indexed: 12/03/2022] Open
Abstract
Deep learning models are powerful tools for representing the complex learning processes and decision-making strategies used by humans. Such neural network models make fewer assumptions about the underlying mechanisms thus providing experimental flexibility in terms of applicability. However, this comes at the cost of involving a larger number of parameters requiring significantly more data for effective learning. This presents practical challenges given that most cognitive experiments involve relatively small numbers of subjects. Laboratory collaborations are a natural way to increase overall dataset size. However, data sharing barriers between laboratories as necessitated by data protection regulations encourage the search for alternative methods to enable collaborative data science. Distributed learning, especially federated learning (FL), which supports the preservation of data privacy, is a promising method for addressing this issue. To verify the reliability and feasibility of applying FL to train neural networks models used in the characterization of decision making, we conducted experiments on a real-world, many-labs data pool including experiment data-sets from ten independent studies. The performance of single models trained on single laboratory data-sets was poor. This unsurprising finding supports the need for laboratory collaboration to train more reliable models. To that end we evaluated four collaborative approaches. The first approach represents conventional centralized learning (CL-based) and is the optimal approach but requires complete sharing of data which we wish to avoid. The results however establish a benchmark for the other three approaches, federated learning (FL-based), incremental learning (IL-based), and cyclic incremental learning (CIL-based). We evaluate these approaches in terms of prediction accuracy and capacity to characterize human decision-making strategies. The FL-based model achieves performance most comparable to that of the CL-based model. This indicates that FL has value in scaling data science methods to data collected in computational modeling contexts when data sharing is not convenient, practical or permissible.
Collapse
Affiliation(s)
- Lili Zhang
- School of Computing, Dublin City University, Dublin, Ireland
- Insight Science Foundation Ireland Research Centre for Data Analytics, Dublin, Ireland
- *Correspondence: Lili Zhang
| | | | - Andrey Totev
- School of Computing, Dublin City University, Dublin, Ireland
| | - Nam Trinh
- School of Computing, Dublin City University, Dublin, Ireland
| | - Tomas Ward
- School of Computing, Dublin City University, Dublin, Ireland
- Insight Science Foundation Ireland Research Centre for Data Analytics, Dublin, Ireland
| |
Collapse
|
62
|
Ziegler J, Pfitzner B, Schulz H, Saalbach A, Arnrich B. Defending against Reconstruction Attacks through Differentially Private Federated Learning for Classification of Heterogeneous Chest X-ray Data. SENSORS (BASEL, SWITZERLAND) 2022; 22:5195. [PMID: 35890875 PMCID: PMC9320045 DOI: 10.3390/s22145195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/30/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]
Abstract
Privacy regulations and the physical distribution of heterogeneous data are often primary concerns for the development of deep learning models in a medical context. This paper evaluates the feasibility of differentially private federated learning for chest X-ray classification as a defense against data privacy attacks. To the best of our knowledge, we are the first to directly compare the impact of differentially private training on two different neural network architectures, DenseNet121 and ResNet50. Extending the federated learning environments previously analyzed in terms of privacy, we simulated a heterogeneous and imbalanced federated setting by distributing images from the public CheXpert and Mendeley chest X-ray datasets unevenly among 36 clients. Both non-private baseline models achieved an area under the receiver operating characteristic curve (AUC) of 0.94 on the binary classification task of detecting the presence of a medical finding. We demonstrate that both model architectures are vulnerable to privacy violation by applying image reconstruction attacks to local model updates from individual clients. The attack was particularly successful during later training stages. To mitigate the risk of a privacy breach, we integrated Rényi differential privacy with a Gaussian noise mechanism into local model training. We evaluate model performance and attack vulnerability for privacy budgets ε∈{1,3,6,10}. The DenseNet121 achieved the best utility-privacy trade-off with an AUC of 0.94 for ε=6. Model performance deteriorated slightly for individual clients compared to the non-private baseline. The ResNet50 only reached an AUC of 0.76 in the same privacy setting. Its performance was inferior to that of the DenseNet121 for all considered privacy constraints, suggesting that the DenseNet121 architecture is more robust to differentially private training.
Collapse
Affiliation(s)
- Joceline Ziegler
- Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany; (B.P.); (B.A.)
- Hasso Plattner Institute for Digital Engineering gGmbH, 14482 Potsdam, Germany
| | - Bjarne Pfitzner
- Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany; (B.P.); (B.A.)
- Hasso Plattner Institute for Digital Engineering gGmbH, 14482 Potsdam, Germany
| | | | - Axel Saalbach
- Philips Research, 22335 Hamburg, Germany; (H.S.); (A.S.)
| | - Bert Arnrich
- Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany; (B.P.); (B.A.)
- Hasso Plattner Institute for Digital Engineering gGmbH, 14482 Potsdam, Germany
| |
Collapse
|
63
|
Jiang W, Liu S, Zhang H, Sun X, Wang SH, Zhao J, Yan J. CNNG: A Convolutional Neural Networks With Gated Recurrent Units for Autism Spectrum Disorder Classification. Front Aging Neurosci 2022; 14:948704. [PMID: 35865746 PMCID: PMC9294312 DOI: 10.3389/fnagi.2022.948704] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 06/16/2022] [Indexed: 12/12/2022] Open
Abstract
As a neurodevelopmental disorder, autism spectrum disorder (ASD) severely affects the living conditions of patients and their families. Early diagnosis of ASD can enable the disease to be effectively intervened in the early stage of development. In this paper, we present an ASD classification network defined as CNNG by combining of convolutional neural network (CNN) and gate recurrent unit (GRU). First, CNNG extracts the 3D spatial features of functional magnetic resonance imaging (fMRI) data by using the convolutional layer of the 3D CNN. Second, CNNG extracts the temporal features by using the GRU and finally classifies them by using the Sigmoid function. The performance of CNNG was validated on the international public data—autism brain imaging data exchange (ABIDE) dataset. According to the experiments, CNNG can be highly effective in extracting the spatio-temporal features of fMRI and achieving a classification accuracy of 72.46%.
Collapse
Affiliation(s)
- Wenjing Jiang
- College of Electronic and Information Engineering, Hebei University, Baoding, China
- Machine Vision Technological Innovation Center of Hebei, Baoding, China
| | - Shuaiqi Liu
- College of Electronic and Information Engineering, Hebei University, Baoding, China
- Machine Vision Technological Innovation Center of Hebei, Baoding, China
| | - Hong Zhang
- College of Electronic and Information Engineering, Hebei University, Baoding, China
- Machine Vision Technological Innovation Center of Hebei, Baoding, China
| | - Xiuming Sun
- School of Mathematics and Information Science, Zhangjiakou University, Zhangjiakou, China
- *Correspondence: Xiuming Sun,
| | - Shui-Hua Wang
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Jie Zhao
- College of Electronic and Information Engineering, Hebei University, Baoding, China
- Machine Vision Technological Innovation Center of Hebei, Baoding, China
| | - Jingwen Yan
- School of Engineering, Shantou University, Shantou, China
| |
Collapse
|
64
|
Lin QH, Niu YW, Sui J, Zhao WD, Zhuo C, Calhoun VD. SSPNet: An interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data. Med Image Anal 2022; 79:102430. [PMID: 35397470 DOI: 10.1016/j.media.2022.102430] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 03/16/2022] [Accepted: 03/18/2022] [Indexed: 01/05/2023]
Abstract
Convolutional neural networks (CNNs) have shown promising results in classifying individuals with mental disorders such as schizophrenia using resting-state fMRI data. However, complex-valued fMRI data is rarely used since additional phase data introduces high-level noise though it is potentially useful information for the context of classification. As such, we propose to use spatial source phase (SSP) maps derived from complex-valued fMRI data as the CNN input. The SSP maps are not only less noisy, but also more sensitive to spatial activation changes caused by mental disorders than magnitude maps. We build a 3D-CNN framework with two convolutional layers (named SSPNet) to fully explore the 3D structure and voxel-level relationships from the SSP maps. Two interpretability modules, consisting of saliency map generation and gradient-weighted class activation mapping (Grad-CAM), are incorporated into the well-trained SSPNet to provide additional information helpful for understanding the output. Experimental results from classifying schizophrenia patients (SZs) and healthy controls (HCs) show that the proposed SSPNet significantly improved accuracy and AUC compared to CNN using magnitude maps extracted from either magnitude-only (by 23.4 and 23.6% for DMN) or complex-valued fMRI data (by 10.6 and 5.8% for DMN). SSPNet captured more prominent HC-SZ differences in saliency maps, and Grad-CAM localized all contributing brain regions with opposite strengths for HCs and SZs within SSP maps. These results indicate the potential of SSPNet as a sensitive tool that may be useful for the development of brain-based biomarkers of mental disorders.
Collapse
Affiliation(s)
- Qiu-Hua Lin
- School of Information and Communication Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Yan-Wei Niu
- School of Information and Communication Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China
| | - Jing Sui
- State Key Laboratory of Brain Cognition and Learning, Beijing Normal University, Beijing, 100875, China
| | - Wen-Da Zhao
- School of Information and Communication Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China
| | - Chuanjun Zhuo
- Department of Psychiatry, The Fourth Center Hospital of Tianjin, Tianjin Medical University Affiliated Fourth Center Hospital, Tianjin 300140, China
| | - Vince D Calhoun
- Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS) Georgia State University, Georgia Institute of Technology, Emory University, Atlanta, USA
| |
Collapse
|
65
|
Eva G, Liese G, Stephanie B, Petr H, Leslie M, Roel V, Martine V, Sergi B, Mette H, Sarah J, Laura RM, Arnout S, Morris A S, Jan T, Xenia T, Nina V, Koert VE, Sylvie R, Greet S. Position paper on management of personal data in environment and health research in Europe. ENVIRONMENT INTERNATIONAL 2022; 165:107334. [PMID: 35696847 DOI: 10.1016/j.envint.2022.107334] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 05/30/2022] [Accepted: 06/01/2022] [Indexed: 06/15/2023]
Abstract
Management of datasets that include health information and other sensitive personal information of European study participants has to be compliant with the General Data Protection Regulation (GDPR, Regulation (EU) 2016/679). Within scientific research, the widely subscribed'FAIR' data principles should apply, meaning that research data should be findable, accessible, interoperable and re-usable. Balancing the aim of open science driven FAIR data management with GDPR compliant personal data protection safeguards is now a common challenge for many research projects dealing with (sensitive) personal data. In December 2020 a workshop was held with representatives of several large EU research consortia and of the European Commission to reflect on how to apply the FAIR data principles for environment and health research (E&H). Several recent data intensive EU funded E&H research projects face this challenge and work intensively towards developing solutions to access, exchange, store, handle, share, process and use such sensitive personal data, with the aim to support European and transnational collaborations. As a result, several recommendations, opportunities and current limitations were formulated. New technical developments such as federated data management and analysis systems, machine learning together with advanced search software, harmonized ontologies and data quality standards should in principle facilitate the FAIRification of data. To address ethical, legal, political and financial obstacles to the wider re-use of data for research purposes, both specific expertise and underpinning infrastructure are needed. There is a need for the E&H research data to find their place in the European Open Science Cloud. Communities using health and population data, environmental data and other publicly available data have to interconnect and synergize. To maximize the use and re-use of environment and health data, a dedicated supporting European infrastructure effort, such as the EIRENE research infrastructure within the ESFRI roadmap 2021, is needed that would interact with existing infrastructures.
Collapse
Affiliation(s)
- Govarts Eva
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium.
| | - Gilles Liese
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Bopp Stephanie
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | | | - Matalonga Leslie
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Vermeulen Roel
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Vrijheid Martine
- ISGlobal, Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Beltran Sergi
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), Barcelona, Spain
| | - Hartlev Mette
- Faculty of Law, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Standaert Arnout
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Swertz Morris A
- Department of Genetics & Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Theunis Jan
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Trier Xenia
- European Environment Agency (EEA), Copenhagen, Denmark
| | - Vogel Nina
- German Environment Agency (UBA), Berlin, Germany
| | | | - Remy Sylvie
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Schoeters Greet
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium; Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
66
|
Liu X, Zheng Y, Yuan X, Yi X. Deep learning-based medical diagnostic services: A secure, lightweight, and accurate realization1. JOURNAL OF COMPUTER SECURITY 2022. [DOI: 10.3233/jcs-210165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In this paper, we propose CryptMed, a system framework that enables medical service providers to offer secure, lightweight, and accurate medical diagnostic service to their customers via an execution of neural network inference in the ciphertext domain. CryptMed ensures the privacy of both parties with cryptographic guarantees. Our technical contributions include: 1) presenting a secret sharing based inference protocol that can well cope with the commonly-used linear and non-linear NN layers; 2) devising optimized secure comparison function that can efficiently support comparison-based activation functions in NN architectures; 3) constructing a suite of secure smooth functions built on precise approximation approaches for accurate medical diagnoses. We evaluate CryptMed on 6 neural network architectures across a wide range of non-linear activation functions over two benchmark and four real-world medical datasets. We comprehensively compare our system with prior art in terms of end-to-end service workload and prediction accuracy. Our empirical results demonstrate that CryptMed achieves up to respectively 413 ×, 19 ×, and 43 × bandwidth savings for MNIST, CIFAR-10, and medical applications compared with prior art. For the smooth activation based inference, the best choice of our proposed approximations preserve the precision of original functions, with less than 1.2% accuracy loss and could enhance the precision due to the newly introduced activation function family.
Collapse
Affiliation(s)
- Xiaoning Liu
- School of Computing Technologies, RMIT University, Melbourne, VIC 3001, Australia
| | - Yifeng Zheng
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, China
| | - Xingliang Yuan
- Faculty of Information Technology, Monash University, Clayton, VIC 3800, Australia
| | - Xun Yi
- School of Computing Technologies, RMIT University, Melbourne, VIC 3001, Australia
| |
Collapse
|
67
|
Wang Y, Fu Y, Luo X. Identification of Pathogenetic Brain Regions via Neuroimaging Data for Diagnosis of Autism Spectrum Disorders. Front Neurosci 2022; 16:900330. [PMID: 35655751 PMCID: PMC9152096 DOI: 10.3389/fnins.2022.900330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 04/11/2022] [Indexed: 11/13/2022] Open
Abstract
Autism spectrum disorder (ASD) is a kind of neurodevelopmental disorder that often occurs in children and has a hidden onset. Patients usually have lagged development of communication ability and social behavior and thus suffer an unhealthy physical and mental state. Evidence has indicated that diseases related to ASD have commonalities in brain imaging characteristics. This study aims to study the pathogenesis of ASD based on brain imaging data to locate the ASD-related brain regions. Specifically, we collected the functional magnetic resonance image data of 479 patients with ASD and 478 normal subjects matched in age and gender and used a machine-learning framework named random support vector machine cluster to extract distinctive brain regions from the preprocessed data. According to the experimental results, compared with other existing approaches, the method used in this study can more accurately distinguish patients from normal individuals based on brain imaging data. At the same time, this study found that the development of ASD was highly correlated with certain brain regions, e.g., lingual gyrus, superior frontal gyrus, medial gyrus, insular lobe, and olfactory cortex. This study explores the effectiveness of a novel machine-learning approach in the study of ASD brain imaging and provides a reference brain area for the medical research and clinical treatment of ASD.
Collapse
Affiliation(s)
- Yu Wang
- Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, China
- College of Information Science and Engineering, Hunan Normal University, Changsha, China
- Hunan Xiangjiang Artificial Intelligence Academy, Changsha, China
| | - Yu Fu
- Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, China
- College of Information Science and Engineering, Hunan Normal University, Changsha, China
- Hunan Xiangjiang Artificial Intelligence Academy, Changsha, China
- *Correspondence: Yu Fu
| | - Xun Luo
- Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, China
- College of Information Science and Engineering, Hunan Normal University, Changsha, China
- Hunan Xiangjiang Artificial Intelligence Academy, Changsha, China
| |
Collapse
|
68
|
Crowson MG, Moukheiber D, Arévalo AR, Lam BD, Mantena S, Rana A, Goss D, Bates DW, Celi LA. A systematic review of federated learning applications for biomedical data. PLOS DIGITAL HEALTH 2022; 1:e0000033. [PMID: 36812504 PMCID: PMC9931322 DOI: 10.1371/journal.pdig.0000033] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 03/30/2022] [Indexed: 11/18/2022]
Abstract
OBJECTIVES Federated learning (FL) allows multiple institutions to collaboratively develop a machine learning algorithm without sharing their data. Organizations instead share model parameters only, allowing them to benefit from a model built with a larger dataset while maintaining the privacy of their own data. We conducted a systematic review to evaluate the current state of FL in healthcare and discuss the limitations and promise of this technology. METHODS We conducted a literature search using PRISMA guidelines. At least two reviewers assessed each study for eligibility and extracted a predetermined set of data. The quality of each study was determined using the TRIPOD guideline and PROBAST tool. RESULTS 13 studies were included in the full systematic review. Most were in the field of oncology (6 of 13; 46.1%), followed by radiology (5 of 13; 38.5%). The majority evaluated imaging results, performed a binary classification prediction task via offline learning (n = 12; 92.3%), and used a centralized topology, aggregation server workflow (n = 10; 76.9%). Most studies were compliant with the major reporting requirements of the TRIPOD guidelines. In all, 6 of 13 (46.2%) of studies were judged at high risk of bias using the PROBAST tool and only 5 studies used publicly available data. CONCLUSION Federated learning is a growing field in machine learning with many promising uses in healthcare. Few studies have been published to date. Our evaluation found that investigators can do more to address the risk of bias and increase transparency by adding steps for data homogeneity or sharing required metadata and code.
Collapse
Affiliation(s)
- Matthew G. Crowson
- Department of Otolaryngology-Head & Neck Surgery, Massachusetts Eye & Ear, Boston, Massachusetts, United States of America
- Department of Otolaryngology-Head & Neck Surgery, Harvard Medical School, Massachusetts, United States of America
| | - Dana Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, United States of America
| | - Aldo Robles Arévalo
- IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
- Data & Analytics, NTT DATA Portugal, Lisbon, Portugal
| | - Barbara D. Lam
- Department of Hematology & Oncology, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Sreekar Mantena
- Harvard College, Boston, Massachusetts, United States of America
| | - Aakanksha Rana
- Massachusetts Institute of Technology, Boston, Massachusetts, United States of America
| | - Deborah Goss
- Department of Otolaryngology-Head & Neck Surgery, Massachusetts Eye & Ear, Boston, Massachusetts, United States of America
| | - David W. Bates
- Division of General Internal Medicine and Primary Care, Brigham and Women’s Hospital, Boston, MA, United States of America
- Department of Health Policy and Management, Harvard T. H. Chan School of Public Health, Boston, MA, United States of America
| | - Leo Anthony Celi
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| |
Collapse
|
69
|
Darzidehkalani E, Ghasemi-Rad M, van Ooijen PMA. Federated Learning in Medical Imaging: Part II: Methods, Challenges, and Considerations. J Am Coll Radiol 2022; 19:975-982. [PMID: 35483437 DOI: 10.1016/j.jacr.2022.03.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 03/29/2022] [Indexed: 11/28/2022]
Abstract
Federated learning is a machine learning method that allows decentralized training of deep neural networks among multiple clients while preserving the privacy of each client's data. Federated learning is instrumental in medical imaging due to the privacy considerations of medical data. Setting up federated networks in hospitals comes with unique challenges, primarily because medical imaging data and federated learning algorithms each have their own set of distinct characteristics. This article introduces federated learning algorithms in medical imaging and discusses technical challenges and considerations of real-world implementation of them.
Collapse
Affiliation(s)
- Erfan Darzidehkalani
- Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands; Machine Learning Lab, Data Science Center in Health, University Medical Center Groningen, University of Groningen, the Netherlands.
| | - Mohammad Ghasemi-Rad
- Department of Interventional Radiology, Baylor College of Medicine, Houston, Texas
| | - P M A van Ooijen
- Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands; Machine Learning Lab, Data Science Center in Health, University Medical Center Groningen, University of Groningen, the Netherlands
| |
Collapse
|
70
|
Stripelis D, Thompson PM, Ambite JL. Semi-Synchronous Federated Learning for Energy-Efficient Training and Accelerated Convergence in Cross-Silo Settings. ACM T INTEL SYST TEC 2022. [DOI: 10.1145/3524885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
Abstract
There are situations where data relevant to machine learning problems are distributed across multiple locations that cannot share the data due to regulatory, competitiveness, or privacy reasons. Machine learning approaches that require data to be copied to a single location are hampered by the challenges of data sharing. Federated Learning (FL) is a promising approach to learn a joint model over all the available data across silos. In many cases, the sites participating in a federation have different data distributions and computational capabilities. In these heterogeneous environments existing approaches exhibit poor performance: synchronous FL protocols are communication efficient, but have slow learning convergence and high energy cost; conversely, asynchronous FL protocols have faster convergence with lower energy cost, but higher communication. In this work, we introduce a novel energy-efficient
Semi-Synchronous Federated Learning
protocol that mixes local models periodically with minimal idle time and fast convergence. We show through extensive experiments over established benchmark datasets in the computer-vision domain as well as in real-world biomedical settings that our approach significantly outperforms previous work in
data and computationally heterogeneous environments
.
Collapse
Affiliation(s)
| | - Paul M. Thompson
- Imaging Genetics Center, Stevens Neuroimaging and Informatics Institute, University of Southern California, USA
| | - José Luis Ambite
- Information Sciences Institute, University of Southern California, USA
| |
Collapse
|
71
|
Panda R, Kalmady SV, Greiner R. Multi-Source Domain Adaptation Techniques for Mitigating Batch Effects: A Comparative Study. Front Neuroinform 2022; 16:805117. [PMID: 35528213 PMCID: PMC9067602 DOI: 10.3389/fninf.2022.805117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 03/15/2022] [Indexed: 11/13/2022] Open
Abstract
The past decade has seen an increasing number of applications of deep learning (DL) techniques to biomedical fields, especially in neuroimaging-based analysis. Such DL-based methods are generally data-intensive and require a large number of training instances, which might be infeasible to acquire from a single acquisition site, especially for data, such as fMRI scans, due to the time and costs that they demand. We can attempt to address this issue by combining fMRI data from various sites, thereby creating a bigger heterogeneous dataset. Unfortunately, the inherent differences in the combined data, known as batch effects, often hamper learning a model. To mitigate this issue, techniques such as multi-source domain adaptation [Multi-source Domain Adversarial Networks (MSDA)] aim at learning an effective classification function that uses (learned) domain-invariant latent features. This article analyzes and compares the performance of various popular MSDA methods [MDAN, Domain AggRegation Networks (DARN), Multi-Domain Matching Networks (MDMN), and Moment Matching for MSDA (M3SDA)] at predicting different labels (illness, age, and sex) of images from two public rs-fMRI datasets: ABIDE 1and ADHD-200. It also evaluates the impact of various conditions such as class imbalance, the number of sites along with a comparison of the degree of adaptation of each of the methods, thereby presenting the effectiveness of MSDA models in neuroimaging-based applications.
Collapse
Affiliation(s)
- Rohan Panda
- Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Sunil Vasu Kalmady
- Canadian VIGOUR Centre, University of Alberta, Edmonton, AB, Canada
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
| | - Russell Greiner
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
- Alberta Machine Intelligence Institute, Edmonton, AB, Canada
- Department of Psychiatry, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
72
|
Decentralized Distributed Multi-institutional PET Image Segmentation Using a Federated Deep Learning Framework. Clin Nucl Med 2022; 47:606-617. [PMID: 35442222 DOI: 10.1097/rlu.0000000000004194] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
PURPOSE The generalizability and trustworthiness of deep learning (DL)-based algorithms depend on the size and heterogeneity of training datasets. However, because of patient privacy concerns and ethical and legal issues, sharing medical images between different centers is restricted. Our objective is to build a federated DL-based framework for PET image segmentation utilizing a multicentric dataset and to compare its performance with the centralized DL approach. METHODS PET images from 405 head and neck cancer patients from 9 different centers formed the basis of this study. All tumors were segmented manually. PET images converted to SUV maps were resampled to isotropic voxels (3 × 3 × 3 mm3) and then normalized. PET image subvolumes (12 × 12 × 12 cm3) consisting of whole tumors and background were analyzed. Data from each center were divided into train/validation (80% of patients) and test sets (20% of patients). The modified R2U-Net was used as core DL model. A parallel federated DL model was developed and compared with the centralized approach where the data sets are pooled to one server. Segmentation metrics, including Dice similarity and Jaccard coefficients, percent relative errors (RE%) of SUVpeak, SUVmean, SUVmedian, SUVmax, metabolic tumor volume, and total lesion glycolysis were computed and compared with manual delineations. RESULTS The performance of the centralized versus federated DL methods was nearly identical for segmentation metrics: Dice (0.84 ± 0.06 vs 0.84 ± 0.05) and Jaccard (0.73 ± 0.08 vs 0.73 ± 0.07). For quantitative PET parameters, we obtained comparable RE% for SUVmean (6.43% ± 4.72% vs 6.61% ± 5.42%), metabolic tumor volume (12.2% ± 16.2% vs 12.1% ± 15.89%), and total lesion glycolysis (6.93% ± 9.6% vs 7.07% ± 9.85%) and negligible RE% for SUVmax and SUVpeak. No significant differences in performance (P > 0.05) between the 2 frameworks (centralized vs federated) were observed. CONCLUSION The developed federated DL model achieved comparable quantitative performance with respect to the centralized DL model. Federated DL models could provide robust and generalizable segmentation, while addressing patient privacy and legal and ethical issues in clinical data sharing.
Collapse
|
73
|
Chi NA, Washington P, Kline A, Husic A, Hou C, He C, Dunlap K, Wall DP. Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study. JMIR Pediatr Parent 2022; 5:e35406. [PMID: 35436234 PMCID: PMC9052034 DOI: 10.2196/35406] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 01/18/2022] [Accepted: 01/25/2022] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Autism spectrum disorder (ASD) is a neurodevelopmental disorder that results in altered behavior, social development, and communication patterns. In recent years, autism prevalence has tripled, with 1 in 44 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process that requires the work of trained physicians, significant attention has been given to developing systems that automatically detect autism. We work toward this goal by analyzing audio data, as prosody abnormalities are a signal of autism, with affected children displaying speech idiosyncrasies such as echolalia, monotonous intonation, atypical pitch, and irregular linguistic stress patterns. OBJECTIVE We aimed to test the ability for machine learning approaches to aid in detection of autism in self-recorded speech audio captured from children with ASD and neurotypical (NT) children in their home environments. METHODS We considered three methods to detect autism in child speech: (1) random forests trained on extracted audio features (including Mel-frequency cepstral coefficients); (2) convolutional neural networks trained on spectrograms; and (3) fine-tuned wav2vec 2.0-a state-of-the-art transformer-based speech recognition model. We trained our classifiers on our novel data set of cellphone-recorded child speech audio curated from the Guess What? mobile game, an app designed to crowdsource videos of children with ASD and NT children in a natural home environment. RESULTS The random forest classifier achieved 70% accuracy, the fine-tuned wav2vec 2.0 model achieved 77% accuracy, and the convolutional neural network achieved 79% accuracy when classifying children's audio as either ASD or NT. We used 5-fold cross-validation to evaluate model performance. CONCLUSIONS Our models were able to predict autism status when trained on a varied selection of home audio clips with inconsistent recording qualities, which may be more representative of real-world conditions. The results demonstrate that machine learning methods offer promise in detecting autism automatically from speech without specialized equipment.
Collapse
Affiliation(s)
- Nathan A Chi
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Peter Washington
- Department of Bioengineering, Stanford University, Stanford, CA, United States
| | - Aaron Kline
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Arman Husic
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Cathy Hou
- Department of Computer Science, Stanford University, Stanford, CA, United States
| | - Chloe He
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Kaitlyn Dunlap
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Dennis P Wall
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, United States
| |
Collapse
|
74
|
Federated learning for multi-center imaging diagnostics: a simulation study in cardiovascular disease. Sci Rep 2022; 12:3551. [PMID: 35241683 PMCID: PMC8894335 DOI: 10.1038/s41598-022-07186-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 02/02/2022] [Indexed: 12/25/2022] Open
Abstract
Deep learning models can enable accurate and efficient disease diagnosis, but have thus far been hampered by the data scarcity present in the medical world. Automated diagnosis studies have been constrained by underpowered single-center datasets, and although some results have shown promise, their generalizability to other institutions remains questionable as the data heterogeneity between institutions is not taken into account. By allowing models to be trained in a distributed manner that preserves patients’ privacy, federated learning promises to alleviate these issues, by enabling diligent multi-center studies. We present the first simulated federated learning study on the modality of cardiovascular magnetic resonance and use four centers derived from subsets of the M&M and ACDC datasets, focusing on the diagnosis of hypertrophic cardiomyopathy. We adapt a 3D-CNN network pretrained on action recognition and explore two different ways of incorporating shape prior information to the model, and four different data augmentation set-ups, systematically analyzing their impact on the different collaborative learning choices. We show that despite the small size of data (180 subjects derived from four centers), the privacy preserving federated learning achieves promising results that are competitive with traditional centralized learning. We further find that federatively trained models exhibit increased robustness and are more sensitive to domain shift effects.
Collapse
|
75
|
Shi CL, Xin XW, Zhang JC. Domain adaptation based on rough adjoint inconsistency and optimal transport for identifying autistic patients. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 215:106615. [PMID: 35016084 DOI: 10.1016/j.cmpb.2021.106615] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 12/09/2021] [Accepted: 12/30/2021] [Indexed: 06/14/2023]
Abstract
BACKGROUND AND OBJECTIVE Computer aided diagnosis technology has been widely used to diagnose autism spectrum disorder (ASD) from neural images. The performance of the model usually depends largely on a sufficient number of training samples that reflect the real sample distribution. Due to the lack of labelled neural images data, multisite data are often pooled together to expand the sample size. However, the heterogeneity among sites will inevitably lead to a decline in the generalization of models. To solve this problem, we propose a multisource unsupervised domain adaptation method using rough adjoint inconsistency and optimal transport. METHODS First, we define the concept of rough adjoint inconsistency and propose a double quantization method based on rough adjoint inconsistency and Dempster-Shafer (D-S) evidence theory to estimate the weight coefficient of each source domain to accurately describe the importance of each source domain to the target domain. Second, using optimal transport theory, we weaken the data distribution differences between domains and solve the problem of class imbalance by adjusting the sampling weights among classes. RESULTS The ASD recognition accuracy of the proposed method is improved on all eight tasks, which are 70.67%, 64.86%, 62.50%, 70.80%, 73.08%, 71.19%, 75.41% and 75.76%, respectively. Our proposed model achieves superior performance compared to traditional machine learning methods and other recently proposed deep learning model. CONCLUSIONS Our method demonstrates that the fusion of rough adjoint inconsistency and optimal transport can be a powerful tool for identifying ASD and quantifying the correlations between domains.
Collapse
Affiliation(s)
- Chun-Lei Shi
- School of Artificial Intelligence, Beijing Normal University, Beijing, 100875, China
| | - Xian-Wei Xin
- School of Artificial Intelligence, Beijing Normal University, Beijing, 100875, China
| | - Jia-Cai Zhang
- School of Artificial Intelligence, Beijing Normal University, Beijing, 100875, China; Engineering Research Center of Intelligent Technology and Educational Application, Ministry of Education, Beijing, 100875, China.
| |
Collapse
|
76
|
Islam H, Mosa A. A Federated Mining Approach on Predicting Diabetes-Related Complications: Demonstration Using Real-World Clinical Data. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2021:556-564. [PMID: 35308968 PMCID: PMC8861723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Chronic diabetes can lead to microvascular complications, including diabetic eye disease, diabetic kidney disease, and diabetic neuropathy. However, the long-term complications often remain undetected at the early stages of diagnosis. Developing a machine learning model to identify the patients at high risk of developing diabetes-related complications can help design better treatment interventions. Building robust machine learning models require large datasets which further requires sharing data among different healthcare systems, hence, involving privacy and confidentiality concerns. The main objective of this study is to design a decentralized privacy-protected federated learning architecture that can deliver comparable performance to centralized learning. We demonstrate the potential of adopting federated learning to address the challenges such as class-imbalance in using real-world clinical data. In all our experiments, federated learning showed comparable performance to the gold-standard of centralized learning, and applying class balancing techniques improved performance across all cohorts.
Collapse
Affiliation(s)
- Humayera Islam
- Institute for Data Science and Informatics
- Center for Biomedical Informatics; University of Missouri School of Medicine, Columbia, Missouri
| | - Abu Mosa
- Institute for Data Science and Informatics
- Department of Health Management and Informatics
- Department of Electrical Engineering and Computer Science
- Center for Biomedical Informatics; University of Missouri School of Medicine, Columbia, Missouri
| |
Collapse
|
77
|
Abstract
The federated learning technique (FL) supports the collaborative training of machine learning and deep learning models for edge network optimization. Although a complex edge network with heterogeneous devices having different constraints can affect its performance, this leads to a problem in this area. Therefore, some research can be seen to design new frameworks and approaches to improve federated learning processes. The purpose of this study is to provide an overview of the FL technique and its applicability in different domains. The key focus of the paper is to produce a systematic literature review of recent research studies that clearly describes the adoption of FL in edge networks. The search procedure was performed from April 2020 to May 2021 with a total initial number of papers being 7546 published in the duration of 2016 to 2020. The systematic literature synthesizes and compares the algorithms, models, and frameworks of federated learning. Additionally, we have presented the scope of FL applications in different industries and domains. It has been revealed after careful investigation of studies that 25% of the studies used FL in IoT and edge-based applications and 30% of studies implement the FL concept in the health industry, 10% for NLP, 10% for autonomous vehicles, 10% for mobile services, 10% for recommender systems, and 5% for FinTech. A taxonomy is also proposed on implementing FL for edge networks in different domains. Moreover, another novelty of this paper is that datasets used for the implementation of FL are discussed in detail to provide the researchers an overview of the distributed datasets, which can be used for employing FL techniques. Lastly, this study discusses the current challenges of implementing the FL technique. We have found that the areas of medical AI, IoT, edge systems, and the autonomous industry can adapt the FL in many of its sub-domains; however, the challenges these domains can encounter are statistical heterogeneity, system heterogeneity, data imbalance, resource allocation, and privacy.
Collapse
|
78
|
Antunes RS, da Costa CA, Küderle A, Yari IA, Eskofier B. Federated Learning for Healthcare: Systematic Review and Architecture Proposal. ACM T INTEL SYST TEC 2022. [DOI: 10.1145/3501813] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The use of machine learning (ML) with electronic health records (EHR) is growing in popularity as a means to extract knowledge that can improve the decision-making process in healthcare. Such methods require training of high-quality learning models based on diverse and comprehensive datasets, which are hard to obtain due to the sensitive nature of medical data from patients. In this context, federated learning (FL) is a methodology that enables the distributed training of machine learning models with remotely hosted datasets without the need to accumulate data and, therefore, compromise it. FL is a promising solution to improve ML-based systems, better aligning them to regulatory requirements, improving trustworthiness and data sovereignty. However, many open questions must be addressed before the use of FL becomes widespread. This article aims at presenting a systematic literature review on current research about FL in the context of EHR data for healthcare applications. Our analysis highlights the main research topics, proposed solutions, case studies, and respective ML methods. Furthermore, the article discusses a general architecture for FL applied to healthcare data based on the main insights obtained from the literature review. The collected literature corpus indicates that there is extensive research on the privacy and confidentiality aspects of training data and model sharing, which is expected given the sensitive nature of medical data. Studies also explore improvements to the aggregation mechanisms required to generate the learning model from distributed contributions and case studies with different types of medical data.
Collapse
Affiliation(s)
| | | | | | | | - Björn Eskofier
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| |
Collapse
|
79
|
Adnan M, Kalra S, Cresswell JC, Taylor GW, Tizhoosh HR. Federated learning and differential privacy for medical image analysis. Sci Rep 2022; 12:1953. [PMID: 35121774 PMCID: PMC8816913 DOI: 10.1038/s41598-022-05539-7] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 01/13/2022] [Indexed: 01/01/2023] Open
Abstract
The artificial intelligence revolution has been spurred forward by the availability of large-scale datasets. In contrast, the paucity of large-scale medical datasets hinders the application of machine learning in healthcare. The lack of publicly available multi-centric and diverse datasets mainly stems from confidentiality and privacy concerns around sharing medical data. To demonstrate a feasible path forward in medical image imaging, we conduct a case study of applying a differentially private federated learning framework for analysis of histopathology images, the largest and perhaps most complex medical images. We study the effects of IID and non-IID distributions along with the number of healthcare providers, i.e., hospitals and clinics, and the individual dataset sizes, using The Cancer Genome Atlas (TCGA) dataset, a public repository, to simulate a distributed environment. We empirically compare the performance of private, distributed training to conventional training and demonstrate that distributed training can achieve similar performance with strong privacy guarantees. We also study the effect of different source domains for histopathology images by evaluating the performance using external validation. Our work indicates that differentially private federated learning is a viable and reliable framework for the collaborative development of machine learning models in medical image analysis.
Collapse
Affiliation(s)
- Mohammed Adnan
- Kimia Lab, University of Waterloo, Waterloo, Canada.,Vector Institute, MaRS Discovery District, Toronto, Canada.,University of Guelph, Guelph, ON, Canada
| | - Shivam Kalra
- Kimia Lab, University of Waterloo, Waterloo, Canada.,Vector Institute, MaRS Discovery District, Toronto, Canada
| | | | - Graham W Taylor
- Vector Institute, MaRS Discovery District, Toronto, Canada.,University of Guelph, Guelph, ON, Canada
| | - Hamid R Tizhoosh
- Kimia Lab, University of Waterloo, Waterloo, Canada. .,Vector Institute, MaRS Discovery District, Toronto, Canada. .,Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
80
|
Lu MY, Chen RJ, Kong D, Lipkova J, Singh R, Williamson DFK, Chen TY, Mahmood F. Federated learning for computational pathology on gigapixel whole slide images. Med Image Anal 2022; 76:102298. [PMID: 34911013 PMCID: PMC9340569 DOI: 10.1016/j.media.2021.102298] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 10/07/2021] [Accepted: 11/02/2021] [Indexed: 02/07/2023]
Abstract
Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non human-identifiable features from histology such as molecular alterations. However, the development of robust, adaptable and accurate deep learning-based models often rely on the collection and time-costly curation large high-quality annotated training data that should ideally come from diverse sources and patient populations to cater for the heterogeneity that exists in such datasets. Multi-centric and collaborative integration of medical data across multiple institutions can naturally help overcome this challenge and boost the model performance but is limited by privacy concerns among other difficulties that may arise in the complex data sharing process as models scale towards using hundreds of thousands of gigapixel whole slide images. In this paper, we introduce privacy-preserving federated learning for gigapixel whole slide images in computational pathology using weakly-supervised attention multiple instance learning and differential privacy. We evaluated our approach on two different diagnostic problems using thousands of histology whole slide images with only slide-level labels. Additionally, we present a weakly-supervised learning framework for survival prediction and patient stratification from whole slide images and demonstrate its effectiveness in a federated setting. Our results show that using federated learning, we can effectively develop accurate weakly-supervised deep learning models from distributed data silos without direct data sharing and its associated complexities, while also preserving differential privacy using randomized noise generation. We also make available an easy-to-use federated learning for computational pathology software package: http://github.com/mahmoodlab/HistoFL.
Collapse
Affiliation(s)
- Ming Y Lu
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Richard J Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Dehan Kong
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States
| | - Jana Lipkova
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Rajendra Singh
- Department of Pathology, Northwell Health, NY, United States
| | - Drew F K Williamson
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Tiffany Y Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States
| | - Faisal Mahmood
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, United States; Data Science Department, Dana-Farber/Harvard Cancer Center, Boston, MA, United States; Harvard Data Science Initiative, Harvard University, Cambridge, MA, United States.
| |
Collapse
|
81
|
Islam H, Alaboud K, Paul T, Rana MKZ, Mosa A. A Privacy-Preserved Transfer Learning Concept to Predict Diabetic Kidney Disease at Out-of-Network Siloed Sites Using an In-Network Federated Model on Real-World Data. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2022:264-273. [PMID: 35854714 PMCID: PMC9285167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
Abstract
Successful implementation of data-driven artificial intelligence (AI) applications requires access to large datasets. Healthcare institutions can establish coordinated data-sharing networks to address the complexity of large clinical data accessibility for scientific advancements. However, persisting challenges from controlled access, safe data transferring, license restrictions from regulatory and legal concerns discourage data sharing among the in-network hospitals. In contrast, out-of-network healthcare institutions are deprived of access to any big EHR database; hence, limiting their research scope. The main objective of this study is to design a privacy-preserved transfer learning architecture that can utilize the knowledge from a federated model developed from in-network hospital-site EHR data for predicting diabetic kidney cases at out-of-network siloed hospital sites. In all our experiments, transfer learning showed improved performance compared to models trained with out-of-network site datasets. Thus, we demonstrate the proof-of-concept of transferring knowledge from established networks to aid data-driven AI discoveries at siloed sites.
Collapse
Affiliation(s)
- Humayera Islam
- Institute for Data Science and Informatics
- NextGen Biomedical Informatics Center; University of Missouri School of Medicine, Columbia, Missouri
| | - Khuder Alaboud
- Institute for Data Science and Informatics
- NextGen Biomedical Informatics Center; University of Missouri School of Medicine, Columbia, Missouri
| | - Tanmoy Paul
- Department of Electrical Engineering and Computer Science
- NextGen Biomedical Informatics Center; University of Missouri School of Medicine, Columbia, Missouri
| | - Md Kamruz Zaman Rana
- Institute for Data Science and Informatics
- NextGen Biomedical Informatics Center; University of Missouri School of Medicine, Columbia, Missouri
| | - Abu Mosa
- Corresponding Author.
- Institute for Data Science and Informatics
- Department of Health Management and Informatics
- Department of Electrical Engineering and Computer Science
- NextGen Biomedical Informatics Center; University of Missouri School of Medicine, Columbia, Missouri
| |
Collapse
|
82
|
A review of methods for classification and recognition of ASD using fMRI data. J Neurosci Methods 2021; 368:109456. [PMID: 34954253 DOI: 10.1016/j.jneumeth.2021.109456] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 12/17/2021] [Accepted: 12/18/2021] [Indexed: 02/06/2023]
Abstract
Autism spectrum disorder (ASD) is a severe neuropsychiatric brain disorder that affects people's social communication and daily routine. Considering the phenomenon of abnormal brain function in the early stage of ASD, functional magnetic resonance imaging (fMRI), an excellent technique that measures brain activity, provides effective data to study ASD. Therefore, based on fMRI data of ASD cases, this paper reviews the progress of machine learning methods and deep learning methods in ASD classification and recognition in the last three years and summarizes the different research results of fMRI data extracted from the Autism Brain Imaging Data Exchange (ABIDE). From the classification performance of classification and recognition of ASD by the two methods, comparing the important classification indicators such as accuracy, sensitivity and specificity, the current challenges and future development trends are reported, which can provide an essential reference for the early diagnosis of ASD cases.
Collapse
|
83
|
Abstract
The coronavirus of 2019 (COVID‐19) was declared a global pandemic by World Health Organization in March 2020. Effective testing is crucial to slow the spread of the pandemic. Artificial intelligence and machine learning techniques can help COVID‐19 detection using various clinical symptom data. While deep learning (DL) approach requiring centralized data is susceptible to a high risk of data privacy breaches, federated learning (FL) approach resting on decentralized data can preserve data privacy, a critical factor in the health domain. This paper reviews recent advances in applying DL and FL techniques for COVID‐19 detection with a focus on the latter. A model FL implementation use case in health systems with a COVID‐19 detection using chest X‐ray image data sets is studied. We have also reviewed applications of previously published FL experiments for COVID‐19 research to demonstrate the applicability of FL in tackling health research issues. Last, several challenges in FL implementation in the healthcare domain are discussed in terms of potential future work.
Collapse
Affiliation(s)
- Sadaf Naz
- Department of Computer Science and Information Technology, School of Engineering and Mathematical Sciences La Trobe University Bundoora Victoria Australia
| | - Khoa T. Phan
- Department of Computer Science and Information Technology, School of Engineering and Mathematical Sciences La Trobe University Bundoora Victoria Australia
| | - Yi‐Ping Phoebe Chen
- Department of Computer Science and Information Technology, School of Engineering and Mathematical Sciences La Trobe University Bundoora Victoria Australia
| |
Collapse
|
84
|
Federated Learning for Microvasculature Segmentation and Diabetic Retinopathy Classification of OCT Data. OPHTHALMOLOGY SCIENCE 2021; 1:100069. [PMID: 36246944 PMCID: PMC9559956 DOI: 10.1016/j.xops.2021.100069] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 09/01/2021] [Accepted: 09/28/2021] [Indexed: 12/26/2022]
Abstract
Purpose To evaluate the performance of a federated learning framework for deep neural network-based retinal microvasculature segmentation and referable diabetic retinopathy (RDR) classification using OCT and OCT angiography (OCTA). Design Retrospective analysis of clinical OCT and OCTA scans of control participants and patients with diabetes. Participants The 153 OCTA en face images used for microvasculature segmentation were acquired from 4 OCT instruments with fields of view ranging from 2 × 2-mm to 6 × 6-mm. The 700 eyes used for RDR classification consisted of OCTA en face images and structural OCT projections acquired from 2 commercial OCT systems. Methods OCT angiography images used for microvasculature segmentation were delineated manually and verified by retina experts. Diabetic retinopathy (DR) severity was evaluated by retinal specialists and was condensed into 2 classes: non-RDR and RDR. The federated learning configuration was demonstrated via simulation using 4 clients for microvasculature segmentation and was compared with other collaborative training methods. Subsequently, federated learning was applied over multiple institutions for RDR classification and was compared with models trained and tested on data from the same institution (internal models) and different institutions (external models). Main Outcome Measures For microvasculature segmentation, we measured the accuracy and Dice similarity coefficient (DSC). For severity classification, we measured accuracy, area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve, balanced accuracy, F1 score, sensitivity, and specificity. Results For both applications, federated learning achieved similar performance as internal models. Specifically, for microvasculature segmentation, the federated learning model achieved similar performance (mean DSC across all test sets, 0.793) as models trained on a fully centralized dataset (mean DSC, 0.807). For RDR classification, federated learning achieved a mean AUROC of 0.954 and 0.960; the internal models attained a mean AUROC of 0.956 and 0.973. Similar results are reflected in the other calculated evaluation metrics. Conclusions Federated learning showed similar results to traditional deep learning in both applications of segmentation and classification, while maintaining data privacy. Evaluation metrics highlight the potential of collaborative learning for increasing domain diversity and the generalizability of models used for the classification of OCT data.
Collapse
|
85
|
A Systematic Review of Federated Learning in the Healthcare Area: From the Perspective of Data Properties and Applications. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112311191] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Recent advances in deep learning have shown many successful stories in smart healthcare applications with data-driven insight into improving clinical institutions’ quality of care. Excellent deep learning models are heavily data-driven. The more data trained, the more robust and more generalizable the performance of the deep learning model. However, pooling the medical data into centralized storage to train a robust deep learning model faces privacy, ownership, and strict regulation challenges. Federated learning resolves the previous challenges with a shared global deep learning model using a central aggregator server. At the same time, patient data remain with the local party, maintaining data anonymity and security. In this study, first, we provide a comprehensive, up-to-date review of research employing federated learning in healthcare applications. Second, we evaluate a set of recent challenges from a data-centric perspective in federated learning, such as data partitioning characteristics, data distributions, data protection mechanisms, and benchmark datasets. Finally, we point out several potential challenges and future research directions in healthcare applications.
Collapse
|
86
|
Huang YL, Yang HC, Lee CC. Federated Learning via Conditional Mutual Learning for Alzheimer's Disease Classification on T1w MRI. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:2427-2432. [PMID: 34891771 DOI: 10.1109/embc46164.2021.9630382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Data-driven deep learning has been considered a promising method for building powerful models for medical data, which often requires a large amount of diverse data to be sufficiently effective. However, the expensive cost of collecting and the privacy constraints lead to the fact that existing medical datasets are small-scale and distributed. Federated learning via model distillation is a data-private collaborative learning where the model can leverage all available data without direct sharing. The data knowledge is shared by distillation through the multi-site average prediction scores on the public dataset. However, the average consensus is suboptimal to individual client due to data domain shift in MRI data caused by acquisition protocols, recruitment criteria, etc. In this work, we propose a federated conditional mutual learning (FedCM) to improve the performance by considering the clients' local performance and the similarity between clients. This work is the first federated learning on multi-dataset Alzheimer's disease classification by 3DCNN using T1w MRI. Our method achieves the best recognition rates comparing with FedMD and other frameworks. Further visualization and relevance ranking on the region of interests (ROI) in human brains implies that the left hemisphere may have greater relevance than the right hemisphere does. Several potential regions are listed for future investigation.
Collapse
|
87
|
Desmet C, Cook DJ. Recent Developments in Privacy-Preserving Mining of Clinical Data. ACM/IMS TRANSACTIONS ON DATA SCIENCE 2021; 2:28. [PMID: 35018368 PMCID: PMC8746818 DOI: 10.1145/3447774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 01/01/2021] [Indexed: 06/14/2023]
Abstract
With the dramatic increases in both the capability to collect personal data and the capability to analyze large amounts of data, increasingly sophisticated and personal insights are being drawn. These insights are valuable for clinical applications but also open up possibilities for identification and abuse of personal information. In this paper, we survey recent research on classical methods of privacy-preserving data mining. Looking at dominant techniques and recent innovations to them, we examine the applicability of these methods to the privacy-preserving analysis of clinical data. We also discuss promising directions for future research in this area.
Collapse
|
88
|
Yang C, Wang P, Tan J, Liu Q, Li X. Autism spectrum disorder diagnosis using graph attention network based on spatial-constrained sparse functional brain networks. Comput Biol Med 2021; 139:104963. [PMID: 34700253 DOI: 10.1016/j.compbiomed.2021.104963] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 10/13/2021] [Accepted: 10/17/2021] [Indexed: 11/16/2022]
Abstract
The accurate diagnosis of autism spectrum disorder (ASD), a common mental disease in children, has always been an important task in clinical practice. In recent years, the use of graph neural network (GNN) based on functional brain network (FBN) has shown powerful performance for disease diagnosis. The challenge to construct "ideal" FBN from resting-state fMRI data remained. Moreover, it remains unclear whether and to what extent the non-Euclidean structure of different FBNs affect the performance of GNN-based disease classification. In this paper, we proposed a new method named Pearson's correlation-based Spatial Constraints Representation (PSCR) to estimate the FBN structures that were transformed to brain graphs and then fed into a graph attention network (GAT) to diagnose ASD. Extensive experiments on comparing different FBN construction methods and classification frameworks were conducted on the ABIDE I dataset (n = 871). The results demonstrated the superiority of our PSCR method and the influence of different FBNs on the GNN-based classification results. The proposed PSCR and GAT framework achieved promising classification results for ASD (accuracy: 72.40%), which significantly outperformed competing methods. This will help facilitate patient-control separation, and provide a promising solution for future disease diagnosis based on the FBN and GNN framework.
Collapse
Affiliation(s)
- Chunde Yang
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China; School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Panyu Wang
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Jia Tan
- School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Qingshui Liu
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Xinwei Li
- School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, China.
| |
Collapse
|
89
|
Dayan I, Roth HR, Zhong A, Harouni A, Gentili A, Abidin AZ, Liu A, Costa AB, Wood BJ, Tsai CS, Wang CH, Hsu CN, Lee CK, Ruan P, Xu D, Wu D, Huang E, Kitamura FC, Lacey G, de Antônio Corradi GC, Nino G, Shin HH, Obinata H, Ren H, Crane JC, Tetreault J, Guan J, Garrett JW, Kaggie JD, Park JG, Dreyer K, Juluru K, Kersten K, Rockenbach MABC, Linguraru MG, Haider MA, AbdelMaseeh M, Rieke N, Damasceno PF, E Silva PMC, Wang P, Xu S, Kawano S, Sriswasdi S, Park SY, Grist TM, Buch V, Jantarabenjakul W, Wang W, Tak WY, Li X, Lin X, Kwon YJ, Quraini A, Feng A, Priest AN, Turkbey B, Glicksberg B, Bizzo B, Kim BS, Tor-Díez C, Lee CC, Hsu CJ, Lin C, Lai CL, Hess CP, Compas C, Bhatia D, Oermann EK, Leibovitz E, Sasaki H, Mori H, Yang I, Sohn JH, Murthy KNK, Fu LC, de Mendonça MRF, Fralick M, Kang MK, Adil M, Gangai N, Vateekul P, Elnajjar P, Hickman S, Majumdar S, McLeod SL, Reed S, Gräf S, Harmon S, Kodama T, Puthanakit T, Mazzulli T, de Lavor VL, Rakvongthai Y, Lee YR, Wen Y, Gilbert FJ, Flores MG, Li Q. Federated learning for predicting clinical outcomes in patients with COVID-19. Nat Med 2021; 27:1735-1743. [PMID: 34526699 PMCID: PMC9157510 DOI: 10.1038/s41591-021-01506-3] [Citation(s) in RCA: 166] [Impact Index Per Article: 55.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 08/13/2021] [Indexed: 02/08/2023]
Abstract
Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site's data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare.
Collapse
Affiliation(s)
- Ittai Dayan
- MGH Radiology and Harvard Medical School, Boston, MA, USA
| | | | - Aoxiao Zhong
- Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA
| | | | | | | | | | | | - Bradford J Wood
- Radiology & Imaging Sciences/Clinical Center, National Institutes of Health, Bethesda, MD, USA
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Chien-Sung Tsai
- Division of Cardiovascular Surgery, Department of Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chih-Hung Wang
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
- Graduate Institute of Medical Sciences, National Defense Medical Center, Taipei, Taiwan
| | - Chun-Nan Hsu
- Center for Research in Biological Systems, University of California, San Diego, CA, USA
| | - C K Lee
- NVIDIA, Santa Clara, CA, USA
| | | | | | - Dufan Wu
- Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | | | | | | | | | - Gustavo Nino
- Division of Pediatric Pulmonary and Sleep Medicine, Children's National Hospital, Washington, DC, USA
| | - Hao-Hsin Shin
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | - Hui Ren
- Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Jason C Crane
- Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA
| | | | | | - John W Garrett
- Departments of Radiology and Medical Physics, The University of Wisconsin-Madison School of Medicine and Public Health, Madison, WI, USA
| | - Joshua D Kaggie
- Department of Radiology, NIHR Cambridge Biomedical Resource Centre, University of Cambridge, Cambridge, UK
| | - Jung Gil Park
- Department of Internal Medicine, Yeungnam University College of Medicine, Daegu, South Korea
| | - Keith Dreyer
- MGH Radiology and Harvard Medical School, Boston, MA, USA
- Center for Clinical Data Science, Massachusetts General Brigham, Boston, MA, USA
| | - Krishna Juluru
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | | | - Marius George Linguraru
- Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA
- Departments of Radiology and Pediatrics, The George Washington University School of Medicine and Health Sciences, Washington, DC, USA
| | - Masoom A Haider
- Joint Dept. of Medical Imaging, Sinai Health System, University of Toronto, Toronto, Ontario, Canada
- Lunenfeld-Tanenbaum Research Institute, Toronto, Ontario, Canada
| | | | | | - Pablo F Damasceno
- Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA
| | | | - Pochuan Wang
- MeDA Lab Institute of Applied Mathematical Sciences, National Taiwan University, Taipei, Taiwan
- Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
| | - Sheng Xu
- Radiology & Imaging Sciences/Clinical Center, National Institutes of Health, Bethesda, MD, USA
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Sira Sriswasdi
- Research Affairs, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Center for Artificial Intelligence in Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Soo Young Park
- Department of Internal Medicine, School of Medicine, Kyungpook National University, Daegu, South Korea
| | - Thomas M Grist
- Departments of Radiology, Medical Physics, and Biomedical Engineering, The University of Wisconsin-Madison School of Medicine and Public Health, Madison, WI, USA
| | - Varun Buch
- Center for Clinical Data Science, Massachusetts General Brigham, Boston, MA, USA
| | - Watsamon Jantarabenjakul
- Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Thai Red Cross Emerging Infectious Diseases Clinical Center, King Chulalongkorn Memorial Hospital, Bangkok, Thailand
| | - Weichung Wang
- MeDA Lab Institute of Applied Mathematical Sciences, National Taiwan University, Taipei, Taiwan
- Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
| | - Won Young Tak
- Department of Internal Medicine, School of Medicine, Kyungpook National University, Daegu, South Korea
| | - Xiang Li
- Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Xihong Lin
- Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Young Joon Kwon
- Department of Neurosurgery, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | | | - Andrew N Priest
- Department of Radiology, NIHR Cambridge Biomedical Resource Centre, Cambridge University Hospital, Cambridge, UK
| | - Baris Turkbey
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Radiology and Imaging Sciences, National Institutes of Health, Bethesda, MD, USA
| | - Benjamin Glicksberg
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bernardo Bizzo
- Center for Clinical Data Science, Massachusetts General Brigham, Boston, MA, USA
| | - Byung Seok Kim
- Department of Internal Medicine, Catholic University of Daegu School of Medicine, Daegu, South Korea
| | - Carlos Tor-Díez
- Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA
| | - Chia-Cheng Lee
- Planning and Management Office, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chia-Jung Hsu
- Planning and Management Office, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chin Lin
- School of Medicine, National Defense Medical Center, Taipei, Taiwan
- School of Public Health, National Defense Medical Center, Taipei, Taiwan
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
| | - Chiu-Ling Lai
- Medical Review and Pharmaceutical Benefits Division, National Health Insurance Administration, Taipei, Taiwan
| | - Christopher P Hess
- Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA
| | | | | | - Eric K Oermann
- Department of Neurosurgery, NYU Grossman School of Medicine, New York, NY, USA
| | - Evan Leibovitz
- Center for Clinical Data Science, Massachusetts General Brigham, Boston, MA, USA
| | | | - Hitoshi Mori
- Self-Defense Forces Central Hospital, Tokyo, Japan
| | | | - Jae Ho Sohn
- Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA
| | | | - Li-Chen Fu
- MOST/NTU All Vista Healthcare Center, Center for Artificial Intelligence and Advanced Robotics, National Taiwan University, Taipei, Taiwan
| | | | - Mike Fralick
- Division of General Internal Medicine and Geriatrics (Fralick), Sinai Health System, Toronto, Ontario, Canada
| | - Min Kyu Kang
- Department of Internal Medicine, Yeungnam University College of Medicine, Daegu, South Korea
| | | | - Natalie Gangai
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Peerapon Vateekul
- Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, Thailand
| | | | - Sarah Hickman
- Department of Radiology, NIHR Cambridge Biomedical Resource Centre, University of Cambridge, Cambridge, UK
| | - Sharmila Majumdar
- Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA
| | - Shelley L McLeod
- Schwartz/Reisman Emergency Medicine Institute, Sinai Health, Toronto, Ontario, Canada
- Department of Family and Community Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Sheridan Reed
- Radiology & Imaging Sciences/Clinical Center, National Institutes of Health, Bethesda, MD, USA
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Stefan Gräf
- Department of Medicine and NIHR BioResource for Translational Research, NIHR Cambridge Biomedical Research Centre, University of Cambridge, Cambridge, UK
| | - Stephanie Harmon
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Clinical Research Directorate, Frederick National Laboratory for Cancer, National Cancer Institute, Frederick, MD, USA
| | | | - Thanyawee Puthanakit
- Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Thai Red Cross Emerging Infectious Diseases Clinical Center, King Chulalongkorn Memorial Hospital, Bangkok, Thailand
| | - Tony Mazzulli
- Department of Microbiology, Sinai Health/University Health Network, Toronto, Ontario, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
- Public Health Ontario Laboratories, Toronto, Ontario, Canada
| | | | - Yothin Rakvongthai
- Chulalongkorn University Biomedical Imaging Group and Division of Nuclear Medicine, Department of Radiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Yu Rim Lee
- Department of Internal Medicine, School of Medicine, Kyungpook National University, Daegu, South Korea
| | | | - Fiona J Gilbert
- Department of Radiology, NIHR Cambridge Biomedical Resource Centre, University of Cambridge, Cambridge, UK
| | | | - Quanzheng Li
- Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
90
|
Ficek J, Wang W, Chen H, Dagne G, Daley E. Differential privacy in health research: A scoping review. J Am Med Inform Assoc 2021; 28:2269-2276. [PMID: 34333623 DOI: 10.1093/jamia/ocab135] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 06/11/2021] [Accepted: 06/16/2021] [Indexed: 11/14/2022] Open
Abstract
OBJECTIVE Differential privacy is a relatively new method for data privacy that has seen growing use due its strong protections that rely on added noise. This study assesses the extent of its awareness, development, and usage in health research. MATERIALS AND METHODS A scoping review was conducted by searching for ["differential privacy" AND "health"] in major health science databases, with additional articles obtained via expert consultation. Relevant articles were classified according to subject area and focus. RESULTS A total of 54 articles met the inclusion criteria. Nine articles provided descriptive overviews, 31 focused on algorithm development, 9 presented novel data sharing systems, and 8 discussed appraisals of the privacy-utility tradeoff. The most common areas of health research where differential privacy has been discussed are genomics, neuroimaging studies, and health surveillance with personal devices. Algorithms were most commonly developed for the purposes of data release and predictive modeling. Studies on privacy-utility appraisals have considered economic cost-benefit analysis, low-utility situations, personal attitudes toward sharing health data, and mathematical interpretations of privacy risk. DISCUSSION Differential privacy remains at an early stage of development for applications in health research, and accounts of real-world implementations are scant. There are few algorithms for explanatory modeling and statistical inference, particularly with correlated data. Furthermore, diminished accuracy in small datasets is problematic. Some encouraging work has been done on decision making with regard to epsilon. The dissemination of future case studies can inform successful appraisals of privacy and utility. CONCLUSIONS More development, case studies, and evaluations are needed before differential privacy can see widespread use in health research.
Collapse
Affiliation(s)
- Joseph Ficek
- College of Public Health, University of South Florida, Tampa, Florida, USA
| | - Wei Wang
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Henian Chen
- College of Public Health, University of South Florida, Tampa, Florida, USA
| | - Getachew Dagne
- College of Public Health, University of South Florida, Tampa, Florida, USA
| | - Ellen Daley
- College of Public Health, University of South Florida, Tampa, Florida, USA
| |
Collapse
|
91
|
Oestreich M, Chen D, Schultze JL, Fritz M, Becker M. Privacy considerations for sharing genomics data. EXCLI JOURNAL 2021; 20:1243-1260. [PMID: 34345236 PMCID: PMC8326502 DOI: 10.17179/excli2021-4002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 07/07/2021] [Indexed: 01/23/2023]
Abstract
An increasing amount of attention has been geared towards understanding the privacy risks that arise from sharing genomic data of human origin. Most of these efforts have focused on issues in the context of genomic sequence data, but the popularity of techniques for collecting other types of genome-related data has prompted researchers to investigate privacy concerns in a broader genomic context. In this review, we give an overview of different types of genome-associated data, their individual ways of revealing sensitive information, the motivation to share them as well as established and upcoming methods to minimize information leakage. We further discuss the concise threats that are being posed, who is at risk, and how the risk level compares to potential benefits, all while addressing the topic in the context of modern technology, methodology, and information sharing culture. Additionally, we will discuss the current legal situation regarding the sharing of genomic data in a selection of countries, evaluating the scope of their applicability as well as their limitations. We will finalize this review by evaluating the development that is required in the scientific field in the near future in order to improve and develop privacy-preserving data sharing techniques for the genomic context.
Collapse
Affiliation(s)
- Marie Oestreich
- Systems Medicine, Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE), Venusberg-Campus 1/99, 53127 Bonn, Germany
| | - Dingfan Chen
- CISPA Helmholtz Center for Information Security, Saarbrücken, Germany, Stuhlsatzenhaus 5, 66123 Saarbrücken, Germany
| | - Joachim L. Schultze
- Systems Medicine, Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE), Venusberg-Campus 1/99, 53127 Bonn, Germany
- Genomics and Immunoregulation, Life & Medical Sciences (LIMES) Institute, University of Bonn, Bonn, Germany, Carl-Troll-Straße 31, 53115 Bonn, Germany
- PRECISE Platform for Single Cell Genomics and Epigenomics at Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE) and the University of Bonn, Germany, Venusberg-Campus 1/99, 53127 Bonn, Germany
| | - Mario Fritz
- CISPA Helmholtz Center for Information Security, Saarbrücken, Germany, Stuhlsatzenhaus 5, 66123 Saarbrücken, Germany
| | - Matthias Becker
- Systems Medicine, Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE), Venusberg-Campus 1/99, 53127 Bonn, Germany
| |
Collapse
|
92
|
Guo P, Wang P, Zhou J, Jiang S, Patel VM. Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning. PROCEEDINGS. IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2021; 2021:2423-2432. [PMID: 35444379 DOI: 10.1109/cvpr46437.2021.00245] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Fast and accurate reconstruction of magnetic resonance (MR) images from under-sampled data is important in many clinical applications. In recent years, deep learning-based methods have been shown to produce superior performance on MR image reconstruction. However, these methods require large amounts of data which is difficult to collect and share due to the high cost of acquisition and medical data privacy regulations. In order to overcome this challenge, we propose a federated learning (FL) based solution in which we take advantage of the MR data available at different institutions while preserving patients' privacy. However, the generalizability of models trained with the FL setting can still be suboptimal due to domain shift, which results from the data collected at multiple institutions with different sensors, disease types, and acquisition protocols, etc. With the motivation of circumventing this challenge, we propose a cross-site modeling for MR image reconstruction in which the learned intermediate latent features among different source sites are aligned with the distribution of the latent features at the target site. Extensive experiments are conducted to provide various insights about FL for MR image reconstruction. Experimental results demonstrate that the proposed framework is a promising direction to utilize multi-institutional data without compromising patients' privacy for achieving improved MR image reconstruction. Our code is available at https://github.com/guopengf/FL-MRCM.
Collapse
|
93
|
Kaissis G, Ziller A, Passerat-Palmbach J, Ryffel T, Usynin D, Trask A, Lima I, Mancuso J, Jungmann F, Steinborn MM, Saleh A, Makowski M, Rueckert D, Braren R. End-to-end privacy preserving deep learning on multi-institutional medical imaging. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00337-8] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
94
|
Zhou SK, Greenspan H, Davatzikos C, Duncan JS, van Ginneken B, Madabhushi A, Prince JL, Rueckert D, Summers RM. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2021; 109:820-838. [PMID: 37786449 PMCID: PMC10544772 DOI: 10.1109/jproc.2021.3054390] [Citation(s) in RCA: 176] [Impact Index Per Article: 58.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
Since its renaissance, deep learning has been widely used in various medical imaging tasks and has achieved remarkable success in many medical imaging applications, thereby propelling us into the so-called artificial intelligence (AI) era. It is known that the success of AI is mostly attributed to the availability of big data with annotations for a single task and the advances in high performance computing. However, medical imaging presents unique challenges that confront deep learning approaches. In this survey paper, we first present traits of medical imaging, highlight both clinical needs and technical challenges in medical imaging, and describe how emerging trends in deep learning are addressing these issues. We cover the topics of network architecture, sparse and noisy labels, federating learning, interpretability, uncertainty quantification, etc. Then, we present several case studies that are commonly found in clinical practice, including digital pathology and chest, brain, cardiovascular, and abdominal imaging. Rather than presenting an exhaustive literature survey, we instead describe some prominent research highlights related to these case study applications. We conclude with a discussion and presentation of promising future directions.
Collapse
Affiliation(s)
- S Kevin Zhou
- School of Biomedical Engineering, University of Science and Technology of China and Institute of Computing Technology, Chinese Academy of Sciences
| | - Hayit Greenspan
- Biomedical Engineering Department, Tel-Aviv University, Israel
| | - Christos Davatzikos
- Radiology Department and Electrical and Systems Engineering Department, University of Pennsylvania, USA
| | - James S Duncan
- Departments of Biomedical Engineering and Radiology & Biomedical Imaging, Yale University
| | | | - Anant Madabhushi
- Department of Biomedical Engineering, Case Western Reserve University and Louis Stokes Cleveland Veterans Administration Medical Center, USA
| | - Jerry L Prince
- Electrical and Computer Engineering Department, Johns Hopkins University, USA
| | - Daniel Rueckert
- Klinikum rechts der Isar, TU Munich, Germany and Department of Computing, Imperial College, UK
| | | |
Collapse
|
95
|
Beauvais MJS, Knoppers BM, Illes J. A marathon, not a sprint - neuroimaging, Open Science and ethics. Neuroimage 2021; 236:118041. [PMID: 33848622 DOI: 10.1016/j.neuroimage.2021.118041] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/10/2021] [Accepted: 03/31/2021] [Indexed: 01/10/2023] Open
Abstract
Open Science is calling for a radical re-thinking of existing scientific practices. Within the neuroimaging community, Open Science practices are taking the form of open data repositories and open lab notebooks. The broad sharing of data that accompanies Open Science, however, raises some difficult ethical and legal issues. With neuroethics as a focusing lens, we explore eight central concerns posed by open data with regard to human brain imaging studies: respect for individuals and communities, concern for marginalized communities, consent, privacy protections, participatory research designs, contextual integrity, fusions of clinical and research goals, and incidental findings. Each consideration assists in bringing nuance to the potential benefits for open data sharing against associated challenges. We combine current understandings with forward-looking solutions to key issues. We conclude by underscoring the need for new policy tools to enhance the potential for responsible open data.
Collapse
Affiliation(s)
| | | | - Judy Illes
- Neuroethics Canada, Division of Neurology, Department of Medicine, University of British Columbia, Canada.
| |
Collapse
|
96
|
A Deep Learning Approach to Predict Autism Spectrum Disorder Using Multisite Resting-State fMRI. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11083636] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Autism spectrum disorder (ASD) is a complex and degenerative neuro-developmental disorder. Most of the existing methods utilize functional magnetic resonance imaging (fMRI) to detect ASD with a very limited dataset which provides high accuracy but results in poor generalization. To overcome this limitation and to enhance the performance of the automated autism diagnosis model, in this paper, we propose an ASD detection model using functional connectivity features of resting-state fMRI data. Our proposed model utilizes two commonly used brain atlases, Craddock 200 (CC200) and Automated Anatomical Labelling (AAL), and two rarely used atlases Bootstrap Analysis of Stable Clusters (BASC) and Power. A deep neural network (DNN) classifier is used to perform the classification task. Simulation results indicate that the proposed model outperforms state-of-the-art methods in terms of accuracy. The mean accuracy of the proposed model was 88%, whereas the mean accuracy of the state-of-the-art methods ranged from 67% to 85%. The sensitivity, F1-score, and area under receiver operating characteristic curve (AUC) score of the proposed model were 90%, 87%, and 96%, respectively. Comparative analysis on various scoring strategies show the superiority of BASC atlas over other aforementioned atlases in classifying ASD and control.
Collapse
|
97
|
Kirienko M, Sollini M, Ninatti G, Loiacono D, Giacomello E, Gozzi N, Amigoni F, Mainardi L, Lanzi PL, Chiti A. Distributed learning: a reliable privacy-preserving strategy to change multicenter collaborations using AI. Eur J Nucl Med Mol Imaging 2021; 48:3791-3804. [PMID: 33847779 PMCID: PMC8041944 DOI: 10.1007/s00259-021-05339-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/24/2021] [Indexed: 12/12/2022]
Abstract
Purpose The present scoping review aims to assess the non-inferiority of distributed learning over centrally and locally trained machine learning (ML) models in medical applications. Methods We performed a literature search using the term “distributed learning” OR “federated learning” in the PubMed/MEDLINE and EMBASE databases. No start date limit was used, and the search was extended until July 21, 2020. We excluded articles outside the field of interest; guidelines or expert opinion, review articles and meta-analyses, editorials, letters or commentaries, and conference abstracts; articles not in the English language; and studies not using medical data. Selected studies were classified and analysed according to their aim(s). Results We included 26 papers aimed at predicting one or more outcomes: namely risk, diagnosis, prognosis, and treatment side effect/adverse drug reaction. Distributed learning was compared to centralized or localized training in 21/26 and 14/26 selected papers, respectively. Regardless of the aim, the type of input, the method, and the classifier, distributed learning performed close to centralized training, but two experiments focused on diagnosis. In all but 2 cases, distributed learning outperformed locally trained models. Conclusion Distributed learning resulted in a reliable strategy for model development; indeed, it performed equally to models trained on centralized datasets. Sensitive data can get preserved since they are not shared for model development. Distributed learning constitutes a promising solution for ML-based research and practice since large, diverse datasets are crucial for success.
Collapse
Affiliation(s)
- Margarita Kirienko
- Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.,Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Martina Sollini
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy. .,IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy.
| | - Gaia Ninatti
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | | | | | - Noemi Gozzi
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | | | | | | | - Arturo Chiti
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy.,IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| |
Collapse
|
98
|
Valverde JM, Imani V, Abdollahzadeh A, De Feo R, Prakash M, Ciszek R, Tohka J. Transfer Learning in Magnetic Resonance Brain Imaging: A Systematic Review. J Imaging 2021; 7:66. [PMID: 34460516 PMCID: PMC8321322 DOI: 10.3390/jimaging7040066] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 03/26/2021] [Accepted: 03/29/2021] [Indexed: 11/25/2022] Open
Abstract
(1) Background: Transfer learning refers to machine learning techniques that focus on acquiring knowledge from related tasks to improve generalization in the tasks of interest. In magnetic resonance imaging (MRI), transfer learning is important for developing strategies that address the variation in MR images from different imaging protocols or scanners. Additionally, transfer learning is beneficial for reutilizing machine learning models that were trained to solve different (but related) tasks to the task of interest. The aim of this review is to identify research directions, gaps in knowledge, applications, and widely used strategies among the transfer learning approaches applied in MR brain imaging; (2) Methods: We performed a systematic literature search for articles that applied transfer learning to MR brain imaging tasks. We screened 433 studies for their relevance, and we categorized and extracted relevant information, including task type, application, availability of labels, and machine learning methods. Furthermore, we closely examined brain MRI-specific transfer learning approaches and other methods that tackled issues relevant to medical imaging, including privacy, unseen target domains, and unlabeled data; (3) Results: We found 129 articles that applied transfer learning to MR brain imaging tasks. The most frequent applications were dementia-related classification tasks and brain tumor segmentation. The majority of articles utilized transfer learning techniques based on convolutional neural networks (CNNs). Only a few approaches utilized clearly brain MRI-specific methodology, and considered privacy issues, unseen target domains, or unlabeled data. We proposed a new categorization to group specific, widely-used approaches such as pretraining and fine-tuning CNNs; (4) Discussion: There is increasing interest in transfer learning for brain MRI. Well-known public datasets have clearly contributed to the popularity of Alzheimer's diagnostics/prognostics and tumor segmentation as applications. Likewise, the availability of pretrained CNNs has promoted their utilization. Finally, the majority of the surveyed studies did not examine in detail the interpretation of their strategies after applying transfer learning, and did not compare their approach with other transfer learning approaches.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Jussi Tohka
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, 70150 Kuopio, Finland; (J.M.V.); (V.I.); (A.A.); (R.D.F.); (M.P.); (R.C.)
| |
Collapse
|
99
|
Olsen A, Babikian T, Bigler ED, Caeyenberghs K, Conde V, Dams-O'Connor K, Dobryakova E, Genova H, Grafman J, Håberg AK, Heggland I, Hellstrøm T, Hodges CB, Irimia A, Jha RM, Johnson PK, Koliatsos VE, Levin H, Li LM, Lindsey HM, Livny A, Løvstad M, Medaglia J, Menon DK, Mondello S, Monti MM, Newcombe VFJ, Petroni A, Ponsford J, Sharp D, Spitz G, Westlye LT, Thompson PM, Dennis EL, Tate DF, Wilde EA, Hillary FG. Toward a global and reproducible science for brain imaging in neurotrauma: the ENIGMA adult moderate/severe traumatic brain injury working group. Brain Imaging Behav 2021; 15:526-554. [PMID: 32797398 PMCID: PMC8032647 DOI: 10.1007/s11682-020-00313-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The global burden of mortality and morbidity caused by traumatic brain injury (TBI) is significant, and the heterogeneity of TBI patients and the relatively small sample sizes of most current neuroimaging studies is a major challenge for scientific advances and clinical translation. The ENIGMA (Enhancing NeuroImaging Genetics through Meta-Analysis) Adult moderate/severe TBI (AMS-TBI) working group aims to be a driving force for new discoveries in AMS-TBI by providing researchers world-wide with an effective framework and platform for large-scale cross-border collaboration and data sharing. Based on the principles of transparency, rigor, reproducibility and collaboration, we will facilitate the development and dissemination of multiscale and big data analysis pipelines for harmonized analyses in AMS-TBI using structural and functional neuroimaging in combination with non-imaging biomarkers, genetics, as well as clinical and behavioral measures. Ultimately, we will offer investigators an unprecedented opportunity to test important hypotheses about recovery and morbidity in AMS-TBI by taking advantage of our robust methods for large-scale neuroimaging data analysis. In this consensus statement we outline the working group's short-term, intermediate, and long-term goals.
Collapse
Affiliation(s)
- Alexander Olsen
- Department of Psychology, Norwegian University of Science and Technology, 7491, Trondheim, Norway.
- Department of Physical Medicine and Rehabilitation, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway.
| | - Talin Babikian
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute for Neuroscience and Human Behavior, UCLA, Los Angeles, CA, USA
- UCLA Steve Tisch BrainSPORT Program, Los Angeles, CA, USA
| | - Erin D Bigler
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- Department of Psychology and Neuroscience Center, Brigham Young University, Provo, UT, USA
| | - Karen Caeyenberghs
- Cognitive Neuroscience Unit, School of Psychology, Deakin University, Burwood, Australia
| | - Virginia Conde
- Department of Psychology, Norwegian University of Science and Technology, 7491, Trondheim, Norway
| | - Kristen Dams-O'Connor
- Department of Rehabilitation Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ekaterina Dobryakova
- Center for Traumatic Brain Injury, Kessler Foundation, East Hanover, NJ, USA
- Rutgers New Jersey Medical School, Newark, NJ, USA
| | - Helen Genova
- Center for Traumatic Brain Injury, Kessler Foundation, East Hanover, NJ, USA
| | - Jordan Grafman
- Cognitive Neuroscience Laboratory, Shirley Ryan AbilityLab, Chicago, IL, USA
- Department of Physical Medicine & Rehabilitation, Neurology, Department of Psychiatry & Department of Psychology, Cognitive Neurology and Alzheimer's, Center, Feinberg School of Medicine, Weinberg, Chicago, IL, USA
| | - Asta K Håberg
- Department of Neuromedicine and Movement Science, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Radiology and Nuclear Medicine, St. Olavs Hopsital, Trondheim University Hospital, Trondheim, Norway
| | - Ingrid Heggland
- Section for Collections and Digital Services, NTNU University Library, Norwegian University of Science and Technology, Trondheim, Norway
| | - Torgeir Hellstrøm
- Department of Physical Medicine and Rehabilitation, Oslo University Hospital, Oslo, Norway
| | - Cooper B Hodges
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- Department of Psychology, Brigham Young University, Provo, UT, USA
- George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA
| | - Andrei Irimia
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, USA
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Ruchira M Jha
- Departments of Critical Care Medicine, Neurology, Neurological Surgery, University of Pittsburgh, Pittsburgh, PA, USA
- Safar Center for Resuscitation Research, Pittsburgh, PA, USA
- Clinical and Translational Science Institute, Pittsburgh, PA, USA
| | - Paula K Johnson
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- Neuroscience Center, Brigham Young University, Provo, UT, USA
| | - Vassilis E Koliatsos
- Departments of Pathology(Neuropathology), Neurology, and Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Neuropsychiatry Program, Sheppard and Enoch Pratt Hospital, Baltimore, MD, USA
| | - Harvey Levin
- H. Ben Taub Department of Physical Medicine and Rehabilitation, Baylor College of Medicine, Houston, TX, USA
- Michael E. DeBakey Veterans Affairs Medical Center, Houston, TX, USA
| | - Lucia M Li
- C3NL, Imperial College London, London, UK
- UK DRI Centre for Health Care and Technology, Imperial College London, London, UK
| | - Hannah M Lindsey
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- Department of Psychology, Brigham Young University, Provo, UT, USA
- George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA
| | - Abigail Livny
- Department of Diagnostic Imaging, Sheba Medical Center, Tel-Hashomer, Ramat Gan, Israel
- Joseph Sagol Neuroscience Center, Sheba Medical Center, Tel-Hashomer, Ramat Gan, Israel
| | - Marianne Løvstad
- Sunnaas Rehabilitation Hospital, Nesodden, Norway
- Department of Psychology, University of Oslo, Oslo, Norway
| | - John Medaglia
- Department of Psychology, Drexel University, Philadelphia, PA, USA
- Department of Neurology, Drexel University, Philadelphia, PA, USA
| | - David K Menon
- Division of Anaesthesia, University of Cambridge, Cambridge, UK
| | - Stefania Mondello
- Department of Biomedical and Dental Sciences and Morphofunctional Imaging, University of Messina, Messina, Italy
| | - Martin M Monti
- Department of Psychology, University of California Los Angeles, Los Angeles, CA, USA
- Department of Neurosurgery, Brain Injury Research Center (BIRC), UCLA, Los Angeles, CA, USA
| | | | - Agustin Petroni
- Department of Psychology, Norwegian University of Science and Technology, 7491, Trondheim, Norway
- Department of Computer Science, Faculty of Exact & Natural Sciences, University of Buenos Aires, Buenos Aires, Argentina
- National Scientific & Technical Research Council, Institute of Research in Computer Science, Buenos Aires, Argentina
| | - Jennie Ponsford
- Turner Institute for Brain and Mental Health, School of Psychological Sciences, Monash University, Melbourne, Australia
- Monash Epworth Rehabilitation Research Centre, Epworth Healthcare, Melbourne, Australia
| | - David Sharp
- Department of Brain Sciences, Imperial College London, London, UK
- Care Research & Technology Centre, UK Dementia Research Institute, London, UK
| | - Gershon Spitz
- Turner Institute for Brain and Mental Health, School of Psychological Sciences, Monash University, Melbourne, Australia
| | - Lars T Westlye
- Department of Psychology, University of Oslo, Oslo, Norway
- NORMENT, Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| | - Paul M Thompson
- Imaging Genetics Center, Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of USC, Marina del Rey, CA, USA
- Departments of Neurology, Pediatrics, Psychiatry, Radiology, Engineering, and Ophthalmology, USC, Los Angeles, CA, USA
| | - Emily L Dennis
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- Imaging Genetics Center, Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of USC, Marina del Rey, CA, USA
| | - David F Tate
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA
| | - Elisabeth A Wilde
- Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA
- George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA
- H. Ben Taub Department of Physical Medicine and Rehabilitation, Baylor College of Medicine, Houston, TX, USA
| | - Frank G Hillary
- Department of Neurology, Hershey Medical Center, State College, PA, USA.
| |
Collapse
|
100
|
Graña M, Silva M. Impact of Machine Learning Pipeline Choices in Autism Prediction From Functional Connectivity Data. Int J Neural Syst 2021; 31:2150009. [PMID: 33472548 DOI: 10.1142/s012906572150009x] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Autism Spectrum Disorder (ASD) is a largely prevalent neurodevelopmental condition with a big social and economical impact affecting the entire life of families. There is an intense search for biomarkers that can be assessed as early as possible in order to initiate treatment and preparation of the family to deal with the challenges imposed by the condition. Brain imaging biomarkers have special interest. Specifically, functional connectivity data extracted from resting state functional magnetic resonance imaging (rs-fMRI) should allow to detect brain connectivity alterations. Machine learning pipelines encompass the estimation of the functional connectivity matrix from brain parcellations, feature extraction, and building classification models for ASD prediction. The works reported in the literature are very heterogeneous from the computational and methodological point of view. In this paper, we carry out a comprehensive computational exploration of the impact of the choices involved while building these machine learning pipelines. Specifically, we consider six brain parcellation definitions, five methods for functional connectivity matrix construction, six feature extraction/selection approaches, and nine classifier building algorithms. We report the prediction performance sensitivity to each of these choices, as well as the best results that are comparable with the state of the art.
Collapse
Affiliation(s)
- Manuel Graña
- Computational Intelligence Group, University of the Basque Country (UPV/EHU), San Sebastian, Spain
| | - Moises Silva
- Universidad Mayor de San Andres, La Paz, Bolivia
| |
Collapse
|