Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

50
(from Reference Citation Analysis)

Article PDFs (10)

Cited by > 0 (28)

Searched Name

Generative Adversarial Networks

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Du X, Ding X, Xi M, Lv Y, Qiu S, Liu Q. A Data Augmentation Method for Motor Imagery EEG Signals Based on DCGAN-GP Network. Brain Sci 2024;14:375. [PMID: 38672024 PMCID: PMC11048538 DOI: 10.3390/brainsci14040375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 04/09/2024] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open

Wang X, Mi Y, Zhang X. 3D human pose data augmentation using Generative Adversarial Networks for robotic-assisted movement quality assessment. Front Neurorobot 2024;18:1371385. [PMID: 38644903 PMCID: PMC11032046 DOI: 10.3389/fnbot.2024.1371385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 03/08/2024] [Indexed: 04/23/2024] Open

Abstract

In the realm of human motion recognition systems, the augmentation of 3D human pose data plays a pivotal role in enriching and enhancing the quality of original datasets through the generation of synthetic data. This augmentation is vital for addressing the current research gaps in diversity and complexity, particularly when dealing with rare or complex human movements. Our study introduces a groundbreaking approach employing Generative Adversarial Networks (GANs), coupled with Support Vector Machine (SVM) and DenseNet, further enhanced by robot-assisted technology to improve the precision and efficiency of data collection. The GANs in our model are responsible for generating highly realistic and diverse 3D human motion data, while SVM aids in the effective classification of this data. DenseNet is utilized for the extraction of key features, facilitating a comprehensive and integrated approach that significantly elevates both the data augmentation process and the model's ability to process and analyze complex human movements. The experimental outcomes underscore our model's exceptional performance in motion quality assessment, showcasing a substantial improvement over traditional methods in terms of classification accuracy and data processing efficiency. These results validate the effectiveness of our integrated network model, setting a solid foundation for future advancements in the field. Our research not only introduces innovative methodologies for 3D human pose data enhancement but also provides substantial technical support for practical applications across various domains, including sports science, rehabilitation medicine, and virtual reality. By combining advanced algorithmic strategies with robotic technologies, our work addresses key challenges in data augmentation and motion quality assessment, paving the way for new research and development opportunities in these critical areas.

Collapse

Jiang Q, Sun H, Deng W, Chen L, Li Q, Xie J, Pan X, Cheng Y, Chen X, Wang Y, Li Y, Wang X, Liu S, Xiao Y. Super Resolution of Pulmonary Nodules Target Reconstruction Using a Two-Channel GAN Models. Acad Radiol 2024:S1076-6332(24)00086-2. [PMID: 38458886 DOI: 10.1016/j.acra.2024.02.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 02/09/2024] [Accepted: 02/09/2024] [Indexed: 03/10/2024]

Vagni M, Tran HE, Romano A, Chiloiro G, Boldrini L, Zormpas-Petridis K, Kawula M, Landry G, Kurz C, Corradini S, Belka C, Indovina L, Gambacorta MA, Placidi L, Cusumano D. Auto-segmentation of pelvic organs at risk on 0.35T MRI using 2D and 3D Generative Adversarial Network models. Phys Med 2024;119:103297. [PMID: 38310680 DOI: 10.1016/j.ejmp.2024.103297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/04/2023] [Accepted: 01/23/2024] [Indexed: 02/06/2024] Open

Sohn J, Shin H, Lee J, Kim HC. Validation of Electrocardiogram Based Photoplethysmogram Generated Using U-Net Based Generative Adversarial Networks. J Healthc Inform Res 2024;8:140-157. [PMID: 38273980 PMCID: PMC10805750 DOI: 10.1007/s41666-023-00156-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 10/24/2023] [Accepted: 11/13/2023] [Indexed: 01/27/2024]

Abstract

Photoplethysmogram (PPG) performs an important role in alarming atrial fibrillation (AF). While the importance of PPG is emphasized, there is insufficient amount of openly available atrial fibrillation PPG data. We propose a U-net-based generative adversarial network (GAN) which synthesize PPG from paired electrocardiogram (ECG). To measure the performance of the proposed GAN, we compared the generated PPG to reference PPG in terms of morphology similarity and also examined its influence on AF detection classifier performance. First, morphology was compared using two different metrics against the reference signal: percent root mean square difference (PRD) and Pearson correlation coefficient. The mean PRD and Pearson correlation coefficient were 27% and 0.94, respectively. Heart rate variability (HRV) of the reference AF ECG and the generated PPG were compared as well. The p-value of the paired t-test was 0.248, indicating that no significant difference was observed between the two HRV values. Second, to validate the generated AF PPG dataset, four different datasets were prepared combining the generated PPG and real AF PPG. Each dataset was used to optimize a classification model while maintaining the same architecture. A test dataset was prepared to test the performance of each optimized model. Subsequently, these datasets were used to test the hypothesis whether the generated data benefits the training of an AF classifier. Comparing the performance metrics of each optimized model, the training dataset consisting of generated and real AF PPG showed a test accuracy result of 0.962, which was close to that of the dataset consisting only of real AF PPG data at 0.961. Furthermore, both models yielded the same F1 score of 0.969. Lastly, using only the generated AF PPG dataset resulted in test accuracy of 0.945, indicating that the trained model was capable of generating valuable AF PPG. Therefore, it can be concluded that the generated AF PPG can be used to augment insufficient data. To summarize, this study proposes a GAN-based method to generate atrial fibrillation PPG that can be used for training atrial fibrillation PPG classification models.

Collapse

Xu Z, Tang J, Qi C, Yao D, Liu C, Zhan Y, Lukasiewicz T. Cross-domain attention-guided generative data augmentation for medical image analysis with limited data. Comput Biol Med 2024;168:107744. [PMID: 38006826 DOI: 10.1016/j.compbiomed.2023.107744] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 11/12/2023] [Accepted: 11/20/2023] [Indexed: 11/27/2023]

Abstract

Data augmentation is widely applied to medical image analysis tasks in limited datasets with imbalanced classes and insufficient annotations. However, traditional augmentation techniques cannot supply extra information, making the performance of diagnosis unsatisfactory. GAN-based generative methods have thus been proposed to obtain additional useful information to realize more effective data augmentation; but existing generative data augmentation techniques mainly encounter two problems: (i) Current generative data augmentation lacks of the capability in using cross-domain differential information to extend limited datasets. (ii) The existing generative methods cannot provide effective supervised information in medical image segmentation tasks. To solve these problems, we propose an attention-guided cross-domain tumor image generation model (CDA-GAN) with an information enhancement strategy. The CDA-GAN can generate diverse samples to expand the scale of datasets, improving the performance of medical image diagnosis and treatment tasks. In particular, we incorporate channel attention into a CycleGAN-based cross-domain generation network that captures inter-domain information and generates positive or negative samples of brain tumors. In addition, we propose a semi-supervised spatial attention strategy to guide spatial information of features at the pixel level in tumor generation. Furthermore, we add spectral normalization to prevent the discriminator from mode collapse and stabilize the training procedure. Finally, to resolve an inapplicability problem in the segmentation task, we further propose an application strategy of using this data augmentation model to achieve more accurate medical image segmentation with limited data. Experimental studies on two public brain tumor datasets (BraTS and TCIA) show that the proposed CDA-GAN model greatly outperforms the state-of-the-art generative data augmentation in both practical medical image classification tasks and segmentation tasks; e.g. CDA-GAN is 0.50%, 1.72%, 2.05%, and 0.21% better than the best SOTA baseline in terms of ACC, AUC, Recall, and F1, respectively, in the classification task of BraTS, while its improvements w.r.t. the best SOTA baseline in terms of Dice, Sens, HD95, and mIOU, in the segmentation task of TCIA are 2.50%, 0.90%, 14.96%, and 4.18%, respectively.

Collapse

Belue MJ, Harmon SA, Masoudi S, Barrett T, Law YM, Purysko AS, Panebianco V, Yilmaz EC, Lin Y, Jadda PK, Raavi S, Wood BJ, Pinto PA, Choyke PL, Turkbey B. Quality of T2-weighted MRI re-acquisition versus deep learning GAN image reconstruction: A multi-reader study. Eur J Radiol 2024;170:111259. [PMID: 38128256 PMCID: PMC10842312 DOI: 10.1016/j.ejrad.2023.111259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 11/23/2023] [Accepted: 12/07/2023] [Indexed: 12/23/2023]

Thangaraj PM, Shankar SV, Oikonomou EK, Khera R. RCT-Twin-GAN Generates Digital Twins of Randomized Control Trials Adapted to Real-world Patients to Enhance their Inference and Application. medRxiv 2023:2023.12.06.23299464. [PMID: 38106089 PMCID: PMC10723568 DOI: 10.1101/2023.12.06.23299464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Abstract

Background

Randomized clinical trials (RCTs) are designed to produce evidence in selected populations. Assessing their effects in the real-world is essential to change medical practice, however, key populations are historically underrepresented in the RCTs. We define an approach to simulate RCT-based effects in real-world settings using RCT digital twins reflecting the covariate patterns in an electronic health record (EHR).

Methods

We developed a Generative Adversarial Network (GAN) model, RCT-Twin-GAN, which generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from an EHR cohort. We improved upon a traditional tabular conditional GAN, CTGAN, with a loss function adapted for data distributions and by conditioning on multiple discrete and continuous covariates simultaneously. We assessed the similarity between a Heart Failure with preserved Ejection Fraction (HFpEF) RCT (TOPCAT), a Yale HFpEF EHR cohort, and RCT-Twin. We also evaluated cardiovascular event-free survival stratified by Spironolactone (treatment) use.

Results

By applying RCT-Twin-GAN to 3445 TOPCAT participants and conditioning on 3445 Yale EHR HFpEF patients, we generated RCT-Twin datasets between 1141-3445 patients in size, depending on covariate conditioning and model parameters. RCT-Twin randomly allocated spironolactone (S)/ placebo (P) arms like an RCT, was similar to RCT by a multi-dimensional distance metric, and balanced covariates (median absolute standardized mean difference (MASMD) 0.017, IQR 0.0034-0.030). The 5 EHR-conditioned covariates in RCT-Twin were closer to the EHR compared with the RCT (MASMD 0.008 vs 0.63, IQR 0.005-0.018 vs 0.59-1.11). RCT-Twin reproduced the overall effect size seen in TOPCAT (5-year cardiovascular composite outcome odds ratio (95% confidence interval) of 0.89 (0.75-1.06) in RCT vs 0.85 (0.69-1.04) in RCT-Twin).

Conclusions

RCT-Twin-GAN simulates RCT-derived effects in real-world patients by translating these effects to the covariate distributions of EHR patients. This key methodological advance may enable the direct translation of RCT-derived effects into real-world patient populations and may enable causal inference in real-world settings.

Collapse

Salas J, Saha A, Ravela S. Learning inter-annual flood loss risk models from historical flood insurance claims. J Environ Manage 2023;347:118862. [PMID: 37806269 DOI: 10.1016/j.jenvman.2023.118862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/06/2023] [Accepted: 08/20/2023] [Indexed: 10/10/2023]

Shi D, Zhang W, He S, Chen Y, Song F, Liu S, Wang R, Zheng Y, He M. Translation of Color Fundus Photography into Fluorescein Angiography Using Deep Learning for Enhanced Diabetic Retinopathy Screening. Ophthalmol Sci 2023;3:100401. [PMID: 38025160 PMCID: PMC10630672 DOI: 10.1016/j.xops.2023.100401] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 08/23/2023] [Accepted: 09/08/2023] [Indexed: 12/01/2023]

Abstract

Purpose

To develop and validate a deep learning model that can transform color fundus (CF) photography into corresponding venous and late-phase fundus fluorescein angiography (FFA) images.

Design

Cross-sectional study.

Participants

We included 51 370 CF-venous FFA pairs and 14 644 CF-late FFA pairs from 4438 patients for model development. External testing involved 50 eyes with CF-FFA pairs and 2 public datasets for diabetic retinopathy (DR) classification, with 86 952 CF from EyePACs, and 1744 CF from MESSIDOR2.

Methods

We trained a deep-learning model to transform CF into corresponding venous and late-phase FFA images. The translated FFA images' quality was evaluated quantitatively on the internal test set and subjectively on 100 eyes with CF-FFA paired images (50 from external), based on the realisticity of the global image, anatomical landmarks (macula, optic disc, and vessels), and lesions. Moreover, we validated the clinical utility of the translated FFA for classifying 5-class DR and diabetic macular edema (DME) in the EyePACs and MESSIDOR2 datasets.

Main Outcome Measures

Image generation was quantitatively assessed by structural similarity measures (SSIM), and subjectively by 2 clinical experts on a 5-point scale (1 refers real FFA); intragrader agreement was assessed by kappa. The DR classification accuracy was assessed by area under the receiver operating characteristic curve.

Results

The SSIM of the translated FFA images were > 0.6, and the subjective quality scores ranged from 1.37 to 2.60. Both experts reported similar quality scores with substantial agreement (all kappas > 0.8). Adding the generated FFA on top of CF improved DR classification in the EyePACs and MESSIDOR2 datasets, with the area under the receiver operating characteristic curve increased from 0.912 to 0.939 on the EyePACs dataset and from 0.952 to 0.972 on the MESSIDOR2 dataset. The DME area under the receiver operating characteristic curve also increased from 0.927 to 0.974 in the MESSIDOR2 dataset.

Conclusions

Our CF-to-FFA framework produced realistic FFA images. Moreover, adding the translated FFA images on top of CF improved the accuracy of DR screening. These results suggest that CF-to-FFA translation could be used as a surrogate method when FFA examination is not feasible and as a simple add-on to improve DR screening.

Financial Disclosures

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Madokoro H, Sato K, Nix S, Chiyonobu S, Nagayoshi T, Sato K. OutcropHyBNet: Hybrid Backbone Networks with Data Augmentation for Accurate Stratum Semantic Segmentation of Monocular Outcrop Images in Carbon Capture and Storage Applications. Sensors (Basel) 2023;23:8809. [PMID: 37960509 PMCID: PMC10650223 DOI: 10.3390/s23218809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/11/2023] [Accepted: 10/26/2023] [Indexed: 11/15/2023]

Abstract

The rapid advancement of climate change and global warming have widespread impacts on society, including ecosystems, water security, food production, health, and infrastructure. To achieve significant global emission reductions, approximately 74% is expected to come from cutting carbon dioxide (CO2) emissions in energy supply and demand. Carbon Capture and Storage (CCS) has attained global recognition as a preeminent approach for the mitigation of atmospheric carbon dioxide levels, primarily by means of capturing and storing CO2 emissions originating from fossil fuel systems. Currently, geological models for storage location determination in CCS rely on limited sampling data from borehole surveys, which poses accuracy challenges. To tackle this challenge, our research project focuses on analyzing exposed rock formations, known as outcrops, with the goal of identifying the most effective backbone networks for classifying various strata types in outcrop images. We leverage deep learning-based outcrop semantic segmentation techniques using hybrid backbone networks, named OutcropHyBNet, to achieve accurate and efficient lithological classification, while considering texture features and without compromising computational efficiency. We conducted accuracy comparisons using publicly available benchmark datasets, as well as an original dataset expanded through random sampling of 13 outcrop images obtained using a stationary camera, installed on the ground. Additionally, we evaluated the efficacy of data augmentation through image synthesis using Only Adversarial Supervision for Semantic Image Synthesis (OASIS). Evaluation experiments on two public benchmark datasets revealed insights into the classification characteristics of different classes. The results demonstrate the superiority of Convolutional Neural Networks (CNNs), specifically DeepLabv3, and Vision Transformers (ViTs), particularly SegFormer, under specific conditions. These findings contribute to advancing accurate lithological classification in geological studies using deep learning methodologies. In the evaluation experiments conducted on ground-level images obtained using a stationary camera and aerial images captured using a drone, we successfully demonstrated the superior performance of SegFormer across all categories.

Collapse

Lim B, Seth I, Kah S, Sofiadellis F, Ross RJ, Rozen WM, Cuomo R. Using Generative Artificial Intelligence Tools in Cosmetic Surgery: A Study on Rhinoplasty, Facelifts, and Blepharoplasty Procedures. J Clin Med 2023;12:6524. [PMID: 37892665 PMCID: PMC10607912 DOI: 10.3390/jcm12206524] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 10/03/2023] [Accepted: 10/13/2023] [Indexed: 10/29/2023] Open

Guerreiro J, Tomás P, Garcia N, Aidos H. Super-resolution of magnetic resonance images using Generative Adversarial Networks. Comput Med Imaging Graph 2023;108:102280. [PMID: 37597380 DOI: 10.1016/j.compmedimag.2023.102280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 06/30/2023] [Accepted: 07/26/2023] [Indexed: 08/21/2023]

Huang F, Deng Y. TCGAN: Convolutional Generative Adversarial Network for time series classification and clustering. Neural Netw 2023;165:868-883. [PMID: 37433231 DOI: 10.1016/j.neunet.2023.06.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 06/04/2023] [Accepted: 06/25/2023] [Indexed: 07/13/2023]

Pérez E, Ventura S. Progressive growing of Generative Adversarial Networks for improving data augmentation and skin cancer diagnosis. Artif Intell Med 2023;141:102556. [PMID: 37295899 DOI: 10.1016/j.artmed.2023.102556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 04/06/2023] [Accepted: 04/14/2023] [Indexed: 06/12/2023]

Zhang M, Lin H, Takagi S, Cao Y, Shahabi C, Xiong L. CSGAN: Modality-Aware Trajectory Generation via Clustering-based Sequence GAN. IEEE Int Conf Mob Data Manag 2023;2023:148-157. [PMID: 37965426 PMCID: PMC10644148 DOI: 10.1109/mdm58254.2023.00032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]

Penhaskashi J, Sekimoto O, Chiappelli F. Permafrost viremia and immune tweening. Bioinformation 2023;19:685-691. [PMID: 37885785 PMCID: PMC10598357 DOI: 10.6026/97320630019685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 06/30/2023] [Accepted: 06/30/2023] [Indexed: 10/28/2023] Open

Sheikh ZA, Singh Y, Singh PK, Gonçalves PJS. Defending the Defender: Adversarial Learning Based Defending Strategy for Learning Based Security Methods in Cyber-Physical Systems (CPS). Sensors (Basel) 2023;23:5459. [PMID: 37420626 DOI: 10.3390/s23125459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 05/26/2023] [Accepted: 06/06/2023] [Indexed: 07/09/2023]

Abstract

Cyber-Physical Systems (CPS) are prone to many security exploitations due to a greater attack surface being introduced by their cyber component by the nature of their remote accessibility or non-isolated capability. Security exploitations, on the other hand, rise in complexities, aiming for more powerful attacks and evasion from detections. The real-world applicability of CPS thus poses a question mark due to security infringements. Researchers have been developing new and robust techniques to enhance the security of these systems. Many techniques and security aspects are being considered to build robust security systems; these include attack prevention, attack detection, and attack mitigation as security development techniques with consideration of confidentiality, integrity, and availability as some of the important security aspects. In this paper, we have proposed machine learning-based intelligent attack detection strategies which have evolved as a result of failures in traditional signature-based techniques to detect zero-day attacks and attacks of a complex nature. Many researchers have evaluated the feasibility of learning models in the security domain and pointed out their capability to detect known as well as unknown attacks (zero-day attacks). However, these learning models are also vulnerable to adversarial attacks like poisoning attacks, evasion attacks, and exploration attacks. To make use of a robust-cum-intelligent security mechanism, we have proposed an adversarial learning-based defense strategy for the security of CPS to ensure CPS security and invoke resilience against adversarial attacks. We have evaluated the proposed strategy through the implementation of Random Forest (RF), Artificial Neural Network (ANN), and Long Short-Term Memory (LSTM) on the ToN_IoT Network dataset and an adversarial dataset generated through the Generative Adversarial Network (GAN) model.

Collapse

Veturi YA, Woof W, Lazebnik T, Moghul I, Woodward-Court P, Wagner SK, Cabral de Guimarães TA, Daich Varela M, Liefers B, Patel PJ, Beck S, Webster AR, Mahroo O, Keane PA, Michaelides M, Balaskas K, Pontikos N. SynthEye: Investigating the Impact of Synthetic Data on Artificial Intelligence-assisted Gene Diagnosis of Inherited Retinal Disease. Ophthalmol Sci 2023;3:100258. [PMID: 36685715 PMCID: PMC9852957 DOI: 10.1016/j.xops.2022.100258] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 11/08/2022] [Accepted: 11/09/2022] [Indexed: 11/23/2022]

Abstract

Purpose

Rare disease diagnosis is challenging in medical image-based artificial intelligence due to a natural class imbalance in datasets, leading to biased prediction models. Inherited retinal diseases (IRDs) are a research domain that particularly faces this issue. This study investigates the applicability of synthetic data in improving artificial intelligence-enabled diagnosis of IRDs using generative adversarial networks (GANs).

Design

Diagnostic study of gene-labeled fundus autofluorescence (FAF) IRD images using deep learning.

Participants

Moorfields Eye Hospital (MEH) dataset of 15 692 FAF images obtained from 1800 patients with confirmed genetic diagnosis of 1 of 36 IRD genes.

Methods

A StyleGAN2 model is trained on the IRD dataset to generate 512 × 512 resolution images. Convolutional neural networks are trained for classification using different synthetically augmented datasets, including real IRD images plus 1800 and 3600 synthetic images, and a fully rebalanced dataset. We also perform an experiment with only synthetic data. All models are compared against a baseline convolutional neural network trained only on real data.

Main Outcome Measures

We evaluated synthetic data quality using a Visual Turing Test conducted with 4 ophthalmologists from MEH. Synthetic and real images were compared using feature space visualization, similarity analysis to detect memorized images, and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) score for no-reference-based quality evaluation. Convolutional neural network diagnostic performance was determined on a held-out test set using the area under the receiver operating characteristic curve (AUROC) and Cohen's Kappa (κ).

Results

An average true recognition rate of 63% and fake recognition rate of 47% was obtained from the Visual Turing Test. Thus, a considerable proportion of the synthetic images were classified as real by clinical experts. Similarity analysis showed that the synthetic images were not copies of the real images, indicating that copied real images, meaning the GAN was able to generalize. However, BRISQUE score analysis indicated that synthetic images were of significantly lower quality overall than real images (P < 0.05). Comparing the rebalanced model (RB) with the baseline (R), no significant change in the average AUROC and κ was found (R-AUROC = 0.86[0.85-88], RB-AUROC = 0.88[0.86-0.89], R-k = 0.51[0.49-0.53], and RB-k = 0.52[0.50-0.54]). The synthetic data trained model (S) achieved similar performance as the baseline (S-AUROC = 0.86[0.85-87], S-k = 0.48[0.46-0.50]).

Conclusions

Synthetic generation of realistic IRD FAF images is feasible. Synthetic data augmentation does not deliver improvements in classification performance. However, synthetic data alone deliver a similar performance as real data, and hence may be useful as a proxy to real data. Financial Disclosure(s): Proprietary or commercial disclosure may be found after the references.

Collapse

Affiliation(s)

Yoga Advaith Veturi University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
William Woof University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Teddy Lazebnik University College London Cancer Institute, University College London, London, UK
Ismail Moghul Moorfields Eye Hospital, London, UK
Peter Woodward-Court University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Siegfried K. Wagner University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Thales Antonio Cabral de Guimarães University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Malena Daich Varela University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Bart Liefers Moorfields Eye Hospital, London, UK
Praveen J. Patel Moorfields Eye Hospital, London, UK
Stephan Beck University College London Cancer Institute, University College London, London, UK
Andrew R. Webster University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Omar Mahroo University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Pearse A. Keane University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Michel Michaelides University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Konstantinos Balaskas University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK
Nikolas Pontikos University College London Institute of Ophthalmology, University College London, London, UK Moorfields Eye Hospital, London, UK

Collapse

Kim HS, Ha EG, Lee A, Choi YJ, Jeon KJ, Han SS, Lee C. Refinement of image quality in panoramic radiography using a generative adversarial network. Dentomaxillofac Radiol 2023:20230007. [PMID: 37129509 DOI: 10.1259/dmfr.20230007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023] Open

Zhu W, Qiu P, Farazi M, Nandakumar K, Dumitrascu OM, Wang Y. OPTIMAL TRANSPORT GUIDED UNSUPERVISED LEARNING FOR ENHANCING LOW-QUALITY RETINAL IMAGES. Proc IEEE Int Symp Biomed Imaging 2023;2023:10.1109/isbi53787.2023.10230719. [PMID: 37736573 PMCID: PMC10513403 DOI: 10.1109/isbi53787.2023.10230719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/23/2023]

Huang Z, Wang J, Lu X, Mohd Zain A, Yu G. scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network. Brief Bioinform 2023;24:7024714. [PMID: 36733262 DOI: 10.1093/bib/bbad040] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/21/2022] [Accepted: 01/18/2023] [Indexed: 02/04/2023] Open

Qin J, Gao F, Wang Z, Wong DC, Zhao Z, Relton SD, Fang H. A novel temporal generative adversarial network for electrocardiography anomaly detection. Artif Intell Med 2023;136:102489. [PMID: 36710067 DOI: 10.1016/j.artmed.2023.102489] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 11/28/2022] [Accepted: 01/09/2023] [Indexed: 01/15/2023]

Kim H, Kim C, Kim H, Cho S, Hwang E. Panoptic blind image inpainting. ISA Trans 2023;132:208-221. [PMID: 36372606 DOI: 10.1016/j.isatra.2022.10.030] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 10/24/2022] [Accepted: 10/24/2022] [Indexed: 06/16/2023]

Han T, Wu J, Luo W, Wang H, Jin Z, Qu L. Review of Generative Adversarial Networks in mono- and cross-modal biomedical image registration. Front Neuroinform 2022;16:933230. [PMID: 36483313 PMCID: PMC9724825 DOI: 10.3389/fninf.2022.933230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 10/13/2022] [Indexed: 09/19/2023] Open

Pattanaik A, Balabantaray RC. Enhancement of license plate recognition performance using Xception with Mish activation function. Multimed Tools Appl 2022;82:16793-16815. [PMID: 36258895 PMCID: PMC9560886 DOI: 10.1007/s11042-022-13922-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 04/12/2022] [Accepted: 09/12/2022] [Indexed: 06/16/2023]

Abstract

The current breakthroughs in the highway research sector have resulted in a greater awareness and focus on the construction of an effective Intelligent Transportation System (ITS). One of the most actively researched areas is Vehicle Licence Plate Recognition (VLPR), concerned with determining the characters contained in a vehicle's Licence Plate (LP). Many existing methods have been used to deal with different environmental complexity factors but are limited to motion deblurring. The aim of our research is to provide an effective and robust solution for recognizing characters present in license plates in complex environmental conditions. Our proposed approach is capable of handling not only the motion-blurred LPs but also recognizing the characters present in different types of low resolution and blurred license plates, illegible vehicle plates, license plates present in different weather and light conditions, and various traffic circumstances, as well as high-speed vehicles. Our research provides a series of different approaches to execute different steps in the character recognition process. The proposed approach presents the concept of Generative Adversarial Networks (GAN) with Discrete Cosine Transform (DCT) Discriminator (DCTGAN), a joint image super resolution and deblurring approach that uses a discrete cosine transform with low computational complexity to remove various types of blur and complexities from licence plates. License Plates (LPs) are detected using the Improved Bernsen Algorithm (IBA) with Connected Component Analysis(CCA). Finally, with the aid of the proposed Xception model with transfer learning, the characters in LPs are recognised. Here we have not used any segmentation technique to split the characters. Four benchmark datasets such as Stanford Cars, FZU Cars, HumAIn 2019 Challenge datasets, and Application-Oriented License Plate (AOLP) dataset, as well as our own collected dataset, were used for the validation of our proposed algorithm. This dataset includes the images of vehicles captured in different lighting and weather conditions such as sunny, rainy, cloudy, blurred, low illumination, foggy, and night. The suggested strategy does better than the current best practices in both numbers and quality.

Collapse

Puttagunta M, Subban R, C NKB. A Novel COVID-19 Detection Model Based on DCGAN and Deep Transfer Learning. Procedia Comput Sci 2022;204:65-72. [PMID: 36120410 PMCID: PMC9464299 DOI: 10.1016/j.procs.2022.08.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Dunphy K, Fekri MN, Grolinger K, Sadhu A. Data Augmentation for Deep-Learning-Based Multiclass Structural Damage Detection Using Limited Information. Sensors (Basel) 2022;22:6193. [PMID: 36015955 PMCID: PMC9412832 DOI: 10.3390/s22166193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 08/05/2022] [Accepted: 08/16/2022] [Indexed: 06/15/2023]

Abstract

The deterioration of infrastructure's health has become more predominant on a global scale during the 21st century. Aging infrastructure as well as those structures damaged by natural disasters have prompted the research community to improve state-of-the-art methodologies for conducting Structural Health Monitoring (SHM). The necessity for efficient SHM arises from the hazards damaged infrastructure imposes, often resulting in structural collapse, leading to economic loss and human fatalities. Furthermore, day-to-day operations in these affected areas are limited until an inspection is performed to assess the level of damage experienced by the structure and the required rehabilitation determined. However, human-based inspections are often labor-intensive, inefficient, subjective, and restricted to accessible site locations, which ultimately negatively impact our ability to collect large amounts of data from inspection sites. Though Deep-Learning (DL) methods have been heavily explored in the past decade to rectify the limitations of traditional methods and automate structural inspection, data scarcity continues to remain prevalent within the field of SHM. The absence of sufficiently large, balanced, and generalized databases to train DL-based models often results in inaccurate and biased damage predictions. Recently, Generative Adversarial Networks (GANs) have received attention from the SHM community as a data augmentation tool by which a training dataset can be expanded to improve the damage classification. However, there are no existing studies within the SHM field which investigate the performance of DL-based multiclass damage identification using synthetic data generated from GANs. Therefore, this paper investigates the performance of a convolutional neural network architecture using synthetic images generated from a GAN for multiclass damage detection of concrete surfaces. Through this study, it was determined the average classification performance of the proposed CNN on hybrid datasets decreased by 10.6% and 7.4% for validation and testing datasets when compared to the same model trained entirely on real samples. Moreover, each model's performance decreased on average by 1.6% when comparing a singular model trained with real samples and the same model trained with both real and synthetic samples for a given training configuration. The correlation between classification accuracy and the amount and diversity of synthetic data used for data augmentation is quantified and the effect of using limited data to train existing GAN architectures is investigated. It was observed that the diversity of the samples decreases and correlation increases with the increase in the number of synthetic samples.

Collapse

Xiong YT, Zeng W, Xu L, Guo JX, Liu C, Chen JT, Du XY, Tang W. Virtual reconstruction of midfacial bone defect based on generative adversarial network. Head Face Med 2022;18:19. [PMID: 35761334 PMCID: PMC9235085 DOI: 10.1186/s13005-022-00325-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 05/19/2022] [Indexed: 02/08/2023] Open

Zhang Y, Wa S, Zhang L, Lv C. Automatic Plant Disease Detection Based on Tranvolution Detection Network With GAN Modules Using Leaf Images. Front Plant Sci 2022;13:875693. [PMID: 35693164 PMCID: PMC9178295 DOI: 10.3389/fpls.2022.875693] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 04/27/2022] [Indexed: 05/31/2023]

Abstract

The detection of plant disease is of vital importance in practical agricultural production. It scrutinizes the plant's growth and health condition and guarantees the regular operation and harvest of the agricultural planting to proceed successfully. In recent decades, the maturation of computer vision technology has provided more possibilities for implementing plant disease detection. Nonetheless, detecting plant diseases is typically hindered by factors such as variations in the illuminance and weather when capturing images and the number of leaves or organs containing diseases in one image. Meanwhile, traditional deep learning-based algorithms attain multiple deficiencies in the area of this research: (1) Training models necessitate a significant investment in hardware and a large amount of data. (2) Due to their slow inference speed, models are tough to acclimate to practical production. (3) Models are unable to generalize well enough. Provided these impediments, this study suggested a Tranvolution detection network with GAN modules for plant disease detection. Foremost, a generative model was added ahead of the backbone, and GAN models were added to the attention extraction module to construct GAN modules. Afterward, the Transformer was modified and incorporated with the CNN, and then we suggested the Tranvolution architecture. Eventually, we validated the performance of different generative models' combinations. Experimental outcomes demonstrated that the proposed method satisfyingly achieved 51.7% (Precision), 48.1% (Recall), and 50.3% (mAP), respectively. Furthermore, the SAGAN model was the best in the attention extraction module, while WGAN performed best in image augmentation. Additionally, we deployed the proposed model on Hbird E203 and devised an intelligent agricultural robot to put the model into practical agricultural use.

Collapse

Kaabachi B, Despraz J, Meurers T, Prasser F, Raisaro JL. Generation and Evaluation of Synthetic Data in a University Hospital Setting. Stud Health Technol Inform 2022;294:141-142. [PMID: 35612040 DOI: 10.3233/shti220420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Kossen T, Hirzel MA, Madai VI, Boenisch F, Hennemuth A, Hildebrand K, Pokutta S, Sharma K, Hilbert A, Sobesky J, Galinovic I, Khalil AA, Fiebach JB, Frey D. Toward Sharing Brain Images: Differentially Private TOF-MRA Images With Segmentation Labels Using Generative Adversarial Networks. Front Artif Intell 2022;5:813842. [PMID: 35586223 PMCID: PMC9108458 DOI: 10.3389/frai.2022.813842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/31/2022] [Indexed: 12/03/2022] Open

Abstract

Sharing labeled data is crucial to acquire large datasets for various Deep Learning applications. In medical imaging, this is often not feasible due to privacy regulations. Whereas anonymization would be a solution, standard techniques have been shown to be partially reversible. Here, synthetic data using a Generative Adversarial Network (GAN) with differential privacy guarantees could be a solution to ensure the patient's privacy while maintaining the predictive properties of the data. In this study, we implemented a Wasserstein GAN (WGAN) with and without differential privacy guarantees to generate privacy-preserving labeled Time-of-Flight Magnetic Resonance Angiography (TOF-MRA) image patches for brain vessel segmentation. The synthesized image-label pairs were used to train a U-net which was evaluated in terms of the segmentation performance on real patient images from two different datasets. Additionally, the Fréchet Inception Distance (FID) was calculated between the generated images and the real images to assess their similarity. During the evaluation using the U-Net and the FID, we explored the effect of different levels of privacy which was represented by the parameter ϵ. With stricter privacy guarantees, the segmentation performance and the similarity to the real patient images in terms of FID decreased. Our best segmentation model, trained on synthetic and private data, achieved a Dice Similarity Coefficient (DSC) of 0.75 for ϵ = 7.4 compared to 0.84 for ϵ = ∞ in a brain vessel segmentation paradigm (DSC of 0.69 and 0.88 on the second test set, respectively). We identified a threshold of ϵ <5 for which the performance (DSC <0.61) became unstable and not usable. Our synthesized labeled TOF-MRA images with strict privacy guarantees retained predictive properties necessary for segmenting the brain vessels. Although further research is warranted regarding generalizability to other imaging modalities and performance improvement, our results mark an encouraging first step for privacy-preserving data sharing in medical imaging.

Collapse

Affiliation(s)

Tabea Kossen CLAIM-Charité Lab for AI in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany Department of Computer Engineering and Microelectronics, Computer Vision & Remote Sensing, Technical University Berlin, Berlin, Germany
Manuel A. Hirzel CLAIM-Charité Lab for AI in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany
Vince I. Madai CLAIM-Charité Lab for AI in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charité-Universitätsmedizin Berlin, Berlin, Germany Faculty of Computing, Engineering and the Built Environment, School of Computing and Digital Technology, Birmingham City University, Birmingham, United Kingdom
Franziska Boenisch Fraunhofer AISEC, Berlin, Germany
Anja Hennemuth Department of Computer Engineering and Microelectronics, Computer Vision & Remote Sensing, Technical University Berlin, Berlin, Germany Institute for Imaging Science and Computational Modelling in Cardiovascular Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany Fraunhofer MEVIS, Bremen, Germany
Kristian Hildebrand Department VI Computer Science and Media, Berlin University of Applied Sciences and Technology, Berlin, Germany
Sebastian Pokutta Department for AI in Society, Science, and Technology, Zuse Institute Berlin, Berlin, Germany Institute of Mathematics, Technical University Berlin, Berlin, Germany
Kartikey Sharma Department for AI in Society, Science, and Technology, Zuse Institute Berlin, Berlin, Germany
Adam Hilbert CLAIM-Charité Lab for AI in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany
Jan Sobesky Johanna-Etienne-Hospital, Neuss, Germany Centre for Stroke Research Berlin, Charité Universitätsmedizin Berlin, Berlin, Germany
Ivana Galinovic Centre for Stroke Research Berlin, Charité Universitätsmedizin Berlin, Berlin, Germany
Ahmed A. Khalil Centre for Stroke Research Berlin, Charité Universitätsmedizin Berlin, Berlin, Germany Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany Mind, Brain, Body Institute, Berlin School of Mind and Brain, Humboldt-Universität Berlin, Berlin, Germany
Jochen B. Fiebach Centre for Stroke Research Berlin, Charité Universitätsmedizin Berlin, Berlin, Germany
Dietmar Frey CLAIM-Charité Lab for AI in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany

Collapse

Apostolopoulos ID, Papathanasiou ND, Apostolopoulos DJ, Panayiotakis GS. Applications of Generative Adversarial Networks (GANs) in Positron Emission Tomography (PET) imaging: A review. Eur J Nucl Med Mol Imaging 2022. [PMID: 35451611 DOI: 10.1007/s00259-022-05805-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 04/12/2022] [Indexed: 11/04/2022]

Kierdorf J, Weber I, Kicherer A, Zabawa L, Drees L, Roscher R. Behind the Leaves: Estimation of Occluded Grapevine Berries With Conditional Generative Adversarial Networks. Front Artif Intell 2022;5:830026. [PMID: 35402903 PMCID: PMC8990779 DOI: 10.3389/frai.2022.830026] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 02/28/2022] [Indexed: 11/30/2022] Open

Ganjdanesh A, Zhang J, Chew EY, Ding Y, Huang H, Chen W. LONGL-Net: temporal correlation structure guided deep learning model to predict longitudinal age-related macular degeneration severity. PNAS Nexus 2022;1:pgab003. [PMID: 35360552 PMCID: PMC8962776 DOI: 10.1093/pnasnexus/pgab003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 11/15/2021] [Indexed: 01/28/2023]

Argilaga A, Zhuang D. Predicting the Non-Deterministic Response of a Micro-Scale Mechanical Model Using Generative Adversarial Networks. Materials (Basel) 2022;15:965. [PMID: 35160911 PMCID: PMC8838419 DOI: 10.3390/ma15030965] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/19/2022] [Accepted: 01/23/2022] [Indexed: 01/27/2023]

Sousa T, Correia J, Pereira V, Rocha M. Generative Deep Learning for Targeted Compound Design. J Chem Inf Model 2021;61:5343-5361. [PMID: 34699719 DOI: 10.1021/acs.jcim.0c01496] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Hou X, Zhang X, Liang H, Shen L, Lai Z, Wan J. GuidedStyle: Attribute knowledge guided style manipulation for semantic face editing. Neural Netw 2022;145:209-20. [PMID: 34768091 DOI: 10.1016/j.neunet.2021.10.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 09/29/2021] [Accepted: 10/21/2021] [Indexed: 11/21/2022]

Hu L, Zhou DW, Zha YF, Li L, He H, Xu WH, Qian L, Zhang YK, Fu CX, Hu H, Zhao JG. Synthesizing High-b-Value Diffusion-weighted Imaging of the Prostate Using Generative Adversarial Networks. Radiol Artif Intell 2021;3:e200237. [PMID: 34617025 DOI: 10.1148/ryai.2021200237] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 04/11/2021] [Accepted: 05/18/2021] [Indexed: 11/11/2022]

Abstract

Purpose

To develop and evaluate a diffusion-weighted imaging (DWI) deep learning framework based on the generative adversarial network (GAN) to generate synthetic high-b-value (b =1500 sec/mm²) DWI (SYN_b1500) sets from acquired standard-b-value (b = 800 sec/mm²) DWI (ACQ_b800) and acquired standard-b-value (b = 1000 sec/mm²) DWI (ACQ_b1000) sets.

Materials and Methods

This retrospective multicenter study included 395 patients who underwent prostate multiparametric MRI. This cohort was split into internal training (96 patients) and external testing (299 patients) datasets. To create SYN_b1500 sets from ACQ_b800 and ACQ_b1000 sets, a deep learning model based on GAN (M₀) was developed by using the internal dataset. M₀ was trained and compared with a conventional model based on the cycle GAN (M_cyc). M₀ was further optimized by using denoising and edge-enhancement techniques (optimized version of the M₀ [Opt-M₀]). The SYN_b1500 sets were synthesized by using the M₀ and the Opt-M₀ were synthesized by using ACQ_b800 and ACQ_b1000 sets from the external testing dataset. For comparison, traditional calculated (b =1500 sec/mm²) DWI (CAL_b1500) sets were also obtained. Reader ratings for image quality and prostate cancer detection were performed on the acquired high-b-value (b = 1500 sec/mm²) DWI (ACQ_b1500), CAL_b1500, and SYN_b1500 sets and the SYN_b1500 set generated by the Opt-M₀ (Opt-SYN_b1500). Wilcoxon signed rank tests were used to compare the readers' scores. A multiple-reader multiple-case receiver operating characteristic curve was used to compare the diagnostic utility of each DWI set.

Results

When compared with the M_cyc, the M₀ yielded a lower mean squared difference and higher mean scores for the peak signal-to-noise ratio, structural similarity, and feature similarity (P < .001 for all). Opt-SYN_b1500 resulted in significantly better image quality (P ≤ .001 for all) and a higher mean area under the curve than ACQ_b1500 and CAL_b1500 (P ≤ .042 for all).

Conclusion

A deep learning framework based on GAN is a promising method to synthesize realistic high-b-value DWI sets with good image quality and accuracy in prostate cancer detection.Keywords: Prostate Cancer, Abdomen/GI, Diffusion-weighted Imaging, Deep Learning Framework, High b Value, Generative Adversarial Networks© RSNA, 2021 Supplemental material is available for this article.

Collapse

Affiliation(s)

Lei Hu Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Da-Wei Zhou Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Yun-Fei Zha Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Liang Li Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Huan He Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Wen-Hao Xu Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Li Qian Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Yi-Kun Zhang Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Cai-Xia Fu Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Hui Hu Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)
Jun-Gong Zhao Department of Diagnostic and Interventional Radiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, 600 Yi Shan Road, Shanghai 200233, China (L.H., W.H.X., J.G.Z.); State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi'an, China (D.W.Z.); Department of Radiology, Renmin Hospital, Wuhan University, Wuhan, China (Y.F.Z., L.L., H. He, L.Q., Y.K.Z.); MR Application Development, Siemens Shenzhen MR, Shenzhen, China (C.X.F.); and Department of Radiology, The Affiliated Renmin Hospital of Jiangsu University, Zhenjiang, China (H. Hu)

Collapse

Liu X, Xing F, Fakhri GE, Woo J. A UNIFIED CONDITIONAL DISENTANGLEMENT FRAMEWORK FOR MULTIMODAL BRAIN MR IMAGE TRANSLATION. Proc IEEE Int Symp Biomed Imaging 2021;2021. [PMID: 34567419 DOI: 10.1109/isbi48211.2021.9433897] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Giudice O, Guarnera L, Battiato S. Fighting Deepfakes by Detecting GAN DCT Anomalies. J Imaging 2021;7:128. [PMID: 34460764 DOI: 10.3390/jimaging7080128] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 07/23/2021] [Accepted: 07/26/2021] [Indexed: 11/16/2022] Open

Klasen M, Ahrens D, Eberle J, Steinhage V. Image-based Automated Species Identification: Can Virtual Data Augmentation Overcome Problems of Insufficient Sampling? Syst Biol 2021;71:320-333. [PMID: 34143222 DOI: 10.1093/sysbio/syab048] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 06/10/2021] [Accepted: 06/16/2021] [Indexed: 11/13/2022] Open

Marzullo A, Moccia S, Catellani M, Calimeri F, Momi ED. Towards realistic laparoscopic image generation using image-domain translation. Comput Methods Programs Biomed 2021;200:105834. [PMID: 33229016 DOI: 10.1016/j.cmpb.2020.105834] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 11/05/2020] [Indexed: 06/11/2023]

Kan CNE, Gilat-Schmidt T, Ye DH. Enhancing Reproductive Organ Segmentation in Pediatric CT via Adversarial Learning. Proc SPIE Int Soc Opt Eng 2021;11596:1159612. [PMID: 33994628 PMCID: PMC8122493 DOI: 10.1117/12.2582127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Kahembwe E, Ramamoorthy S. Lower dimensional kernels for video discriminators. Neural Netw 2020;132:506-20. [PMID: 33039788 DOI: 10.1016/j.neunet.2020.09.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 09/13/2020] [Accepted: 09/14/2020] [Indexed: 11/22/2022]

Nishimura Y, Nakamura Y, Ishiguro H. Human interaction behavior modeling using Generative Adversarial Networks. Neural Netw 2020;132:521-31. [PMID: 33039789 DOI: 10.1016/j.neunet.2020.09.019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 09/01/2020] [Accepted: 09/25/2020] [Indexed: 11/22/2022]

Wen J, Shi Y, Zhou X, Xue Y. Crop Disease Classification on Inadequate Low-Resolution Target Images. Sensors (Basel) 2020;20:E4601. [PMID: 32824352 DOI: 10.3390/s20164601] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 08/13/2020] [Accepted: 08/13/2020] [Indexed: 12/01/2022]

Zhao M, Liu X, Liu H, Wong KKL. Super-resolution of cardiac magnetic resonance images using Laplacian Pyramid based on Generative Adversarial Networks. Comput Med Imaging Graph 2020;80:101698. [PMID: 31935666 DOI: 10.1016/j.compmedimag.2020.101698] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/28/2019] [Accepted: 01/02/2020] [Indexed: 11/24/2022]

Ali T, Jan S, Alkhodre A, Nauman M, Amin M, Siddiqui MS. DeepMoney: counterfeit money detection using generative adversarial networks. PeerJ Comput Sci 2019;5:e216. [PMID: 33816869 PMCID: PMC7924467 DOI: 10.7717/peerj-cs.216] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Accepted: 07/16/2019] [Indexed: 06/12/2023]

Zhang S, Wang T, Peng Y, Dong J. A hierarchically trained generative network for robust facial symmetrization. Technol Health Care 2019;27:217-227. [PMID: 31045541 PMCID: PMC6598010 DOI: 10.3233/thc-199021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]

Abstract

Face symmetrization has extensive applications in both medical and academic fields, such as facial disorder diagnosis. Human face possesses an important characteristic, which is as known as symmetry. However, in many scenarios, the perfect symmetry doesn't exist in human faces, which yields a large number of studies around this topic. For example, facial palsy evaluation, facial beauty evaluation based on facial symmetry analysis, and many among others. Currently, there are still very limited researches dedicated for automatic facial symmetrization. Most of the existing studies only utilized their own implantations for facial symmetrization to assist their interdisciplinary academic researches. Limitations thus can be noticed in their methods, such as the requirements for manual interventions. Furthermore, most existing methods utilize facial landmark detection algorithms for automatic facial symmetrization. Though accuracies of the landmark detection algorithms are promising, the uncontrolled conditions in the facial images can still negatively impact the performance of the symmetrical face production. To this end, this paper presents a joint-loss enhanced deep generative network model for automatic facial symmetrization, which is achieved by a full facial image analysis. The joint-loss consists of a pair of adversarial losses and an identity loss. The adversarial losses try to make the generated symmetrical face as realistic as possible, while the identity loss helps to constrain the output to have the same identity of the person in the original input as much as possible. Rather than an end-to-end learning strategy, the proposed model is constructed by a multi-stage training process, which avoids the demand for a large size of the symmetrical face as training data. Experiments are conducted with comparisons with several existing methods based on some of the most popular facial landmark detection algorithms. Competitive results of the proposed method are demonstrated.

Collapse