1
|
Wang L, Meng M, Chen S, Bian Z, Zeng D, Meng D, Ma J. Semi-supervised iterative adaptive network for low-dose CT sinogram recovery. Phys Med Biol 2024; 69:085013. [PMID: 38422540 DOI: 10.1088/1361-6560/ad2ee7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 02/29/2024] [Indexed: 03/02/2024]
Abstract
Background.Concern has been expressed regarding the risk of carcinogenesis from medical computed tomography (CT) radiation. Lowering radiation in CT without appropriate modifications often leads to severe noise-induced artifacts in the images. The utilization of deep learning (DL) techniques has achieved promising reconstruction performance in low-dose CT (LDCT) imaging. However, most DL-based algorithms require the pre-collection of a large set of image pairs (low-dose/standard-dose) and the training of networks in an end-to-end supervised manner. Meanwhile, securing such a large volume of paired, well-registered training data in clinical practice is challenging. Moreover, these algorithms often overlook the potential to utilize the abundant information in a large collection of LDCT-only images/sinograms.Methods.In this paper, we introduce a semi-supervised iterative adaptive network (SIA-Net) for LDCT imaging, utilizing both labeled and unlabeled sinograms in a cohesive network framework, integrating supervised and unsupervised learning processes. Specifically, the supervised process captures critical features (i.e. noise distribution and tissue characteristics) latent in the paired sinograms, while the unsupervised process effectively learns these features in the unlabeled low-dose sinograms, employing a conventional weighted least-squares model with a regularization term. Furthermore, the SIA-Net method is designed to adaptively transfer the learned feature distribution from the supervised to the unsupervised process, thereby obtaining a high-fidelity sinogram through iterative adaptive learning. Finally, high-quality CT images can be reconstructed from the refined sinogram using the filtered back-projection algorithm.Results.Experimental results on two clinical datasets indicate that the proposed SIA-Net method achieves competitive performance in terms of noise reduction and structure preservation in LDCT imaging, when compared to traditional supervised learning methods.
Collapse
Affiliation(s)
- Lei Wang
- School of Future Technology, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
| | - Mingqiang Meng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
- Pazhou Lab (Huangpu), Guangdong, People's Republic of China
| | - Shixuan Chen
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
| | - Zhaoying Bian
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
- Pazhou Lab (Huangpu), Guangdong, People's Republic of China
| | - Dong Zeng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
- Pazhou Lab (Huangpu), Guangdong, People's Republic of China
- Department of Radiology, Zhujiang Hospital, Southern Medical University, Guangdong, People's Republic of China
| | - Deyu Meng
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Jianhua Ma
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, People's Republic of China
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, People's Republic of China
| |
Collapse
|
2
|
Gao J, Bonzel CL, Hong C, Varghese P, Zakir K, Gronsbell J. Semi-supervised ROC analysis for reliable and streamlined evaluation of phenotyping algorithms. J Am Med Inform Assoc 2024; 31:640-650. [PMID: 38128118 PMCID: PMC10873838 DOI: 10.1093/jamia/ocad226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 09/22/2023] [Accepted: 11/20/2023] [Indexed: 12/23/2023] Open
Abstract
OBJECTIVE High-throughput phenotyping will accelerate the use of electronic health records (EHRs) for translational research. A critical roadblock is the extensive medical supervision required for phenotyping algorithm (PA) estimation and evaluation. To address this challenge, numerous weakly-supervised learning methods have been proposed. However, there is a paucity of methods for reliably evaluating the predictive performance of PAs when a very small proportion of the data is labeled. To fill this gap, we introduce a semi-supervised approach (ssROC) for estimation of the receiver operating characteristic (ROC) parameters of PAs (eg, sensitivity, specificity). MATERIALS AND METHODS ssROC uses a small labeled dataset to nonparametrically impute missing labels. The imputations are then used for ROC parameter estimation to yield more precise estimates of PA performance relative to classical supervised ROC analysis (supROC) using only labeled data. We evaluated ssROC with synthetic, semi-synthetic, and EHR data from Mass General Brigham (MGB). RESULTS ssROC produced ROC parameter estimates with minimal bias and significantly lower variance than supROC in the simulated and semi-synthetic data. For the 5 PAs from MGB, the estimates from ssROC are 30% to 60% less variable than supROC on average. DISCUSSION ssROC enables precise evaluation of PA performance without demanding large volumes of labeled data. ssROC is also easily implementable in open-source R software. CONCLUSION When used in conjunction with weakly-supervised PAs, ssROC facilitates the reliable and streamlined phenotyping necessary for EHR-based research.
Collapse
Affiliation(s)
- Jianhui Gao
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Clara-Lea Bonzel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
| | - Chuan Hong
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, United States
| | - Paul Varghese
- Health Informatics, Verily Life Sciences, Cambridge, MA, United States
| | - Karim Zakir
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Jessica Gronsbell
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
- Department of Family and Community Medicine, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
3
|
Li H, Yu Z, Du F, Song L, Gao Y, Shi F. sscNOVA: a semi-supervised convolutional neural network for predicting functional regulatory variants in autoimmune diseases. Front Immunol 2024; 15:1323072. [PMID: 38380333 PMCID: PMC10876991 DOI: 10.3389/fimmu.2024.1323072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 01/15/2024] [Indexed: 02/22/2024] Open
Abstract
Genome-wide association studies (GWAS) have identified thousands of variants in the human genome with autoimmune diseases. However, identifying functional regulatory variants associated with autoimmune diseases remains challenging, largely because of insufficient experimental validation data. We adopt the concept of semi-supervised learning by combining labeled and unlabeled data to develop a deep learning-based algorithm framework, sscNOVA, to predict functional regulatory variants in autoimmune diseases and analyze the functional characteristics of these regulatory variants. Compared to traditional supervised learning methods, our approach leverages more variants' data to explore the relationship between functional regulatory variants and autoimmune diseases. Based on the experimentally curated testing dataset and evaluation metrics, we find that sscNOVA outperforms other state-of-the-art methods. Furthermore, we illustrate that sscNOVA can help to improve the prioritization of functional regulatory variants from lead single-nucleotide polymorphisms and the proxy variants in autoimmune GWAS data.
Collapse
Affiliation(s)
- Haibo Li
- School of Information Engineering, Ningxia University, Yinchuan, China
| | - Zhenhua Yu
- School of Information Engineering, Ningxia University, Yinchuan, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan, Ningxia University, Yinchuan, China
| | - Fang Du
- School of Information Engineering, Ningxia University, Yinchuan, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan, Ningxia University, Yinchuan, China
| | - Lijuan Song
- School of Information Engineering, Ningxia University, Yinchuan, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan, Ningxia University, Yinchuan, China
| | - Yang Gao
- School of Medical Technology, North Minzu University, Yinchuan, China
| | - Fangyuan Shi
- School of Information Engineering, Ningxia University, Yinchuan, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan, Ningxia University, Yinchuan, China
| |
Collapse
|
4
|
Ismond KP, Cruz C, Limon-Miro AT, Low G, Prado CM, Spence JC, Raman M, McNeely ML, Tandon P. An open label feasibility study of a nutrition and exercise app-based solution in cirrhosis. Can Liver J 2024; 7:5-15. [PMID: 38505789 PMCID: PMC10946184 DOI: 10.3138/canlivj-2023-0011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 06/10/2023] [Indexed: 03/21/2024]
Abstract
Background Nutrition and exercise are the mainstay of therapy for the prevention and treatment of frailty in cirrhosis. This pilot study assessed feasibility of the online delivery of an app-based semi-supervised nutrition and exercise intervention in this population. Methods The 11-week pilot recruited adults with cirrhosis who owned internet-connected devices. Patients were encouraged to participate in exercise sessions 3× per week including a combination of online group exercise (weekly) and home-based follow-along exercise (biweekly). They also participated in group nutrition classes (five sessions) and one-to-one exercise and nutrition check-ins delivered through the app. Primary outcome measures pertained to program feasibility: recruitment, retention, adherence, and satisfaction. Exploratory measures included physical performance (liver frailty index [LFI], 6-minute walk test [6MWT]), health behaviour domains, and quality of life. Results Twenty three patients completed baseline measures. Of these, 18 (72%) completed end of study measures (mean MELD-Na, 9.2; female, 44.4%). Over 70% of participants fulfilled 75% or more of the feasibility criteria. Satisfaction with the program was high (mean, 89%). Exercise program modifications were required for 17 patients to accommodate health events or abilities. Exploratory evaluation showed improvement in the LFI and the 6MWT by -0.58-units (95% CI: -0.91 to -0.25) and 46.0 m (95% CI: 22.7-69.3) respectively without changes in quality of life or health behaviour domains. Conclusions Outcomes demonstrate feasibility of the app-based delivery of programming with promising exploratory impact on efficacy for physical performance. Findings can guide the design of a large-scale app-based randomized controlled trials in cirrhosis.
Collapse
Affiliation(s)
- Kathleen P Ismond
- Division of Gastroenterology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| | - Christofer Cruz
- Division of Gastroenterology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| | - Ana Teresa Limon-Miro
- Division of Gastroenterology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| | - Gavin Low
- Division of Gastroenterology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| | - Carla M Prado
- Department of Agricultural, Food and Nutritional Science, Faculty of Agricultural, Life and Environmental Sciences, University of Alberta, Edmonton, Alberta, Canada
| | - John C Spence
- Faculty of Kinesiology, Sport, and Recreation, University of Alberta, Edmonton, Alberta, Canada
| | - Maitreyi Raman
- Division of Gastroenterology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Margaret L McNeely
- Department of Physical Therapy/ Department of Oncology, Faculty of Rehabilitation Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Puneeta Tandon
- Division of Gastroenterology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
5
|
Baker CM, Gong Y. A Semi-supervised Pipeline for Accurate Neuron Segmentation with Fewer Ground Truth Labels. eNeuro 2024; 11:ENEURO.0352-23.2024. [PMID: 38242690 PMCID: PMC10880440 DOI: 10.1523/eneuro.0352-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 12/21/2023] [Accepted: 01/04/2024] [Indexed: 01/21/2024] Open
Abstract
Recent advancements in two-photon calcium imaging have enabled scientists to record the activity of thousands of neurons with cellular resolution. This scope of data collection is crucial to understanding the next generation of neuroscience questions, but analyzing these large recordings requires automated methods for neuron segmentation. Supervised methods for neuron segmentation achieve state of-the-art accuracy and speed but currently require large amounts of manually generated ground truth training labels. We reduced the required number of training labels by designing a semi-supervised pipeline. Our pipeline used neural network ensembling to generate pseudolabels to train a single shallow U-Net. We tested our method on three publicly available datasets and compared our performance to three widely used segmentation methods. Our method outperformed other methods when trained on a small number of ground truth labels and could achieve state-of-the-art accuracy after training on approximately a quarter of the number of ground truth labels as supervised methods. When trained on many ground truth labels, our pipeline attained higher accuracy than that of state-of-the-art methods. Overall, our work will help researchers accurately process large neural recordings while minimizing the time and effort needed to generate manual labels.
Collapse
Affiliation(s)
- Casey M Baker
- Departments of Biomedical Engineering, Duke University, Durham, North Carolina 27701
| | - Yiyang Gong
- Departments of Biomedical Engineering, Duke University, Durham, North Carolina 27701
- Neurobiology, Duke University, Durham, North Carolina 27701
| |
Collapse
|
6
|
Jiang SB, Sun YW, Xu S, Zhang HX, Wu ZF. Semi-supervised segmentation of metal-artifact contaminated industrial CT images using improved CycleGAN. J Xray Sci Technol 2024; 32:271-283. [PMID: 38217629 DOI: 10.3233/xst-230233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2024]
Abstract
Accurate segmentation of industrial CT images is of great significance in industrial fields such as quality inspection and defect analysis. However, reconstruction of industrial CT images often suffers from typical metal artifacts caused by factors like beam hardening, scattering, statistical noise, and partial volume effects. Traditional segmentation methods are difficult to achieve precise segmentation of CT images mainly due to the presence of these metal artifacts. Furthermore, acquiring paired CT image data required by fully supervised networks proves to be extremely challenging. To address these issues, this paper introduces an improved CycleGAN approach for achieving semi-supervised segmentation of industrial CT images. This method not only eliminates the need for removing metal artifacts and noise, but also enables the direct conversion of metal artifact-contaminated images into segmented images without the requirement of paired data. The average values of quantitative assessment of image segmentation performance can reach 0.96645 for Dice Similarity Coefficient(Dice) and 0.93718 for Intersection over Union(IoU). In comparison to traditional segmentation methods, it presents significant improvements in both quantitative metrics and visual quality, provides valuable insights for further research.
Collapse
Affiliation(s)
- Shi Bo Jiang
- Institute of Nuclear and New Energy Technology, Tsinghua University, BeiJing, China
- Tsinghua University-Beijing Key Laboratory of Nuclear Detection Technology
| | - Yue Wen Sun
- Institute of Nuclear and New Energy Technology, Tsinghua University, BeiJing, China
- Tsinghua University-Beijing Key Laboratory of Nuclear Detection Technology
| | - Shuo Xu
- Institute of Nuclear and New Energy Technology, Tsinghua University, BeiJing, China
- Tsinghua University-Beijing Key Laboratory of Nuclear Detection Technology
| | - Hua Xia Zhang
- Institute of Nuclear and New Energy Technology, Tsinghua University, BeiJing, China
- Tsinghua University-Beijing Key Laboratory of Nuclear Detection Technology
| | - Zhi Fang Wu
- Institute of Nuclear and New Energy Technology, Tsinghua University, BeiJing, China
- Tsinghua University-Beijing Key Laboratory of Nuclear Detection Technology
| |
Collapse
|
7
|
Comajoan Cara M, Mas Montserrat D, Ioannidis AG. PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations. Pac Symp Biocomput 2024; 29:327-340. [PMID: 38160290 PMCID: PMC10906137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
The lack of diversity in genomic datasets, currently skewed towards individuals of European ancestry, presents a challenge in developing inclusive biomedical models. The scarcity of such data is particularly evident in labeled datasets that include genomic data linked to electronic health records. To address this gap, this paper presents PopGenAdapt, a genotype-to-phenotype prediction model which adopts semi-supervised domain adaptation (SSDA) techniques originally proposed for computer vision. PopGenAdapt is designed to leverage the substantial labeled data available from individuals of European ancestry, as well as the limited labeled and the larger amount of unlabeled data from currently underrepresented populations. The method is evaluated in underrepresented populations from Nigeria, Sri Lanka, and Hawaii for the prediction of several disease outcomes. The results suggest a significant improvement in the performance of genotype-to-phenotype models for these populations over state-of-the-art supervised learning methods, setting SSDA as a promising strategy for creating more inclusive machine learning models in biomedical research.Our code is available at https://github.com/AI-sandbox/PopGenAdapt.
Collapse
Affiliation(s)
- Marçal Comajoan Cara
- Dept. of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA2Dept. of Signal Theory & Communications, Universitat Politècnica de Catalunya, Barcelona, Spain
| | | | | |
Collapse
|
8
|
Pagano L, Thibault G, Bousselham W, Riesterer JL, Song X, Gray JW. Efficient semi-supervised semantic segmentation of electron microscopy cancer images with sparse annotations. Front Bioinform 2023; 3:1308707. [PMID: 38162122 PMCID: PMC10757843 DOI: 10.3389/fbinf.2023.1308707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 11/29/2023] [Indexed: 01/03/2024] Open
Abstract
Electron microscopy (EM) enables imaging at a resolution of nanometers and can shed light on how cancer evolves to develop resistance to therapy. Acquiring these images has become a routine task.However, analyzing them is now a bottleneck, as manual structure identification is very time-consuming and can take up to several months for a single sample. Deep learning approaches offer a suitable solution to speed up the analysis. In this work, we present a study of several state-of-the-art deep learning models for the task of segmenting nuclei and nucleoli in volumes from tumor biopsies. We compared previous results obtained with the ResUNet architecture to the more recent UNet++, FracTALResNet, SenFormer, and CEECNet models. In addition, we explored the utilization of unlabeled images through semi-supervised learning with Cross Pseudo Supervision. We have trained and evaluated all of the models on sparse manual labels from three fully annotated in-house datasets that we have made available on demand, demonstrating improvements in terms of 3D Dice score. From the analysis of these results, we drew conclusions on the relative gains of using more complex models, and semi-supervised learning as well as the next steps for the mitigation of the manual segmentation bottleneck.
Collapse
Affiliation(s)
- Lucas Pagano
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, United States
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, United States
| | - Guillaume Thibault
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, United States
| | - Walid Bousselham
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, United States
| | - Jessica L. Riesterer
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, United States
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, United States
| | - Xubo Song
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, United States
- Department of Medical Informatics and Clinical Epidemiology at Oregon Health and Science University, Portland, OR, United States
| | - Joe W. Gray
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, United States
| |
Collapse
|
9
|
Pagano L, Thibault G, Bousselham W, Riesterer JL, Song X, Gray JW. Efficient semi-supervised semantic segmentation of electron microscopy cancer images with sparse annotations. bioRxiv 2023:2023.10.30.563998. [PMID: 37961180 PMCID: PMC10635003 DOI: 10.1101/2023.10.30.563998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Electron microscopy (EM) enables imaging at nanometer resolution and can shed light on how cancer evolves to develop resistance to therapy. Acquiring these images has become a routine task; however, analyzing them is now the bottleneck, as manual structure identification is very time-consuming and can take up to several months for a single sample. Deep learning approaches offer a suitable solution to speed up the analysis. In this work, we present a study of several state-of-the-art deep learning models for the task of segmenting nuclei and nucleoli in volumes from tumor biopsies. We compared previous results obtained with the ResUNet architecture to the more recent UNet++, FracTALResNet, SenFormer, and CEECNet models. In addition, we explored the utilization of unlabeled images through semi-supervised learning with Cross Pseudo Supervision. We have trained and evaluated all of the models on sparse manual labels from three fully annotated in-house datasets that we have made available on demand, demonstrating improvements in terms of 3D Dice score. From the analysis of these results, we drew conclusions on the relative gains of using more complex models, semi-supervised learning as well as next steps for the mitigation of the manual segmentation bottleneck.
Collapse
Affiliation(s)
- Lucas Pagano
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, USA
| | - Guillaume Thibault
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
| | - Walid Bousselham
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
| | - Jessica L. Riesterer
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, USA
| | - Xubo Song
- Department of Medical Informatics and Clinical Epidemiology at Oregon Health and Science University, Portland, OR USA
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR, USA
| | - Joe W. Gray
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
| |
Collapse
|
10
|
Lin S, Li J, Huang D, Cheng Z, Xiang L, Ye D, Weng H. Early Detection of Rice Blast Using a Semi-Supervised Contrastive Unpaired Translation Iterative Network Based on UAV Images. Plants (Basel) 2023; 12:3675. [PMID: 37960032 PMCID: PMC10647743 DOI: 10.3390/plants12213675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/16/2023] [Accepted: 10/23/2023] [Indexed: 11/15/2023]
Abstract
Rice blast has caused major production losses in rice, and thus the early detection of rice blast plays a crucial role in global food security. In this study, a semi-supervised contrastive unpaired translation iterative network is specifically designed based on unmanned aerial vehicle (UAV) images for rice blast detection. It incorporates multiple critic contrastive unpaired translation networks to generate fake images with different disease levels through an iterative process of data augmentation. These generated fake images, along with real images, are then used to establish a detection network called RiceBlastYolo. Notably, the RiceBlastYolo model integrates an improved fpn and a general soft labeling approach. The results show that the detection precision of RiceBlastYolo is 99.51% under intersection over union (IOU0.5) conditions and the average precision is 98.75% under IOU0.5-0.9 conditions. The precision and recall rates are respectively 98.23% and 99.99%, which are higher than those of common detection models (YOLO, YOLACT, YOLACT++, Mask R-CNN, and Faster R-CNN). Additionally, external data also verified the ability of the model. The findings demonstrate that our proposed model can accurately identify rice blast under field-scale conditions.
Collapse
Affiliation(s)
- Shaodan Lin
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- College of Mechanical and Intelligent Manufacturing, Fujian Chuanzheng Communications College, Fuzhou 350007, China
| | - Jiayi Li
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- Fujian Key Laboratory of Agricultural Information Sensing Technology, Fuzhou 350002, China
| | - Deyao Huang
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- Fujian Key Laboratory of Agricultural Information Sensing Technology, Fuzhou 350002, China
| | - Zuxin Cheng
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Lirong Xiang
- Department of Biological and Agricultural Engineering, North Carolina State University, Raleigh, NC 27606, USA;
| | - Dapeng Ye
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- Fujian Key Laboratory of Agricultural Information Sensing Technology, Fuzhou 350002, China
- Agricultural Artificial Intelligence Research Center, College of Future Technology, Fujian Agriculture and Forestry University, Fuzhou 350007, China
| | - Haiyong Weng
- College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (S.L.); (D.H.); (Z.C.)
- Fujian Key Laboratory of Agricultural Information Sensing Technology, Fuzhou 350002, China
- Agricultural Artificial Intelligence Research Center, College of Future Technology, Fujian Agriculture and Forestry University, Fuzhou 350007, China
| |
Collapse
|
11
|
Cara MC, Montserrat DM, Ioannidis AG. PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations. bioRxiv 2023:2023.10.10.561715. [PMID: 37873492 PMCID: PMC10592760 DOI: 10.1101/2023.10.10.561715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The lack of diversity in genomic datasets, currently skewed towards individuals of European ancestry, presents a challenge in developing inclusive biomedical models. The scarcity of such data is particularly evident in labeled datasets that include genomic data linked to electronic health records. To address this gap, this paper presents PopGenAdapt, a genotype-to-phenotype prediction model which adopts semi-supervised domain adaptation (SSDA) techniques originally proposed for computer vision. PopGenAdapt is designed to leverage the substantial labeled data available from individuals of European ancestry, as well as the limited labeled and the larger amount of unlabeled data from currently underrepresented populations. The method is evaluated in underrepresented populations from Nigeria, Sri Lanka, and Hawaii for the prediction of several disease outcomes. The results suggest a significant improvement in the performance of genotype-to-phenotype models for these populations over state-of-the-art supervised learning methods, setting SSDA as a promising strategy for creating more inclusive machine learning models in biomedical research.
Collapse
Affiliation(s)
- Marçal Comajoan Cara
- Dept. of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
- Dept. of Signal Theory & Communications, Universitat Politècnica de Catalunya, Barcelona, Spain
| | - Daniel Mas Montserrat
- Dept. of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexander G Ioannidis
- Dept. of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
- Dept. of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
- Institute for Computational & Mathematical Engineering, Stanford University, Stanford, CA, USA
| |
Collapse
|
12
|
Lou A, Tawfik K, Yao X, Liu Z, Noble J. Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation. IEEE Trans Med Imaging 2023; 42:2832-2841. [PMID: 37037256 PMCID: PMC10597739 DOI: 10.1109/tmi.2023.3266137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
A common problem with segmentation of medical images using neural networks is the difficulty to obtain a significant number of pixel-level annotated data for training. To address this issue, we proposed a semi-supervised segmentation network based on contrastive learning. In contrast to the previous state-of-the-art, we introduce Min-Max Similarity (MMS), a contrastive learning form of dual-view training by employing classifiers and projectors to build all-negative, and positive and negative feature pairs, respectively, to formulate the learning as solving a MMS problem. The all-negative pairs are used to supervise the networks learning from different views and to capture general features, and the consistency of unlabeled predictions is measured by pixel-wise contrastive loss between positive and negative pairs. To quantitatively and qualitatively evaluate our proposed method, we test it on four public endoscopy surgical tool segmentation datasets and one cochlear implant surgery dataset, which we manually annotated. Results indicate that our proposed method consistently outperforms state-of-the-art semi-supervised and fully supervised segmentation algorithms. And our semi-supervised segmentation algorithm can successfully recognize unknown surgical tools and provide good predictions. Also, our MMS approach could achieve inference speeds of about 40 frames per second (fps) and is suitable to deal with the real-time video segmentation.
Collapse
|
13
|
Alsharqi M, Lapidaire W, Iturria-Medina Y, Xiong Z, Williamson W, Mohamed A, Tan CMJ, Kitt J, Burchert H, Fletcher A, Whitworth P, Lewandowski AJ, Leeson P. A machine learning-based score for precise echocardiographic assessment of cardiac remodelling in hypertensive young adults. Eur Heart J Imaging Methods Pract 2023; 1:qyad029. [PMID: 37818310 PMCID: PMC10562347 DOI: 10.1093/ehjimp/qyad029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/15/2023] [Indexed: 10/12/2023]
Abstract
Aims Accurate staging of hypertension-related cardiac changes, before the development of significant left ventricular hypertrophy, could help guide early prevention advice. We evaluated whether a novel semi-supervised machine learning approach could generate a clinically meaningful summary score of cardiac remodelling in hypertension. Methods and results A contrastive trajectories inference approach was applied to data collected from three UK studies of young adults. Low-dimensional variance was identified in 66 echocardiography variables from participants with hypertension (systolic ≥160 mmHg) relative to a normotensive group (systolic < 120 mmHg) using a contrasted principal component analysis. A minimum spanning tree was constructed to derive a normalized score for each individual reflecting extent of cardiac remodelling between zero (health) and one (disease). Model stability and clinical interpretability were evaluated as well as modifiability in response to a 16-week exercise intervention. A total of 411 young adults (29 ± 6 years) were included in the analysis, and, after contrastive dimensionality reduction, 21 variables characterized >80% of data variance. Repeated scores for an individual in cross-validation were stable (root mean squared deviation = 0.1 ± 0.002) with good differentiation of normotensive and hypertensive individuals (area under the receiver operating characteristics 0.98). The derived score followed expected hypertension-related patterns in individual cardiac parameters at baseline and reduced after exercise, proportional to intervention compliance (P = 0.04) and improvement in ventilatory threshold (P = 0.01). Conclusion A quantitative score that summarizes hypertension-related cardiac remodelling in young adults can be generated from a computational model. This score might allow more personalized early prevention advice, but further evaluation of clinical applicability is required.
Collapse
Affiliation(s)
- Maryam Alsharqi
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
- Department of Cardiac Technology, College of Applied Medical Sciences, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
| | - Winok Lapidaire
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Yasser Iturria-Medina
- Neurology and Neurosurgery Department, Montreal Neurological Institute, Montreal, Canada
| | - Zhaohan Xiong
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Wilby Williamson
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Afifah Mohamed
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
- Department of Diagnostic Imaging and Applied Health Sciences, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Cheryl M J Tan
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Jamie Kitt
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Holger Burchert
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Andrew Fletcher
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Polly Whitworth
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Adam J Lewandowski
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| | - Paul Leeson
- Oxford Cardiovascular Clinical Research Facility, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX39DU, UK
| |
Collapse
|
14
|
Yang Z, Chen F, Xu B, Ma B, Qu Z, Zhou X. Metric Learning-Guided Semi-Supervised Path-Interaction Fault Diagnosis Method for Extremely Limited Labeled Samples under Variable Working Conditions. Sensors (Basel) 2023; 23:6951. [PMID: 37571734 PMCID: PMC10422390 DOI: 10.3390/s23156951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 08/02/2023] [Accepted: 08/03/2023] [Indexed: 08/13/2023]
Abstract
The lack of labeled data and variable working conditions brings challenges to the application of intelligent fault diagnosis. Given this, extracting labeled information and learning distribution-invariant representation provides a feasible and promising way. Enlightened by metric learning and semi-supervised architecture, a triplet-guided path-interaction ladder network (Tri-CLAN) is proposed based on the aspects of algorithm structure and feature space. An encoder-decoder structure with path interaction is built to utilize the unlabeled data with fewer parameters, and the network structure is simplified by CNN and an element additive combination activation function. Metric learning is introduced to the feature space of the established algorithm structure, which enables the mining of hard samples from extremely limited labeled data and the learning of working condition-independent representations. The generalization and applicability of Tri-CLAN are proved by experiments, and the contribution of the algorithm structure and the metric learning in the feature space are discussed.
Collapse
Affiliation(s)
- Zheng Yang
- School of Mechanical and Aerospace Engineering, Jilin University, Changchun 130025, China;
| | - Fei Chen
- Sino-German College of Intelligent Manufacturing, Shenzhen Technology University, Shenzhen 518118, China; (B.X.); (B.M.); (Z.Q.); (X.Z.)
| | - Binbin Xu
- Sino-German College of Intelligent Manufacturing, Shenzhen Technology University, Shenzhen 518118, China; (B.X.); (B.M.); (Z.Q.); (X.Z.)
| | - Boquan Ma
- Sino-German College of Intelligent Manufacturing, Shenzhen Technology University, Shenzhen 518118, China; (B.X.); (B.M.); (Z.Q.); (X.Z.)
| | - Zege Qu
- Sino-German College of Intelligent Manufacturing, Shenzhen Technology University, Shenzhen 518118, China; (B.X.); (B.M.); (Z.Q.); (X.Z.)
| | - Xin Zhou
- Sino-German College of Intelligent Manufacturing, Shenzhen Technology University, Shenzhen 518118, China; (B.X.); (B.M.); (Z.Q.); (X.Z.)
| |
Collapse
|
15
|
Orescanin M, Smith LN, Sahu S, Goyal P, Chhetri SR. Editorial: Deep learning with limited labeled data for vision, audio, and text. Front Artif Intell 2023; 6:1213419. [PMID: 37384145 PMCID: PMC10295149 DOI: 10.3389/frai.2023.1213419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 05/02/2023] [Indexed: 06/30/2023] Open
Affiliation(s)
- Marko Orescanin
- Computer Science Department, Naval Postgraduate School, Monterey, CA, United States
| | - Leslie N. Smith
- Naval Center for Applied Research in Artificial Intelligence (NCARAI), U.S. Naval Research Laboratory, Washington, DC, United States
| | | | | | | |
Collapse
|
16
|
Fu H, Yu H, Wang X, Lu X, Zhu C. A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features. Brain Sci 2023; 13:brainsci13050725. [PMID: 37239197 DOI: 10.3390/brainsci13050725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 04/23/2023] [Accepted: 04/24/2023] [Indexed: 05/28/2023] Open
Abstract
Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy.
Collapse
Affiliation(s)
- Hongliang Fu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Hang Yu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
| | - Xuemei Wang
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Xiangying Lu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
| | - Chunhua Zhu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| |
Collapse
|
17
|
Cui M, Li K, Li Y, Kamuhanda D, Tessone CJ. Semi-Supervised Semantic Segmentation of Remote Sensing Images Based on Dual Cross-Entropy Consistency. Entropy (Basel) 2023; 25:e25040681. [PMID: 37190469 PMCID: PMC10138059 DOI: 10.3390/e25040681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 03/29/2023] [Accepted: 03/30/2023] [Indexed: 05/17/2023]
Abstract
Semantic segmentation is a growing topic in high-resolution remote sensing image processing. The information in remote sensing images is complex, and the effectiveness of most remote sensing image semantic segmentation methods depends on the number of labels; however, labeling images requires significant time and labor costs. To solve these problems, we propose a semi-supervised semantic segmentation method based on dual cross-entropy consistency and a teacher-student structure. First, we add a channel attention mechanism to the encoding network of the teacher model to reduce the predictive entropy of the pseudo label. Secondly, the two student networks share a common coding network to ensure consistent input information entropy, and a sharpening function is used to reduce the information entropy of unsupervised predictions for both student networks. Finally, we complete the alternate training of the models via two entropy-consistent tasks: (1) semi-supervising student prediction results via pseudo-labels generated from the teacher model, (2) cross-supervision between student models. Experimental results on publicly available datasets indicate that the suggested model can fully understand the hidden information in unlabeled images and reduce the information entropy in prediction, as well as reduce the number of required labeled images with guaranteed accuracy. This allows the new method to outperform the related semi-supervised semantic segmentation algorithm at half the proportion of labeled images.
Collapse
Affiliation(s)
- Mengtian Cui
- College of Computer Science and Engineering, Southwest Minzu University, Chengdu 610041, China
| | - Kai Li
- College of Computer Science and Engineering, Southwest Minzu University, Chengdu 610041, China
| | - Yulan Li
- College of Computer Science and Engineering, Southwest Minzu University, Chengdu 610041, China
| | - Dany Kamuhanda
- Department of Science Mathematics and Physical Education, College of Education, University of Rwanda, Kigali P.O. Box 3900, Rwanda
| | - Claudio J Tessone
- Department of Informatics, University of Zurich, Andreasstrasse 15, CH-8050 Zurich, Switzerland
| |
Collapse
|
18
|
Karasmanoglou A, Antonakakis M, Zervakis M. ECG-Based Semi-Supervised Anomaly Detection for Early Detection and Monitoring of Epileptic Seizures. Int J Environ Res Public Health 2023; 20:5000. [PMID: 36981911 PMCID: PMC10049350 DOI: 10.3390/ijerph20065000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 02/17/2023] [Accepted: 03/08/2023] [Indexed: 06/18/2023]
Abstract
Epilepsy is one of the most common brain diseases, characterized by frequent recurrent seizures or "ictal" states. A patient experiences uncontrollable muscular contractions, inducing loss of mobility and balance, which may result in injury or even death during these ictal states. Extensive investigation is vital to establish a systematic approach for predicting and informing patients about oncoming seizures ahead of time. Most methodologies developed are focused on the detection of abnormalities using mostly electroencephalogram (EEG) recordings. In this regard, research has indicated that certain pre-ictal alterations in the Autonomic Nervous System (ANS) can be detected in patient electrocardiogram (ECG) signals. The latter could potentially provide the basis for a robust seizure prediction approach. The recently proposed ECG-based seizure warning systems utilize machine learning models to classify a patient's condition. Such approaches require the incorporation of large, diverse, and thoroughly annotated ECG datasets, limiting their application potential. In this work, we investigate anomaly detection models in a patient-specific context with low supervision requirements. Specifically, we consider One-Class SVM (OCSVM), Minimum Covariance Determinant (MCD) Estimator, and Local Outlier Factor (LOF) models to quantify the novelty or abnormality of pre-ictal short-term (2-3 min) Heart Rate Variability (HRV) features of patients, trained on a reference interval considered to contain stable heart rate as the only form of supervision. Our models are evaluated against labels that were either hand-picked or automatically generated (weak labels) by a two-phase clustering procedure for samples of the "Post-Ictal Heart Rate Oscillations in Partial Epilepsy" (PIHROPE) dataset recorded by the Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, achieving detection in 9 out of 10 cases, with average AUCs of over 93% across all models and warning times ranging from 6 to 30 min prior to seizure. The proposed anomaly detection and monitoring approach can potentially pave the way for early detection and warning of seizure incidents based on body sensor inputs.
Collapse
|
19
|
Huang Y, Rong Z, Zhang L, Xu Z, Ji J, He J, Liu W, Hou Y, Li K. HiRAND: A novel GCN semi-supervised deep learning-based framework for classification and feature selection in drug research and development. Front Oncol 2023; 13:1047556. [PMID: 36776339 PMCID: PMC9909422 DOI: 10.3389/fonc.2023.1047556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 01/03/2023] [Indexed: 01/28/2023] Open
Abstract
The prediction of response to drugs before initiating therapy based on transcriptome data is a major challenge. However, identifying effective drug response label data costs time and resources. Methods available often predict poorly and fail to identify robust biomarkers due to the curse of dimensionality: high dimensionality and low sample size. Therefore, this necessitates the development of predictive models to effectively predict the response to drugs using limited labeled data while being interpretable. In this study, we report a novel Hierarchical Graph Random Neural Networks (HiRAND) framework to predict the drug response using transcriptome data of few labeled data and additional unlabeled data. HiRAND completes the information integration of the gene graph and sample graph by graph convolutional network (GCN). The innovation of our model is leveraging data augmentation strategy to solve the dilemma of limited labeled data and using consistency regularization to optimize the prediction consistency of unlabeled data across different data augmentations. The results showed that HiRAND achieved better performance than competitive methods in various prediction scenarios, including both simulation data and multiple drug response data. We found that the prediction ability of HiRAND in the drug vorinostat showed the best results across all 62 drugs. In addition, HiRAND was interpreted to identify the key genes most important to vorinostat response, highlighting critical roles for ribosomal protein-related genes in the response to histone deacetylase inhibition. Our HiRAND could be utilized as an efficient framework for improving the drug response prediction performance using few labeled data.
Collapse
Affiliation(s)
- Yue Huang
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Zhiwei Rong
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Liuchao Zhang
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Zhenyi Xu
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Jianxin Ji
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Jia He
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Weisha Liu
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China
| | - Yan Hou
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China,*Correspondence: Kang Li, ; Yan Hou,
| | - Kang Li
- Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China,*Correspondence: Kang Li, ; Yan Hou,
| |
Collapse
|
20
|
Thapa K, Seo Y, Yang SH, Kim K. Semi-Supervised Adversarial Auto-Encoder to Expedite Human Activity Recognition. Sensors (Basel) 2023; 23:683. [PMID: 36679478 PMCID: PMC9863227 DOI: 10.3390/s23020683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 12/26/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]
Abstract
The study of human activity recognition concentrates on classifying human activities and the inference of human behavior using modern sensing technology. However, the issue of domain adaptation for inertial sensing-based human activity recognition (HAR) is still burdensome. The existing requirement of labeled training data for adapting such classifiers to every new person, device, or on-body location is a significant barrier to the widespread adoption of HAR-based applications, making this a challenge of high practical importance. We propose the semi-supervised HAR method to improve reconstruction and generation. It executes proper adaptation with unlabeled data without changes to a pre-trained HAR classifier. Our approach decouples VAE with adversarial learning to ensure robust classifier operation, without newly labeled training data, under changes to the individual activity and the on-body sensor position. Our proposed framework shows the empirical results using the publicly available benchmark dataset compared to state-of-art baselines, achieving competitive improvement for handling new and unlabeled activity. The result demonstrates SAA has achieved a 5% improvement in classification score compared to the existing HAR platform.
Collapse
Affiliation(s)
- Keshav Thapa
- Department of Rehabilitation Medical Engineering, Daegu Haany University, Gyeongsan-si 38610, Republic of Korea
| | - Yousung Seo
- Department of Geriatric Rehabilitation, Daegu Haany University, Gyeongsan-si 38610, Republic of Korea
| | - Sung-Hyun Yang
- Department of Electronic Engineering, Kwangwoon University, Seoul 01897, Republic of Korea
| | - Kyong Kim
- Department of Rehabilitation Medical Engineering, Daegu Haany University, Gyeongsan-si 38610, Republic of Korea
| |
Collapse
|
21
|
Shi Z, Wang N, Kong F, Cao H, Cao Q. A semi-supervised learning method of latent features based on convolutional neural networks for CT metal artifact reduction. Med Phys 2022; 49:3845-3859. [PMID: 35322430 DOI: 10.1002/mp.15633] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 02/15/2022] [Accepted: 03/15/2022] [Indexed: 11/09/2022] Open
Abstract
PURPOSE X-ray computed tomography (CT) has become a convenient and efficient clinical medical technique. However, in the presence of metal implants, CT images may be corrupted by metal artifacts. The metal artifact reduction (MAR) methods based on deep learning are mostly supervised methods trained with labeled synthetic-artifact CT images. However, this causes the neural network to be biased toward learning specific synthetic-artifact patterns and leads to a poor generalization for unlabeled real-artifact CT images. In this study, a semi-supervised learning method of latent features based on convolutional neural networks (SLF-CNN) is developed to remove metal artifacts while ensuring a good generalization ability for real-artifact CT images. METHODS The proposed semi-supervised method extracts CT image features in alternate iterations of a synthetic-artifact learning stage and a real-artifact learning stage. In the synthetic-artifact learning stage, SLF-CNN is fed with paired synthetic-artifact CT images and is constrained using mean-squared-error (MSE) loss and Perceptual loss in a supervised learning fashion. In the real-artifact learning stage, the network weight is updated by minimizing the error between the pseudo-ground truths and the predicted latent features. The feature level pseudo-ground truths are obtained by modeling latent features using the Gaussian process. The overall framework of SLF-CNN adopts an encoder-decoder structure. The encoder is composed of artifact information collection groups to map the input artifact-affected synthetic-artifact CT images and real-artifact CT images into latent features. The decoder is composed of stacked ResNeXt blocks and is responsible for decoding latent features with high-level semantic information to reconstruct artifact-free CT images. The performance of the proposed method is evaluated through contrast experiments and ablation experiments. RESULTS The contrast experimental results indicate that the artifact-free CT images obtained by SLF-CNN have good metrics values, which are close to or better than those of typical supervised MAR methods. The metal artifacts in artifact-affected CT images are eliminated and the tissue structure details are preserved using SLF-CNN. In the ablation experiment shows that adding real-artifact CT images greatly improves the generalization ability of the network. CONCLUSIONS The proposed semi-supervised learning method of latent features for MAR effectively suppresses metal artifacts and improves the generalization ability of the network. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Zaifeng Shi
- School of Microelectronics, Tianjin University, Tianjin, 300072, China.,Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, Tianjin, 300072, China
| | - Ning Wang
- School of Microelectronics, Tianjin University, Tianjin, 300072, China
| | - Fanning Kong
- School of Microelectronics, Tianjin University, Tianjin, 300072, China
| | - Huaisheng Cao
- School of Microelectronics, Tianjin University, Tianjin, 300072, China
| | - Qingjie Cao
- School of Mathematical Sciences, Tianjin Normal University, Tianjin, 300387, China
| |
Collapse
|
22
|
He R, Xu S, Liu Y, Li Q, Liu Y, Zhao N, Yuan Y, Zhang H. Three-Dimensional Liver Image Segmentation Using Generative Adversarial Networks Based on Feature Restoration. Front Med (Lausanne) 2022; 8:794969. [PMID: 35071275 PMCID: PMC8777029 DOI: 10.3389/fmed.2021.794969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 11/22/2021] [Indexed: 11/13/2022] Open
Abstract
Medical imaging provides a powerful tool for medical diagnosis. In the process of computer-aided diagnosis and treatment of liver cancer based on medical imaging, accurate segmentation of liver region from abdominal CT images is an important step. However, due to defects of liver tissue and limitations of CT imaging procession, the gray level of liver region in CT image is heterogeneous, and the boundary between the liver and those of adjacent tissues and organs is blurred, which makes the liver segmentation an extremely difficult task. In this study, aiming at solving the problem of low segmentation accuracy of the original 3D U-Net network, an improved network based on the three-dimensional (3D) U-Net, is proposed. Moreover, in order to solve the problem of insufficient training data caused by the difficulty of acquiring labeled 3D data, an improved 3D U-Net network is embedded into the framework of generative adversarial networks (GAN), which establishes a semi-supervised 3D liver segmentation optimization algorithm. Finally, considering the problem of poor quality of 3D abdominal fake images generated by utilizing random noise as input, deep convolutional neural networks (DCNN) based on feature restoration method is designed to generate more realistic fake images. By testing the proposed algorithm on the LiTS-2017 and KiTS19 dataset, experimental results show that the proposed semi-supervised 3D liver segmentation method can greatly improve the segmentation performance of liver, with a Dice score of 0.9424 outperforming other methods.
Collapse
Affiliation(s)
- Runnan He
- Peng Cheng Laboratory, Shenzhen, China
| | - Shiqi Xu
- School of Computer Science and Technology, Harbin Institute of Technology (HIT), Harbin, China
| | - Yashu Liu
- School of Computer Science and Technology, Harbin Institute of Technology (HIT), Harbin, China
| | - Qince Li
- Peng Cheng Laboratory, Shenzhen, China
- School of Computer Science and Technology, Harbin Institute of Technology (HIT), Harbin, China
| | - Yang Liu
- School of Computer Science and Technology, Harbin Institute of Technology (HIT), Harbin, China
| | - Na Zhao
- School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Yongfeng Yuan
- School of Computer Science and Technology, Harbin Institute of Technology (HIT), Harbin, China
| | - Henggui Zhang
- Peng Cheng Laboratory, Shenzhen, China
- School of Physics and Astronomy, The University of Manchester, Manchester, United Kingdom
- Key Laboratory of Medical Electrophysiology of Ministry of Education and Medical Electrophysiological Key Laboratory of Sichuan Province, Institute of Cardiovascular Research, Southwest Medical University, Luzhou, China
| |
Collapse
|
23
|
Zinga MM, Abdel-Shafy E, Melak T, Vignoli A, Piazza S, Zerbini LF, Tenori L, Cacciatore S. KODAMA exploratory analysis in metabolic phenotyping. Front Mol Biosci 2022; 9:1070394. [PMID: 36733493 PMCID: PMC9887019 DOI: 10.3389/fmolb.2022.1070394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/28/2022] [Indexed: 01/18/2023] Open
Abstract
KODAMA is a valuable tool in metabolomics research to perform exploratory analysis. The advanced analytical technologies commonly used for metabolic phenotyping, mass spectrometry, and nuclear magnetic resonance spectroscopy push out a bunch of high-dimensional data. These complex datasets necessitate tailored statistical analysis able to highlight potentially interesting patterns from a noisy background. Hence, the visualization of metabolomics data for exploratory analysis revolves around dimensionality reduction. KODAMA excels at revealing local structures in high-dimensional data, such as metabolomics data. KODAMA has a high capacity to detect different underlying relationships in experimental datasets and correlate extracted features with accompanying metadata. Here, we describe the main application of KODAMA exploratory analysis in metabolomics research.
Collapse
Affiliation(s)
- Maria Mgella Zinga
- Bioinformatics Unit, International Centre for Genetic Engineering and Biotechnology, Cape Town, South Africa
- Department of Medical Parasitology and Entomology, Catholic University of Health and Allied Sciences, Mwanza, Tanzania
| | - Ebtesam Abdel-Shafy
- Bioinformatics Unit, International Centre for Genetic Engineering and Biotechnology, Cape Town, South Africa
- National Research Centre, Cairo, Egypt
| | - Tadele Melak
- Computation Biology, International Centre for Genetic Engineering and Biotechnology, Trieste, Italy
- Department of clinical chemistry, University of Gondar, Gondar, Ethiopia
| | - Alessia Vignoli
- Magnetic Resonance Center (CERM) and Department of Chemistry “Ugo Schiff”, University of Florence, Sesto Fiorentino, Italy
- Consorzio Interuniversitario Risonanze Magnetiche Metallo Proteine (CIRMMP), Sesto Fiorentino, Italy
| | - Silvano Piazza
- Computation Biology, International Centre for Genetic Engineering and Biotechnology, Trieste, Italy
| | - Luiz Fernando Zerbini
- Cancer Genomics, International Centre for Genetic Engineering and Biotechnology, Cape Town, South Africa
| | - Leonardo Tenori
- Magnetic Resonance Center (CERM) and Department of Chemistry “Ugo Schiff”, University of Florence, Sesto Fiorentino, Italy
- Consorzio Interuniversitario Risonanze Magnetiche Metallo Proteine (CIRMMP), Sesto Fiorentino, Italy
| | - Stefano Cacciatore
- Bioinformatics Unit, International Centre for Genetic Engineering and Biotechnology, Cape Town, South Africa
- Institute of Reproductive and Developmental Biology, Imperial College London, London, United Kingdom
- *Correspondence: Stefano Cacciatore,
| |
Collapse
|
24
|
Schmarje L, Brünger J, Santarossa M, Schröder SM, Kiko R, Koch R. Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy. Sensors (Basel) 2021; 21:6661. [PMID: 34640981 PMCID: PMC8512301 DOI: 10.3390/s21196661] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 10/01/2021] [Accepted: 10/02/2021] [Indexed: 11/17/2022]
Abstract
Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures.
Collapse
Affiliation(s)
- Lars Schmarje
- Multimedia Information Processing Group, Kiel University, 24118 Kiel, Germany; (J.B.); (M.S.); (S.-M.S.); (R.K.)
| | - Johannes Brünger
- Multimedia Information Processing Group, Kiel University, 24118 Kiel, Germany; (J.B.); (M.S.); (S.-M.S.); (R.K.)
| | - Monty Santarossa
- Multimedia Information Processing Group, Kiel University, 24118 Kiel, Germany; (J.B.); (M.S.); (S.-M.S.); (R.K.)
| | - Simon-Martin Schröder
- Multimedia Information Processing Group, Kiel University, 24118 Kiel, Germany; (J.B.); (M.S.); (S.-M.S.); (R.K.)
| | - Rainer Kiko
- Laboratoire d’Océanographie de Villefranche, Sorbonne Université, 06230 Villefranche-sur-Mer, France;
| | - Reinhard Koch
- Multimedia Information Processing Group, Kiel University, 24118 Kiel, Germany; (J.B.); (M.S.); (S.-M.S.); (R.K.)
| |
Collapse
|
25
|
Xue H, Wu Z, Long X, Ullah A, Chen S, Mat WK, Sun P, Gao MZ, Wang JQ, Wang HJ, Li X, Sun WJ, Qiao MQ. Copy number variation profile-based genomic typing of premenstrual dysphoric disorder in Chinese. J Genet Genomics 2021; 48:1070-1080. [PMID: 34530168 DOI: 10.1016/j.jgg.2021.08.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 08/13/2021] [Accepted: 08/13/2021] [Indexed: 11/16/2022]
Abstract
Premenstrual dysphoric disorder (PMDD) affects nearly 5% women of reproductive age. Symptomatic heterogeneity, together with largely unknown genetics, have greatly hindered its effective treatment. In the present study, analysis of genomic sequencing-based copy-number-variations (CNVs) called from 100-kb white blood cell DNA sequence windows by means of semi-supervised clustering led to the segregation of patient genomes into the D and V groups, which correlated with the depression and invasion clinical types respectively with 89.0% consistency. Application of diagnostic CNV features selected using the correlation-based machine-learning method enabled the classification of the CNVs obtained into the D group, V group, total-patient group and control group with an average accuracy of 83.0%. The power of the diagnostic CNV features was 0.98 on average, suggesting that these CNV features could be employed for the molecular diagnosis of the major clinical types of PMDD. This demonstrated concordnce between the CNV profiles and clinical types of PMDD supported the validity of symptom-based diagnosis of PMDD for differentiating between its two major clinical types, as well as the predominanly genetic nature of PMDD with a host of overlaps between multiple susceptibility genes/pathways and the diagnostic CNV features as indicators of involvement in PMDD etiology.
Collapse
Affiliation(s)
- Hong Xue
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China; Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China; School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, Jiangsu, 210009, China; HKUST Shenzhen Research Institute, 9 Yuexing First Road, Nanshan, Shenzhen, China; Guangzhou HKUST Fok Ying Tung Research Institute, Science and Technology Building, Nansha Information Technology Park, Nansha, Guangzhou, 511458, China.
| | - Zhenggang Wu
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China; School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, Jiangsu, 210009, China; HKUST Shenzhen Research Institute, 9 Yuexing First Road, Nanshan, Shenzhen, China; Guangzhou HKUST Fok Ying Tung Research Institute, Science and Technology Building, Nansha Information Technology Park, Nansha, Guangzhou, 511458, China
| | - Xi Long
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China; School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, Jiangsu, 210009, China; HKUST Shenzhen Research Institute, 9 Yuexing First Road, Nanshan, Shenzhen, China; Guangzhou HKUST Fok Ying Tung Research Institute, Science and Technology Building, Nansha Information Technology Park, Nansha, Guangzhou, 511458, China
| | - Ata Ullah
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China
| | - Si Chen
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China
| | - Wai-Kin Mat
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science and Technology, Hong Kong, China
| | - Peng Sun
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Ming-Zhou Gao
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Jie-Qiong Wang
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Hai-Jun Wang
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Xia Li
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Wen-Jun Sun
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China
| | - Ming-Qi Qiao
- Shandong University of Traditional Chinese Medicine, Jinan, Shandong, 250355, China.
| |
Collapse
|
26
|
Golhar M, Bobrow TL, Khoshknab MP, Jit S, Ngamruengphong S, Durr NJ. Improving Colonoscopy Lesion Classification Using Semi-Supervised Deep Learning. IEEE Access 2021; 9:631-640. [PMID: 33747680 PMCID: PMC7978231 DOI: 10.1109/access.2020.3047544] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
While data-driven approaches excel at many image analysis tasks, the performance of these approaches is often limited by a shortage of annotated data available for training. Recent work in semi-supervised learning has shown that meaningful representations of images can be obtained from training with large quantities of unlabeled data, and that these representations can improve the performance of supervised tasks. Here, we demonstrate that an unsupervised jigsaw learning task, in combination with supervised training, results in up to a 9.8% improvement in correctly classifying lesions in colonoscopy images when compared to a fully-supervised baseline. We additionally benchmark improvements in domain adaptation and out-of-distribution detection, and demonstrate that semi-supervised learning outperforms supervised learning in both cases. In colonoscopy applications, these metrics are important given the skill required for endoscopic assessment of lesions, the wide variety of endoscopy systems in use, and the homogeneity that is typical of labeled datasets.
Collapse
Affiliation(s)
- Mayank Golhar
- Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Taylor L Bobrow
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | | | - Simran Jit
- Division of Gastroenterology and Hepatology, Johns Hopkins Hospital, Baltimore, MD 21287, USA
| | - Saowanee Ngamruengphong
- Division of Gastroenterology and Hepatology, Johns Hopkins Hospital, Baltimore, MD 21287, USA
| | - Nicholas J Durr
- Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
27
|
Abstract
The computational prediction of interactions between drugs and targets is a standing challenge in drug discovery. State-of-the-art methods for drug-target interaction prediction are primarily based on supervised machine learning with known label information. However, in biomedicine, obtaining labeled training data is an expensive and a laborious process. This paper proposes a semi-supervised generative adversarial networks (GANs)-based method to predict binding affinity. Our method comprises two parts, two GANs for feature extraction and a regression network for prediction. The semi-supervised mechanism allows our model to learn proteins drugs features of both labeled and unlabeled data. We evaluate the performance of our method using multiple public datasets. Experimental results demonstrate that our method achieves competitive performance while utilizing freely available unlabeled data. Our results suggest that utilizing such unlabeled data can considerably help improve performance in various biomedical relation extraction processes, for example, Drug-Target interaction and protein-protein interaction, particularly when only limited labeled data are available in such tasks. To our best knowledge, this is the first semi-supervised GANs-based method to predict binding affinity.
Collapse
Affiliation(s)
- Lingling Zhao
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Junjie Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Long Pang
- Institute of Space Environment and Material Science, Harbin Institute of Technology, Harbin, China
| | - Yang Liu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Jun Zhang
- Department of Rehabilitation, Heilongjiang Province Land Reclamation Headquarters General Hospital, Harbin, China
| |
Collapse
|
28
|
Qian X, Li E, Zhang J, Zhao SN, Wu QE, Zhang H, Wang W, Wu Y. Hardness Recognition of Robotic Forearm Based on Semi-supervised Generative Adversarial Networks. Front Neurorobot 2019; 13:73. [PMID: 31551748 PMCID: PMC6743412 DOI: 10.3389/fnbot.2019.00073] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/23/2019] [Indexed: 11/13/2022] Open
Abstract
The hardness recognition is of great significance to tactile sensing and robotic control. The hardness recognition methods based on deep learning have demonstrated a good performance, however, a huge amount of manually labeled samples which require lots of time and labor costs are necessary for the training of deep neural networks. In order to alleviate this problem, a semi-supervised generative adversarial network (GAN) which requires less manually labeled samples is proposed in this paper. First of all, a large number of unlabeled samples are made use of through the unsupervised training of GAN, which is used to provide a good initial state to the following model. Afterwards, the manually labeled samples corresponding to each hardness level are individually used to train the GAN, of which the architecture and initial parameter values are inherited from the unsupervised GAN, and augmented by the generator of trained GAN. Finally, the hardness recognition network (HRN), of which the main architecture and initial parameter values are inherited from the discriminator of unsupervised GAN, is pretrained by a large number of augmented labeled samples and fine-tuned by manually labeled samples. The hardness recognition result can be obtained online by importing the tactile data captured by the robotic forearm into the trained HRN. The experimental results demonstrate that the proposed method can significantly save the manual labeling work while providing an excellent recognition precision for hardness recognition.
Collapse
Affiliation(s)
- Xiaoliang Qian
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Erkai Li
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Jianwei Zhang
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Su-Na Zhao
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Qing-E Wu
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Huanlong Zhang
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Wei Wang
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| | - Yuanyuan Wu
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
| |
Collapse
|
29
|
Zhao D, Liu F, Meng H. Bearing Fault Diagnosis Based on the Switchable Normalization SSGAN with 1-D Representation of Vibration Signals as Input. Sensors (Basel) 2019; 19:E2000. [PMID: 31035634 DOI: 10.3390/s19092000] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Revised: 04/24/2019] [Accepted: 04/25/2019] [Indexed: 11/17/2022]
Abstract
The bearing is a component of the support shaft that guides the rotational movement of the shaft, widely used in the mechanical industry and also called a mechanical joint. In bearing fault diagnosis, the accuracy much depends on the feature extraction, which always needs a lot of training samples and classification in the commonly used methods. Neural networks are good at latent feature extraction and fault classification, however, they have problems with instability and over-fitting, and more labeled samples must be trained. Switchable normalization and semi-supervised learning are introduced to solve the above obstacles in this paper, which proposes a novel bearing fault diagnosis method based on switchable normalization semi-supervised generative adversarial networks (SN-SSGAN) with 1-dimensional representation of vibration signals as input. Experimental results showed that the proposed method has a desirable 99.93% classification accuracy in the case of less labeled data from the public data set of West Reserve University, which is better than the state-of-the-art methods.
Collapse
|
30
|
Graim K, Friedl V, Houlahan KE, Stuart JM. PLATYPUS: A Multiple-View Learning Predictive Framework for Cancer Drug Sensitivity Prediction. Pac Symp Biocomput 2019; 24:136-147. [PMID: 30864317 PMCID: PMC6417802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Cancer is a complex collection of diseases that are to some degree unique to each patient. Precision oncology aims to identify the best drug treatment regime using molecular data on tumor samples. While omics-level data is becoming more widely available for tumor specimens, the datasets upon which computational learning methods can be trained vary in coverage from sample to sample and from data type to data type. Methods that can 'connect the dots' to leverage more of the information provided by these studies could offer major advantages for maximizing predictive potential. We introduce a multi-view machinelearning strategy called PLATYPUS that builds 'views' from multiple data sources that are all used as features for predicting patient outcomes. We show that a learning strategy that finds agreement across the views on unlabeled data increases the performance of the learning methods over any single view. We illustrate the power of the approach by deriving signatures for drug sensitivity in a large cancer cell line database. Code and additional information are available from the PLATYPUS website https://sysbiowiki.soe.ucsc.edu/platypus.
Collapse
Affiliation(s)
| | - Verena Friedl
- Dept. of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA
| | | | - Joshua M. Stuart
- Dept. of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA
| |
Collapse
|
31
|
Li M, Li O, Liu G, Zhang C. Generative Adversarial Networks-Based Semi-Supervised Automatic Modulation Recognition for Cognitive Radio Networks. Sensors (Basel) 2018; 18:E3913. [PMID: 30428617 DOI: 10.3390/s18113913] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 11/07/2018] [Accepted: 11/09/2018] [Indexed: 11/30/2022]
Abstract
With the recently explosive growth of deep learning, automatic modulation recognition has undergone rapid development. Most of the newly proposed methods are dependent on large numbers of labeled samples. We are committed to using fewer labeled samples to perform automatic modulation recognition in the cognitive radio domain. Here, a semi-supervised learning method based on adversarial training is proposed which is called signal classifier generative adversarial network. Most of the prior methods based on this technology involve computer vision applications. However, we improve the existing network structure of a generative adversarial network by adding the encoder network and a signal spatial transform module, allowing our framework to address radio signal processing tasks more efficiently. These two technical improvements effectively avoid nonconvergence and mode collapse problems caused by the complexity of the radio signals. The results of simulations show that compared with well-known deep learning methods, our method improves the classification accuracy on a synthetic radio frequency dataset by 0.1% to 12%. In addition, we verify the advantages of our method in a semi-supervised scenario and obtain a significant increase in accuracy compared with traditional semi-supervised learning methods.
Collapse
|
32
|
Abstract
BACKGROUND The BioNLP Gene Regulation Task has attracted a diverse collection of submissions showcasing state-of-the-art systems. However, a principal challenge remains in obtaining a significant amount of recall. We argue that this is an important quality for Information Extraction tasks in this field. We propose a semi-supervised framework, leveraging a large corpus of unannotated data available to us. In this framework, the annotated data is used to find plausible candidates for positive data points, which are included in the machine learning process. As this is a method principally designed for gaining recall, we further explore additional methods to improve precision on top of this. These are: weighted regularisation in the SVM framework, and filtering out unlabelled examples based on a probabilistic rule-finding method. The latter method also allows us to add candidates for negatives from unlabelled data, a method not viable in the unfiltered approach. RESULTS We replicate one of the original participant systems, and modify it to incorporate our methods. This allows us to test the extent of our proposed methods by applying them to the GRN task data. We find a considerable improvement in recall compared to the baseline system. We also investigate the evaluation metrics and find several mechanisms explaining a bias towards precision. Furthermore, these findings uncover an intricate precision-recall interaction, depriving recall of its habitual immediacy seen in traditional machine learning set-ups. CONCLUSION Our contributions are twofold.
Collapse
Affiliation(s)
- Thomas Provoost
- Computer Science Department, KU Leuven, Celestijnenlaan 200A, 3001 Heverlee, Belgium
| | - Marie-Francine Moens
- Computer Science Department, KU Leuven, Celestijnenlaan 200A, 3001 Heverlee, Belgium
| |
Collapse
|