1
|
Yang L, Zhao Z, Zhao H. UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2025; 47:3031-3048. [PMID: 40031040 DOI: 10.1109/tpami.2025.3528453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Semi-supervised semantic segmentation (SSS) aims at learning rich visual knowledge from cheap unlabeled images to enhance semantic segmentation capability. Among recent works, UniMatch (Yang et al. 2023) improves its precedents tremendously by amplifying the practice of weak-to-strong consistency regularization. Subsequent works typically follow similar pipelines and propose various delicate designs. Despite the achieved progress, strangely, even in this flourishing era of numerous powerful vision models, almost all SSS works are still sticking to 1) using outdated ResNet encoders with small-scale ImageNet-1 K pre-training, and 2) evaluation on simple Pascal and Cityscapes datasets. In this work, we argue that, it is necessary to switch the baseline of SSS from ResNet-based encoders to more capable ViT-based encoders (e.g., DINOv2) that are pre-trained on massive data. A simple update on the encoder (even using 2× fewer parameters) can bring more significant improvement than careful method designs. Built on this competitive baseline, we present our upgraded and simplified UniMatch V2, inheriting the core spirit of weak-to-strong consistency from V1, but requiring less training cost and providing consistently better results. Additionally, witnessing the gradually saturated performance on Pascal and Cityscapes, we appeal that we should focus on more challenging benchmarks with complex taxonomy, such as ADE20K and COCO datasets.
Collapse
|
2
|
Ma F, Yuan Y, Xie Y, Ren H, Liu I, He Y, Ren F, Yu FR, Ni S. Generative technology for human emotion recognition: A scoping review. INFORMATION FUSION 2025; 115:102753. [DOI: 10.1016/j.inffus.2024.102753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]
|
3
|
Beau M, Herzfeld DJ, Naveros F, Hemelt ME, D'Agostino F, Oostland M, Sánchez-López A, Chung YY, Maibach M, Kyranakis S, Stabb HN, Martínez Lopera MG, Lajko A, Zedler M, Ohmae S, Hall NJ, Clark BA, Cohen D, Lisberger SG, Kostadinov D, Hull C, Häusser M, Medina JF. A deep learning strategy to identify cell types across species from high-density extracellular recordings. Cell 2025:S0092-8674(25)00110-2. [PMID: 40023155 DOI: 10.1016/j.cell.2025.01.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 11/20/2024] [Accepted: 01/28/2025] [Indexed: 03/04/2025]
Abstract
High-density probes allow electrophysiological recordings from many neurons simultaneously across entire brain circuits but fail to reveal cell type. Here, we develop a strategy to identify cell types from extracellular recordings in awake animals and reveal the computational roles of neurons with distinct functional, molecular, and anatomical properties. We combine optogenetics and pharmacology using the cerebellum as a testbed to generate a curated ground-truth library of electrophysiological properties for Purkinje cells, molecular layer interneurons, Golgi cells, and mossy fibers. We train a semi-supervised deep learning classifier that predicts cell types with greater than 95% accuracy based on the waveform, discharge statistics, and layer of the recorded neuron. The classifier's predictions agree with expert classification on recordings using different probes, in different laboratories, from functionally distinct cerebellar regions, and across species. Our classifier extends the power of modern dynamical systems analyses by revealing the unique contributions of simultaneously recorded cell types during behavior.
Collapse
Affiliation(s)
- Maxime Beau
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - David J Herzfeld
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Francisco Naveros
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA; Department of Computer Engineering, Automation and Robotics, Research Centre for Information and Communication Technologies, University of Granada, Granada, Spain
| | - Marie E Hemelt
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Federico D'Agostino
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Marlies Oostland
- Wolfson Institute for Biomedical Research, University College London, London, UK; Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, the Netherlands
| | | | - Young Yoon Chung
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Michael Maibach
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Stephen Kyranakis
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Hannah N Stabb
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | | | - Agoston Lajko
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Marie Zedler
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Shogo Ohmae
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Nathan J Hall
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Beverley A Clark
- Wolfson Institute for Biomedical Research, University College London, London, UK
| | - Dana Cohen
- The Leslie and Susan Gonda Multidisciplinary Brain Research Center, Bar-Ilan University, Ramat-Gan, Israel
| | - Stephen G Lisberger
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Dimitar Kostadinov
- Wolfson Institute for Biomedical Research, University College London, London, UK; Centre for Developmental Neurobiology, King's College London, London, UK
| | - Court Hull
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Michael Häusser
- Wolfson Institute for Biomedical Research, University College London, London, UK; School of Biomedical Sciences, The University of Hong Kong, Hong Kong, China
| | - Javier F Medina
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
4
|
Ambreen S, Umar M, Noor A, Jain H, Ali R. Advanced AI and ML frameworks for transforming drug discovery and optimization: With innovative insights in polypharmacology, drug repurposing, combination therapy and nanomedicine. Eur J Med Chem 2025; 284:117164. [PMID: 39721292 DOI: 10.1016/j.ejmech.2024.117164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 11/24/2024] [Accepted: 11/27/2024] [Indexed: 12/28/2024]
Abstract
Artificial Intelligence (AI) and Machine Learning (ML) are transforming drug discovery by overcoming traditional challenges like high costs, time-consuming, and frequent failures. AI-driven approaches streamline key phases, including target identification, lead optimization, de novo drug design, and drug repurposing. Frameworks such as deep neural networks (DNNs), convolutional neural networks (CNNs), and deep reinforcement learning (DRL) models have shown promise in identifying drug targets, optimizing delivery systems, and accelerating drug repurposing. Generative adversarial networks (GANs) and variational autoencoders (VAEs) aid de novo drug design by creating novel drug-like compounds with desired properties. Case studies, such as DDR1 kinase inhibitors designed using generative models and CDK20 inhibitors developed via structure-based methods, highlight AI's ability to produce highly specific therapeutics. Models like SNF-CVAE and DeepDR further advance drug repurposing by uncovering new therapeutic applications for existing drugs. Advanced ML algorithms enhance precision in predicting drug efficacy, toxicity, and ADME-Tox properties, reducing development costs and improving drug-target interactions. AI also supports polypharmacology by optimizing multi-target drug interactions and enhances combination therapy through predictions of drug synergies and antagonisms. In nanomedicine, AI models like CURATE.AI and the Hartung algorithm optimize personalized treatments by predicting toxicological risks and real-time dosing adjustments with high accuracy. Despite its potential, challenges like data quality, model interpretability, and ethical concerns must be addressed. High-quality datasets, transparent models, and unbiased algorithms are essential for reliable AI applications. As AI continues to evolve, it is poised to revolutionize drug discovery and personalized medicine, advancing therapeutic development and patient care.
Collapse
Affiliation(s)
- Subiya Ambreen
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Mohammad Umar
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Aaisha Noor
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Himangini Jain
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Ruhi Ali
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India.
| |
Collapse
|
5
|
Nilsson A, Meimetis N, Lauffenburger DA. Towards an interpretable deep learning model of cancer. NPJ Precis Oncol 2025; 9:46. [PMID: 39948231 PMCID: PMC11825879 DOI: 10.1038/s41698-025-00822-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 01/27/2025] [Indexed: 02/16/2025] Open
Abstract
Cancer is a manifestation of dysfunctional cell states. It emerges from an interplay of intrinsic and extrinsic factors that disrupt cellular dynamics, including genetic and epigenetic alterations, as well as the tumor microenvironment. This complexity can make it challenging to infer molecular causes for treating the disease. This may be addressed by system-wide computer models of cells, as they allow rapid generation and testing of hypotheses that would be too slow or impossible to perform in the laboratory and clinic. However, so far, such models have been impeded by both experimental and computational limitations. In this perspective, we argue that they can now be achieved using deep learning algorithms to integrate omics data and prior knowledge of molecular networks. Such models would have many applications in precision oncology, e.g., for identifying drug targets and biomarkers, predicting resistance mechanisms and toxicity effects of drugs, or simulating cell-cell interactions in the microenvironment.
Collapse
Affiliation(s)
- Avlant Nilsson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biology and Biological Engineering, Chalmers University of Technology, Göteborg, Sweden
- Department of Cell and Molecular Biology, SciLifeLab, Karolinska Institutet, Stockholm, Sweden
| | - Nikolaos Meimetis
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
6
|
Lowe SC, Misiuk B, Xu I, Abdulazizov S, Baroi AR, Bastos AC, Best M, Ferrini V, Friedman A, Hart D, Hoegh-Guldberg O, Ierodiaconou D, Mackin-McLaughlin J, Markey K, Menandro PS, Monk J, Nemani S, O'Brien J, Oh E, Reshitnyk LY, Robert K, Roelfsema CM, Sameoto JA, Schimel ACG, Thomson JA, Wilson BR, Wong MC, Brown CJ, Trappenberg T. BenthicNet: A global compilation of seafloor images for deep learning applications. Sci Data 2025; 12:230. [PMID: 39920123 PMCID: PMC11806053 DOI: 10.1038/s41597-025-04491-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Accepted: 01/16/2025] [Indexed: 02/09/2025] Open
Abstract
Advances in underwater imaging enable collection of extensive seafloor image datasets necessary for monitoring important benthic ecosystems. The ability to collect seafloor imagery has outpaced our capacity to analyze it, hindering mobilization of this crucial environmental information. Machine learning approaches provide opportunities to increase the efficiency with which seafloor imagery is analyzed, yet large and consistent datasets to support development of such approaches are scarce. Here we present BenthicNet: a global compilation of seafloor imagery designed to support the training and evaluation of large-scale image recognition models. An initial set of over 11.4 million images was collected and curated to represent a diversity of seafloor environments using a representative subset of 1.3 million images. These are accompanied by 3.1 million annotations translated to the CATAMI scheme, which span 190,000 of the images. A large deep learning model was trained on this compilation and preliminary results suggest it has utility for automating large and small-scale image analysis tasks. The compilation and model are made openly available for reuse.
Collapse
Affiliation(s)
| | - Benjamin Misiuk
- Memorial University of Newfoundland, Department of Geography, St. John's, Newfoundland, Canada.
- Memorial University of Newfoundland, Department of Earth Sciences, St. John's, Newfoundland, Canada.
| | - Isaac Xu
- Dalhousie University, Faculty of Computer Science, Halifax, Nova Scotia, Canada
| | | | - Amit R Baroi
- Dalhousie University, School for Resource and Environmental Studies, Halifax, Nova Scotia, Canada
- Dalhousie University, DeepSense, Halifax, Nova Scotia, Canada
| | - Alex C Bastos
- Universidade Federal do Espírito Santo, Departamento de Oceanografia e Ecologia, Vitória, Brazil
| | - Merlin Best
- Fisheries and Oceans Canada, Marine Spatial Ecology and Analysis Section, Institute of Ocean Sciences, Sidney, British Columbia, Canada
| | - Vicki Ferrini
- Columbia University, Lamont-Doherty Earth Observatory, Palisades, New York, USA
| | - Ariell Friedman
- University of Sydney, Australian Centre for Field Robotics, Sydney, New South Wales, Australia
- Greybits Engineering, Sydney, New South Wales, Australia
| | - Deborah Hart
- National Oceanic and Atmospheric Administration Northeast Fisheries Science Center, Woods Hole, Massachusetts, USA
| | - Ove Hoegh-Guldberg
- University of Queensland, School of the Environment, Brisbane, Queensland, Australia
| | - Daniel Ierodiaconou
- Deakin University, School of Life and Environmental Sciences, Warrnambool, Victoria, Australia
| | | | - Kathryn Markey
- University of Queensland, School of the Environment, Brisbane, Queensland, Australia
| | - Pedro S Menandro
- Universidade Federal do Espírito Santo, Departamento de Oceanografia e Ecologia, Vitória, Brazil
| | - Jacquomo Monk
- University of Tasmania, Institute for Marine and Antarctic Studies, Hobart, Tasmania, Australia
| | - Shreya Nemani
- Fisheries and Marine Institute of Memorial University of Newfoundland, School of Ocean Technology, St. John's, Newfoundland, Canada
| | - John O'Brien
- Fisheries and Oceans Canada, Bedford Institute of Oceanography, Dartmouth, Nova Scotia, Canada
| | - Elizabeth Oh
- University of Tasmania, Institute for Marine and Antarctic Studies, Hobart, Tasmania, Australia
| | | | - Katleen Robert
- Fisheries and Marine Institute of Memorial University of Newfoundland, School of Ocean Technology, St. John's, Newfoundland, Canada
| | - Chris M Roelfsema
- University of Queensland, School of the Environment, Brisbane, Queensland, Australia
| | - Jessica A Sameoto
- Fisheries and Oceans Canada, Bedford Institute of Oceanography, Dartmouth, Nova Scotia, Canada
| | | | | | - Brittany R Wilson
- Fisheries and Oceans Canada, Bedford Institute of Oceanography, Dartmouth, Nova Scotia, Canada
| | - Melisa C Wong
- Fisheries and Oceans Canada, Bedford Institute of Oceanography, Dartmouth, Nova Scotia, Canada
| | - Craig J Brown
- Dalhousie University, Department of Oceanography, Halifax, Nova Scotia, Canada
| | - Thomas Trappenberg
- Dalhousie University, Faculty of Computer Science, Halifax, Nova Scotia, Canada
| |
Collapse
|
7
|
Wang C, Jia P, Tian X, Tang X, Hu X, Li H. Fault Diagnosis of Semi-Supervised Electromechanical Transmission Systems Under Imbalanced Unlabeled Sample Class Information Screening. ENTROPY (BASEL, SWITZERLAND) 2025; 27:175. [PMID: 40003172 PMCID: PMC11854703 DOI: 10.3390/e27020175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2024] [Revised: 01/20/2025] [Accepted: 01/23/2025] [Indexed: 02/27/2025]
Abstract
In the health monitoring of electromechanical transmission systems, the collected state data typically consist of only a minimal amount of labeled data, with a vast majority remaining unlabeled. Consequently, deep learning-based diagnostic models encounter the challenge of scarcity in labeled data and abundance in unlabeled data. Traditional semi-supervised deep learning methods based on pseudo-label self-training, while alleviating the issue of labeled data scarcity to some extent, neglect the reliability of pseudo-label information, the accuracy of feature extraction from unlabeled data, and the imbalance in sample selection. To address these issues, this paper proposes a novel semi-supervised fault diagnosis method under imbalanced unlabeled sample class information screening. Firstly, an information screening mechanism for unlabeled data based on active learning is established. This mechanism discriminates based on the variability of intrinsic feature information in fault samples, accurately screening out unlabeled samples located near decision boundaries that are difficult to separate clearly. Then, combining the maximum membership degree of these unlabeled data in the classification space of the supervised model and interacting with the active learning expert system, label information is assigned to the screened unlabeled data. Secondly, a cost-sensitive function driven by data imbalance is constructed to address the class imbalance problem in unlabeled sample screening, adaptively adjusting the weights of different class samples during model training to guide the training of the supervised model. Ultimately, through dynamic optimization of the supervised model and the feature extraction capability of unlabeled samples, the recognition ability of the diagnostic model for unlabeled samples is significantly enhanced. Validation through two datasets, encompassing a total of 12 experimental scenarios, demonstrates that in scenarios with only a small amount of labeled data, the proposed method achieves a diagnostic accuracy increment exceeding 10% compared to existing typical methods, fully validating the effectiveness and superiority of the proposed method in practical applications.
Collapse
Affiliation(s)
- Chaoge Wang
- School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Pengpeng Jia
- School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Xinyu Tian
- School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Xiaojing Tang
- School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Xiong Hu
- School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Hongkun Li
- School of Mechanical Engineering, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
8
|
Dai W, Lu R, Zhu J, Chen P, Yang H. Harnessing unlabeled data: Enhanced rare earth component content prediction based on BiLSTM-Deep autoencoder. ISA TRANSACTIONS 2025; 157:357-367. [PMID: 39757067 DOI: 10.1016/j.isatra.2024.12.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 12/13/2024] [Accepted: 12/13/2024] [Indexed: 01/07/2025]
Abstract
Traditional data-driven models for predicting rare earth component content are primarily developed by relying on supervised learning methods, which suffer from limitations such as a lack of labeled data, lagging, and poor usage of a major amount of unlabeled data. This paper proposes a novel prediction approach based on the BiLSTM-Deep autoencoder enhanced traditional LSSVM algorithm, termed BiLSTM-DeepAE-LSSVM. This approach thoroughly exploits the implicit information contained in copious amounts of unlabeled data in the rare earth production process, thereby improving the traditional supervised prediction method and increasing the accuracy of component content predictions. Initially, a BiLSTM autoencoder is established for unsupervised training on the rare earth production process data, enabling the extraction of inherent time series characteristics. Subsequently, boolean vectors are introduced in the Deep autoencoder training process to perform masking operations on the input data, simulating scenarios with noise and missing data. This is facilitated by their adherence to Bernoulli distributions, which allow for the random setting of certain input vector dimensions to zero. Additionally, the Deep autoencoder is capable of extracting high-dimensional implicit features from the data. After that, the conventional supervised prediction technique, least squares support vector machine (LSSVM), is fused with the implicit characteristics derived from the well-constructed BiLSTM-Deep autoencoder, culminating in the creation of a prediction model for rare earth component content. Ultimately, the simulation verification using LaCe/PrNd extraction field data demonstrates the effectiveness of the proposed approach in harnessing substantial quantities of unlabeled data from the rare earth extraction production process, thereby bolstering the accuracy of model predictions.
Collapse
Affiliation(s)
- Wenhao Dai
- School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, 330013, Jiangxi, China; Key Laboratory of Advanced Control & Optimization of Jiangxi Province, Nanchang, 330013, Jiangxi, China.
| | - Rongxiu Lu
- School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, 330013, Jiangxi, China; Key Laboratory of Advanced Control & Optimization of Jiangxi Province, Nanchang, 330013, Jiangxi, China.
| | - Jianyong Zhu
- School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, 330013, Jiangxi, China; Key Laboratory of Advanced Control & Optimization of Jiangxi Province, Nanchang, 330013, Jiangxi, China.
| | - Pengzhan Chen
- School of Intelligent Manufacture, Taizhou University, Taizhou, 318000, Zhejiang, China.
| | - Hui Yang
- School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, 330013, Jiangxi, China; Key Laboratory of Advanced Control & Optimization of Jiangxi Province, Nanchang, 330013, Jiangxi, China.
| |
Collapse
|
9
|
Xiao Z, Tong H, Qu R, Xing H, Luo S, Zhu Z, Song F, Feng L. CapMatch: Semi-Supervised Contrastive Transformer Capsule With Feature-Based Knowledge Distillation for Human Activity Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:2690-2704. [PMID: 38150344 DOI: 10.1109/tnnls.2023.3344294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
This article proposes a semi-supervised contrastive capsule transformer method with feature-based knowledge distillation (KD) that simplifies the existing semisupervised learning (SSL) techniques for wearable human activity recognition (HAR), called CapMatch. CapMatch gracefully hybridizes supervised learning and unsupervised learning to extract rich representations from input data. In unsupervised learning, CapMatch leverages the pseudolabeling, contrastive learning (CL), and feature-based KD techniques to construct similarity learning on lower and higher level semantic information extracted from two augmentation versions of the data, "weak" and "timecut," to recognize the relationships among the obtained features of classes in the unlabeled data. CapMatch combines the outputs of the weak- and timecut-augmented models to form pseudolabeling and thus CL. Meanwhile, CapMatch uses the feature-based KD to transfer knowledge from the intermediate layers of the weak-augmented model to those of the timecut-augmented model. To effectively capture both local and global patterns of HAR data, we design a capsule transformer network consisting of four capsule-based transformer blocks and one routing layer. Experimental results show that compared with a number of state-of-the-art semi-supervised and supervised algorithms, the proposed CapMatch achieves decent performance on three commonly used HAR datasets, namely, HAPT, WISDM, and UCI_HAR. With only 10% of data labeled, CapMatch achieves values of higher than 85.00% on these datasets, outperforming 14 semi-supervised algorithms. When the proportion of labeled data reaches 30%, CapMatch obtains values of no lower than 88.00% on the datasets above, which is better than several classical supervised algorithms, e.g., decision tree and -nearest neighbor (KNN).
Collapse
|
10
|
Hresko DJ, Drotar P, Ngo QC, Kumar DK. Enhanced Domain Adaptation for Foot Ulcer Segmentation Through Mixing Self-Trained Weak Labels. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025; 38:455-466. [PMID: 39020158 PMCID: PMC11810871 DOI: 10.1007/s10278-024-01193-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 05/04/2024] [Accepted: 05/22/2024] [Indexed: 07/19/2024]
Abstract
Wound management requires the measurement of the wound parameters such as its shape and area. However, computerized analysis of the wound suffers the challenge of inexact segmentation of the wound images due to limited or inaccurate labels. It is a common scenario that the source domain provides an abundance of labeled data, while the target domain provides only limited labels. To overcome this, we propose a novel approach that combines self-training learning and mixup augmentation. The neural network is trained on the source domain to generate weak labels on the target domain via the self-training process. In the second stage, generated labels are mixed up with labels from the source domain to retrain the neural network and enhance generalization across diverse datasets. The efficacy of our approach was evaluated using the DFUC 2022, FUSeg, and RMIT datasets, demonstrating substantial improvements in segmentation accuracy and robustness across different data distributions. Specifically, in single-domain experiments, segmentation on the DFUC 2022 dataset scored a dice score of 0.711, while the score on the FUSeg dataset achieved 0.859. For domain adaptation, when these datasets were used as target datasets, the dice scores were 0.714 for DFUC 2022 and 0.561 for FUSeg.
Collapse
Affiliation(s)
- David Jozef Hresko
- IISLab, Technical University of Kosice, Letna 1/9, Kosice, 04200, Kosicky Kraj, Slovakia
| | - Peter Drotar
- IISLab, Technical University of Kosice, Letna 1/9, Kosice, 04200, Kosicky Kraj, Slovakia.
| | - Quoc Cuong Ngo
- School of Engineering, RMIT University, 80/445 Swanston St, Melbourne, 3000, VIC, Australia
| | - Dinesh Kant Kumar
- School of Engineering, RMIT University, 80/445 Swanston St, Melbourne, 3000, VIC, Australia
| |
Collapse
|
11
|
Essahraui S, Lamaakal I, El Hamly I, Maleh Y, Ouahbi I, El Makkaoui K, Filali Bouami M, Pławiak P, Alfarraj O, Abd El-Latif AA. Real-Time Driver Drowsiness Detection Using Facial Analysis and Machine Learning Techniques. SENSORS (BASEL, SWITZERLAND) 2025; 25:812. [PMID: 39943451 PMCID: PMC11819803 DOI: 10.3390/s25030812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2025] [Revised: 01/25/2025] [Accepted: 01/27/2025] [Indexed: 02/16/2025]
Abstract
Drowsy driving poses a significant challenge to road safety worldwide, contributing to thousands of accidents and fatalities annually. Despite advancements in driver drowsiness detection (DDD) systems, many existing methods face limitations such as intrusiveness and delayed reaction times. This research addresses these gaps by leveraging facial analysis and state-of-the-art machine learning techniques to develop a real-time, non-intrusive DDD system. A distinctive aspect of this research is its systematic assessment of various machine and deep learning algorithms across three pivotal public datasets, the NTHUDDD, YawDD, and UTA-RLDD, known for their widespread use in drowsiness detection studies. Our evaluation covered techniques including the K-Nearest Neighbors (KNNs), support vector machines (SVMs), convolutional neural networks (CNNs), and advanced computer vision (CV) models such as YOLOv5, YOLOv8, and Faster R-CNN. Notably, the KNNs classifier reported the highest accuracy of 98.89%, a precision of 99.27%, and an F1 score of 98.86% on the UTA-RLDD. Among the CV methods, YOLOv5 and YOLOv8 demonstrated exceptional performance, achieving 100% precision and recall with mAP@0.5 values of 99.5% on the UTA-RLDD. In contrast, Faster R-CNN showed an accuracy of 81.0% and a precision of 63.4% on the same dataset. These results demonstrate the potential of our system to significantly enhance road safety by providing proactive alerts in real time.
Collapse
Affiliation(s)
- Siham Essahraui
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Ismail Lamaakal
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Ikhlas El Hamly
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Yassine Maleh
- Laboratory LaSTI, ENSAK, Sultan Moulay Slimane University, Khouribga 54000, Morocco
| | - Ibrahim Ouahbi
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Khalid El Makkaoui
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Mouncef Filali Bouami
- Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco; (S.E.); (I.L.); (I.E.H.); (I.O.); (K.E.M.); (M.F.B.)
| | - Paweł Pławiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155 Krakow, Poland;
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, 44-100 Gliwice, Poland
| | - Osama Alfarraj
- Computer Science Department, Community College, King Saud University, Riyadh 11437, Saudi Arabia;
| | - Ahmed A. Abd El-Latif
- Jadara University Research Center, Jadara University, Irbid 21110, Jordan;
- Department of Mathematics and Computer Science, Faculty of Science, Menoufia University, Shebin El-Koom 32511, Egypt
| |
Collapse
|
12
|
Bushiri Pwesombo D, Beese C, Schmied C, Sun H. Semisupervised Contrastive Learning for Bioactivity Prediction Using Cell Painting Image Data. J Chem Inf Model 2025; 65:528-543. [PMID: 39761993 PMCID: PMC11776044 DOI: 10.1021/acs.jcim.4c00835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 12/17/2024] [Accepted: 12/26/2024] [Indexed: 01/28/2025]
Abstract
Morphological profiling has recently demonstrated remarkable potential for identifying the biological activities of small molecules. Alongside the fully supervised and self-supervised machine learning methods recently proposed for bioactivity prediction from Cell Painting image data, we introduce here a semisupervised contrastive (SemiSupCon) learning approach. This approach combines the strengths of using biological annotations in supervised contrastive learning and leveraging large unannotated image data sets with self-supervised contrastive learning. SemiSupCon enhances downstream prediction performance of classifying MeSH pharmacological classifications from PubChem, as well as mode of action and biological target annotations from the Drug Repurposing Hub across two publicly available Cell Painting data sets. Notably, our approach has effectively predicted the biological activities of several unannotated compounds, and these findings were validated through literature searches. This demonstrates that our approach can potentially expedite the exploration of biological activity based on Cell Painting image data with minimal human intervention.
Collapse
Affiliation(s)
- David Bushiri Pwesombo
- Research
Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany
- Institute
of Chemistry, Technische Universität
Berlin, 10623 Berlin, Germany
| | - Carsten Beese
- Research
Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany
| | - Christopher Schmied
- Research
Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany
- EU-OPENSCREEN, Berlin 13125, Germany
| | - Han Sun
- Research
Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany
- Institute
of Chemistry, Technische Universität
Berlin, 10623 Berlin, Germany
| |
Collapse
|
13
|
Yang Y, Cheng F. Artificial intelligence streamlines scientific discovery of drug-target interactions. Br J Pharmacol 2025. [PMID: 39843168 DOI: 10.1111/bph.17427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 10/04/2024] [Accepted: 11/01/2024] [Indexed: 01/24/2025] Open
Abstract
Drug discovery is a complicated process through which new therapeutics are identified to prevent and treat specific diseases. Identification of drug-target interactions (DTIs) stands as a pivotal aspect within the realm of drug discovery and development. The traditional process of drug discovery, especially identification of DTIs, is marked by its high costs of experimental assays and low success rates. Computational methods have emerged as indispensable tools, especially those employing artificial intelligence (AI) methods, which could streamline the process, thereby reducing costs and time consumption and potentially increasing success rates. In this review, we focus on the application of AI techniques in DTI prediction. Specifically, we commence with a comprehensive overview of drug discovery and development, along with systematic prediction and validation of DTIs. We proceed to highlight the prominent databases and toolkits used in developing AI methods for DTI prediction, as well as with methodologies for evaluating their efficacy. We further extend the exploration into three primary types of state-of-the-art AI methods used in DTI prediction, including classical machine learning, deep learning and network-based methods. Finally, we summarize the key findings and outline the current challenges and future directions that AI methods face in scientific drug discovery and development.
Collapse
Affiliation(s)
- Yuxin Yang
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Feixiong Cheng
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| |
Collapse
|
14
|
Salmanpour MR, Gorji A, Mousavi A, Fathi Jouzdani A, Sanati N, Maghsudi M, Leung B, Ho C, Yuan R, Rahmim A. Enhanced Lung Cancer Survival Prediction Using Semi-Supervised Pseudo-Labeling and Learning from Diverse PET/CT Datasets. Cancers (Basel) 2025; 17:285. [PMID: 39858067 PMCID: PMC11763441 DOI: 10.3390/cancers17020285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Revised: 01/09/2025] [Accepted: 01/13/2025] [Indexed: 01/27/2025] Open
Abstract
OBJECTIVE This study explores a semi-supervised learning (SSL), pseudo-labeled strategy using diverse datasets such as head and neck cancer (HNCa) to enhance lung cancer (LCa) survival outcome predictions, analyzing handcrafted and deep radiomic features (HRF/DRF) from PET/CT scans with hybrid machine learning systems (HMLSs). METHODS We collected 199 LCa patients with both PET and CT images, obtained from TCIA and our local database, alongside 408 HNCa PET/CT images from TCIA. We extracted 215 HRFs and 1024 DRFs by PySERA and a 3D autoencoder, respectively, within the ViSERA 1.0.0 software, from segmented primary tumors. The supervised strategy (SL) employed an HMLS-PCA connected with six classifiers on both HRFs and DRFs. The SSL strategy expanded the datasets by adding 408 pseudo-labeled HNCa cases (labeled by the Random Forest algorithm) to 199 LCa cases, using the same HMLS techniques. Furthermore, principal component analysis (PCA) linked with four survival prediction algorithms were utilized in the survival hazard ratio analysis. RESULTS The SSL strategy outperformed the SL method (p << 0.001), achieving an average accuracy of 0.85 ± 0.05 with DRFs from PET and PCA + Multi-Layer Perceptron (MLP), compared to 0.69 ± 0.06 for the SL strategy using DRFs from CT and PCA + Light Gradient Boosting (LGB). Additionally, PCA linked with Component-wise Gradient Boosting Survival Analysis on both HRFs and DRFs, as extracted from CT, had an average C-index of 0.80, with a log rank p-value << 0.001, confirmed by external testing. CONCLUSIONS Shifting from HRFs and SL to DRFs and SSL strategies, particularly in contexts with limited data points, enabling CT or PET alone, can significantly achieve high predictive performance.
Collapse
Affiliation(s)
- Mohammad R. Salmanpour
- BC Cancer Research Institute, Vancouver, BC V5Z 1L3, Canada; (C.H.); (A.R.)
- Department of Radiology, University of British Columbia, Vancouver, BC V6T 1Z4, Canada;
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
| | - Arman Gorji
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
- Neuroscience and Artificial Intelligence Research Group (NAIRG), Department of Neuroscience, Hamadan University of Medical Sciences, Hamadan 6517838736, Iran
| | - Amin Mousavi
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
| | - Ali Fathi Jouzdani
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
- Neuroscience and Artificial Intelligence Research Group (NAIRG), Department of Neuroscience, Hamadan University of Medical Sciences, Hamadan 6517838736, Iran
| | - Nima Sanati
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
- Neuroscience and Artificial Intelligence Research Group (NAIRG), Department of Neuroscience, Hamadan University of Medical Sciences, Hamadan 6517838736, Iran
| | - Mehdi Maghsudi
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC V6L 1L7, Canada; (A.G.); (A.M.); (A.F.J.); (N.S.); (M.M.)
| | - Bonnie Leung
- BC Cancer, Vancouver Center, Vancouver, BC V5Z 1L3, Canada;
| | - Cheryl Ho
- BC Cancer Research Institute, Vancouver, BC V5Z 1L3, Canada; (C.H.); (A.R.)
| | - Ren Yuan
- Department of Radiology, University of British Columbia, Vancouver, BC V6T 1Z4, Canada;
- BC Cancer, Vancouver Center, Vancouver, BC V5Z 1L3, Canada;
| | - Arman Rahmim
- BC Cancer Research Institute, Vancouver, BC V5Z 1L3, Canada; (C.H.); (A.R.)
- Department of Radiology, University of British Columbia, Vancouver, BC V6T 1Z4, Canada;
- Department of Physics & Astronomy, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| |
Collapse
|
15
|
Liu Z, Shang L, Huang K, Yue Z, Han AY, Wang D, Zhang H. Combining Group Contribution Method and Semisupervised Learning to Build Machine Learning Models for Predicting Hydroxyl Radical Rate Constants of Water Contaminants. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2025; 59:857-868. [PMID: 39723902 DOI: 10.1021/acs.est.4c11950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2024]
Abstract
Machine learning is an effective tool for predicting reaction rate constants for many organic compounds with the hydroxyl radical (HO•). Previously reported models have achieved relatively good performance, but due to scarce data (<1400 records), the applicability domain (AD) has been significantly limited. To address this limitation, we curated a much larger experimental data set (Primary data set), which contains 2358 kinetic records. We then employed both the group contribution method (GCM) and a semisupervised learning (SSL) strategy to add new data points, aiming to effectively expand the model's AD while improving model performance. The results indicated that GCM improved the model's performance for chemicals outside the AD, while SSL expanded the model's AD. The final model, after incorporating 147,168 new data points, achieved an R2 = 0.77, root-mean-square-error = 0.32, and mean-absolute-error = 0.24 on the test set. Importantly, the AD was expanded by 117% compared to the model developed solely based on the Primary data set, and the final model can be reliably applied to more than 560,000 chemicals from the DSSTox database. Further model interpretation results indicated that the model made predictions based on a correct "understanding" of the impact of key substituents and reactive sites toward HO•. This research provides an effective method for augmenting data sets, which is important in improving ML model performance and expanding AD. The final model has been made widely accessible through a free online predictor.
Collapse
Affiliation(s)
- Zhao Liu
- Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States
| | - Lanyu Shang
- School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, United States
| | - Kuan Huang
- Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States
| | - Zhenrui Yue
- School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, United States
| | - Alan Y Han
- Department of Computer Science, Cornell University, Ithaca, New York 14850, United States
| | - Dong Wang
- School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, United States
| | - Huichun Zhang
- Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States
| |
Collapse
|
16
|
Bi H, Feng Y, Diao W, Wang P, Mao Y, Fu K, Wang H, Sun X. Prompt-and-Transfer: Dynamic Class-Aware Enhancement for Few-Shot Segmentation. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2025; 47:131-148. [PMID: 39288051 DOI: 10.1109/tpami.2024.3461779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/19/2024]
Abstract
For more efficient generalization to unseen domains (classes), most Few-shot Segmentation (FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially in the current era of large models. However, such fixed feature encoders tend to be class-agnostic, inevitably activating objects that are irrelevant to the target class. In contrast, humans can effortlessly focus on specific objects in the line of sight. This paper mimics the visual perception pattern of human beings and proposes a novel and powerful prompt-driven scheme, called "Prompt and Transfer" (PAT), which constructs a dynamic class-aware prompting paradigm to tune the encoder for focusing on the interested object (target class) in the current task. Three key points are elaborated to enhance the prompting: 1) Cross-modal linguistic information is introduced to initialize prompts for each task. 2) Semantic Prompt Transfer (SPT) that precisely transfers the class-specific semantics within the images to prompts. 3) Part Mask Generator (PMG) that works in conjunction with SPT to adaptively generate different but complementary part prompts for different individuals. Surprisingly, PAT achieves competitive performance on 4 different tasks including standard FSS, Cross-domain FSS (e.g., CV, medical, and remote sensing domains), Weak-label FSS, and Zero-shot Segmentation, setting new state-of-the-arts on 11 benchmarks.
Collapse
|
17
|
Du Y, Luo S, Xin Y, Chen M, Feng S, Zhang M, Wang C. Multi-source fully test-time adaptation. Neural Netw 2025; 181:106661. [PMID: 39393207 DOI: 10.1016/j.neunet.2024.106661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 06/22/2024] [Accepted: 08/21/2024] [Indexed: 10/13/2024]
Abstract
Deep neural networks have significantly advanced various fields. However, these models often encounter difficulties in achieving effective generalization when the distribution of test samples varies from that of the training samples. Recently, some fully test-time adaptation methods have been proposed to adapt the trained model with the unlabeled test samples before prediction to enhance the test performance. Despite achieving remarkable results, these methods only involve one trained model, which could only provide certain side information for the test samples. In real-world scenarios, there could be multiple available trained models that are beneficial to the test samples and are complementary to each other. Consequently, to better utilize these trained models, in this paper, we propose the problem of multi-source fully test-time adaptation to adapt multiple trained models to the test samples. To address this problem, we introduce a simple yet effective method utilizing a weighted aggregation scheme and introduce two unsupervised losses. The former could adaptively assign a higher weight to a more relevant model, while the latter could jointly adapt models with online unlabeled samples. Extensive experiments on three image classification datasets show that the proposed method achieves better results than baseline methods, demonstrating the superiority in adapting to multiple models.
Collapse
Affiliation(s)
- Yuntao Du
- Beijing Institute for General Artificial Intelligence (BIGAI), Beijing, China
| | - Siqi Luo
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China
| | - Yi Xin
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China
| | - Mingcai Chen
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China
| | - Shuai Feng
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China
| | - Mujie Zhang
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China
| | - Chonngjun Wang
- State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing, China.
| |
Collapse
|
18
|
Wei C, Wang Z, Yuan J, Wang X, Liu H, Zhao Q. SemiHAR: Improving Semisupervised Human Activity Recognition via Multitask Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:1884-1897. [PMID: 37971917 DOI: 10.1109/tnnls.2023.3330879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2023]
Abstract
Semisupervised human activity recognition (SemiHAR) has attracted attention in recent years from various domains, such as digital health and ambient intelligence. Currently, it still faces two challenges. For one thing, discriminative features may exist among multiple sequences rather than a single sequence since activities are combinations of motions involving several body parts. For another thing, labeled data and unlabeled data suffer from distribution discrepancies due to the different behavior patterns or biological conditions of users. For that, we propose a novel SemiHAR method based on multitask learning. First, a dimension-based Markov transition field (DMTF) technique is designed to generate 2-D activity data for capturing the interactions among different dimensions. Second, we jointly consider the user recognition (UR) task and the activity recognition (AR) task to reduce the underlying discrepancy. In addition, a task relation learner (TRL) is introduced to dynamically learn task relations, which enables the primary AR task to exploit preferred knowledge from other secondary tasks. We theoretically analyze the proposed SemiHAR and provide a novel generalization result. Extensive experiments conducted on four real-world datasets demonstrate that SemiHAR outperforms other state-of-the-art methods.
Collapse
|
19
|
Seeuws N, De Vos M, Bertrand A. A Human-in-the-Loop Method for Annotation of Events in Biomedical Signals. IEEE J Biomed Health Inform 2025; 29:95-106. [PMID: 39269811 DOI: 10.1109/jbhi.2024.3460533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
OBJECTIVE Building large-scale data bases of biomedical signal recordings for training artificial-intelligence systems involves substantial human effort in data processing and annotation. In the case of event detection, experts need to exhaustively scroll through the recordings and highlight events of interest. METHODS We propose an iterative annotation support algorithm with a human in the loop to improve the efficiency of the annotation process. Our algorithm generates proposal events based on an event detection model trained on incomplete annotations. The human only needs to verify candidate events proposed by the tool instead of scrolling through the entire data set. Our algorithm iterates between proposal generation and verification to leverage the human-in-the-loop feedback to obtain a growing set of event annotations. RESULTS Our algorithm finds a substantial amount of events at a fraction of the human time spent when comparing with a benchmark method and the normal manual process, finding all events in one data set and 70% of events in another with the human-in-the-loop only viewing 20% of the data. CONCLUSION Our results show that combining human and computer effort can substantially speed up the annotation process for events in biomedical signal processing. SIGNIFICANCE Due to its simplicity and minimal reliance on task-specific information, our algorithm is broadly applicable, unlocking substantial improvements in the scalability and efficiency of biomedical signal annotation.
Collapse
|
20
|
Dominguez-Morales JP, Hernandez-Rodriguez JC, Duran-Lopez L, Conejo-Mir J, Pereyra-Rodriguez JJ. Melanoma Breslow Thickness Classification Using Ensemble-Based Knowledge Distillation With Semi-Supervised Convolutional Neural Networks. IEEE J Biomed Health Inform 2025; 29:443-455. [PMID: 39302772 DOI: 10.1109/jbhi.2024.3465929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Melanoma is considered a global public health challenge and is responsible for more than 90% deaths related to skin cancer. Although the diagnosis of early melanoma is the main goal of dermoscopy, the discrimination between dermoscopic images of in situ and invasive melanomas can be a difficult task even for experienced dermatologists. Recent advances in artificial intelligence in the field of medical image analysis show that its application to dermoscopy with the aim of supporting and providing a second opinion to the medical expert could be of great interest. In this work, four datasets from different sources were used to train and evaluate deep learning models on in situ versus invasive melanoma classification and on Breslow thickness prediction. Supervised learning and semi-supervised learning using a multi-teacher ensemble knowledge distillation approach were considered and evaluated using a stratified 5-fold cross-validation scheme. The best models achieved AUCs of 0.80850.0242 and of 0.82320.0666 on the former and latter classification tasks, respectively. The best results were obtained using semi-supervised learning, with the best model achieving 0.8547 and 0.8768 AUC, respectively. An external test set was also evaluated, where semi-supervision achieved higher performance in all the classification tasks. The results obtained show that semi-supervised learning could improve the performance of trained models in different melanoma classification tasks compared to supervised learning. Automatic deep learning-based diagnosis systems could support medical professionals in their decision, serving as a second opinion or as a triage tool for medical centers.
Collapse
|
21
|
Saihood A, Abdulhussien WR, Alzubaid L, Manoufali M, Gu Y. Fusion-driven semi-supervised learning-based lung nodules classification with dual-discriminator and dual-generator generative adversarial network. BMC Med Inform Decis Mak 2024; 24:403. [PMID: 39716221 DOI: 10.1186/s12911-024-02820-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Accepted: 12/16/2024] [Indexed: 12/25/2024] Open
Abstract
BACKGROUND The detection and classification of lung nodules are crucial in medical imaging, as they significantly impact patient outcomes related to lung cancer diagnosis and treatment. However, existing models often suffer from mode collapse and poor generalizability, as they fail to capture the complete diversity of the data distribution. This study addresses these challenges by proposing a novel generative adversarial network (GAN) architecture tailored for semi-supervised lung nodule classification. METHODS The proposed DDDG-GAN model consists of dual generators and discriminators. Each generator specializes in benign or malignant nodules, generating diverse, high-fidelity synthetic images for each class. This dual-generator setup prevents mode collapse. The dual-discriminator framework enhances the model's generalization capability, ensuring better performance on unseen data. Feature fusion techniques are incorporated to refine the model's discriminatory power between benign and malignant nodules. The model is evaluated in two scenarios: (1) training and testing on the LIDC-IDRI dataset and (2) training on LIDC-IDRI, testing on the unseen LUNA16 dataset and the unseen LUNGx dataset. RESULTS In Scenario 1, the DDDG-GAN achieved an accuracy of 92.56%, a precision of 90.12%, a recall of 95.87%, and an F1 score of 92.77%. In Scenario 2, the model demonstrated robust performance with an accuracy of 72.6%, a precision of 72.3%, a recall of 73.82%, and an F1 score of 73.39% when testing using Luna16 and an accuracy of 71.23%, a precision of 67.56%, a recall of 73.52%, and an F1 score of 70.42% when testing using LungX. The results indicate that the proposed model outperforms state-of-the-art semi-supervised learning approaches. CONCLUSIONS The DDDG-GAN model mitigates mode collapse and improves generalizability in lung nodule classification. It demonstrates superior performance on both the LIDC-IDRI and the unseen LUNA16 and LungX datasets, offering significant potential for improving diagnostic accuracy in clinical practice.
Collapse
Affiliation(s)
- Ahmed Saihood
- College of Computer Science and Mathematics, University of Thi-Qar, Thi Qar, Iraq
| | | | - Laith Alzubaid
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD, 4000, Australia.
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, 4000, Australia.
- Aptium Company, Brisbane, QLD, 4059, Australia.
| | - Mohamed Manoufali
- Space & Astronomy, Commonwealth Scientific Industrial Research Organisation (CSIRO), Kensington, WA, 6151, Australia
| | - Yuantong Gu
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD, 4000, Australia
| |
Collapse
|
22
|
Yang J, Henao JAG, Dvornek N, He J, Bower DV, Depotter A, Bajercius H, de Mortanges AP, You C, Gange C, Ledda RE, Silva M, Dela Cruz CS, Hautz W, Bonel HM, Reyes M, Staib LH, Poellinger A, Duncan JS. Prior knowledge-guided vision-transformer-based unsupervised domain adaptation for intubation prediction in lung disease at one week. Comput Med Imaging Graph 2024; 118:102442. [PMID: 39515190 DOI: 10.1016/j.compmedimag.2024.102442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 09/05/2024] [Accepted: 09/30/2024] [Indexed: 11/16/2024]
Abstract
Data-driven approaches have achieved great success in various medical image analysis tasks. However, fully-supervised data-driven approaches require unprecedentedly large amounts of labeled data and often suffer from poor generalization to unseen new data due to domain shifts. Various unsupervised domain adaptation (UDA) methods have been actively explored to solve these problems. Anatomical and spatial priors in medical imaging are common and have been incorporated into data-driven approaches to ease the need for labeled data as well as to achieve better generalization and interpretation. Inspired by the effectiveness of recent transformer-based methods in medical image analysis, the adaptability of transformer-based models has been investigated. How to incorporate prior knowledge for transformer-based UDA models remains under-explored. In this paper, we introduce a prior knowledge-guided and transformer-based unsupervised domain adaptation (PUDA) pipeline. It regularizes the vision transformer attention heads using anatomical and spatial prior information that is shared by both the source and target domain, which provides additional insight into the similarity between the underlying data distribution across domains. Besides the global alignment of class tokens, it assigns local weights to guide the token distribution alignment via adversarial training. We evaluate our proposed method on a clinical outcome prediction task, where Computed Tomography (CT) and Chest X-ray (CXR) data are collected and used to predict the intubation status of patients in a week. Abnormal lesions are regarded as anatomical and spatial prior information for this task and are annotated in the source domain scans. Extensive experiments show the effectiveness of the proposed PUDA method.
Collapse
Affiliation(s)
- Junlin Yang
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA.
| | | | - Nicha Dvornek
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
| | - Jianchun He
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
| | - Danielle V Bower
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland
| | - Arno Depotter
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland
| | - Herkus Bajercius
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland
| | - Aurélie Pahud de Mortanges
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland
| | - Chenyu You
- Department of Electrical Engineering, Yale University, New Haven, CT, USA
| | - Christopher Gange
- Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
| | | | - Mario Silva
- Section of "Scienze Radiologiche," Diagnostic Department, University Hospital of Parma, Parma, Italy; Department of Medicine and Surgery, University of Parma, Italy
| | - Charles S Dela Cruz
- Section of Pulmonary, Critical Care, and Sleep Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Wolf Hautz
- Department of Emergency Medicine, Inselspital University Hospital, University of Bern, Bern, Switzerland
| | - Harald M Bonel
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland; Campusradiologie, Department of Radiological Diagnostics, Lindenhofspital Bern, Bern, Switzerland; Campus Stiftung Lindenhof Bern, Bern, Switzerland
| | - Mauricio Reyes
- The ARTORG Center for Biomedical Research, University of Bern, Bern, Switzerland; Department of Radiation Oncology, Inselspital, Bern University Hospital, Bern, Switzerland
| | - Lawrence H Staib
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA; Department of Electrical Engineering, Yale University, New Haven, CT, USA
| | - Alexander Poellinger
- Department of Diagnostic, Interventional, and Pediatric Radiology, Inselspital Bern, University of Bern, Bern, Switzerland.
| | - James S Duncan
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA; Department of Electrical Engineering, Yale University, New Haven, CT, USA.
| |
Collapse
|
23
|
Wu Y, Liu S, Kapelan Z. Addressing data limitations in leakage detection of water distribution systems: Data creation, data requirement reduction, and knowledge transfer. WATER RESEARCH 2024; 267:122471. [PMID: 39305529 DOI: 10.1016/j.watres.2024.122471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 09/03/2024] [Accepted: 09/16/2024] [Indexed: 11/28/2024]
Abstract
Leakage in water distribution systems is a significant problem worldwide, leading to wastage of water resources, compromised water quality and excess energy consumption. Leakage detection is essential to reduce the duration of leaks and data-driven methods are increasingly being used for this purpose. However, these models are data hungry and available observed data, especially leakage data, is limited in most cases. In addition, these data need to be manually processed to label whether leaks occur, which is time-consuming and costly. These are significant obstacles for the development and application of these methods. This article provides a comprehensive review of relevant journal papers, categorizing all data-driven methods into unsupervised anomaly detection, semi-supervised anomaly detection and supervised classification methods based on how the data are utilized for developing these methods. In addition, strategies to address data limitations are summarized from both data and model perspectives, including data creation, reduction of a model's data requirements and knowledge transfer. After detailing these strategies, research gaps are identified. Based on these, future research directions are suggested, highlighting the need for further research in data augmentation, development of semi-supervised classification methods, exploration of multi-classification methods with model updating mechanisms, and development of novel knowledge transfer methods.
Collapse
Affiliation(s)
- Yipeng Wu
- School of Environment, Tsinghua University, 100084, Beijing, China; Faculty of Civil Engineering and Geosciences, Delft University of Technology, 2628 CN Delft, the Netherlands.
| | - Shuming Liu
- School of Environment, Tsinghua University, 100084, Beijing, China
| | - Zoran Kapelan
- Faculty of Civil Engineering and Geosciences, Delft University of Technology, 2628 CN Delft, the Netherlands
| |
Collapse
|
24
|
Qi Y, Hossain MS. Semi-supervised Federated Learning for Digital Twin 6G-enabled IIoT: A Bayesian estimated approach. J Adv Res 2024; 66:47-57. [PMID: 38417576 PMCID: PMC11675063 DOI: 10.1016/j.jare.2024.02.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/03/2024] [Accepted: 02/17/2024] [Indexed: 03/01/2024] Open
Abstract
INTRODUCTION In recent years, the proliferation of Industrial Internet of Things (IIoT) devices has resulted in a substantial increase in data generation across various domains, including the nascent 6G networks. Digital Twins (DTs), serving as virtual replicas of physical entities, have gained popularity within the realm of IoT due to their capacity to simulate and optimize physical systems in a cost-effective manner. Nonetheless, the security of DTs and the safeguarding of the sensitive data they generate have emerged as paramount concerns. Fortunately, the Federated Fearning (FL) system has emerged as a promising solution to address the challenge of data privacy within DTs. Nonetheless, the requisite acquisition of a significant volume of labeled data for training purposes poses a formidable challenge, particularly in a DT environment that blends real and virtual data. OBJECTIVES To tackle this challenge, this study presents an innovative Semi-supervised FL (SSFL) framework designed to overcome the scarcity of labeled data through the strategic utilization of pseudo-labels. METHODS Specifically, our proposed SSFL algorithm, named SSFL-MBE, introduces a novel approach by combining Mix data augmentation and Bayesian Estimation consistency regularization loss, thereby integrating robust augmentation techniques to enhance model generalization. Furthermore, we introduce a Bayesian-estimated pseudo-label loss that leverages prior probabilistic knowledge to enhance model performance. Our investigation focuses particularly on a demanding scenario where labeled and unlabeled data are segregated across disparate locations, specifically, the server and various clients. RESULTS Comprehensive evaluations conducted on CIFAR-10 and MNIST datasets conclusively demonstrate that our proposed algorithm consistently surpasses mainstream SSFL baseline models, exhibiting an enhancement in model performance ranging from 0.5% to 1.5%. CONCLUSION Overall, this work contributes to the development of more efficient and secure approaches for model training in DT-empowered FL settings, which is crucial for the deployment of IIoTs in 6G-enabled environments.
Collapse
Affiliation(s)
- Yuanhang Qi
- School of Computer Science, University of Electronic Science and Technology of China, Zhongshan Institute, Zhongshan 528402, China.
| | - M Shamim Hossain
- Department of Software Engineering, College of Computer and Information Sciences, King Saud University, Riyadh 12372, Saudi Arabia.
| |
Collapse
|
25
|
Li Y, Wang T, Xie J, Yang J, Pan T, Yang B. A simulation data-driven semi-supervised framework based on MK-KNN graph and ESSGAT for bearing fault diagnosis. ISA TRANSACTIONS 2024; 155:261-273. [PMID: 39366891 DOI: 10.1016/j.isatra.2024.09.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 09/24/2024] [Accepted: 09/25/2024] [Indexed: 10/06/2024]
Abstract
Current supervised intelligent fault diagnosis relies on abundant labeled data. However, collecting and labeling data are typically both expensive and time-consuming. Fault diagnosis with unlabeled data remains a significant challenge. To address this issue, a simulation data-driven semi-supervised framework based on multi-kernel K-nearest neighbor (MK-KNN) and edge self-supervised graph attention network (ESSGAT) is proposed. The novel MK-KNN establishes the neighborhood relationships between simulation data and real data. The developed multi-kernel function mitigates the risks of overfitting and underfitting, thereby enhancing the robustness of the simulation-real graphs. The designed ESSGAT employs two forms of self-supervised attention to predict the presence of edges, increasing the weights of crucial neighboring nodes in the MK-KNN graph. The performance of the proposed method is evaluated using a public bearing dataset and a self-constructed dataset of high-speed train axle box bearings. The results show that the proposed method achieves better diagnostic performance compared with other state-of-the-art graph construction methods and graph convolutional networks.
Collapse
Affiliation(s)
- Yuyan Li
- College of Traffic and Transportation Engineering, Central South University, Changsha 410075, China.
| | - Tiantian Wang
- College of Traffic and Transportation Engineering, Central South University, Changsha 410075, China; College of Mechanical and Vehicle Engineering, Hunan University, Changsha 410082, China.
| | - Jingsong Xie
- College of Traffic and Transportation Engineering, Central South University, Changsha 410075, China.
| | - Jinsong Yang
- College of Traffic and Transportation Engineering, Central South University, Changsha 410075, China.
| | - Tongyang Pan
- College of Traffic and Transportation Engineering, Central South University, Changsha 410075, China.
| | - Buyao Yang
- College of Mechanical and Vehicle Engineering, Hunan University, Changsha 410082, China.
| |
Collapse
|
26
|
Liang D, Liu A, Wu L, Li C, Qian R, Chen X. Privacy-preserving multi-source semi-supervised domain adaptation for seizure prediction. Cogn Neurodyn 2024; 18:3521-3534. [PMID: 39712093 PMCID: PMC11655995 DOI: 10.1007/s11571-023-10026-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/22/2023] [Accepted: 10/23/2023] [Indexed: 12/24/2024] Open
Abstract
Domain adaptation (DA) has been frequently used to solve the inter-patient variability problem in EEG-based seizure prediction. However, existing DA methods require access to the existing patients' data when adapting the model, which leads to privacy concerns. Besides, most of them treat the whole existing patients' data as one single source and attempt to minimize the discrepancy with the target patient. This manner ignores the inter-patient variability among source patients, making the adaptation more difficult. Considering theses issues simultaneously, we present a novel multi-source-free semi-supervised domain adaptive seizure prediction model (MSF-SSDA-SPM). MSF-SSDA-SPM considers each source patient as one single source and generates a pretrained model from each source. Without requiring access to the source data, MSF-SSDA-SPM performs adaptation just using these pretrained source models and limited labeled target data. Specifically, we freeze the classifiers of all the source models and optimize the source feature extractors in a joint manner. Then we design a knowledge distillation strategy to integrate the knowledge of these well-adapted source models into one single target model. On the CHB-MIT dataset, MSF-SSDA-SPM attains a sensitivity of 88.6%, a FPR of 0.182/h and an AUC of 0.856; on the Kaggle dataset, it achieves 78.6%, 0.178/h and 0.784, respectively. Experimental results demonstrate that MSF-SSDA-SPM achieves both high privacy-protection and promising prediction performance.
Collapse
Affiliation(s)
- Deng Liang
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Aiping Liu
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Le Wu
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Chang Li
- Department of Biomedical Engineering, Hefei University of Technology, Hefei, 230009 Anhui China
| | - Ruobing Qian
- Department of Neurosurgery, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001 Anhui China
| | - Xun Chen
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
- Department of Neurosurgery, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001 Anhui China
| |
Collapse
|
27
|
Rodu J, DeJong Lempke AF, Kupperman N, Hertel J. On Leveraging Machine Learning in Sport Science in the Hypothetico-deductive Framework. SPORTS MEDICINE - OPEN 2024; 10:124. [PMID: 39541034 PMCID: PMC11564444 DOI: 10.1186/s40798-024-00788-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 10/23/2024] [Indexed: 11/16/2024]
Abstract
Supervised machine learning (ML) offers an exciting suite of algorithms that could benefit research in sport science. In principle, supervised ML approaches were designed for pure prediction, as opposed to explanation, leading to a rise in powerful, but opaque, algorithms. Recently, two subdomains of ML-explainable ML, which allows us to "peek into the black box," and interpretable ML, which encourages using algorithms that are inherently interpretable-have grown in popularity. The increased transparency of these powerful ML algorithms may provide considerable support for the hypothetico-deductive framework, in which hypotheses are generated from prior beliefs and theory, and are assessed against data collected specifically to test that hypothesis. However, this paper shows why ML algorithms are fundamentally different from statistical methods, even when using explainable or interpretable approaches. Translating potential insights from supervised ML algorithms, while in many cases seemingly straightforward, can have unanticipated challenges. While supervised ML cannot be used to replace statistical methods, we propose ways in which the sport sciences community can take advantage of supervised ML in the hypothetico-deductive framework. In this manuscript we argue that supervised machine learning can and should augment our exploratory investigations in sport science, but that leveraging potential insights from supervised ML algorithms should be undertaken with caution. We justify our position through a careful examination of supervised machine learning, and provide a useful analogy to help elucidate our findings. Three case studies are provided to demonstrate how supervised machine learning can be integrated into exploratory analysis. Supervised machine learning should be integrated into the scientific workflow with requisite caution. The approaches described in this paper provide ways to safely leverage the strengths of machine learning-like the flexibility ML algorithms can provide for fitting complex patterns-while avoiding potential pitfalls-at best, like wasted effort and money, and at worst, like misguided clinical recommendations-that may arise when trying to integrate findings from ML algorithms into domain knowledge. KEY POINTS: Some supervised machine learning algorithms and statistical models are used to solve the same problem, y = f(x) + ε, but differ fundamentally in motivation and approach. The hypothetico-deductive framework-in which hypotheses are generated from prior beliefs and theory, and are assessed against data collected specifically to test that hypothesis-is one of the core frameworks comprising the scientific method. In the hypothetico-deductive framework, supervised machine learning can be used in an exploratory capacity. However, it cannot replace the use of statistical methods, even as explainable and interpretable machine learning methods become increasingly popular. Improper use of supervised machine learning in the hypothetico-deductive framework is tantamount to p-value hacking in statistical methods.
Collapse
Affiliation(s)
- Jordan Rodu
- Department of Statistics, University of Virginia, Charlottesville, VA, USA.
| | - Alexandra F DeJong Lempke
- Department of Physical Medicine and Rehabilitation, School of Medicine, Virginia Commonwealth University, Richmond, VA, USA
| | - Natalie Kupperman
- School of Data Science, University of Virginia, Charlottesville, VA, USA
| | - Jay Hertel
- Department of Kinesiology, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
28
|
Behera SK, Karthika S, Mahanty B, Meher SK, Zafar M, Baskaran D, Rajamanickam R, Das R, Pakshirajan K, Bilyaminu AM, Rene ER. Application of artificial intelligence tools in wastewater and waste gas treatment systems: Recent advances and prospects. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 370:122386. [PMID: 39260284 DOI: 10.1016/j.jenvman.2024.122386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 08/17/2024] [Accepted: 08/31/2024] [Indexed: 09/13/2024]
Abstract
The non-linear complex relationships among the process variables in wastewater and waste gas treatment systems possess a significant challenge for real-time systems modelling. Data driven artificial intelligence (AI) tools are increasingly being adopted to predict the process performance, cost-effective process monitoring, and the control of different waste treatment systems, including those involving resource recovery. This review presents an in-depth analysis of the applications of emerging AI tools in physico-chemical and biological processes for the treatment of air pollutants, water and wastewater, and resource recovery processes. Additionally, the successful implementation of AI-controlled wastewater and waste gas treatment systems, along with real-time monitoring at the industrial scale are discussed.
Collapse
Affiliation(s)
- Shishir Kumar Behera
- Process Simulation Research Group, School of Chemical Engineering, Vellore Institute of Technology, Vellore, 632 014, Tamil Nadu, India.
| | - S Karthika
- Department of Chemical Engineering, Alagappa College of Technology, Anna University, Chennai, 600 025, Tamil Nadu, India
| | - Biswanath Mahanty
- Division of Biotechnology, Karunya Institute of Technology & Sciences, Coimbatore, 641 114, Tamil Nadu, India
| | - Saroj K Meher
- Systems Science and Informatics Unit, Indian Statistical Institute, Bangalore, 560059, India
| | - Mohd Zafar
- Department of Applied Biotechnology, College of Applied Sciences & Pharmacy, University of Technology and Applied Sciences - Sur, P.O. Box: 484, Zip Code: 411, Sur, Oman
| | - Divya Baskaran
- Department of Chemical and Biomolecular Engineering, Chonnam National University, Yeosu, Jeonnam, 59626, South Korea; Department of Biomaterials, Saveetha Dental College and Hospitals, Saveetha Institute of Medical and Technical Sciences, Chennai, 600 077, Tamil Nadu, India
| | - Ravi Rajamanickam
- Department of Chemical Engineering, Annamalai University, Chidambaram, 608002, Tamil Nadu, India
| | - Raja Das
- Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology, Vellore, 632 014, Tamil Nadu, India
| | - Kannan Pakshirajan
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Guwahati, 781 039, Assam, India
| | - Abubakar M Bilyaminu
- Department of Water Supply, Sanitation and Environmental Engineering, IHE Delft Institute for Water Education, P. O. Box 3015, 2601, DA Delft, the Netherlands
| | - Eldon R Rene
- Department of Water Supply, Sanitation and Environmental Engineering, IHE Delft Institute for Water Education, P. O. Box 3015, 2601, DA Delft, the Netherlands
| |
Collapse
|
29
|
Bröker F, Holt LL, Roads BD, Dayan P, Love BC. Demystifying unsupervised learning: how it helps and hurts. Trends Cogn Sci 2024; 28:974-986. [PMID: 39353836 DOI: 10.1016/j.tics.2024.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 09/06/2024] [Accepted: 09/09/2024] [Indexed: 10/04/2024]
Abstract
Humans and machines rarely have access to explicit external feedback or supervision, yet manage to learn. Most modern machine learning systems succeed because they benefit from unsupervised data. Humans are also expected to benefit and yet, mysteriously, empirical results are mixed. Does unsupervised learning help humans or not? Here, we argue that the mixed results are not conflicting answers to this question, but reflect that humans self-reinforce their predictions in the absence of supervision, which can help or hurt depending on whether predictions and task align. We use this framework to synthesize empirical results across various domains to clarify when unsupervised learning will help or hurt. This provides new insights into the fundamentals of learning with implications for instruction and lifelong learning.
Collapse
Affiliation(s)
- Franziska Bröker
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany; Gatsby Computational Neuroscience Unit, University College London, London, UK; Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA; Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA.
| | - Lori L Holt
- Department of Psychology, University of Texas at Austin, Austin, TX, US
| | - Brett D Roads
- Department of Experimental Psychology, University College London, London, UK
| | - Peter Dayan
- Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Tübingen, Germany; University of Tübingen, Tübingen, Germany
| | - Bradley C Love
- Department of Experimental Psychology, University College London, London, UK
| |
Collapse
|
30
|
Li L, Xiao K, Shang X, Hu W, Yusufu M, Chen R, Wang Y, Liu J, Lai T, Guo L, Zou J, van Wijngaarden P, Ge Z, He M, Zhu Z. Advances in artificial intelligence for meibomian gland evaluation: A comprehensive review. Surv Ophthalmol 2024; 69:945-956. [PMID: 39025239 DOI: 10.1016/j.survophthal.2024.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 07/14/2024] [Accepted: 07/15/2024] [Indexed: 07/20/2024]
Abstract
Meibomian gland dysfunction (MGD) is increasingly recognized as a critical contributor to evaporative dry eye, significantly impacting visual quality. With a global prevalence estimated at 35.8 %, it presents substantial challenges for clinicians. Conventional manual evaluation techniques for MGD face limitations characterized by inefficiencies, high subjectivity, limited big data processing capabilities, and a dearth of quantitative analytical tools. With rapidly advancing artificial intelligence (AI) techniques revolutionizing ophthalmology, studies are now leveraging sophisticated AI methodologies--including computer vision, unsupervised learning, and supervised learning--to facilitate comprehensive analyses of meibomian gland (MG) evaluations. These evaluations employ various techniques, including slit lamp examination, infrared imaging, confocal microscopy, and optical coherence tomography. This paradigm shift promises enhanced accuracy and consistency in disease evaluation and severity classification. While AI has achieved preliminary strides in meibomian gland evaluation, ongoing advancements in system development and clinical validation are imperative. We review the evolution of MG evaluation, juxtapose AI-driven methods with traditional approaches, elucidate the specific roles of diverse AI technologies, and explore their practical applications using various evaluation techniques. Moreover, we delve into critical considerations for the clinical deployment of AI technologies and envisages future prospects, providing novel insights into MG evaluation and fostering technological and clinical progress in this arena.
Collapse
Affiliation(s)
- Li Li
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia; Shengli Clinical Medical College of Fujian Medical University, Fuzhou University Affiliated Provincial Hospital, Fuzhou, China
| | - Kunhong Xiao
- Department of Ophthalmology and Optometry, Fujian Medical University, Fuzhou, China
| | - Xianwen Shang
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia
| | - Wenyi Hu
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia
| | - Mayinuer Yusufu
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia
| | - Ruiye Chen
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia
| | - Yujie Wang
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia
| | - Jiahao Liu
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia
| | - Taichen Lai
- Department of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Linling Guo
- Department of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Jing Zou
- Department of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Peter van Wijngaarden
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia
| | - Zongyuan Ge
- The AIM for Health Lab, Faculty of IT, Monash University, Australia
| | - Mingguang He
- School of Optometry, The Hong Kong Polytechnic University, Hong Kong Special administrative regions of China; Research Centre for SHARP Vision, The Hong Kong Polytechnic University, Kowloon, Hong Kong Special administrative regions of China.
| | - Zhuoting Zhu
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia.
| |
Collapse
|
31
|
Rouzbahani AK, Khalili-Tanha G, Rajabloo Y, Khojasteh-Leylakoohi F, Garjan HS, Nazari E, Avan A. Machine learning algorithms and biomarkers identification for pancreatic cancer diagnosis using multi-omics data integration. Pathol Res Pract 2024; 263:155602. [PMID: 39357184 DOI: 10.1016/j.prp.2024.155602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 09/21/2024] [Accepted: 09/24/2024] [Indexed: 10/04/2024]
Abstract
PURPOSE Pancreatic cancer is a lethal type of cancer with most of the cases being diagnosed in an advanced stage and poor prognosis. Developing new diagnostic and prognostic markers for pancreatic cancer can significantly improve early detection and patient outcomes. These biomarkers can potentially revolutionize medical practice by enabling personalized, more effective, targeted treatments, ultimately improving patient outcomes. METHODS The search strategy was developed following PRISMA guidelines. A comprehensive search was performed across four electronic databases: PubMed, Scopus, EMBASE, and Web of Science, covering all English publications up to September 2022. The Newcastle-Ottawa Scale (NOS) was utilized to assess bias, categorizing studies as "good," "fair," or "poor" quality based on their NOS scores. Descriptive statistics for all included studies were compiled and reviewed, along with the NOS scores for each study to indicate their quality assessment. RESULTS Our results showed that SVM and RF are the most widely used algorithms in machine learning and data analysis, particularly for biomarker identification. SVM, a supervised learning algorithm, is employed for both classification and regression by mapping data points in high-dimensional space to identify the optimal separating hyperplane between classes. CONCLUSIONS The application of machine-learning algorithms in the search for novel biomarkers in pancreatic cancer represents a significant advancement in the field. By harnessing the power of artificial intelligence, researchers are poised to make strides towards earlier detection and more effective treatment, ultimately improving patient outcomes in this challenging disease.
Collapse
Affiliation(s)
- Arian Karimi Rouzbahani
- Student Research Committee, Lorestan University of Medical Sciences, Khorramabad, Iran; USERN Office, Lorestan University of Medical Sciences, Khorramabad, Iran
| | - Ghazaleh Khalili-Tanha
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Yasamin Rajabloo
- Student Research Committee, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | | | - Hassan Shokri Garjan
- Department of Health Information Technology, School of Management University of Medical Sciences, Tabriz, Iran
| | - Elham Nazari
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Amir Avan
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran.
| |
Collapse
|
32
|
Huang W, Zhang L, Wang Z, Wang L. Exploring Inherent Consistency for Semi-Supervised Anatomical Structure Segmentation in Medical Imaging. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:3731-3741. [PMID: 38743533 DOI: 10.1109/tmi.2024.3400840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Due to the exorbitant expense of obtaining labeled data in the field of medical image analysis, semi-supervised learning has emerged as a favorable method for the segmentation of anatomical structures. Although semi-supervised learning techniques have shown great potential in this field, existing methods only utilize image-level spatial consistency to impose unsupervised regularization on data in label space. Considering that anatomical structures often possess inherent anatomical properties that have not been focused on in previous works, this study introduces the inherent consistency into semi-supervised anatomical structure segmentation. First, the prediction and the ground-truth are projected into an embedding space to obtain latent representations that encapsulate the inherent anatomical properties of the structures. Then, two inherent consistency constraints are designed to leverage these inherent properties by aligning these latent representations. The proposed method is plug-and-play and can be seamlessly integrated with existing methods, thereby collaborating to improve segmentation performance and enhance the anatomical plausibility of the results. To evaluate the effectiveness of the proposed method, experiments are conducted on three public datasets (ACDC, LA, and Pancreas). Extensive experimental results demonstrate that the proposed method exhibits good generalizability and outperforms several state-of-the-art methods.
Collapse
|
33
|
Košprdić M, Prodanović N, Ljajić A, Bašaragin B, Milošević N. From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts. Artif Intell Med 2024; 156:102970. [PMID: 39197375 DOI: 10.1016/j.artmed.2024.102970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 08/23/2024] [Accepted: 08/23/2024] [Indexed: 09/01/2024]
Abstract
Supervised named entity recognition (NER) in the biomedical domain depends on large sets of annotated texts with the given named entities. The creation of such datasets can be time-consuming and expensive, while extraction of new entities requires additional annotation tasks and retraining the model. This paper proposes a method for zero- and few-shot NER in the biomedical domain to address these challenges. The method is based on transforming the task of multi-class token classification into binary token classification and pre-training on a large number of datasets and biomedical entities, which allows the model to learn semantic relations between the given and potentially novel named entity labels. We have achieved average F1 scores of 35.44% for zero-shot NER, 50.10% for one-shot NER, 69.94% for 10-shot NER, and 79.51% for 100-shot NER on 9 diverse evaluated biomedical entities with fine-tuned PubMedBERT-based model. The results demonstrate the effectiveness of the proposed method for recognizing new biomedical entities with no or limited number of examples, outperforming previous transformer-based methods, and being comparable to GPT3-based models using models with over 1000 times fewer parameters. We make models and developed code publicly available.
Collapse
Affiliation(s)
- Miloš Košprdić
- Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia
| | - Nikola Prodanović
- Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia
| | - Adela Ljajić
- Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia
| | - Bojana Bašaragin
- Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia
| | - Nikola Milošević
- Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia; Bayer A.G., Research and Development, Mullerstrasse 173, Berlin, 13342, Germany.
| |
Collapse
|
34
|
Li J, Pan Y, Tsang IW. Taming Overconfident Prediction on Unlabeled Data From Hindsight. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:14151-14163. [PMID: 37220056 DOI: 10.1109/tnnls.2023.3274845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Minimizing prediction uncertainty on unlabeled data is a key factor to achieve good performance in semi-supervised learning (SSL). The prediction uncertainty is typically expressed as the entropy computed by the transformed probabilities in output space. Most existing works distill low-entropy prediction by either accepting the determining class (with the largest probability) as the true label or suppressing subtle predictions (with the smaller probabilities). Unarguably, these distillation strategies are usually heuristic and less informative for model training. From this discernment, this article proposes a dual mechanism, named adaptive sharpening (ADS), which first applies a soft-threshold to adaptively mask out determinate and negligible predictions, and then seamlessly sharpens the informed predictions, distilling certain predictions with the informed ones only. More importantly, we theoretically analyze the traits of ADS by comparing it with various distillation strategies. Numerous experiments verify that ADS significantly improves state-of-the-art SSL methods by making it a plug-in. Our proposed ADS forges a cornerstone for future distillation-based SSL research.
Collapse
|
35
|
Sadeghi P, Karimi H, Lavafian A, Rashedi R, Samieefar N, Shafiekhani S, Rezaei N. Machine learning and artificial intelligence within pediatric autoimmune diseases: applications, challenges, future perspective. Expert Rev Clin Immunol 2024; 20:1219-1236. [PMID: 38771915 DOI: 10.1080/1744666x.2024.2359019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 05/20/2024] [Indexed: 05/23/2024]
Abstract
INTRODUCTION Autoimmune disorders affect 4.5% to 9.4% of children, significantly reducing their quality of life. The diagnosis and prognosis of autoimmune diseases are uncertain because of the variety of onset and development. Machine learning can identify clinically relevant patterns from vast amounts of data. Hence, its introduction has been beneficial in the diagnosis and management of patients. AREAS COVERED This narrative review was conducted through searching various electronic databases, including PubMed, Scopus, and Web of Science. This study thoroughly explores the current knowledge and identifies the remaining gaps in the applications of machine learning specifically in the context of pediatric autoimmune and related diseases. EXPERT OPINION Machine learning algorithms have the potential to completely change how pediatric autoimmune disorders are identified, treated, and managed. Machine learning can assist physicians in making more precise and fast judgments, identifying new biomarkers and therapeutic targets, and personalizing treatment strategies for each patient by utilizing massive datasets and powerful analytics.
Collapse
Affiliation(s)
- Parniyan Sadeghi
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hanie Karimi
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Atiye Lavafian
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- School of Medicine, Semnan University of Medical Science, Semnan, Iran
| | - Ronak Rashedi
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- USERN Office, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Noosha Samieefar
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- USERN Office, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Sajad Shafiekhani
- Department of Biomedical Engineering, Buein Zahra Technical University, Qazvin, Iran
| | - Nima Rezaei
- Network of Interdisciplinarity in Neonates and Infants (NINI), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- Research Center for Immunodeficiencies, Children's Medical Center, Tehran University of Medical Sciences, Tehran, Iran
- Department of Immunology, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
36
|
Shinde A, Shahra EQ, Basurra S, Saeed F, AlSewari AA, Jabbar WA. SMS Scam Detection Application Based on Optical Character Recognition for Image Data Using Unsupervised and Deep Semi-Supervised Learning. SENSORS (BASEL, SWITZERLAND) 2024; 24:6084. [PMID: 39338829 PMCID: PMC11435858 DOI: 10.3390/s24186084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/11/2024] [Accepted: 09/16/2024] [Indexed: 09/30/2024]
Abstract
The growing problem of unsolicited text messages (smishing) and data irregularities necessitates stronger spam detection solutions. This paper explores the development of a sophisticated model designed to identify smishing messages by understanding the complex relationships among words, images, and context-specific factors, areas that remain underexplored in existing research. To address this, we merge a UCI spam dataset of regular text messages with real-world spam data, leveraging OCR technology for comprehensive analysis. The study employs a combination of traditional machine learning models, including K-means, Non-Negative Matrix Factorization, and Gaussian Mixture Models, along with feature extraction techniques such as TF-IDF and PCA. Additionally, deep learning models like RNN-Flatten, LSTM, and Bi-LSTM are utilized. The selection of these models is driven by their complementary strengths in capturing both the linear and non-linear relationships inherent in smishing messages. Machine learning models are chosen for their efficiency in handling structured text data, while deep learning models are selected for their superior ability to capture sequential dependencies and contextual nuances. The performance of these models is rigorously evaluated using metrics like accuracy, precision, recall, and F1 score, enabling a comparative analysis between the machine learning and deep learning approaches. Notably, the K-means feature extraction with vectorizer achieved 91.01% accuracy, and the KNN-Flatten model reached 94.13% accuracy, emerging as the top performer. The rationale behind highlighting these models is their potential to significantly improve smishing detection rates. For instance, the high accuracy of the KNN-Flatten model suggests its applicability in real-time spam detection systems, but its computational complexity might limit scalability in large-scale deployments. Similarly, while K-means with vectorizer excels in accuracy, it may struggle with the dynamic and evolving nature of smishing attacks, necessitating continual retraining.
Collapse
Affiliation(s)
| | - Essa Q. Shahra
- Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham B4 7RQ, UK; (A.S.); (S.B.); (F.S.); (A.A.A.)
| | | | | | | | - Waheb A. Jabbar
- Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham B4 7RQ, UK; (A.S.); (S.B.); (F.S.); (A.A.A.)
| |
Collapse
|
37
|
Lee D, Hwang W, Byun J, Shin B. Turbocharging protein binding site prediction with geometric attention, inter-resolution transfer learning, and homology-based augmentation. BMC Bioinformatics 2024; 25:306. [PMID: 39304807 DOI: 10.1186/s12859-024-05923-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 09/05/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND Locating small molecule binding sites in target proteins, in the resolution of either pocket or residue, is critical in many drug-discovery scenarios. Since it is not always easy to find such binding sites using conventional methods, different deep learning methods to predict binding sites out of protein structures have been developed in recent years. The existing deep learning based methods have several limitations, including (1) the inefficiency of the CNN-only architecture, (2) loss of information due to excessive post-processing, and (3) the under-utilization of available data sources. METHODS We present a new model architecture and training method that resolves the aforementioned problems. First, by layering geometric self-attention units on top of residue-level 3D CNN outputs, our model overcomes the problems of CNN-only architectures. Second, by configuring the fundamental units of computation as residues and pockets instead of voxels, our method reduced the information loss from post-processing. Lastly, by employing inter-resolution transfer learning and homology-based augmentation, our method maximizes the utilization of available data sources to a significant extent. RESULTS The proposed method significantly outperformed all state-of-the-art baselines regarding both resolutions-pocket and residue. An ablation study demonstrated the indispensability of our proposed architecture, as well as transfer learning and homology-based augmentation, for achieving optimal performance. We further scrutinized our model's performance through a case study involving human serum albumin, which demonstrated our model's superior capability in identifying multiple binding sites of the protein, outperforming the existing methods. CONCLUSIONS We believe that our contribution to the literature is twofold. Firstly, we introduce a novel computational method for binding site prediction with practical applications, substantiated by its strong performance across diverse benchmarks and case studies. Secondly, the innovative aspects in our method- specifically, the design of the model architecture, inter-resolution transfer learning, and homology-based augmentation-would serve as useful components for future work.
Collapse
Affiliation(s)
| | | | | | - Bonggun Shin
- Deargen, Seoul, Republic of Korea.
- SK Life Science, Inc., Paramus, NJ, USA.
| |
Collapse
|
38
|
Chakraborty C, Bhattacharya M, Lee SS, Wen ZH, Lo YH. The changing scenario of drug discovery using AI to deep learning: Recent advancement, success stories, collaborations, and challenges. MOLECULAR THERAPY. NUCLEIC ACIDS 2024; 35:102295. [PMID: 39257717 PMCID: PMC11386122 DOI: 10.1016/j.omtn.2024.102295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Due to the transformation of artificial intelligence (AI) tools and technologies, AI-driven drug discovery has come to the forefront. It reduces the time and expenditure. Due to these advantages, pharmaceutical industries are concentrating on AI-driven drug discovery. Several drug molecules have been discovered using AI-based techniques and tools, and several newly AI-discovered drug molecules have already entered clinical trials. In this review, we first present the data and their resources in the pharmaceutical sector for AI-driven drug discovery and illustrated some significant algorithms or techniques used for AI and ML which are used in this field. We gave an overview of the deep neural network (NN) models and compared them with artificial NNs. Then, we illustrate the recent advancement of the landscape of drug discovery using AI to deep learning, such as the identification of drug targets, prediction of their structure, estimation of drug-target interaction, estimation of drug-target binding affinity, design of de novo drug, prediction of drug toxicity, estimation of absorption, distribution, metabolism, excretion, toxicity; and estimation of drug-drug interaction. Moreover, we highlighted the success stories of AI-driven drug discovery and discussed several collaboration and the challenges in this area. The discussions in the article will enrich the pharmaceutical industry.
Collapse
Affiliation(s)
- Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal 700126, India
| | - Manojit Bhattacharya
- Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore, Odisha 756020, India
| | - Sang-Soo Lee
- Institute for Skeletal Aging & Orthopedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon, Gangwon-Do 24252, Republic of Korea
| | - Zhi-Hong Wen
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Yi-Hao Lo
- Department of Family Medicine, Zuoying Armed Forces General Hospital, Kaohsiung 813204, Taiwan
- Shu-Zen Junior College of Medicine and Management, Kaohsiung 821004, Taiwan
- Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung 804201, Taiwan
| |
Collapse
|
39
|
Agarwal P, Mathur V, Kasturi M, Srinivasan V, Seetharam RN, S Vasanthan K. A Futuristic Development in 3D Printing Technique Using Nanomaterials with a Step Toward 4D Printing. ACS OMEGA 2024; 9:37445-37458. [PMID: 39281933 PMCID: PMC11391532 DOI: 10.1021/acsomega.4c04123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 07/27/2024] [Accepted: 08/06/2024] [Indexed: 09/18/2024]
Abstract
3D bioprinting has shown great promise in tissue engineering and regenerative medicine for creating patient-specific tissue scaffolds and medicinal devices. The quickness, accurate imaging, and design targeting of this emerging technology have excited biomedical engineers and translational medicine researchers. Recently, scaffolds made from 3D bioprinted tissue have become more clinically effective due to nanomaterials and nanotechnology. Because of quantum confinement effects and high surface area/volume ratios, nanomaterials and nanotechnological techniques have unique physical, chemical, and biological features. The use of nanomaterials and 3D bioprinting has led to scaffolds with improved physicochemical and biological properties. Nanotechnology and nanomaterials affect 3D bioprinted tissue engineered scaffolds for regenerative medicine and tissue engineering. Biomaterials and cells that respond to stimuli change the structural shape in 4D bioprinting. With such dynamic designs, tissue architecture can change morphologically. New 4D bioprinting techniques will aid in bioactuation, biorobotics, and biosensing. The potential of 4D bioprinting in biomedical technologies is also discussed in this article.
Collapse
Affiliation(s)
- Prachi Agarwal
- Manipal Centre for Biotherapeutics Research, Manipal Academy of Higher Education, Karnataka, Manipal 576104, India
| | - Vidhi Mathur
- Manipal Centre for Biotherapeutics Research, Manipal Academy of Higher Education, Karnataka, Manipal 576104, India
| | - Meghana Kasturi
- Department of Mechanical Engineering, University of Michigan, Dearborn, Michigan 48128, United States
| | - Varadharajan Srinivasan
- Manipal Institute of Technology, Manipal Academy of Higher Education, Karnataka, Manipal 576104, India
| | - Raviraja N Seetharam
- Manipal Centre for Biotherapeutics Research, Manipal Academy of Higher Education, Karnataka, Manipal 576104, India
| | - Kirthanashri S Vasanthan
- Manipal Centre for Biotherapeutics Research, Manipal Academy of Higher Education, Karnataka, Manipal 576104, India
| |
Collapse
|
40
|
Torrik A, Zarif M. Machine learning assisted sorting of active microswimmers. J Chem Phys 2024; 161:094907. [PMID: 39225539 DOI: 10.1063/5.0216862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024] Open
Abstract
Active matter systems, being in a non-equilibrium state, exhibit complex behaviors, such as self-organization, giving rise to emergent phenomena. There are many examples of active particles with biological origins, including bacteria and spermatozoa, or with artificial origins, such as self-propelled swimmers and Janus particles. The ability to manipulate active particles is vital for their effective application, e.g., separating motile spermatozoa from nonmotile and dead ones, to increase fertilization chance. In this study, we proposed a mechanism-an apparatus-to sort and demix active particles based on their motility values (Péclet number). Initially, using Brownian simulations, we demonstrated the feasibility of sorting self-propelled particles. Following this, we employed machine learning methods, supplemented with data from comprehensive simulations that we conducted for this study, to model the complex behavior of active particles. This enabled us to sort them based on their Péclet number. Finally, we evaluated the performance of the developed models and showed their effectiveness in demixing and sorting the active particles. Our findings can find applications in various fields, including physics, biology, and biomedical science, where the sorting and manipulation of active particles play a pivotal role.
Collapse
Affiliation(s)
- Abdolhalim Torrik
- Department of Physical and Computational Chemistry, Shahid Beheshti University, Tehran 19839-9411, Iran
| | - Mahdi Zarif
- Department of Physical and Computational Chemistry, Shahid Beheshti University, Tehran 19839-9411, Iran
| |
Collapse
|
41
|
Hao J, Wong LM, Shan Z, Ai QYH, Shi X, Tsoi JKH, Hung KF. A Semi-Supervised Transformer-Based Deep Learning Framework for Automated Tooth Segmentation and Identification on Panoramic Radiographs. Diagnostics (Basel) 2024; 14:1948. [PMID: 39272733 PMCID: PMC11394203 DOI: 10.3390/diagnostics14171948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 08/26/2024] [Accepted: 08/27/2024] [Indexed: 09/15/2024] Open
Abstract
Automated tooth segmentation and identification on dental radiographs are crucial steps in establishing digital dental workflows. While deep learning networks have been developed for these tasks, their performance has been inferior in partially edentulous individuals. This study proposes a novel semi-supervised Transformer-based framework (SemiTNet), specifically designed to improve tooth segmentation and identification performance on panoramic radiographs, particularly in partially edentulous cases, and establish an open-source dataset to serve as a unified benchmark. A total of 16,317 panoramic radiographs (1589 labeled and 14,728 unlabeled images) were collected from various datasets to create a large-scale dataset (TSI15k). The labeled images were divided into training and test sets at a 7:1 ratio, while the unlabeled images were used for semi-supervised learning. The SemiTNet was developed using a semi-supervised learning method with a label-guided teacher-student knowledge distillation strategy, incorporating a Transformer-based architecture. The performance of SemiTNet was evaluated on the test set using the intersection over union (IoU), Dice coefficient, precision, recall, and F1 score, and compared with five state-of-the-art networks. Paired t-tests were performed to compare the evaluation metrics between SemiTNet and the other networks. SemiTNet outperformed other networks, achieving the highest accuracy for tooth segmentation and identification, while requiring minimal model size. SemiTNet's performance was near-perfect for fully dentate individuals (all metrics over 99.69%) and excellent for partially edentulous individuals (all metrics over 93%). In edentulous cases, SemiTNet obtained statistically significantly higher tooth identification performance than all other networks. The proposed SemiTNet outperformed previous high-complexity, state-of-the-art networks, particularly in partially edentulous cases. The established open-source TSI15k dataset could serve as a unified benchmark for future studies.
Collapse
Affiliation(s)
- Jing Hao
- Applied Oral Sciences and Community Dental Care, Faculty of Dentistry, The University of Hong Kong, Hong Kong SAR, China
| | - Lun M Wong
- Imaging and Interventional Radiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Zhiyi Shan
- Paediatric Dentistry and Orthodontics, Faculty of Dentistry, The University of Hong Kong, Hong Kong SAR, China
| | - Qi Yong H Ai
- Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Xieqi Shi
- Section of Oral Maxillofacial Radiology, Department of Clinical Dentistry, University of Bergen, 5009 Bergen, Norway
| | - James Kit Hon Tsoi
- Applied Oral Sciences and Community Dental Care, Faculty of Dentistry, The University of Hong Kong, Hong Kong SAR, China
| | - Kuo Feng Hung
- Applied Oral Sciences and Community Dental Care, Faculty of Dentistry, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
42
|
Zhang Y, Chung ACS. Retinal Vessel Segmentation by a Transformer-U-Net Hybrid Model With Dual-Path Decoder. IEEE J Biomed Health Inform 2024; 28:5347-5359. [PMID: 38669172 DOI: 10.1109/jbhi.2024.3394151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
This paper introduces an effective and efficient framework for retinal vessel segmentation. First, we design a Transformer-CNN hybrid model in which a Transformer module is inserted inside the U-Net to capture long-range interactions. Second, we design a dual-path decoder in the U-Net framework, which contains two decoding paths for multi-task outputs. Specifically, we train the extra decoder to predict vessel skeletons as an auxiliary task which helps the model learn balanced features. The proposed framework, named as TSNet, not only achieves good performances in a fully supervised learning manner but also enables a rough skeleton annotation process. The annotators only need to roughly delineate vessel skeletons instead of giving precise pixel-wise vessel annotations. To learn with rough skeleton annotations plus a few precise vessel annotations, we propose a skeleton semi-supervised learning scheme. We adopt a mean teacher model to produce pseudo vessel annotations and conduct annotation correction for roughly labeled skeletons annotations. This learning scheme can achieve promising performance with fewer annotation efforts. We have evaluated TSNet through extensive experiments on five benchmarking datasets. Experimental results show that TSNet yields state-of-the-art performances on retinal vessel segmentation and provides an efficient training scheme in practice.
Collapse
|
43
|
Kohler M, Eisenbach M, Gross HM. Few-Shot Object Detection: A Comprehensive Survey. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:11958-11978. [PMID: 37067965 DOI: 10.1109/tnnls.2023.3265051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Humans are able to learn to recognize new objects even from a few examples. In contrast, training deep-learning-based object detectors requires huge amounts of annotated data. To avoid the need to acquire and annotate these huge amounts of data, few-shot object detection (FSOD) aims to learn from few object instances of new categories in the target domain. In this survey, we provide an overview of the state of the art in FSOD. We categorize approaches according to their training scheme and architectural layout. For each type of approach, we describe the general realization as well as concepts to improve the performance on novel categories. Whenever appropriate, we give short takeaways regarding these concepts in order to highlight the best ideas. Eventually, we introduce commonly used datasets and their evaluation protocols and analyze the reported benchmark results. As a result, we emphasize common challenges in evaluation and identify the most promising current trends in this emerging field of FSOD.
Collapse
|
44
|
Seinen TM, Kors JA, van Mulligen EM, Fridgeirsson EA, Verhamme KM, Rijnbeek PR. Using clinical text to refine unspecific condition codes in Dutch general practitioner EHR data. Int J Med Inform 2024; 189:105506. [PMID: 38820647 DOI: 10.1016/j.ijmedinf.2024.105506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 05/22/2024] [Accepted: 05/27/2024] [Indexed: 06/02/2024]
Abstract
OBJECTIVE Observational studies using electronic health record (EHR) databases often face challenges due to unspecific clinical codes that can obscure detailed medical information, hindering precise data analysis. In this study, we aimed to assess the feasibility of refining these unspecific condition codes into more specific codes in a Dutch general practitioner (GP) EHR database by leveraging the available clinical free text. METHODS We utilized three approaches for text classification-search queries, semi-supervised learning, and supervised learning-to improve the specificity of ten unspecific International Classification of Primary Care (ICPC-1) codes. Two text representations and three machine learning algorithms were evaluated for the (semi-)supervised models. Additionally, we measured the improvement achieved by the refinement process on all code occurrences in the database. RESULTS The classification models performed well for most codes. In general, no single classification approach consistently outperformed the others. However, there were variations in the relative performance of the classification approaches within each code and in the use of different text representations and machine learning algorithms. Class imbalance and limited training data affected the performance of the (semi-)supervised models, yet the simple search queries remained particularly effective. Ultimately, the developed models improved the specificity of over half of all the unspecific code occurrences in the database. CONCLUSIONS Our findings show the feasibility of using information from clinical text to improve the specificity of unspecific condition codes in observational healthcare databases, even with a limited range of machine-learning techniques and modest annotated training sets. Future work could investigate transfer learning, integration of structured data, alternative semi-supervised methods, and validation of models across healthcare settings. The improved level of detail enriches the interpretation of medical information and can benefit observational research and patient care.
Collapse
Affiliation(s)
- Tom M Seinen
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands.
| | - Jan A Kors
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Erik M van Mulligen
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Egill A Fridgeirsson
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Katia Mc Verhamme
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Peter R Rijnbeek
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| |
Collapse
|
45
|
Li J, Chen D, Yin X, Li Z. Performance evaluation of semi-supervised learning frameworks for multi-class weed detection. FRONTIERS IN PLANT SCIENCE 2024; 15:1396568. [PMID: 39228840 PMCID: PMC11369944 DOI: 10.3389/fpls.2024.1396568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 07/24/2024] [Indexed: 09/05/2024]
Abstract
Precision weed management (PWM), driven by machine vision and deep learning (DL) advancements, not only enhances agricultural product quality and optimizes crop yield but also provides a sustainable alternative to herbicide use. However, existing DL-based algorithms on weed detection are mainly developed based on supervised learning approaches, typically demanding large-scale datasets with manual-labeled annotations, which can be time-consuming and labor-intensive. As such, label-efficient learning methods, especially semi-supervised learning, have gained increased attention in the broader domain of computer vision and have demonstrated promising performance. These methods aim to utilize a small number of labeled data samples along with a great number of unlabeled samples to develop high-performing models comparable to the supervised learning counterpart trained on a large amount of labeled data samples. In this study, we assess the effectiveness of a semi-supervised learning framework for multi-class weed detection, employing two well-known object detection frameworks, namely FCOS (Fully Convolutional One-Stage Object Detection) and Faster-RCNN (Faster Region-based Convolutional Networks). Specifically, we evaluate a generalized student-teacher framework with an improved pseudo-label generation module to produce reliable pseudo-labels for the unlabeled data. To enhance generalization, an ensemble student network is employed to facilitate the training process. Experimental results show that the proposed approach is able to achieve approximately 76% and 96% detection accuracy as the supervised methods with only 10% of labeled data in CottonWeedDet3 and CottonWeedDet12, respectively. We offer access to the source code (https://github.com/JiajiaLi04/SemiWeeds), contributing a valuable resource for ongoing semi-supervised learning research in weed detection and beyond.
Collapse
Affiliation(s)
- Jiajia Li
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI, United States
| | - Dong Chen
- Environmental Institute, University of Virginia, Charlottesville, VA, United States
| | - Xunyuan Yin
- School of Chemical and Biomedical Engineering, Nanyang Technological University, Singapore, Singapore
| | - Zhaojian Li
- Department of Mechanical Engineering, Michigan State University, East Lansing, MI, United States
| |
Collapse
|
46
|
Sabiri B, Khtira A, El Asri B, Rhanoui M. Investigating Contrastive Pair Learning's Frontiers in Supervised, Semisupervised, and Self-Supervised Learning. J Imaging 2024; 10:196. [PMID: 39194985 DOI: 10.3390/jimaging10080196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 07/23/2024] [Accepted: 07/25/2024] [Indexed: 08/29/2024] Open
Abstract
In recent years, contrastive learning has been a highly favored method for self-supervised representation learning, which significantly improves the unsupervised training of deep image models. Self-supervised learning is a subset of unsupervised learning in which the learning process is supervised by creating pseudolabels from the data themselves. Using supervised final adjustments after unsupervised pretraining is one way to take the most valuable information from a vast collection of unlabeled data and teach from a small number of labeled instances. This study aims firstly to compare contrastive learning with other traditional learning models; secondly to demonstrate by experimental studies the superiority of contrastive learning during classification; thirdly to fine-tune performance using pretrained models and appropriate hyperparameter selection; and finally to address the challenge of using contrastive learning techniques to produce data representations with semantic meaning that are independent of irrelevant factors like position, lighting, and background. Relying on contrastive techniques, the model efficiently captures meaningful representations by discerning similarities and differences between modified copies of the same image. The proposed strategy, involving unsupervised pretraining followed by supervised fine-tuning, improves the robustness, accuracy, and knowledge extraction of deep image models. The results show that even with a modest 5% of data labeled, the semisupervised model achieves an accuracy of 57.72%. However, the use of supervised learning with a contrastive approach and careful hyperparameter tuning increases accuracy to 85.43%. Further adjustment of the hyperparameters resulted in an excellent accuracy of 88.70%.
Collapse
Affiliation(s)
- Bihi Sabiri
- IMS Team, ADMIR Laboratory, Rabat IT Center, ENSIAS, Mohammed V University in Rabat, Rabat 10000, Morocco
| | - Amal Khtira
- LASTIMI Laboratory, EST Salé, Mohammed V University in Rabat, Salé 11060, Morocco
| | - Bouchra El Asri
- IMS Team, ADMIR Laboratory, Rabat IT Center, ENSIAS, Mohammed V University in Rabat, Rabat 10000, Morocco
| | - Maryem Rhanoui
- Laboratory Health Systemic Process (P2S), UR4129, University Claude Bernard Lyon 1, University of Lyon, 69100 Lyon, France
- Meridian Team, LYRICA Laboratory, School of Information Sciences, Rabat 10100, Morocco
| |
Collapse
|
47
|
Ng WL, Goh GL, Goh GD, Ten JSJ, Yeong WY. Progress and Opportunities for Machine Learning in Materials and Processes of Additive Manufacturing. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2310006. [PMID: 38456831 DOI: 10.1002/adma.202310006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 03/01/2024] [Indexed: 03/09/2024]
Abstract
In recent years, there has been widespread adoption of machine learning (ML) technologies to unravel intricate relationships among diverse parameters in various additive manufacturing (AM) techniques. These ML models excel at recognizing complex patterns from extensive, well-curated datasets, thereby unveiling latent knowledge crucial for informed decision-making during the AM process. The collaborative synergy between ML and AM holds the potential to revolutionize the design and production of AM-printed parts. This review delves into the challenges and opportunities emerging at the intersection of these two dynamic fields. It provides a comprehensive analysis of the publication landscape for ML-related research in the field of AM, explores common ML applications in AM research (such as quality control, process optimization, design optimization, microstructure analysis, and material formulation), and concludes by presenting an outlook that underscores the utilization of advanced ML models, the development of emerging sensors, and ML applications in emerging AM-related fields. Notably, ML has garnered increased attention in AM due to its superior performance across various AM-related applications. It is envisioned that the integration of ML into AM processes will significantly enhance 3D printing capabilities across diverse AM-related research areas.
Collapse
Affiliation(s)
- Wei Long Ng
- Singapore Centre for 3D Printing, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Guo Liang Goh
- School of Mechanical and Aerospace Engineering, Nanyang Technological University, 50 Nanyang Ave, Singapore, 639798, Singapore
| | - Guo Dong Goh
- Singapore Institute of Manufacturing Technology (SIMTech), Agency for Science, Technology and Research (A*STAR), 5 CleanTech Loop #01-01, Singapore, 636732, Singapore
| | - Jyi Sheuan Jason Ten
- Singapore Institute of Manufacturing Technology (SIMTech), Agency for Science, Technology and Research (A*STAR), 5 CleanTech Loop #01-01, Singapore, 636732, Singapore
| | - Wai Yee Yeong
- Singapore Centre for 3D Printing, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
- School of Mechanical and Aerospace Engineering, Nanyang Technological University, 50 Nanyang Ave, Singapore, 639798, Singapore
| |
Collapse
|
48
|
Tran M, Truong S, Fernandes AFA, Kidd MT, Le N. CarcassFormer: an end-to-end transformer-based framework for simultaneous localization, segmentation and classification of poultry carcass defect. Poult Sci 2024; 103:103765. [PMID: 38925080 PMCID: PMC11255899 DOI: 10.1016/j.psj.2024.103765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 04/02/2024] [Accepted: 04/09/2024] [Indexed: 06/28/2024] Open
Abstract
In the food industry, assessing the quality of poultry carcasses during processing is a crucial step. This study proposes an effective approach for automating the assessment of carcass quality without requiring skilled labor or inspector involvement. The proposed system is based on machine learning (ML) and computer vision (CV) techniques, enabling automated defect detection and carcass quality assessment. To this end, an end-to-end framework called CarcassFormer is introduced. It is built upon a Transformer-based architecture designed to effectively extract visual representations while simultaneously detecting, segmenting, and classifying poultry carcass defects. Our proposed framework is capable of analyzing imperfections resulting from production and transport welfare issues, as well as processing plant stunner, scalder, picker, and other equipment malfunctions. To benchmark the framework, a dataset of 7,321 images was initially acquired, which contained both single and multiple carcasses per image. In this study, the performance of the CarcassFormer system is compared with other state-of-the-art (SOTA) approaches for both classification, detection, and segmentation tasks. Through extensive quantitative experiments, our framework consistently outperforms existing methods, demonstrating re- markable improvements across various evaluation metrics such as AP, AP@50, and AP@75. Furthermore, the qualitative results highlight the strengths of CarcassFormer in capturing fine details, including feathers, and accurately localizing and segmenting carcasses with high precision. To facilitate further research and collaboration, the source code and trained models will be made publicly available upon acceptance.
Collapse
Affiliation(s)
- Minh Tran
- Department of Computer Science and Computer Engineering, 1 University of Arkansas, Fayetteville, AR 72701, USA
| | - Sang Truong
- Department of Computer Science and Computer Engineering, 1 University of Arkansas, Fayetteville, AR 72701, USA
| | | | - Michael T Kidd
- Department of Poultry Science, Fayetteville, AR 72701, USA
| | - Ngan Le
- Department of Computer Science and Computer Engineering, 1 University of Arkansas, Fayetteville, AR 72701, USA.
| |
Collapse
|
49
|
Liu J, Du H, Huang L, Xie W, Liu K, Zhang X, Chen S, Zhang Y, Li D, Pan H. AI-Powered Microfluidics: Shaping the Future of Phenotypic Drug Discovery. ACS APPLIED MATERIALS & INTERFACES 2024; 16:38832-38851. [PMID: 39016521 DOI: 10.1021/acsami.4c07665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/18/2024]
Abstract
Phenotypic drug discovery (PDD), which involves harnessing biological systems directly to uncover effective drugs, has undergone a resurgence in recent years. The rapid advancement of artificial intelligence (AI) over the past few years presents numerous opportunities for augmenting phenotypic drug screening on microfluidic platforms, leveraging its predictive capabilities, data analysis, efficient data processing, etc. Microfluidics coupled with AI is poised to revolutionize the landscape of phenotypic drug discovery. By integrating advanced microfluidic platforms with AI algorithms, researchers can rapidly screen large libraries of compounds, identify novel drug candidates, and elucidate complex biological pathways with unprecedented speed and efficiency. This review provides an overview of recent advances and challenges in AI-based microfluidics and their applications in drug discovery. We discuss the synergistic combination of microfluidic systems for high-throughput screening and AI-driven analysis for phenotype characterization, drug-target interactions, and predictive modeling. In addition, we highlight the potential of AI-powered microfluidics to achieve an automated drug screening system. Overall, AI-powered microfluidics represents a promising approach to shaping the future of phenotypic drug discovery by enabling rapid, cost-effective, and accurate identification of therapeutically relevant compounds.
Collapse
Affiliation(s)
- Junchi Liu
- Department of Anesthesiology, The First Hospital of Jilin University, 71 Xinmin Street, Changchun 130012, China
| | - Hanze Du
- Department of Endocrinology, Key Laboratory of Endocrinology of National Health Commission, Translation Medicine Centre, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100730, China
- Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing 100730, China
| | - Lei Huang
- Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, School and Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130012, China
| | - Wangni Xie
- Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, School and Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130012, China
| | - Kexuan Liu
- Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, School and Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130012, China
| | - Xue Zhang
- Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, School and Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130012, China
| | - Shi Chen
- Department of Endocrinology, Key Laboratory of Endocrinology of National Health Commission, Translation Medicine Centre, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100730, China
- Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing 100730, China
| | - Yuan Zhang
- Department of Anesthesiology, The First Hospital of Jilin University, 71 Xinmin Street, Changchun 130012, China
| | - Daowei Li
- Jilin Provincial Key Laboratory of Tooth Development and Bone Remodeling, School and Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130012, China
| | - Hui Pan
- Department of Endocrinology, Key Laboratory of Endocrinology of National Health Commission, Translation Medicine Centre, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100730, China
- Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing 100730, China
| |
Collapse
|
50
|
Li L, Sun M, Wang J, Wan S. Multi-omics based artificial intelligence for cancer research. Adv Cancer Res 2024; 163:303-356. [PMID: 39271266 DOI: 10.1016/bs.acr.2024.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
With significant advancements of next generation sequencing technologies, large amounts of multi-omics data, including genomics, epigenomics, transcriptomics, proteomics, and metabolomics, have been accumulated, offering an unprecedented opportunity to explore the heterogeneity and complexity of cancer across various molecular levels and scales. One of the promising aspects of multi-omics lies in its capacity to offer a holistic view of the biological networks and pathways underpinning cancer, facilitating a deeper understanding of its development, progression, and response to treatment. However, the exponential growth of data generated by multi-omics studies present significant analytical challenges. Processing, analyzing, integrating, and interpreting these multi-omics datasets to extract meaningful insights is an ambitious task that stands at the forefront of current cancer research. The application of artificial intelligence (AI) has emerged as a powerful solution to these challenges, demonstrating exceptional capabilities in deciphering complex patterns and extracting valuable information from large-scale, intricate omics datasets. This review delves into the synergy of AI and multi-omics, highlighting its revolutionary impact on oncology. We dissect how this confluence is reshaping the landscape of cancer research and clinical practice, particularly in the realms of early detection, diagnosis, prognosis, treatment and pathology. Additionally, we elaborate the latest AI methods for multi-omics integration to provide a comprehensive insight of the complex biological mechanisms and inherent heterogeneity of cancer. Finally, we discuss the current challenges of data harmonization, algorithm interpretability, and ethical considerations. Addressing these challenges necessitates a multidisciplinary collaboration, paving the promising way for more precise, personalized, and effective treatments for cancer patients.
Collapse
Affiliation(s)
- Lusheng Li
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Mengtao Sun
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Jieqiong Wang
- Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE, United States
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States.
| |
Collapse
|