1
|
Fang W, Reniers G, Zhou D, Yin J, Liu Z. A victim risk identification model for nature-induced urban disaster emergency response. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024. [PMID: 39277189 DOI: 10.1111/risa.17456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/17/2024]
Abstract
In recent years, nature-induced urban disasters in high-density modern cities in China have raised great concerns. The delayed and imprecise understanding of the real-time post-disaster situation made it difficult for the decision-makers to find a suitable emergency rescue plan. To this end, this study aims to facilitate the real-time performance and accuracy of on-site victim risk identification. In this article, we propose a victim identification model based on the You Only Look Once v7-W6 (YOLOv7-W6) algorithm. This model defines the "fall-down" pose as a key feature in identifying urgent victims from the perspective of disaster medicine rescue. The results demonstrate that this model performs superior accuracy (mAP@0.5, 0.960) and inference speed (5.1 ms) on the established disaster victim database compared to other state-of-the-art object detection algorithms. Finally, a case study is illustrated to show the practical utilization of this model in a real disaster rescue scenario. This study proposes an intelligent on-site victim risk identification approach, contributing significantly to government emergency decision-making and response.
Collapse
Affiliation(s)
- Weipeng Fang
- School of Economics and Management, Tongji University, Shanghai, China
- The Institute of Disaster Medicine Engineering of Tongji University, Tongji University, Shanghai, China
- CEDON, KU Leuven, Campus Brussels, Brussels, Belgium
| | - Genserik Reniers
- CEDON, KU Leuven, Campus Brussels, Brussels, Belgium
- Faculty of Technology, Policy and Management, Safety and Security Science Group (S3G), TU Delft, Delft, The Netherlands
- University of Antwerp, Antwerp, Belgium
| | - Dan Zhou
- The Institute of Disaster Medicine Engineering of Tongji University, Tongji University, Shanghai, China
| | - Jian Yin
- The Institute of Disaster Medicine Engineering of Tongji University, Tongji University, Shanghai, China
| | - Zhongmin Liu
- School of Economics and Management, Tongji University, Shanghai, China
- The Institute of Disaster Medicine Engineering of Tongji University, Tongji University, Shanghai, China
| |
Collapse
|
2
|
Soe NN, Yu Z, Latt PM, Lee D, Samra RS, Ge Z, Rahman R, Sun J, Ong JJ, Fairley CK, Zhang L. Using AI to Differentiate Mpox From Common Skin Lesions in a Sexual Health Clinic: Algorithm Development and Validation Study. J Med Internet Res 2024; 26:e52490. [PMID: 39269753 PMCID: PMC11437223 DOI: 10.2196/52490] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/11/2024] [Accepted: 06/10/2024] [Indexed: 09/15/2024] Open
Abstract
BACKGROUND The 2022 global outbreak of mpox has significantly impacted health facilities, and necessitated additional infection prevention and control measures and alterations to clinic processes. Early identification of suspected mpox cases will assist in mitigating these impacts. OBJECTIVE We aimed to develop and evaluate an artificial intelligence (AI)-based tool to differentiate mpox lesion images from other skin lesions seen in a sexual health clinic. METHODS We used a data set with 2200 images, that included mpox and non-mpox lesions images, collected from Melbourne Sexual Health Centre and web resources. We adopted deep learning approaches which involved 6 different deep learning architectures to train our AI models. We subsequently evaluated the performance of each model using a hold-out data set and an external validation data set to determine the optimal model for differentiating between mpox and non-mpox lesions. RESULTS The DenseNet-121 model outperformed other models with an overall area under the receiver operating characteristic curve (AUC) of 0.928, an accuracy of 0.848, a precision of 0.942, a recall of 0.742, and an F1-score of 0.834. Implementation of a region of interest approach significantly improved the performance of all models, with the AUC for the DenseNet-121 model increasing to 0.982. This approach resulted in an increase in the correct classification of mpox images from 79% (55/70) to 94% (66/70). The effectiveness of this approach was further validated by a visual analysis with gradient-weighted class activation mapping, demonstrating a reduction in false detection within the background of lesion images. On the external validation data set, ResNet-18 and DenseNet-121 achieved the highest performance. ResNet-18 achieved an AUC of 0.990 and an accuracy of 0.947, and DenseNet-121 achieved an AUC of 0.982 and an accuracy of 0.926. CONCLUSIONS Our study demonstrated it was possible to use an AI-based image recognition algorithm to accurately differentiate between mpox and common skin lesions. Our findings provide a foundation for future investigations aimed at refining the algorithm and establishing the place of such technology in a sexual health clinic.
Collapse
Affiliation(s)
- Nyi Nyi Soe
- Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Zhen Yu
- Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
- Augmented Intelligence and Multimodal analytics (AIM) for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia
| | - Phyu Mon Latt
- Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - David Lee
- Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
| | - Ranjit Singh Samra
- Department of Infectious Diseases, Alfred Hospital, Alfred Health, Melbourne, Australia
| | - Zongyuan Ge
- Augmented Intelligence and Multimodal analytics (AIM) for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia
| | - Rashidur Rahman
- Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
| | - Jiajun Sun
- Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Jason J Ong
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
- Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
| | - Christopher K Fairley
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
- Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
| | - Lei Zhang
- Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
- Clinical Medical Research Center, Children's Hospital of Nanjing Medical University, Nanjing, China
| |
Collapse
|
3
|
O'Reilly JA, Zhu JD, Sowman PF. Localized estimation of event-related neural source activity from simultaneous MEG-EEG with a recurrent neural network. Neural Netw 2024; 180:106731. [PMID: 39303603 DOI: 10.1016/j.neunet.2024.106731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 09/05/2024] [Accepted: 09/10/2024] [Indexed: 09/22/2024]
Abstract
Estimating intracranial current sources underlying the electromagnetic signals observed from extracranial sensors is a perennial challenge in non-invasive neuroimaging. Established solutions to this inverse problem treat time samples independently without considering the temporal dynamics of event-related brain processes. This paper describes current source estimation from simultaneously recorded magneto- and electro-encephalography (MEEG) using a recurrent neural network (RNN) that learns sequential relationships from neural data. The RNN was trained in two phases: (1) pre-training and (2) transfer learning with L1 regularization applied to the source estimation layer. Performance of using scaled labels derived from MEEG, magnetoencephalography (MEG), or electroencephalography (EEG) were compared, as were results from volumetric source space with free dipole orientation and surface source space with fixed dipole orientation. Exact low-resolution electromagnetic tomography (eLORETA) and mixed-norm L1/L2 (MxNE) source estimation methods were also applied to these data for comparison with the RNN method. The RNN approach outperformed other methods in terms of output signal-to-noise ratio, correlation and mean-squared error metrics evaluated against reference event-related field (ERF) and event-related potential (ERP) waveforms. Using MEEG labels with fixed-orientation surface sources produced the most consistent estimates. To estimate sources of ERF and ERP waveforms, the RNN generates temporal dynamics within its internal computational units, driven by sequential structure in neural data used as training labels. It thus provides a data-driven model of computational transformations from psychophysiological events into corresponding event-related neural signals, which is unique among MEEG source reconstruction solutions.
Collapse
Affiliation(s)
- Jamie A O'Reilly
- School of Engineering, King Mongkut's Institute of Technology Ladkrabang, Bangkok, 10520, Thailand.
| | - Judy D Zhu
- School of Psychological Sciences, Macquarie University, New South Wales, 2109, Australia
| | - Paul F Sowman
- School of Psychological Sciences, Macquarie University, New South Wales, 2109, Australia; School of Clinical Sciences, Auckland University of Technology, Auckland, 1142, New Zealand
| |
Collapse
|
4
|
Niu B, Lee B, Wang L, Chen W, Johnson J. The Accurate Prediction of Antibody Deamidations by Combining High-Throughput Automated Peptide Mapping and Protein Language Model-Based Deep Learning. Antibodies (Basel) 2024; 13:74. [PMID: 39311379 PMCID: PMC11417914 DOI: 10.3390/antib13030074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 08/30/2024] [Accepted: 09/06/2024] [Indexed: 09/26/2024] Open
Abstract
Therapeutic antibodies such as monoclonal antibodies (mAbs), bispecific and multispecific antibodies are pivotal in therapeutic protein development and have transformed disease treatments across various therapeutic areas. The integrity of therapeutic antibodies, however, is compromised by sequence liabilities, notably deamidation, where asparagine (N) and glutamine (Q) residues undergo chemical degradations. Deamidation negatively impacts the efficacy, stability, and safety of diverse classes of antibodies, thus necessitating the critical need for the early and accurate identification of vulnerable sites. In this article, a comprehensive antibody deamidation-specific dataset (n = 2285) of varied modalities was created by using high-throughput automated peptide mapping followed by supervised machine learning to predict the deamidation propensities, as well as the extents, throughout the entire antibody sequences. We propose a novel chimeric deep learning model, integrating protein language model (pLM)-derived embeddings with local sequence information for enhanced deamidation predictions. Remarkably, this model requires only sequence inputs, eliminating the need for laborious feature engineering. Our approach demonstrates state-of-the-art performance, offering a streamlined workflow for high-throughput automated peptide mapping and deamidation prediction, with the potential of broader applicability to other antibody sequence liabilities.
Collapse
Affiliation(s)
- Ben Niu
- Discovery Biotherapeutics, Bristol Myers Squibb, San Diego, CA 92121, USA
| | - Benjamin Lee
- Discovery Biotherapeutics, Bristol Myers Squibb, San Diego, CA 92121, USA
| | - Lili Wang
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Wen Chen
- Discovery Biotherapeutics, Bristol Myers Squibb, San Diego, CA 92121, USA
| | - Jeffrey Johnson
- Discovery Biotherapeutics, Bristol Myers Squibb, San Diego, CA 92121, USA
| |
Collapse
|
5
|
Varela-Vega A, Posada-Reyes AB, Méndez-Cruz CF. Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach. Database (Oxford) 2024; 2024:baae094. [PMID: 39213391 PMCID: PMC11363960 DOI: 10.1093/database/baae094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 08/09/2024] [Accepted: 08/14/2024] [Indexed: 09/04/2024]
Abstract
Transcriptional regulatory networks (TRNs) give a global view of the regulatory mechanisms of bacteria to respond to environmental signals. These networks are published in biological databases as a valuable resource for experimental and bioinformatics researchers. Despite the efforts to publish TRNs of diverse bacteria, many of them still lack one and many of the existing TRNs are incomplete. In addition, the manual extraction of information from biomedical literature ("literature curation") has been the traditional way to extract these networks, despite this being demanding and time-consuming. Recently, language models based on pretrained transformers have been used to extract relevant knowledge from biomedical literature. Moreover, the benefit of fine-tuning a large pretrained model with new limited data for a specific task ("transfer learning") opens roads to address new problems of biomedical information extraction. Here, to alleviate this lack of knowledge and assist literature curation, we present a new approach based on the Bidirectional Transformer for Language Understanding (BERT) architecture to classify transcriptional regulatory interactions of bacteria as a first step to extract TRNs from literature. The approach achieved a significant performance in a test dataset of sentences of Escherichia coli (F1-Score: 0.8685, Matthew's correlation coefficient: 0.8163). The examination of model predictions revealed that the model learned different ways to express the regulatory interaction. The approach was evaluated to extract a TRN of Salmonella using 264 complete articles. The evaluation showed that the approach was able to accurately extract 82% of the network and that it was able to extract interactions absent in curation data. To the best of our knowledge, the present study is the first effort to obtain a BERT-based approach to extract this specific kind of interaction. This approach is a starting point to address the limitations of reconstructing TRNs of bacteria and diseases of biological interest. Database URL: https://github.com/laigen-unam/BERT-trn-extraction.
Collapse
Affiliation(s)
- Alfredo Varela-Vega
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, UNAM, Av. Universidad S/N Col. Chamilpa, Cuernavaca, Morelos 62210, México
| | - Ali-Berenice Posada-Reyes
- Laboratorio de Microbiología, Inmunología y Salud Pública, Facultad de Estudios Superiores Cuautitlán, UNAM, Carretera Cuautitlán-Teoloyucan Km. 2.5, Xhala, Cuautitlán Izcalli, Estado de México 54714, México
| | - Carlos-Francisco Méndez-Cruz
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, UNAM, Av. Universidad S/N Col. Chamilpa, Cuernavaca, Morelos 62210, México
| |
Collapse
|
6
|
Soe NN, Yu Z, Latt PM, Lee D, Ong JJ, Ge Z, Fairley CK, Zhang L. Evaluation of artificial intelligence-powered screening for sexually transmitted infections-related skin lesions using clinical images and metadata. BMC Med 2024; 22:296. [PMID: 39020355 PMCID: PMC11256573 DOI: 10.1186/s12916-024-03512-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 07/02/2024] [Indexed: 07/19/2024] Open
Abstract
BACKGROUND Sexually transmitted infections (STIs) pose a significant global public health challenge. Early diagnosis and treatment reduce STI transmission, but rely on recognising symptoms and care-seeking behaviour of the individual. Digital health software that distinguishes STI skin conditions could improve health-seeking behaviour. We developed and evaluated a deep learning model to differentiate STIs from non-STIs based on clinical images and symptoms. METHODS We used 4913 clinical images of genital lesions and metadata from the Melbourne Sexual Health Centre collected during 2010-2023. We developed two binary classification models to distinguish STIs from non-STIs: (1) a convolutional neural network (CNN) using images only and (2) an integrated model combining both CNN and fully connected neural network (FCN) using images and metadata. We evaluated the model performance by the area under the ROC curve (AUC) and assessed metadata contributions to the Image-only model. RESULTS Our study included 1583 STI and 3330 non-STI images. Common STI diagnoses were syphilis (34.6%), genital warts (24.5%) and herpes (19.4%), while most non-STIs (80.3%) were conditions such as dermatitis, lichen sclerosis and balanitis. In both STI and non-STI groups, the most frequently observed groups were 25-34 years (48.6% and 38.2%, respectively) and heterosexual males (60.3% and 45.9%, respectively). The Image-only model showed a reasonable performance with an AUC of 0.859 (SD 0.013). The Image + Metadata model achieved a significantly higher AUC of 0.893 (SD 0.018) compared to the Image-only model (p < 0.01). Out of 21 metadata, the integration of demographic and dermatological metadata led to the most significant improvement in model performance, increasing AUC by 6.7% compared to the baseline Image-only model. CONCLUSIONS The Image + Metadata model outperformed the Image-only model in distinguishing STIs from other skin conditions. Using it as a screening tool in a clinical setting may require further development and evaluation with larger datasets.
Collapse
Affiliation(s)
- Nyi N Soe
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Zhen Yu
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Phyu M Latt
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - David Lee
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia
| | - Jason J Ong
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Zongyuan Ge
- Augmented Intelligence and Multimodal analytics (AIM) for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia
| | - Christopher K Fairley
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
| | - Lei Zhang
- Melbourne Sexual Health Centre, Alfred Health, 580 Swanston Street, Carlton, Melbourne, VIC, 3053, Australia.
- School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia.
- Clinical Medical Research Centre, Children's Hospital of Nanjing Medical University, Nanjing, Jiangsu Province, 210008, China.
| |
Collapse
|
7
|
Granov R, Vedad S, Wang SH, Durham A, Shah D, Pasinetti GM. The Role of the Neural Exposome as a Novel Strategy to Identify and Mitigate Health Inequities in Alzheimer's Disease and Related Dementias. Mol Neurobiol 2024:10.1007/s12035-024-04339-6. [PMID: 38967905 DOI: 10.1007/s12035-024-04339-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Accepted: 06/28/2024] [Indexed: 07/06/2024]
Abstract
With the continuous increase of the elderly population, there is an urgency to understand and develop relevant treatments for Alzheimer's disease and related dementias (ADRD). In tandem with this, the prevalence of health inequities continues to rise as disadvantaged communities fail to be included in mainstream research. The neural exposome poses as a relevant mechanistic approach and tool for investigating ADRD onset, progression, and pathology as it accounts for several different factors: exogenous, endogenous, and behavioral. Consequently, through the neural exposome, health inequities can be addressed in ADRD research. In this paper, we address how the neural exposome relates to ADRD by contributing to the discourse through defining how the neural exposome can be developed as a tool in accordance with machine learning. Through this, machine learning can allow for developing a greater insight into the application of transferring and making sense of experimental mouse models exposed to health inequities and potentially relate it to humans. The overall goal moving beyond this paper is to define a multitude of potential factors that can increase the risk of ADRD onset and integrate them to create an interdisciplinary approach to the study of ADRD and subsequently translate the findings to clinical research.
Collapse
Affiliation(s)
- Ravid Granov
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Skyler Vedad
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Shu-Han Wang
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Andrea Durham
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Divyash Shah
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Giulio Maria Pasinetti
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA.
- Geriatrics Research, Education and Clinical Center, JJ Peters VA Medical Center, Bronx, NY, 10468, USA.
| |
Collapse
|
8
|
Soares Dias Portela A, Saxena V, Rosenn E, Wang SH, Masieri S, Palmieri J, Pasinetti GM. Role of Artificial Intelligence in Multinomial Decisions and Preventative Nutrition in Alzheimer's Disease. Mol Nutr Food Res 2024; 68:e2300605. [PMID: 38175857 DOI: 10.1002/mnfr.202300605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 10/04/2023] [Indexed: 01/06/2024]
Abstract
Alzheimer's disease (AD) affects 50 million people worldwide, an increase of 35 million since 2015, and it is known for memory loss and cognitive decline. Considering the morbidity associated with AD, it is important to explore lifestyle elements influencing the chances of developing AD, with special emphasis on nutritional aspects. This review will first discuss how dietary factors have an impact in AD development and the possible role of Artificial Intelligence (AI) and Machine Learning (ML) in preventative care of AD patients through nutrition. The Mediterranean-DASH diets provide individuals with many nutrient benefits which assists the prevention of neurodegeneration by having neuroprotective roles. Lack of micronutrients, protein-energy, and polyunsaturated fatty acids increase the chance of cognitive decline, loss of memory, and synaptic dysfunction among others. ML software has the ability to design models of algorithms from data introduced to present practical solutions that are accessible and easy to use. It can give predictions for a precise medicine approach to evaluate individuals as a whole. There is no doubt the future of nutritional science lies on customizing diets for individuals to reduce dementia risk factors, maintain overall health and brain function.
Collapse
Affiliation(s)
| | - Vrinda Saxena
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Eric Rosenn
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Shu-Han Wang
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Sibilla Masieri
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Joshua Palmieri
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
| | - Giulio Maria Pasinetti
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, 10019, USA
- Geriatrics Research, Education and Clinical Center, JJ Peters VA Medical Center, Bronx, NY, 10468, USA
| |
Collapse
|
9
|
Ashayeri H, Sobhi N, Pławiak P, Pedrammehr S, Alizadehsani R, Jafarizadeh A. Transfer Learning in Cancer Genetics, Mutation Detection, Gene Expression Analysis, and Syndrome Recognition. Cancers (Basel) 2024; 16:2138. [PMID: 38893257 PMCID: PMC11171544 DOI: 10.3390/cancers16112138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 05/30/2024] [Accepted: 06/01/2024] [Indexed: 06/21/2024] Open
Abstract
Artificial intelligence (AI), encompassing machine learning (ML) and deep learning (DL), has revolutionized medical research, facilitating advancements in drug discovery and cancer diagnosis. ML identifies patterns in data, while DL employs neural networks for intricate processing. Predictive modeling challenges, such as data labeling, are addressed by transfer learning (TL), leveraging pre-existing models for faster training. TL shows potential in genetic research, improving tasks like gene expression analysis, mutation detection, genetic syndrome recognition, and genotype-phenotype association. This review explores the role of TL in overcoming challenges in mutation detection, genetic syndrome detection, gene expression, or phenotype-genotype association. TL has shown effectiveness in various aspects of genetic research. TL enhances the accuracy and efficiency of mutation detection, aiding in the identification of genetic abnormalities. TL can improve the diagnostic accuracy of syndrome-related genetic patterns. Moreover, TL plays a crucial role in gene expression analysis in order to accurately predict gene expression levels and their interactions. Additionally, TL enhances phenotype-genotype association studies by leveraging pre-trained models. In conclusion, TL enhances AI efficiency by improving mutation prediction, gene expression analysis, and genetic syndrome detection. Future studies should focus on increasing domain similarities, expanding databases, and incorporating clinical data for better predictions.
Collapse
Affiliation(s)
- Hamidreza Ashayeri
- Student Research Committee, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran;
| | - Navid Sobhi
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran; (N.S.); (A.J.)
| | - Paweł Pławiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155 Krakow, Poland
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, 44-100 Gliwice, Poland
| | - Siamak Pedrammehr
- Faculty of Design, Tabriz Islamic Art University, Tabriz 5164736931, Iran;
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, VIC 3216, Australia;
| | - Roohallah Alizadehsani
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, VIC 3216, Australia;
| | - Ali Jafarizadeh
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran; (N.S.); (A.J.)
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran
| |
Collapse
|
10
|
Subaar C, Addai FT, Addison ECK, Christos O, Adom J, Owusu-Mensah M, Appiah-Agyei N, Abbey S. Investigating the detection of breast cancer with deep transfer learning using ResNet18 and ResNet34. Biomed Phys Eng Express 2024; 10:035029. [PMID: 38599202 DOI: 10.1088/2057-1976/ad3cdf] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Accepted: 04/10/2024] [Indexed: 04/12/2024]
Abstract
A lot of underdeveloped nations particularly in Africa struggle with cancer-related, deadly diseases. Particularly in women, the incidence of breast cancer is rising daily because of ignorance and delayed diagnosis. Only by correctly identifying and diagnosing cancer in its very early stages of development can be effectively treated. The classification of cancer can be accelerated and automated with the aid of computer-aided diagnosis and medical image analysis techniques. This research provides the use of transfer learning from a Residual Network 18 (ResNet18) and Residual Network 34 (ResNet34) architectures to detect breast cancer. The study examined how breast cancer can be identified in breast mammography pictures using transfer learning from ResNet18 and ResNet34, and developed a demo app for radiologists using the trained models with the best validation accuracy. 1, 200 datasets of breast x-ray mammography images from the National Radiological Society's (NRS) archives were employed in the study. The dataset was categorised as implant cancer negative, implant cancer positive, cancer negative and cancer positive in order to increase the consistency of x-ray mammography images classification and produce better features. For the multi-class classification of the images, the study gave an average accuracy for binary classification of benign or malignant cancer cases of 86.7% validation accuracy for ResNet34 and 92% validation accuracy for ResNet18. A prototype web application showcasing ResNet18 performance has been created. The acquired results show how transfer learning can improve the accuracy of breast cancer detection, providing invaluable assistance to medical professionals, particularly in an African scenario.
Collapse
Affiliation(s)
- Christiana Subaar
- Department of Physics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | | | | | - Olivia Christos
- Department of Physics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Joseph Adom
- Department of Physics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Martin Owusu-Mensah
- Department of Physics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Nelson Appiah-Agyei
- Department of Health Physics and Diagnostic Sciences, University of Nevada, Las Vegas, United States of America
| | - Shadrack Abbey
- Department of Physics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| |
Collapse
|
11
|
Xiao H, Zou Y, Wang J, Wan S. A Review for Artificial Intelligence Based Protein Subcellular Localization. Biomolecules 2024; 14:409. [PMID: 38672426 PMCID: PMC11048326 DOI: 10.3390/biom14040409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 03/21/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Proteins need to be located in appropriate spatiotemporal contexts to carry out their diverse biological functions. Mislocalized proteins may lead to a broad range of diseases, such as cancer and Alzheimer's disease. Knowing where a target protein resides within a cell will give insights into tailored drug design for a disease. As the gold validation standard, the conventional wet lab uses fluorescent microscopy imaging, immunoelectron microscopy, and fluorescent biomarker tags for protein subcellular location identification. However, the booming era of proteomics and high-throughput sequencing generates tons of newly discovered proteins, making protein subcellular localization by wet-lab experiments a mission impossible. To tackle this concern, in the past decades, artificial intelligence (AI) and machine learning (ML), especially deep learning methods, have made significant progress in this research area. In this article, we review the latest advances in AI-based method development in three typical types of approaches, including sequence-based, knowledge-based, and image-based methods. We also elaborately discuss existing challenges and future directions in AI-based method development in this research field.
Collapse
Affiliation(s)
- Hanyu Xiao
- Department of Genetics, Cell Biology and Anatomy, College of Medicine, University of Nebraska Medical Center, Omaha, NE 68198, USA;
| | - Yijin Zou
- College of Veterinary Medicine, China Agricultural University, Beijing 100193, China;
| | - Jieqiong Wang
- Department of Neurological Sciences, College of Medicine, University of Nebraska Medical Center, Omaha, NE 68198, USA;
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, College of Medicine, University of Nebraska Medical Center, Omaha, NE 68198, USA;
| |
Collapse
|
12
|
Safdar Ali Khan M, Husen A, Nisar S, Ahmed H, Shah Muhammad S, Aftab S. Offloading the computational complexity of transfer learning with generic features. PeerJ Comput Sci 2024; 10:e1938. [PMID: 38660182 PMCID: PMC11041970 DOI: 10.7717/peerj-cs.1938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 02/19/2024] [Indexed: 04/26/2024]
Abstract
Deep learning approaches are generally complex, requiring extensive computational resources and having high time complexity. Transfer learning is a state-of-the-art approach to reducing the requirements of high computational resources by using pre-trained models without compromising accuracy and performance. In conventional studies, pre-trained models are trained on datasets from different but similar domains with many domain-specific features. The computational requirements of transfer learning are directly dependent on the number of features that include the domain-specific and the generic features. This article investigates the prospects of reducing the computational requirements of the transfer learning models by discarding domain-specific features from a pre-trained model. The approach is applied to breast cancer detection using the dataset curated breast imaging subset of the digital database for screening mammography and various performance metrics such as precision, accuracy, recall, F1-score, and computational requirements. It is seen that discarding the domain-specific features to a specific limit provides significant performance improvements as well as minimizes the computational requirements in terms of training time (reduced by approx. 12%), processor utilization (reduced approx. 25%), and memory usage (reduced approx. 22%). The proposed transfer learning strategy increases accuracy (approx. 7%) and offloads computational complexity expeditiously.
Collapse
Affiliation(s)
- Muhammad Safdar Ali Khan
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Arif Husen
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
- Department of Computer Science, COMSATS Institute of Information Technology, Lahore, Punjab, Pakistan
| | - Shafaq Nisar
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Hasnain Ahmed
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Syed Shah Muhammad
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Shabib Aftab
- Department of Computer Science and Information Technology, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| |
Collapse
|
13
|
Wang L, Hu Y, Gao L. Adjustment of scRNA-seq data to improve cell-type decomposition of spatial transcriptomics. Brief Bioinform 2024; 25:bbae063. [PMID: 38426323 PMCID: PMC10939420 DOI: 10.1093/bib/bbae063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 01/10/2024] [Accepted: 02/01/2024] [Indexed: 03/02/2024] Open
Abstract
Most sequencing-based spatial transcriptomics (ST) technologies do not achieve single-cell resolution where each captured location (spot) may contain a mixture of cells from heterogeneous cell types, and several cell-type decomposition methods have been proposed to estimate cell type proportions of each spot by integrating with single-cell RNA sequencing (scRNA-seq) data. However, these existing methods did not fully consider the effect of distribution difference between scRNA-seq and ST data for decomposition, leading to biased cell-type-specific genes derived from scRNA-seq for ST data. To address this issue, we develop an instance-based transfer learning framework to adjust scRNA-seq data by ST data to correctly match cell-type-specific gene expression. We evaluate the effect of raw and adjusted scRNA-seq data on cell-type decomposition by eight leading decomposition methods using both simulated and real datasets. Experimental results show that data adjustment can effectively reduce distribution difference and improve decomposition, thus enabling for a more precise depiction on spatial organization of cell types. We highlight the importance of data adjustment in integrative analysis of scRNA-seq with ST data and provide guidance for improved cell-type decomposition.
Collapse
Affiliation(s)
- Lanying Wang
- School of Computer Science and Technology, Xidian University, Xi’an 710100, China
| | - Yuxuan Hu
- School of Computer Science and Technology, Xidian University, Xi’an 710100, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi’an 710100, China
| |
Collapse
|
14
|
Dungate B, Tucker DR, Goodwin E, Yong PJ. Assessing the Utility of artificial intelligence in endometriosis: Promises and pitfalls. WOMEN'S HEALTH (LONDON, ENGLAND) 2024; 20:17455057241248121. [PMID: 38686828 PMCID: PMC11062212 DOI: 10.1177/17455057241248121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 01/29/2024] [Accepted: 03/29/2024] [Indexed: 05/02/2024]
Abstract
Endometriosis, a chronic condition characterized by the growth of endometrial-like tissue outside of the uterus, poses substantial challenges in terms of diagnosis and treatment. Artificial intelligence (AI) has emerged as a promising tool in the field of medicine, offering opportunities to address the complexities of endometriosis. This review explores the current landscape of endometriosis diagnosis and treatment, highlighting the potential of AI to alleviate some of the associated burdens and underscoring common pitfalls and challenges when employing AI algorithms in this context. Women's health research in endometriosis has suffered from underfunding, leading to limitations in diagnosis, classification, and treatment approaches. The heterogeneity of symptoms in patients with endometriosis has further complicated efforts to address this condition. New, powerful methods of analysis have the potential to uncover previously unidentified patterns in data relating to endometriosis. AI, a collection of algorithms replicating human decision-making in data analysis, has been increasingly adopted in medical research, including endometriosis studies. While AI offers the ability to identify novel patterns in data and analyze large datasets, its effectiveness hinges on data quality and quantity and the expertise of those implementing the algorithms. Current applications of AI in endometriosis range from diagnostic tools for ultrasound imaging to predicting treatment success. These applications show promise in reducing diagnostic delays, healthcare costs, and providing patients with more treatment options, improving their quality of life. AI holds significant potential in advancing the diagnosis and treatment of endometriosis, but it must be applied carefully and transparently to avoid pitfalls and ensure reproducibility. This review calls for increased scrutiny and accountability in AI research. Addressing these challenges can lead to more effective AI-driven solutions for endometriosis and other complex medical conditions.
Collapse
Affiliation(s)
- Brie Dungate
- Faculty of Medicine, The University of British Columbia, Vancouver, BC, Canada
- Department of Obstetrics and Gynecology, The University of British Columbia, Vancouver, BC, Canada
- Women’s Health Research Institute, Vancouver, BC, Canada
| | - Dwayne R Tucker
- Department of Obstetrics and Gynecology, The University of British Columbia, Vancouver, BC, Canada
- Women’s Health Research Institute, Vancouver, BC, Canada
- Centre for Pelvic Pain & Endometriosis, BC Women’s Hospital & Health Centre, Vancouver, BC, Canada
| | - Emma Goodwin
- Department of Obstetrics and Gynecology, The University of British Columbia, Vancouver, BC, Canada
- Women’s Health Research Institute, Vancouver, BC, Canada
| | - Paul J Yong
- Department of Obstetrics and Gynecology, The University of British Columbia, Vancouver, BC, Canada
- Women’s Health Research Institute, Vancouver, BC, Canada
- Centre for Pelvic Pain & Endometriosis, BC Women’s Hospital & Health Centre, Vancouver, BC, Canada
| |
Collapse
|
15
|
Chi Duong V, Minh Vu G, Khac Nguyen T, Tran The Nguyen H, Luong Pham T, S Vo N, Hong Hoang T. A rapid and reference-free imputation method for low-cost genotyping platforms. Sci Rep 2023; 13:23083. [PMID: 38155188 PMCID: PMC10754833 DOI: 10.1038/s41598-023-50086-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 12/15/2023] [Indexed: 12/30/2023] Open
Abstract
Most current genotype imputation methods are reference-based, which posed several challenges to users, such as high computational costs and reference panel inaccessibility. Thus, deep learning models are expected to create reference-free imputation methods performing with higher accuracy and shortening the running time. We proposed a imputation method using recurrent neural networks integrating with an additional discriminator network, namely GRUD. This method was applied to datasets from genotyping chips and Low-Pass Whole Genome Sequencing (LP-WGS) with the reference panels from The 1000 Genomes Project (1KGP) phase 3, the dataset of 4810 Singaporeans (SG10K), and The 1000 Vietnamese Genome Project (VN1K). Our model performed more accurately than other existing methods on multiple datasets, especially with common variants with large minor allele frequency, and shrank running time and memory usage. In summary, these results indicated that GRUD can be implemented in genomic analyses to improve the accuracy and running-time of genotype imputation.
Collapse
Affiliation(s)
- Vinh Chi Duong
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
- GeneStory Joint Stock Company, Hanoi, Vietnam
| | - Giang Minh Vu
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
- GeneStory Joint Stock Company, Hanoi, Vietnam
| | | | - Hung Tran The Nguyen
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
- Nanyang Technological University, Singapore, Singapore
| | | | - Nam S Vo
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam.
- GeneStory Joint Stock Company, Hanoi, Vietnam.
| | - Tham Hong Hoang
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam.
- GeneStory Joint Stock Company, Hanoi, Vietnam.
| |
Collapse
|
16
|
Airas J, Ding X, Zhang B. Transferable Implicit Solvation via Contrastive Learning of Graph Neural Networks. ACS CENTRAL SCIENCE 2023; 9:2286-2297. [PMID: 38161379 PMCID: PMC10755853 DOI: 10.1021/acscentsci.3c01160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/26/2023] [Accepted: 10/31/2023] [Indexed: 01/03/2024]
Abstract
Implicit solvent models are essential for molecular dynamics simulations of biomolecules, striking a balance between computational efficiency and biological realism. Efforts are underway to develop accurate and transferable implicit solvent models and coarse-grained (CG) force fields in general, guided by a bottom-up approach that matches the CG energy function with the potential of mean force (PMF) defined by the finer system. However, practical challenges arise due to the lack of analytical expressions for the PMF and algorithmic limitations in parameterizing CG force fields. To address these challenges, a machine learning-based approach is proposed, utilizing graph neural networks (GNNs) to represent the solvation free energy and potential contrasting for parameter optimization. We demonstrate the effectiveness of the approach by deriving a transferable GNN implicit solvent model using 600,000 atomistic configurations of six proteins obtained from explicit solvent simulations. The GNN model provides solvation free energy estimations much more accurately than state-of-the-art implicit solvent models, reproducing configurational distributions of explicit solvent simulations. We also demonstrate the reasonable transferability of the GNN model outside of the training data. Our study offers valuable insights for deriving systematically improvable implicit solvent models and CG force fields from a bottom-up perspective.
Collapse
Affiliation(s)
- Justin Airas
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| | - Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| |
Collapse
|
17
|
Chato L, Regentova E. Survey of Transfer Learning Approaches in the Machine Learning of Digital Health Sensing Data. J Pers Med 2023; 13:1703. [PMID: 38138930 PMCID: PMC10744730 DOI: 10.3390/jpm13121703] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/01/2023] [Accepted: 12/08/2023] [Indexed: 12/24/2023] Open
Abstract
Machine learning and digital health sensing data have led to numerous research achievements aimed at improving digital health technology. However, using machine learning in digital health poses challenges related to data availability, such as incomplete, unstructured, and fragmented data, as well as issues related to data privacy, security, and data format standardization. Furthermore, there is a risk of bias and discrimination in machine learning models. Thus, developing an accurate prediction model from scratch can be an expensive and complicated task that often requires extensive experiments and complex computations. Transfer learning methods have emerged as a feasible solution to address these issues by transferring knowledge from a previously trained task to develop high-performance prediction models for a new task. This survey paper provides a comprehensive study of the effectiveness of transfer learning for digital health applications to enhance the accuracy and efficiency of diagnoses and prognoses, as well as to improve healthcare services. The first part of this survey paper presents and discusses the most common digital health sensing technologies as valuable data resources for machine learning applications, including transfer learning. The second part discusses the meaning of transfer learning, clarifying the categories and types of knowledge transfer. It also explains transfer learning methods and strategies, and their role in addressing the challenges in developing accurate machine learning models, specifically on digital health sensing data. These methods include feature extraction, fine-tuning, domain adaptation, multitask learning, federated learning, and few-/single-/zero-shot learning. This survey paper highlights the key features of each transfer learning method and strategy, and discusses the limitations and challenges of using transfer learning for digital health applications. Overall, this paper is a comprehensive survey of transfer learning methods on digital health sensing data which aims to inspire researchers to gain knowledge of transfer learning approaches and their applications in digital health, enhance the current transfer learning approaches in digital health, develop new transfer learning strategies to overcome the current limitations, and apply them to a variety of digital health technologies.
Collapse
Affiliation(s)
- Lina Chato
- Department of Electrical and Computer Engineering, University of Nevada, Las Vegas, NV 89154, USA;
| | | |
Collapse
|
18
|
Rutland H, You J, Liu H, Bull L, Reynolds D. A Systematic Review of Machine-Learning Solutions in Anaerobic Digestion. Bioengineering (Basel) 2023; 10:1410. [PMID: 38136001 PMCID: PMC10740876 DOI: 10.3390/bioengineering10121410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 12/02/2023] [Accepted: 12/04/2023] [Indexed: 12/24/2023] Open
Abstract
The use of machine learning (ML) in anaerobic digestion (AD) is growing in popularity and improves the interpretation of complex system parameters for better operation and optimisation. This systematic literature review aims to explore how ML is currently employed in AD, with particular attention to the challenges of implementation and the benefits of integrating ML techniques. While both lab and industry-scale datasets have been used for model training, challenges arise from varied system designs and the different monitoring equipment used. Traditional machine-learning techniques, predominantly artificial neural networks (ANN), are the most commonly used but face difficulties in scalability and interpretability. Specifically, models trained on lab-scale data often struggle to generalize to full-scale, real-world operations due to the complexity and variability in bacterial communities and system operations. In practical scenarios, machine learning can be employed in real-time operations for predictive modelling, ensuring system stability is maintained, resulting in improved efficiency of both biogas production and waste treatment processes. Through reviewing the ML techniques employed in wider applied domains, potential future research opportunities in addressing these challenges have been identified.
Collapse
Affiliation(s)
- Harvey Rutland
- School of Computer Science, Electrical and Electronic Engineering, and Engineering Maths, University of Bristol, Bristol BS8 1QU, UK
| | - Jiseon You
- School of Engineering, University of the West of England, Bristol BS16 1QY, UK;
| | - Haixia Liu
- School of Computing and Creative Technologies, University of the West of England, Bristol BS16 1QY, UK; (H.L.); (L.B.)
| | - Larry Bull
- School of Computing and Creative Technologies, University of the West of England, Bristol BS16 1QY, UK; (H.L.); (L.B.)
| | - Darren Reynolds
- School of Applied Sciences, University of the West of England, Bristol BS16 1QY, UK;
| |
Collapse
|
19
|
Cheng PC, Chiang HHK. Diagnosis of Salivary Gland Tumors Using Transfer Learning with Fine-Tuning and Gradual Unfreezing. Diagnostics (Basel) 2023; 13:3333. [PMID: 37958229 PMCID: PMC10648910 DOI: 10.3390/diagnostics13213333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/25/2023] [Accepted: 10/27/2023] [Indexed: 11/15/2023] Open
Abstract
Ultrasound is the primary tool for evaluating salivary gland tumors (SGTs); however, tumor diagnosis currently relies on subjective features. This study aimed to establish an objective ultrasound diagnostic method using deep learning. We collected 446 benign and 223 malignant SGT ultrasound images in the training/validation set and 119 benign and 44 malignant SGT ultrasound images in the testing set. We trained convolutional neural network (CNN) models from scratch and employed transfer learning (TL) with fine-tuning and gradual unfreezing to classify malignant and benign SGTs. The diagnostic performances of these models were compared. By utilizing the pretrained ResNet50V2 with fine-tuning and gradual unfreezing, we achieved a 5-fold average validation accuracy of 0.920. The diagnostic performance on the testing set demonstrated an accuracy of 89.0%, a sensitivity of 81.8%, a specificity of 91.6%, a positive predictive value of 78.3%, and a negative predictive value of 93.2%. This performance surpasses that of other models in our study. The corresponding Grad-CAM visualizations were also presented to provide explanations for the diagnosis. This study presents an effective and objective ultrasound method for distinguishing between malignant and benign SGTs, which could assist in preoperative evaluation.
Collapse
Affiliation(s)
- Ping-Chia Cheng
- Department of Biomedical Engineering, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan;
- Department of Otolaryngology Head and Neck Surgery, Far Eastern Memorial Hospital, New Taipei City 22060, Taiwan
- Department of Communication Engineering, Asia Eastern University of Science and Technology, New Taipei City 22060, Taiwan
| | - Hui-Hua Kenny Chiang
- Department of Biomedical Engineering, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan;
| |
Collapse
|
20
|
Paolucci I, Lin YM, Albuquerque Marques Silva J, Brock KK, Odisio BC. Bayesian parametric models for survival prediction in medical applications. BMC Med Res Methodol 2023; 23:250. [PMID: 37884857 PMCID: PMC10605790 DOI: 10.1186/s12874-023-02059-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 10/06/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND Evidence-based treatment decisions in medicine are made founded on population-level evidence obtained during randomized clinical trials. In an era of personalized medicine, these decisions should be based on the predicted benefit of a treatment on a patient-level. Survival prediction models play a central role as they incorporate the time-to-event and censoring. In medical applications uncertainty is critical especially when treatments differ in their side effect profiles or costs. Additionally, models must be adapted to local populations without diminishing performance and often without the original training data available due to privacy concern. Both points are supported by Bayesian models-yet they are rarely used. The aim of this work is to evaluate Bayesian parametric survival models on public datasets including cardiology, infectious diseases, and oncology. MATERIALS AND METHODS Bayesian parametric survival models based on the Exponential and Weibull distribution were implemented as a Python package. A linear combination and a neural network were used for predicting the parameters of the distributions. A superiority design was used to assess whether Bayesian models are better than commonly used models such as Cox Proportional Hazards, Random Survival Forest, and Neural Network-based Cox Proportional Hazards. In a secondary analysis, overfitting was compared between these models. An equivalence design was used to assess whether the prediction performance of Bayesian models after model updating using Bayes rule is equivalent to retraining on the full dataset. RESULTS In this study, we found that Bayesian parametric survival models perform as good as state-of-the art models while requiring less hyperparameters to be tuned and providing a measure of the uncertainty of the predictions. In addition, these models were less prone to overfitting. Furthermore, we show that updating these models using Bayes rule yields equivalent performance compared to models trained on combined original and new datasets. CONCLUSIONS Bayesian parametric survival models are non-inferior to conventional survival models while requiring less hyperparameter tuning, being less prone to overfitting, and allowing model updating using Bayes rule. Further, the Bayesian models provide a measure of the uncertainty on the statistical inference, and, in particular, on the prediction.
Collapse
Affiliation(s)
- Iwan Paolucci
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| | - Yuan-Mao Lin
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | | | - Kristy K Brock
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Bruno C Odisio
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
21
|
Ohashi Y, Ihara T, Oka K, Takane Y, Kikegawa Y. Machine learning analysis and risk prediction of weather-sensitive mortality related to cardiovascular disease during summer in Tokyo, Japan. Sci Rep 2023; 13:17020. [PMID: 37813975 PMCID: PMC10562479 DOI: 10.1038/s41598-023-44181-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 10/04/2023] [Indexed: 10/11/2023] Open
Abstract
Climate-sensitive diseases developing from heat or cold stress threaten human health. Therefore, the future health risk induced by climate change and the aging of society need to be assessed. We developed a prediction model for mortality due to cardiovascular diseases such as myocardial infarction and cerebral infarction, which are weather or climate sensitive, using machine learning (ML) techniques. We evaluated the daily mortality of ischaemic heart disease (IHD) and cerebrovascular disease (CEV) in Tokyo and Osaka City, Japan, during summer. The significance of delayed effects of daily maximum temperature and other weather elements on mortality was previously demonstrated using a distributed lag nonlinear model. We conducted ML by a LightGBM algorithm that included specified lag days, with several temperature- and air pressure-related elements, to assess the respective mortality risks for IHD and CEV, based on training and test data for summer 2010-2019. These models were used to evaluate the effect of climate change on the risk for IHD mortality in Tokyo by applying transfer learning (TL). ML with TL predicted that the daily IHD mortality risk in Tokyo would averagely increase by 29% and 35% at the 95th and 99th percentiles, respectively, using a high-level warming-climate scenario in 2045-2055, compared to the risk simulated using ML in 2009-2019.
Collapse
Affiliation(s)
- Yukitaka Ohashi
- Faculty of Biosphere-Geosphere Science, Okayama University of Science, Kita-Ku, Okayama City, Okayama, Japan.
| | - Tomohiko Ihara
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa City, Chiba, Japan
| | - Kazutaka Oka
- Center for Climate Change Adaptation, National Institute for Environmental Studies (NIES), Tsukuba City, Ibaraki, Japan
| | - Yuya Takane
- Environmental Management Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba City, Ibaraki, Japan
| | - Yukihiro Kikegawa
- School of Science and Engineering, Meisei University, Hino City, Tokyo, Japan
| |
Collapse
|
22
|
Ahalya RK, Almutairi FM, Snekhalatha U, Dhanraj V, Aslam SM. RANet: a custom CNN model and quanvolutional neural network for the automated detection of rheumatoid arthritis in hand thermal images. Sci Rep 2023; 13:15638. [PMID: 37730717 PMCID: PMC10511741 DOI: 10.1038/s41598-023-42111-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 09/05/2023] [Indexed: 09/22/2023] Open
Abstract
Rheumatoid arthritis is an autoimmune disease which affects the small joints. Early prediction of RA is necessary for the treatment and management of the disease. The current work presents a deep learning and quantum computing-based automated diagnostic approach for RA in hand thermal imaging. The study's goals are (i) to develop a custom RANet model and compare its performance with the pretrained models and quanvolutional neural network (QNN) to distinguish between the healthy subjects and RA patients, (ii) To validate the performance of the custom model using feature selection method and classification using machine learning (ML) classifiers. The present study developed a custom RANet model and employed pre-trained models such as ResNet101V2, InceptionResNetV2, and DenseNet201 to classify the RA patients and normal subjects. The deep features extracted from the RA Net model are fed into the ML classifiers after the feature selection process. The RANet model, RA Net+ SVM, and QNN model produced an accuracy of 95%, 97% and 93.33% respectively in the classification of healthy groups and RA patients. The developed RANet and QNN models based on thermal imaging could be employed as an accurate automated diagnostic tool to differentiate between the RA and control groups.
Collapse
Affiliation(s)
- R K Ahalya
- Department of Biomedical Engineering, College of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, Chennai, Tamil Nadu, 603203, India
- Department of Biomedical Engineering, Easwari Engineering college, Ramapuram, Chennai, Tamil Nadu, India
| | - Fadiyah M Almutairi
- Department of Information Systems, College of Computer and Information Sciences, Majmaah University, 11952, Al Majmaah, Saudi Arabia
| | - U Snekhalatha
- Department of Biomedical Engineering, College of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, Chennai, Tamil Nadu, 603203, India.
| | - Varun Dhanraj
- Department of Physics and Astronomy, University of Waterloo, Waterloo, ON, Canada
| | - Shabnam M Aslam
- Department of Information Technology, College of Computer and Information Sciences, Majmaah University, 11952, Al Majmaah, Saudi Arabia
| |
Collapse
|
23
|
Airas J, Ding X, Zhang B. Transferable Coarse Graining via Contrastive Learning of Graph Neural Networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.08.556923. [PMID: 37745447 PMCID: PMC10515757 DOI: 10.1101/2023.09.08.556923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Coarse-grained (CG) force fields are essential for molecular dynamics simulations of biomolecules, striking a balance between computational efficiency and biological realism. These simulations employ simplified models grouping atoms into interaction sites, enabling the study of complex biomolecular systems over biologically relevant timescales. Efforts are underway to develop accurate and transferable CG force fields, guided by a bottom-up approach that matches the CG energy function with the potential of mean force (PMF) defined by the finer system. However, practical challenges arise due to many-body effects, lack of analytical expressions for the PMF, and limitations in parameterizing CG force fields. To address these challenges, a machine learning-based approach is proposed, utilizing graph neural networks (GNNs) to represent CG force fields and potential contrasting for parameterization from atomistic simulation data. We demonstrate the effectiveness of the approach by deriving a transferable GNN implicit solvent model using 600,000 atomistic configurations of six proteins obtained from explicit solvent simulations. The GNN model provides solvation free energy estimations much more accurately than state-of-the-art implicit solvent models, reproducing configurational distributions of explicit solvent simulations. We also demonstrate the reasonable transferability of the GNN model outside the training data. Our study offers valuable insights for building accurate coarse-grained models bottom-up.
Collapse
Affiliation(s)
- Justin Airas
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
24
|
Wozniak P, Ozog D. Cross-Domain Indoor Visual Place Recognition for Mobile Robot via Generalization Using Style Augmentation. SENSORS (BASEL, SWITZERLAND) 2023; 23:6134. [PMID: 37447982 PMCID: PMC10346347 DOI: 10.3390/s23136134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/22/2023] [Accepted: 06/26/2023] [Indexed: 07/15/2023]
Abstract
The article presents an algorithm for the multi-domain visual recognition of an indoor place. It is based on a convolutional neural network and style randomization. The authors proposed a scene classification mechanism and improved the performance of the models based on synthetic and real data from various domains. In the proposed dataset, a domain change was defined as a camera model change. A dataset of images collected from several rooms was used to show different scenarios, human actions, equipment changes, and lighting conditions. The proposed method was tested in a scene classification problem where multi-domain data were used. The basis was a transfer learning approach with an extension style applied to various combinations of source and target data. The focus was on improving the unknown domain score and multi-domain support. The results of the experiments were analyzed in the context of data collected on a humanoid robot. The article shows that the average score was the highest for the use of multi-domain data and data style enhancement. The method of obtaining average results for the proposed method reached the level of 92.08%. The result obtained by another research team was corrected.
Collapse
Affiliation(s)
- Piotr Wozniak
- Department of Computer and Control Engineering, Faculty of Electrical and Computer Engineering, Rzeszow University of Technology, Al. Powstańców Warszawy 12, 35-959 Rzeszow, Poland;
| | | |
Collapse
|
25
|
Yuvaraj R, Baranwal A, Prince AA, Murugappan M, Mohammed JS. Emotion Recognition from Spatio-Temporal Representation of EEG Signals via 3D-CNN with Ensemble Learning Techniques. Brain Sci 2023; 13:brainsci13040685. [PMID: 37190650 DOI: 10.3390/brainsci13040685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 04/12/2023] [Accepted: 04/17/2023] [Indexed: 05/17/2023] Open
Abstract
The recognition of emotions is one of the most challenging issues in human-computer interaction (HCI). EEG signals are widely adopted as a method for recognizing emotions because of their ease of acquisition, mobility, and convenience. Deep neural networks (DNN) have provided excellent results in emotion recognition studies. Most studies, however, use other methods to extract handcrafted features, such as Pearson correlation coefficient (PCC), Principal Component Analysis, Higuchi Fractal Dimension (HFD), etc., even though DNN is capable of generating meaningful features. Furthermore, most earlier studies largely ignored spatial information between the different channels, focusing mainly on time domain and frequency domain representations. This study utilizes a pre-trained 3D-CNN MobileNet model with transfer learning on the spatio-temporal representation of EEG signals to extract features for emotion recognition. In addition to fully connected layers, hybrid models were explored using other decision layers such as multilayer perceptron (MLP), k-nearest neighbor (KNN), extreme learning machine (ELM), XGBoost (XGB), random forest (RF), and support vector machine (SVM). Additionally, this study investigates the effects of post-processing or filtering output labels. Extensive experiments were conducted on the SJTU Emotion EEG Dataset (SEED) (three classes) and SEED-IV (four classes) datasets, and the results obtained were comparable to the state-of-the-art. Based on the conventional 3D-CNN with ELM classifier, SEED and SEED-IV datasets showed a maximum accuracy of 89.18% and 81.60%, respectively. Post-filtering improved the emotional classification performance in the hybrid 3D-CNN with ELM model for SEED and SEED-IV datasets to 90.85% and 83.71%, respectively. Accordingly, spatial-temporal features extracted from the EEG, along with ensemble classifiers, were found to be the most effective in recognizing emotions compared to state-of-the-art methods.
Collapse
Affiliation(s)
- Rajamanickam Yuvaraj
- National Institute of Education, Nanyang Technological University, Singapore 637616, Singapore
| | - Arapan Baranwal
- Department of Computer Science and Information Systems, BITS Pilani, Sancoale 403726, Goa, India
| | - A Amalin Prince
- Department of Electrical and Electronics Engineering, BITS Pilani, Sancoale 403726, Goa, India
| | - M Murugappan
- Intelligent Signal Processing (ISP) Research Lab, Department of Electronics and Communication Engineering, Kuwait College of Science and Technology, Block 4, Doha 13133, Kuwait
- Department of Electronics and Communication Engineering, Faculty of Engineering, Vels Institute of Sciences, Technology, and Advanced Studies, Chennai 600117, Tamilnadu, India
- Centre for Excellence in Unmanned Aerial Systems (CoEUAS), Universiti Malaysia Perlis, Kangar 02600, Perlis, Malaysia
| | - Javeed Shaikh Mohammed
- Department of Biomedical Technology, College of Applied Medical Sciences, Prince Sattam bin Abdulaziz University, Al Kharj 11942, Saudi Arabia
| |
Collapse
|