1
|
Zhang Y, Gao Y, Wang H, Wu H, Xia Y, Wu X. A Secure High-Order Gene Interaction Detection Algorithm Based on Deep Neural Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:619-630. [PMID: 36251904 DOI: 10.1109/tcbb.2022.3214863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Identifying high-order Single Nucleotide Polymorphism (SNP) interactions of additive genetic model is crucial for detecting complex disease gene-type and predicting pathogenic genes of various disorders. We present a novel framework for high-order gene interactions detection, not directly identifying individual site, but based on Deep Learning (DL) method with Differential Privacy (DP), termed as Deep-DPGI. Firstly, integrate loss functions including cross-entropy and focal loss function to train the model parameters that minimize the value of loss. Secondly, use the layer-wise relevance analysis method to measure relevance difference between neurons weight and outputting results. Deep-DPGI disturbs neuron weight by adaptive noising mechanism, protecting the safety of high-order gene interactions and balancing the privacy and utility. Specifically, more noise is added to gradients of neurons that is less relevance with the outputs, less noise to gradients that more relevance. Finally, Experiments on simulated and real datasets demonstrate that Deep-DPGI not only improve the power of high-order gene interactions detection in with marginal and without marginal effect of complex disease models, but also prevent the disclosure of sensitive information effectively.
Collapse
|
2
|
Hua H, Zhou Y, Li W, Zhang J, Deng Y, Khoo BL. Microfluidics-based patient-derived disease detection tool for deep learning-assisted precision medicine. BIOMICROFLUIDICS 2024; 18:014101. [PMID: 38223546 PMCID: PMC10787641 DOI: 10.1063/5.0172146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Accepted: 12/11/2023] [Indexed: 01/16/2024]
Abstract
Cancer spatial and temporal heterogeneity fuels resistance to therapies. To realize the routine assessment of cancer prognosis and treatment, we demonstrate the development of an Intelligent Disease Detection Tool (IDDT), a microfluidic-based tumor model integrated with deep learning-assisted algorithmic analysis. IDDT was clinically validated with liquid blood biopsy samples (n = 71) from patients with various types of cancers (e.g., breast, gastric, and lung cancer) and healthy donors, requiring low sample volume (∼200 μl) and a high-throughput 3D tumor culturing system (∼300 tumor clusters). To support automated algorithmic analysis, intelligent decision-making, and precise segmentation, we designed and developed an integrative deep neural network, which includes Mask Region-Based Convolutional Neural Network (Mask R-CNN), vision transformer, and Segment Anything Model (SAM). Our approach significantly reduces the manual labeling time by up to 90% with a high mean Intersection Over Union (mIoU) of 0.902 and immediate results (<2 s per image) for clinical cohort classification. The IDDT can accurately stratify healthy donors (n = 12) and cancer patients (n = 55) within their respective treatment cycle and cancer stage, resulting in high precision (∼99.3%) and high sensitivity (∼98%). We envision that our patient-centric IDDT provides an intelligent, label-free, and cost-effective approach to help clinicians make precise medical decisions and tailor treatment strategies for each patient.
Collapse
Affiliation(s)
| | - Yunlan Zhou
- Department of Clinical Laboratory, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai 200092, China
| | | | - Jing Zhang
- Department of Biomedical Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong 999077, China
| | - Yanlin Deng
- Department of Biomedical Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong 999077, China
| | - Bee Luan Khoo
- Authors to whom correspondence should be addressed:; ; and
| |
Collapse
|
3
|
Jin Z, Jin Y, Chen Z. Empirical mode decomposition using deep learning model for financial market forecasting. PeerJ Comput Sci 2022; 8:e1076. [PMID: 36262133 PMCID: PMC9575866 DOI: 10.7717/peerj-cs.1076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 08/08/2022] [Indexed: 06/16/2023]
Abstract
Financial market forecasting is an essential component of financial systems; however, predicting financial market trends is a challenging job due to noisy and non-stationary information. Deep learning is renowned for bringing out excellent abstract features from the huge volume of raw data without depending on prior knowledge, which is potentially fascinating in forecasting financial transactions. This article aims to propose a deep learning model that autonomously mines the statistical rules of data and guides the financial market transactions based on empirical mode decomposition (EMD) with back-propagation neural networks (BPNN). Through the characteristic time scale of data, the intrinsic wave pattern was obtained and then decomposed. Financial market transaction data were analyzed, optimized using PSO, and predicted. Combining the nonlinear and non-stationary financial time series can improve prediction accuracy. The predictive model of deep learning, based on the analysis of the massive financial trading data, can forecast the future trend of financial market price, forming a trading signal when particular confidence is satisfied. The empirical results show that the EMD-based deep learning model has an excellent predicting performance.
Collapse
Affiliation(s)
- Zebin Jin
- College of Management, Ocean University of China, Qingdao, Shandong, China
| | - Yixiao Jin
- Shanghai Yingcai Information Technology Ltd., Fengxian, Shanghai, China
| | | |
Collapse
|
4
|
Vogado L, Veras R, Aires K, Araújo F, Silva R, Ponti M, Tavares JMRS. Diagnosis of Leukaemia in Blood Slides Based on a Fine-Tuned and Highly Generalisable Deep Learning Model. SENSORS (BASEL, SWITZERLAND) 2021; 21:2989. [PMID: 33923209 PMCID: PMC8123151 DOI: 10.3390/s21092989] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Revised: 04/19/2021] [Accepted: 04/21/2021] [Indexed: 02/06/2023]
Abstract
Leukaemia is a dysfunction that affects the production of white blood cells in the bone marrow. Young cells are abnormally produced, replacing normal blood cells. Consequently, the person suffers problems in transporting oxygen and in fighting infections. This article proposes a convolutional neural network (CNN) named LeukNet that was inspired on convolutional blocks of VGG-16, but with smaller dense layers. To define the LeukNet parameters, we evaluated different CNNs models and fine-tuning methods using 18 image datasets, with different resolution, contrast, colour and texture characteristics. We applied data augmentation operations to expand the training dataset, and the 5-fold cross-validation led to an accuracy of 98.61%. To evaluate the CNNs generalisation ability, we applied a cross-dataset validation technique. The obtained accuracies using cross-dataset experiments on three datasets were 97.04, 82.46 and 70.24%, which overcome the accuracies obtained by current state-of-the-art methods. We conclude that using the most common and deepest CNNs may not be the best choice for applications where the images to be classified differ from those used in pre-training. Additionally, the adopted cross-dataset validation approach proved to be an excellent choice to evaluate the generalisation capability of a model, as it considers the model performance on unseen data, which is paramount for CAD systems.
Collapse
Affiliation(s)
- Luis Vogado
- Departamento de Computação, Universidade Federal do Piauí, Teresina 64049-550, Brazil; (L.V.); (R.V.); (K.A.)
| | - Rodrigo Veras
- Departamento de Computação, Universidade Federal do Piauí, Teresina 64049-550, Brazil; (L.V.); (R.V.); (K.A.)
| | - Kelson Aires
- Departamento de Computação, Universidade Federal do Piauí, Teresina 64049-550, Brazil; (L.V.); (R.V.); (K.A.)
| | - Flávio Araújo
- Curso de Bacharelado em Sistemas de Informação, Universidade Federal do Piauí, Picos 64607-670, Brazil; (F.A.); (R.S.)
| | - Romuere Silva
- Curso de Bacharelado em Sistemas de Informação, Universidade Federal do Piauí, Picos 64607-670, Brazil; (F.A.); (R.S.)
| | - Moacir Ponti
- Instituto de Ciências Matemáticas de de Computação, Universidade de São Paulo, São Carlos 13566-590, Brazil;
| | - João Manuel R. S. Tavares
- Departamento de Engenharia Mecânica, Faculdade de Engenharia, Instituto de Ciência e Inovação em Engenharia Mecânica e Engenharia Industrial, Universidade do Porto, 4200-465 Porto, Portugal
| |
Collapse
|
5
|
Zhou J, Hu L, Jiang Y, Liu L. A Correlation Analysis between SNPs and ROIs of Alzheimer's Disease Based on Deep Learning. BIOMED RESEARCH INTERNATIONAL 2021; 2021:8890513. [PMID: 33628827 PMCID: PMC7886593 DOI: 10.1155/2021/8890513] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Revised: 12/23/2020] [Accepted: 01/27/2021] [Indexed: 12/31/2022]
Abstract
Motivation. At present, the research methods for image genetics of Alzheimer's disease based on machine learning are mainly divided into three steps: the first step is to preprocess the original image and gene information into digital signals that are easy to calculate; the second step is feature selection aiming at eliminating redundant signals and obtain representative features; and the third step is to build a learning model and predict the unknown data with regression or bivariate correlation analysis. This type of method requires manual extraction of feature single-nucleotide polymorphisms (SNPs), and the extraction process relies on empirical knowledge to a certain extent, such as linkage imbalance and gene function information in a group sparse model, which puts forward certain requirements for applicable scenarios and application personnel. To solve the problems of insufficient biological significance and large errors in the previous methods of association analysis and disease diagnosis, this paper presents a method of correlation analysis and disease diagnosis between SNP and region of interest (ROI) based on a deep learning model. It is a data-driven method, which has no obvious feature selection process. Results. The deep learning method adopted in this paper has no obvious feature extraction process relying on prior knowledge and model assumptions. From the results of correlation analysis between SNP and ROI, this method is complementary to other regression model methods in application scenarios. In order to improve the disease diagnosis performance of deep learning, we use the deep learning model to integrate SNP characteristics and ROI characteristics. The SNP feature, ROI feature, and SNP-ROI joint feature were input into the deep learning model and trained by cross-validation technique. The experimental results show that the SNP-ROI joint feature describes the information of the samples from different angles, which makes the diagnosis accuracy higher.
Collapse
Affiliation(s)
- Juan Zhou
- School of Software, East China Jiaotong University, Nanchang 330013, China
| | - Linfeng Hu
- School of Software, East China Jiaotong University, Nanchang 330013, China
| | - Yu Jiang
- School of Software, East China Jiaotong University, Nanchang 330013, China
| | - Liyue Liu
- School of Software, East China Jiaotong University, Nanchang 330013, China
| |
Collapse
|
6
|
Shin H, Seo D, Choi Y. Extracellular Vesicle Identification Using Label-Free Surface-Enhanced Raman Spectroscopy: Detection and Signal Analysis Strategies. Molecules 2020; 25:E5209. [PMID: 33182340 PMCID: PMC7664897 DOI: 10.3390/molecules25215209] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 10/19/2020] [Accepted: 11/05/2020] [Indexed: 02/07/2023] Open
Abstract
Extracellular vesicles (EVs) have been widely investigated as promising biomarkers for the liquid biopsy of diseases, owing to their countless roles in biological systems. Furthermore, with the notable progress of exosome research, the use of label-free surface-enhanced Raman spectroscopy (SERS) to identify and distinguish disease-related EVs has emerged. Even in the absence of specific markers for disease-related EVs, label-free SERS enables the identification of unique patterns of disease-related EVs through their molecular fingerprints. In this review, we describe label-free SERS approaches for disease-related EV pattern identification in terms of substrate design and signal analysis strategies. We first describe the general characteristics of EVs and their SERS signals. We then present recent works on applied plasmonic nanostructures to sensitively detect EVs and notable methods to interpret complex spectral data. This review also discusses current challenges and future prospects of label-free SERS-based disease-related EV pattern identification.
Collapse
Affiliation(s)
- Hyunku Shin
- Department of Bio-convergence Engineering, Korea University, Seoul 02841, Korea; (H.S.); (D.S.)
| | - Dongkwon Seo
- Department of Bio-convergence Engineering, Korea University, Seoul 02841, Korea; (H.S.); (D.S.)
| | - Yeonho Choi
- Department of Bio-convergence Engineering, Korea University, Seoul 02841, Korea; (H.S.); (D.S.)
- School of Biomedical Engineering, Korea University, Seoul 02841, Korea
| |
Collapse
|
7
|
Sangphukieo A, Laomettachit T, Ruengjitchatchawalya M. Photosynthetic protein classification using genome neighborhood-based machine learning feature. Sci Rep 2020; 10:7108. [PMID: 32346070 PMCID: PMC7189237 DOI: 10.1038/s41598-020-64053-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Accepted: 04/07/2020] [Indexed: 11/08/2022] Open
Abstract
Identification of novel photosynthetic proteins is important for understanding and improving photosynthetic efficiency. Synergistically, genome neighborhood can provide additional useful information to identify photosynthetic proteins. We, therefore, expected that applying a computational approach, particularly machine learning (ML) with the genome neighborhood-based feature should facilitate the photosynthetic function assignment. Our results revealed a functional relationship between photosynthetic genes and their conserved neighboring genes observed by 'Phylo score', indicating their functions could be inferred from the genome neighborhood profile. Therefore, we created a new method for extracting patterns based on the genome neighborhood network (GNN) and applied them for the photosynthetic protein classification using ML algorithms. Random forest (RF) classifier using genome neighborhood-based features achieved the highest accuracy up to 87% in the classification of photosynthetic proteins and also showed better performance (Mathew's correlation coefficient = 0.718) than other available tools including the sequence similarity search (0.447) and ML-based method (0.361). Furthermore, we demonstrated the ability of our model to identify novel photosynthetic proteins compared to the other methods. Our classifier is available at http://bicep2.kmutt.ac.th/photomod_standalone, https://bit.ly/2S0I2Ox and DockerHub: https://hub.docker.com/r/asangphukieo/photomod.
Collapse
Affiliation(s)
- Apiwat Sangphukieo
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (KMUTT), Bang Khun Thian, Bangkok, 10150, Thailand
- School of Information Technology, KMUTT, Bang Mod, Thung Khru, Bangkok, 10140, Thailand
| | - Teeraphan Laomettachit
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (KMUTT), Bang Khun Thian, Bangkok, 10150, Thailand
| | - Marasri Ruengjitchatchawalya
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (KMUTT), Bang Khun Thian, Bangkok, 10150, Thailand.
- Biotechnology program, School of Bioresources and Technology, KMUTT, Bang Khun Thian, Bangkok, 10150, Thailand.
- Algal Biotechnology Research Group, Pilot Plant Development and Training Institute (PDTI), KMUTT, Bang Khun Thian, Bangkok, 10150, Thailand.
| |
Collapse
|
8
|
Genetics of Congenital Heart Disease. Biomolecules 2019; 9:biom9120879. [PMID: 31888141 PMCID: PMC6995556 DOI: 10.3390/biom9120879] [Citation(s) in RCA: 105] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 12/07/2019] [Accepted: 12/09/2019] [Indexed: 12/12/2022] Open
Abstract
Congenital heart disease (CHD) is one of the most common birth defects. Studies in animal models and humans have indicated a genetic etiology for CHD. About 400 genes have been implicated in CHD, encompassing transcription factors, cell signaling molecules, and structural proteins that are important for heart development. Recent studies have shown genes encoding chromatin modifiers, cilia related proteins, and cilia-transduced cell signaling pathways play important roles in CHD pathogenesis. Elucidating the genetic etiology of CHD will help improve diagnosis and the development of new therapies to improve patient outcomes.
Collapse
|