1
|
Chung J, Kim S, Won JH, Park H. Integrating Multimodal Neuroimaging and Genetics: A Structurally-Linked Sparse Canonical Correlation Analysis Approach. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2024; 12:659-667. [PMID: 39464624 PMCID: PMC11505868 DOI: 10.1109/jtehm.2024.3463720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 08/16/2024] [Accepted: 09/14/2024] [Indexed: 10/29/2024]
Abstract
Neuroimaging genetics represents a multivariate approach aimed at elucidating the intricate relationships between high-dimensional genetic variations and neuroimaging data. Predominantly, existing methodologies revolve around Sparse Canonical Correlation Analysis (SCCA), a framework we expand to 1) encompass multiple imaging modalities and 2) promote the simultaneous identification of structurally linked features across imaging modalities. The structurally linked brain regions were assessed using diffusion tensor imaging, which quantifies the presence of neuronal fibers, thereby grounding our approach in biologically well-founded prior knowledge within the SCCA model. In our proposed structurally linked SCCA framework, we leverage T1-weighted MRI and functional MRI (fMRI) time series data to delineate both the structural and functional characteristics of the brain. Genetic variations, specifically single nucleotide polymorphisms (SNPs), are also incorporated as a genetic modality. Validation of our methodology was conducted using a simulated dataset and large-scale normative data from the Human Connectome Project (HCP). Our approach demonstrated superior performance compared to existing methods on simulated data and revealed interpretable gene-imaging associations in the real dataset. Thus, our methodology lays the groundwork for elucidating the genetic underpinnings of brain structure and function, thereby providing novel insights into the field of neuroscience. Our code is available at https://github.com/mungegg.
Collapse
Affiliation(s)
- Jiwon Chung
- Department of Electrical and Computer EngineeringSungkyunkwan UniversitySuwon16419Republic of Korea
| | - Sunghun Kim
- Department of Electrical and Computer EngineeringSungkyunkwan UniversitySuwon16419Republic of Korea
| | - Ji Hye Won
- Department of Computer Engineering and Artificial IntelligencePukyong National UniversityBusan48513Republic of Korea
| | - Hyunjin Park
- Department of Electrical and Computer EngineeringSungkyunkwan UniversitySuwon16419Republic of Korea
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwon16419Republic of Korea
| |
Collapse
|
2
|
Mondal S, Maji P. Multi-Task Learning and Sparse Discriminant Canonical Correlation Analysis for Identification of Diagnosis-Specific Genotype-Phenotype Association. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1390-1402. [PMID: 38587960 DOI: 10.1109/tcbb.2024.3386406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/10/2024]
Abstract
The primary objective of imaging genetics research is to investigate the complex genotype-phenotype association for the disease under study. For example, to understand the impact of genetic variations over the brain functions and structure, the genotypic data such as single nucleotide polymorphism (SNP) is integrated with the phenotypic data such as imaging quantitative traits. The sparse models, based on canonical correlation analysis (CCA), are popular in this area to find the complex bi-multivariate genotype-phenotype association, as the number of features in genotypic and/or phenotypic data is significantly higher as compared to the number of samples. However, the sparse CCA based methods are, in general, unsupervised in nature, and fail to identify the diagnose-specific features those play an important role for the diagnosis and prognosis of the disease under study. In this regard, a new supervised model is proposed to study the complex genotype-phenotype association, by judiciously integrating the merits of CCA, linear discriminant analysis (LDA) and multi-task learning. The proposed model can identify the diagnose-specific as well as the diagnose-consistent features with significantly lower computational complexity. The performance of the proposed method, along with a comparison with the state-of-the-art methods, is evaluated on several synthetic data sets and one real imaging genetics data collected from Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort. In the current study, the SNP as genetic data and resting state functional MRI ( fMRI) as imaging data are integrated to find the complex genotype-phenotype association. An important finding is that the proposed method has better correlation value, improved noise resistance and stability, and also has better feature selection ability. All the results illustrate the power and capability of the proposed method to find the diagnostic group-specific imaging genetic association, which may help to understand the neurodegenerative disorder in a more comprehensive way.
Collapse
|
3
|
Huang J, Lin L, Yu F, He X, Song W, Lin J, Tang Z, Yuan K, Li Y, Huang H, Pei Z, Xian W, Yu-Chian Chen C. Parkinson's severity diagnosis explainable model based on 3D multi-head attention residual network. Comput Biol Med 2024; 170:107959. [PMID: 38215619 DOI: 10.1016/j.compbiomed.2024.107959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 12/31/2023] [Accepted: 01/01/2024] [Indexed: 01/14/2024]
Abstract
The severity evaluation of Parkinson's disease (PD) is of great significance for the treatment of PD. However, existing methods either have limitations based on prior knowledge or are invasive methods. To propose a more generalized severity evaluation model, this paper proposes an explainable 3D multi-head attention residual convolution network. First, we introduce the 3D attention-based convolution layer to extract video features. Second, features will be fed into LSTM and residual backbone networks, which can be used to capture the contextual information of the video. Finally, we design a feature compression module to condense the learned contextual features. We develop some interpretable experiments to better explain this black-box model so that it can be better generalized. Experiments show that our model can achieve state-of-the-art diagnosis performance. The proposed lightweight but effective model is expected to serve as a suitable end-to-end deep learning baseline in future research on PD video-based severity evaluation and has the potential for large-scale application in PD telemedicine. The source code is available at https://github.com/JackAILab/MARNet.
Collapse
Affiliation(s)
- Jiehui Huang
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Lishan Lin
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Diagnosis and Treatment of Major Neurological Diseases, National Key Clinical Department and Key Discipline of Neurology, 510080, China
| | - Fengcheng Yu
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Xuedong He
- School of Computer Science and Technology, Zhejiang Normal University, Jinhua, 321004, Zhejiang, China
| | - Wenhui Song
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Jiaying Lin
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Zhenchao Tang
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Kang Yuan
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Diagnosis and Treatment of Major Neurological Diseases, National Key Clinical Department and Key Discipline of Neurology, 510080, China
| | - Yucheng Li
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Diagnosis and Treatment of Major Neurological Diseases, National Key Clinical Department and Key Discipline of Neurology, 510080, China
| | - Haofan Huang
- Polytechnic Institute, Zhejiang University, Hangzhou 310058, China
| | - Zhong Pei
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Diagnosis and Treatment of Major Neurological Diseases, National Key Clinical Department and Key Discipline of Neurology, 510080, China.
| | - Wenbiao Xian
- Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Diagnosis and Treatment of Major Neurological Diseases, National Key Clinical Department and Key Discipline of Neurology, 510080, China.
| | - Calvin Yu-Chian Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China; AI for Science (AI4S)-Preferred Program, Peking University Shenzhen Graduate School, Shenzhen, 518055, Guangdong, China; School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, Guangdong, China; Department of Medical Research, China Medical University Hospital, Taichung, 40447, Taiwan; Department of Bioinformatics and Medical Engineering, Asia University, Taichung, 41354, Taiwan.
| |
Collapse
|
4
|
Choi H, Byeon K, Lee J, Hong S, Park B, Park H. Identifying subgroups of eating behavior traits unrelated to obesity using functional connectivity and feature representation learning. Hum Brain Mapp 2024; 45:e26581. [PMID: 38224537 PMCID: PMC10789215 DOI: 10.1002/hbm.26581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 12/13/2023] [Accepted: 12/20/2023] [Indexed: 01/17/2024] Open
Abstract
Eating behavior is highly heterogeneous across individuals and cannot be fully explained using only the degree of obesity. We utilized unsupervised machine learning and functional connectivity measures to explore the heterogeneity of eating behaviors measured by a self-assessment instrument using 424 healthy adults (mean ± standard deviation [SD] age = 47.07 ± 18.89 years; 67% female). We generated low-dimensional representations of functional connectivity using resting-state functional magnetic resonance imaging and estimated latent features using the feature representation capabilities of an autoencoder by nonlinearly compressing the functional connectivity information. The clustering approaches applied to latent features identified three distinct subgroups. The subgroups exhibited different levels of hunger traits, while their body mass indices were comparable. The results were replicated in an independent dataset consisting of 212 participants (mean ± SD age = 38.97 ± 19.80 years; 35% female). The model interpretation technique of integrated gradients revealed that the between-group differences in the integrated gradient maps were associated with functional reorganization in heteromodal association and limbic cortices and reward-related subcortical structures such as the accumbens, amygdala, and caudate. The cognitive decoding analysis revealed that these systems are associated with reward- and emotion-related systems. Our findings provide insights into the macroscopic brain organization of eating behavior-related subgroups independent of obesity.
Collapse
Affiliation(s)
- Hyoungshin Choi
- Department of Electrical and Computer EngineeringSungkyunkwan UniversitySuwonRepublic of Korea
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwonRepublic of Korea
| | | | - Jong‐eun Lee
- Department of Electrical and Computer EngineeringSungkyunkwan UniversitySuwonRepublic of Korea
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwonRepublic of Korea
| | - Seok‐Jun Hong
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwonRepublic of Korea
- Center for the Developing BrainChild Mind InstituteNew YorkUSA
- Department of Biomedical EngineeringSungkyunkwan UniversitySuwonRepublic of Korea
| | - Bo‐yong Park
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwonRepublic of Korea
- Department of Data ScienceInha UniversityIncheonRepublic of Korea
- Department of Statistics and Data ScienceInha UniversityIncheonRepublic of Korea
| | - Hyunjin Park
- Center for Neuroscience Imaging ResearchInstitute for Basic ScienceSuwonRepublic of Korea
- School of Electronic and Electrical EngineeringSungkyunkwan UniversitySuwonRepublic of Korea
| |
Collapse
|
5
|
Guo R, Tian X, Lin H, McKenna S, Li HD, Guo F, Liu J. Graph-Based Fusion of Imaging, Genetic and Clinical Data for Degenerative Disease Diagnosis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:57-68. [PMID: 37991907 DOI: 10.1109/tcbb.2023.3335369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2023]
Abstract
Graph learning methods have achieved noteworthy performance in disease diagnosis due to their ability to represent unstructured information such as inter-subject relationships. While it has been shown that imaging, genetic and clinical data are crucial for degenerative disease diagnosis, existing methods rarely consider how best to use their relationships. How best to utilize information from imaging, genetic and clinical data remains a challenging problem. This study proposes a novel graph-based fusion (GBF) approach to meet this challenge. To extract effective imaging-genetic features, we propose an imaging-genetic fusion module which uses an attention mechanism to obtain modality-specific and joint representations within and between imaging and genetic data. Then, considering the effectiveness of clinical information for diagnosing degenerative diseases, we propose a multi-graph fusion module to further fuse imaging-genetic and clinical features, which adopts a learnable graph construction strategy and a graph ensemble method. Experimental results on two benchmarks for degenerative disease diagnosis (Alzheimers Disease Neuroimaging Initiative and Parkinson's Progression Markers Initiative) demonstrate its effectiveness compared to state-of-the-art graph-based methods. Our findings should help guide further development of graph-based models for dealing with imaging, genetic and clinical data.
Collapse
|
6
|
Ling Q, Liu A, Li Y, Mi T, Chan P, Liu Y, Chen X. Homogeneous-Multiset-CCA-Based Brain Covariation and Contravariance Connectivity Network Modeling. IEEE Trans Neural Syst Rehabil Eng 2023; 31:3556-3565. [PMID: 37682656 DOI: 10.1109/tnsre.2023.3310340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/10/2023]
Abstract
Brain connectivity networks based on functional magnetic resonance imaging (fMRI) have expanded our understanding of brain functions in both healthy and diseased states. However, most current studies construct connectivity networks using averaged regional time courses with the strong assumption that the activities of voxels contained in each brain region are similar, ignoring their possible variations. Additionally, pairwise correlation analysis is often adopted with more attention to positive relationships, while joint interactions at the network level as well as anti-correlations are less investigated. In this paper, to provide a new strategy for regional activity representation and brain connectivity modeling, a novel homogeneous multiset canonical correlation analysis (HMCCA) model is proposed, which enforces sign constraints on the weights of voxels to guarantee homogeneity within each brain region. It is capable of obtaining regional representative signals and constructing covariation and contravariance networks simultaneously, at both group and subject levels. Validations on two sessions of fMRI data verified its reproducibility and reliability when dealing with brain connectivity networks. Further experiments on subjects with and without Parkinson's disease (PD) revealed significant alterations in brain connectivity patterns, which were further associated with clinical scores and demonstrated superior prediction ability, indicating its potential in clinical practice.
Collapse
|
7
|
Mandal A, Maji P. Multiview Regularized Discriminant Canonical Correlation Analysis: Sequential Extraction of Relevant Features From Multiblock Data. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5497-5509. [PMID: 35417362 DOI: 10.1109/tcyb.2022.3155875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
One of the important issues associated with real-life high-dimensional data analysis is how to extract significant and relevant features from multiview data. The multiset canonical correlation analysis (MCCA) is a well-known statistical method for multiview data integration. It finds a linear subspace that maximizes the correlations among different views. However, the existing methods to find the multiset canonical variables are computationally very expensive, which restricts the application of the MCCA in real-life big data analysis. The covariance matrix of each high-dimensional view may also suffer from the singularity problem due to the limited number of samples. Moreover, the MCCA-based existing feature extraction algorithms are, in general, unsupervised in nature. In this regard, a new supervised feature extraction algorithm is proposed, which integrates multimodal multidimensional data sets by solving maximal correlation problem of the MCCA. A new block matrix representation is introduced to reduce the computational complexity for computing the canonical variables of the MCCA. The analytical formulation enables efficient computation of the multiset canonical variables under supervised ridge regression optimization technique. It deals with the "curse of dimensionality" problem associated with high-dimensional data and facilitates the sequential generation of relevant features with significantly lower computational cost. The effectiveness of the proposed multiblock data integration algorithm, along with a comparison with other existing methods, is demonstrated on several benchmark and real-life cancer data.
Collapse
|
8
|
Wang T, Chen X, Zhang J, Feng Q, Huang M. Deep multimodality-disentangled association analysis network for imaging genetics in neurodegenerative diseases. Med Image Anal 2023; 88:102842. [PMID: 37247468 DOI: 10.1016/j.media.2023.102842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 03/01/2023] [Accepted: 05/15/2023] [Indexed: 05/31/2023]
Abstract
Imaging genetics is a crucial tool that is applied to explore potentially disease-related biomarkers, particularly for neurodegenerative diseases (NDs). With the development of imaging technology, the association analysis between multimodal imaging data and genetic data is gradually being concerned by a wide range of imaging genetics studies. However, multimodal data are fused first and then correlated with genetic data in traditional methods, which leads to an incomplete exploration of their common and complementary information. In addition, the inaccurate formulation in the complex relationships between imaging and genetic data and information loss caused by missing multimodal data are still open problems in imaging genetics studies. Therefore, in this study, a deep multimodality-disentangled association analysis network (DMAAN) is proposed to solve the aforementioned issues and detect the disease-related biomarkers of NDs simultaneously. First, the imaging data are nonlinearly projected into a latent space and imaging representations can be achieved. The imaging representations are further disentangled into common and specific parts by using a multimodal-disentangled module. Second, the genetic data are encoded to achieve genetic representations, and then, the achieved genetic representations are nonlinearly mapped to the common and specific imaging representations to build nonlinear associations between imaging and genetic data through an association analysis module. Moreover, modality mask vectors are synchronously synthesized to integrate the genetic and imaging data, which helps the following disease diagnosis. Finally, the proposed method achieves reasonable diagnosis performance via a disease diagnosis module and utilizes the label information to detect the disease-related modality-shared and modality-specific biomarkers. Furthermore, the genetic representation can be used to impute the missing multimodal data with our learning strategy. Two publicly available datasets with different NDs are used to demonstrate the effectiveness of the proposed DMAAN. The experimental results show that the proposed DMAAN can identify the disease-related biomarkers, which suggests the proposed DMAAN may provide new insights into the pathological mechanism and early diagnosis of NDs. The codes are publicly available at https://github.com/Meiyan88/DMAAN.
Collapse
Affiliation(s)
- Tao Wang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Xiumei Chen
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Jiawei Zhang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Qianjin Feng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou 510515, China.
| | - Meiyan Huang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
9
|
Kong W, Xu Y, Wang S, Wei K, Wen G, Yu Y, Zhu Y. A Novel Longitudinal Phenotype-Genotype Association Study Based on Deep Feature Extraction and Hypergraph Models for Alzheimer's Disease. Biomolecules 2023; 13:biom13050728. [PMID: 37238598 DOI: 10.3390/biom13050728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 03/30/2023] [Accepted: 04/18/2023] [Indexed: 05/28/2023] Open
Abstract
Traditional image genetics primarily uses linear models to investigate the relationship between brain image data and genetic data for Alzheimer's disease (AD) and does not take into account the dynamic changes in brain phenotype and connectivity data across time between different brain areas. In this work, we proposed a novel method that combined Deep Subspace reconstruction with Hypergraph-Based Temporally-constrained Group Sparse Canonical Correlation Analysis (DS-HBTGSCCA) to discover the deep association between longitudinal phenotypes and genotypes. The proposed method made full use of dynamic high-order correlation between brain regions. In this method, the deep subspace reconstruction technique was applied to retrieve the nonlinear properties of the original data, and hypergraphs were used to mine the high-order correlation between two types of rebuilt data. The molecular biological analysis of the experimental findings demonstrated that our algorithm was capable of extracting more valuable time series correlation from the real data obtained by the AD neuroimaging program and finding AD biomarkers across multiple time points. Additionally, we used regression analysis to verify the close relationship between the extracted top brain areas and top genes and found the deep subspace reconstruction approach with a multi-layer neural network was helpful in enhancing clustering performance.
Collapse
Affiliation(s)
- Wei Kong
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave., Shanghai 201306, China
| | - Yufang Xu
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave., Shanghai 201306, China
| | - Shuaiqun Wang
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave., Shanghai 201306, China
| | - Kai Wei
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Gen Wen
- Department of Orthopedic Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Yaling Yu
- Department of Orthopedic Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
- Institute of Microsurgery on Extremities, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Yuemin Zhu
- CREATIS UMR 5220, U1294, CNRS, Inserm, INSA Lyon, University Lyon, 69621 Lyon, France
| |
Collapse
|
10
|
Kong W, Xu F, Wang S, Wei K, Wen G, Yu Y. Application of orthogonal sparse joint non-negative matrix factorization based on connectivity in Alzheimer's disease research. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:9923-9947. [PMID: 37322917 DOI: 10.3934/mbe.2023435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Based on the mining of micro- and macro-relationships of genetic variation and brain imaging data, imaging genetics has been widely applied in the early diagnosis of Alzheimer's disease (AD). However, effective integration of prior knowledge remains a barrier to determining the biological mechanism of AD. This paper proposes a new connectivity-based orthogonal sparse joint non-negative matrix factorization (OSJNMF-C) method based on integrating the structural magnetic resonance image, single nucleotide polymorphism and gene expression data of AD patients; the correlation information, sparseness, orthogonal constraint and brain connectivity information between the brain image data and genetic data are designed as constraints in the proposed algorithm, which efficiently improved the accuracy and convergence through multiple iterative experiments. Compared with the competitive algorithm, OSJNMF-C has significantly smaller related errors and objective function values than the competitive algorithm, showing its good anti-noise performance. From the biological point of view, we have identified some biomarkers and statistically significant relationship pairs of AD/mild cognitive impairment (MCI), such as rs75277622 and BCL7A, which may affect the function and structure of multiple brain regions. These findings will promote the prediction of AD/MCI.
Collapse
Affiliation(s)
- Wei Kong
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Feifan Xu
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Shuaiqun Wang
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Kai Wei
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Gen Wen
- Department of Orthopedic Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Yaling Yu
- Department of Orthopedic Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
- Institute of Microsurgery on Extremities, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| |
Collapse
|
11
|
Hartmann K, Sadée CY, Satwah I, Carrillo-Perez F, Gevaert O. Imaging genomics: data fusion in uncovering disease heritability. Trends Mol Med 2023; 29:141-151. [PMID: 36470817 PMCID: PMC10507799 DOI: 10.1016/j.molmed.2022.11.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 10/28/2022] [Accepted: 11/03/2022] [Indexed: 12/04/2022]
Abstract
Sequencing of the human genome in the early 2000s enabled probing of the genetic basis of disease on a scale previously unimaginable. Now, two decades later, after interrogating millions of markers in thousands of individuals, a significant portion of disease heritability still remains hidden. Recent efforts to unravel this 'missing heritability' have focused on garnering new insight from merging different data types, including medical imaging. Imaging offers promising intermediate phenotypes to bridge the gap between genetic variation and disease pathology. In this review we outline this fusion and provide examples of imaging genomics in a range of diseases, from oncology to cardiovascular and neurodegenerative disease. Finally, we discuss how ongoing revolutions in data science and sharing are primed to advance the field.
Collapse
Affiliation(s)
- Katherine Hartmann
- Department of Radiology, Hospital of the University of Pennsylvania, Philadelphia, PA, USA.
| | - Christoph Y Sadée
- Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA, USA
| | - Ishan Satwah
- College of Medicine, Drexel University, Philadelphia, PA, USA
| | - Francisco Carrillo-Perez
- Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA, USA; Department of Computer Architecture and Technology, University of Granada. C.I.T.I.C., Granada, Spain
| | - Olivier Gevaert
- Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA, USA.
| |
Collapse
|
12
|
Zhuang J, Tian J, Xiong X, Li T, Chen Z, Chen R, Chen J, Li X. Associating brain imaging phenotypes and genetic risk factors via a hypergraph based netNMF method. Front Aging Neurosci 2023; 15:1052783. [PMID: 36936501 PMCID: PMC10017840 DOI: 10.3389/fnagi.2023.1052783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 02/08/2023] [Indexed: 03/06/2023] Open
Abstract
Abstract Alzheimer's disease (AD) is a severe neurodegenerative disease for which there is currently no effective treatment. Mild cognitive impairment (MCI) is an early disease that may progress to AD. The effective diagnosis of AD and MCI in the early stage has important clinical significance. Methods To this end, this paper proposed a hypergraph-based netNMF (HG-netNMF) algorithm for integrating structural magnetic resonance imaging (sMRI) of AD and MCI with corresponding gene expression profiles. Results Hypergraph regularization assumes that regions of interest (ROIs) and genes were located on a non-linear low-dimensional manifold and can capture the inherent prevalence of two modalities of data and mined high-order correlation features of the two data. Further, this paper used the HG-netNMF algorithm to construct a brain structure connection network and a protein interaction network (PPI) with potential role relationships, mine the risk (ROI) and key genes of both, and conduct a series of bioinformatics analyses. Conclusion Finally, this paper used the risk ROI and key genes of the AD and MCI groups to construct diagnostic models. The AUC of the AD group and MCI group were 0.8 and 0.797, respectively.
Collapse
Affiliation(s)
- Junli Zhuang
- Department of Vascular Surgery, Renmin Hospital of Wuhan University, Wuhan, China
| | - Jinping Tian
- Faculty of Medicine, Jianghan University, Wuhan, China
| | - Xiaoxing Xiong
- Central Laboratory, Renmin Hospital of Wuhan University, Wuhan, China
- *Correspondence: Xiaoxing Xiong,
| | - Taihan Li
- Department of Clinical Laboratory, The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital, Shenzhen, China
- Taihan Li,
| | - Zhengwei Chen
- Department of Radiology, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
| | - Rong Chen
- Department of Radiology, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
| | - Jun Chen
- Department of Radiology, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
| | - Xiang Li
- School of Health, Wuhan University, Wuhan, China
| |
Collapse
|
13
|
Wang Y, Tang S, Ma R, Zamit I, Wei Y, Pan Y. Multi-modal intermediate integrative methods in neuropsychiatric disorders: A review. Comput Struct Biotechnol J 2022; 20:6149-6162. [PMID: 36420153 PMCID: PMC9674886 DOI: 10.1016/j.csbj.2022.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 11/04/2022] [Accepted: 11/04/2022] [Indexed: 11/09/2022] Open
Abstract
The etiology of neuropsychiatric disorders involves complex biological processes at different omics layers, such as genomics, transcriptomics, epigenetics, proteomics, and metabolomics. The advent of high-throughput technology, as well as the availability of large open-source datasets, has ushered in a new era in system biology, necessitating the integration of various types of omics data. The complexity of biological mechanisms, the limitations of integrative strategies, and the heterogeneity of multi-omics data have all presented significant challenges to computational scientists. In comparison to early and late integration, intermediate integration may transform each data type into appropriate intermediate representations using various data transformation techniques, allowing it to capture more complementary information contained in each omics and highlight new interactions across omics layers. Here, we reviewed multi-modal intermediate integrative techniques based on component analysis, matrix factorization, similarity network, multiple kernel learning, Bayesian network, artificial neural networks, and graph transformation, as well as their applications in neuropsychiatric domains. We depicted advancements in these approaches and compared the strengths and weaknesses of each method examined. We believe that our findings will aid researchers in their understanding of the transformation and integration of multi-omics data in neuropsychiatric disorders.
Collapse
Affiliation(s)
- Yanlin Wang
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
| | - Shi Tang
- Li Chiu Kong Family Sleep Assessment Unit, Department of Psychiatry, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region
| | - Ruimin Ma
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
| | - Ibrahim Zamit
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanjie Wei
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
| | - Yi Pan
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
| |
Collapse
|
14
|
Ma Z, Chen B, Zhang Y, Zeng J, Tao J, Hu Y. Integration of RNA molecules data with prior-knowledge driven Joint Deep Semi-Negative Matrix Factorization for heart failure study. Front Genet 2022; 13:967363. [PMID: 36299595 PMCID: PMC9589260 DOI: 10.3389/fgene.2022.967363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 09/27/2022] [Indexed: 12/04/2022] Open
Abstract
Heart failure (HF) is the main manifestation of cardiovascular disease. Recent studies have shown that various RNA molecules and their complex connections play an essential role in HF’s pathogenesis and pathological progression. This paper aims to mine key RNA molecules associated with HF. We proposed a Prior-knowledge Driven Joint Deep Semi-Negative Matrix Factorization (PD-JDSNMF) model that uses a hierarchical nonlinear feature extraction method that integrates three types of data: mRNA, lncRNA, and miRNA. The PPI information is added to the model as prior knowledge, and the Laplacian constraint is used to help the model resist the noise in the genetic data. We used the PD-JDSNMF algorithm to identify significant co-expression modules. The elements in the module are then subjected to bioinformatics analysis and algorithm performance analysis. The results show that the PD-JDSNMF algorithm can robustly select biomarkers associated with HF. Finally, we built a heart failure diagnostic model based on multiple classifiers and using the Top 13 genes in the significant module, the AUC of the internal test set was up to 0.8714, and the AUC of the external validation set was up to 0.8329, which further confirmed the effectiveness of the PD-JDSNMF algorithm.
Collapse
|
15
|
Huang M, Zhang X, Chen X, Mai Y, Wu X, Zhao J, Feng Q. Joint-Channel-Connectivity-Based Feature Selection and Classification on fNIRS for Stress Detection in Decision-Making. IEEE Trans Neural Syst Rehabil Eng 2022; 30:1858-1869. [PMID: 35788456 DOI: 10.1109/tnsre.2022.3188560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Stress is one of the contributing factors affecting decision-making. Therefore, early stress recognition is essential to improve clinicians' decision-making performance. Functional near-infrared spectroscopy (fNIRS) has shown great potential in detecting stress. However, the majority of previous studies only used fNIRS features at the individual level for classification without considering the correlations among channels corresponding to the brain, which may provide distinguishing features. Hence, this study proposes a novel joint-channel-connectivity-based feature selection and classification algorithm for fNIRS to detect stress in decision-making. Specifically, this approach integrates feature selection and classifier modeling into a sparse model, where intra- and inter-channel regularizers are designed to explore potential correlations among channels to obtain discriminating features. In this paper, we simulated the decision-making of medical students under stress through the Trier Social Stress Test and the Balloon Analog Risk Task and recorded their cerebral hemodynamic alterations by fNIRS device. Experimental results illustrated that our method with the accuracy of 0.961 is superior to other machine learning methods. Additionally, the stress correlation and connectivity of brain regions calculated by feature selection have been confirmed in previous studies, which validates the effectiveness of our method and helps optimize the channel settings of fNIRS. This work was the first attempt to utilize a sparse model that simultaneously considers the sparsity of features and the correlation of brain regions for stress detection and obtained an admirable classification performance. Thus, the proposed model might be a useful tool for medical personnel to automatically detect stress in clinical decision-making situations.
Collapse
|
16
|
Wang S, Chen H, Kong W, Ke F, Wei K. Identify Biomarkers of Alzheimer's Disease Based on Multi-task Canonical Correlation Analysis and Regression Model. J Mol Neurosci 2022; 72:1749-1763. [PMID: 35698015 DOI: 10.1007/s12031-022-02031-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 05/21/2022] [Indexed: 11/29/2022]
Abstract
Imaging genetics using imaging technology is regarded as a neuroanatomical phenotype to evaluate gene single nucleotide polymorphisms and their effects on the structure and function of different brain regions. It plays a vital role in bridging the initial understanding of the genetic basis of brain structure and dysfunction. Sparse canonical correlation analysis (SCCA) has become a widespread technique in this field because of its powerful ability to identify bivariate relationships and feature selection. Since most traditional SCCA algorithms assume that the input features are independent, this method obviously cannot be used to analyze genetic image data. The MT-SCCA model is unsupervised and cannot identify the genotype-phenotype associations for diagnostic guidance. Meanwhile, a single biological clinical index cannot fully reflect the physiological process of a comprehensive disease. Therefore, it is necessary to find biomarkers that can reflect Alzheimer's disease and physiological functions that can more comprehensively reflect the development of the disease. This article uses a multi-task sparse canonical correlation analysis and regression (MT-SCCAR) model to combine the annual depression level total score (GDSCALE), clinical dementia assessment scale (GLOBAL CDR), functional activity questionnaire (FAQ), and neuropsychiatric Symptom Questionnaire (NPI-Q) in this paper. These four clinical data are used as compensation information and embedded in the algorithm in a linear regression manner. It also reflects its superiority and robustness compared to traditional correlation analysis methods on actual and simulated data. Meanwhile, compared with MT-SCCA, the model utilized in this paper obtains a higher gene-ROI weight and identifies clearer biomarkers, which provides a practical basis for the study of complex human disease pathology.
Collapse
Affiliation(s)
- Shuaiqun Wang
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave, Shanghai, 201306, People's Republic of China.
| | - Huiqiu Chen
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave, Shanghai, 201306, People's Republic of China
| | - Wei Kong
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave, Shanghai, 201306, People's Republic of China
| | - Fengchun Ke
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave, Shanghai, 201306, People's Republic of China
| | - Kai Wei
- College of Information Engineering, Shanghai Maritime University, 1550 Haigang Ave, Shanghai, 201306, People's Republic of China
| |
Collapse
|
17
|
Structure-constrained combination-based nonlinear association analysis between incomplete multimodal imaging and genetic data for biomarker detection of neurodegenerative diseases. Med Image Anal 2022; 78:102419. [DOI: 10.1016/j.media.2022.102419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Revised: 02/15/2022] [Accepted: 03/10/2022] [Indexed: 11/18/2022]
|
18
|
Kim M, Min EJ, Liu K, Yan J, Saykin AJ, Moore JH, Long Q, Shen L. Multi-task learning based structured sparse canonical correlation analysis for brain imaging genetics. Med Image Anal 2022; 76:102297. [PMID: 34871929 PMCID: PMC8792314 DOI: 10.1016/j.media.2021.102297] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 09/08/2021] [Accepted: 10/29/2021] [Indexed: 02/03/2023]
Abstract
The advances in technologies for acquiring brain imaging and high-throughput genetic data allow the researcher to access a large amount of multi-modal data. Although the sparse canonical correlation analysis is a powerful bi-multivariate association analysis technique for feature selection, we are still facing major challenges in integrating multi-modal imaging genetic data and yielding biologically meaningful interpretation of imaging genetic findings. In this study, we propose a novel multi-task learning based structured sparse canonical correlation analysis (MTS2CCA) to deliver interpretable results and improve integration in imaging genetics studies. We perform comparative studies with state-of-the-art competing methods on both simulation and real imaging genetic data. On the simulation data, our proposed model has achieved the best performance in terms of canonical correlation coefficients, estimation accuracy, and feature selection accuracy. On the real imaging genetic data, our proposed model has revealed promising features of single-nucleotide polymorphisms and brain regions related to sleep. The identified features can be used to improve clinical score prediction using promising imaging genetic biomarkers. An interesting future direction is to apply our model to additional neurological or psychiatric cohorts such as patients with Alzheimer's or Parkinson's disease to demonstrate the generalizability of our method.
Collapse
Affiliation(s)
- Mansu Kim
- Department of Artificial Intelligence, Catholic University of Korea, Bucheon, Republic of Korea
| | - Eun Jeong Min
- College of Medicine, Catholic University of Korea, Seoul, Republic of Korea
| | - Kefei Liu
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
| | - Jingwen Yan
- School of Informatics and Computing, Indiana University, IN, USA
| | | | - Jason H Moore
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
| | - Qi Long
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
| | - Li Shen
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA.
| |
Collapse
|
19
|
Wang S, Wu X, Wei K, Kong W. An Improved Fusion Paired Group Lasso Structured Sparse Canonical Correlation Analysis Based on Brain Imaging Genetics to Identify Biomarkers of Alzheimer’s Disease. Front Aging Neurosci 2022; 13:817520. [PMID: 35069181 PMCID: PMC8770861 DOI: 10.3389/fnagi.2021.817520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 12/14/2021] [Indexed: 01/01/2023] Open
Abstract
Brain imaging genetics can demonstrate the complicated relationship between genetic factors and the structure or function of the humankind brain. Therefore, it has become an important research topic and attracted more and more attention from scholars. The structured sparse canonical correlation analysis (SCCA) model has been widely used to identify the association between brain image data and genetic data in imaging genetics. To investigate the intricate genetic basis of cerebrum imaging phenotypes, a great deal of other standard SCCA methods combining different interested structed have now appeared. For example, some models use group lasso penalty, and some use the fused lasso or the graph/network guided fused lasso for feature selection. However, prior knowledge may not be completely available and the group lasso methods have limited capabilities in practical applications. The graph/network guided approaches can use sample correlation to define constraints, thereby overcoming this problem. Unfortunately, this also has certain limitations. The graph/network conducted methods are susceptible to the sign of the sample correlation of the data, which will affect the stability of the model. To improve the efficiency and stability of SCCA, a sparse canonical correlation analysis model with GraphNet regularization (FGLGNSCCA) is proposed in this manuscript. Based on the FGLSCCA model, the GraphNet regularization penalty is imposed in our study and an optimization algorithm is presented to optimize the model. The structural Magnetic Resonance Imaging (sMRI) and gene expression data are used in this study to find the genotype and characteristics of brain regions associated with Alzheimer’s disease (AD). Experiment results shown that the new FGLGNSCCA model proposed in this manuscript is superior or equivalent to traditional methods in both artificially synthesized neuroimaging genetics data or actual neuroimaging genetics data. It can select essential features more powerfully compared with other multivariate methods and identify significant canonical correlation coefficients as well as captures more significant typical weight patterns which demonstrated its excellent ability in finding biologically important imaging genetic relations.
Collapse
|
20
|
Kim M, Kim J, Qu J, Huang H, Long Q, Sohn KA, Kim D, Shen L. Interpretable temporal graph neural network for prognostic prediction of Alzheimer's disease using longitudinal neuroimaging data. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2021; 2021:1381-1384. [PMID: 35299717 PMCID: PMC8922159 DOI: 10.1109/bibm52615.2021.9669504] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative brain disorder characterized by memory loss and cognitive decline. Early detection and accurate prognosis of AD is an important research topic, and numerous machine learning methods have been proposed to solve this problem. However, traditional machine learning models are facing challenges in effectively integrating longitudinal neuroimaging data and biologically meaningful structure and knowledge to build accurate and interpretable prognostic predictors. To bridge this gap, we propose an interpretable graph neural network (GNN) model for AD prognostic prediction based on longitudinal neuroimaging data while embracing the valuable knowledge of structural brain connectivity. In our empirical study, we demonstrate that 1) the proposed model outperforms several competing models (i.e., DNN, SVM) in terms of prognostic prediction accuracy, and 2) our model can capture neuroanatomical contribution to the prognostic predictor and yield biologically meaningful interpretation to facilitate better mechanistic understanding of the Alzheimer's disease. Source code is available at https://github.com/JaesikKim/temporal-GNN.
Collapse
Affiliation(s)
- Mansu Kim
- Department of Artificial Intelligence, Catholic University of Korea, Bucheon, South Korea
| | - Jaesik Kim
- Department of Computer Engineering, Ajou University, Suwon, South Korea
| | - Jeffrey Qu
- School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, USA
| | - Heng Huang
- Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, USA
| | - Qi Long
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, USA
| | - Kyung-Ah Sohn
- Department of Artificial Intelligence, Ajou University, Suwon, South Korea
| | - Dokyoon Kim
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, USA
| | - Li Shen
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, USA
| |
Collapse
|
21
|
Associating brain imaging phenotypes and genetic in Alzheimer's disease via JSCCA approach with autocorrelation constraints. Med Biol Eng Comput 2021; 60:95-108. [PMID: 34714488 DOI: 10.1007/s11517-021-02439-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Accepted: 09/02/2021] [Indexed: 10/20/2022]
Abstract
Imaging genetics research can explore the potential correlation between imaging and genomics. Most association analysis methods cannot effectively use the prior knowledge of the original data. In this respect, we add the prior knowledge of each original data to mine more effective biomarkers. The study of imaging genetics based on the sparse canonical correlation analysis (SCCA) is helpful to mine the potential biomarkers of neurological diseases. To improve the performance and interpretability of SCCA, we proposed a penalty method based on the autocorrelation matrix for discovering the possible biological mechanism between single nucleotide polymorphisms (SNP) variations and brain regions changes of Alzheimer's disease (AD). The addition of the penalty allows the proposed algorithm to analyze the correlation between different modal features. The proposed algorithm obtains more biologically interpretable ROIs and SNPs that are significantly related to AD, which has better anti-noise performance. Compared with other SCCA-based algorithms (JCB-SCCA, JSNMNMF), the proposed algorithm can still maintain a stronger correlation with ground truth even when the noise is larger. Then, we put the regions of interest (ROI) selected by the three algorithms into the SVM classifier. The proposed algorithm has higher classification accuracy. Also, we use ridge regression with SNPs selected by three algorithms and four AD risk ROIs. The proposed algorithm has a smaller root mean square error (RMSE). It shows that proposed algorithm has a good ability in association recognition and feature selection. Furthermore, it selects important features more stably, improving the clinical diagnosis of new potential biomarkers.
Collapse
|
22
|
Identifying Biomarkers of Alzheimer's Disease via a Novel Structured Sparse Canonical Correlation Analysis Approach. J Mol Neurosci 2021; 72:323-335. [PMID: 34570360 DOI: 10.1007/s12031-021-01915-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 09/09/2021] [Indexed: 02/05/2023]
Abstract
Using correlation analysis to study the potential connection between brain genetics and imaging has become an effective method to understand neurodegenerative diseases. Sparse canonical correlation analysis (SCCA) makes it possible to study high-dimensional genetic information. The traditional SCCA methods can only process single-modal genetic and image data, which to some extent weaken the close connection of the brain's biological network. In some recently proposed multimodal SCCA methods, due to the limitations of penalty items, the pre-processed data needs to be further filtered to make the dimensions uniform, which may destroy the potential association of data in the same modal. In this research, in order to combine data between different modalities and to ensure that the chain relationship or graph network relationship within the same modality will not be destroyed, the original generalized fused lasso penalty was replaced with the fused pairwise group lasso (FGL) and the graph-guided pairwise group lasso (GGL) based on the method of joint sparse canonical correlation analysis (JSCCA). We used prior knowledge to construct a supervised bivariate learning model and use linear regression to select quantitative traits (QTs) of images that are strongly correlated with the Mini-mental State Examination (MMSE) scores. Compared with FGL-SCCA, the model we constructed obtained a higher gene-ROI correlation coefficient and identified more significant biomarkers, providing a theoretical basis for further understanding the complex pathology of neurodegenerative diseases.
Collapse
|
23
|
Simi V, Reddy Edla D, Joseph J. A no-reference metric to assess quality of denoising for Magnetic Resonance images. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
24
|
Integration of Imaging Genomics Data for the Study of Alzheimer's Disease Using Joint-Connectivity-Based Sparse Nonnegative Matrix Factorization. J Mol Neurosci 2021; 72:255-272. [PMID: 34410569 DOI: 10.1007/s12031-021-01888-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 07/06/2021] [Indexed: 10/20/2022]
Abstract
Imaging genetics reveals the connection between microscopic genetics and macroscopic imaging, enabling the identification of disease biomarkers. In this work, we make full use of prior knowledge that has significant reference value for investigating the correlation between the brain and genetics to explore more biologically substantial biomarkers. In this paper, we propose joint-connectivity-based sparse nonnegative matrix factorization (JCB-SNMF). The algorithm simultaneously projects structural magnetic resonance imaging (sMRI), single-nucleotide polymorphism sites (SNPs), and gene expression data onto a common feature space, where heterogeneous variables with large coefficients in the same projection direction form a common module. In addition, the connectivity information for each region of the brain and genetic data are added as prior knowledge to identify regions of interest (ROIs), SNPs, and gene-related risks related to Alzheimer's disease (AD) patients. GraphNet regularization increases the anti-noise performance of the algorithm and the biological interpretability of the results. The simulation results show that compared with other NMF-based algorithms (JNMF, JSNMNMF), JCB-SNMF has better anti-noise performance and can identify and predict biomarkers closely related to AD from significant modules. By constructing a protein-protein interaction (PPI) network, we identified SF3B1, RPS20, and RBM14 as potential biomarkers of AD. We also found some significant SNP-ROI and gene-ROI pairs. Among them, two SNPs rs4472239 and rs11918049 and three genes KLHL8, ZC3H11A, and OSGEPL1 may have effects on the gray matter volume of multiple brain regions. This model provides a new way to further integrate multimodal impact genetic data to identify complex disease association patterns.
Collapse
|
25
|
Huang M, Lai H, Yu Y, Chen X, Wang T, Feng Q. Deep-gated recurrent unit and diet network-based genome-wide association analysis for detecting the biomarkers of Alzheimer's disease. Med Image Anal 2021; 73:102189. [PMID: 34343841 DOI: 10.1016/j.media.2021.102189] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 05/30/2021] [Accepted: 07/16/2021] [Indexed: 01/01/2023]
Abstract
Genome-wide association analysis (GWAS) is a commonly used method to detect the potential biomarkers of Alzheimer's disease (AD). Most existing GWAS methods entail a high computational cost, disregard correlations among imaging data and correlations among genetic data, and ignore various associations between longitudinal imaging and genetic data. A novel GWAS method was proposed to identify potential AD biomarkers and address these problems. A network based on a gated recurrent unit was applied without imputing incomplete longitudinal imaging data to integrate the longitudinal data of variable lengths and extract an image representation. In this study, a modified diet network that can considerably reduce the number of parameters in the genetic network was proposed to perform GWAS between image representation and genetic data. Genetic representation can be extracted in this way. A link between genetic representation and AD was established to detect potential AD biomarkers. The proposed method was tested on a set of simulated data and a real AD dataset. Results of the simulated data showed that the proposed method can accurately detect relevant biomarkers. Moreover, the results of real AD dataset showed that the proposed method can detect some new risk-related genes of AD. Based on previous reports, no research has incorporated a deep-learning model into a GWAS framework to investigate the potential information on super-high-dimensional genetic data and longitudinal imaging data and create a link between imaging genetics and AD for detecting potential AD biomarkers. Therefore, the proposed method may provide new insights into the underlying pathological mechanism of AD.
Collapse
Affiliation(s)
- Meiyan Huang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou 510515, China.
| | - Haoran Lai
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Yuwei Yu
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Xiumei Chen
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Tao Wang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Qianjin Feng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou 510515, China.
| | | |
Collapse
|
26
|
Kim M, Bao J, Liu K, Park BY, Park H, Baik JY, Shen L. A structural enriched functional network: An application to predict brain cognitive performance. Med Image Anal 2021; 71:102026. [PMID: 33848962 DOI: 10.1016/j.media.2021.102026] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 01/10/2021] [Accepted: 03/01/2021] [Indexed: 11/19/2022]
Abstract
The structure-function coupling in brain networks has emerged as an important research topic in modern neuroscience. The structural network could provide the backbone of the functional network. The integration of the functional network with structural information can help us better understand functional communication in the brain. This paper proposed a method to accurately estimate the brain functional network enriched by the structural network from diffusion magnetic resonance imaging. First, we adopted a simplex regression model with graph-constrained Elastic Net to construct the functional networks enriched by the structural network. Then, we compared the constructed network characteristics of this approach with several state-of-the-art competing functional network models. Furthermore, we evaluated whether the structural enriched functional network model improves the performance for predicting the cognitive-behavioral outcomes. The experiments have been performed on 218 participants from the Human Connectome Project database. The results demonstrated that our network model improves network consistency and its predictive performance compared with several state-of-the-art competing functional network models.
Collapse
Affiliation(s)
- Mansu Kim
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
| | - Jingxuan Bao
- School of Arts and Sciences, University of Pennsylvania, PA, USA
| | - Kefei Liu
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA
| | - Bo-Yong Park
- McConnell Brain Imaging Centre, Montreal Neurological Institute and Hospital, McGill University, Montreal, QC, Canada
| | - Hyunjin Park
- School of Electronic and Electrical Engineering, Sungkyunkwan University, Suwon, South Korea; Center for Neuroscience Imaging Research, Institute for Basic Science, Suwon, South Korea
| | - Jae Young Baik
- School of Arts and Sciences, University of Pennsylvania, PA, USA
| | - Li Shen
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, PA, USA.
| |
Collapse
|
27
|
Kim M, Bao J, Liu K, Park BY, Park H, Shen L. Structural Connectivity Enriched Functional Brain Network using Simplex Regression with GraphNet. MACHINE LEARNING IN MEDICAL IMAGING. MLMI (WORKSHOP) 2020; 12436:292-302. [PMID: 34766171 PMCID: PMC8580103 DOI: 10.1007/978-3-030-59861-7_30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]
Abstract
The connectivity analysis is a powerful technique for investigating a hard-wired brain architecture as well as flexible, functional dynamics tied to human cognition. Recent multi-modal connectivity studies had the challenge of combining functional and structural connectivity information into one integrated network. In this paper, we proposed a simplex regression model with graph-constrained Elastic Net (GraphNet) to estimate functional networks enriched by structural connectivity in a biologically meaningful way with a low model complexity. Our model constructed the functional networks using sparse simplex regression framework and enriched structural connectivity information based on GraphNet constraint. We applied our model on the real neuroimaging datasets to show its ability for predicting a clinical score. Our results demonstrated that integrating multi-modal features could detect more sensitive and subtle brain biomarkers than using a single modality.
Collapse
Affiliation(s)
- Mansu Kim
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, USA
| | - Jingxaun Bao
- School of Arts and Sciences, University of Pennsylvania, USA
| | - Kefei Liu
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, USA
| | - Bo-Yong Park
- McConnell Brain Imaging Centre, Montreal Neurological Institute, McGill University, Canada
| | - Hyunjin Park
- School of Electronic and Electrical Engineering, Sungkyunkwan University, Korea
- Center for Neuroscience Imaging Research, Institute for Basic Science, Korea
| | - Li Shen
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, USA
| |
Collapse
|
28
|
Zhuang X, Yang Z, Cordes D. A technical review of canonical correlation analysis for neuroscience applications. Hum Brain Mapp 2020; 41:3807-3833. [PMID: 32592530 PMCID: PMC7416047 DOI: 10.1002/hbm.25090] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 05/23/2020] [Indexed: 12/11/2022] Open
Abstract
Collecting comprehensive data sets of the same subject has become a standard in neuroscience research and uncovering multivariate relationships among collected data sets have gained significant attentions in recent years. Canonical correlation analysis (CCA) is one of the powerful multivariate tools to jointly investigate relationships among multiple data sets, which can uncover disease or environmental effects in various modalities simultaneously and characterize changes during development, aging, and disease progressions comprehensively. In the past 10 years, despite an increasing number of studies have utilized CCA in multivariate analysis, simple conventional CCA dominates these applications. Multiple CCA-variant techniques have been proposed to improve the model performance; however, the complicated multivariate formulations and not well-known capabilities have delayed their wide applications. Therefore, in this study, a comprehensive review of CCA and its variant techniques is provided. Detailed technical formulation with analytical and numerical solutions, current applications in neuroscience research, and advantages and limitations of each CCA-related technique are discussed. Finally, a general guideline in how to select the most appropriate CCA-related technique based on the properties of available data sets and particularly targeted neuroscience questions is provided.
Collapse
Affiliation(s)
- Xiaowei Zhuang
- Cleveland Clinic Lou Ruvo Center for Brain HealthLas VegasNevadaUSA
| | - Zhengshi Yang
- Cleveland Clinic Lou Ruvo Center for Brain HealthLas VegasNevadaUSA
| | - Dietmar Cordes
- Cleveland Clinic Lou Ruvo Center for Brain HealthLas VegasNevadaUSA
- University of ColoradoBoulderColoradoUSA
- Department of Brain HealthUniversity of NevadaLas VegasNevadaUSA
| |
Collapse
|
29
|
Lee H, Park BY, Byeon K, Won JH, Kim M, Kim SH, Park H. Multivariate association between brain function and eating disorders using sparse canonical correlation analysis. PLoS One 2020; 15:e0237511. [PMID: 32785278 PMCID: PMC7423138 DOI: 10.1371/journal.pone.0237511] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 07/28/2020] [Indexed: 12/26/2022] Open
Abstract
Eating disorder is highly associated with obesity and it is related to brain dysfunction as well. Still, the functional substrates of the brain associated with behavioral traits of eating disorder are underexplored. Existing neuroimaging studies have explored the association between eating disorder and brain function without using all the information provided by the eating disorder related questionnaire but by adopting summary factors. Here, we aimed to investigate the multivariate association between brain function and eating disorder at fine-grained question-level information. Our study is a retrospective secondary analysis that re-analyzed resting-state functional magnetic resonance imaging of 284 participants from the enhanced Nathan Kline Institute-Rockland Sample database. Leveraging sparse canonical correlation analysis, we associated the functional connectivity of all brain regions and all questions in the eating disorder questionnaires. We found that executive- and inhibitory control-related frontoparietal networks showed positive associations with questions of restraint eating, while brain regions involved in the reward system showed negative associations. Notably, inhibitory control-related brain regions showed a positive association with the degree of obesity. Findings were well replicated in the independent validation dataset (n = 34). The results of this study might contribute to a better understanding of brain function with respect to eating disorder.
Collapse
Affiliation(s)
- Hyebin Lee
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, Korea
- Center for Neuroscience Imaging Research, Institute for Basic Science (IBS), Suwon, Korea
| | - Bo-yong Park
- McConnell Brain Imaging Centre, Montreal Neurological Institute and Hospital, McGill University, Montreal, Quebec, Canada
| | - Kyoungseob Byeon
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, Korea
- Center for Neuroscience Imaging Research, Institute for Basic Science (IBS), Suwon, Korea
| | - Ji Hye Won
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, Korea
- Center for Neuroscience Imaging Research, Institute for Basic Science (IBS), Suwon, Korea
| | - Mansu Kim
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Se-Hong Kim
- Department of Family Medicine, St. Vincent's Hospital, College of Medicine, The Catholic University of Korea, Suwon, Korea
| | - Hyunjin Park
- Center for Neuroscience Imaging Research, Institute for Basic Science (IBS), Suwon, Korea
- School of Electronic and Electrical Engineering, Sungkyunkwan University, Suwon, Korea
- * E-mail:
| |
Collapse
|
30
|
Cao P, Gao J, Zhang Z. Multi-View Based Multi-Model Learning for MCI Diagnosis. Brain Sci 2020; 10:brainsci10030181. [PMID: 32244855 PMCID: PMC7139974 DOI: 10.3390/brainsci10030181] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 03/16/2020] [Indexed: 12/26/2022] Open
Abstract
Mild cognitive impairment (MCI) is the early stage of Alzheimer’s disease (AD). Automatic diagnosis of MCI by magnetic resonance imaging (MRI) images has been the focus of research in recent years. Furthermore, deep learning models based on 2D view and 3D view have been widely used in the diagnosis of MCI. The deep learning architecture can capture anatomical changes in the brain from MRI scans to extract the underlying features of brain disease. In this paper, we propose a multi-view based multi-model (MVMM) learning framework, which effectively combines the local information of 2D images with the global information of 3D images. First, we select some 2D slices from MRI images and extract the features representing 2D local information. Then, we combine them with the features representing 3D global information learned from 3D images to train the MVMM learning framework. We evaluate our model on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. The experimental results show that our proposed model can effectively recognize MCI through MRI images (accuracy of 87.50% for MCI/HC and accuracy of 83.18% for MCI/AD).
Collapse
|