1
|
Liu J, Tang H, Feng R. Unraveling the evolutionary patterns of construction accidents: a risk assessment framework based on average mutual information theory. Sci Rep 2025; 15:13457. [PMID: 40251350 PMCID: PMC12008309 DOI: 10.1038/s41598-025-98229-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2024] [Accepted: 04/10/2025] [Indexed: 04/20/2025] Open
Abstract
Studies of the evolutionary law of construction accidents and the formulation of effective risk assessments are crucial for ensuring construction safety and reducing accidents. However, existing research often focuses exclusively on one of the four critical perspectives: human, facilities, environment, and management, with limited systematic theoretical analysis. Current risk assessments typically emphasized the impact of risks at specific points in time, neglecting the cumulative effects of risks over extended periods. This study has explored how hazardous and harmful factors affect the evolution of accidents, then has established a risk assessment model utilizing average mutual information theory. The findings indicate that "Command violations", "Command errors", and "Illegal operations" are the primary direct causes, while "Inadequate occupational safety and health management structure and staffing" and "Inadequate or unimplemented occupational health management system" are key management factors. In addition, it also provides practical guidance for construction enterprises to strengthen safety management from aspects such as safety production investment and safety management system integration. These contributions are expected to significantly improve the safety level of construction projects and reduce the occurrence of accidents.
Collapse
Affiliation(s)
- Jian Liu
- School of Resource and Safety Engineering, University of Science and Technology Beijing, Beijing, 100083, China
- Key Laboratory of High-Efficient Mining and Safety of Metal Mines of the Ministry of Education, University of Science and Technology Beijing, Beijing, 100083, China
| | - Hanqiang Tang
- School of Resource and Safety Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Rui Feng
- Research Institute of Macro-Safety Science, University of Science and Technology Beijing, Beijing, 100083, China.
- School of Resource and Safety Engineering, University of Science and Technology Beijing, Beijing, 100083, China.
| |
Collapse
|
2
|
Tang L, Zhang J, Shao Y, Wei Y, Li Y, Tian K, Yan X, Feng C, Zhang QC. Joint analysis of chromatin accessibility and gene expression in the same single cells reveals cancer-specific regulatory programs. Cell Syst 2025:101266. [PMID: 40262617 DOI: 10.1016/j.cels.2025.101266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2024] [Revised: 01/19/2025] [Accepted: 03/28/2025] [Indexed: 04/24/2025]
Abstract
Biological analyses conducted at the single-cell scale have revealed profound impacts of heterogeneity and plasticity of chromatin states and gene expression on physiology and cancer. Here, we developed Parallel-seq, a technology for simultaneously measuring chromatin accessibility and gene expression in the same single cells. By combining combinatorial cell indexing and droplet overloading, Parallel-seq generates high-quality data in an ultra-high-throughput fashion and at a cost two orders of magnitude lower than alternative technologies (10× Multiome and ISSAAC-seq). We applied Parallel-seq to 40 lung tumor and tumor-adjacent clinical samples and obtained over 200,000 high-quality joint scATAC-and-scRNA profiles. Leveraging this large dataset, we characterized copy-number variations (CNVs) and extrachromosomal circular DNA (eccDNA) heterogeneity in tumor cells, predicted hundreds of thousands of cell-type-specific regulatory events, and identified enhancer mutations affecting tumor progression. Our analyses highlight Parallel-seq's power in investigating epigenetic and genetic factors driving cancer development at the cell-type-specific level and its utility for revealing vulnerable therapeutic targets.
Collapse
Affiliation(s)
- Lei Tang
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Jinsong Zhang
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Yanqiu Shao
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Yifan Wei
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Yuzhe Li
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Kang Tian
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China
| | - Xiang Yan
- Department of Medical Oncology, the Fifth Medical Center, Beijing 301 Hospital, Beijing 100039, China
| | - Changjiang Feng
- Department of Thoracic Surgery, the First Medical Center, Beijing 301 Hospital, Beijing 100039, China.
| | - Qiangfeng Cliff Zhang
- State Key Laboratory of Membrane Biology, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, Beijing 100084, China.
| |
Collapse
|
3
|
Xu L, Shao Y. ICRL: independent causality representation learning for domain generalization. Sci Rep 2025; 15:11771. [PMID: 40189711 PMCID: PMC11973159 DOI: 10.1038/s41598-025-96357-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Accepted: 03/27/2025] [Indexed: 04/09/2025] Open
Abstract
Domain generalization (DG) addresses the challenge of out-of-distribution (OOD) data; however, the reliance on statistical correlations during model development often introduces shortcut learning problems. Current approaches to mitigating these issues commonly involve the integration of causal inference, formalizing DG problems through a general structural causal model. Nevertheless, ensuring the independence of features when incorporating causal models is often overlooked, leading to spurious causal relationships. In this work, we design three independent feature modules using GAN variants (GAN, WGAN, and WGAN-GP) and select the best-performing WGAN module to integrate into the existing causal model framework, constructing an independent causal relationship learning (ICRL) model. Extensive experiments on widely used datasets demonstrate that our model, with independent causal representations, outperforms the original model in both performance and efficiency, thereby validating the effectiveness of our proposed approach. The code for ICRL can be accessed at: https://github.com/22Shao/ICRL.git .
Collapse
Affiliation(s)
- Liwen Xu
- College of Science, North China University of Technology, Beijing, 100144, China.
| | - Yuxuan Shao
- College of Science, North China University of Technology, Beijing, 100144, China
| |
Collapse
|
4
|
Pérez-González AP, de Anda-Jáuregui G, Hernández-Lemus E. Differential Transcriptional Programs Reveal Modular Network Rearrangements Associated with Late-Onset Alzheimer's Disease. Int J Mol Sci 2025; 26:2361. [PMID: 40076979 PMCID: PMC11900169 DOI: 10.3390/ijms26052361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2025] [Revised: 02/24/2025] [Accepted: 02/28/2025] [Indexed: 03/14/2025] Open
Abstract
Alzheimer's disease (AD) is a complex, genetically heterogeneous disorder. The diverse phenotypes associated with AD result from interactions between genetic and environmental factors, influencing multiple biological pathways throughout disease progression. Network-based approaches offer a way to assess phenotype-specific states. In this study, we calculated key network metrics to characterize the network transcriptional structure and organization in LOAD, focusing on genes and pathways implicated in AD pathology within the dorsolateral prefrontal cortex (DLPFC). Our findings revealed disease-specific coexpression markers associated with diverse metabolic functions. Additionally, significant differences were observed at both the mesoscopic and local levels between AD and control networks, along with a restructuring of gene coexpression and biological functions into distinct transcriptional modules. These results show the molecular reorganization of the transcriptional program occurring in LOAD, highlighting specific adaptations that may contribute to or result from cellular responses to pathological stressors. Our findings may support the development of a unified model for the causal mechanisms of AD, suggesting that its diverse manifestations arise from multiple pathways working together to produce the disease's complex clinical patho-phenotype.
Collapse
Affiliation(s)
- Alejandra Paulina Pérez-González
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City 14610, Mexico;
- Programa de Doctorado en Ciencias Biomédicas, Unidad de Posgrado Edificio B Primer Piso, Ciudad Universitaria, Mexico City 04510, Mexico
- Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Mexico City 54090, Mexico
| | - Guillermo de Anda-Jáuregui
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City 14610, Mexico;
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City 04510, Mexico
- Investigadores por M’exico, Conahcyt, Mexico City 03940, Mexico
| | - Enrique Hernández-Lemus
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City 14610, Mexico;
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City 04510, Mexico
| |
Collapse
|
5
|
Hu K, Zhong B, Tian R, Yao J. Optimizing functional brain network analysis by incorporating nonlinear factors and frequency band selection with machine learning models. Medicine (Baltimore) 2025; 104:e41667. [PMID: 40020107 PMCID: PMC11875576 DOI: 10.1097/md.0000000000041667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 02/07/2025] [Indexed: 03/05/2025] Open
Abstract
The accurate assessment of the brain's functional network is seen as crucial for the understanding of complex relationships between different brain regions. Hidden information within different frequency bands, which is often overlooked by traditional linear correlation-based methods such as Pearson correlation (PC) and partial correlation, fails to be revealed, leading to the neglect of more intricate nonlinear factors. These limitations were aimed to be overcome in this study by the combination of fast continuous wavelet transform and normalized mutual information (NMI) to develop a novel approach. Original time-domain signals from resting-state functional magnetic resonance imaging were decomposed into different frequency domains using fast continuous wavelet transform, and adjacency matrices were constructed to enhance feature separation across brain regions. Both linear and nonlinear aspects between brain regions were comprehensively considered through the integration of complex correlation coefficient and NMI. The construction of functional brain networks was enabled by the adaptive selection of optimal frequency band combinations. The construction of the model was facilitated by feature extraction using tree models with extreme gradient boosting. It was demonstrated through comparative analysis that the method outperformed baseline methods such as PC and NMI, achieving an area under the curve of 0.9054. The introduction of nonlinear factors was found to increase precision by 14.25% and recall by 17.14%. Importantly, the approach optimized the original data without significantly altering the feature topology. Overall, this innovation advances the understanding of brain function, offering more accurate potential for future research and clinical applications.
Collapse
Affiliation(s)
- Kaixing Hu
- School of Information Science and Engineering, Chongqing Jiaotong University, Chongqing, China
| | - Baohua Zhong
- School of Shipping and Naval Architecture, Chongqing Jiaotong University, Chongqing, China
| | - Renjie Tian
- School of Information Science and Engineering, Chongqing Jiaotong University, Chongqing, China
| | - Jiaming Yao
- School of Mathematics and Statistics, Chongqing Jiaotong University, Chongqing, China
| |
Collapse
|
6
|
Fan Z, Lv J, Zhang S, Gu B, Wang C, Zhang T. ISCAZIM: Integrated statistical correlation analysis for zero-inflated microbiome data. Heliyon 2025; 11:e41184. [PMID: 39811376 PMCID: PMC11730854 DOI: 10.1016/j.heliyon.2024.e41184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Revised: 12/05/2024] [Accepted: 12/11/2024] [Indexed: 01/16/2025] Open
Abstract
Microbiome-metabolome association analysis is critical to reveal the key pairs of gut microbiota and metabolites for discovery of the microbial biomarkers in chronic diseases. However, the characteristics of microbiome data, such as zero inflation, over dispersion, may impair the confidence of association analysis between microbiome and metabolome data. The objectives of this study are to evaluate the strengths and weaknesses of existing statistical methods and to develop a computational framework tailored to the unique characteristics of microbiome data. We designed a computational framework called Integrated Statistical Correlation Analysis for Zero-Inflated Microbiome data (ISCAZIM) that takes account of complicated microbiome data characteristics, including zero inflation rates (ZIRs), dispersion and correlation patterns. ISCAZIM first benchmarked prevalent statistical correlation methods, Pearson, Spearman, zero inflated negative binomial (ZINB) model, mutual information and Maximal Information Coefficient. ISCAZIM then classifies the correlation pattern to linear or non-linear and applies the correlation method according to the ZIRs status. Applying to multiple real-world microbiome-metabolomics data, ISCAZIM is overall more accurate than using a single method with more truly significant association pairs included. Therefore, ISCAZIM will significantly facilitate the association analysis using zero-inflated microbiome data for multi-omics integration.
Collapse
Affiliation(s)
- Zhe Fan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| | - Jiali Lv
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| | - Shuai Zhang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| | - Bingbing Gu
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| | - Cheng Wang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| | - Tao Zhang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
- National Institute of Health Data Science of China, Shandong University, Jinan, 250012, China
| |
Collapse
|
7
|
Bhattacharya S, He Y, Chen Y, Mohanty A, Grishaev A, Kulkarni P, Salgia R, Orban J. Conformational dynamics and multi-modal interaction of Paxillin with the Focal Adhesion Targeting Domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.01.630265. [PMID: 39803547 PMCID: PMC11722443 DOI: 10.1101/2025.01.01.630265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2025]
Abstract
Paxillin (PXN) and focal adhesion kinase (FAK) are two major components of the focal adhesion complex, a multiprotein structure linking the intracellular cytoskeleton to the cell exterior. PXN interacts directly with the C-terminal targeting domain of FAK (FAT) via its intrinsically disordered N-terminal domain. This interaction is necessary and sufficient for localizing FAK to focal adhesions. Furthermore, PXN serves as a platform for recruiting other proteins that together control the dynamic changes needed for cell migration and survival. Here, we show that the PXN disordered region undergoes large-scale conformational restriction upon binding to FAT, forming a 48-kDa multi-modal complex consisting of four major interconverting states. Although the complex is flexible, each state has unique sets of contacts involving disordered regions that are both highly represented in ensembles and conserved. Moreover, conserved intramolecular contacts from glutamine-rich regions in PXN contribute to high entropy and thus stability of the FAT bound complex. As PXN is a hub protein, the results provide a structural basis for understanding how perturbations that lead to cellular network rewiring, such as ligand binding and phosphorylation, may lead to shifts in the multi-state equilibrium and phenotypic switching.
Collapse
Affiliation(s)
- Supriyo Bhattacharya
- Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte National Medical Center, CA 91010-3000, USA
- These authors contributed equally
| | - Yanan He
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, 20850, USA
- These authors contributed equally
| | - Yihong Chen
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, 20850, USA
- These authors contributed equally
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010-3000, USA
| | - Alexander Grishaev
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, 20850, USA
- National Institute of Standards and Technology, Gaithersburg, MD, 20850 USA
| | - Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010-3000, USA
- Department of Systems Biology, City of Hope National Medical Center, Duarte, CA 91010-3000, USA
| | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010-3000, USA
| | - John Orban
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, 20850, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
8
|
Long S, Xia Y, Liang L, Yang Y, Xie H, Wang X. PyNetCor: a high-performance Python package for large-scale correlation analysis. NAR Genom Bioinform 2024; 6:lqae177. [PMID: 39703431 PMCID: PMC11655297 DOI: 10.1093/nargab/lqae177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 11/19/2024] [Accepted: 11/28/2024] [Indexed: 12/21/2024] Open
Abstract
The development of multi-omics technologies has generated an abundance of biological datasets, providing valuable resources for investigating potential relationships within complex biological systems. However, most correlation analysis tools face computational challenges when dealing with these high-dimensional datasets containing millions of features. Here, we introduce pyNetCor, a fast and scalable tool for constructing correlation networks on large-scale and high-dimensional data. PyNetCor features optimized algorithms for both full correlation coefficient matrix computation and top-k correlation search, outperforming other tools in the field in terms of runtime and memory consumption. It utilizes a linear interpolation strategy to rapidly estimate P-values and achieve false discovery rate control, demonstrating a speedup of over 110 times compared to existing methods. Overall, pyNetCor supports large-scale correlation analysis, a crucial foundational step for various bioinformatics workflows, and can be easily integrated into downstream applications to accelerate the process of extracting biological insights from data.
Collapse
Affiliation(s)
- Shibin Long
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
| | - Yan Xia
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
- State Key Laboratory of Pharmaceutical Biotechnology, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Lifeng Liang
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
| | - Ying Yang
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
| | - Hailiang Xie
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
| | - Xiaokai Wang
- Department of Data Science, 01Life Institute, Shenzhen 518000, China
| |
Collapse
|
9
|
Stylianou O, Susi G, Hoffmann M, Suárez-Méndez I, López-Sanz D, Schirner M, Ritter P. Multiscale detrended cross-correlation coefficient: estimating coupling in non-stationary neurophysiological signals. Front Neurosci 2024; 18:1422085. [PMID: 39605794 PMCID: PMC11599215 DOI: 10.3389/fnins.2024.1422085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 10/23/2024] [Indexed: 11/29/2024] Open
Abstract
The brain consists of a vastly interconnected network of regions, the connectome. By estimating the statistical interdependence of neurophysiological time series, we can measure the functional connectivity (FC) of this connectome. Pearson's correlation (r P) is a common metric of coupling in FC studies. Yet r P does not account properly for the non-stationarity of the signals recorded in neuroimaging. In this study, we introduced a novel estimator of coupled dynamics termed multiscale detrended cross-correlation coefficient (MDC3). Firstly, we showed that MDC3 had higher accuracy compared to r P and lagged covariance using simulated time series with known coupling, as well as simulated functional magnetic resonance imaging (fMRI) signals with known underlying structural connectivity. Next, we computed functional brain networks based on empirical magnetoencephalography (MEG) and fMRI. We found that by using MDC3 we could construct networks of healthy populations with significantly different properties compared to r P networks. Based on our results, we believe that MDC3 is a valid alternative to r P that should be incorporated in future FC studies.
Collapse
Affiliation(s)
- Orestis Stylianou
- Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
- Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt Universität zu Berlin, Department of Neurology with Experimental Neurology, Berlin, Germany
- Department of Surgery, Immanuel Clinic Rüdersdorf, University Clinic of Brandenburg Medical School, Berlin, Germany
| | - Gianluca Susi
- Department of Structure of Matter, Thermal Physics and Electronics, Complutense University of Madrid, Madrid, Spain
- Center for Cognitive and Computational Neuroscience, Complutense University of Madrid, Madrid, Spain
| | - Martin Hoffmann
- Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
- Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt Universität zu Berlin, Department of Neurology with Experimental Neurology, Berlin, Germany
| | - Isabel Suárez-Méndez
- Department of Structure of Matter, Thermal Physics and Electronics, Complutense University of Madrid, Madrid, Spain
- Center for Cognitive and Computational Neuroscience, Complutense University of Madrid, Madrid, Spain
| | - David López-Sanz
- Center for Cognitive and Computational Neuroscience, Complutense University of Madrid, Madrid, Spain
- Department of Experimental Psychology, Cognitive Processes and Speech Therapy, Complutense University of Madrid, Madrid, Spain
| | - Michael Schirner
- Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
- Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt Universität zu Berlin, Department of Neurology with Experimental Neurology, Berlin, Germany
- Bernstein Focus State Dependencies of Learning & Bernstein Center for Computational Neuroscience, Berlin, Germany
- Einstein Center for Neuroscience Berlin, Berlin, Germany
- Einstein Center Digital Future, Berlin, Germany
| | - Petra Ritter
- Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
- Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt Universität zu Berlin, Department of Neurology with Experimental Neurology, Berlin, Germany
- Bernstein Focus State Dependencies of Learning & Bernstein Center for Computational Neuroscience, Berlin, Germany
- Einstein Center for Neuroscience Berlin, Berlin, Germany
- Einstein Center Digital Future, Berlin, Germany
| |
Collapse
|
10
|
Sun T, Sun D, Kuang J, Chao X, Guo Y, Li M, Chen T. A cross-omics data analysis strategy for metabolite-microbe pair identification. Proteomics 2024; 24:e2400035. [PMID: 38994817 DOI: 10.1002/pmic.202400035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 06/22/2024] [Accepted: 07/02/2024] [Indexed: 07/13/2024]
Abstract
Given the pivotal roles of metabolomics and microbiomics, numerous data mining approaches aim to uncover their intricate connections. However, the complex many-to-many associations between metabolome-microbiome profiles yield numerous statistically significant but biologically unvalidated candidates. To address these challenges, we introduce BiOFI, a strategic framework for identifying metabolome-microbiome correlation pairs (Bi-Omics). BiOFI employs a comprehensive scoring system, incorporating intergroup differences, effects on feature correlation networks, and organism abundance. Meanwhile, it establishes a built-in database of metabolite-microbe-KEGG functional pathway linking relationships. Furthermore, BiOFI can rank related feature pairs by combining importance scores and correlation strength. Validation on a dataset of cesarean-section infants confirms the strategy's validity and interpretability. The BiOFI R package is freely accessible at https://github.com/chentianlu/BiOFI.
Collapse
Affiliation(s)
- Tao Sun
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Dongnan Sun
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Junliang Kuang
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xiaowen Chao
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yihan Guo
- School of Medicine, Faculty of Medicine, The University of Queensland, Brisbane, Australia
| | - Mengci Li
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Tianlu Chen
- Center for Translational Medicine, The Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
11
|
Stokely AM, Votapka LW, Hock MT, Teitgen AE, McCammon JA, McCulloch AD, Amaro RE. NetSci: A Library for High Performance Biomolecular Simulation Network Analysis Computation. J Chem Inf Model 2024; 64:7966-7976. [PMID: 39364881 DOI: 10.1021/acs.jcim.4c00899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2024]
Abstract
We present the NetSci program-an open-source scientific software package designed for estimating mutual information (MI) between data sets using GPU acceleration and a k-nearest-neighbor algorithm. This approach significantly enhances calculation speed, achieving improvements of several orders of magnitude over traditional CPU-based methods, with data set size limits dictated only by available hardware. To validate NetSci, we accurately compute MI for an analytically verifiable two-dimensional Gaussian distribution and replicate the generalized correlation (GC) analysis previously conducted on the B1 domain of protein G. We also apply NetSci to molecular dynamics simulations of the Sarcoendoplasmic Reticulum Calcium-ATPase (SERCA) pump, exploring the allosteric mechanisms and pathways influenced by ATP and 2'-deoxy-ATP (dATP) binding. Our analysis reveals distinct allosteric effects induced by ATP compared to dATP, with predicted information pathways from the bound nucleotide to the calcium-binding domain differing based on the nucleotide involved. NetSci proves to be a valuable tool for estimating MI and GC in various data sets and is particularly effective for analyzing intraprotein communication and information transfer.
Collapse
Affiliation(s)
- Andrew M Stokely
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
- Mesoscale and Microscale Meteorology Laboratory, National Center for Atmospheric Research, Boulder, Colorado 80307, United States
| | - Lane W Votapka
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Marcus T Hock
- Department of Bioengineering, University of California San Diego, La Jolla, California 92093, United States
| | - Abigail E Teitgen
- Department of Bioengineering, University of California San Diego, La Jolla, California 92093, United States
| | - J Andrew McCammon
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
- Department of Pharmacology, University of California San Diego, La Jolla, California 92093, United States
| | - Andrew D McCulloch
- Department of Bioengineering, University of California San Diego, La Jolla, California 92093, United States
| | - Rommie E Amaro
- Department of Molecular Biology, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
12
|
Poudel SP, Behura SK. Relevance of the regulation of the brain-placental axis to the nocturnal bottleneck of mammals. Placenta 2024; 155:11-21. [PMID: 39121583 DOI: 10.1016/j.placenta.2024.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Revised: 08/02/2024] [Accepted: 08/03/2024] [Indexed: 08/12/2024]
Abstract
INTRODUCTION Evolutionary theory suggests that the ancestors of all placental animals were nocturnal. Visual perceptive function of mammalian brain has evolved extensively, but nearly 70 % of today's mammals are still nocturnal. While placental influence on brain development is known, if placenta plays a role in the visual perceptive function of mammalian brain remains untested. The present study aims to test this hypothesis. METHODS In this study, single-nuclei RNA sequencing was performed to identify genes expressed in the pig placenta and fetal brain, and then compared with the orthologous genes expressed in the placenta and fetal brain cells of mouse. Differential gene expression analysis was performed to identify placental genes regulated differentially between nocturnal and diurnal animals. Phylogenetic modeling was performed to test correlated evolution between placenta type, and the nocturnal or diurnal activity among different mammals. RESULTS The results showed that genes differentially regulated in the fetal brain were related to visual perception whereas the placental genes were related to the nocturnal or diurnal activity in placental animals. Phylogenetic modeling of these genes in thirty-four diverse mammalian species showed evidence for evolutionary link between placenta and the nocturnal/diurnal activity in animals. DISCUSSION The findings of this study suggest that the placenta plays a role in the evolution of visual perceptive function of brain to shape the nocturnal or diurnal activity of placental animals.
Collapse
Affiliation(s)
- Shankar P Poudel
- Division of Animal Sciences, University of Missouri, 920 East Campus Drive, Columbia, MO, 65211, USA
| | - Susanta K Behura
- Division of Animal Sciences, University of Missouri, 920 East Campus Drive, Columbia, MO, 65211, USA; MU Institute for Data Science and Informatics, University of Missouri, 920 East Campus Drive, Columbia, MO, 65211, USA; Interdisciplinary Reproduction and Health Group, University of Missouri, 920 East Campus Drive, Columbia, MO, 65211, USA; Interdisciplinary Neuroscience Program, University of Missouri, 920 East Campus Drive, Columbia, MO, 65211, USA.
| |
Collapse
|
13
|
Chang LY, Hao TY, Wang WJ, Lin CY. Inference of single-cell network using mutual information for scRNA-seq data analysis. BMC Bioinformatics 2024; 25:292. [PMID: 39237886 PMCID: PMC11378379 DOI: 10.1186/s12859-024-05895-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Accepted: 08/08/2024] [Indexed: 09/07/2024] Open
Abstract
BACKGROUND With the advance in single-cell RNA sequencing (scRNA-seq) technology, deriving inherent biological system information from expression profiles at a single-cell resolution has become possible. It has been known that network modeling by estimating the associations between genes could better reveal dynamic changes in biological systems. However, accurately constructing a single-cell network (SCN) to capture the network architecture of each cell and further explore cell-to-cell heterogeneity remains challenging. RESULTS We introduce SINUM, a method for constructing the SIngle-cell Network Using Mutual information, which estimates mutual information between any two genes from scRNA-seq data to determine whether they are dependent or independent in a specific cell. Experiments on various scRNA-seq datasets with different cell numbers based on eight performance indexes (e.g., adjusted rand index and F-measure index) validated the accuracy and robustness of SINUM in cell type identification, superior to the state-of-the-art SCN inference method. Additionally, the SINUM SCNs exhibit high overlap with the human interactome and possess the scale-free property. CONCLUSIONS SINUM presents a view of biological systems at the network level to detect cell-type marker genes/gene pairs and investigate time-dependent changes in gene associations during embryo development. Codes for SINUM are freely available at https://github.com/SysMednet/SINUM .
Collapse
Affiliation(s)
- Lan-Yun Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Ting-Yi Hao
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Wei-Jie Wang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Chun-Yu Lin
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Institute of Data Science and Engineering, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Center for Intelligent Drug Systems and Smart Bio-Devices, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Cancer and Immunology Research Center, National Yang Ming Chiao Tung University, Taipei, 112, Taiwan.
- School of Dentistry, Kaohsiung Medical University, Kaohsiung, 807, Taiwan.
| |
Collapse
|
14
|
Karch JD, Perez-Alonso AF, Bergsma WP. Beyond Pearson's Correlation: Modern Nonparametric Independence Tests for Psychological Research. MULTIVARIATE BEHAVIORAL RESEARCH 2024; 59:957-977. [PMID: 39097830 DOI: 10.1080/00273171.2024.2347960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/05/2024]
Abstract
When examining whether two continuous variables are associated, tests based on Pearson's, Kendall's, and Spearman's correlation coefficients are typically used. This paper explores modern nonparametric independence tests as an alternative, which, unlike traditional tests, have the ability to potentially detect any type of relationship. In addition to existing modern nonparametric independence tests, we developed and considered two novel variants of existing tests, most notably the Heller-Heller-Gorfine-Pearson (HHG-Pearson) test. We conducted a simulation study to compare traditional independence tests, such as Pearson's correlation, and the modern nonparametric independence tests in situations commonly encountered in psychological research. As expected, no test had the highest power across all relationships. However, the distance correlation and the HHG-Pearson tests were found to have substantially greater power than all traditional tests for many relationships and only slightly less power in the worst case. A similar pattern was found in favor of the HHG-Pearson test compared to the distance correlation test. However, given that distance correlation performed better for linear relationships and is more widely accepted, we suggest considering its use in place or additional to traditional methods when there is no prior knowledge of the relationship type, as is often the case in psychological research.
Collapse
Affiliation(s)
- Julian D Karch
- Methodology and Statistics Department, Institute of Psychology, Leiden University, Leiden, the Netherlands
| | - Andres F Perez-Alonso
- Department of Methodology and Statistics, Tilburg School of Social and Behavioral Sciences Tilburg University, Tilburg, the Netherlands
| | - Wicher P Bergsma
- Department of Statistics, London School of Economics and Political Science, London, United Kingdom
| |
Collapse
|
15
|
McCain JSP, Britten GL, Hackett SR, Follows MJ, Li GW. Microbial reaction rate estimation using proteins and proteomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.13.607198. [PMID: 39185172 PMCID: PMC11343155 DOI: 10.1101/2024.08.13.607198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Microbes transform their environments using diverse enzymatic reactions. However, it remains challenging to measure microbial reaction rates in natural environments. Despite advances in global quantification of enzyme abundances, the individual relationships between enzyme abundances and their reaction rates have not been systematically examined. Using matched proteomic and reaction rate data from microbial cultures, we show that enzyme abundance is often insufficient to predict its corresponding reaction rate. However, we discovered that global proteomic measurements can be used to make accurate rate predictions of individual reaction rates (median R 2 = 0.78). Accurate rate predictions required only a small number of proteins and they did not need explicit prior mechanistic knowledge or environmental context. These results indicate that proteomes are encoders of cellular reaction rates, potentially enabling proteomic measurements in situ to estimate the rates of microbially mediated reactions in natural systems.
Collapse
Affiliation(s)
- J. Scott P. McCain
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Gregory L. Britten
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA
| | | | - Michael J. Follows
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Gene-Wei Li
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
16
|
Francis D, Sun F. A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data. BMC Bioinformatics 2024; 25:266. [PMID: 39143554 PMCID: PMC11323399 DOI: 10.1186/s12859-024-05883-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 07/29/2024] [Indexed: 08/16/2024] Open
Abstract
BACKGROUND Construction of co-occurrence networks in metagenomic data often employs correlation to infer pairwise relationships between microbes. However, biological systems are complex and often display qualities non-linear in nature. Therefore, the reliance on correlation alone may overlook important relationships and fail to capture the full breadth of intricacies presented in underlying interaction networks. It is of interest to incorporate metrics that are not only robust in detecting linear relationships, but non-linear ones as well. RESULTS In this paper, we explore the use of various mutual information (MI) estimation approaches for quantifying pairwise relationships in biological data and compare their performances against two traditional measures-Pearson's correlation coefficient, r, and Spearman's rank correlation coefficient, ρ. Metrics are tested on both simulated data designed to mimic pairwise relationships that may be found in ecological systems and real data from a previous study on C. diff infection. The results demonstrate that, in the case of asymmetric relationships, mutual information estimators can provide better detection ability than Pearson's or Spearman's correlation coefficients. Specifically, we find that these estimators have elevated performances in the detection of exploitative relationships, demonstrating the potential benefit of including them in future metagenomic studies. CONCLUSIONS Mutual information (MI) can uncover complex pairwise relationships in biological data that may be missed by traditional measures of association. The inclusion of such relationships when constructing co-occurrence networks can result in a more comprehensive analysis than the use of correlation alone.
Collapse
Affiliation(s)
- Dallace Francis
- Quantitative and Computational Biology Department, University of Southern California, Los Angeles, CA, 90089, USA.
| | - Fengzhu Sun
- Quantitative and Computational Biology Department, University of Southern California, Los Angeles, CA, 90089, USA
| |
Collapse
|
17
|
Gorshkov O, Ombao H. Assessment of Fractal Synchronization during an Epileptic Seizure. ENTROPY (BASEL, SWITZERLAND) 2024; 26:666. [PMID: 39202136 PMCID: PMC11353581 DOI: 10.3390/e26080666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 09/03/2024]
Abstract
In this paper, we define fractal synchronization (FS) based on the idea of stochastic synchronization and propose a mathematical apparatus for estimating FS. One major advantage of our proposed approach is that fractal synchronization makes it possible to estimate the aggregate strength of the connection on multiple time scales between two projections of the attractor, which are time series with a fractal structure. We believe that one of the promising uses of FS is the assessment of the interdependence of encephalograms. To demonstrate this approach in evaluating the cross-dependence between channels in a network of electroencephalograms, we evaluated the FS of encephalograms during an epileptic seizure. Fractal synchronization demonstrates the presence of desynchronization during an epileptic seizure.
Collapse
Affiliation(s)
- Oleg Gorshkov
- Statistics Program, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia;
| | | |
Collapse
|
18
|
Gómez-Pascual A, Rocamora-Pérez G, Ibanez L, Botía JA. Targeted co-expression networks for the study of traits. Sci Rep 2024; 14:16675. [PMID: 39030261 PMCID: PMC11271532 DOI: 10.1038/s41598-024-67329-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 07/10/2024] [Indexed: 07/21/2024] Open
Abstract
Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used approach for the generation of gene co-expression networks. However, networks generated with this tool usually create large modules with a large set of functional annotations hard to decipher. We have developed TGCN, a new method to create Targeted Gene Co-expression Networks. This method identifies the transcripts that best predict the trait of interest based on gene expression using a refinement of the LASSO regression. Then, it builds the co-expression modules around those transcripts. Algorithm properties were characterized using the expression of 13 brain regions from the Genotype-Tissue Expression project. When comparing our method with WGCNA, TGCN networks lead to more precise modules that have more specific and yet rich biological meaning. Then, we illustrate its applicability by creating an APP-TGCN on The Religious Orders Study and Memory and Aging Project dataset, aiming to identify the molecular pathways specifically associated with APP role in Alzheimer's disease. Main biological findings were further validated in two independent cohorts. In conclusion, we provide a new framework that serves to create targeted networks that are smaller, biologically relevant and useful in high throughput hypothesis driven research. The TGCN R package is available on Github: https://github.com/aliciagp/TGCN .
Collapse
Affiliation(s)
- A Gómez-Pascual
- Communications Engineering and Information Department, University of Murcia, 30100, Murcia, Spain
| | - G Rocamora-Pérez
- Department of Genetics and Genomic Medicine Research and Teaching, UCL GOS Institute of Child Health, London, WC1N 1EH, UK
| | - L Ibanez
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO, 63110, USA
- Department of Neurology, Washington University School of Medicine, Saint Louis, MO, 63110, USA
| | - J A Botía
- Communications Engineering and Information Department, University of Murcia, 30100, Murcia, Spain.
| |
Collapse
|
19
|
Roy S, Sheikh SZ, Furey TS. CoVar: A generalizable machine learning approach to identify the coordinated regulators driving variational gene expression. PLoS Comput Biol 2024; 20:e1012016. [PMID: 38630807 PMCID: PMC11057768 DOI: 10.1371/journal.pcbi.1012016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 04/29/2024] [Accepted: 03/22/2024] [Indexed: 04/19/2024] Open
Abstract
Network inference is used to model transcriptional, signaling, and metabolic interactions among genes, proteins, and metabolites that identify biological pathways influencing disease pathogenesis. Advances in machine learning (ML)-based inference models exhibit the predictive capabilities of capturing latent patterns in genomic data. Such models are emerging as an alternative to the statistical models identifying causative factors driving complex diseases. We present CoVar, an ML-based framework that builds upon the properties of existing inference models, to find the central genes driving perturbed gene expression across biological states. Unlike differentially expressed genes (DEGs) that capture changes in individual gene expression across conditions, CoVar focuses on identifying variational genes that undergo changes in their expression network interaction profiles, providing insights into changes in the regulatory dynamics, such as in disease pathogenesis. Subsequently, it finds core genes from among the nearest neighbors of these variational genes, which are central to the variational activity and influence the coordinated regulatory processes underlying the observed changes in gene expression. Through the analysis of simulated as well as yeast expression data perturbed by the deletion of the mitochondrial genome, we show that CoVar captures the intrinsic variationality and modularity in the expression data, identifying key driver genes not found through existing differential analysis methodologies.
Collapse
Affiliation(s)
- Satyaki Roy
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Shehzad Z. Sheikh
- Departments of Medicine and Genetics, Center for Gastrointestinal Biology and Disease, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Terrence S. Furey
- Departments of Genetics and Biology, Center for Gastrointestinal Biology and Disease, University of North Carolina, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
20
|
Tian J, Lei J, Roeder K. From local to global gene co-expression estimation using single-cell RNA-seq data. Biometrics 2024; 80:ujae001. [PMID: 38465983 PMCID: PMC10926266 DOI: 10.1093/biomtc/ujae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 10/01/2023] [Accepted: 01/15/2024] [Indexed: 03/12/2024]
Abstract
In genomics studies, the investigation of gene relationships often brings important biological insights. Currently, the large heterogeneous datasets impose new challenges for statisticians because gene relationships are often local. They change from one sample point to another, may only exist in a subset of the sample, and can be nonlinear or even nonmonotone. Most previous dependence measures do not specifically target local dependence relationships, and the ones that do are computationally costly. In this paper, we explore a state-of-the-art network estimation technique that characterizes gene relationships at the single cell level, under the name of cell-specific gene networks. We first show that averaging the cell-specific gene relationship over a population gives a novel univariate dependence measure, the averaged Local Density Gap (aLDG), that accumulates local dependence and can detect any nonlinear, nonmonotone relationship. Together with a consistent nonparametric estimator, we establish its robustness on both the population and empirical levels. Then, we show that averaging the cell-specific gene relationship over mini-batches determined by some external structure information (eg, spatial or temporal factor) better highlights meaningful local structure change points. We explore the application of aLDG and its minibatch variant in many scenarios, including pairwise gene relationship estimation, bifurcating point detection in cell trajectory, and spatial transcriptomics structure visualization. Both simulations and real data analysis show that aLDG outperforms existing ones.
Collapse
Affiliation(s)
- Jinjin Tian
- Department of Statistics and Data Science, Carnegie Mellon University, 15213, Pittsburgh, PA, United States
| | - Jing Lei
- Department of Statistics and Data Science, Carnegie Mellon University, 15213, Pittsburgh, PA, United States
| | - Kathryn Roeder
- Department of Statistics and Data Science, Carnegie Mellon University, 15213, Pittsburgh, PA, United States
| |
Collapse
|
21
|
Chaturvedi A, Som A. Inference of Dynamic Growth Regulatory Network in Cancer Using High-Throughput Transcriptomic Data. Methods Mol Biol 2024; 2719:51-77. [PMID: 37803112 DOI: 10.1007/978-1-0716-3461-5_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/08/2023]
Abstract
Growth is regulated by gene expression variation at different developmental stages of biological processes such as cell differentiation, disease progression, or drug response. In cancer, a stage-specific regulatory model constructed to infer the dynamic expression changes in genes contributing to tissue growth or proliferation is referred as a dynamic growth regulatory network (dGRN). Over the past decade, gene expression data has been widely used for reconstructing dGRN by computing correlations between the differentially expressed genes (DEGs). A wide variety of pipelines are available to construct the GRNs using DEGs and the choice of a particular method or tool depends on the nature of the study. In this protocol, we have outlined a step-by-step guide for the analysis of DEGs using RNA-Seq data, beginning from data acquisition, pre-processing, mapping to reference genome, and construction of a correlation-based co-expression network to further downstream analysis. We have also outlined the steps for the inclusion of publicly available interaction/regulation information into the dGRN followed by relevant topological inferences. This tutorial has been designed in a way that early researchers can refer to for an easy and comprehensive glimpse of methodologies used in the inference of dGRN using transcriptomics data.
Collapse
Affiliation(s)
- Aparna Chaturvedi
- Centre of Bioinformatics, Institute of Interdisciplinary Studies, University of Allahabad, Prayagraj, India
| | - Anup Som
- Centre of Bioinformatics, Institute of Interdisciplinary Studies, University of Allahabad, Prayagraj, India
| |
Collapse
|
22
|
Strawn M, Safranski TJ, Behura SK. Does DNA methylation in the fetal brain leave an epigenetic memory in the blood? Gene 2023; 887:147788. [PMID: 37696423 DOI: 10.1016/j.gene.2023.147788] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 08/23/2023] [Accepted: 09/07/2023] [Indexed: 09/13/2023]
Abstract
Epigenetic memory is an emerging concept that refers to the process in which epigenetic changes occurring early-in life can lead to long-term programs of gene regulation in time and space. By leveraging neural network regression modeling of DNA methylation data in pigs, we show that specific methylations in the adult blood can reliably predict methylation changes that occurred in the fetal brain. Genes associated with these methylations represented known markers of specific cell types of blood including bone marrow hematopoietic progenitor cells, and ependymal and oligodendrocyte cells of brain. This suggested that methylation changes that occurred in the developing brain were maintained as an epigenetic memory in the blood through the adult life.
Collapse
Affiliation(s)
- Monica Strawn
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, United States
| | - Timothy J Safranski
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, United States
| | - Susanta K Behura
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, United States; MU Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, United States; Interdisciplinary Neuroscience Program, University of Missouri, Columbia, MO 65211, United States.
| |
Collapse
|
23
|
Hicks AR, Reynolds RH, O’Callaghan B, García-Ruiz S, Gil-Martínez AL, Botía J, Plun-Favreau H, Ryten M. The non-specific lethal complex regulates genes and pathways genetically linked to Parkinson's disease. Brain 2023; 146:4974-4987. [PMID: 37522749 PMCID: PMC10689904 DOI: 10.1093/brain/awad246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 05/12/2023] [Accepted: 06/23/2023] [Indexed: 08/01/2023] Open
Abstract
Genetic variants conferring risks for Parkinson's disease have been highlighted through genome-wide association studies, yet exploration of their specific disease mechanisms is lacking. Two Parkinson's disease candidate genes, KAT8 and KANSL1, identified through genome-wide studies and a PINK1-mitophagy screen, encode part of the histone acetylating non-specific lethal complex. This complex localizes to the nucleus, where it plays a role in transcriptional activation, and to mitochondria, where it has been suggested to have a role in mitochondrial transcription. In this study, we sought to identify whether the non-specific lethal complex has potential regulatory relationships with other genes associated with Parkinson's disease in human brain. Correlation in the expression of non-specific lethal genes and Parkinson's disease-associated genes was investigated in primary gene co-expression networks using publicly-available transcriptomic data from multiple brain regions (provided by the Genotype-Tissue Expression Consortium and UK Brain Expression Consortium), whilst secondary networks were used to examine cell type specificity. Reverse engineering of gene regulatory networks generated regulons of the complex, which were tested for heritability using stratified linkage disequilibrium score regression. Prioritized gene targets were then validated in vitro using a QuantiGene multiplex assay and publicly-available chromatin immunoprecipitation-sequencing data. Significant clustering of non-specific lethal genes was revealed alongside Parkinson's disease-associated genes in frontal cortex primary co-expression modules, amongst other brain regions. Both primary and secondary co-expression modules containing these genes were enriched for mainly neuronal cell types. Regulons of the complex contained Parkinson's disease-associated genes and were enriched for biological pathways genetically linked to disease. When examined in a neuroblastoma cell line, 41% of prioritized gene targets showed significant changes in mRNA expression following KANSL1 or KAT8 perturbation. KANSL1 and H4K8 chromatin immunoprecipitation-sequencing data demonstrated non-specific lethal complex activity at many of these genes. In conclusion, genes encoding the non-specific lethal complex are highly correlated with and regulate genes associated with Parkinson's disease. Overall, these findings reveal a potentially wider role for this protein complex in regulating genes and pathways implicated in Parkinson's disease.
Collapse
Affiliation(s)
- Amy R Hicks
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
| | - Regina H Reynolds
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, Bloomsbury, London WC1N 1EH, UK
| | - Benjamin O’Callaghan
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
| | - Sonia García-Ruiz
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, Bloomsbury, London WC1N 1EH, UK
| | - Ana Luisa Gil-Martínez
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, Bloomsbury, London WC1N 1EH, UK
- Department of Information and Communication Engineering, University of Murcia, Murcia 30100, Spain
| | - Juan Botía
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Department of Information and Communication Engineering, University of Murcia, Murcia 30100, Spain
| | - Hélène Plun-Favreau
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
| | - Mina Ryten
- Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, Bloomsbury, London WC1N 1EH, UK
- NIHR GOSH Biomedical Research Centre, Great Ormond Street Institute of Child Health, Bloomsbury, London WC1N 1EH, UK
| |
Collapse
|
24
|
Melendez-Pastor I, Lopez-Granado OM, Navarro-Pedreño J, Hernández EI, Jordán Vidal MM, Gómez Lucas I. Environmental factors influencing DDT-DDE spatial distribution in an agricultural drainage system determined by using machine learning techniques. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2023; 45:9067-9085. [PMID: 36750542 PMCID: PMC10673731 DOI: 10.1007/s10653-023-01486-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 01/16/2023] [Indexed: 06/18/2023]
Abstract
The presence and persistence of pesticides in the environment are environmental problems of great concern due to the health implications for humans and wildlife. The persistence of DDT-DDE in a Mediterranean coastal plain where pesticides were widely used and were banned decades ago is the aim of this study. Different sources of analytical information from water and soil analysis and topography and geographical variables were combined with the purpose of analyzing which environmental factors are more likely to condition the spatial distribution of DDT-DDE in the drainage watercourses of the area. An approach combining machine learning techniques, such as Random Forest and Mutual Information (MI), for classifying DDT-DDE concentration levels based on other environmental predictive variables was applied. In addition, classification procedure was iteratively performed with different training/validation partitions in order to extract the most informative parameters denoted by the highest MI scores and larger accuracy assessment metrics. Distance to drain canals, soil electrical conductivity, and soil sand texture fraction were the most informative environmental variables for predicting DDT-DDE water concentration clusters.
Collapse
Affiliation(s)
- Ignacio Melendez-Pastor
- Department of Agrochemistry and Environment, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain.
| | - Otoniel M Lopez-Granado
- Department of Computers Engineering, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain
| | - Jose Navarro-Pedreño
- Department of Agrochemistry and Environment, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain
| | - Encarni I Hernández
- Department of Agrochemistry and Environment, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain
| | - Manuel M Jordán Vidal
- Department of Agrochemistry and Environment, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain
| | - Ignacio Gómez Lucas
- Department of Agrochemistry and Environment, Miguel Hernández University of Elche, Av. Universidad s/n, Edificio Alcudia, 03202, Elche, Alicante, Spain
| |
Collapse
|
25
|
Islam M, Behura SK. Role of caveolin-1 in metabolic programming of fetal brain. iScience 2023; 26:107710. [PMID: 37720105 PMCID: PMC10500482 DOI: 10.1016/j.isci.2023.107710] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 05/10/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
Mice lacking caveolin-1 (Cav1), a key protein of plasma membrane, exhibit brain aging at an early adult stage. Here, integrative analyses of metabolomics, transcriptomics, epigenetics, and single-cell data were performed to test the hypothesis that metabolic deregulation of fetal brain due to the ablation of Cav1 is linked to brain aging in these mice. The results of this study show that lack of Cav1 caused deregulation in the lipid and amino acid metabolism in the fetal brain, and genes associated with these deregulated metabolites were significantly altered in the brain upon aging. Moreover, ablation of Cav1 deregulated several metabolic genes in specific cell types of the fetal brain and impacted DNA methylation of those genes in coordination with mouse epigenetic clock. The findings of this study suggest that the aging program of brain is confounded by metabolic abnormalities in the fetal stage due to the absence of Cav1.
Collapse
Affiliation(s)
- Maliha Islam
- Division of Animal Sciences, 920 East Campus Drive, University of Missouri, Columbia, MO 65211, USA
| | - Susanta K. Behura
- Division of Animal Sciences, 920 East Campus Drive, University of Missouri, Columbia, MO 65211, USA
- MU Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
- Interdisciplinary Reproduction and Health Group, University of Missouri, Columbia, MO, USA
- Interdisciplinary Neuroscience Program, University of Missouri, Columbia, MO, USA
| |
Collapse
|
26
|
Chen X, Xu G, He B, Zhang S, Su Z, Jia Y, Zhang X, Zhao Z. Capturing synchronization with complexity measure of ordinal pattern transition network constructed by crossplot. ROYAL SOCIETY OPEN SCIENCE 2023; 10:221067. [PMID: 37388315 PMCID: PMC10300663 DOI: 10.1098/rsos.221067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 06/06/2023] [Indexed: 07/01/2023]
Abstract
To evaluate the synchronization of bivariate time series has been a hot topic, and a number of measures have been proposed. In this work, by introducing the ordinal pattern transition network into the crossplot, a new method for measuring the synchronization of bivariate time series is proposed. After the crossplot been partitioned and coded, the coded partitions are defined as network nodes and a directed weighted network is constructed based on the temporal adjacency of the nodes. The crossplot transition entropy of the network is proposed as an indicator of the synchronization between two time series. To test the characteristics and performance of the method, it is used to analyse the unidirectional coupled Lorentz model and compared it with existing methods. The results showed the new method had the advantages of easy parameter setting, efficiency, robustness, good consistency and suitability for short time series. Finally, electroencephalogram (EEG) data from auditory-evoked potential EEG-biometric dataset are investigated, and some useful and interesting results are obtained.
Collapse
Affiliation(s)
- Xiaobi Chen
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Guanghua Xu
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
- State Key Laboratory for Manufacturing Systems Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
- The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, People's Republic of China
| | - Bo He
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Sicong Zhang
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Zijvn Su
- School of Materials, Sun Yat-sen University, Shenzhen 518107, People's Republic of China
| | - Yaguang Jia
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Xun Zhang
- School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | - Zhe Zhao
- School of Microelectronics, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| |
Collapse
|
27
|
Sun H, Lin F, Wu X, Zhang T, Li J. Normalized mutual information of fNIRS signals as a measure for accessing typical and atypical brain activity. JOURNAL OF BIOPHOTONICS 2023; 16:e202200369. [PMID: 36808258 DOI: 10.1002/jbio.202200369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 01/24/2023] [Accepted: 02/10/2023] [Indexed: 06/07/2023]
Abstract
Normalized mutual information (NMI) can be used to detect statistical correlations between time series. We showed possibility of using NMI to quantify synchronicity of information transmission in different brain regions, thus to characterize functional connections, and ultimately analyze differences in physiological states of brain. Resting-state brain signals were recorded from bilateral temporal lobes by functional near-infrared spectroscopy (fNIRS) in 19 young healthy (YH) adults, 25 children with autism spectrum disorder (ASD), and 22 children with typical development (TD). Using NMI of the fNIRS signals, common information volume was assessed for each of three groups. Results showed that mutual information of children with ASD was significantly smaller than that of TD children, while mutual information of YH adults was slightly larger than that of TD children. This study may suggest that NMI could be a measure for assessing brain activity with different development states.
Collapse
Affiliation(s)
- Huiwen Sun
- South China Academy of Advanced Optoelectronics, South China Normal University, Guangzhou, China
| | - Fang Lin
- South China Academy of Advanced Optoelectronics, South China Normal University, Guangzhou, China
| | - Xiaoyin Wu
- South China Academy of Advanced Optoelectronics, South China Normal University, Guangzhou, China
| | - Tingzhen Zhang
- South China Academy of Advanced Optoelectronics, South China Normal University, Guangzhou, China
| | - Jun Li
- South China Academy of Advanced Optoelectronics, South China Normal University, Guangzhou, China
- Key Lab for Behavioral Economic Science & Technology, South China Normal University, Guangzhou, China
| |
Collapse
|
28
|
Jefferson RE, Oggier A, Füglistaler A, Camviel N, Hijazi M, Villarreal AR, Arber C, Barth P. Computational design of dynamic receptor-peptide signaling complexes applied to chemotaxis. Nat Commun 2023; 14:2875. [PMID: 37208363 DOI: 10.1038/s41467-023-38491-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 05/04/2023] [Indexed: 05/21/2023] Open
Abstract
Engineering protein biosensors that sensitively respond to specific biomolecules by triggering precise cellular responses is a major goal of diagnostics and synthetic cell biology. Previous biosensor designs have largely relied on binding structurally well-defined molecules. In contrast, approaches that couple the sensing of flexible compounds to intended cellular responses would greatly expand potential biosensor applications. Here, to address these challenges, we develop a computational strategy for designing signaling complexes between conformationally dynamic proteins and peptides. To demonstrate the power of the approach, we create ultrasensitive chemotactic receptor-peptide pairs capable of eliciting potent signaling responses and strong chemotaxis in primary human T cells. Unlike traditional approaches that engineer static binding complexes, our dynamic structure design strategy optimizes contacts with multiple binding and allosteric sites accessible through dynamic conformational ensembles to achieve strongly enhanced signaling efficacy and potency. Our study suggests that a conformationally adaptable binding interface coupled to a robust allosteric transmission region is a key evolutionary determinant of peptidergic GPCR signaling systems. The approach lays a foundation for designing peptide-sensing receptors and signaling peptide ligands for basic and therapeutic applications.
Collapse
Affiliation(s)
- Robert E Jefferson
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
| | - Aurélien Oggier
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
| | - Andreas Füglistaler
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
| | - Nicolas Camviel
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
- Department of Oncology UNIL-CHUV, University Hospital Lausanne (CHUV), University of Lausanne (UNIL), Lausanne, Switzerland
| | - Mahdi Hijazi
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
| | - Ana Rico Villarreal
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
| | - Caroline Arber
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland
- Department of Oncology UNIL-CHUV, University Hospital Lausanne (CHUV), University of Lausanne (UNIL), Lausanne, Switzerland
| | - Patrick Barth
- Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland.
- Ludwig Institute for Cancer Research Lausanne, Lausanne, Switzerland.
| |
Collapse
|
29
|
Shachaf LI, Roberts E, Cahan P, Xiao J. Gene regulation network inference using k-nearest neighbor-based mutual information estimation: revisiting an old DREAM. BMC Bioinformatics 2023; 24:84. [PMID: 36879188 PMCID: PMC9990267 DOI: 10.1186/s12859-022-05047-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 11/08/2022] [Indexed: 03/08/2023] Open
Abstract
BACKGROUND A cell exhibits a variety of responses to internal and external cues. These responses are possible, in part, due to the presence of an elaborate gene regulatory network (GRN) in every single cell. In the past 20 years, many groups worked on reconstructing the topological structure of GRNs from large-scale gene expression data using a variety of inference algorithms. Insights gained about participating players in GRNs may ultimately lead to therapeutic benefits. Mutual information (MI) is a widely used metric within this inference/reconstruction pipeline as it can detect any correlation (linear and non-linear) between any number of variables (n-dimensions). However, the use of MI with continuous data (for example, normalized fluorescence intensity measurement of gene expression levels) is sensitive to data size, correlation strength and underlying distributions, and often requires laborious and, at times, ad hoc optimization. RESULTS In this work, we first show that estimating MI of a bi- and tri-variate Gaussian distribution using k-nearest neighbor (kNN) MI estimation results in significant error reduction as compared to commonly used methods based on fixed binning. Second, we demonstrate that implementing the MI-based kNN Kraskov-Stoögbauer-Grassberger (KSG) algorithm leads to a significant improvement in GRN reconstruction for popular inference algorithms, such as Context Likelihood of Relatedness (CLR). Finally, through extensive in-silico benchmarking we show that a new inference algorithm CMIA (Conditional Mutual Information Augmentation), inspired by CLR, in combination with the KSG-MI estimator, outperforms commonly used methods. CONCLUSIONS Using three canonical datasets containing 15 synthetic networks, the newly developed method for GRN reconstruction-which combines CMIA, and the KSG-MI estimator-achieves an improvement of 20-35% in precision-recall measures over the current gold standard in the field. This new method will enable researchers to discover new gene interactions or better choose gene candidates for experimental validations.
Collapse
Affiliation(s)
- Lior I Shachaf
- Department of Biophysics, Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD, 21218, USA.
| | - Elijah Roberts
- Department of Biophysics, Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD, 21218, USA
- 10x Genomics, 6230 Stoneridge Mall Road, Pleasanton, CA, 94588-3260, USA
| | - Patrick Cahan
- Department of Biomedical Engineering, Department of Molecular Biology and Genetics, Institute for Cell Engineering, Johns Hopkins School of Medicine, 733 N. Broadway, Baltimore, MD, 21205, USA
| | - Jie Xiao
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins School of Medicine, 725 N. Wolfe Street, WBSB 708, Baltimore, MD, 21205, USA
| |
Collapse
|
30
|
Moeller S, Unakafov AM, Fischer J, Gail A, Treue S, Kagan I. Human and macaque pairs employ different coordination strategies in a transparent decision game. eLife 2023; 12:e81641. [PMID: 36633125 PMCID: PMC9937648 DOI: 10.7554/elife.81641] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 12/28/2022] [Indexed: 01/13/2023] Open
Abstract
Many real-world decisions in social contexts are made while observing a partner's actions. To study dynamic interactions during such decisions, we developed a setup where two agents seated face-to-face to engage in game-theoretical tasks on a shared transparent touchscreen display ('transparent games'). We compared human and macaque pairs in a transparent version of the coordination game 'Bach-or-Stravinsky', which entails a conflict about which of two individually-preferred opposing options to choose to achieve coordination. Most human pairs developed coordinated behavior and adopted dynamic turn-taking to equalize the payoffs. All macaque pairs converged on simpler, static coordination. Remarkably, two animals learned to coordinate dynamically after training with a human confederate. This pair selected the faster agent's preferred option, exhibiting turn-taking behavior that was captured by modeling the visibility of the partner's action before one's own movement. Such competitive turn-taking was unlike the prosocial turn-taking in humans, who equally often initiated switches to and from their preferred option. Thus, the dynamic coordination is not restricted to humans but can occur on the background of different social attitudes and cognitive capacities in rhesus monkeys. Overall, our results illustrate how action visibility promotes the emergence and maintenance of coordination when agents can observe and time their mutual actions.
Collapse
Affiliation(s)
- Sebastian Moeller
- Cognitive Neuroscience Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
| | - Anton M Unakafov
- Cognitive Neuroscience Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
- Georg-Elias-Müller-Institute of Psychology, University of GottingenGöttingenGermany
- Max Planck Institute for Dynamics and Self-OrganizationGöttingenGermany
- Campus Institute for Dynamics of Biological NetworksGottingenGermany
| | - Julia Fischer
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
- Cognitive Ethology Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Department of Primate Cognition, Johann-Friedrich-Blumenbach Institute for Zoology and Anthropology, University of GottingenGöttingenGermany
| | - Alexander Gail
- Cognitive Neuroscience Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
- Georg-Elias-Müller-Institute of Psychology, University of GottingenGöttingenGermany
- Bernstein Center for Computational NeuroscienceGöttingenGermany
| | - Stefan Treue
- Cognitive Neuroscience Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
- Georg-Elias-Müller-Institute of Psychology, University of GottingenGöttingenGermany
- Bernstein Center for Computational NeuroscienceGöttingenGermany
| | - Igor Kagan
- Cognitive Neuroscience Laboratory, German Primate Center – Leibniz Institute for Primate ResearchGöttingenGermany
- Leibniz ScienceCampus Primate CognitionGöttingenGermany
| |
Collapse
|
31
|
Roy S, Sheikh SZ, Furey TS. CoVar: A generalizable machine learning approach to identify the coordinated regulators driving variational gene expression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.12.523808. [PMID: 36712050 PMCID: PMC9882103 DOI: 10.1101/2023.01.12.523808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Network inference is used to model transcriptional, signaling, and metabolic interactions among genes, proteins, and metabolites that identify biological pathways influencing disease pathogenesis. Advances in machine learning (ML)-based inference models exhibit the predictive capabilities of capturing latent patterns in genomic data. Such models are emerging as an alternative to the statistical models identifying causative factors driving complex diseases. We present CoVar, an inference framework that builds upon the properties of existing inference models, to find the central genes driving perturbed gene expression across biological states. We leverage ML-based network inference to find networks that capture the strength of regulatory interactions. Our model first pinpoints a subset of genes, termed variational, whose expression variabilities typify the differences in network connectivity between the control and perturbed data. Variational genes, by being differentially expressed themselves or possessing differentially expressed neighbor genes, capture gene expression variability. CoVar then creates subnetworks comprising variational genes and their strongly connected neighbor genes and identifies core genes central to these subnetworks that influence the bulk of the variational activity. Through the analysis of yeast expression data perturbed by the deletion of the mitochondrial genome, we show that CoVar identifies key genes not found through independent differential expression analysis.
Collapse
|
32
|
Sarkar A, Hossain SKS, Sarkar R. Human activity recognition from sensor data using spatial attention-aided CNN with genetic algorithm. Neural Comput Appl 2023; 35:5165-5191. [PMID: 36311167 PMCID: PMC9596348 DOI: 10.1007/s00521-022-07911-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 09/29/2022] [Indexed: 12/01/2022]
Abstract
Capturing time and frequency relationships of time series signals offers an inherent barrier for automatic human activity recognition (HAR) from wearable sensor data. Extracting spatiotemporal context from the feature space of the sensor reading sequence is challenging for the current recurrent, convolutional, or hybrid activity recognition models. The overall classification accuracy also gets affected by large size feature maps that these models generate. To this end, in this work, we have put forth a hybrid architecture for wearable sensor data-based HAR. We initially use Continuous Wavelet Transform to encode the time series of sensor data as multi-channel images. Then, we utilize a Spatial Attention-aided Convolutional Neural Network (CNN) to extract higher-dimensional features. To find the most essential features for recognizing human activities, we develop a novel feature selection (FS) method. In order to identify the fitness of the features for the FS, we first employ three filter-based methods: Mutual Information (MI), Relief-F, and minimum redundancy maximum relevance (mRMR). The best set of features is then chosen by removing the lower-ranked features using a modified version of the Genetic Algorithm (GA). The K-Nearest Neighbors (KNN) classifier is then used to categorize human activities. We conduct comprehensive experiments on five well-known, publicly accessible HAR datasets, namely UCI-HAR, WISDM, MHEALTH, PAMAP2, and HHAR. Our model significantly outperforms the state-of-the-art models in terms of classification performance. We also observe an improvement in overall recognition accuracy with the use of GA-based FS technique with a lower number of features. The source code of the paper is publicly available here https://github.com/apusarkar2195/HAR_WaveletTransform_SpatialAttention_FeatureSelection.
Collapse
Affiliation(s)
- Apu Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | - S. K. Sabbir Hossain
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
33
|
Wu S, Luo H, Yin S, Li K, Jiang Y. A Residual-Driven Secure Transmission and Detection Approach Against Stealthy Cyber-Physical Attacks for Accident Prevention. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2023; 18:5762-5771. [DOI: 10.1109/tifs.2023.3314194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/03/2024]
Affiliation(s)
- Shimeng Wu
- Department of Control Science and Engineering, Harbin Institute of Technology (HIT), Harbin, China
| | - Hao Luo
- Department of Control Science and Engineering, Harbin Institute of Technology (HIT), Harbin, China
| | - Shen Yin
- Department of Mechanical and Industrial Engineering, Faculty of Engineering, Norwegian University of Science and Technology, Trondheim, Norway
| | - Kuan Li
- Shanghai Aerospace Control Technology Institute, Shanghai, China
| | - Yuchen Jiang
- Department of Control Science and Engineering, Harbin Institute of Technology (HIT), Harbin, China
| |
Collapse
|
34
|
Association Measure and Compact Prediction for Chemical Process Data from an Information-Theoretic Perspective. Processes (Basel) 2022. [DOI: 10.3390/pr10122659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Mutual information (MI) has been widely used for association mining in complex chemical processes, but how to precisely estimate MI between variables of different numerical types, discriminate their association relationships with targets and finally achieve compact and interpretable prediction has not been discussed in detail, which may limit MI in more complicated industrial applications. Therefore, this paper first reviews the existing information-based association measures and proposes a general framework, GIEF, to consistently detect associations and independence between different types of variables. Then, the study defines four mutually exclusive association relations of variables from an information-theoretic perspective to guide feature selection and compact prediction in high-dimensional processes. Based on GIEF and conditional mutual information maximization (CMIM), a new algorithm, CMIM-GIEF, is proposed and tested on a fluidized catalytic cracking (FCC) process with 217 variables, one which achieves significantly improved accuracies with fewer variables in predicting the yields of four crucial products. The compact variables identified are also consistent with the results of Shapley Additive exPlanations (SHAP) and industrial experience, proving good adaptivity of the method for chemical process data.
Collapse
|
35
|
Bayesian SIR model with change points with application to the Omicron wave in Singapore. Sci Rep 2022; 12:20864. [PMID: 36460721 PMCID: PMC9718478 DOI: 10.1038/s41598-022-25473-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 11/30/2022] [Indexed: 12/05/2022] Open
Abstract
The Omicron variant has led to a new wave of the COVID-19 pandemic worldwide, with unprecedented numbers of daily confirmed new cases in many countries and areas. To analyze the impact of society or policy changes on the development of the Omicron wave, the stochastic susceptible-infected-removed (SIR) model with change points is proposed to accommodate the situations where the transmission rate and the removal rate may vary significantly at change points. Bayesian inference based on a Markov chain Monte Carlo algorithm is developed to estimate both the locations of change points as well as the transmission rate and removal rate within each stage. Experiments on simulated data reveal the effectiveness of the proposed method, and several stages are detected in analyzing the Omicron wave data in Singapore.
Collapse
|
36
|
Wang Q, Dong A, Zhao J, Wang C, Griffin C, Gragnoli C, Xue F, Wu R. Vaginal microbiota networks as a mechanistic predictor of aerobic vaginitis. Front Microbiol 2022; 13:998813. [PMID: 36338093 PMCID: PMC9631484 DOI: 10.3389/fmicb.2022.998813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/09/2022] [Indexed: 09/07/2024] Open
Abstract
Aerobic vaginitis (AV) is a complex vaginal dysbiosis that is thought to be caused by the micro-ecological change of the vaginal microbiota. While most studies have focused on how changes in the abundance of individual microbes are associated with the emergence of AV, we still do not have a complete mechanistic atlas of the microbe-AV link. Network modeling is central to understanding the structure and function of any microbial community assembly. By encapsulating the abundance of microbes as nodes and ecological interactions among microbes as edges, microbial networks can reveal how each microbe functions and how one microbe cooperate or compete with other microbes to mediate the dynamics of microbial communities. However, existing approaches can only estimate either the strength of microbe-microbe link or the direction of this link, failing to capture full topological characteristics of a network, especially from high-dimensional microbial data. We combine allometry scaling law and evolutionary game theory to derive a functional graph theory that can characterize bidirectional, signed, and weighted interaction networks from any data domain. We apply our theory to characterize the causal interdependence between microbial interactions and AV. From functional networks arising from different functional modules, we find that, as the only favorable genus from Firmicutes among all identified genera, the role of Lactobacillus in maintaining vaginal microbial symbiosis is enabled by upregulation from other microbes, rather than through any intrinsic capacity. Among Lactobacillus species, the proportion of L. crispatus to L. iners is positively associated with more healthy acid vaginal ecosystems. In a less healthy alkaline ecosystem, L. crispatus establishes a contradictory relationship with other microbes, leading to population decrease relative to L. iners. We identify topological changes of vaginal microbiota networks when the menstrual cycle of women changes from the follicular to luteal phases. Our network tool provides a mechanistic approach to disentangle the internal workings of the microbiota assembly and predict its causal relationships with human diseases including AV.
Collapse
Affiliation(s)
- Qian Wang
- Department of Obstetrics and Gynecology, Tianjin Medical University General Hospital, Tianjin, China
- Tianjin Key Laboratory of Female Reproductive Health and Eugenics, Tianjin, China
| | - Ang Dong
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Jinshuai Zhao
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Chen Wang
- Department of Obstetrics and Gynecology, Tianjin Medical University General Hospital, Tianjin, China
- Tianjin Key Laboratory of Female Reproductive Health and Eugenics, Tianjin, China
| | - Christipher Griffin
- Applied Research Laboratory, The Pennsylvania State University, State College, PA, United States
| | - Claudia Gragnoli
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, United States
- Division of Endocrinology, Department of Medicine, Creighton University School of Medicine, Omaha, NE, United States
- Molecular Biology Laboratory, Bios Biotech Multi-Diagnostic Health Center, Rome, Italy
| | - Fengxia Xue
- Department of Obstetrics and Gynecology, Tianjin Medical University General Hospital, Tianjin, China
- Tianjin Key Laboratory of Female Reproductive Health and Eugenics, Tianjin, China
| | - Rongling Wu
- Center for Statistical Genetics, Department of Public Health Sciences and Statistics, The Pennsylvania State University, Hershey, PA, United States
| |
Collapse
|
37
|
Mahmoodifar S, Pangal DJ, Cardinal T, Craig D, Simon T, Tew BY, Yang W, Chang E, Yu M, Neman J, Mason J, Toga A, Salhia B, Zada G, Newton PK. A quantitative characterization of the spatial distribution of brain metastases from breast cancer and respective molecular subtypes. J Neurooncol 2022; 160:241-251. [DOI: 10.1007/s11060-022-04147-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/25/2022] [Indexed: 11/30/2022]
|
38
|
Estimating high-order brain functional networks by correlation-preserving embedding. Med Biol Eng Comput 2022; 60:2813-2823. [DOI: 10.1007/s11517-022-02628-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 06/07/2022] [Indexed: 11/26/2022]
|
39
|
Fan Zhang Y, Mameri S, Xie T, Sadoun A. Local similarity of activity patterns during auditory and visual processing. Neurosci Lett 2022; 790:136891. [PMID: 36181962 DOI: 10.1016/j.neulet.2022.136891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 09/20/2022] [Accepted: 09/26/2022] [Indexed: 11/29/2022]
Abstract
Neuroimaging studies have shown that brain activity is variable and changes according to stimuli and the environmental context, reflecting brain coding or information representations at different processing levels. However, little is known about activity organization that reflects coding strategies. Here, we explored and compared two different coding approaches, spatial via cross-correlation and intensity-based coding using mutual information. Using two fMRI datasets and different seeds, we searched for the spatial and intensity-based similarities with the seeds in brain activity. Our results showed that, apart from the seed regions, significant regions detected by intensity-based similarity analysis differ completely from those found using cross-correlation. These findings may indicate that information shared through spatial coding differs from that transmitted via non-spatial coding processes. Our results suggest that brain coding is organized in several different ways to optimize information processing.
Collapse
Affiliation(s)
- Yi Fan Zhang
- UMR 5549, Université de Toulouse 3, France, Centre National de la Recherche Scientifique, Toulouse, France; Centre de Recherche Cerveau et Cognition, Université de Toulouse 3, Université Paul Sabatier, Toulouse, France.
| | - Samir Mameri
- University of Bordj Bou Arreridj, Algeria; Laboratory of theoretical physics (LPT), University of Béjaïa, Algeria
| | - Ting Xie
- Centre de Recherches en Cancérologie de Toulouse (CRCT), INSERM U1037, Toulouse 31037, France; Université Paul Sabatier III, Toulouse 31400, Toulouse, France
| | | |
Collapse
|
40
|
Baikejuli M, Shi J, Hussain M. A study on the probabilistic quantification of heavy-truck crash risk under the influence of multi-factors. ACCIDENT; ANALYSIS AND PREVENTION 2022; 174:106771. [PMID: 35841687 DOI: 10.1016/j.aap.2022.106771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 01/20/2022] [Accepted: 07/07/2022] [Indexed: 06/15/2023]
Abstract
In order to manage and reduce a specific risk, its quantitative analysis is necessary. The key objective of this study is to illustrate the prevalence of multi-factors in fatal crashes involving heavy trucks and to quantify the crash risk under the influence of multi-factors. Data from a recent, nationally representative sample of fatal crashes was investigated to identify the risk factors contributing to crash causations and a novel risk index was obtained to develop a criterion for crash risk quantification. Then, based on the mutual information theory, the mutual dependence between risk factors was calculated to quantify the crash risk under different risk factor combinations. The results reveal that most heavy-truck fatal crashes are the result of co-occurring multi-factors rather than a single factor, and are mainly caused by simultaneous occurrence of two or three contributing factors. Moreover, crash risk increases with the increase of the number of risk factors influencing the driver. Furthermore, multi-factor interaction between certain risk factors, such as environmental and vehicular factors, makes incremental contribution to the crash risk by further increasing the crash probability. Specifically, when driver's aberrant behaviors (errors and/or violations) are exposed to both environmental and vehicular factors, driver's likelihood of being involved in a fatal crash increases significantly. This suggests that in addition to the number of risk factors, the crash risk also depends on the multi-factor interactions between different risk factors. Therefore, the effects of individual risk factors should be controlled at the outset to prevent the incremental effects of multi-factors on crash risk, in turn enabling risk minimization..
Collapse
Affiliation(s)
| | - Jing Shi
- Department of Civil Engineering, Tsinghua University, Beijing 100084, China.
| | - Muhammad Hussain
- Department of Civil Engineering, Tsinghua University, Beijing 100084, China
| |
Collapse
|
41
|
Yang J, Lu H, Li C, Hu X, Hu B. Data augmentation for depression detection using skeleton-based gait information. Med Biol Eng Comput 2022; 60:2665-2679. [PMID: 35829811 DOI: 10.1007/s11517-022-02595-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 04/22/2022] [Indexed: 11/27/2022]
Abstract
In recent years, the incidence of depression is rising rapidly worldwide, but large-scale depression screening is still challenging. Gait analysis provides a non-contact, low-cost, and efficient early screening method for depression. However, the early screening of depression based on gait analysis lacks sufficient effective sample data. In this paper, we propose a skeleton data augmentation method for assessing the risk of depression. First, we propose five techniques to augment skeleton data and apply them to depression and emotion datasets. Then, we divide augmentation methods into two types (non-noise augmentation and noise augmentation) based on the mutual information and the classification accuracy. Finally, we explore which augmentation strategies can capture the characteristics of human skeleton data more effectively. Experimental results show that the augmented training dataset that retains more of the raw skeleton data properties determines the performance of the detection model. Specifically, rotation augmentation and channel mask augmentation make the depression detection accuracy reach 92.15% and 91.34%, respectively.
Collapse
Affiliation(s)
- Jingjing Yang
- School of information Science and Engineering, Lanzhou University, Lanzhou, China
| | - Haifeng Lu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China
| | - Chengming Li
- School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, China.
| | - Xiping Hu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China. .,School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, China.
| | - Bin Hu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China. .,Institute of Engineering Medicine, Beijing Institute of Technology, Beijing, China.
| |
Collapse
|
42
|
Epigenetic regulation of fetal brain development in pig. Gene 2022; 844:146823. [PMID: 35988784 DOI: 10.1016/j.gene.2022.146823] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/27/2022] [Accepted: 08/15/2022] [Indexed: 02/01/2023]
Abstract
How fetal brain development is regulated at the molecular level is not well understood. Due to ethical challenges associated with research on the human fetus, large animals particularly pigs are increasingly used to study development and disorders of fetal brain. The pig fetal brain grows rapidly during the last ∼ 50 days before birth which is around day 60 (d60) of pig gestation. But what regulates the onset of accelerated growth of the brain is unknown. The current study tests the hypothesis that epigenetic alteration around d60 is involved in the onset of rapid growth of fetal brain of pig. To test this hypothesis, DNA methylation changes of fetal brain was assessed in a genome-wide manner by Enzymatic Methyl-seq (EM-seq) during two gestational periods (GP): d45 vs. d60 (GP1) and d60 vs. d90 (GP2). The cytosine-guanine (CpG) methylation data was analyzed in an integrative manner with the RNA-seq data generated from the same brain samples from our earlier study. A neural network based modeling approach was implemented to learn changes in methylation patterns of the differentially expressed genes, and then predict methylations of the brain in a genome-wide manner during rapid growth. This approach identified specific methylations that changed in a mutually informative manner during rapid growth of the fetal brain. These methylations were significantly overrepresented in specific genic as well as intergenic features including CpG islands, introns, and untranslated regions. In addition, sex-bias methylations of known single nucleotide polymorphic sites were also identified in the fetal brain ide during rapid growth.
Collapse
|
43
|
Islam M, Strawn M, Behura SK. Fetal origin of sex‐bias brain aging. FASEB J 2022; 36:e22463. [DOI: 10.1096/fj.202200255rr] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 06/30/2022] [Accepted: 07/11/2022] [Indexed: 02/01/2023]
Affiliation(s)
- Maliha Islam
- Division of Animal Sciences University of Missouri Columbia Missouri USA
| | - Monica Strawn
- Division of Animal Sciences University of Missouri Columbia Missouri USA
| | - Susanta K. Behura
- Division of Animal Sciences University of Missouri Columbia Missouri USA
- MU Institute for Data Science and Informatics University of Missouri Columbia Missouri USA
- Interdisciplinary Neuroscience Program University of Missouri Columbia Missouri USA
| |
Collapse
|
44
|
Abstract
Maximal information coefficient (MIC) explores the associations between pairwise variables in complex relationships. It approaches the correlation by optimized partition on the axis. However, when the relationships meet special noise, MIC may overestimate the correlated value, which leads to the misidentification of the relationship without noiseless. In this article, a novel method of weighted information coefficient mean (WICM) is proposed to detect unbiased associations in large data sets. First, we mathematically analyze the cause of giving an abnormal correlation value to a noisy relationship. Then, the WICM is presented in two core steps. One is to detect the potential overestimation from the relationships with high value, and the other is to rectify the overestimation by calculating information coefficient mean instead of just selecting the maximum element in the characteristic matrix. Finally, experiments in functional relationships and real-world data relationships show that the overestimation can be solved by WICM with both feasibility and effectiveness.
Collapse
Affiliation(s)
- Chuanlu Liu
- Department of Data Science and Knowledge Engineering, School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Shuliang Wang
- Department of Data Science and Knowledge Engineering, School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
- Institute of E-Government, Beijing Institute of Technology, Beijing, China
| | - Hanning Yuan
- Department of Data Science and Knowledge Engineering, School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Xiaojia Liu
- Department of Data Science and Knowledge Engineering, School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
45
|
Zhang T, Wong G. Gene expression data analysis using Hellinger correlation in weighted gene co-expression networks (WGCNA). Comput Struct Biotechnol J 2022; 20:3851-3863. [PMID: 35891798 PMCID: PMC9307959 DOI: 10.1016/j.csbj.2022.07.018] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Revised: 07/09/2022] [Accepted: 07/09/2022] [Indexed: 12/24/2022] Open
Abstract
Weighted gene co-expression network analysis (WGCNA) is used to detect clusters with highly correlated genes. Measurements of correlation most typically rely on linear relationships. However, a linear relationship does not always model pairwise functional-related dependence between genes. In this paper, we first compared 6 different correlation methods in their ability to capture complex dependence between genes in three different tissues. Next, we compared their gene-pairwise coefficient results and corresponding WGCNA results. Finally, we applied a recently proposed correlation method, Hellinger correlation, as a more sensitive correlation measurement in WGCNA. To test this method, we constructed gene networks containing co-expression gene modules from RNA-seq data of human frontal cortex from Alzheimer's disease patients. To test the generality, we also used a microarray data set from human frontal cortex, single cell RNA-seq data from human prefrontal cortex, RNA-seq data from human temporal cortex, and GTEx data from heart. The Hellinger correlation method captures essentially similar results as other linear correlations in WGCNA, but provides additional new functional relationships as exemplified by uncovering a link between inflammation and mitochondria function. We validated the network constructed with the microarray and single cell sequencing data sets and a RNA-seq dataset of temporal cortex. We observed that this new correlation method enables the detection of non-linear biologically meaningful relationships among genes robustly and provides a complementary new approach to WGCNA. Thus, the application of Hellinger correlation to WGCNA provides a more flexible correlation approach to modelling networks in gene expression analysis that uncovers novel network relationships.
Collapse
Affiliation(s)
- Tianjiao Zhang
- Cancer Centre, Centre for Reproduction, Development and Aging, Department of Public Health and Medicinal Administration, Faculty of Health Sciences, University of Macau, Taipa 999078, Macau Special Administrative Region
| | - Garry Wong
- Cancer Centre, Centre for Reproduction, Development and Aging, Department of Public Health and Medicinal Administration, Faculty of Health Sciences, University of Macau, Taipa 999078, Macau Special Administrative Region
| |
Collapse
|
46
|
Approaches in Gene Coexpression Analysis in Eukaryotes. BIOLOGY 2022; 11:biology11071019. [PMID: 36101400 PMCID: PMC9312353 DOI: 10.3390/biology11071019] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 06/28/2022] [Accepted: 07/04/2022] [Indexed: 11/22/2022]
Abstract
Simple Summary Genes whose expression levels rise and fall similarly in a large set of samples, may be considered coexpressed. Gene coexpression analysis refers to the en masse discovery of coexpressed genes from a large variety of transcriptomic experiments. The type of biological networks that studies gene coexpression, known as Gene Coexpression Networks, consist of an undirected graph depicting genes and their coexpression relationships. Coexpressed genes are clustered in smaller subnetworks, the predominant biological roles of which can be determined through enrichment analysis. By studying well-annotated gene partners, the attribution of new roles to genes of unknown function or assumption for participation in common metabolic pathways can be achieved, through a guilt-by-association approach. In this review, we present key issues in gene coexpression analysis, as well as the most popular tools that perform it. Abstract Gene coexpression analysis constitutes a widely used practice for gene partner identification and gene function prediction, consisting of many intricate procedures. The analysis begins with the collection of primary transcriptomic data and their preprocessing, continues with the calculation of the similarity between genes based on their expression values in the selected sample dataset and results in the construction and visualisation of a gene coexpression network (GCN) and its evaluation using biological term enrichment analysis. As gene coexpression analysis has been studied extensively, we present most parts of the methodology in a clear manner and the reasoning behind the selection of some of the techniques. In this review, we offer a comprehensive and comprehensible account of the steps required for performing a complete gene coexpression analysis in eukaryotic organisms. We comment on the use of RNA-Seq vs. microarrays, as well as the best practices for GCN construction. Furthermore, we recount the most popular webtools and standalone applications performing gene coexpression analysis, with details on their methods, features and outputs.
Collapse
|
47
|
Enhancer methylation dynamics drive core transcriptional regulatory circuitry in pan-cancer. Oncogene 2022; 41:3474-3484. [PMID: 35655092 DOI: 10.1038/s41388-022-02359-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 05/11/2022] [Accepted: 05/19/2022] [Indexed: 12/16/2022]
Abstract
Accumulating evidence has demonstrated that enhancer methylation has strong and dynamic regulatory effects on gene expression. Some transcription factors (TFs) can auto- and cross-regulate in a feed-forward manner, and cooperate with their enhancers to form core transcriptional regulatory circuitries (CRCs). However, the elaborated regulatory mechanism between enhancer methylation and CRC remains the tip of the iceberg. Here, we revealed that DNA methylation could drive the tissue-specific enhancer basal transcription and target gene expression in human cancers. By integrating methylome, transcriptome, and 3D genomic data, we identified enhancer methylation triplets (enhancer methylation-enhancer transcription-target gene expression) and dissected potential regulatory patterns within them. Moreover, we observed that cancer-specific core TFs regulated by enhancers were able to shape their enhancer methylation forming the enhancer methylation-driven CRCs (emCRCs). Further parsing of clinical implications showed rewired emCRCs could serve as druggable targets and prognostic risk markers. In summary, the integrative analysis of enhancer methylation regulome would facilitate portraying the cancer epigenomics landscape and developing the epigenetic anti-cancer approaches.
Collapse
|
48
|
|
49
|
Passemiers A, Moreau Y, Raimondi D. Fast and accurate inference of gene regulatory networks through robust precision matrix estimation. Bioinformatics 2022; 38:2802-2809. [PMID: 35561176 PMCID: PMC9113237 DOI: 10.1093/bioinformatics/btac178] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 03/14/2022] [Accepted: 03/22/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Transcriptional regulation mechanisms allow cells to adapt and respond to external stimuli by altering gene expression. The possible cell transcriptional states are determined by the underlying gene regulatory network (GRN), and reliably inferring such network would be invaluable to understand biological processes and disease progression. RESULTS In this article, we present a novel method for the inference of GRNs, called PORTIA, which is based on robust precision matrix estimation, and we show that it positively compares with state-of-the-art methods while being orders of magnitude faster. We extensively validated PORTIA using the DREAM and MERLIN+P datasets as benchmarks. In addition, we propose a novel scoring metric that builds on graph-theoretical concepts. AVAILABILITY AND IMPLEMENTATION The code and instructions for data acquisition and full reproduction of our results are available at https://github.com/AntoinePassemiers/PORTIA-Manuscript. PORTIA is available on PyPI as a Python package (portia-grn). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
50
|
Patowary P, Bhattacharyya DK, Barah P. SNMRS: An advanced measure for Co-expression network analysis. Comput Biol Med 2022; 143:105222. [PMID: 35121360 DOI: 10.1016/j.compbiomed.2022.105222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 01/03/2022] [Accepted: 01/03/2022] [Indexed: 11/17/2022]
Abstract
The challenge of identifying modules in a gene interaction network is important for a better understanding of the overall network architecture. In this work, we develop a novel similarity measure called Scaling-and-Shifting Normalized Mean Residue Similarity (SNMRS), based on the existing NMRS technique [1]. SNMRS yields correlation values in the range of 0 to +1 corresponding to negative and positive dependency. To study the performance of our measure, internal validation of extracted clusters resulting from different methods is carried out. Based on the performance, we choose hierarchical clustering and apply the same using the corresponding dissimilarity (distance) values of SNMRS scores, and utilize a dynamic tree cut method for extracting dense modules. The modules are validated using a literature search, KEGG pathway analysis, and gene-ontology analyses on the genes that make up the modules. Moreover, our measure can handle absolute, shifting, scaling, and shifting-and-scaling correlations and provides better performance than several other measures in terms of cluster-validity indices. Also, SNMRS based module detection method results in interesting biologically relevant patterns from gene microarray and RNA-seq dataset. A set of crucial genes having high relevance with the ESCC are also identified.
Collapse
Affiliation(s)
- Pallabi Patowary
- Department of Computer Science and Engineering, Tezpur University, Assam, India.
| | | | - Pankaj Barah
- Dept. of Molecular Biology and Biotechnology Tezpur University, Assam, India.
| |
Collapse
|