1
|
Kim M, Jang YJ, Lee M, Guo Q, Son AJ, Kakkad NA, Roland AB, Lee BK, Kim J. The transcriptional regulatory network modulating human trophoblast stem cells to extravillous trophoblast differentiation. Nat Commun 2024; 15:1285. [PMID: 38346993 PMCID: PMC10861538 DOI: 10.1038/s41467-024-45669-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 01/31/2024] [Indexed: 02/15/2024] Open
Abstract
During human pregnancy, extravillous trophoblasts play crucial roles in placental invasion into the maternal decidua and spiral artery remodeling. However, regulatory factors and their action mechanisms modulating human extravillous trophoblast specification have been unknown. By analyzing dynamic changes in transcriptome and enhancer profile during human trophoblast stem cell to extravillous trophoblast differentiation, we define stage-specific regulators, including an early-stage transcription factor, TFAP2C, and multiple late-stage transcription factors. Loss-of-function studies confirm the requirement of all transcription factors identified for adequate differentiation, and we reveal that the dynamic changes in the levels of TFAP2C are essential. Notably, TFAP2C pre-occupies the regulatory elements of the inactive extravillous trophoblast-active genes during the early stage of differentiation, and the late-stage transcription factors directly activate extravillous trophoblast-active genes, including themselves as differentiation further progresses, suggesting sequential actions of transcription factors assuring differentiation. Our results reveal stage-specific transcription factors and their inter-connected regulatory mechanisms modulating extravillous trophoblast differentiation, providing a framework for understanding early human placentation and placenta-related complications.
Collapse
Affiliation(s)
- Mijeong Kim
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Yu Jin Jang
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Muyoung Lee
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Qingqing Guo
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Albert J Son
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Nikita A Kakkad
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Abigail B Roland
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Bum-Kyu Lee
- Department of Biomedical Sciences, Cancer Research Center, University at Albany, State University of New York, Rensselaer, NY, 12144, USA
| | - Jonghwan Kim
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, 78712, USA.
| |
Collapse
|
2
|
Di Giovanni N, Meuwis MA, Louis E, Focant JF. Correlations for untargeted GC × GC-HRTOF-MS metabolomics of colorectal cancer. Metabolomics 2023; 19:85. [PMID: 37740774 DOI: 10.1007/s11306-023-02047-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Accepted: 08/28/2023] [Indexed: 09/25/2023]
Abstract
INTRODUCTION Modern comprehensive instrumentations provide an unprecedented coverage of complex matrices in the form of high-dimensional, information rich data sets. OBJECTIVES In addition to the usual biomarker research that focuses on the detection of the studied condition, we aimed to define a proper strategy to conduct a correlation analysis on an untargeted colorectal cancer case study with a data set of 102 variables corresponding to metabolites obtained from serum samples analyzed with comprehensive two-dimensional gas chromatography coupled to high-resolution time-of-flight mass spectrometry (GC × GC-HRTOF-MS). Indeed, the strength of association existing between the metabolites contains potentially valuable information about the molecular mechanisms involved and the underlying metabolic network associated to a global perturbation, at no additional analytical effort. METHODS Following Anscombe's quartet, we took particular attention to four main aspects. First, the presence of non-linear relationships through the comparison of parametric and non-parametric correlation coefficients: Pearson's r, Spearman's rho, Kendall's tau and Goodman-Kruskal's gamma. Second, the visual control of the detected associations through scatterplots and their associated regressions and angles. Third, the effect and handling of atypical samples and values. Fourth, the role of the precision of the data on the attribution of the ranks through the presence of ties. RESULTS Kendall's tau was found the method of choice for the data set at hand. Its application highlighted 17 correlations significantly altered in the active state of colorectal cancer (CRC) in comparison to matched healthy controls (HC), from which 10 were specific to this state in comparison to the remission one (R-CRC) investigated on distinct patients. 15 metabolites involved in the correlations of interest, on the 25 unique ones obtained, were annotated (Metabolomics Standards Initiative level 2). CONCLUSIONS The metabolites highlighted could be used to better understand the pathology. The systematic investigation of the methodological aspects that we expose allows to implement correlation analysis to various fields and many specific cases.
Collapse
Affiliation(s)
- Nicolas Di Giovanni
- Department of Chemistry, Organic and Biological Analytical Chemistry Group, Quartier Agora, University of Liège, Allée du Six Août,B6c, B-4000, Liège, Sart Tilman, Belgium
| | - Marie-Alice Meuwis
- GIGA Institute, Translational Gastroenterology and CHU de Liège, Hepato-Gastroenterology and Digestive Oncology, Quartier Hôpital, University of Liège, Avenue de L'Hôpital 13, B34-35, B-4000, Liège, Belgium
| | - Edouard Louis
- GIGA Institute, Translational Gastroenterology and CHU de Liège, Hepato-Gastroenterology and Digestive Oncology, Quartier Hôpital, University of Liège, Avenue de L'Hôpital 13, B34-35, B-4000, Liège, Belgium
| | - Jean-François Focant
- Department of Chemistry, Organic and Biological Analytical Chemistry Group, Quartier Agora, University of Liège, Allée du Six Août,B6c, B-4000, Liège, Sart Tilman, Belgium.
| |
Collapse
|
3
|
Qiu G, Wang H, Yan Q, Ma H, Niu R, Lei Y, Xiao Y, Zhou L, Yang H, Xu C, Zhang X, He M, Tang H, Hu Z, Pan A, Shen H, Wu T. A Lipid Signature with Perturbed Triacylglycerol Co-Regulation, Identified from Targeted Lipidomics, Predicts Risk for Type 2 Diabetes and Mediates the Risk from Adiposity in Two Prospective Cohorts of Chinese Adults. Clin Chem 2022; 68:1094-1107. [PMID: 35708664 DOI: 10.1093/clinchem/hvac090] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 04/18/2022] [Indexed: 11/13/2022]
Abstract
BACKGROUND The roles of individual and co-regulated lipid molecular species in the development of type 2 diabetes (T2D) and mediation from metabolic risk factors remain unknown. METHODS We conducted profiling of 166 plasma lipid species in 2 nested case-control studies within 2 independent cohorts of Chinese adults, the Dongfeng-Tongji and the Jiangsu non-communicable disease cohorts. After 4.61 (0.15) and 7.57 (1.13) years' follow-up, 1039 and 520 eligible participants developed T2D in these 2 cohorts, respectively, and controls were 1:1 matched to cases by age and sex. RESULTS We found 27 lipid species, including 10 novel ones, consistently associated with T2D risk in the 2 cohorts. Differential correlation network analysis revealed significant correlations of triacylglycerol (TAG) 50:3, containing at least one oleyl chain, with 6 TAGs, at least 3 of which contain the palmitoyl chain, all downregulated within cases relative to controls among the 27 lipids in both cohorts, while the networks also both identified the oleyl chain-containing TAG 50:3 as the central hub. We further found that 13 of the 27 lipids consistently mediated the association between adiposity indicators (body mass index, waist circumference, and waist-to-height ratio) and diabetes risk in both cohorts (all P < 0.05; proportion mediated: 20.00%, 17.70%, and 17.71%, and 32.50%, 28.73%, and 33.86%, respectively). CONCLUSIONS Our findings suggested notable perturbed co-regulation, inferred from differential correlation networks, between oleyl chain- and palmitoyl chain-containing TAGs before diabetes onset, with the oleyl chain-containing TAG 50:3 at the center, and provided novel etiological insight regarding lipid dysregulation in the progression from adiposity to overt T2D.
Collapse
Affiliation(s)
- Gaokun Qiu
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Hao Wang
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Qi Yan
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Hongxia Ma
- Department of Epidemiology, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Rundong Niu
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Yanshou Lei
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Yang Xiao
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Lue Zhou
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Handong Yang
- Department of Cardiovascular Disease, Sinopharm Dongfeng General Hospital, Hubei University of Medicine, Shiyan 442008, China
| | - Chengwei Xu
- Department of Cardiovascular Disease, Sinopharm Dongfeng General Hospital, Hubei University of Medicine, Shiyan 442008, China
| | - Xiaomin Zhang
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Meian He
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Huiru Tang
- State Key Laboratory of Genetic Engineering, Fudan University, Shanghai 200433, China.,CAS Key Laboratory of Magnetic Resonance in Biological Systems, University of Chinese Academy of Sciences, Wuhan 430071, China
| | - Zhibin Hu
- Department of Epidemiology, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - An Pan
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Hongbing Shen
- Department of Epidemiology, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Tangchun Wu
- Ministry of Education and State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| |
Collapse
|
4
|
Grimes T, Datta S. SeqNet: An R Package for Generating Gene-Gene Networks and Simulating RNA-Seq Data. J Stat Softw 2021; 98:10.18637/jss.v098.i12. [PMID: 34321962 PMCID: PMC8315007 DOI: 10.18637/jss.v098.i12] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Gene expression data provide an abundant resource for inferring connections in gene regulatory networks. While methodologies developed for this task have shown success, a challenge remains in comparing the performance among methods. Gold-standard datasets are scarce and limited in use. And while tools for simulating expression data are available, they are not designed to resemble the data obtained from RNA-seq experiments. SeqNet is an R package that provides tools for generating a rich variety of gene network structures and simulating RNA-seq data from them. This produces in silico RNA-seq data for benchmarking and assessing gene network inference methods. The package is available on CRAN and on GitHub at https://github.com/tgrimes/SeqNet.
Collapse
Affiliation(s)
- Tyler Grimes
- Univeristy of Florida, Department of Biostatistics
| | | |
Collapse
|
5
|
Davenport KM, Massa AT, Bhattarai S, McKay SD, Mousel MR, Herndon MK, White SN, Cockett NE, Smith TPL, Murdoch BM. Characterizing Genetic Regulatory Elements in Ovine Tissues. Front Genet 2021; 12:628849. [PMID: 34093640 PMCID: PMC8173140 DOI: 10.3389/fgene.2021.628849] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 03/25/2021] [Indexed: 12/11/2022] Open
Abstract
The Ovine Functional Annotation of Animal Genomes (FAANG) project, part of the broader livestock species FAANG initiative, aims to identify and characterize gene regulatory elements in domestic sheep. Regulatory element annotation is essential for identifying genetic variants that affect health and production traits in this important agricultural species, as greater than 90% of variants underlying genetic effects are estimated to lie outside of transcribed regions. Histone modifications that distinguish active or repressed chromatin states, CTCF binding, and DNA methylation were used to characterize regulatory elements in liver, spleen, and cerebellum tissues from four yearling sheep. Chromatin immunoprecipitation with sequencing (ChIP-seq) was performed for H3K4me3, H3K27ac, H3K4me1, H3K27me3, and CTCF. Nine chromatin states including active promoters, active enhancers, poised enhancers, repressed enhancers, and insulators were characterized in each tissue using ChromHMM. Whole-genome bisulfite sequencing (WGBS) was performed to determine the complement of whole-genome DNA methylation with the ChIP-seq data. Hypermethylated and hypomethylated regions were identified across tissues, and these locations were compared with chromatin states to better distinguish and validate regulatory elements in these tissues. Interestingly, chromatin states with the poised enhancer mark H3K4me1 in the spleen and cerebellum and CTCF in the liver displayed the greatest number of hypermethylated sites. Not surprisingly, active enhancers in the liver and spleen, and promoters in the cerebellum, displayed the greatest number of hypomethylated sites. Overall, chromatin states defined by histone marks and CTCF occupied approximately 22% of the genome in all three tissues. Furthermore, the liver and spleen displayed in common the greatest percent of active promoter (65%) and active enhancer (81%) states, and the liver and cerebellum displayed in common the greatest percent of poised enhancer (53%), repressed enhancer (68%), hypermethylated sites (75%), and hypomethylated sites (73%). In addition, both known and de novo CTCF-binding motifs were identified in all three tissues, with the highest number of unique motifs identified in the cerebellum. In summary, this study has identified the regulatory regions of genes in three tissues that play key roles in defining health and economically important traits and has set the precedent for the characterization of regulatory elements in ovine tissues using the Rambouillet reference genome.
Collapse
Affiliation(s)
- Kimberly M. Davenport
- Department of Animal, Veterinary, and Food Science, University of Idaho, Moscow, ID, United States
| | - Alisha T. Massa
- Department of Veterinary Microbiology and Pathology, Washington State University, Pullman, WA, United States
| | | | | | - Michelle R. Mousel
- USDA, ARS, Animal Disease Research Unit, Pullman, WA, United States
- Paul G. Allen School for Global Animal Health, Washington State University, Pullman, WA, United States
| | - Maria K. Herndon
- Department of Veterinary Microbiology and Pathology, Washington State University, Pullman, WA, United States
| | - Stephen N. White
- Department of Veterinary Microbiology and Pathology, Washington State University, Pullman, WA, United States
- USDA, ARS, Animal Disease Research Unit, Pullman, WA, United States
- Center for Reproductive Biology, Washington State University, Pullman, WA, United States
| | | | - Timothy P. L. Smith
- USDA, ARS, U.S. Meat Animal Research Center (USMARC), Clay Center, NE, United States
| | - Brenda M. Murdoch
- Department of Animal, Veterinary, and Food Science, University of Idaho, Moscow, ID, United States
- Center for Reproductive Biology, Washington State University, Pullman, WA, United States
| | | |
Collapse
|
6
|
Arbet J, Zhuang Y, Litkowski E, Saba L, Kechris K. Comparing Statistical Tests for Differential Network Analysis of Gene Modules. Front Genet 2021; 12:630215. [PMID: 34093641 PMCID: PMC8170128 DOI: 10.3389/fgene.2021.630215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 04/19/2021] [Indexed: 11/13/2022] Open
Abstract
Genes often work together to perform complex biological processes, and "networks" provide a versatile framework for representing the interactions between multiple genes. Differential network analysis (DiNA) quantifies how this network structure differs between two or more groups/phenotypes (e.g., disease subjects and healthy controls), with the goal of determining whether differences in network structure can help explain differences between phenotypes. In this paper, we focus on gene co-expression networks, although in principle, the methods studied can be used for DiNA for other types of features (e.g., metabolome, epigenome, microbiome, proteome, etc.). Three common applications of DiNA involve (1) testing whether the connections to a single gene differ between groups, (2) testing whether the connection between a pair of genes differs between groups, or (3) testing whether the connections within a "module" (a subset of 3 or more genes) differs between groups. This article focuses on the latter, as there is a lack of studies comparing statistical methods for identifying differentially co-expressed modules (DCMs). Through extensive simulations, we compare several previously proposed test statistics and a new p-norm difference test (PND). We demonstrate that the true positive rate of the proposed PND test is competitive with and often higher than the other methods, while controlling the false positive rate. The R package discoMod (differentially co-expressed modules) implements the proposed method and provides a full pipeline for identifying DCMs: clustering tools to derive gene modules, tests to identify DCMs, and methods for visualizing the results.
Collapse
Affiliation(s)
- Jaron Arbet
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Yaxu Zhuang
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Elizabeth Litkowski
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Laura Saba
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora CO, United States
| | - Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| |
Collapse
|
7
|
Xu B, Su H, Wang R, Wang Y, Zhang W. Metabolic networks of plasma and joint fluid base on differential correlation. PLoS One 2021; 16:e0247191. [PMID: 33617578 PMCID: PMC7899361 DOI: 10.1371/journal.pone.0247191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 02/02/2021] [Indexed: 11/18/2022] Open
Abstract
Whether osteoarthritis (OA) is a systemic metabolic disorder remains controversial. The aim of this study was to investigate the metabolic characteristics between plasma and knee joint fluid (JF) of patients with advanced OA using a differential correlation metabolic (DCM) networks approach. Plasma and JF were collected during the joint replacement surgery of patients with knee OA. The biological samples were pretreated with standard procedures for metabolite analysis. The metabolic profiling was conducted by means of liquid mass spectrometry coupled with a AbsoluteIDQ kit. A DCM network approach was adopted for analyzing the metabolomics data between the plasma and JF. The variation in the correlation of the pairwise metabolites was quantified across the plasma and JF samples, and networks analysis was used to characterize the difference in the correlations of the metabolites from the two sample types. Core metabolites that played an important role in the DCM networks were identified via topological analysis. One hundred advanced OA patients (50 men and 50 women) were included in this study, with an average age of 65.0 ± 7.6 years (65.6 ± 7.1 years for females and 64.4 ± 8.1 years for males) and a mean BMI of 32.6 ± 5.8 kg/m2 (33.4 ± 6.3 kg/m2 for females and 31.7 ± 5.3 kg/m2 for males). Age and BMI matched between the male and female groups. One hundred and forty-five nodes, 567 edges, and 131 nodes, 407 edges were found in the DCM networks (p < 0.05) of the female and male groups, respectively. Six metabolites in the female group and 5 metabolites in the male group were identified as key nodes in the network. There was a significant difference in the differential correlation metabolism networks of plasma and JF that may be related to local joint metabolism. Focusing on these key metabolites may help uncover the pathogenesis of knee OA. In addition, the differential metabolic correlation between plasma and JF mostly overlapped, indicating that these common correlations of pairwise metabolites may be a reflection of systemic characteristics of JF and that most significant correlation variations were just a result of "housekeeping” biological reactions.
Collapse
Affiliation(s)
- Bingyong Xu
- School of Pharmaceutical Sciences, Jilin University, Changchun, Jilin, China
- Hangzhou Heze Pharmaceutical Technology CO.,LTD, Hangzhou, Zhejiang, China
| | - Hong Su
- School of Pharmaceutical Sciences, Jilin University, Changchun, Jilin, China
- Department of Pharmacy and Examination, Daqing Medical College, Daqing, Heilongjiang, China
| | - Ruya Wang
- School of Pharmaceutical Sciences, Jilin University, Changchun, Jilin, China
| | - Yixiao Wang
- School of Pharmaceutical Sciences, Jilin University, Changchun, Jilin, China
| | - Weidong Zhang
- School of Pharmaceutical Sciences, Jilin University, Changchun, Jilin, China
- * E-mail:
| |
Collapse
|
8
|
Bloch NI, Corral‐López A, Buechel SD, Kotrschal A, Kolm N, Mank JE. Different mating contexts lead to extensive rewiring of female brain coexpression networks in the guppy. GENES BRAIN AND BEHAVIOR 2020; 20:e12697. [DOI: 10.1111/gbb.12697] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 08/10/2020] [Accepted: 08/29/2020] [Indexed: 12/19/2022]
Affiliation(s)
- Natasha I. Bloch
- Department of Biomedical Engineering Universidad de Los Andes Bogotá D.C. Colombia
| | - Alberto Corral‐López
- Department of Zoology/Ethology Stockholm University Stockholm Sweden
- Department of Genetics, Evolution and Environment University College London UK
| | | | - Alexander Kotrschal
- Department of Zoology/Ethology Stockholm University Stockholm Sweden
- Wageningen University Behavioral Ecology Group Wageningen Netherlands
| | - Niclas Kolm
- Department of Zoology/Ethology Stockholm University Stockholm Sweden
| | - Judith E. Mank
- University of British Columbia Department of Zoology and Biodiversity Research Centre Vancouver Canada
- Department of Genetics, Evolution and Environment University College London UK
| |
Collapse
|
9
|
Li J, Yao Z, Duan M, Liu S, Li F, Zhu H, Xia Z, Huang L, Zhou F. MuscNet, a Weighted Voting Model of Multi-Source Connectivity Networks to Predict Mild Cognitive Impairment Using Resting-State Functional MRI. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:174023-174031. [PMID: 35548102 PMCID: PMC9090182 DOI: 10.1109/access.2020.3025828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The neurological disorder mild cognitive impairment (MCI) demonstrates minor impacts on the patient's daily activities and may be ignored as the status of normal aging. But some of the MCI patients may further develop into severe statuses like Alzheimer's disease (AD). The brain functional connectivity network (BFCN) was usually constructed from the resting-state functional magnetic resonance imaging (rs-fMRI) data. This technology has been widely used to detect the neurodegenerative dementia and to reveal the intrinsic mechanism of neural activities. The BFCN edge was usually determined by the pairwise correlation between the brain regions. This study proposed a weighted voting model of multi-source connectivity networks (MuscNet) by integrating multiple BFCNs of different correlation coefficients. Our model was further improved by removing redundant features. The experimental data demonstrated that different BFCNs contributed complementary information to each other and MuscNet outperformed the existing models on detecting MCI patients. The previous study suggested the existence of multiple solutions with similarly good performance for a machine learning problem. The proposed model MuscNet utilized a weighted voting strategy to slightly outperform the existing studies, suggesting an effective way to fuse multiple base models. The reason may need further theoretical investigations about why different base models contribute to each other for the MCI prediction.
Collapse
Affiliation(s)
- Jialiang Li
- BioKnow Health Informatics Laboratory, College of Software, Jilin University, Changchun 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
| | - Zhaomin Yao
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
- Cancer Systems Biology Center, China-Japan Union Hospital of Jilin University, Changchun 130012, China
| | - Meiyu Duan
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Shuai Liu
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Fei Li
- BioKnow Health Informatics Laboratory, College of Software, Jilin University, Changchun 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
| | - Haiyang Zhu
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Zhiqiang Xia
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Lan Huang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Fengfeng Zhou
- BioKnow Health Informatics Laboratory, College of Software, Jilin University, Changchun 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
- BioKnow Health Informatics Laboratory, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| |
Collapse
|
10
|
Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1154-1173. [PMID: 30668502 DOI: 10.1109/tcbb.2019.2893170] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Analysis of gene expression data is widely used in transcriptomic studies to understand functions of molecules inside a cell and interactions among molecules. Differential co-expression analysis studies diseases and phenotypic variations by finding modules of genes whose co-expression patterns vary across conditions. We review the best practices in gene expression data analysis in terms of analysis of (differential) co-expression, co-expression network, differential networking, and differential connectivity considering both microarray and RNA-seq data along with comparisons. We highlight hurdles in RNA-seq data analysis using methods developed for microarrays. We include discussion of necessary tools for gene expression analysis throughout the paper. In addition, we shed light on scRNA-seq data analysis by including preprocessing and scRNA-seq in co-expression analysis along with useful tools specific to scRNA-seq. To get insights, biological interpretation and functional profiling is included. Finally, we provide guidelines for the analyst, along with research issues and challenges which should be addressed.
Collapse
|
11
|
Ramirez R, Chiu YC, Hererra A, Mostavi M, Ramirez J, Chen Y, Huang Y, Jin YF. Classification of Cancer Types Using Graph Convolutional Neural Networks. FRONTIERS IN PHYSICS 2020; 8:203. [PMID: 33437754 PMCID: PMC7799442 DOI: 10.3389/fphy.2020.00203] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
BACKGROUND Cancer has been a leading cause of death in the United States with significant health care costs. Accurate prediction of cancers at an early stage and understanding the genomic mechanisms that drive cancer development are vital to the improvement of treatment outcomes and survival rates, thus resulting in significant social and economic impacts. Attempts have been made to classify cancer types with machine learning techniques during the past two decades and deep learning approaches more recently. RESULTS In this paper, we established four models with graph convolutional neural network (GCNN) that use unstructured gene expressions as inputs to classify different tumor and non-tumor samples into their designated 33 cancer types or as normal. Four GCNN models based on a co-expression graph, co-expression+singleton graph, protein-protein interaction (PPI) graph, and PPI+singleton graph have been designed and implemented. They were trained and tested on combined 10,340 cancer samples and 731 normal tissue samples from The Cancer Genome Atlas (TCGA) dataset. The established GCNN models achieved excellent prediction accuracies (89.9-94.7%) among 34 classes (33 cancer types and a normal group). In silico gene-perturbation experiments were performed on four models based on co-expression graph, co-expression+singleton, PPI graph, and PPI+singleton graphs. The co-expression GCNN model was further interpreted to identify a total of 428 markers genes that drive the classification of 33 cancer types and normal. The concordance of differential expressions of these markers between the represented cancer type and others are confirmed. Successful classification of cancer types and a normal group regardless of normal tissues' origin suggested that the identified markers are cancer-specific rather than tissue-specific. CONCLUSION Novel GCNN models have been established to predict cancer types or normal tissue based on gene expression profiles. We demonstrated the results from the TCGA dataset that these models can produce accurate classification (above 94%), using cancer-specific markers genes. The models and the source codes are publicly available and can be readily adapted to the diagnosis of cancer and other diseases by the data-driven modeling research community.
Collapse
Affiliation(s)
- Ricardo Ramirez
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
| | - Yu-Chiao Chiu
- Greehey Children’s Cancer Research Institute, The University of Texas Health San Antonio, San Antonio, TX, 78229, USA
| | - Allen Hererra
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
| | - Milad Mostavi
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
| | - Joshua Ramirez
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
| | - Yidong Chen
- Greehey Children’s Cancer Research Institute, The University of Texas Health San Antonio, San Antonio, TX, 78229, USA
- Department of Population Health Sciences, The University of Texas Health San Antonio, San Antonio, Texas 78229, USA
| | - Yufei Huang
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
- Department of Population Health Sciences, The University of Texas Health San Antonio, San Antonio, Texas 78229, USA
| | - Yu-Fang Jin
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, Texas 78249, USA
| |
Collapse
|
12
|
Hippocampal Subregion Transcriptomic Profiles Reflect Strategy Selection during Cognitive Aging. J Neurosci 2020; 40:4888-4899. [PMID: 32376783 DOI: 10.1523/jneurosci.2944-19.2020] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 04/08/2020] [Accepted: 04/30/2020] [Indexed: 12/18/2022] Open
Abstract
Age-related cognitive impairments are associated with differentially expressed genes (DEGs) linked to defined neural systems; however, studies examining multiple regions of the hippocampus fail to find links between behavior and transcription in the dentate gyrus (DG). We hypothesized that use of a task requiring intact DG function would emphasize molecular signals in the DG associated with a decline in performance. We used a water maze beacon discrimination task to characterize young and middle-age male F344 rats, followed by a spatial reference memory probe trial test. Middle-age rats showed increased variability in discriminating two identical beacons. Use of an allocentric strategy and formation of a spatial reference memory were not different between age groups; however, older animals compensated for impaired beacon discrimination through greater reliance on spatial reference memory. mRNA sequencing of hippocampal subregions indicated DEGs in the DG of middle-age rats, linked to synaptic function and neurogenesis, correlated with beacon discrimination performance, suggesting that senescence of the DG underlies the impairment. Few genes correlated with spatial memory across age groups, with a greater number in region CA1. Age-related CA1 DEGs, correlated with spatial memory, were linked to regulation of neural activity. These results indicate that the beacon task is sensitive to impairment in middle age, and distinct gene profiles are observed in neural circuits that underlie beacon discrimination performance and allocentric memory. The use of different strategies in older animals and associated transcriptional profiles could provide an animal model for examining cognitive reserve and neural compensation of aging.SIGNIFICANCE STATEMENT Hippocampal subregions are thought to differentially contribute to memory. We took advantage of age-related variability in performance on a water maze beacon task and next-generation sequencing to test the hypothesis that aging of the dentate gyrus is linked to impaired beacon discrimination and compensatory use of allocentric memory. The dentate gyrus expressed synaptic function and neurogenesis genes correlated with beacon discrimination in middle-age animals. Spatial reference memory was associated with CA1 transcriptional correlates linked to regulation of neural activity and use of an allocentric strategy. This is the first study examining transcriptomes of multiple hippocampal subregions to link age-related impairments associated with discrimination of feature overlap and alternate response strategies to gene expression in specific hippocampal subregions.
Collapse
|
13
|
Ghazanfar S, Strbenac D, Ormerod JT, Yang JYH, Patrick E. DCARS: differential correlation across ranked samples. Bioinformatics 2019; 35:823-829. [PMID: 30102408 DOI: 10.1093/bioinformatics/bty698] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Revised: 07/19/2018] [Accepted: 08/07/2018] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION Genes act as a system and not in isolation. Thus, it is important to consider coordinated changes of gene expression rather than single genes when investigating biological phenomena such as the aetiology of cancer. We have developed an approach for quantifying how changes in the association between pairs of genes may inform the outcome of interest called Differential Correlation across Ranked Samples (DCARS). Modelling gene correlation across a continuous sample ranking does not require the dichotomisation of samples into two distinct classes and can identify differences in gene correlation across early, mid or late stages of the outcome of interest. RESULTS When we evaluated DCARS against the typical Fisher Z-transformation test for differential correlation, as well as a typical approach testing for interaction within a linear model, on real TCGA data, DCARS significantly ranked gene pairs containing known cancer genes more highly across several cancers. Similar results are found with our simulation study. DCARS was applied to 13 cancers datasets in TCGA, revealing several distinct relationships for which survival ranking was found to be associated with a change in correlation between genes. Furthermore, we demonstrated that DCARS can be used in conjunction with network analysis techniques to extract biological meaning from multi-layered and complex data. AVAILABILITY AND IMPLEMENTATION DCARS R package and sample data are available at https://github.com/shazanfar/DCARS. Publicly available data from The Cancer Genome Atlas (TCGA) was used using the TCGABiolinks R package. Supplementary Files and DCARS R package is available at https://github.com/shazanfar/DCARS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shila Ghazanfar
- The Judith and David Coffey Life Lab, Charles Perkins Centre, University of Sydney, Sydney, NSW, Australia.,School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
| | - Dario Strbenac
- School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
| | - John T Ormerod
- School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS), Richard Berry Building, The University of Melbourne, Melbourne, Parkville, VIC, Australia
| | - Jean Y H Yang
- The Judith and David Coffey Life Lab, Charles Perkins Centre, University of Sydney, Sydney, NSW, Australia.,School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
| | - Ellis Patrick
- School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia.,Westmead Institute for Medical Research, University of Sydney, Westmead, NSW, Australia
| |
Collapse
|
14
|
Bhuva DD, Cursons J, Smyth GK, Davis MJ. Differential co-expression-based detection of conditional relationships in transcriptional data: comparative analysis and application to breast cancer. Genome Biol 2019; 20:236. [PMID: 31727119 PMCID: PMC6857226 DOI: 10.1186/s13059-019-1851-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 10/02/2019] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Elucidation of regulatory networks, including identification of regulatory mechanisms specific to a given biological context, is a key aim in systems biology. This has motivated the move from co-expression to differential co-expression analysis and numerous methods have been developed subsequently to address this task; however, evaluation of methods and interpretation of the resulting networks has been hindered by the lack of known context-specific regulatory interactions. RESULTS In this study, we develop a simulator based on dynamical systems modelling capable of simulating differential co-expression patterns. With the simulator and an evaluation framework, we benchmark and characterise the performance of inference methods. Defining three different levels of "true" networks for each simulation, we show that accurate inference of causation is difficult for all methods, compared to inference of associations. We show that a z-score-based method has the best general performance. Further, analysis of simulation parameters reveals five network and simulation properties that explained the performance of methods. The evaluation framework and inference methods used in this study are available in the dcanr R/Bioconductor package. CONCLUSIONS Our analysis of networks inferred from simulated data show that hub nodes are more likely to be differentially regulated targets than transcription factors. Based on this observation, we propose an interpretation of the inferred differential network that can reconstruct a putative causal network.
Collapse
Affiliation(s)
- Dharmesh D Bhuva
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Melbourne, VIC, 3010, Australia
| | - Joseph Cursons
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, VIC, 3010, Australia
| | - Gordon K Smyth
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Melbourne, VIC, 3010, Australia
| | - Melissa J Davis
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia. .,Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, VIC, 3010, Australia. .,Department of Clinical Pathology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, VIC, 3010, Australia.
| |
Collapse
|
15
|
Civita P, Franceschi S, Aretini P, Ortenzi V, Menicagli M, Lessi F, Pasqualetti F, Naccarato AG, Mazzanti CM. Laser Capture Microdissection and RNA-Seq Analysis: High Sensitivity Approaches to Explain Histopathological Heterogeneity in Human Glioblastoma FFPE Archived Tissues. Front Oncol 2019; 9:482. [PMID: 31231613 PMCID: PMC6568189 DOI: 10.3389/fonc.2019.00482] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 05/21/2019] [Indexed: 12/21/2022] Open
Abstract
Laser capture microdissection (LCM) coupled with RNA-seq is a powerful tool to identify genes that are differentially expressed in specific histological tumor subtypes. To better understand the role of single tumor cell populations in the complex heterogeneity of glioblastoma, we paired microdissection and NGS technology to study intra-tumoral differences into specific histological regions and cells of human GBM FFPE tumors. We here isolated astrocytes, neurons and endothelial cells in 6 different histological contexts: tumor core astrocytes, pseudopalisading astrocytes, perineuronal astrocytes in satellitosis, neurons with satellitosis, tumor blood vessels, and normal blood vessels. A customized protocol was developed for RNA amplification, library construction, and whole transcriptome analysis of each single portion. We first validated our protocol comparing the obtained RNA expression pattern with the gene expression levels of RNA-seq raw data experiments from the BioProject NCBI database, using Spearman's correlation coefficients calculation. We found a good concordance for pseudopalisading and tumor core astrocytes compartments (0.5 Spearman correlation) and a high concordance for perineuronal astrocytes, neurons, normal, and tumor endothelial cells compartments (0.7 Spearman correlation). Then, Principal Component Analysis and differential expression analysis were employed to find differences between tumor compartments and control tissue and between same cell types into distinct tumor contexts. Data consistent with the literature emerged, in which multiple therapeutic targets significant for glioblastoma (such as Integrins, Extracellular Matrix, transmembrane transport, and metabolic processes) play a fundamental role in the disease progression. Moreover, specific cellular processes have been associated with certain cellular subtypes within the tumor. Our results are promising and suggest a compelling method for studying glioblastoma heterogeneity in FFPE samples and its application in both prospective and retrospective studies.
Collapse
Affiliation(s)
| | | | | | - Valerio Ortenzi
- Department of Translational Research and New Technologies in Medicine and Surgery, Pisa University Hospital, Pisa, Italy
| | | | | | | | - Antonio Giuseppe Naccarato
- Department of Translational Research and New Technologies in Medicine and Surgery, Pisa University Hospital, Pisa, Italy
| | | |
Collapse
|
16
|
Yu D, Zhang Z, Glass K, Su J, DeMeo DL, Tantisira K, Weiss ST, Qiu W. New Statistical Methods for Constructing Robust Differential Correlation Networks to characterize the interactions among microRNAs. Sci Rep 2019; 9:3499. [PMID: 30837613 PMCID: PMC6401044 DOI: 10.1038/s41598-019-40167-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 02/11/2019] [Indexed: 02/06/2023] Open
Abstract
The interplay among microRNAs (miRNAs) plays an important role in the developments of complex human diseases. Co-expression networks can characterize the interactions among miRNAs. Differential correlation network is a powerful tool to investigate the differences of co-expression networks between cases and controls. To construct a differential correlation network, the Fisher's Z-transformation test is usually used. However, the Fisher's Z-transformation test requires the normality assumption, the violation of which would result in inflated Type I error rate. Several bootstrapping-based improvements for Fisher's Z test have been proposed. However, these methods are too computationally intensive to be used to construct differential correlation networks for high-throughput genomic data. In this article, we proposed six novel robust equal-correlation tests that are computationally efficient. The systematic simulation studies and a real microRNA data analysis showed that one of the six proposed tests (ST5) overall performed better than other methods.
Collapse
Affiliation(s)
- Danyang Yu
- Department of Information and Computing Science, College of Mathematics and Econometrics, Hunan University, Hunan, China
| | - Zeyu Zhang
- Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, China
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA
| | - Jessica Su
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA
| | - Kelan Tantisira
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA
| | - Scott T Weiss
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA
| | - Weiliang Qiu
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, USA.
| |
Collapse
|