1
|
Sun G, Yu H, Wang P, Lopez-Guerrero M, Mural RV, Mizero ON, Grzybowski M, Song B, van Dijk K, Schachtman DP, Zhang C, Schnable JC. A role for heritable transcriptomic variation in maize adaptation to temperate environments. Genome Biol 2023; 24:55. [PMID: 36964601 PMCID: PMC10037803 DOI: 10.1186/s13059-023-02891-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 03/06/2023] [Indexed: 03/26/2023] Open
Abstract
Background Transcription bridges genetic information and phenotypes. Here, we evaluated how changes in transcriptional regulation enable maize (Zea mays), a crop originally domesticated in the tropics, to adapt to temperate environments. Result We generated 572 unique RNA-seq datasets from the roots of 340 maize genotypes. Genes involved in core processes such as cell division, chromosome organization and cytoskeleton organization showed lower heritability of gene expression, while genes involved in anti-oxidation activity exhibited higher expression heritability. An expression genome-wide association study (eGWAS) identified 19,602 expression quantitative trait loci (eQTLs) associated with the expression of 11,444 genes. A GWAS for alternative splicing identified 49,897 splicing QTLs (sQTLs) for 7614 genes. Genes harboring both cis-eQTLs and cis-sQTLs in linkage disequilibrium were disproportionately likely to encode transcription factors or were annotated as responding to one or more stresses. Independent component analysis of gene expression data identified loci regulating co-expression modules involved in oxidation reduction, response to water deprivation, plastid biogenesis, protein biogenesis, and plant-pathogen interaction. Several genes involved in cell proliferation, flower development, DNA replication, and gene silencing showed lower gene expression variation explained by genetic factors between temperate and tropical maize lines. A GWAS of 27 previously published phenotypes identified several candidate genes overlapping with genomic intervals showing signatures of selection during adaptation to temperate environments. Conclusion Our results illustrate how maize transcriptional regulatory networks enable changes in transcriptional regulation to adapt to temperate regions. Supplementary information The online version contains supplementary material available at 10.1186/s13059-023-02891-3.
Collapse
Affiliation(s)
- Guangchao Sun
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Huihui Yu
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
| | - Peng Wang
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Martha Lopez-Guerrero
- grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
| | - Ravi V. Mural
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Olivier N. Mizero
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Marcin Grzybowski
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Baoxing Song
- grid.5386.8000000041936877XInstitute for Genomic Diversity, Cornell University, Ithaca, USA
| | - Karin van Dijk
- grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
| | - Daniel P. Schachtman
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Chi Zhang
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
| | - James C. Schnable
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| |
Collapse
|
2
|
Lüleci HB, Yılmaz A. Robust and rigorous identification of tissue-specific genes by statistically extending tau score. BioData Min 2022; 15:31. [PMID: 36494766 PMCID: PMC9733102 DOI: 10.1186/s13040-022-00315-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 11/11/2022] [Indexed: 12/13/2022] Open
Abstract
OBJECTIVES In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm. INTRODUCTION Tissue-specific genes are a class of genes whose functions and expressions are preferred in one or several tissues restrictedly. Identification of tissue-specific genes is essential for discovering multi-cellular biological processes such as tissue-specific molecular regulations, tissue development, physiology, and the pathogenesis of tissue-associated diseases. MATERIALS AND METHODS Gene expression data derived from five large RNA sequencing (RNA-seq) projects, spanning 96 different human tissues, were retrieved from ArrayExpress and ExpressionAtlas. The first step is categorizing genes using significant filters and tau score as a specificity index. After calculating tau for each gene in all datasets separately, statistical distance from the maximum expression level was estimated using a new meaningful procedure. Specific expression of a gene in one or several tissues was calculated after the integration of tau and statistical distance estimation, which is called as extended tau approach. Obtained tissue-specific genes for 96 different human tissues were functionally annotated, and some comparisons were carried out to show the effectiveness of the extended tau method. RESULTS AND DISCUSSION Categorization of genes based on expression level and identification of tissue-specific genes for a large number of tissues/organs were executed. Genes were successfully assigned to multiple tissues by generating the extended tau approach as opposed to the original tau score, which can assign tissue specificity to single tissue only.
Collapse
Affiliation(s)
- Hatice Büşra Lüleci
- grid.448834.70000 0004 0595 7127Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey
| | - Alper Yılmaz
- grid.38575.3c0000 0001 2337 3561Department of Bioengineering, Yildiz Technical University, Istanbul, Turkey
| |
Collapse
|
3
|
Heskett MB, Vouzas AE, Smith LG, Yates PA, Boniface C, Bouhassira EE, Spellman PT, Gilbert DM, Thayer MJ. Epigenetic control of chromosome-associated lncRNA genes essential for replication and stability. Nat Commun 2022; 13:6301. [PMID: 36273230 PMCID: PMC9588035 DOI: 10.1038/s41467-022-34099-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 10/13/2022] [Indexed: 01/18/2023] Open
Abstract
ASARs are long noncoding RNA genes that control replication timing of entire human chromosomes in cis. The three known ASAR genes are located on human chromosomes 6 and 15, and are essential for chromosome integrity. To identify ASARs on all human chromosomes we utilize a set of distinctive ASAR characteristics that allow for the identification of hundreds of autosomal loci with epigenetically controlled, allele-restricted behavior in expression and replication timing of coding and noncoding genes, and is distinct from genomic imprinting. Disruption of noncoding RNA genes at five of five tested loci result in chromosome-wide delayed replication and chromosomal instability, validating their ASAR activity. In addition to the three known essential cis-acting chromosomal loci, origins, centromeres, and telomeres, we propose that all mammalian chromosomes also contain "Inactivation/Stability Centers" that display allele-restricted epigenetic regulation of protein coding and noncoding ASAR genes that are essential for replication and stability of each chromosome.
Collapse
Affiliation(s)
- Michael B Heskett
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Department of Molecular and Medical Genetics Oregon Health & Science University, Portland, OR, 97239, USA
| | - Athanasios E Vouzas
- Department of Biological Science, Florida State University, Tallahassee, FL, 32306, USA
| | - Leslie G Smith
- Department of Chemical Physiology and Biochemistry Oregon Health & Science University, Portland, OR, 97239, USA
| | - Phillip A Yates
- Department of Chemical Physiology and Biochemistry Oregon Health & Science University, Portland, OR, 97239, USA
| | - Christopher Boniface
- Cancer Early Detection Advanced Research Center, Knight Cancer Institute Oregon Health & Science University, Portland, OR, 97239, USA
| | - Eric E Bouhassira
- Department of Cell Biology and Department of Medicine, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Paul T Spellman
- Department of Molecular and Medical Genetics Oregon Health & Science University, Portland, OR, 97239, USA
- Cancer Early Detection Advanced Research Center, Knight Cancer Institute Oregon Health & Science University, Portland, OR, 97239, USA
| | - David M Gilbert
- San Diego Biomedical Research Institute, San Diego, CA, 92121, USA
| | - Mathew J Thayer
- Department of Chemical Physiology and Biochemistry Oregon Health & Science University, Portland, OR, 97239, USA.
| |
Collapse
|
4
|
Munro D, Wang T, Chitre AS, Polesskaya O, Ehsan N, Gao J, Gusev A, Woods LS, Saba L, Chen H, Palmer A, Mohammadi P. The regulatory landscape of multiple brain regions in outbred heterogeneous stock rats. Nucleic Acids Res 2022; 50:10882-10895. [PMID: 36263809 PMCID: PMC9638908 DOI: 10.1093/nar/gkac912] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 08/17/2022] [Accepted: 10/05/2022] [Indexed: 11/14/2022] Open
Abstract
Heterogeneous Stock (HS) rats are a genetically diverse outbred rat population that is widely used for studying genetics of behavioral and physiological traits. Mapping Quantitative Trait Loci (QTL) associated with transcriptional changes would help to identify mechanisms underlying these traits. We generated genotype and transcriptome data for five brain regions from 88 HS rats. We identified 21 392 cis-QTLs associated with expression and splicing changes across all five brain regions and validated their effects using allele specific expression data. We identified 80 cases where eQTLs were colocalized with genome-wide association study (GWAS) results from nine physiological traits. Comparing our dataset to human data from the Genotype-Tissue Expression (GTEx) project, we found that the HS rat data yields twice as many significant eQTLs as a similarly sized human dataset. We also identified a modest but highly significant correlation between genetic regulatory variation among orthologous genes. Surprisingly, we found less genetic variation in gene regulation in HS rats relative to humans, though we still found eQTLs for the orthologs of many human genes for which eQTLs had not been found. These data are available from the RatGTEx data portal (RatGTEx.org) and will enable new discoveries of the genetic influences of complex traits.
Collapse
Affiliation(s)
- Daniel Munro
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA,Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA, USA
| | - Tengfei Wang
- Department of Pharmacology, Addiction Science and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Apurva S Chitre
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Oksana Polesskaya
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Nava Ehsan
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA, USA
| | - Jianjun Gao
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Alexander Gusev
- Division of Population Sciences, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA
| | - Leah C Solberg Woods
- Section of Molecular Medicine, Department of Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Laura M Saba
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Hao Chen
- Department of Pharmacology, Addiction Science and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Abraham A Palmer
- Correspondence may also be addressed to Abraham A. Palmer. Tel: +1 858 534 2093;
| | - Pejman Mohammadi
- To whom correspondence should be addressed. Tel: +1 858 784 8746;
| |
Collapse
|
5
|
Gewirtz AD, Townes FW, Engelhardt BE. Telescoping bimodal latent Dirichlet allocation to identify expression QTLs across tissues. Life Sci Alliance 2022; 5:e202101297. [PMID: 35977827 PMCID: PMC9387650 DOI: 10.26508/lsa.202101297] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 07/15/2022] [Accepted: 07/18/2022] [Indexed: 11/24/2022] Open
Abstract
Expression quantitative trait loci (eQTLs), or single-nucleotide polymorphisms that affect average gene expression levels, provide important insights into context-specific gene regulation. Classic eQTL analyses use one-to-one association tests, which test gene-variant pairs individually and ignore correlations induced by gene regulatory networks and linkage disequilibrium. Probabilistic topic models, such as latent Dirichlet allocation, estimate latent topics for a collection of count observations. Prior multimodal frameworks that bridge genotype and expression data assume matched sample numbers between modalities. However, many data sets have a nested structure where one individual has several associated gene expression samples and a single germline genotype vector. Here, we build a telescoping bimodal latent Dirichlet allocation (TBLDA) framework to learn shared topics across gene expression and genotype data that allows multiple RNA sequencing samples to correspond to a single individual's genotype. By using raw count data, our model avoids possible adulteration via normalization procedures. Ancestral structure is captured in a genotype-specific latent space, effectively removing it from shared components. Using GTEx v8 expression data across 10 tissues and genotype data, we show that the estimated topics capture meaningful and robust biological signal in both modalities and identify associations within and across tissue types. We identify 4,645 cis-eQTLs and 995 trans-eQTLs by conducting eQTL mapping between the most informative features in each topic. Our TBLDA model is able to identify associations using raw sequencing count data when the samples in two separate data modalities are matched one-to-many, as is often the case in biological data. Our code is freely available at https://github.com/gewirtz/TBLDA.
Collapse
Affiliation(s)
- Ariel Dh Gewirtz
- Lewis-Sigler Institute of Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - F William Townes
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Barbara E Engelhardt
- Department of Computer Science, Princeton University, Princeton, NJ, USA
- Gladstone Institutes, San Francisco, CA, USA
| |
Collapse
|
6
|
Identifying causal genes for depression via integration of the proteome and transcriptome from brain and blood. Mol Psychiatry 2022; 27:2849-2857. [PMID: 35296807 DOI: 10.1038/s41380-022-01507-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2021] [Revised: 02/17/2022] [Accepted: 02/22/2022] [Indexed: 12/15/2022]
Abstract
Genome-wide association studies (GWASs) have identified numerous risk genes for depression. Nevertheless, genes crucial for understanding the molecular mechanisms of depression and effective antidepressant drug targets are largely unknown. Addressing this, we aimed to highlight potentially causal genes by systematically integrating the brain and blood protein and expression quantitative trait loci (QTL) data with a depression GWAS dataset via a statistical framework including Mendelian randomization (MR), Bayesian colocalization, and Steiger filtering analysis. In summary, we identified three candidate genes (TMEM106B, RAB27B, and GMPPB) based on brain data and two genes (TMEM106B and NEGR1) based on blood data with consistent robust evidence at both the protein and transcriptional levels. Furthermore, the protein-protein interaction (PPI) network provided new insights into the interaction between brain and blood in depression. Collectively, four genes (TMEM106B, RAB27B, GMPPB, and NEGR1) affect depression by influencing protein and gene expression level, which could guide future researches on candidate genes investigations in animal studies as well as prioritize antidepressant drug targets.
Collapse
|
7
|
Smith LB, Anderson CV, Withangage MHH, Koch A, Roberts TJ, Liebl AL. Relationship between gene expression networks and muscle contractile physiology differences in Anolis lizards. J Comp Physiol B 2022; 192:489-499. [PMID: 35596083 DOI: 10.1007/s00360-022-01441-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 04/11/2022] [Accepted: 04/26/2022] [Indexed: 12/12/2022]
Abstract
Muscles facilitate most animal behavior, from eating to fleeing. However, to generate the variation in behavior necessary for survival, different muscles must perform differently; for instance, sprinting requires multiple rapid muscle contractions, whereas biting may require fewer contractions but greater force. Here, we use a transcriptomic approach to identify genes associated with variation in muscle contractile physiology among different muscles from the same individual. We measured differential gene expression between a leg and jaw muscle of Anolis lizards known to differ in muscle contractile physiology and performance. For each individual, one muscle was used to measure muscle contractile physiology, including contractile velocity (Vmax and V40), specific tension, power ratio, and twitch time, whereas the contralateral muscle was used to extract RNA for transcriptomic sequencing. Using the transcriptomic data, we found clear clustering of muscle type. Expression of genes clustered in gene ontology (GO) terms related to muscle contraction and extracellular matrix was, on average, negatively correlated with Vmax and slower twitch times but positively correlated to power ratio and V40. Conversely, genes related to the GO terms related to aerobic respiration were downregulated in muscles with higher power ratio and V40, and over-expressed as twitch time decreased. Determining the molecular mechanisms that underlie variation in muscle contractile physiology can begin to explain how organisms are able to optimize behavior under variable conditions. Future studies pursuing the effects of differential gene expression across muscle types in different environments might inform researchers about how differences develop across species, populations, and individuals varying in ecological history.
Collapse
Affiliation(s)
- Luke B Smith
- Department of Biology, University of South Dakota, Vermillion, SD, USA.,Sanford School of Medicine, University of South Dakota, Vermillion, SD, USA
| | | | - Miyuraj H Hikkaduwa Withangage
- Department of Biology, University of South Dakota, Vermillion, SD, USA.,College of Dentistry and Dental Clinics, University of Iowa, Iowa City, IA, USA
| | - Andrew Koch
- Department of Biology, University of South Dakota, Vermillion, SD, USA
| | - Thomas J Roberts
- Department of Ecology and Evolutionary Biology, Brown University, Providence, RI, USA
| | - Andrea L Liebl
- Department of Biology, University of South Dakota, Vermillion, SD, USA.
| |
Collapse
|
8
|
Senko AN, Overall RW, Silhavy J, Mlejnek P, Malínská H, Hüttl M, Marková I, Fabel KS, Lu L, Stuchlik A, Williams RW, Pravenec M, Kempermann G. Systems genetics in the rat HXB/BXH family identifies Tti2 as a pleiotropic quantitative trait gene for adult hippocampal neurogenesis and serum glucose. PLoS Genet 2022; 18:e1009638. [PMID: 35377872 PMCID: PMC9060359 DOI: 10.1371/journal.pgen.1009638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 05/02/2022] [Accepted: 03/07/2022] [Indexed: 11/19/2022] Open
Abstract
Neurogenesis in the adult hippocampus contributes to learning and memory in the healthy brain but is dysregulated in metabolic and neurodegenerative diseases. The molecular relationships between neural stem cell activity, adult neurogenesis, and global metabolism are largely unknown. Here we applied unbiased systems genetics methods to quantify genetic covariation among adult neurogenesis and metabolic phenotypes in peripheral tissues of a genetically diverse family of rat strains, derived from a cross between the spontaneously hypertensive (SHR/OlaIpcv) strain and Brown Norway (BN-Lx/Cub). The HXB/BXH family is a very well established model to dissect genetic variants that modulate metabolic and cardiovascular diseases and we have accumulated deep phenome and transcriptome data in a FAIR-compliant resource for systematic and integrative analyses. Here we measured rates of precursor cell proliferation, survival of new neurons, and gene expression in the hippocampus of the entire HXB/BXH family, including both parents. These data were combined with published metabolic phenotypes to detect a neurometabolic quantitative trait locus (QTL) for serum glucose and neuronal survival on Chromosome 16: 62.1-66.3 Mb. We subsequently fine-mapped the key phenotype to a locus that includes the Telo2-interacting protein 2 gene (Tti2)-a chaperone that modulates the activity and stability of PIKK kinases. To verify the hypothesis that differences in neurogenesis and glucose levels are caused by a polymorphism in Tti2, we generated a targeted frameshift mutation on the SHR/OlaIpcv background. Heterozygous SHR-Tti2+/- mutants had lower rates of hippocampal neurogenesis and hallmarks of dysglycemia compared to wild-type littermates. Our findings highlight Tti2 as a causal genetic link between glucose metabolism and structural brain plasticity. In humans, more than 800 genomic variants are linked to TTI2 expression, seven of which have associations to protein and blood stem cell factor concentrations, blood pressure and frontotemporal dementia.
Collapse
Affiliation(s)
- Anna N. Senko
- German Center for Neurodegenerative Diseases (DZNE) Dresden, Germany
- CRTD–Center for Regenerative Therapies Dresden, Technische Universität Dresden, Germany
| | - Rupert W. Overall
- German Center for Neurodegenerative Diseases (DZNE) Dresden, Germany
- CRTD–Center for Regenerative Therapies Dresden, Technische Universität Dresden, Germany
| | - Jan Silhavy
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Petr Mlejnek
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Hana Malínská
- Institute for Clinical and Experimental Medicine, Prague, Czech Republic
| | - Martina Hüttl
- Institute for Clinical and Experimental Medicine, Prague, Czech Republic
| | - Irena Marková
- Institute for Clinical and Experimental Medicine, Prague, Czech Republic
| | - Klaus S. Fabel
- German Center for Neurodegenerative Diseases (DZNE) Dresden, Germany
- CRTD–Center for Regenerative Therapies Dresden, Technische Universität Dresden, Germany
| | - Lu Lu
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Ales Stuchlik
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Robert W. Williams
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Michal Pravenec
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Gerd Kempermann
- German Center for Neurodegenerative Diseases (DZNE) Dresden, Germany
- CRTD–Center for Regenerative Therapies Dresden, Technische Universität Dresden, Germany
| |
Collapse
|
9
|
Dahale S, Ruiz-Orera J, Silhavy J, Hübner N, van Heesch S, Pravenec M, Atanur SS. Cap analysis of gene expression reveals alternative promoter usage in a rat model of hypertension. Life Sci Alliance 2022; 5:5/4/e202101234. [PMID: 34996843 PMCID: PMC8742872 DOI: 10.26508/lsa.202101234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 12/26/2021] [Accepted: 12/28/2021] [Indexed: 11/24/2022] Open
Abstract
The role of alternative promoter usage in tissue-specific gene expression has been well established; however, its role in complex diseases is poorly understood. We performed cap analysis of gene expression (CAGE) sequencing from the left ventricle of a rat model of hypertension, the spontaneously hypertensive rat (SHR), and a normotensive strain, Brown Norway to understand the role of alternative promoter usage in complex disease. We identified 26,560 CAGE-defined transcription start sites in the rat left ventricle, including 1,970 novel cardiac transcription start sites. We identified 28 genes with alternative promoter usage between SHR and Brown Norway, which could lead to protein isoforms differing at the amino terminus between two strains and 475 promoter switching events altering the length of the 5' UTR. We found that the shift in Insr promoter usage was significantly associated with insulin levels and blood pressure within a panel of HXB/BXH recombinant inbred rat strains, suggesting that hyperinsulinemia due to insulin resistance might lead to hypertension in SHR. Our study provides a preliminary evidence of alternative promoter usage in complex diseases.
Collapse
Affiliation(s)
- Sonal Dahale
- Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, UK.,Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Jan Silhavy
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Norbert Hübner
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany.,Charité -Universitätsmedizin, Berlin, Germany.,DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, Berlin, Germany
| | | | - Michal Pravenec
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Santosh S Atanur
- Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, UK .,The National Institute for Health Research, Imperial Biomedical Research Centre, ITMAT Data Science Group, Imperial College London, London, UK
| |
Collapse
|
10
|
Escoto-Sandoval C, Ochoa-Alejo N, Martínez O. Inheritance of gene expression throughout fruit development in chili pepper. Sci Rep 2021; 11:22647. [PMID: 34811443 PMCID: PMC8609037 DOI: 10.1038/s41598-021-02151-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 11/10/2021] [Indexed: 12/13/2022] Open
Abstract
Gene expression is the primary molecular phenotype and can be estimated in specific organs or tissues at particular times. Here we analyzed genome-wide inheritance of gene expression in fruits of chili pepper (Capsicum annuum L.) in reciprocal crosses between a domesticated and a wild accession, estimating this parameter during fruit development. We defined a general hierarchical schema to classify gene expression inheritance which can be employed for any quantitative trait. We found that inheritance of gene expression is affected by both, the time of fruit development as well as the direction of the cross, and propose that such variations could be common in many developmental processes. We conclude that classification of inheritance patterns is important to have a better understanding of the mechanisms underlying gene expression regulation, and demonstrate that sets of genes with specific inheritance pattern at particular times of fruit development are enriched in different biological processes, molecular functions and cell components. All curated data and functions for analysis and visualization are publicly available as an R package.
Collapse
Affiliation(s)
- Christian Escoto-Sandoval
- Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Unidad de Genómica Avanzada (Langebio), Irapuato Guanajuato, 36824, México
| | - Neftalí Ochoa-Alejo
- Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Departamento de Ingeniería Genética, Unidad Irapuato, Irapuato Guanajuato, 36824, México
| | - Octavio Martínez
- Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Unidad de Genómica Avanzada (Langebio), Irapuato Guanajuato, 36824, México.
| |
Collapse
|
11
|
Witte F, Ruiz-Orera J, Mattioli CC, Blachut S, Adami E, Schulz JF, Schneider-Lunitz V, Hummel O, Patone G, Mücke MB, Šilhavý J, Heinig M, Bottolo L, Sanchis D, Vingron M, Chekulaeva M, Pravenec M, Hubner N, van Heesch S. A trans locus causes a ribosomopathy in hypertrophic hearts that affects mRNA translation in a protein length-dependent fashion. Genome Biol 2021; 22:191. [PMID: 34183069 PMCID: PMC8240307 DOI: 10.1186/s13059-021-02397-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 06/02/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Little is known about the impact of trans-acting genetic variation on the rates with which proteins are synthesized by ribosomes. Here, we investigate the influence of such distant genetic loci on the efficiency of mRNA translation and define their contribution to the development of complex disease phenotypes within a panel of rat recombinant inbred lines. RESULTS We identify several tissue-specific master regulatory hotspots that each control the translation rates of multiple proteins. One of these loci is restricted to hypertrophic hearts, where it drives a translatome-wide and protein length-dependent change in translational efficiency, altering the stoichiometric translation rates of sarcomere proteins. Mechanistic dissection of this locus across multiple congenic lines points to a translation machinery defect, characterized by marked differences in polysome profiles and misregulation of the small nucleolar RNA SNORA48. Strikingly, from yeast to humans, we observe reproducible protein length-dependent shifts in translational efficiency as a conserved hallmark of translation machinery mutants, including those that cause ribosomopathies. Depending on the factor mutated, a pre-existing negative correlation between protein length and translation rates could either be enhanced or reduced, which we propose to result from mRNA-specific imbalances in canonical translation initiation and reinitiation rates. CONCLUSIONS We show that distant genetic control of mRNA translation is abundant in mammalian tissues, exemplified by a single genomic locus that triggers a translation-driven molecular mechanism. Our work illustrates the complexity through which genetic variation can drive phenotypic variability between individuals and thereby contribute to complex disease.
Collapse
Affiliation(s)
- Franziska Witte
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
- Present Address: NUVISAN ICB GmbH, Lead Discovery-Structrual Biology, 13353, Berlin, Germany
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Camilla Ciolli Mattioli
- Berlin Institute for Medical Systems Biology (BIMSB), Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 10115, Berlin, Germany
- Present Address: Department of Biological Regulation, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Susanne Blachut
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Eleonora Adami
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
- Present Address: Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - Jana Felicitas Schulz
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Valentin Schneider-Lunitz
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Oliver Hummel
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Giannino Patone
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
| | - Michael Benedikt Mücke
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347, Berlin, Germany
- Charité-Universitätsmedizin, 10117, Berlin, Germany
| | - Jan Šilhavý
- Institute of Physiology of the Czech Academy of Sciences, 4, 142 20, Praha, Czech Republic
| | - Matthias Heinig
- Institute of Computational Biology (ICB), HMGU, Ingolstaedter Landstr. 1, 85764 Neuherberg, Munich, Germany
- Department of Informatics, Technische Universitaet Muenchen (TUM), Boltzmannstr. 3, 85748 Garching, Munich, Germany
| | - Leonardo Bottolo
- Department of Medical Genetics, University of Cambridge, Cambridge, CB2 0QQ, UK
- The Alan Turing Institute, London, NW1 2DB, UK
- MRC Biostatistics Unit, University of Cambridge, Cambridge, CB2 0SR, UK
| | - Daniel Sanchis
- Institut de Recerca Biomedica de Lleida (IRBLLEIDA), Universitat de Lleida, Edifici Biomedicina-I. Av. Rovira Roure, 80, 25198, Lleida, Spain
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
| | - Marina Chekulaeva
- Berlin Institute for Medical Systems Biology (BIMSB), Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 10115, Berlin, Germany
| | - Michal Pravenec
- Institute of Physiology of the Czech Academy of Sciences, 4, 142 20, Praha, Czech Republic
| | - Norbert Hubner
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany.
- DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347, Berlin, Germany.
- Charité-Universitätsmedizin, 10117, Berlin, Germany.
| | - Sebastiaan van Heesch
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125, Berlin, Germany.
- Present Address: The Princess Máxima Center for Pediatric Oncology, Utrecht, the Netherlands.
| |
Collapse
|
12
|
Variable expression quantitative trait loci analysis of breast cancer risk variants. Sci Rep 2021; 11:7192. [PMID: 33785833 PMCID: PMC8009949 DOI: 10.1038/s41598-021-86690-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 03/12/2021] [Indexed: 01/08/2023] Open
Abstract
Genome wide association studies (GWAS) have identified more than 180 variants associated with breast cancer risk, however the underlying functional mechanisms and biological pathways which confer disease susceptibility remain largely unknown. As gene expression traits are under genetic regulation we hypothesise that differences in gene expression variability may identify causal breast cancer susceptibility genes. We performed variable expression quantitative trait loci (veQTL) analysis using tissue-specific expression data from the Genotype-Tissue Expression (GTEx) Common Fund Project. veQTL analysis identified 70 associations (p < 5 × 10–8) consisting of 60 genes and 27 breast cancer risk variants, including 55 veQTL that were observed in breast tissue only. Pathway analysis of genes associated with breast-specific veQTL revealed an enrichment of four genes (CYP11B1, CYP17A1 HSD3B2 and STAR) involved in the C21-steroidal biosynthesis pathway that converts cholesterol to breast-related hormones (e.g. oestrogen). Each of these four genes were significantly more variable in individuals homozygous for rs11075995 (A/A) breast cancer risk allele located in the FTO gene, which encodes an RNA demethylase. The A/A allele was also found associated with reduced expression of FTO, suggesting an epi-transcriptomic mechanism may underlie the dysregulation of genes involved in hormonal biosynthesis leading to an increased risk of breast cancer. These findings provide evidence that genetic variants govern high levels of expression variance in breast tissue, thus building a more comprehensive insight into the underlying biology of breast cancer risk loci.
Collapse
|
13
|
Heskett MB, Smith LG, Spellman P, Thayer MJ. Reciprocal monoallelic expression of ASAR lncRNA genes controls replication timing of human chromosome 6. RNA (NEW YORK, N.Y.) 2020; 26:724-738. [PMID: 32144193 PMCID: PMC7266157 DOI: 10.1261/rna.073114.119] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 02/22/2020] [Indexed: 06/10/2023]
Abstract
DNA replication occurs on mammalian chromosomes in a cell-type distinctive temporal order known as the replication timing program. We previously found that disruption of the noncanonical lncRNA genes ASAR6 and ASAR15 results in delayed replication timing and delayed mitotic chromosome condensation of human chromosomes 6 and 15, respectively. ASAR6 and ASAR15 display random monoallelic expression and display asynchronous replication between alleles that is coordinated with other random monoallelic genes on their respective chromosomes. Disruption of the expressed allele, but not the silent allele, of ASAR6 leads to delayed replication, activation of the previously silent alleles of linked monoallelic genes, and structural instability of human chromosome 6. In this report, we describe a second lncRNA gene (ASAR6-141) on human chromosome 6 that when disrupted results in delayed replication timing in cisASAR6-141 is subject to random monoallelic expression and asynchronous replication and is expressed from the opposite chromosome 6 homolog as ASAR6 ASAR6-141 RNA, like ASAR6 and ASAR15 RNAs, contains a high L1 content and remains associated with the chromosome territory where it is transcribed. Three classes of cis-acting elements control proper chromosome function in mammals: origins of replication, centromeres, and telomeres, which are responsible for replication, segregation, and stability of all chromosomes. Our work supports a fourth type of essential chromosomal element, the "Inactivation/Stability Center," which expresses ASAR lncRNAs responsible for proper replication timing, monoallelic expression, and structural stability of each chromosome.
Collapse
Affiliation(s)
- Michael B Heskett
- Department of Molecular and Medical Genetics, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Leslie G Smith
- Department of Chemical Physiology and Biochemistry, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Paul Spellman
- Department of Molecular and Medical Genetics, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Mathew J Thayer
- Department of Chemical Physiology and Biochemistry, Oregon Health & Science University, Portland, Oregon 97239, USA
| |
Collapse
|
14
|
Morgan MD, Patin E, Jagla B, Hasan M, Quintana-Murci L, Marioni JC. Quantitative genetic analysis deciphers the impact of cis and trans regulation on cell-to-cell variability in protein expression levels. PLoS Genet 2020; 16:e1008686. [PMID: 32168362 PMCID: PMC7094872 DOI: 10.1371/journal.pgen.1008686] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 03/25/2020] [Accepted: 02/19/2020] [Indexed: 11/19/2022] Open
Abstract
Identifying the factors that shape protein expression variability in complex multi-cellular organisms has primarily focused on promoter architecture and regulation of single-cell expression in cis. However, this targeted approach has to date been unable to identify major regulators of cell-to-cell gene expression variability in humans. To address this, we have combined single-cell protein expression measurements in the human immune system using flow cytometry with a quantitative genetics analysis. For the majority of proteins whose variability in expression has a heritable component, we find that genetic variants act in trans, with notably fewer variants acting in cis. Furthermore, we highlight using Mendelian Randomization that these variability-Quantitative Trait Loci might be driven by the cis regulation of upstream genes. This indicates that natural selection may balance the impact of gene regulation in cis with downstream impacts on expression variability in trans.
Collapse
Affiliation(s)
- Michael D. Morgan
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
- Cancer Research UK–Cambridge Institute, Robinson Way, Cambridge, United Kingdom
| | - Etienne Patin
- Human Evolutionary Genetics Unit, Institut Pasteur, CNRS UMR2000, Paris, France
| | - Bernd Jagla
- Cytometry and Biomarkers UTechS, Institut Pasteur, Paris, France
- Hub Bioinformatique et Biostatisque, Départment de Biologie Computationalle—USR 3756 CNRS, Institut Pasteur, Paris, France
| | - Milena Hasan
- Cytometry and Biomarkers UTechS, Institut Pasteur, Paris, France
| | - Lluís Quintana-Murci
- Human Evolutionary Genetics Unit, Institut Pasteur, CNRS UMR2000, Paris, France
- Human Genomics and Evolution, Collège de France, Paris, France
| | - John C. Marioni
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
- Cancer Research UK–Cambridge Institute, Robinson Way, Cambridge, United Kingdom
- EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
15
|
Otto GW, Kaisaki PJ, Brial F, Le Lay A, Cazier JB, Mott R, Gauguier D. Conserved properties of genetic architecture of renal and fat transcriptomes in rat models of insulin resistance. Dis Model Mech 2019; 12:dmm.038539. [PMID: 31213483 PMCID: PMC6679378 DOI: 10.1242/dmm.038539] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 05/20/2019] [Indexed: 12/19/2022] Open
Abstract
To define renal molecular mechanisms that are affected by permanent hyperglycaemia and might promote phenotypes relevant to diabetic nephropathy, we carried out linkage analysis of genome-wide gene transcription in the kidneys of F2 offspring from the Goto-Kakizaki (GK) rat model of type 2 diabetes and normoglycaemic Brown Norway (BN) rats. We mapped 2526 statistically significant expression quantitative trait loci (eQTLs) in the cross. More than 40% of eQTLs mapped in the close vicinity of the linked transcripts, underlying possible cis-regulatory mechanisms of gene expression. We identified eQTL hotspots on chromosomes 5 and 9 regulating the expression of 80-165 genes, sex or cross direction effects, and enriched metabolic and immunological processes by segregating GK alleles. Comparative analysis with adipose tissue eQTLs in the same cross showed that 496 eQTLs, in addition to the top enriched biological pathways, are conserved in the two tissues. Extensive similarities in eQTLs mapped in the GK rat and in the spontaneously hypertensive rat (SHR) suggest a common aetiology of disease phenotypes common to the two strains, including insulin resistance, which is a prominent pathophysiological feature in both GK rats and SHRs. Our data shed light on shared and tissue-specific molecular mechanisms that might underlie aetiological aspects of insulin resistance in the context of spontaneously occurring hyperglycaemia and hypertension. Summary: Kidney and fat expression QTL mapping in rat models of spontaneously occurring insulin resistance associated with either diabetes or hypertension reveals conserved gene expression regulation, suggesting shared aetiology of disease phenotypes.
Collapse
Affiliation(s)
- Georg W Otto
- Genetics and Genomic Medicine, University College London Institute of Child Health, 30 Guilford Street, London WC1N 1EH, United Kingdom
| | - Pamela J Kaisaki
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Headington, Oxford OX3 7BN, United Kingdom
| | - Francois Brial
- University Paris Descartes, INSERM UMR 1124, 45 rue des Saint-Pères, 75006 Paris, France
| | - Aurélie Le Lay
- University Paris Descartes, INSERM UMR 1124, 45 rue des Saint-Pères, 75006 Paris, France
| | - Jean-Baptiste Cazier
- Centre for Computational Biology, Medical School, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Richard Mott
- University College London Genetics Institute, Gower Street, London WC1E 6BT, United Kingdom
| | - Dominique Gauguier
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Headington, Oxford OX3 7BN, United Kingdom .,University Paris Descartes, INSERM UMR 1124, 45 rue des Saint-Pères, 75006 Paris, France.,McGill University and Genome Quebec Innovation Centre, 740 Doctor Penfield Avenue, Montreal, QC H3A 0G1, Canada
| |
Collapse
|
16
|
Mulligan MK, Lu L, Cavigelli SA, Mormède P, Terenina E, Zhao W, Williams RW, Jones BC. Impact of Genetic Variation on Stress-Related Ethanol Consumption. Alcohol Clin Exp Res 2019; 43:1391-1402. [PMID: 31034606 DOI: 10.1111/acer.14073] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 04/23/2019] [Indexed: 01/03/2023]
Abstract
BACKGROUND The effect of stress on alcohol consumption in humans is highly variable, and the underlying processes are not yet understood. Attempts to model a positive relationship between stress and increased ethanol (EtOH) consumption in animals have been only modestly successful. Our hypothesis is that individual differences in stress effects on EtOH consumption are mediated by genetics. METHODS We measured alcohol consumption, using the drinking-in-the-dark (DID) paradigm in females from 2 inbred mouse strains, C57BL/6J (B6) and DBA/2J (D2), and 35 of their inbred progeny (the BXD family). A control group was maintained in normal housing and a stress group was exposed to chronic mild stress (CMS), consisting of unpredictable stressors over 7 weeks. These included predator, social, and environmental perturbations. Alcohol intake was measured over 16 weeks in both groups during baseline (preceding 5-week period), CMS (intervening 7-week period), and post-CMS (final 4-week period). RESULTS We detected a strong effect of CMS on alcohol intake. A few strains demonstrated CMS-related increased alcohol consumption; however, most showed decreased intake. We identified 1 nearly significant quantitative trait locus on chromosome 5 that contains the neuronal nitric oxide synthase gene (Nos1). The expression of Nos1 is frequently changed following alcohol exposure, and variants in this gene segregating among the BXD population may modulate alcohol intake in response to stress. CONCLUSIONS The results we present here represent the first study to combine chronic stress and alcohol consumption in a genetic reference population of mice. Differences in susceptibility to the effects of stressful environments vis-à-vis alcohol use disorders would suggest that the differences have at least some basis in genetic constitution. We have also nominated a likely candidate gene underlying the large individual differences in effects of stress on alcohol consumption.
Collapse
Affiliation(s)
- Megan K Mulligan
- The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Lu Lu
- The University of Tennessee Health Science Center, Memphis, Tennessee
| | | | - Pierre Mormède
- GenPhySE, Université de Toulouse, INRA, ENVT, Castanet-Tolosan, France
| | - Elena Terenina
- GenPhySE, Université de Toulouse, INRA, ENVT, Castanet-Tolosan, France
| | - Wenyuan Zhao
- The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Robert W Williams
- The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Byron C Jones
- The University of Tennessee Health Science Center, Memphis, Tennessee
| |
Collapse
|
17
|
Liu X, Li YI, Pritchard JK. Trans Effects on Gene Expression Can Drive Omnigenic Inheritance. Cell 2019; 177:1022-1034.e6. [PMID: 31051098 PMCID: PMC6553491 DOI: 10.1016/j.cell.2019.04.014] [Citation(s) in RCA: 286] [Impact Index Per Article: 57.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 12/18/2018] [Accepted: 04/07/2019] [Indexed: 01/02/2023]
Abstract
Early genome-wide association studies (GWASs) led to the surprising discovery that, for typical complex traits, most of the heritability is due to huge numbers of common variants with tiny effect sizes. Previously, we argued that new models are needed to understand these patterns. Here, we provide a formal model in which genetic contributions to complex traits are partitioned into direct effects from core genes and indirect effects from peripheral genes acting in trans. We propose that most heritability is driven by weak trans-eQTL SNPs, whose effects are mediated through peripheral genes to impact the expression of core genes. In particular, if the core genes for a trait tend to be co-regulated, then the effects of peripheral variation can be amplified such that nearly all of the genetic variance is driven by weak trans effects. Thus, our model proposes a framework for understanding key features of the architecture of complex traits.
Collapse
Affiliation(s)
- Xuanyao Liu
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA.
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA; Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | - Jonathan K Pritchard
- Departments of Biology and Genetics and Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
18
|
Abstract
Recent advance in high-throughput proteome analysis has enabled genome-wide and proteome-wide analyses of associations between single nucleotide polymorphisms and protein expression levels. Protein quantitative trait locus (pQTL) studies using cerebrospinal fluid (CSF) and DNA samples may provide valuable insights into the genetic basis and molecular mechanisms regulating protein expression in the central nervous system. In this chapter, we describe a step-by-step procedures of CSF collection and pQTL analysis, by using PLINK and R software.
Collapse
|
19
|
Adriaens ME, Lodder EM, Moreno‐Moral A, Šilhavý J, Heinig M, Glinge C, Belterman C, Wolswinkel R, Petretto E, Pravenec M, Remme CA, Bezzina CR. Systems Genetics Approaches in Rat Identify Novel Genes and Gene Networks Associated With Cardiac Conduction. J Am Heart Assoc 2018; 7:e009243. [PMID: 30608189 PMCID: PMC6404199 DOI: 10.1161/jaha.118.009243] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 08/03/2018] [Indexed: 01/20/2023]
Abstract
Background Electrocardiographic ( ECG ) parameters are regarded as intermediate phenotypes of cardiac arrhythmias. Insight into the genetic underpinnings of these parameters is expected to contribute to the understanding of cardiac arrhythmia mechanisms. Here we used HXB / BXH recombinant inbred rat strains to uncover genetic loci and candidate genes modulating ECG parameters. Methods and Results RR interval, PR interval, QRS duration, and QT c interval were measured from ECG s obtained in 6 male rats from each of the 29 available HXB / BXH recombinant inbred strains. Genes at loci displaying significant quantitative trait loci (QTL) effects were prioritized by assessing the presence of protein-altering variants, and by assessment of cis expression QTL ( eQTL ) effects and correlation of transcript abundance to the respective trait in the heart. Cardiac RNA -seq data were additionally used to generate gene co-expression networks. QTL analysis of ECG parameters identified 2 QTL for PR interval, respectively, on chromosomes 10 and 17. At the chromosome 10 QTL , cis- eQTL effects were identified for Acbd4, Cd300lg, Fam171a2, and Arhgap27; the transcript abundance in the heart of these 4 genes was correlated with PR interval. At the chromosome 17 QTL , a cis- eQTL was uncovered for Nhlrc1 candidate gene; the transcript abundance of this gene was also correlated with PR interval. Co-expression analysis furthermore identified 50 gene networks, 6 of which were correlated with PR interval or QRS duration, both parameters of cardiac conduction. Conclusions These newly identified genetic loci and gene networks associated with the ECG parameters of cardiac conduction provide a starting point for future studies with the potential of identifying novel mechanisms underlying cardiac electrical function.
Collapse
Affiliation(s)
- Michiel E. Adriaens
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
- Maastricht Centre for Systems BiologyMaastricht UniversityMaastrichtThe Netherlands
| | - Elisabeth M. Lodder
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| | | | - Jan Šilhavý
- Institute of PhysiologyAcademy of Sciences of the Czech RepublicPragueCzech Republic
| | - Matthias Heinig
- Institute of Computational BiologyHelmholtz Zentrum MünchenMünchenGermany
| | - Charlotte Glinge
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| | - Charly Belterman
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| | - Rianne Wolswinkel
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| | - Enrico Petretto
- The MRC London Institute of Medical SciencesImperial College LondonLondonUnited Kingdom
- Duke‐NUS Medical SchoolSingapore
| | - Michal Pravenec
- Institute of PhysiologyAcademy of Sciences of the Czech RepublicPragueCzech Republic
| | - Carol Ann Remme
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| | - Connie R. Bezzina
- Department of Experimental CardiologyHeart CentreAcademic Medical Center AmsterdamAmsterdamThe Netherlands
| |
Collapse
|
20
|
Huang L, Dai L, Xu W, Zhang S, Yan D, Shi X. Identification of expression quantitative trait loci of MTOR associated with the progression of glioma. Oncol Lett 2018; 15:665-671. [PMID: 29387238 DOI: 10.3892/ol.2017.7319] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 09/22/2017] [Indexed: 11/05/2022] Open
Abstract
Mechanistic target of rapamycin (MTOR) encodes a key modulator of cell growth, proliferation, and apoptosis. Previous studies have demonstrated that the dysregulation of MTOR is involved in the development and progression of several types of cancer, including glioma. In the present study, a comprehensive analysis was conducted to examine whether the expression quantitative trait loci (eQTLs) of MTOR are associated with the progression of glioma. Candidate eQTLs of MTOR were obtained from the Genotype-Tissue Expression eQTL Browser. The Kaplan-Meier method and multivariate Cox model were used to analyze the progression-free survival time of glioma patients. Based on the analysis of 138 glioma patients, one eQTL of MTOR, rs4845964, was demonstrated to be significantly associated with the progression of glioma in a dominant manner. The adjusted hazard ratios (HRs) for patients with the AG or AA genotype at rs4845964 were 2.82 [95% confidence interval (CI), 1.27-6.27; P=0.0111] and 2.79 (95% CI, 1.10-7.07; P=0.0312), respectively, compared with those with the GG genotype. When the rs4845964 AG and AA genotypes were combined for analysis, the HR was 2.70 (95% CI, 1.25-5.82; P=0.0114) vs. the GG genotype. Stratified analyses revealed similar associations between the rs4845964 genotypes and the progression of glioma in all subgroups (following stratification by age, sex and tumor grade). These results demonstrate for the first time that the MTOR eQTL rs4845964 is associated with the progression of glioma.
Collapse
Affiliation(s)
- Liming Huang
- The First Department of Chemotherapy, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350005, P.R. China
| | - Lian Dai
- Department of Medicine, The Third Affiliated People's Hospital, Fujian University of Traditional Chinese Medicine, Fuzhou, Fujian 350108, P.R. China
| | - Wenshen Xu
- Department of Laboratory Medicine, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350005, P.R. China
| | - Shu Zhang
- The First Department of Chemotherapy, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350005, P.R. China
| | - Danfang Yan
- Department of Radiation Oncology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, Zhejiang 310003, P.R. China
| | - Xi Shi
- The First Department of Chemotherapy, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian 350005, P.R. China
| |
Collapse
|
21
|
He X, Houde ALS, Neff BD, Heath DD. Transcriptome response of Atlantic salmon ( Salmo salar) to competition with ecologically similar non-native species. Ecol Evol 2018; 8:1769-1777. [PMID: 29435251 PMCID: PMC5792521 DOI: 10.1002/ece3.3798] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 10/18/2017] [Accepted: 12/06/2017] [Indexed: 01/01/2023] Open
Abstract
Non-native species may be introduced either intentionally or unintentionally, and their impact can range from benign to highly disruptive. Non-native salmonids were introduced into Lake Ontario, Canada, to provide recreational fishing opportunities; however, the establishment of those species has been proposed as a significant barrier to the reintroduction of native Atlantic salmon (Salmo salar) due to intense interspecific competition. In this study, we compared population differences of Atlantic salmon in transcriptome response to interspecific competition. We reared Atlantic salmon from two populations (LaHave River and Sebago Lake) with fish of each of three non-native salmonids (Chinook salmon Oncorhynchus tshawytscha, rainbow trout O. mykiss, and brown trout S. trutta) in artificial streams. We used RNA-seq to assess transcriptome differences between the Atlantic salmon populations and the responses of these populations to the interspecific competition treatments after 10 months of competition in the stream tanks. We found that population differences in gene expression were generally greater than the effects of interspecific competition. Interestingly, we found that the two Atlantic salmon populations exhibited similar responses to interspecific competition based on functional gene ontologies, but the specific genes within those ontologies were different. Our transcriptome analyses suggest that the most stressful competitor (as measured by the highest number of differentially expressed genes) differs between the two study populations. Our transcriptome characterization highlights the importance of source population selection for conservation applications, as organisms with different evolutionary histories can possess different transcriptional responses to the same biotic stressors. The results also indicate that generalized predictions of the response of native species to interactions with introduced species may not be appropriate without incorporating potential population-specific response to introduced species.
Collapse
Affiliation(s)
- Xiaoping He
- Great Lakes Institute for Environmental ResearchUniversity of WindsorWindsorONCanada
- Present address:
Pacific Biological Station, Fisheries and Oceans CanadaNanaimoBCCanada
| | | | - Bryan D. Neff
- Department of BiologyWestern UniversityLondonONCanada
| | - Daniel D. Heath
- Great Lakes Institute for Environmental ResearchUniversity of WindsorWindsorONCanada
| |
Collapse
|
22
|
Lee PH, Lee C, Li X, Wee B, Dwivedi T, Daly M. Principles and methods of in-silico prioritization of non-coding regulatory variants. Hum Genet 2018; 137:15-30. [PMID: 29288389 PMCID: PMC5892192 DOI: 10.1007/s00439-017-1861-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 12/14/2017] [Indexed: 12/13/2022]
Abstract
Over a decade of genome-wide association, studies have made great strides toward the detection of genes and genetic mechanisms underlying complex traits. However, the majority of associated loci reside in non-coding regions that are functionally uncharacterized in general. Now, the availability of large-scale tissue and cell type-specific transcriptome and epigenome data enables us to elucidate how non-coding genetic variants can affect gene expressions and are associated with phenotypic changes. Here, we provide an overview of this emerging field in human genomics, summarizing available data resources and state-of-the-art analytic methods to facilitate in-silico prioritization of non-coding regulatory mutations. We also highlight the limitations of current approaches and discuss the direction of much-needed future research.
Collapse
Affiliation(s)
- Phil H Lee
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Simches Research Building, 185 Cambridge St, Boston, MA, 02114, USA.
- Quantitative Genomics Program, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| | - Christian Lee
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Simches Research Building, 185 Cambridge St, Boston, MA, 02114, USA
- Department of Life Sciences, Harvard University, Cambridge, MA, USA
| | - Xihao Li
- Quantitative Genomics Program, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Brian Wee
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Simches Research Building, 185 Cambridge St, Boston, MA, 02114, USA
| | - Tushar Dwivedi
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Simches Research Building, 185 Cambridge St, Boston, MA, 02114, USA
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | - Mark Daly
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Simches Research Building, 185 Cambridge St, Boston, MA, 02114, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| |
Collapse
|
23
|
Maroilley T, Lemonnier G, Lecardonnel J, Esquerré D, Ramayo-Caldas Y, Mercat MJ, Rogel-Gaillard C, Estellé J. Deciphering the genetic regulation of peripheral blood transcriptome in pigs through expression genome-wide association study and allele-specific expression analysis. BMC Genomics 2017; 18:967. [PMID: 29237423 PMCID: PMC5729405 DOI: 10.1186/s12864-017-4354-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 11/28/2017] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Efforts to improve sustainability in livestock production systems have focused on two objectives: investigating the genetic control of immune function as it pertains to robustness and disease resistance, and finding predictive markers for use in breeding programs. In this context, the peripheral blood transcriptome represents an important source of biological information about an individual's health and immunological status, and has been proposed for use as an intermediate phenotype to measure immune capacity. The objective of this work was to study the genetic architecture of variation in gene expression in the blood of healthy young pigs using two approaches: an expression genome-wide association study (eGWAS) and allele-specific expression (ASE) analysis. RESULTS The blood transcriptomes of 60-day-old Large White pigs were analyzed by expression microarrays for eGWAS (242 animals) and by RNA-Seq for ASE analysis (38 animals). Using eGWAS, the expression levels of 1901 genes were found to be associated with expression quantitative trait loci (eQTLs). We recovered 2839 local and 1752 distant associations (Single Nucleotide Polymorphism or SNP located less or more than 1 Mb from expression probe, respectively). ASE analyses confirmed the extensive cis-regulation of gene transcription in blood, and revealed allelic imbalance in 2286 SNPs, which affected 763 genes. eQTLs and ASE-genes were widely distributed on all chromosomes. By analyzing mutually overlapping eGWAS results, we were able to describe putative regulatory networks, which were further refined using ASE data. At the functional level, genes with genetically controlled expression that were detected by eGWAS and/or ASE analyses were significantly enriched in biological processes related to RNA processing and immune function. Indeed, numerous distant and local regulatory relationships were detected within the major histocompatibility complex region on chromosome 7, revealing ASE for most class I and II genes. CONCLUSIONS This study represents, to the best of our knowledge, the first genome-wide map of the genetic control of gene expression in porcine peripheral blood. These results represent an interesting resource for the identification of genetic markers and blood biomarkers associated with variations in immunity traits in pigs, as well as any other complex traits for which blood is an appropriate surrogate tissue.
Collapse
Affiliation(s)
- T Maroilley
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.
| | - G Lemonnier
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - J Lecardonnel
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - D Esquerré
- GenPhySE, INRA, INPT, ENVT, Université de Toulouse, 31326, Castanet-Tolosan, France
| | - Y Ramayo-Caldas
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - M J Mercat
- IFIP - Institut du porc/BIOPORC, La Motte au Vicomte, BP 35104, 35651, Le Rheu, France
| | - C Rogel-Gaillard
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.
| | - J Estellé
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.
| |
Collapse
|
24
|
Müller B, Schaadt G, Boltze J, Emmrich F, Skeide MA, Neef NE, Kraft I, Brauer J, Friederici AD, Kirsten H, Wilcke A. ATP2C2 and DYX1C1 are putative modulators of dyslexia-related MMR. Brain Behav 2017; 7:e00851. [PMID: 29201552 PMCID: PMC5698869 DOI: 10.1002/brb3.851] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Revised: 08/23/2017] [Accepted: 09/01/2017] [Indexed: 11/07/2022] Open
Abstract
BACKGROUND Dyslexia is a specific learning disorder affecting reading and spelling abilities. Its prevalence is ~5% in German-speaking individuals. Although the etiology of dyslexia largely remains to be determined, comprehensive evidence supports deficient phonological processing as a major contributing factor. An important prerequisite for phonological processing is auditory discrimination and, thus, essential for acquiring reading and spelling skills. The event-related potential Mismatch Response (MMR) is an indicator for auditory discrimination capabilities with dyslexics showing an altered late component of MMR in response to auditory input. METHODS In this study, we comprehensively analyzed associations of dyslexia-specific late MMRs with genetic variants previously reported to be associated with dyslexia-related phenotypes in multiple studies comprising 25 independent single-nucleotide polymorphisms (SNPs) within 10 genes. RESULTS First, we demonstrated validity of these SNPs for dyslexia in our sample by showing that additional inclusion of a polygenic risk score improved prediction of impaired writing compared with a model that used MMR alone. Secondly, a multifactorial regression analysis was conducted to uncover the subset of the 25 SNPs that is associated with the dyslexia-specific late component of MMR. In total, four independent SNPs within DYX1C1 and ATP2C2 were found to be associated with MMR stronger than expected from multiple testing. To explore potential pathomechanisms, we annotated these variants with functional data including tissue-specific expression analysis and eQTLs. CONCLUSION Our findings corroborate the late component of MMR as a potential endophenotype for dyslexia and support tripartite relationships between dyslexia-related SNPs, the late component of MMR and dyslexia.
Collapse
Affiliation(s)
- Bent Müller
- Fraunhofer Institute for Cell Therapy and Immunology Leipzig Germany
| | - Gesa Schaadt
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany.,Department of Psychology Humboldt-Universität zu Berlin Berlin Germany
| | - Johannes Boltze
- Fraunhofer Institute for Cell Therapy and Immunology Leipzig Germany.,Department of Medical Cell Technology Fraunhofer Research Institution for Marine Biotechnology Lübeck Germany.,Institute for Medical and Marine Biotechnology University of Lübeck Lübeck Germany
| | - Frank Emmrich
- Fraunhofer Institute for Cell Therapy and Immunology Leipzig Germany
| | | | - Michael A Skeide
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany
| | - Nicole E Neef
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany
| | - Indra Kraft
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany
| | - Jens Brauer
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany
| | - Angela D Friederici
- Department of Neuropsychology Max Planck Institute for Human Cognitive and Brain Sciences Leipzig Germany
| | - Holger Kirsten
- Fraunhofer Institute for Cell Therapy and Immunology Leipzig Germany.,Institute for Medical Informatics Statistics and Epidemiology University of Leipzig Leipzig Germany.,LIFE-Leipzig Research Center for Civilization Diseases University of Leipzig Leipzig Germany
| | - Arndt Wilcke
- Fraunhofer Institute for Cell Therapy and Immunology Leipzig Germany
| |
Collapse
|
25
|
Wong ES, Schmitt BM, Kazachenka A, Thybert D, Redmond A, Connor F, Rayner TF, Feig C, Ferguson-Smith AC, Marioni JC, Odom DT, Flicek P. Interplay of cis and trans mechanisms driving transcription factor binding and gene expression evolution. Nat Commun 2017; 8:1092. [PMID: 29061983 PMCID: PMC5653656 DOI: 10.1038/s41467-017-01037-x] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 08/09/2017] [Indexed: 12/23/2022] Open
Abstract
Noncoding regulatory variants play a central role in the genetics of human diseases and in evolution. Here we measure allele-specific transcription factor binding occupancy of three liver-specific transcription factors between crosses of two inbred mouse strains to elucidate the regulatory mechanisms underlying transcription factor binding variations in mammals. Our results highlight the pre-eminence of cis-acting variants on transcription factor occupancy divergence. Transcription factor binding differences linked to cis-acting variants generally exhibit additive inheritance, while those linked to trans-acting variants are most often dominantly inherited. Cis-acting variants lead to local coordination of transcription factor occupancies that decay with distance; distal coordination is also observed and may be modulated by long-range chromatin contacts. Our results reveal the regulatory mechanisms that interplay to drive transcription factor occupancy, chromatin state, and gene expression in complex mammalian cell states.
Collapse
Affiliation(s)
- Emily S Wong
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Bianca M Schmitt
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | | | - David Thybert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Aisling Redmond
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Frances Connor
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Tim F Rayner
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Christine Feig
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | | | - John C Marioni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Duncan T Odom
- University of Cambridge, Cancer Research UK-Cambridge Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK.
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| |
Collapse
|
26
|
Reconstructing the Molecular Function of Genetic Variation in Regulatory Networks. Genetics 2017; 207:1699-1709. [PMID: 29046401 DOI: 10.1534/genetics.117.300381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 09/11/2017] [Indexed: 11/18/2022] Open
Abstract
Over the past decade, genetic studies have recognized hundreds of polymorphic DNA loci called response QTLs (reQTLs) as potential contributors to interindividual variation in transcriptional responses to stimulations. Such reQTLs commonly affect the transduction of signals along the regulatory network that controls gene transcription. Identifying the pathways through which reQTLs perturb the underlying network has been a major challenge. Here, we present GEVIN ("Genome-wide Embedding of Variation In Networks"), a methodology that simultaneously identifies a reQTL and the particular pathway in which the reQTL affects downstream signal transduction along the network. Using synthetic data, we show that this algorithm outperforms existing pathway identification and reQTL identification methods. We applied GEVIN to the analysis of murine and human dendritic cells in response to pathogenic components. These analyses revealed significant reQTLs together with their perturbed Toll-like receptor signaling pathways. GEVIN thus offers a powerful framework that renders a comprehensive picture of disease-related DNA loci and their molecular functions within regulatory networks.
Collapse
|
27
|
Sá ACC, Sadee W, Johnson JA. Whole Transcriptome Profiling: An RNA-Seq Primer and Implications for Pharmacogenomics Research. Clin Transl Sci 2017; 11:153-161. [PMID: 28945944 PMCID: PMC5866981 DOI: 10.1111/cts.12511] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 09/03/2017] [Indexed: 12/16/2022] Open
Affiliation(s)
- Ana Caroline C Sá
- Center for Pharmacogenomics & Department of Pharmacotherapy and Translational Research, College of Pharmacy, University of Florida, Gainesville, Florida, USA.,Genetics & Genomic Graduate Program, Genetics Institute, University of Florida, Gainesville, Florida, USA
| | - Wolfgang Sadee
- Center for Pharmacogenomics, Department of Cancer Biology and Genetic, College of Medicine, Ohio State University, Columbus, Ohio, USA
| | - Julie A Johnson
- Center for Pharmacogenomics & Department of Pharmacotherapy and Translational Research, College of Pharmacy, University of Florida, Gainesville, Florida, USA.,Genetics & Genomic Graduate Program, Genetics Institute, University of Florida, Gainesville, Florida, USA.,Division of Cardiovascular Medicine, Colleges of Pharmacy and Medicine, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
28
|
Sasayama D, Hattori K, Ogawa S, Yokota Y, Matsumura R, Teraishi T, Hori H, Ota M, Yoshida S, Kunugi H. Genome-wide quantitative trait loci mapping of the human cerebrospinal fluid proteome. Hum Mol Genet 2017; 26:44-51. [PMID: 28031287 DOI: 10.1093/hmg/ddw366] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 10/21/2016] [Indexed: 11/12/2022] Open
Abstract
Cerebrospinal fluid (CSF) is virtually the only one accessible source of proteins derived from the central nervous system (CNS) of living humans and possibly reflects the pathophysiology of a variety of neuropsychiatric diseases. However, little is known regarding the genetic basis of variation in protein levels of human CSF. We examined CSF levels of 1,126 proteins in 133 subjects and performed a genome-wide association analysis of 514,227 single nucleotide polymorphisms (SNPs) to detect protein quantitative trait loci (pQTLs). To be conservative, Spearman's correlation was used to identify an association between genotypes of SNPs and protein levels. A total of 421 cis and 25 trans SNP-protein pairs were significantly correlated at a false discovery rate (FDR) of less than 0.01 (nominal P < 7.66 × 10-9). Cis-only analysis revealed additional 580 SNP-protein pairs with FDR < 0.01 (nominal P < 2.13 × 10-5). pQTL SNPs were more likely, compared to non-pQTL SNPs, to be a disease/trait-associated variants identified by previous genome-wide association studies. The present findings suggest that genetic variations play an important role in the regulation of protein expression in the CNS. The obtained database may serve as a valuable resource to understand the genetic bases for CNS protein expression pattern in humans.
Collapse
Affiliation(s)
- Daimei Sasayama
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan.,Department of Psychiatry, Shinshu University School of Medicine, Matsumoto, Nagano, Japan
| | - Kotaro Hattori
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan.,Translational Medical Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
| | - Shintaro Ogawa
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan
| | - Yuuki Yokota
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan.,Translational Medical Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
| | - Ryo Matsumura
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan.,Translational Medical Center, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
| | - Toshiya Teraishi
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan
| | - Hiroaki Hori
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan
| | - Miho Ota
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan
| | - Sumiko Yoshida
- Department of Psychiatry, National Center Hospital, National Center of Neurology and Psychiatry, Kodaira, Tokyo, Japan
| | - Hiroshi Kunugi
- Department of Mental Disorder Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Ogawahigashi, Kodaira, Tokyo, Japan
| |
Collapse
|
29
|
Wingo TS, Duong DM, Zhou M, Dammer EB, Wu H, Cutler DJ, Lah JJ, Levey AI, Seyfried NT. Integrating Next-Generation Genomic Sequencing and Mass Spectrometry To Estimate Allele-Specific Protein Abundance in Human Brain. J Proteome Res 2017; 16:3336-3347. [PMID: 28691493 DOI: 10.1021/acs.jproteome.7b00324] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Gene expression contributes to phenotypic traits and human disease. To date, comparatively less is known about regulators of protein abundance, which is also under genetic control and likely influences clinical phenotypes. However, identifying and quantifying allele-specific protein abundance by bottom-up proteomics is challenging since single nucleotide variants (SNVs) that alter protein sequence are not considered in standard human protein databases. To address this, we developed the GenPro software and used it to create personalized protein databases (PPDs) to identify single amino acid variants (SAAVs) at the protein level from whole exome sequencing. In silico assessment of PPDs generated by GenPro revealed only a 1% increase in tryptic search space compared to a direct translation of all human transcripts and an equivalent search space compared to the UniProtKB reference database. To identify a large unbiased number of SAAV peptides, we performed high-resolution mass spectrometry-based proteomics for two human post-mortem brain samples and searched the collected MS/MS spectra against their respective PPD. We found an average of ∼117 000 unique peptides mapping to ∼9300 protein groups for each sample, and of these, 977 were unique variant peptides. We found that over 400 reference and SAAV peptide pairs were, on average, equally abundant in human brain by label-free ion intensity measurements and confirmed the absolute levels of three reference and SAAV peptide pairs using heavy labeled peptides standards coupled with parallel reaction monitoring (PRM). Our results highlight the utility of integrating genomic and proteomic sequencing data to identify sample-specific SAAV peptides and support the hypothesis that most alleles are equally expressed in human brain.
Collapse
Affiliation(s)
- Thomas S Wingo
- Division of Neurology, Department of Veterans Affairs Medical Center , Decatur, Georgia 30033, United States
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Kumar J, Gupta DS, Gupta S, Dubey S, Gupta P, Kumar S. Quantitative trait loci from identification to exploitation for crop improvement. PLANT CELL REPORTS 2017; 36:1187-1213. [PMID: 28352970 DOI: 10.1007/s00299-017-2127-y] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 03/09/2017] [Indexed: 05/24/2023]
Abstract
Advancement in the field of genetics and genomics after the discovery of Mendel's laws of inheritance has led to map the genes controlling qualitative and quantitative traits in crop plant species. Mapping of genomic regions controlling the variation of quantitatively inherited traits has become routine after the advent of different types of molecular markers. Recently, the next generation sequencing methods have accelerated the research on QTL analysis. These efforts have led to the identification of more closely linked molecular markers with gene/QTLs and also identified markers even within gene/QTL controlling the trait of interest. Efforts have also been made towards cloning gene/QTLs or identification of potential candidate genes responsible for a trait. Further new concepts like crop QTLome and QTL prioritization have accelerated precise application of QTLs for genetic improvement of complex traits. In the past years, efforts have also been made in exploitation of a number of QTL for improving grain yield or other agronomic traits in various crops through markers assisted selection leading to cultivation of these improved varieties at farmers' field. In present article, we reviewed QTLs from their identification to exploitation in plant breeding programs and also reviewed that how improved cultivars developed through introgression of QTLs have improved the yield productivity in many crops.
Collapse
Affiliation(s)
- Jitendra Kumar
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India.
| | - Debjyoti Sen Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Sunanda Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Sonali Dubey
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Priyanka Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Shiv Kumar
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat-Institutes, B.P. 6299, Rabat, Morocco
| |
Collapse
|
31
|
Benowitz KM, McKinney EC, Cunningham CB, Moore AJ. Relating quantitative variation within a behavior to variation in transcription. Evolution 2017; 71:1999-2009. [PMID: 28542920 DOI: 10.1111/evo.13273] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Revised: 05/27/2017] [Accepted: 04/21/2017] [Indexed: 12/14/2022]
Abstract
Many studies have shown that variation in transcription is associated with changes in behavioral state, or with variation within a state, but little has been done to address if the same genes are involved in both. Here, we investigate the transcriptional basis of variation in parental provisioning using two species of burying beetle, Nicrophorus orbicollis and Nicrophorus vespilloides. We used RNA-seq to compare transcription in parents that provided high amounts of provisioning behavior versus low amounts in males and females of each species. We found no overarching transcriptional patterns distinguishing high from low caring parents, and no informative transcripts that displayed particularly large expression differences in either sex. However, we did find subtler gene expression differences between high and low provisioning parents that are consistent across both sexes and species. Furthermore, we show that transcripts previously implicated in transitioning into parental care in N. vespilloides had high variance in the levels of transcription and were unusually likely to display differential expression between high and low provisioning parents. Thus, quantitative behavioral variation appears to reflect many transcriptional differences of small effect. Furthermore, the same transcripts required for the transition between behavioral states are also related to variation within a behavioral state.
Collapse
Affiliation(s)
- Kyle M Benowitz
- Department of Genetics, University of Georgia, Athens, Georgia, 30602
| | | | - Christopher B Cunningham
- Department of Genetics, University of Georgia, Athens, Georgia, 30602.,Department of Biosciences, Swansea University, Swansea, SA2 8PP, UK
| | - Allen J Moore
- Department of Genetics, University of Georgia, Athens, Georgia, 30602
| |
Collapse
|
32
|
Ju JH, Shenoy SA, Crystal RG, Mezey JG. An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci. PLoS Comput Biol 2017; 13:e1005537. [PMID: 28505156 PMCID: PMC5448815 DOI: 10.1371/journal.pcbi.1005537] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 05/30/2017] [Accepted: 04/28/2017] [Indexed: 11/19/2022] Open
Abstract
Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL.
Collapse
Affiliation(s)
- Jin Hyun Ju
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Sushila A. Shenoy
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Ronald G. Crystal
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Jason G. Mezey
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, United States of America
- * E-mail:
| |
Collapse
|
33
|
Coan PM, Hummel O, Garcia Diaz A, Barrier M, Alfazema N, Norsworthy PJ, Pravenec M, Petretto E, Hübner N, Aitman TJ. Genetic, physiological and comparative genomic studies of hypertension and insulin resistance in the spontaneously hypertensive rat. Dis Model Mech 2017; 10:297-306. [PMID: 28130354 PMCID: PMC5374317 DOI: 10.1242/dmm.026716] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 01/23/2017] [Indexed: 12/18/2022] Open
Abstract
We previously mapped hypertension-related insulin resistance quantitative trait loci (QTLs) to rat chromosomes 4, 12 and 16 using adipocytes from F2 crosses between spontaneously hypertensive (SHR) and Wistar Kyoto (WKY) rats, and subsequently identified Cd36 as the gene underlying the chromosome 4 locus. The identity of the chromosome 12 and 16 genes remains unknown. To identify whole-body phenotypes associated with the chromosome 12 and 16 linkage regions, we generated and characterised new congenic strains, with WKY donor segments introgressed onto an SHR genetic background, for the chromosome 12 and 16 linkage regions. We found a >50% increase in insulin sensitivity in both the chromosome 12 and 16 strains. Blood pressure and left ventricular mass were reduced in the two congenic strains consistent with the congenic segments harbouring SHR genes for insulin resistance, hypertension and cardiac hypertrophy. Integrated genomic analysis, using physiological and whole-genome sequence data across 42 rat strains, identified variants within the congenic regions in Upk3bl, RGD1565131 and AABR06087018.1 that were associated with blood pressure, cardiac mass and insulin sensitivity. Quantitative trait transcript analysis across 29 recombinant inbred strains showed correlation between expression of Hspb1, Zkscan5 and Pdgfrl with adipocyte volume, systolic blood pressure and cardiac mass, respectively. Comparative genome analysis showed a marked enrichment of orthologues for human GWAS-associated genes for insulin resistance within the syntenic regions of both the chromosome 12 and 16 congenic intervals. Our study defines whole-body phenotypes associated with the SHR chromosome 12 and 16 insulin-resistance QTLs, identifies candidate genes for these SHR QTLs and finds human orthologues of rat genes in these regions that associate with related human traits. Further study of these genes in the congenic strains will lead to robust identification of the underlying genes and cellular mechanisms. Summary: Comparative genome analyses identify candidate genes for hypertension and insulin resistance on rat chromosomes 12 and 16, and marked enrichment of insulin resistance genes in the syntenic regions of the human genome.
Collapse
Affiliation(s)
- Philip M Coan
- Centre for Genomic and Experimental Medicine & Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Oliver Hummel
- Cardiovascular and Metabolic Sciences, Max-Delbrück-Center for Molecular Medicine (MDC), 13125 Berlin, Germany
| | - Ana Garcia Diaz
- Department of Medicine, Imperial College London, London SW7 2AZ, UK
| | - Marjorie Barrier
- Centre for Genomic and Experimental Medicine & Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Neza Alfazema
- Centre for Genomic and Experimental Medicine & Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Penny J Norsworthy
- MRC Clinical Sciences Centre, Imperial College London, London W12 0NN, UK
| | - Michal Pravenec
- Department of Model Diseases, Institute of Physiology, Czech Academy of Sciences, 142 20 Prague, Czech Republic
| | - Enrico Petretto
- MRC Clinical Sciences Centre, Imperial College London, London W12 0NN, UK.,Duke-NUS Medical School, Singapore 169857, Republic of Singapore
| | - Norbert Hübner
- Cardiovascular and Metabolic Sciences, Max-Delbrück-Center for Molecular Medicine (MDC), 13125 Berlin, Germany.,DZHK (German Centre for Cardiovascular Research), partner site, 13316 Berlin, Germany.,Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Timothy J Aitman
- Centre for Genomic and Experimental Medicine & Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh EH4 2XU, UK.,Department of Medicine, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
34
|
|
35
|
Moreno-Moral A, Pesce F, Behmoaras J, Petretto E. Systems Genetics as a Tool to Identify Master Genetic Regulators in Complex Disease. Methods Mol Biol 2017; 1488:337-362. [PMID: 27933533 DOI: 10.1007/978-1-4939-6427-7_16] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Systems genetics stems from systems biology and similarly employs integrative modeling approaches to describe the perturbations and phenotypic effects observed in a complex system. However, in the case of systems genetics the main source of perturbation is naturally occurring genetic variation, which can be analyzed at the systems-level to explain the observed variation in phenotypic traits. In contrast with conventional single-variant association approaches, the success of systems genetics has been in the identification of gene networks and molecular pathways that underlie complex disease. In addition, systems genetics has proven useful in the discovery of master trans-acting genetic regulators of functional networks and pathways, which in many cases revealed unexpected gene targets for disease. Here we detail the central components of a fully integrated systems genetics approach to complex disease, starting from assessment of genetic and gene expression variation, linking DNA sequence variation to mRNA (expression QTL mapping), gene regulatory network analysis and mapping the genetic control of regulatory networks. By summarizing a few illustrative (and successful) examples, we highlight how different data-modeling strategies can be effectively integrated in a systems genetics study.
Collapse
Affiliation(s)
- Aida Moreno-Moral
- Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
| | - Francesco Pesce
- National Heart and Lung Institute, Faculty of Medicine, Imperial College London, Hammersmith Campus, Imperial Centre for Translational and Experimental Medicine, London, UK
| | - Jacques Behmoaras
- Centre for Complement and Inflammation Research, Imperial College London, Hammersmith Hospital, Du Cane Road, London, W12 0NN, UK
| | - Enrico Petretto
- Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.
| |
Collapse
|
36
|
Transcriptome Profiling in Rat Inbred Strains and Experimental Cross Reveals Discrepant Genetic Architecture of Genome-Wide Gene Expression. G3-GENES GENOMES GENETICS 2016; 6:3671-3683. [PMID: 27646706 PMCID: PMC5100866 DOI: 10.1534/g3.116.033274] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
To test the impact of genetic heterogeneity on cis- and trans-mediated mechanisms of gene expression regulation, we profiled the transcriptome of adipose tissue in 20 inbred congenic strains derived from diabetic Goto-Kakizaki (GK) rats and Brown-Norway (BN) controls, which contain well-defined blocks (1-183 Mb) of genetic polymorphisms, and in 123 genetically heterogeneous rats of an (GK × BN)F2 offspring. Within each congenic we identified 73-1351 differentially expressed genes (DEGs), only 7.7% of which mapped within the congenic blocks, and which may be regulated in cis The remainder localized outside the blocks, and therefore must be regulated in trans Most trans-regulated genes exhibited approximately twofold expression changes, consistent with monoallelic expression. Altered biological pathways were replicated between congenic strains sharing blocks of genetic polymorphisms, but polymorphisms at different loci also had redundant effects on transcription of common distant genes and pathways. We mapped 2735 expression quantitative trait loci (eQTL) in the F2 cross, including 26% predominantly cis-regulated genes, which validated DEGs in congenic strains. A hotspot of >300 eQTL in a 10 cM region of chromosome 1 was enriched in DEGs in a congenic strain. However, many DEGs among GK, BN and congenic strains did not replicate as eQTL in F2 hybrids, demonstrating distinct mechanisms of gene expression when alleles segregate in an outbred population or are fixed homozygous across the entire genome or in short genomic regions. Our analysis provides conceptual advances in our understanding of the complex architecture of genome expression and pathway regulation, and suggests a prominent impact of epistasis and monoallelic expression on gene transcription.
Collapse
|
37
|
Pitsillides AN, Choi SH, Hogan JD, Hong J, Lin H. Association of genetic variations and gene expression in a family-based study. BMC Proc 2016; 10:109-112. [PMID: 27980620 PMCID: PMC5133483 DOI: 10.1186/s12919-016-0014-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Expression quantitative trait locus (eQTL) maps are considered a valuable resource in studying complex diseases. The availability of gene expression data from the Genetic Analysis Workshop 19 (GAW19) provides a great opportunity to investigate the association of gene expression with genetic variants in blood. METHODS A total of 267 samples with gene expression and whole genome sequencing data were employed in this study. We used linear mixed models with genetic random effects along with a permutation procedure to create an eQTL map. The eQTL map was further tested in terms of functional implication, including the enrichment in disease-related variants and in regulatory regions. RESULTS We identified 22,869 significant eQTLs from the GAW19 data set. These eQTLs were highly enriched with genetic loci associated with blood pressure and DNase hypersensitive regions. In addition, the majority of genes associated with eQTLs showed moderate to high heritability (h2 > 0.4). CONCLUSIONS We successfully created an eQTL map from the GAW19 data set. Our study indicated that the eQTLs were enriched within regulatory regions, and tended to have relatively high heritability.
Collapse
Affiliation(s)
- Achilleas N. Pitsillides
- National Heart Lung and Blood Institute’s Framingham Heart Study, 73 Mt. Wayte Avenue, Suite 2, Framingham, MA 01702 USA
| | - Seung-Hoan Choi
- Department of Biostatistics, Boston University School of Public Health, 677 Huntington Avenue, Boston, MA 02115 USA
| | - John D. Hogan
- Program in Bioinformatics, Boston University, Boston, MA 02215 USA
| | - Jaeyoung Hong
- Department of Biostatistics, Boston University School of Public Health, 677 Huntington Avenue, Boston, MA 02115 USA
| | - Honghuang Lin
- National Heart Lung and Blood Institute’s Framingham Heart Study, 73 Mt. Wayte Avenue, Suite 2, Framingham, MA 01702 USA
- Department of Medicine, Boston University School of Medicine, 72 East Concord St, Boston, MA 02118 USA
| |
Collapse
|
38
|
Moreno-Moral A, Petretto E. From integrative genomics to systems genetics in the rat to link genotypes to phenotypes. Dis Model Mech 2016; 9:1097-1110. [PMID: 27736746 PMCID: PMC5087832 DOI: 10.1242/dmm.026104] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Complementary to traditional gene mapping approaches used to identify the hereditary components of complex diseases, integrative genomics and systems genetics have emerged as powerful strategies to decipher the key genetic drivers of molecular pathways that underlie disease. Broadly speaking, integrative genomics aims to link cellular-level traits (such as mRNA expression) to the genome to identify their genetic determinants. With the characterization of several cellular-level traits within the same system, the integrative genomics approach evolved into a more comprehensive study design, called systems genetics, which aims to unravel the complex biological networks and pathways involved in disease, and in turn map their genetic control points. The first fully integrated systems genetics study was carried out in rats, and the results, which revealed conserved trans-acting genetic regulation of a pro-inflammatory network relevant to type 1 diabetes, were translated to humans. Many studies using different organisms subsequently stemmed from this example. The aim of this Review is to describe the most recent advances in the fields of integrative genomics and systems genetics applied in the rat, with a focus on studies of complex diseases ranging from inflammatory to cardiometabolic disorders. We aim to provide the genetics community with a comprehensive insight into how the systems genetics approach came to life, starting from the first integrative genomics strategies [such as expression quantitative trait loci (eQTLs) mapping] and concluding with the most sophisticated gene network-based analyses in multiple systems and disease states. Although not limited to studies that have been directly translated to humans, we will focus particularly on the successful investigations in the rat that have led to primary discoveries of genes and pathways relevant to human disease.
Collapse
Affiliation(s)
- Aida Moreno-Moral
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore (NUS) Medical School, Singapore
| | - Enrico Petretto
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore (NUS) Medical School, Singapore
| |
Collapse
|
39
|
He X, Johansson ML, Heath DD. Role of genomics and transcriptomics in selection of reintroduction source populations. CONSERVATION BIOLOGY : THE JOURNAL OF THE SOCIETY FOR CONSERVATION BIOLOGY 2016; 30:1010-1018. [PMID: 26756292 DOI: 10.1111/cobi.12674] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Revised: 01/02/2016] [Accepted: 01/05/2016] [Indexed: 06/05/2023]
Abstract
The use and importance of reintroduction as a conservation tool to return a species to its historical range from which it has been extirpated will increase as climate change and human development accelerate habitat loss and population extinctions. Although the number of reintroduction attempts has increased rapidly over the past 2 decades, the success rate is generally low. As a result of population differences in fitness-related traits and divergent responses to environmental stresses, population performance upon reintroduction is highly variable, and it is generally agreed that selecting an appropriate source population is a critical component of a successful reintroduction. Conservation genomics is an emerging field that addresses long-standing challenges in conservation, and the potential for using novel molecular genetic approaches to inform and improve conservation efforts is high. Because the successful establishment and persistence of reintroduced populations is highly dependent on the functional genetic variation and environmental stress tolerance of the source population, we propose the application of conservation genomics and transcriptomics to guide reintroduction practices. Specifically, we propose using genome-wide functional loci to estimate genetic variation of source populations. This estimate can then be used to predict the potential for adaptation. We also propose using transcriptional profiling to measure the expression response of fitness-related genes to environmental stresses as a proxy for acclimation (tolerance) capacity. Appropriate application of conservation genomics and transcriptomics has the potential to dramatically enhance reintroduction success in a time of rapidly declining biodiversity and accelerating environmental change.
Collapse
Affiliation(s)
- Xiaoping He
- Great Lakes Institute for Environmental Research, University of Windsor, Windsor, ON, N9B 3P4, Canada
| | - Mattias L Johansson
- Great Lakes Institute for Environmental Research, University of Windsor, Windsor, ON, N9B 3P4, Canada
| | - Daniel D Heath
- Great Lakes Institute for Environmental Research, University of Windsor, Windsor, ON, N9B 3P4, Canada.
| |
Collapse
|
40
|
Richardson TG, Shihab HA, Hemani G, Zheng J, Hannon E, Mill J, Carnero-Montoro E, Bell JT, Lyttleton O, McArdle WL, Ring SM, Rodriguez S, Campbell C, Smith GD, Relton CL, Timpson NJ, Gaunt TR. Collapsed methylation quantitative trait loci analysis for low frequency and rare variants. Hum Mol Genet 2016; 25:4339-4349. [PMID: 27559110 PMCID: PMC5291201 DOI: 10.1093/hmg/ddw283] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 06/30/2016] [Accepted: 08/12/2016] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Single variant approaches have been successful in identifying DNA methylation quantitative trait loci (mQTL), although as with complex traits they lack the statistical power to identify the effects from rare genetic variants. We have undertaken extensive analyses to identify regions of low frequency and rare variants that are associated with DNA methylation levels. METHODS We used repeated measurements of DNA methylation from five different life stages in human blood, taken from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort. Variants were collapsed across CpG islands and their flanking regions to identify variants collectively associated with methylation, where no single variant was individually responsible for the observed signal. All analyses were undertaken using the sequence kernel association test. RESULTS For loci where no individual variant mQTL was observed based on a single variant analysis, we identified 95 unique regions where the combined effect of low frequency variants (MAF ≤ 5%) provided strong evidence of association with methylation. For loci where there was previous evidence of an individual variant mQTL, a further 3 regions provided evidence of association between multiple low frequency variants and methylation levels. Effects were observed consistently across 5 different time points in the lifecourse and evidence of replication in the TwinsUK and Exeter cohorts was also identified. CONCLUSION We have demonstrated the potential of this novel approach to mQTL analysis by analysing the combined effect of multiple low frequency or rare variants. Future studies should benefit from applying this approach as a complementary follow up to single variant analyses.
Collapse
Affiliation(s)
- Tom G Richardson
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Hashem A Shihab
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Gibran Hemani
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Jie Zheng
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Eilis Hannon
- University of Exeter Medical School, University of Exeter, Exeter, UK
| | - Jonathan Mill
- University of Exeter Medical School, University of Exeter, Exeter, UK
- Institute of Psychiatry, King's College London, London, UK
| | - Elena Carnero-Montoro
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Jordana T Bell
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Oliver Lyttleton
- Avon Longitudinal Study of Parents and Children (ALSPAC) & School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Wendy L McArdle
- Avon Longitudinal Study of Parents and Children (ALSPAC) & School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Susan M Ring
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
- Avon Longitudinal Study of Parents and Children (ALSPAC) & School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Santiago Rodriguez
- Bristol Genetic Epidemiology Laboratories, School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Colin Campbell
- Intelligent Systems Laboratory, University of Bristol, Bristol, UK
| | - George Davey Smith
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Caroline L Relton
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Nicholas J Timpson
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| | - Tom R Gaunt
- MRC Integrative Epidemiology Unit (IEU), School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
| |
Collapse
|
41
|
Guo CC, Wei N, Liang SH, Wang BL, Sha SM, Wu KC. Population-specific genome-wide mapping of expression quantitative trait loci in the colon of Han Chinese. J Dig Dis 2016; 17:600-609. [PMID: 27534592 DOI: 10.1111/1751-2980.12399] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Revised: 08/06/2016] [Accepted: 08/14/2016] [Indexed: 12/11/2022]
Abstract
OBJECTIVE To establish the colonic expression quantitative trait locus map in Han Chinese population and provide a functional reference for interpreting genetic associations of diseases such as inflammatory bowel disease (IBD). METHODS Colonic mucosal biopsies and peripheral blood samples were obtained from 48 Chinese Han individuals (24 ulcerative colitis patients and 24 healthy controls). Transcription profiling was performed using human whole genome expression array. Genotyping was done using a population-specific genotype array. Imputation was performed using IMPUTE2. Association between genotypes and gene expression was analyzed using a Matrix Expression Quantitative Trait Loci (eQTL) R package to identify eQTL. We used ChIPpeakAnno R package for annotation of the eQTL. Linkage disequilibrium between the eQTL and IBD risk loci was also investigated. RESULTS We identified 6 377 single nucleotide polymorphism-transcript interactions (cis-eQTL) in the colon of the Chinese participants. Most of the eQTL located near the transcription starting sites and overlapped with histone modification marks on the genome. A significant proportion of the eQTL were found to be within transcription factor-binding sites. Two IBD risk loci were found to be colon cis-eQTL in Chinese individuals, and 51 cis-eQTL were identified in another 18 IBD risk loci. CONCLUSIONS This study defined a population-specific catalogue of colon eQTL in the Chinese population. Potential functional variants of IBD association signals were identified. We provided a useful reference dataset for fine mapping IBD risk loci and identifying causal variants in the Chinese Han population.
Collapse
Affiliation(s)
- Chang Cun Guo
- State Key Lab of Cancer Biology, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| | - Ni Wei
- Department of Gastroenterology, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| | - Shu Hui Liang
- State Key Lab of Cancer Biology, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| | - Biao Luo Wang
- Department of Gastroenterology, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| | - Su Mei Sha
- Department of Gastroenterology, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| | - Kai Chun Wu
- Department of Gastroenterology, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi Province, China
| |
Collapse
|
42
|
Genetic Regulation of Transcriptional Variation in Natural Arabidopsis thaliana Accessions. G3-GENES GENOMES GENETICS 2016; 6:2319-28. [PMID: 27226169 PMCID: PMC4978887 DOI: 10.1534/g3.116.030874] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
An increased knowledge of the genetic regulation of expression in Arabidopsis thaliana is likely to provide important insights about the basis of the plant’s extensive phenotypic variation. Here, we reanalyzed two publicly available datasets with genome-wide data on genetic and transcript variation in large collections of natural A. thaliana accessions. Transcripts from more than half of all genes were detected in the leaves of all accessions, and from nearly all annotated genes in at least one accession. Thousands of genes had high transcript levels in some accessions, but no transcripts at all in others, and this pattern was correlated with the genome-wide genotype. In total, 2669 eQTL were mapped in the largest population, and 717 of them were replicated in the other population. A total of 646 cis-eQTL-regulated genes that lacked detectable transcripts in some accessions was found, and for 159 of these we identified one, or several, common structural variants in the populations that were shown to be likely contributors to the lack of detectable RNA transcripts for these genes. This study thus provides new insights into the overall genetic regulation of global gene expression diversity in the leaf of natural A. thaliana accessions. Further, it also shows that strong cis-acting polymorphisms, many of which are likely to be structural variations, make important contributions to the transcriptional variation in the worldwide A. thaliana population.
Collapse
|
43
|
Von Willebrand Factor Gene Variants Associate with Herpes simplex Encephalitis. PLoS One 2016; 11:e0155832. [PMID: 27224245 PMCID: PMC4880288 DOI: 10.1371/journal.pone.0155832] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Accepted: 05/04/2016] [Indexed: 01/24/2023] Open
Abstract
Herpes simplex encephalitis (HSE) is a rare complication of Herpes simplex virus type-1 infection. It results in severe parenchymal damage in the brain. Although viral latency in neurons is very common in the population, it remains unclear why certain individuals develop HSE. Here we explore potential host genetic variants predisposing to HSE. In order to investigate this we used a rat HSE model comparing the HSE susceptible SHR (Spontaneously Hypertensive Rats) with the asymptomatic infection of BN (Brown Norway). Notably, both strains have HSV-1 spread to the CNS at four days after infection. A genome wide linkage analysis of 29 infected HXB/BXH RILs (recombinant inbred lines-generated from the prior two strains), displayed variable susceptibility to HSE enabling the definition of a significant QTL (quantitative trait locus) named Hse6 towards the end of chromosome 4 (160.89-174Mb) containing the Vwf (von Willebrand factor) gene. This was the only gene in the QTL with both cis-regulation in the brain and included several non-synonymous SNPs (single nucleotide polymorphism). Intriguingly, in human chromosome 12 several SNPs within the intronic region between exon 43 and 44 of the VWF gene were associated with human HSE pathogenesis. In particular, rs917859 is nominally associated with an odds ratio of 1.5 (95% CI 1.11-2.02; p-value = 0.008) after genotyping in 115 HSE cases and 428 controls. Although there are possibly several genetic and environmental factors involved in development of HSE, our study identifies variants of the VWF gene as candidates for susceptibility in experimental and human HSE.
Collapse
|
44
|
Gutierrez-Arcelus M, Rich SS, Raychaudhuri S. Autoimmune diseases - connecting risk alleles with molecular traits of the immune system. Nat Rev Genet 2016; 17:160-74. [PMID: 26907721 PMCID: PMC4896831 DOI: 10.1038/nrg.2015.33] [Citation(s) in RCA: 139] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Genome-wide strategies have driven the discovery of more than 300 susceptibility loci for autoimmune diseases. However, for almost all loci, understanding of the mechanisms leading to autoimmunity remains limited, and most variants that are likely to be causal are in non-coding regions of the genome. A critical next step will be to identify the in vivo and ex vivo immunophenotypes that are affected by risk variants. To do this, key cell types and cell states that are implicated in autoimmune diseases will need to be defined. Functional genomic annotations from these cell types and states can then be used to resolve candidate genes and causal variants. Together with longitudinal studies, this approach may yield pivotal insights into how autoimmunity is triggered.
Collapse
Affiliation(s)
- Maria Gutierrez-Arcelus
- Division of Genetics, and Division of Rheumatology, Immunology and Allergy, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts 02142, USA
- Partners Center for Personalized Genetic Medicine, Boston, Massachusetts 02115, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia 22908, USA
| | - Soumya Raychaudhuri
- Division of Genetics, and Division of Rheumatology, Immunology and Allergy, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts 02142, USA
- Partners Center for Personalized Genetic Medicine, Boston, Massachusetts 02115, USA
- Faculty of Medical and Human Sciences, University of Manchester, Manchester M13 9PL, UK
- Department of Medicine, Karolinska Institutet and Karolinska University Hospital Solna, Stockholm SE-171 77, Sweden
| |
Collapse
|
45
|
Rakitsch B, Stegle O. Modelling local gene networks increases power to detect trans-acting genetic effects on gene expression. Genome Biol 2016; 17:33. [PMID: 26911988 PMCID: PMC4765046 DOI: 10.1186/s13059-016-0895-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2015] [Accepted: 02/09/2016] [Indexed: 01/05/2023] Open
Abstract
Expression quantitative trait loci (eQTL) mapping is a widely used tool to study the genetics of gene expression. Confounding factors and the burden of multiple testing limit the ability to map distal trans eQTLs, which is important to understand downstream genetic effects on genes and pathways. We propose a two-stage linear mixed model that first learns local directed gene-regulatory networks to then condition on the expression levels of selected genes. We show that this covariate selection approach controls for confounding factors and regulatory context, thereby increasing eQTL detection power and improving the consistency between studies. GNet-LMM is available at: https://github.com/PMBio/GNetLMM.
Collapse
Affiliation(s)
- Barbara Rakitsch
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | - Oliver Stegle
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| |
Collapse
|
46
|
Repnik K, Potočnik U. eQTL analysis links inflammatory bowel disease associated 1q21 locus to ECM1 gene. J Appl Genet 2016; 57:363-72. [PMID: 26738999 DOI: 10.1007/s13353-015-0334-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Revised: 12/16/2015] [Accepted: 12/18/2015] [Indexed: 12/11/2022]
Abstract
Genome-wide association studies (GWAS) have been highly successful in inflammatory bowel disease (IBD) with 163 confirmed associations so far. We used expression quantitative trait loci (eQTL) mapping to analyze IBD associated regions for which causative gene from the region is still unknown. First, we performed an extensive literature search and in silico analysis of published GWAS in IBD and eQTL studies and extracted 402 IBD associated SNPs assigned to 208 candidate loci, and 9562 eQTL correlations. When crossing GWA and eQTL data we found that for 50 % of loci there is no eQTL gene, while for 31.2 % we can determine one gene, for 11.1 % two genes and for the remaining 7.7 % three or more genes. Based on that we selected loci with one, two, and three or more eQTL genes and analyzed them in peripheral blood lymphocytes and intestine tissue samples of 606 Slovene patients with IBD and in 449 controls. Association analysis of selected SNPs showed statistical significance for three (rs2631372 and rs1050152 on 5q locus and rs13294 on 1q locus) out of six selected SNPs with at least one phenotype. Furthermore, with eQTL analysis of selected chromosomal regions, we confirmed a link between SNP and gene for four (SLC22A5 on 5q, ECM1 on 1q, ORMDL3 on 17q, and PUS10 on 2p locus) out of five selected regions. For 1q21 loci, we confirmed gene ECM1 as the most plausible gene from this region to be involved in pathogenesis of IBD and thereby contributed new eQTL correlation from this genomic region.
Collapse
Affiliation(s)
- Katja Repnik
- Faculty of Medicine, Center for Human Molecular Genetics and Pharmacogenomics, University of Maribor, Taborska ulica 8, 2000, Maribor, Slovenia.,Faculty for Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
| | - Uroš Potočnik
- Faculty of Medicine, Center for Human Molecular Genetics and Pharmacogenomics, University of Maribor, Taborska ulica 8, 2000, Maribor, Slovenia. .,Faculty for Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia.
| |
Collapse
|
47
|
Serin EAR, Nijveen H, Hilhorst HWM, Ligterink W. Learning from Co-expression Networks: Possibilities and Challenges. FRONTIERS IN PLANT SCIENCE 2016; 7:444. [PMID: 27092161 PMCID: PMC4825623 DOI: 10.3389/fpls.2016.00444] [Citation(s) in RCA: 186] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/21/2016] [Indexed: 05/18/2023]
Abstract
Plants are fascinating and complex organisms. A comprehensive understanding of the organization, function and evolution of plant genes is essential to disentangle important biological processes and to advance crop engineering and breeding strategies. The ultimate aim in deciphering complex biological processes is the discovery of causal genes and regulatory mechanisms controlling these processes. The recent surge of omics data has opened the door to a system-wide understanding of the flow of biological information underlying complex traits. However, dealing with the corresponding large data sets represents a challenging endeavor that calls for the development of powerful bioinformatics methods. A popular approach is the construction and analysis of gene networks. Such networks are often used for genome-wide representation of the complex functional organization of biological systems. Network based on similarity in gene expression are called (gene) co-expression networks. One of the major application of gene co-expression networks is the functional annotation of unknown genes. Constructing co-expression networks is generally straightforward. In contrast, the resulting network of connected genes can become very complex, which limits its biological interpretation. Several strategies can be employed to enhance the interpretation of the networks. A strategy in coherence with the biological question addressed needs to be established to infer reliable networks. Additional benefits can be gained from network-based strategies using prior knowledge and data integration to further enhance the elucidation of gene regulatory relationships. As a result, biological networks provide many more applications beyond the simple visualization of co-expressed genes. In this study we review the different approaches for co-expression network inference in plants. We analyse integrative genomics strategies used in recent studies that successfully identified candidate genes taking advantage of gene co-expression networks. Additionally, we discuss promising bioinformatics approaches that predict networks for specific purposes.
Collapse
Affiliation(s)
- Elise A. R. Serin
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
| | - Harm Nijveen
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
- Laboratory of Bioinformatics, Wageningen UniversityWageningen, Netherlands
| | - Henk W. M. Hilhorst
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
| | - Wilco Ligterink
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
- *Correspondence: Wilco Ligterink
| |
Collapse
|
48
|
Lewin A, Saadi H, Peters JE, Moreno-Moral A, Lee JC, Smith KGC, Petretto E, Bottolo L, Richardson S. MT-HESS: an efficient Bayesian approach for simultaneous association detection in OMICS datasets, with application to eQTL mapping in multiple tissues. Bioinformatics 2015; 32:523-32. [PMID: 26504141 PMCID: PMC4743623 DOI: 10.1093/bioinformatics/btv568] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Accepted: 09/03/2015] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION Analysing the joint association between a large set of responses and predictors is a fundamental statistical task in integrative genomics, exemplified by numerous expression Quantitative Trait Loci (eQTL) studies. Of particular interest are the so-called ': hotspots ': , important genetic variants that regulate the expression of many genes. Recently, attention has focussed on whether eQTLs are common to several tissues, cell-types or, more generally, conditions or whether they are specific to a particular condition. RESULTS We have implemented MT-HESS, a Bayesian hierarchical model that analyses the association between a large set of predictors, e.g. SNPs, and many responses, e.g. gene expression, in multiple tissues, cells or conditions. Our Bayesian sparse regression algorithm goes beyond ': one-at-a-time ': association tests between SNPs and responses and uses a fully multivariate model search across all linear combinations of SNPs, coupled with a model of the correlation between condition/tissue-specific responses. In addition, we use a hierarchical structure to leverage shared information across different genes, thus improving the detection of hotspots. We show the increase of power resulting from our new approach in an extensive simulation study. Our analysis of two case studies highlights new hotspots that would remain undetected by standard approaches and shows how greater prediction power can be achieved when several tissues are jointly considered. AVAILABILITY AND IMPLEMENTATION C[Formula: see text] source code and documentation including compilation instructions are available under GNU licence at http://www.mrc-bsu.cam.ac.uk/software/.
Collapse
Affiliation(s)
- Alex Lewin
- Department of Mathematics, Brunel University London
| | - Habib Saadi
- Department of Epidemiology and Biostatistics, Imperial College London, London
| | - James E Peters
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge, MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge
| | | | - James C Lee
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge
| | - Kenneth G C Smith
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge
| | - Enrico Petretto
- MRC Clinical Sciences Centre, Imperial College London, London, UK, Duke-NUS Graduate Medical School, Singapore, Singapore
| | - Leonardo Bottolo
- Department of Mathematics, Imperial College London, London, UK and Department of Medical Genetics, University of Cambridge
| | - Sylvia Richardson
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge
| |
Collapse
|
49
|
Ulcerative Colitis Database: An Integrated Database and Toolkit for Gene Function and Medication Involved in Ulcerative Colitis. Inflamm Bowel Dis 2015. [PMID: 26199991 DOI: 10.1097/mib.0000000000000411] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
BACKGROUND Over the last decade, a massive amount of well-annotated genomic data has been accumulated on the pathogenesis and therapies for ulcerative colitis (UC). However, a comprehensive repository is not available yet. METHODS Ulcerative Colitis Database (UCDB) was constructed using text mining followed by manually curating on the literature to collect the reliable information of UC-related genes, drugs, and susceptibility loci. UC DNA microarray data were collected. R packages were used to implement gene expression analysis toolkit. RESULTS UCDB includes 4 separate but closely related components: "UC GENE," "UC DRUG," "UC LOCUS," and "UC ANALYSIS." The UC GENE contains comprehensive information for 1151 UC-related genes manually curated from 2919 publications. The UC DRUG includes information for 248 drugs manually curated from 2344 publications. "UC LOCUS" includes 110 UC susceptibility SNP loci, which were collected from 12 Genome-Wide Association Studies. A comprehensive expression quantitative trait loci browser was also implemented. The UC ANALYSIS is an expression analysis toolkit for 37 UC expression array data sets, which contains 1098 samples. The toolkit can be used to do gene expression correlation, clustering, differentially expressed, and Gene Set Enrichment Analysis (GSEA). CONCLUSIONS UCDB provides a comprehensive collection of well-curated UC-related genes and drugs, and straightforward interfaces for gene expression analyses. UCDB is a useful leading resource for both basic and clinical research and will benefit UC community worldwide. UCDB is freely accessible at http://seiwertlab.uchicago.edu/UCDB.
Collapse
|
50
|
Shea CJA, Carhuatanta KAK, Wagner J, Bechmann N, Moore R, Herman JP, Jankord R. Variable impact of chronic stress on spatial learning and memory in BXD mice. Physiol Behav 2015; 150:69-77. [PMID: 26079812 DOI: 10.1016/j.physbeh.2015.06.022] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Revised: 06/04/2015] [Accepted: 06/11/2015] [Indexed: 12/15/2022]
Abstract
The effects of chronic stress on learning are highly variable across individuals. This variability stems from gene-environment interactions. However, the mechanisms by which stress affects genetic predictors of learning are unclear. Thus, we aim to determine whether the genetic pathways that predict spatial memory performance are altered by previous exposure to chronic stress. Sixty-two BXD recombinant inbred strains of mice, as well as parent strains C57BL/6J and DBA/2J, were randomly assigned as behavioral control or to a chronic variable stress paradigm and then underwent behavioral testing to assess spatial memory and learning performance using the Morris water maze. Quantitative trait loci (QTL) mapping was completed for average escape latency times for both control and stress animals. Loci on chromosomes 5 and 10 were found in both control and stress environmental populations; eight additional loci were found to be unique to either the control or stress environment. In sum, results indicate that certain genetic loci predict spatial memory performance regardless of prior stress exposure, while exposure to stress also reveals unique genetic predictors of training during the memory task. Thus, we find that genetic predictors contributing to spatial learning and memory are susceptible to the presence of chronic stress.
Collapse
Affiliation(s)
- Chloe J A Shea
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States
| | - Kimberly A K Carhuatanta
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States; Research Associate Program, National Research Council, National Academies of Science, Washington DC 20001, United States
| | - Jessica Wagner
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States
| | - Naomi Bechmann
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States; Infoscitex, Inc., Dayton, OH 45435, United States
| | - Raquel Moore
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States; Infoscitex, Inc., Dayton, OH 45435, United States
| | - James P Herman
- Department of Psychiatry and Behavioral Neuroscience, University of Cincinnati, Cincinnati, OH 45267, United States
| | - Ryan Jankord
- Applied Neuroscience, 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, OH 45433, United States.
| |
Collapse
|