Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Munger SC, Raghupathy N, Choi K, Simons AK, Gatti DM, Hinerfeld DA, Svenson KL, Keller MP, Attie AD, Hibbs MA, Graber JH, Chesler EJ, Churchill GA. RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations. Genetics 2014;198:59-73. [PMID: 25236449 DOI: 10.1534/genetics.114.165886] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open

For:	Munger SC, Raghupathy N, Choi K, Simons AK, Gatti DM, Hinerfeld DA, Svenson KL, Keller MP, Attie AD, Hibbs MA, Graber JH, Chesler EJ, Churchill GA. RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations. Genetics 2014;198:59-73. [PMID: 25236449 DOI: 10.1534/genetics.114.165886] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

de Jong TV, Pan Y, Rastas P, Munro D, Tutaj M, Akil H, Benner C, Chen D, Chitre AS, Chow W, Colonna V, Dalgard CL, Demos WM, Doris PA, Garrison E, Geurts AM, Gunturkun HM, Guryev V, Hourlier T, Howe K, Huang J, Kalbfleisch T, Kim P, Li L, Mahaffey S, Martin FJ, Mohammadi P, Ozel AB, Polesskaya O, Pravenec M, Prins P, Sebat J, Smith JR, Solberg Woods LC, Tabakoff B, Tracey A, Uliano-Silva M, Villani F, Wang H, Sharp BM, Telese F, Jiang Z, Saba L, Wang X, Murphy TD, Palmer AA, Kwitek AE, Dwinell MR, Williams RW, Li JZ, Chen H. A revamped rat reference genome improves the discovery of genetic diversity in laboratory rats. CELL GENOMICS 2024;4:100527. [PMID: 38537634 PMCID: PMC11019364 DOI: 10.1016/j.xgen.2024.100527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/26/2023] [Accepted: 02/29/2024] [Indexed: 04/09/2024]

Affiliation(s)

Tristan V de Jong Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
Yanchao Pan Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
Pasi Rastas Institute of Biotechnology, University of Helsinki, Helsinki, Finland
Daniel Munro Department of Psychiatry, University of California San Diego, San Diego, CA, USA; Department of Integrative Structural and Computational Biology, Scripps Research, San Diego, CA, USA
Monika Tutaj Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA; Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
Huda Akil Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA
Chris Benner Department of Medicine, University of California San Diego, San Diego, CA, USA
Denghui Chen Department of Psychiatry, University of California San Diego, San Diego, CA, USA
Apurva S Chitre Department of Psychiatry, University of California San Diego, San Diego, CA, USA
William Chow Tree of Life, Wellcome Sanger Institute, Cambridge, UK
Vincenza Colonna Institute of Genetics and Biophysics, National Research Council, Naples, Italy; Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Clifton L Dalgard Department of Anatomy, Physiology & Genetics, The American Genome Center, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
Wendy M Demos Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA; Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
Peter A Doris The Brown Foundation Institute of Molecular Medicine, Center for Human Genetics, University of Texas Health Science Center, Houston, TX, USA
Erik Garrison Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Aron M Geurts Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
Hakan M Gunturkun Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
Victor Guryev Genome Structure and Ageing, University of Groningen, UMC, Groningen, the Netherlands
Thibaut Hourlier European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus in Hinxton, Cambridgeshire, UK
Kerstin Howe Tree of Life, Wellcome Sanger Institute, Cambridge, UK
Jun Huang Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
Ted Kalbfleisch Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Louisville, KY, USA
Panjun Kim Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Ling Li Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA; Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN, USA
Spencer Mahaffey Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Fergal J Martin European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus in Hinxton, Cambridgeshire, UK
Pejman Mohammadi Center for Immunity and Immunotherapies, Seattle Children's Research Institute, Seattle, WA, USA; Department of Pediatrics, University of Washington School of Medicine, Seattle, WA, USA
Ayse Bilge Ozel Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
Oksana Polesskaya Department of Psychiatry, University of California San Diego, San Diego, CA, USA
Michal Pravenec Institute of Physiology, Czech Academy of Sciences, Prague, Czechia
Pjotr Prins Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Jonathan Sebat Department of Psychiatry, University of California San Diego, San Diego, CA, USA
Jennifer R Smith Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA; Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
Leah C Solberg Woods Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
Boris Tabakoff Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Alan Tracey Tree of Life, Wellcome Sanger Institute, Cambridge, UK
Marcela Uliano-Silva Tree of Life, Wellcome Sanger Institute, Cambridge, UK
Flavia Villani Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Hongyang Wang Department of Animal Sciences, Washington State University, Pullman, WA, USA
Burt M Sharp Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Francesca Telese Department of Psychiatry, University of California San Diego, San Diego, CA, USA
Zhihua Jiang Department of Animal Sciences, Washington State University, Pullman, WA, USA
Laura Saba Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Xusheng Wang Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA; Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN, USA
Terence D Murphy National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
Abraham A Palmer Department of Psychiatry, University of California San Diego, San Diego, CA, USA; Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
Anne E Kwitek Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA; Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
Melinda R Dwinell Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA; Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
Robert W Williams Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Jun Z Li Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
Hao Chen Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA.

Collapse

Coombes B, Lux T, Akhunov E, Hall A. Introgressions lead to reference bias in wheat RNA-seq analysis. BMC Biol 2024;22:56. [PMID: 38454464 PMCID: PMC10921782 DOI: 10.1186/s12915-024-01853-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/21/2024] [Indexed: 03/09/2024] Open

Ball RL, Bogue MA, Liang H, Srivastava A, Ashbrook DG, Lamoureux A, Gerring MW, Hatoum AS, Kim MJ, He H, Emerson J, Berger AK, Walton DO, Sheppard K, El Kassaby B, Castellanos F, Kunde-Ramamoorthy G, Lu L, Bluis J, Desai S, Sundberg BA, Peltz G, Fang Z, Churchill GA, Williams RW, Agrawal A, Bult CJ, Philip VM, Chesler EJ. GenomeMUSter mouse genetic variation service enables multitrait, multipopulation data integration and analysis. Genome Res 2024;34:145-159. [PMID: 38290977 PMCID: PMC10903950 DOI: 10.1101/gr.278157.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 01/10/2024] [Indexed: 02/01/2024]

Abstract

Hundreds of inbred mouse strains and intercross populations have been used to characterize the function of genetic variants that contribute to disease. Thousands of disease-relevant traits have been characterized in mice and made publicly available. New strains and populations including consomics, the collaborative cross, expanded BXD, and inbred wild-derived strains add to existing complex disease mouse models, mapping populations, and sensitized backgrounds for engineered mutations. The genome sequences of inbred strains, along with dense genotypes from others, enable integrated analysis of trait-variant associations across populations, but these analyses are hampered by the sparsity of genotypes available. Moreover, the data are not readily interoperable with other resources. To address these limitations, we created a uniformly dense variant resource by harmonizing multiple data sets. Missing genotypes were imputed using the Viterbi algorithm with a data-driven technique that incorporates local phylogenetic information, an approach that is extendable to other model organisms. The result is a web- and programmatically accessible data service called GenomeMUSter, comprising single-nucleotide variants covering 657 strains at 106.8 million segregating sites. Interoperation with phenotype databases, analytic tools, and other resources enable a wealth of applications, including multitrait, multipopulation meta-analysis. We show this in cross-species comparisons of type 2 diabetes and substance use disorder meta-analyses, leveraging mouse data to characterize the likely role of human variant effects in disease. Other applications include refinement of mapped loci and prioritization of strain backgrounds for disease modeling to further unlock extant mouse diversity for genetic and genomic studies in health and disease.

Collapse

Affiliation(s)

Robyn L Ball The Jackson Laboratory, Bar Harbor, Maine 04609, USA;
Molly A Bogue The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Hongping Liang The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Anuj Srivastava The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
David G Ashbrook University of Tennessee Health Science Center, Memphis, Tennessee 38163, USA
Anna Lamoureux The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Matthew W Gerring The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Alexander S Hatoum Psychological and Brain Sciences, Washington University in St. Louis, St. Louis, Missouri 63130, USA Artificial Intelligence and the Internet of Things Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA
Matthew J Kim University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
Hao He The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Jake Emerson The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Alexander K Berger The Jackson Laboratory, Bar Harbor, Maine 04609, USA
David O Walton The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Keith Sheppard The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Baha El Kassaby The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Francisco Castellanos The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
Govindarajan Kunde-Ramamoorthy The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Lu Lu University of Tennessee Health Science Center, Memphis, Tennessee 38163, USA
John Bluis The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Sejal Desai The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Beth A Sundberg The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Gary Peltz Department of Anesthesia, Pain and Perioperative Medicine, Stanford University School of Medicine, Stanford, California 94305, USA
Zhuoqing Fang Department of Anesthesia, Pain and Perioperative Medicine, Stanford University School of Medicine, Stanford, California 94305, USA
Gary A Churchill The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Robert W Williams University of Tennessee Health Science Center, Memphis, Tennessee 38163, USA
Arpana Agrawal Department of Psychiatry, Washington University School of Medicine, St. Louis, Missouri 63110, USA
Carol J Bult The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Vivek M Philip The Jackson Laboratory, Bar Harbor, Maine 04609, USA
Elissa J Chesler The Jackson Laboratory, Bar Harbor, Maine 04609, USA

Collapse

Meade RK, Long JE, Jinich A, Rhee KY, Ashbrook DG, Williams RW, Sassetti CM, Smith CM. Genome-wide screen identifies host loci that modulate Mycobacterium tuberculosis fitness in immunodivergent mice. G3 (BETHESDA, MD.) 2023;13:jkad147. [PMID: 37405387 PMCID: PMC10468300 DOI: 10.1093/g3journal/jkad147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 06/05/2023] [Accepted: 06/27/2023] [Indexed: 07/06/2023]

Abstract

Genetic differences among mammalian hosts and among strains of Mycobacterium tuberculosis (Mtb) are well-established determinants of tuberculosis (TB) patient outcomes. The advent of recombinant inbred mouse panels and next-generation transposon mutagenesis and sequencing approaches has enabled dissection of complex host-pathogen interactions. To identify host and pathogen genetic determinants of Mtb pathogenesis, we infected members of the highly diverse BXD family of strains with a comprehensive library of Mtb transposon mutants (TnSeq). Members of the BXD family segregate for Mtb-resistant C57BL/6J (B6 or B) and Mtb-susceptible DBA/2J (D2 or D) haplotypes. The survival of each bacterial mutant was quantified within each BXD host, and we identified those bacterial genes that were differentially required for Mtb fitness across BXD genotypes. Mutants that varied in survival among the host family of strains were leveraged as reporters of "endophenotypes," each bacterial fitness profile directly probing specific components of the infection microenvironment. We conducted quantitative trait loci (QTL) mapping of these bacterial fitness endophenotypes and identified 140 host-pathogen QTL (hpQTL). We located a QTL hotspot on chromosome 6 (75.97-88.58 Mb) associated with the genetic requirement of multiple Mtb genes: Rv0127 (mak), Rv0359 (rip2), Rv0955 (perM), and Rv3849 (espR). Together, this screen reinforces the utility of bacterial mutant libraries as precise reporters of the host immunological microenvironment during infection and highlights specific host-pathogen genetic interactions for further investigation. To enable downstream follow-up for both bacterial and mammalian genetic research communities, all bacterial fitness profiles have been deposited into GeneNetwork.org and added into the comprehensive collection of TnSeq libraries in MtbTnDB.

Collapse

Wu EY, Singh NP, Choi K, Zakeri M, Vincent M, Churchill GA, Ackert-Bicknell CL, Patro R, Love MI. SEESAW: detecting isoform-level allelic imbalance accounting for inferential uncertainty. Genome Biol 2023;24:165. [PMID: 37438847 PMCID: PMC10337143 DOI: 10.1186/s13059-023-03003-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 06/29/2023] [Indexed: 07/14/2023] Open

Huynh K, Smith BR, Macdonald SJ, Long AD. Genetic variation in chromatin state across multiple tissues in Drosophila melanogaster. PLoS Genet 2023;19:e1010439. [PMID: 37146087 PMCID: PMC10191298 DOI: 10.1371/journal.pgen.1010439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 05/17/2023] [Accepted: 04/20/2023] [Indexed: 05/07/2023] Open

Perez BC, Bink MCAM, Svenson KL, Churchill GA, Calus MPL. Adding gene transcripts into genomic prediction improves accuracy and reveals sampling time dependence. G3 (BETHESDA, MD.) 2022;12:jkac258. [PMID: 36161485 PMCID: PMC9635642 DOI: 10.1093/g3journal/jkac258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 09/07/2022] [Indexed: 06/16/2023]

Gobet N, Jan M, Franken P, Xenarios I. Towards mouse genetic-specific RNA-sequencing read mapping. PLoS Comput Biol 2022;18:e1010552. [PMID: 36155976 PMCID: PMC9536569 DOI: 10.1371/journal.pcbi.1010552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 10/06/2022] [Accepted: 09/07/2022] [Indexed: 11/18/2022] Open

Abstract

Genetic variations affect behavior and cause disease but understanding how these variants drive complex traits is still an open question. A common approach is to link the genetic variants to intermediate molecular phenotypes such as the transcriptome using RNA-sequencing (RNA-seq). Paradoxically, these variants between the samples are usually ignored at the beginning of RNA-seq analyses of many model organisms. This can skew the transcriptome estimates that are used later for downstream analyses, such as expression quantitative trait locus (eQTL) detection. Here, we assessed the impact of reference-based analysis on the transcriptome and eQTLs in a widely-used mouse genetic population: the BXD panel of recombinant inbred lines. We highlight existing reference bias in the transcriptome data analysis and propose practical solutions which combine available genetic variants, genotypes, and genome reference sequence. The use of custom BXD line references improved downstream analysis compared to classical genome reference. These insights would likely benefit genetic studies with a transcriptomic component and demonstrate that genome references need to be reassessed and improved.

To understand how genetic variations affect behavior and cause disease it is common to quantify expression of transcripts by sequencing. Transcripts are extracted, fragmented, and the sequence of the fragments read. An important step for their quantification is to virtually assign the different fragments to the transcript they originate from using a reference genome. Reference genomes are costly to build, so usually only one high-quality reference per animal model species is available. When comparing genetically different individuals, using a single reference may introduce a bias because it might be more similar to some individuals than to others. Paradoxically, the variations at the core of genetic studies are thus ignored at the start of the analysis. We built customized references with known genetic variants for each of the mouse lines we had and quantified the impact of the reference at different levels of the bioinformatic analysis. We found that using customized references reduced the bias compared to using a single reference. Our study uses publicly available data and tools, so others can easily implement this improvement in their analyses.

Collapse

Guo W, Coulter M, Waugh R, Zhang R. The value of genotype-specific reference for transcriptome analyses in barley. Life Sci Alliance 2022;5:5/8/e202101255. [PMID: 35459738 PMCID: PMC9034525 DOI: 10.26508/lsa.202101255] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 04/10/2022] [Accepted: 04/11/2022] [Indexed: 12/31/2022] Open

Thomas SM, Ackert-Bicknell CL, Zuscik MJ, Payne KA. Understanding the Transcriptomic Landscape to Drive New Innovations in Musculoskeletal Regenerative Medicine. Curr Osteoporos Rep 2022;20:141-152. [PMID: 35156183 DOI: 10.1007/s11914-022-00726-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/18/2022] [Indexed: 11/03/2022]

Zhou X, Sam TW, Lee AY, Leung D. Mouse strain-specific polymorphic provirus functions as cis-regulatory element leading to epigenomic and transcriptomic variations. Nat Commun 2021;12:6462. [PMID: 34753915 PMCID: PMC8578388 DOI: 10.1038/s41467-021-26630-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 10/14/2021] [Indexed: 12/27/2022] Open

Spaulding EL, Hines TJ, Bais P, Tadenev ALD, Schneider R, Jewett D, Pattavina B, Pratt SL, Morelli KH, Stum MG, Hill DP, Gobet C, Pipis M, Reilly MM, Jennings MJ, Horvath R, Bai Y, Shy ME, Alvarez-Castelao B, Schuman EM, Bogdanik LP, Storkebaum E, Burgess RW. The integrated stress response contributes to tRNA synthetase-associated peripheral neuropathy. Science 2021;373:1156-1161. [PMID: 34516839 PMCID: PMC8908546 DOI: 10.1126/science.abb3414] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Affiliation(s)

E. L. Spaulding The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME 04469, USA
T. J. Hines The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
P. Bais The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
A. L. D. Tadenev The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
R. Schneider The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
D. Jewett The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
B. Pattavina The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
S. L. Pratt The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA Neuroscience Program, Graduate School of Biomedical Sciences, Tufts University, Boston, MA, 02111 USA
K. H. Morelli The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME 04469, USA
M. G. Stum The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
D. P. Hill The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
C. Gobet School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
M. Pipis MRC Centre for Neuromuscular Diseases, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
M. M. Reilly MRC Centre for Neuromuscular Diseases, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
M. J. Jennings Department of Clinical Neuroscience, University of Cambridge, Cambridge, UK
R. Horvath Department of Clinical Neuroscience, University of Cambridge, Cambridge, UK
Y. Bai Department of Neurology, Carver College of Medicine, University of Iowa, Iowa City, Iowa, USA
M. E. Shy Department of Neurology, Carver College of Medicine, University of Iowa, Iowa City, Iowa, USA
B. Alvarez-Castelao Max Planck Institute for Brain Research, Frankfurt, Germany
E. M. Schuman Max Planck Institute for Brain Research, Frankfurt, Germany
L. P. Bogdanik The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
E. Storkebaum Department of Molecular Neurobiology, Donders Institute for Brain, Cognition and Behaviour and Faculty of Science, Radboud University, Nijmegen, Netherlands
R. W. Burgess The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME 04469, USA Neuroscience Program, Graduate School of Biomedical Sciences, Tufts University, Boston, MA, 02111 USA

Collapse

Que E, James KL, Coffey AR, Smallwood TL, Albright J, Huda MN, Pomp D, Sethupathy P, Bennett BJ. Genetic architecture modulates diet-induced hepatic mRNA and miRNA expression profiles in Diversity Outbred mice. Genetics 2021;218:6321522. [PMID: 34849860 PMCID: PMC8757298 DOI: 10.1093/genetics/iyab068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/27/2020] [Indexed: 11/30/2022] Open

Boatwright JL, Yeh CT, Hu HC, Susanna A, Soltis DE, Soltis PS, Schnable PS, Barbazuk WB. Trajectories of Homoeolog-Specific Expression in Allotetraploid Tragopogon castellanus Populations of Independent Origins. FRONTIERS IN PLANT SCIENCE 2021;12:679047. [PMID: 34249049 PMCID: PMC8261302 DOI: 10.3389/fpls.2021.679047] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 05/20/2021] [Indexed: 06/13/2023]

Abstract

Polyploidization can have a significant ecological and evolutionary impact by providing substantially more genetic material that may result in novel phenotypes upon which selection may act. While the effects of polyploidization are broadly reviewed across the plant tree of life, the reproducibility of these effects within naturally occurring, independently formed polyploids is poorly characterized. The flowering plant genus Tragopogon (Asteraceae) offers a rare glimpse into the intricacies of repeated allopolyploid formation with both nascent (< 90 years old) and more ancient (mesopolyploids) formations. Neo- and mesopolyploids in Tragopogon have formed repeatedly and have extant diploid progenitors that facilitate the comparison of genome evolution after polyploidization across a broad span of evolutionary time. Here, we examine four independently formed lineages of the mesopolyploid Tragopogon castellanus for homoeolog expression changes and fractionation after polyploidization. We show that expression changes are remarkably similar among these independently formed polyploid populations with large convergence among expressed loci, moderate convergence among loci lost, and stochastic silencing. We further compare and contrast these results for T. castellanus with two nascent Tragopogon allopolyploids. While homoeolog expression bias was balanced in both nascent polyploids and T. castellanus, the degree of additive expression was significantly different, with the mesopolyploid populations demonstrating more non-additive expression. We suggest that gene dosage and expression noise minimization may play a prominent role in regulating gene expression patterns immediately after allopolyploidization as well as deeper into time, and these patterns are conserved across independent polyploid lineages.

Collapse

Miller BR, Morse AM, Borgert JE, Liu Z, Sinclair K, Gamble G, Zou F, Newman JRB, León-Novelo LG, Marroni F, McIntyre LM. Testcrosses are an efficient strategy for identifying cis-regulatory variation: Bayesian analysis of allele-specific expression (BayesASE). G3 (BETHESDA, MD.) 2021;11:jkab096. [PMID: 33772539 PMCID: PMC8104932 DOI: 10.1093/g3journal/jkab096] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 03/10/2021] [Indexed: 12/30/2022]

Abstract

Allelic imbalance (AI) occurs when alleles in a diploid individual are differentially expressed and indicates cis acting regulatory variation. What is the distribution of allelic effects in a natural population? Are all alleles the same? Are all alleles distinct? The approach described applies to any technology generating allele-specific sequence counts, for example for chromatin accessibility and can be applied generally including to comparisons between tissues or environments for the same genotype. Tests of allelic effect are generally performed by crossing individuals and comparing expression between alleles directly in the F1. However, a crossing scheme that compares alleles pairwise is a prohibitive cost for more than a handful of alleles as the number of crosses is at least (n2-n)/2 where n is the number of alleles. We show here that a testcross design followed by a hypothesis test of AI between testcrosses can be used to infer differences between nontester alleles, allowing n alleles to be compared with n crosses. Using a mouse data set where both testcrosses and direct comparisons have been performed, we show that the predicted differences between nontester alleles are validated at levels of over 90% when a parent-of-origin effect is present and of 60%-80% overall. Power considerations for a testcross, are similar to those in a reciprocal cross. In all applications, the testing for AI involves several complex bioinformatics steps. BayesASE is a complete bioinformatics pipeline that incorporates state-of-the-art error reduction techniques and a flexible Bayesian approach to estimating AI and formally comparing levels of AI between conditions. The modular structure of BayesASE has been packaged in Galaxy, made available in Nextflow and as a collection of scripts for the SLURM workload manager on github (https://github.com/McIntyre-Lab/BayesASE).

Collapse

Zhan S, Griswold C, Lukens L. Zea mays RNA-seq estimated transcript abundances are strongly affected by read mapping bias. BMC Genomics 2021;22:285. [PMID: 33874908 PMCID: PMC8056621 DOI: 10.1186/s12864-021-07577-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 03/30/2021] [Indexed: 11/27/2022] Open

Abstract

Background

Genetic variation for gene expression is a source of phenotypic variation for natural and agricultural species. The common approach to map and to quantify gene expression from genetically distinct individuals is to assign their RNA-seq reads to a single reference genome. However, RNA-seq reads from alleles dissimilar to this reference genome may fail to map correctly, causing transcript levels to be underestimated. Presently, the extent of this mapping problem is not clear, particularly in highly diverse species. We investigated if mapping bias occurred and if chromosomal features associated with mapping bias. Zea mays presents a model species to assess these questions, given it has genotypically distinct and well-studied genetic lines.

Results

In Zea mays, the inbred B73 genome is the standard reference genome and template for RNA-seq read assignments. In the absence of mapping bias, B73 and a second inbred line, Mo17, would each have an approximately equal number of regulatory alleles that increase gene expression. Remarkably, Mo17 had 2–4 times fewer such positively acting alleles than did B73 when RNA-seq reads were aligned to the B73 reference genome. Reciprocally, over one-half of the B73 alleles that increased gene expression were not detected when reads were aligned to the Mo17 genome template. Genes at dissimilar chromosomal ends were strongly affected by mapping bias, and genes at more similar pericentromeric regions were less affected. Biased transcript estimates were higher in untranslated regions and lower in splice junctions. Bias occurred across software and alignment parameters.

Conclusions

Mapping bias very strongly affects gene transcript abundance estimates in maize, and bias varies across chromosomal features. Individual genome or transcriptome templates are likely necessary for accurate transcript estimation across genetically variable individuals in maize and other species.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-021-07577-3.

Collapse

Patro R, Salmela L. Algorithms meet sequencing technologies - 10th edition of the RECOMB-Seq workshop. iScience 2021;24:101956. [PMID: 33437938 PMCID: PMC7788091 DOI: 10.1016/j.isci.2020.101956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Melia T, Waxman DJ. Genetic factors contributing to extensive variability of sex-specific hepatic gene expression in Diversity Outbred mice. PLoS One 2020;15:e0242665. [PMID: 33264334 PMCID: PMC7710091 DOI: 10.1371/journal.pone.0242665] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open

Abstract

Sex-specific transcription characterizes hundreds of genes in mouse liver, many implicated in sex-differential drug and lipid metabolism and disease susceptibility. While the regulation of liver sex differences by growth hormone-activated STAT5 is well established, little is known about autosomal genetic factors regulating the sex-specific liver transcriptome. Here we show, using genotyping and expression data from a large population of Diversity Outbred mice, that genetic factors work in tandem with growth hormone to control the individual variability of hundreds of sex-biased genes, including many long non-coding RNA genes. Significant associations between single nucleotide polymorphisms and sex-specific gene expression were identified as expression quantitative trait loci (eQTLs), many of which showed strong sex-dependent associations. Remarkably, autosomal genetic modifiers of sex-specific genes were found to account for more than 200 instances of gain or loss of sex-specificity across eight Diversity Outbred mouse founder strains. Sex-biased STAT5 binding sites and open chromatin regions with strain-specific variants were significantly enriched at eQTL regions regulating correspondingly sex-specific genes, supporting the proposed functional regulatory nature of the eQTL regions identified. Binding of the male-biased, growth hormone-regulated repressor BCL6 was most highly enriched at trans-eQTL regions controlling female-specific genes. Co-regulated gene clusters defined by overlapping eQTLs included sets of highly correlated genes from different chromosomes, further supporting trans-eQTL action. These findings elucidate how an unexpectedly large number of autosomal factors work in tandem with growth hormone signaling pathways to regulate the individual variability associated with sex differences in liver metabolism and disease.

Collapse

Srivastava A, Malik L, Sarkar H, Zakeri M, Almodaresi F, Soneson C, Love MI, Kingsford C, Patro R. Alignment and mapping methodology influence transcript abundance estimation. Genome Biol 2020;21:239. [PMID: 32894187 PMCID: PMC7487471 DOI: 10.1186/s13059-020-02151-8] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 08/19/2020] [Indexed: 01/23/2023] Open

Liang ZS, Cimino I, Yalcin B, Raghupathy N, Vancollie VE, Ibarra-Soria X, Firth HV, Rimmington D, Farooqi IS, Lelliott CJ, Munger SC, O’Rahilly S, Ferguson-Smith AC, Coll AP, Logan DW. Trappc9 deficiency causes parent-of-origin dependent microcephaly and obesity. PLoS Genet 2020;16:e1008916. [PMID: 32877400 PMCID: PMC7467316 DOI: 10.1371/journal.pgen.1008916] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 06/08/2020] [Indexed: 11/30/2022] Open

Affiliation(s)

Zhengzheng S. Liang Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, United Kingdom
Irene Cimino MRC Metabolic Diseases Unit, Wellcome Trust-Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, United Kingdom
Binnaz Yalcin Institut de Génétique et de Biologie Moléculaire et Cellulaire, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Université de Strasbourg, France
Narayanan Raghupathy The Jackson Laboratory, Bar Harbor, Maine, United States of America
Valerie E. Vancollie Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, United Kingdom
Ximena Ibarra-Soria Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, United Kingdom
Helen V. Firth Department of Clinical Genetics, Addenbrooke’s Hospital, Cambridge, United Kingdom
Debra Rimmington MRC Metabolic Diseases Unit, Wellcome Trust-Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, United Kingdom
I. Sadaf Farooqi University of Cambridge Metabolic Research Laboratories and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge, United Kingdom
Christopher J. Lelliott Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, United Kingdom
Steven C. Munger The Jackson Laboratory, Bar Harbor, Maine, United States of America
Stephen O’Rahilly MRC Metabolic Diseases Unit, Wellcome Trust-Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, United Kingdom
Anne C. Ferguson-Smith Department of Genetics, University of Cambridge, Cambridge, United Kingdom
Anthony P. Coll MRC Metabolic Diseases Unit, Wellcome Trust-Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, United Kingdom
Darren W. Logan Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, United Kingdom

Collapse

Que E, James KL, Coffey AR, Smallwood TL, Albright J, Huda MN, Pomp D, Sethupathy P, Bennett BJ. Genetic Architecture Modulates Diet-Induced Hepatic mRNA and miRNA Expression Profiles in Diversity Outbred Mice. Genetics 2020;216:241-259. [PMID: 32763908 PMCID: PMC7463293 DOI: 10.1534/genetics.120.303481] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/27/2020] [Indexed: 02/07/2023] Open

Groza C, Kwan T, Soranzo N, Pastinen T, Bourque G. Personalized and graph genomes reveal missing signal in epigenomic data. Genome Biol 2020;21:124. [PMID: 32450900 PMCID: PMC7249353 DOI: 10.1186/s13059-020-02038-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 05/08/2020] [Indexed: 12/21/2022] Open

Raghupathy N, Choi K, Vincent MJ, Beane GL, Sheppard KS, Munger SC, Korstanje R, Pardo-Manual de Villena F, Churchill GA. Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression. Bioinformatics 2019;34:2177-2184. [PMID: 29444201 DOI: 10.1093/bioinformatics/bty078] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 02/09/2018] [Indexed: 02/06/2023] Open

Abstract

Motivation

Allele-specific expression (ASE) refers to the differential abundance of the allelic copies of a transcript. RNA sequencing (RNA-seq) can provide quantitative estimates of ASE for genes with transcribed polymorphisms. When short-read sequences are aligned to a diploid transcriptome, read-mapping ambiguities confound our ability to directly count reads. Multi-mapping reads aligning equally well to multiple genomic locations, isoforms or alleles can comprise the majority (>85%) of reads. Discarding them can result in biases and substantial loss of information. Methods have been developed that use weighted allocation of read counts but these methods treat the different types of multi-reads equivalently. We propose a hierarchical approach to allocation of read counts that first resolves ambiguities among genes, then among isoforms, and lastly between alleles. We have implemented our model in EMASE software (Expectation-Maximization for Allele Specific Expression) to estimate total gene expression, isoform usage and ASE based on this hierarchical allocation.

Results

Methods that align RNA-seq reads to a diploid transcriptome incorporating known genetic variants improve estimates of ASE and total gene expression compared to methods that use reference genome alignments. Weighted allocation methods outperform methods that discard multi-reads. Hierarchical allocation of reads improves estimation of ASE even when data are simulated from a non-hierarchical model. Analysis of RNA-seq data from F1 hybrid mice using EMASE reveals widespread ASE associated with cis-acting polymorphisms and a small number of parent-of-origin effects.

Availability and implementation

EMASE software is available at https://github.com/churchill-lab/emase.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Bioinformatic methods for cancer neoantigen prediction. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2019;164:25-60. [PMID: 31383407 DOI: 10.1016/bs.pmbts.2019.06.016] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Skelly DA, Raghupathy N, Robledo RF, Graber JH, Chesler EJ. Reference Trait Analysis Reveals Correlations Between Gene Expression and Quantitative Traits in Disjoint Samples. Genetics 2019;212:919-929. [PMID: 31113812 PMCID: PMC6614885 DOI: 10.1534/genetics.118.301865] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 05/14/2019] [Indexed: 12/21/2022] Open

Abstract

Systems genetic analysis of complex traits involves the integrated analysis of genetic, genomic, and disease-related measures. However, these data are often collected separately across multiple study populations, rendering direct correlation of molecular features to complex traits impossible. Recent transcriptome-wide association studies (TWAS) have harnessed gene expression quantitative trait loci (eQTL) to associate unmeasured gene expression with a complex trait in genotyped individuals, but this approach relies primarily on strong eQTL. We propose a simple and powerful alternative strategy for correlating independently obtained sets of complex traits and molecular features. In contrast to TWAS, our approach gains precision by correlating complex traits through a common set of continuous phenotypes instead of genetic predictors, and can identify transcript-trait correlations for which the regulation is not genetic. In our approach, a set of multiple quantitative "reference" traits is measured across all individuals, while measures of the complex trait of interest and transcriptional profiles are obtained in disjoint subsamples. A conventional multivariate statistical method, canonical correlation analysis, is used to relate the reference traits and traits of interest to identify gene expression correlates. We evaluate power and sample size requirements of this methodology, as well as performance relative to other methods, via extensive simulation and analysis of a behavioral genetics experiment in 258 Diversity Outbred mice involving two independent sets of anxiety-related behaviors and hippocampal gene expression. After splitting the data set and hiding one set of anxiety-related traits in half the samples, we identified transcripts correlated with the hidden traits using the other set of anxiety-related traits and exploiting the highest canonical correlation (R = 0.69) between the trait data sets. We demonstrate that this approach outperforms TWAS in identifying associated transcripts. Together, these results demonstrate the validity, reliability, and power of reference trait analysis for identifying relations between complex traits and their molecular substrates.

Collapse

Melia T, Waxman DJ. Sex-Biased lncRNAs Inversely Correlate With Sex-Opposite Gene Coexpression Networks in Diversity Outbred Mouse Liver. Endocrinology 2019;160:989-1007. [PMID: 30840070 PMCID: PMC6449536 DOI: 10.1210/en.2018-00949] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Accepted: 02/27/2019] [Indexed: 01/05/2023]

Abstract

Sex differences in liver gene expression are determined by pituitary growth hormone secretion patterns, which regulate sex-dependent liver transcription factors and establish sex-specific chromatin states. Hypophysectomy (hypox) identifies two major classes of liver sex-biased genes, defined by their sex-dependent positive or negative responses to pituitary hormone ablation. However, the mechanisms that underlie each hypox-response class are unknown. We sought to discover candidate, regulatory, long noncoding RNAs (lncRNAs) controlling responsiveness to hypox. We characterized gene structures and expression patterns for 15,558 mouse liver-expressed lncRNAs, including many sex-specific lncRNAs regulated during postnatal development or subject to circadian regulation. Using the high natural allelic variance of Diversity Outbred (DO) mice, we discovered tightly coexpressed clusters of sex-specific protein-coding genes (gene modules) in male and female DO liver. Remarkably, many gene modules were strongly enriched for sex-specific genes within a single hypox-response class, indicating that the genetic heterogeneity of DO mice encompasses responsiveness to hypox. Moreover, several distant gene modules were enriched for gene subsets of the same hypox-response class, highlighting the complex regulation of hypox-responsiveness. Finally, we identified eight sex-specific lncRNAs with strong negative regulatory potential, as indicated by their strong negative correlation of expression across DO mouse livers with that of protein-coding gene modules enriched for genes of the opposite sex bias and inverse hypox-response class. These findings reveal an important role for genetic factors in regulating responsiveness to hypox, and present testable hypotheses for the roles of sex-biased liver lncRNAs in controlling the sex-bias of liver gene expression.

Collapse

Zhao C, Xie S, Wu H, Luan Y, Hu S, Ni J, Lin R, Zhao S, Zhang D, Li X. Quantification of allelic differential expression using a simple Fluorescence primer PCR-RFLP-based method. Sci Rep 2019;9:6334. [PMID: 31004110 PMCID: PMC6474871 DOI: 10.1038/s41598-019-42815-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 03/29/2019] [Indexed: 12/04/2022] Open

Affiliation(s)

Changzhi Zhao Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Shengsong Xie Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China.,The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Hui Wu Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Yu Luan Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Suqin Hu Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Juan Ni Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Ruiyi Lin Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Shuhong Zhao Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China.,The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, 430070, P.R. China
Dingxiao Zhang Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China. .,The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, 430070, P.R. China.
Xinyun Li Key Laboratory of Agricultural Animal Genetics, Breeding, and Reproduction of the Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, 430070, P.R. China. .,The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, 430070, P.R. China.

Collapse

Qu W, Gurdziel K, Pique-Regi R, Ruden DM. Lead Modulates trans- and cis-Expression Quantitative Trait Loci (eQTLs) in Drosophila melanogaster Heads. Front Genet 2018;9:395. [PMID: 30294342 PMCID: PMC6158337 DOI: 10.3389/fgene.2018.00395] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 08/30/2018] [Indexed: 11/13/2022] Open

A Robust Methodology for Assessing Differential Homeolog Contributions to the Transcriptomes of Allopolyploids. Genetics 2018;210:883-894. [PMID: 30213855 DOI: 10.1534/genetics.118.301564] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 09/07/2018] [Indexed: 12/18/2022] Open

Liu X, MacLeod JN, Liu J. iMapSplice: Alleviating reference bias through personalized RNA-seq alignment. PLoS One 2018;13:e0201554. [PMID: 30096157 PMCID: PMC6086400 DOI: 10.1371/journal.pone.0201554] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 07/17/2018] [Indexed: 11/19/2022] Open

Winter JM, Curry NL, Gildea DM, Williams KA, Lee M, Hu Y, Crawford NPS. Modifier locus mapping of a transgenic F2 mouse population identifies CCDC115 as a novel aggressive prostate cancer modifier gene in humans. BMC Genomics 2018;19:450. [PMID: 29890952 PMCID: PMC5996485 DOI: 10.1186/s12864-018-4827-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Accepted: 05/25/2018] [Indexed: 12/16/2022] Open

Abstract

BACKGROUND

It is well known that development of prostate cancer (PC) can be attributed to somatic mutations of the genome, acquired within proto-oncogenes or tumor-suppressor genes. What is less well understood is how germline variation contributes to disease aggressiveness in PC patients. To map germline modifiers of aggressive neuroendocrine PC, we generated a genetically diverse F2 intercross population using the transgenic TRAMP mouse model and the wild-derived WSB/EiJ (WSB) strain. The relevance of germline modifiers of aggressive PC identified in these mice was extensively correlated in human PC datasets and functionally validated in cell lines.

RESULTS

Aggressive PC traits were quantified in a population of 30 week old (TRAMP x WSB) F2 mice (n = 307). Correlation of germline genotype with aggressive disease phenotype revealed seven modifier loci that were significantly associated with aggressive disease. RNA-seq were analyzed using cis-eQTL and trait correlation analyses to identify candidate genes within each of these loci. Analysis of 92 (TRAMP x WSB) F2 prostates revealed 25 candidate genes that harbored both a significant cis-eQTL and mRNA expression correlations with an aggressive PC trait. We further delineated these candidate genes based on their clinical relevance, by interrogating human PC GWAS and PC tumor gene expression datasets. We identified four genes (CCDC115, DNAJC10, RNF149, and STYXL1), which encompassed all of the following characteristics: 1) one or more germline variants associated with aggressive PC traits; 2) differential mRNA levels associated with aggressive PC traits; and 3) differential mRNA expression between normal and tumor tissue. Functional validation studies of these four genes using the human LNCaP prostate adenocarcinoma cell line revealed ectopic overexpression of CCDC115 can significantly impede cell growth in vitro and tumor growth in vivo. Furthermore, CCDC115 human prostate tumor expression was associated with better survival outcomes.

CONCLUSION

We have demonstrated how modifier locus mapping in mouse models of PC, coupled with in silico analyses of human PC datasets, can reveal novel germline modifier genes of aggressive PC. We have also characterized CCDC115 as being associated with less aggressive PC in humans, placing it as a potential prognostic marker of aggressive PC.

Collapse

Genetic Drivers of Pancreatic Islet Function. Genetics 2018;209:335-356. [PMID: 29567659 DOI: 10.1534/genetics.118.300864] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 03/19/2018] [Indexed: 01/03/2023] Open

Abstract

The majority of gene loci that have been associated with type 2 diabetes play a role in pancreatic islet function. To evaluate the role of islet gene expression in the etiology of diabetes, we sensitized a genetically diverse mouse population with a Western diet high in fat (45% kcal) and sucrose (34%) and carried out genome-wide association mapping of diabetes-related phenotypes. We quantified mRNA abundance in the islets and identified 18,820 expression QTL. We applied mediation analysis to identify candidate causal driver genes at loci that affect the abundance of numerous transcripts. These include two genes previously associated with monogenic diabetes (PDX1 and HNF4A), as well as three genes with nominal association with diabetes-related traits in humans (FAM83E, IL6ST, and SAT2). We grouped transcripts into gene modules and mapped regulatory loci for modules enriched with transcripts specific for α-cells, and another specific for δ-cells. However, no single module enriched for β-cell-specific transcripts, suggesting heterogeneity of gene expression patterns within the β-cell population. A module enriched in transcripts associated with branched-chain amino acid metabolism was the most strongly correlated with physiological traits that reflect insulin resistance. Although the mice in this study were not overtly diabetic, the analysis of pancreatic islet gene expression under dietary-induced stress enabled us to identify correlated variation in groups of genes that are functionally linked to diabetes-associated physiological traits. Our analysis suggests an expected degree of concordance between diabetes-associated loci in the mouse and those found in human populations, and demonstrates how the mouse can provide evidence to support nominal associations found in human genome-wide association mapping.

Collapse

Direct Testing for Allele-Specific Expression Differences Between Conditions. G3-GENES GENOMES GENETICS 2018;8:447-460. [PMID: 29167272 PMCID: PMC5919738 DOI: 10.1534/g3.117.300139] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Sunde RA. Selenium regulation of selenoprotein enzyme activity and transcripts in a pilot study with Founder strains from the Collaborative Cross. PLoS One 2018;13:e0191449. [PMID: 29338053 PMCID: PMC5770059 DOI: 10.1371/journal.pone.0191449] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 01/04/2018] [Indexed: 12/02/2022] Open

Abstract

Rodents and humans have 24–25 selenoproteins, and these proteins contain the 21^st amino acid, selenocysteine, incorporated co-translationally into the peptide backbone in a series of reactions dependent on at least 6 unique gene products. In selenium (Se) deficiency, there is differential regulation of selenoprotein expression, whereby levels of some selenoproteins and their transcripts decrease dramatically in Se deficiency, but other selenoprotein transcripts are spared this decrease; the underlying mechanism, however, is not fully understood. To begin explore the genetic basis for this variation in regulation by Se status in a pilot study, we fed Se-deficient or Se-adequate diets (0.005 or 0.2 μg Se/g, respectively) for eight weeks to the eight Founder strains of the Collaborative Cross. We found rather uniform expression of selenoenzyme activity for glutathione peroxidase (Gpx) 3 in plasma, Gpx1 in red blood cells, and Gpx1, Gpx4, and thioredoxin reductase in liver. In Founder mice, Se deficiency decreased each of these activities to a similar extent. Regulation of selenoprotein transcript expression by Se status was also globally retained intact, with dramatic down-regulation of Gpx1, Selenow, and Selenoh transcripts in all 8 strains of Founder mice. These results indicate that differential regulation of selenoprotein expression by Se status is an essential aspect of Se metabolism and selenoprotein function. A few lone differences in Se regulation were observed for individual selenoproteins in this pilot study, but these differences did not single-out one strain or one selenoprotein that consistently had unique Se regulation of selenoprotein expression. These differences should be affirmed in larger studies; use of the Diversity Outbred and Collaborative Cross strains may help to better define the functions of these selenoproteins.

Collapse

Chen A, Liu Y, Williams SM, Morris N, Buchner DA. Widespread epistasis regulates glucose homeostasis and gene expression. PLoS Genet 2017;13:e1007025. [PMID: 28961251 PMCID: PMC5636166 DOI: 10.1371/journal.pgen.1007025] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Revised: 10/11/2017] [Accepted: 09/17/2017] [Indexed: 02/07/2023] Open

Abstract

The relative contributions of additive versus non-additive interactions in the regulation of complex traits remains controversial. This may be in part because large-scale epistasis has traditionally been difficult to detect in complex, multi-cellular organisms. We hypothesized that it would be easier to detect interactions using mouse chromosome substitution strains that simultaneously incorporate allelic variation in many genes on a controlled genetic background. Analyzing metabolic traits and gene expression levels in the offspring of a series of crosses between mouse chromosome substitution strains demonstrated that inter-chromosomal epistasis was a dominant feature of these complex traits. Epistasis typically accounted for a larger proportion of the heritable effects than those due solely to additive effects. These epistatic interactions typically resulted in trait values returning to the levels of the parental CSS host strain. Due to the large epistatic effects, analyses that did not account for interactions consistently underestimated the true effect sizes due to allelic variation or failed to detect the loci controlling trait variation. These studies demonstrate that epistatic interactions are a common feature of complex traits and thus identifying these interactions is key to understanding their genetic regulation.

Most complex traits and diseases are regulated by the combined influence of multiple genetic variants. However, it remains controversial whether these genetic variants independently influence complex traits, and therefore the impact of each variant could be simply added together (additivity), or whether the variants work together to influence trait variation, in which case the combined impact of multiple variants would differ from the summed impact of each individual variant (epistasis). In this study in mice, we discovered that the genetic regulation of blood sugar levels and gene expression in the liver were predominantly controlled by non-additive interactions, whereas body weight was predominantly controlled by additive interactions. Remarkably, the expression level of nearly 25% of all genes in the liver was controlled by non-additive interactions. The non-additive interactions typically acted to return trait values to the levels detected in control mice, thus contributing to a reduction in trait variation. We also demonstrated that not accounting for non-additive interactions significantly underestimated the phenotypic effect of a genetic variant on a particular genetic background, suggesting that many previously identified risk loci may have significantly larger effects on disease susceptibility in a subset of individuals. These studies highlight the importance of understanding interactions between genetic variants to better understand disease risk and personalize clinical care.

Collapse

Epistatic Networks Jointly Influence Phenotypes Related to Metabolic Disease and Gene Expression in Diversity Outbred Mice. Genetics 2017;206:621-639. [PMID: 28592500 PMCID: PMC5499176 DOI: 10.1534/genetics.116.198051] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 04/03/2017] [Indexed: 12/20/2022] Open

Abstract

In this study, Tyler et al. analyzed the complex genetic architecture of metabolic disease-related traits using the Diversity Outbred mouse population

Genetic studies of multidimensional phenotypes can potentially link genetic variation, gene expression, and physiological data to create multi-scale models of complex traits. The challenge of reducing these data to specific hypotheses has become increasingly acute with the advent of genome-scale data resources. Multi-parent populations derived from model organisms provide a resource for developing methods to understand this complexity. In this study, we simultaneously modeled body composition, serum biomarkers, and liver transcript abundances from 474 Diversity Outbred mice. This population contained both sexes and two dietary cohorts. Transcript data were reduced to functional gene modules with weighted gene coexpression network analysis (WGCNA), which were used as summary phenotypes representing enriched biological processes. These module phenotypes were jointly analyzed with body composition and serum biomarkers in a combined analysis of pleiotropy and epistasis (CAPE), which inferred networks of epistatic interactions between quantitative trait loci that affect one or more traits. This network frequently mapped interactions between alleles of different ancestries, providing evidence of both genetic synergy and redundancy between haplotypes. Furthermore, a number of loci interacted with sex and diet to yield sex-specific genetic effects and alleles that potentially protect individuals from the effects of a high-fat diet. Although the epistatic interactions explained small amounts of trait variance, the combination of directional interactions, allelic specificity, and high genomic resolution provided context to generate hypotheses for the roles of specific genes in complex traits. Our approach moves beyond the cataloging of single loci to infer genetic networks that map genetic etiology by simultaneously modeling all phenotypes.

Collapse

Ibarra-Soria X, Nakahara TS, Lilue J, Jiang Y, Trimmer C, Souza MA, Netto PH, Ikegami K, Murphy NR, Kusma M, Kirton A, Saraiva LR, Keane TM, Matsunami H, Mainland J, Papes F, Logan DW. Variation in olfactory neuron repertoires is genetically controlled and environmentally modulated. eLife 2017;6. [PMID: 28438259 PMCID: PMC5404925 DOI: 10.7554/elife.21476] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Accepted: 03/21/2017] [Indexed: 12/28/2022] Open

Abstract

The mouse olfactory sensory neuron (OSN) repertoire is composed of 10 million cells and each expresses one olfactory receptor (OR) gene from a pool of over 1000. Thus, the nose is sub-stratified into more than a thousand OSN subtypes. Here, we employ and validate an RNA-sequencing-based method to quantify the abundance of all OSN subtypes in parallel, and investigate the genetic and environmental factors that contribute to neuronal diversity. We find that the OSN subtype distribution is stereotyped in genetically identical mice, but varies extensively between different strains. Further, we identify cis-acting genetic variation as the greatest component influencing OSN composition and demonstrate independence from OR function. However, we show that olfactory stimulation with particular odorants results in modulation of dozens of OSN subtypes in a subtle but reproducible, specific and time-dependent manner. Together, these mechanisms generate a highly individualized olfactory sensory system by promoting neuronal diversity.

DOI:http://dx.doi.org/10.7554/eLife.21476.001

Smells are simply chemicals in the air that are recognized by nerves in our nose. Each nerve has a receptor that can identify a limited number of chemicals, and the nerve then relays this information to the brain. Animals have hundreds to thousands of different types of these nerves meaning that they can detect a wide array of smells.

Smell receptors are proteins, and the genes that encode these proteins can be very different in two unrelated people. This could partly explain, for example, why some people find certain odors intense and unpleasant while others do not. However, having different genes for smell receptors does not by itself completely explain why some people are more sensitive than others to particular smells. The amounts of each nerve type in the nose might also differ between people and have an effect, but to date it has not been possible to accurately count them all.

Ibarra-Soria et al. have now devised a new method to essentially count the number of each nerve type in the noses of mice from different breeds. The method makes use of a technique called RNA-sequencing, which can reveal which genes are active at any one time, and thus show how many nerves are producing each type of smell receptor. Ibarra-Soria et al. learned that different breeds of mice had remarkably different compositions of nerves in their noses. Further analysis revealed that this was due to changes to the DNA code near to the genes that encode the smell receptor.

Next, Ibarra-Soria et al. sought to find out how the amount of each nerve type is controlled by giving mice water with different smells for weeks and looking how this affected their noses. These experiments revealed that a small number of the nerve types became more or less common after exposure to a smell. The altered nerves were directly involved in recognizing the smells, proving that the very act of smelling can change the make-up of nerves in a mouse’s nose.

These results confirm that the diversity in the nose of each individual is not only dictated by the types of receptors found in there, but also by the number of each nerve type. The next challenge is to understand better how these differences change the way people perceive smells.

DOI:http://dx.doi.org/10.7554/eLife.21476.002

Collapse

Schughart K, Williams RW. The Collaborative Cross Resource for Systems Genetics Research of Infectious Diseases. Methods Mol Biol 2017;1488:579-596. [PMID: 27933545 PMCID: PMC7120135 DOI: 10.1007/978-1-4939-6427-7_28] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Baud A, Mulligan MK, Casale FP, Ingels JF, Bohl CJ, Callebert J, Launay JM, Krohn J, Legarra A, Williams RW, Stegle O. Genetic Variation in the Social Environment Contributes to Health and Disease. PLoS Genet 2017;13:e1006498. [PMID: 28121987 PMCID: PMC5266220 DOI: 10.1371/journal.pgen.1006498] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 11/21/2016] [Indexed: 11/29/2022] Open

Winter JM, Gildea DE, Andreas JP, Gatti DM, Williams KA, Lee M, Hu Y, Zhang S, Mullikin JC, Wolfsberg TG, McDonnell SK, Fogarty ZC, Larson MC, French AJ, Schaid DJ, Thibodeau SN, Churchill GA, Crawford NPS. Mapping Complex Traits in a Diversity Outbred F1 Mouse Population Identifies Germline Modifiers of Metastasis in Human Prostate Cancer. Cell Syst 2016;4:31-45.e6. [PMID: 27916600 DOI: 10.1016/j.cels.2016.10.018] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 09/08/2016] [Accepted: 10/20/2016] [Indexed: 01/02/2023]

Affiliation(s)

Jean M Winter Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Derek E Gildea Computational and Statistical Genomics Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Jonathan P Andreas Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Daniel M Gatti The Jackson Laboratory, Bar Harbor, ME 04609, USA
Kendra A Williams Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Minnkyong Lee Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Ying Hu Center for Biomedical Informatics and Information Technology, National Cancer Institute, NIH, Rockville, MD 20892, USA
Suiyuan Zhang Computational and Statistical Genomics Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
NIH Intramural Sequencing Center, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
James C Mullikin NIH Intramural Sequencing Center, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Tyra G Wolfsberg Computational and Statistical Genomics Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
Shannon K McDonnell Department of Health Sciences Research, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Zachary C Fogarty Department of Health Sciences Research, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Melissa C Larson Department of Health Sciences Research, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Amy J French Department of Laboratory Medicine and Pathology, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Daniel J Schaid Department of Health Sciences Research, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Stephen N Thibodeau Department of Laboratory Medicine and Pathology, Mayo Clinic College of Medicine, 200 First Street SW, Rochester, MN 55905, USA
Gary A Churchill The Jackson Laboratory, Bar Harbor, ME 04609, USA
Nigel P S Crawford Genetics and Molecular Biology Branch, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA.

Collapse

Dowell R, Odell A, Richmond P, Malmer D, Halper-Stromberg E, Bennett B, Larson C, Leach S, Radcliffe RA. Genome characterization of the selected long- and short-sleep mouse lines. Mamm Genome 2016;27:574-586. [PMID: 27651241 PMCID: PMC5110614 DOI: 10.1007/s00335-016-9663-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 08/22/2016] [Indexed: 01/29/2023]

Hodgkinson A, Grenier JC, Gbeha E, Awadalla P. A haplotype-based normalization technique for the analysis and detection of allele specific expression. BMC Bioinformatics 2016;17:364. [PMID: 27618913 PMCID: PMC5020486 DOI: 10.1186/s12859-016-1238-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 09/02/2016] [Indexed: 12/17/2022] Open

Abstract

Background

Allele specific expression (ASE) has become an important phenotype, being utilized for the detection of cis-regulatory variation, nonsense mediated decay and imprinting in the personal genome, and has been used to both identify disease loci and consider the penetrance of damaging alleles. The detection of ASE using high throughput technologies relies on aligning short-read sequencing data, a process that has inherent biases, and there is still a need to develop fast and accurate methods to detect ASE given the unprecedented growth of sequencing information in big data projects.

Results

Here, we present a new approach to normalize RNA sequencing data in order to call ASE events with high precision in a short time-frame. Using simulated datasets we find that our approach dramatically improves reference allele quantification at heterozygous sites versus default mapping methods and also performs well compared to existing techniques for ASE detection, such as filtering methods and mapping to parental genomes, without the need for complex and time consuming manipulation. Finally, by sequencing the exomes and transcriptomes of 96 well-phenotyped individuals of the CARTaGENE cohort, we characterise the levels of ASE across individuals and find a significant association between the proportion of sites undergoing ASE within the genome and smoking.

Conclusions

The correct treatment and analysis of RNA sequencing data is vital to control for mapping biases and detect genuine ASE signals. By normalising RNA sequencing information after mapping, we show that this approach can be used to identify biologically relevant signals in personal genomes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1238-8) contains supplementary material, which is available to authorized users.

Collapse

Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63. Proc Natl Acad Sci U S A 2016;113:E5163-71. [PMID: 27535938 DOI: 10.1073/pnas.1611012113] [Citation(s) in RCA: 155] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Morton NM, Beltram J, Carter RN, Michailidou Z, Gorjanc G, Fadden CM, Barrios-Llerena ME, Rodriguez-Cuenca S, Gibbins MTG, Aird RE, Moreno-Navarrete JM, Munger SC, Svenson KL, Gastaldello A, Ramage L, Naredo G, Zeyda M, Wang ZV, Howie AF, Saari A, Sipilä P, Stulnig TM, Gudnason V, Kenyon CJ, Seckl JR, Walker BR, Webster SP, Dunbar DR, Churchill GA, Vidal-Puig A, Fernandez-Real JM, Emilsson V, Horvat S. Genetic identification of thiosulfate sulfurtransferase as an adipocyte-expressed antidiabetic target in mice selected for leanness. Nat Med 2016;22:771-9. [PMID: 27270587 PMCID: PMC5524189 DOI: 10.1038/nm.4115] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2015] [Accepted: 04/29/2016] [Indexed: 12/13/2022]

Affiliation(s)

Nicholas M. Morton University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Jasmina Beltram Biotechnical Faculty, Animal Science Department, University of Ljubljana, Ljubljana, Slovenia
Roderick N. Carter University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Zoi Michailidou University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Gregor Gorjanc Biotechnical Faculty, Animal Science Department, University of Ljubljana, Ljubljana, Slovenia
Clare Mc Fadden University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Martin E. Barrios-Llerena University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Sergio Rodriguez-Cuenca Metabolic Research Laboratories, Level 4, Wellcome Trust-MRC Institute of Metabolic Science, Addenbrookes Hospital, Cambridge, UK
Matthew T. G. Gibbins University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Rhona E. Aird University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
José Maria Moreno-Navarrete Department of Diabetes, Endocrinology and Nutrition, Institut d'Investigació Biomédica de Girona; Department of Medicine, University of Girona Centro de Investigación Biomédica en Red de Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, Girona, Spain
Steven C. Munger The Jackson Laboratory, Bar Harbor, Maine, USA
Karen L. Svenson The Jackson Laboratory, Bar Harbor, Maine, USA
Annalisa Gastaldello University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Lynne Ramage University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Gregorio Naredo University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Maximilian Zeyda Clinical Division of Endocrinology and Metabolism, Department of Medicine III, Medical University of Vienna, Vienna, Austria
Zhao V. Wang Department of Internal Medicine, Touchstone Diabetes Center University of Texas Southwestern Medical Center, Dallas, Texas, USA
Alexander F. Howie The MRC Centre for Reproductive Health, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Aila Saari Department of Physiology, Institute of Biomedicine, University of Turku, Turku, Finland
Petra Sipilä Central Animal Laboratory, University of Turku, Turku, Finland
Thomas M. Stulnig Clinical Division of Endocrinology and Metabolism, Department of Medicine III, Medical University of Vienna, Vienna, Austria
Vilmundur Gudnason Icelandic Heart Association, Kopavogur, Iceland
Christopher J. Kenyon University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Jonathan R. Seckl University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Brian R. Walker University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Scott P. Webster University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Donald R. Dunbar University/British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Queen’s Medical Research Institute, Edinburgh, UK
Gary A. Churchill The Jackson Laboratory, Bar Harbor, Maine, USA
Antonio Vidal-Puig Metabolic Research Laboratories, Level 4, Wellcome Trust-MRC Institute of Metabolic Science, Addenbrookes Hospital, Cambridge, UK
José Manuel Fernandez-Real Department of Diabetes, Endocrinology and Nutrition, Institut d'Investigació Biomédica de Girona; Department of Medicine, University of Girona Centro de Investigación Biomédica en Red de Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, Girona, Spain
Valur Emilsson Icelandic Heart Association, Kopavogur, Iceland Faculty of Pharmaceutical Sciences, University of Iceland, Reykjavik, Iceland
Simon Horvat Biotechnical Faculty, Animal Science Department, University of Ljubljana, Ljubljana, Slovenia National Institute of Chemistry, Ljubljana, Slovenia

Collapse

Chick JM, Munger SC, Simecek P, Huttlin EL, Choi K, Gatti DM, Raghupathy N, Svenson KL, Churchill GA, Gygi SP. Defining the consequences of genetic variation on a proteome-wide scale. Nature 2016;534:500-5. [PMID: 27309819 PMCID: PMC5292866 DOI: 10.1038/nature18270] [Citation(s) in RCA: 249] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 04/13/2016] [Indexed: 12/11/2022]

Chen Z, Hagen DE, Wang J, Elsik CG, Ji T, Siqueira LG, Hansen PJ, Rivera RM. Global assessment of imprinted gene expression in the bovine conceptus by next generation sequencing. Epigenetics 2016;11:501-16. [PMID: 27245094 PMCID: PMC4939914 DOI: 10.1080/15592294.2016.1184805] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Buffering of Genetic Regulatory Networks in Drosophila melanogaster. Genetics 2016;203:1177-90. [PMID: 27194752 DOI: 10.1534/genetics.116.188797] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 05/17/2016] [Indexed: 01/01/2023] Open

Genetic Architectures of Quantitative Variation in RNA Editing Pathways. Genetics 2015;202:787-98. [PMID: 26614740 DOI: 10.1534/genetics.115.179481] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 11/17/2015] [Indexed: 11/18/2022] Open

Stein S, Lu ZX, Bahrami-Samani E, Park JW, Xing Y. Discover hidden splicing variations by mapping personal transcriptomes to personal genomes. Nucleic Acids Res 2015;43:10612-22. [PMID: 26578562 PMCID: PMC4678817 DOI: 10.1093/nar/gkv1099] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 10/09/2015] [Indexed: 01/27/2023] Open

Yang C, Wu PY, Tong L, Phan JH, Wang MD. The impact of RNA-seq aligners on gene expression estimation. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2015;2015:462-471. [PMID: 27583310 DOI: 10.1145/2808719.2808767] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]