1
|
James C, Trevisan-Herraz M, Juan D, Rico D. Evolutionary analysis of gene ages across TADs associates chromatin topology with whole-genome duplications. Cell Rep 2024; 43:113895. [PMID: 38517894 DOI: 10.1016/j.celrep.2024.113895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 11/03/2023] [Accepted: 02/16/2024] [Indexed: 03/24/2024] Open
Abstract
Topologically associated domains (TADs) are interaction subnetworks of chromosomal regions in 3D genomes. TAD boundaries frequently coincide with genome breaks while boundary deletion is under negative selection, suggesting that TADs may facilitate genome rearrangements and evolution. We show that genes co-localize by evolutionary age in humans and mice, resulting in TADs having different proportions of younger and older genes. We observe a major transition in the age co-localization patterns between the genes born during vertebrate whole-genome duplications (WGDs) or before and those born afterward. We also find that genes recently duplicated in primates and rodents are more frequently essential when they are located in old-enriched TADs and interact with genes that last duplicated during the WGD. Therefore, the evolutionary relevance of recent genes may increase when located in TADs with established regulatory networks. Our data suggest that TADs could play a role in organizing ancestral functions and evolutionary novelty.
Collapse
Affiliation(s)
- Caelinn James
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Scotland's Rural College (SRUC), The Roslin Institute Building, Easter Bush, Midlothian, UK
| | - Marco Trevisan-Herraz
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Translational and Clinical Research Institute, Newcastle University, Newcastle Upon Tyne, UK
| | - David Juan
- Institut de Biologia Evolutiva, Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain; Systems Biology Department, Spanish National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| | - Daniel Rico
- Biosciences Institute, Newcastle University, Newcastle Upon Tyne, UK; Centro Andaluz de Biología Molecular y Medicina Regenerativa (CABIMER), CSIC-Universidad de Sevilla-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain.
| |
Collapse
|
2
|
Gilbertson EN, Brand CM, McArthur E, Rinker DC, Kuang S, Pollard KS, Capra JA. Machine learning reveals the diversity of human 3D chromatin contact patterns. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.22.573104. [PMID: 38187606 PMCID: PMC10769343 DOI: 10.1101/2023.12.22.573104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Understanding variation in chromatin contact patterns across human populations is critical for interpreting non-coding variants and their ultimate effects on gene expression and phenotypes. However, experimental determination of chromatin contacts at a population-scale is prohibitively expensive. To overcome this challenge, we develop and validate a machine learning method to quantify the diversity 3D chromatin contacts at 2 kilobase resolution from genome sequence alone. We then apply this approach to thousands of diverse modern humans and the inferred human-archaic hominin ancestral genome. While patterns of 3D contact divergence genome-wide are qualitatively similar to patterns of sequence divergence, we find that 3D divergence in local 1-megabase genomic windows does not follow sequence divergence. In particular, we identify 392 windows with significantly greater 3D divergence than expected from sequence. Moreover, 26% of genomic windows have rare 3D contact variation observed in a small number of individuals. Using in silico mutagenesis we find that most sequence changes to do not result in changes to 3D chromatin contacts. However in windows with substantial 3D divergence, just one or a few variants can lead to divergent 3D chromatin contacts without the individuals carrying those variants having high sequence divergence. In summary, inferring 3D chromatin contact maps across human populations reveals diverse contact patterns. We anticipate that these genetically diverse maps of 3D chromatin contact will provide a reference for future work on the function and evolution of 3D chromatin contact variation across human populations.
Collapse
Affiliation(s)
- Erin N Gilbertson
- Biomedical Informatics Graduate Program, University of California San Francisco, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
| | - Colin M Brand
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA
| | - Evonne McArthur
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN
- Department of Medicine, University of Washington, Seattle, WA
| | - David C Rinker
- Department of Biological Sciences, Vanderbilt University, Nashville, TN
| | - Shuzhen Kuang
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
| | - Katherine S Pollard
- Biomedical Informatics Graduate Program, University of California San Francisco, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Chan Zuckerberg Biohub SF, San Francisco, CA
| | - John A Capra
- Biomedical Informatics Graduate Program, University of California San Francisco, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA
| |
Collapse
|
3
|
Wang D, Perera D, He J, Cao C, Kossinna P, Li Q, Zhang W, Guo X, Platt A, Wu J, Zhang Q. cLD: Rare-variant linkage disequilibrium between genomic regions identifies novel genomic interactions. PLoS Genet 2023; 19:e1011074. [PMID: 38109434 PMCID: PMC10758262 DOI: 10.1371/journal.pgen.1011074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 01/01/2024] [Accepted: 11/20/2023] [Indexed: 12/20/2023] Open
Abstract
Linkage disequilibrium (LD) is a fundamental concept in genetics; critical for studying genetic associations and molecular evolution. However, LD measurements are only reliable for common genetic variants, leaving low-frequency variants unanalyzed. In this work, we introduce cumulative LD (cLD), a stable statistic that captures the rare-variant LD between genetic regions, which reflects more biological interactions between variants, in addition to lack of recombination. We derived the theoretical variance of cLD using delta methods to demonstrate its higher stability than LD for rare variants. This property is also verified by bootstrapped simulations using real data. In application, we find cLD reveals an increased genetic association between genes in 3D chromatin interactions, a phenomenon recently reported negatively by calculating standard LD between common variants. Additionally, we show that cLD is higher between gene pairs reported in interaction databases, identifies unreported protein-protein interactions, and reveals interacting genes distinguishing case/control samples in association studies.
Collapse
Affiliation(s)
- Dinghao Wang
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| | - Deshan Perera
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
| | - Jingni He
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
| | - Chen Cao
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
| | - Pathum Kossinna
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
| | - Qing Li
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
| | - William Zhang
- The Harker School, San Jose, California, United States of America
| | - Xingyi Guo
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Alexander Platt
- Department of Genetics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Jingjing Wu
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| | - Qingrun Zhang
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Alberta, Canada
- Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|
4
|
Rozowsky J, Gao J, Borsari B, Yang YT, Galeev T, Gürsoy G, Epstein CB, Xiong K, Xu J, Li T, Liu J, Yu K, Berthel A, Chen Z, Navarro F, Sun MS, Wright J, Chang J, Cameron CJF, Shoresh N, Gaskell E, Drenkow J, Adrian J, Aganezov S, Aguet F, Balderrama-Gutierrez G, Banskota S, Corona GB, Chee S, Chhetri SB, Cortez Martins GC, Danyko C, Davis CA, Farid D, Farrell NP, Gabdank I, Gofin Y, Gorkin DU, Gu M, Hecht V, Hitz BC, Issner R, Jiang Y, Kirsche M, Kong X, Lam BR, Li S, Li B, Li X, Lin KZ, Luo R, Mackiewicz M, Meng R, Moore JE, Mudge J, Nelson N, Nusbaum C, Popov I, Pratt HE, Qiu Y, Ramakrishnan S, Raymond J, Salichos L, Scavelli A, Schreiber JM, Sedlazeck FJ, See LH, Sherman RM, Shi X, Shi M, Sloan CA, Strattan JS, Tan Z, Tanaka FY, Vlasova A, Wang J, Werner J, Williams B, Xu M, Yan C, Yu L, Zaleski C, Zhang J, Ardlie K, Cherry JM, Mendenhall EM, Noble WS, Weng Z, Levine ME, Dobin A, Wold B, Mortazavi A, Ren B, Gillis J, Myers RM, Snyder MP, Choudhary J, Milosavljevic A, Schatz MC, Bernstein BE, Guigó R, Gingeras TR, Gerstein M. The EN-TEx resource of multi-tissue personal epigenomes & variant-impact models. Cell 2023; 186:1493-1511.e40. [PMID: 37001506 PMCID: PMC10074325 DOI: 10.1016/j.cell.2023.02.018] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 10/16/2022] [Accepted: 02/10/2023] [Indexed: 04/03/2023]
Abstract
Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × ∼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.
Collapse
Affiliation(s)
- Joel Rozowsky
- Section on Biomedical Informatics and Data Science, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jiahao Gao
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Beatrice Borsari
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Yucheng T Yang
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Timur Galeev
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Gamze Gürsoy
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Kun Xiong
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jinrui Xu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Tianxiao Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jason Liu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Keyang Yu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Ana Berthel
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Zhanlin Chen
- Department of Statistics and Data Science, Yale University, New Haven, CT, USA
| | - Fabio Navarro
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Maxwell S Sun
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Justin Chang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Christopher J F Cameron
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Noam Shoresh
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jessika Adrian
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Sergey Aganezov
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | | | | | | | | | - Sora Chee
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Surya B Chhetri
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Gabriel Conte Cortez Martins
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Cassidy Danyko
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Carrie A Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Daniel Farid
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Idan Gabdank
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Yoel Gofin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - David U Gorkin
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Mengting Gu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Vivian Hecht
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin C Hitz
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Robbyn Issner
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yunzhe Jiang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Melanie Kirsche
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Xiangmeng Kong
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Bonita R Lam
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Shantao Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Bian Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Xiqi Li
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Khine Zin Lin
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Hong Kong, CHN
| | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Ran Meng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jill E Moore
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Jonathan Mudge
- European Bioinformatics Institute, Cambridge, Cambridgeshire, GB
| | | | - Chad Nusbaum
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ioann Popov
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Henry E Pratt
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Yunjiang Qiu
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Srividya Ramakrishnan
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Joe Raymond
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Leonidas Salichos
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Department of Biological and Chemical Sciences, New York Institute of Technology, Old Westbury, NY, USA
| | - Alexandra Scavelli
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jacob M Schreiber
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Fritz J Sedlazeck
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Lei Hoon See
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Rachel M Sherman
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Xu Shi
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Minyi Shi
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Cricket Alicia Sloan
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - J Seth Strattan
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Zhen Tan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Forrest Y Tanaka
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Anna Vlasova
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain; Comparative Genomics Group, Life Science Programme, Barcelona Supercomputing Centre, Barcelona, Spain; Institute of Research in Biomedicine, Barcelona, Spain
| | - Jun Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jonathan Werner
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Brian Williams
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Min Xu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Chengfei Yan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Lu Yu
- Institute of Cancer Research, London, UK
| | - Christopher Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jing Zhang
- Department of Computer Science, University of California, Irvine, Irvine, CA, USA
| | | | - J Michael Cherry
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | | | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Morgan E Levine
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Alexander Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Barbara Wold
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Jesse Gillis
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Department of Physiology, University of Toronto, Toronto, ON, Canada
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Michael P Snyder
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | | | | | - Michael C Schatz
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | - Bradley E Bernstein
- Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Roderic Guigó
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain; Universitat Pompeu Fabra, Barcelona, Catalonia, Spain.
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | - Mark Gerstein
- Section on Biomedical Informatics and Data Science, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Department of Statistics and Data Science, Yale University, New Haven, CT, USA; Department of Computer Science, Yale University, New Haven, CT, USA.
| |
Collapse
|
5
|
Li Q, Perera D, Cao C, He J, Bian J, Chen X, Azeem F, Howe A, Au B, Wu J, Yan J, Long Q. Interaction-integrated linear mixed model reveals 3D-genetic basis underlying Autism. Genomics 2023; 115:110575. [PMID: 36758877 DOI: 10.1016/j.ygeno.2023.110575] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 01/16/2023] [Accepted: 02/03/2023] [Indexed: 02/10/2023]
Abstract
Genetic interactions play critical roles in genotype-phenotype associations. We developed a novel interaction-integrated linear mixed model (ILMM) that integrates a priori knowledge into linear mixed models. ILMM enables statistical integration of genetic interactions upfront and overcomes the problems of searching for combinations. To demonstrate its utility, with 3D genomic interactions (assessed by Hi-C experiments) as a priori, we applied ILMM to whole-genome sequencing data for Autism Spectrum Disorders (ASD) and brain transcriptome data, revealing the 3D-genetic basis of ASD and 3D-expression quantitative loci (3D-eQTLs) for brain tissues. Notably, we reported a potential mechanism involving distal regulation between FOXP2 and DNMT3A, conferring the risk of ASD.
Collapse
Affiliation(s)
- Qing Li
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Deshan Perera
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Chen Cao
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Jingni He
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Jiayi Bian
- Department of Mathematics and Statistics, University of Calgary, Alberta T2N 1N4, Canada
| | - Xingyu Chen
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Feeha Azeem
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada
| | - Aaron Howe
- Heritage Youth Researcher Summer Program, University of Calgary, Alberta T2N 1N4, Canada
| | - Billie Au
- Department of Medical Genetics, University of Calgary, Alberta T2N 1N4, Canada; Alberta Children's Hospital Research Institute, University of Calgary, Alberta T2N 1N4, Canada
| | - Jingjing Wu
- Department of Mathematics and Statistics, University of Calgary, Alberta T2N 1N4, Canada
| | - Jun Yan
- Department of Physiology and Pharmacology, University of Calgary, Alberta T2N 1N4, Canada; Hotchkiss Brain Institute, University of Calgary, Alberta T2N 1N4, Canada.
| | - Quan Long
- Department of Biochemistry and Molecular Biology, University of Calgary, Alberta T2N 1N4, Canada; Department of Medical Genetics, University of Calgary, Alberta T2N 1N4, Canada; Department of Mathematics and Statistics, University of Calgary, Alberta T2N 1N4, Canada; Alberta Children's Hospital Research Institute, University of Calgary, Alberta T2N 1N4, Canada; Hotchkiss Brain Institute, University of Calgary, Alberta T2N 1N4, Canada.
| |
Collapse
|
6
|
Chua EHZ, Yasar S, Harmston N. The importance of considering regulatory domains in genome-wide analyses - the nearest gene is often wrong! Biol Open 2022; 11:274931. [PMID: 35377406 PMCID: PMC9002814 DOI: 10.1242/bio.059091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The expression of a large number of genes is regulated by regulatory elements that are located far away from their promoters. Identifying which gene is the target of a specific regulatory element or is affected by a non-coding mutation is often accomplished by assigning these regions to the nearest gene in the genome. However, this heuristic ignores key features of genome organisation and gene regulation; in that the genome is partitioned into regulatory domains, which at some loci directly coincide with the span of topologically associated domains (TADs), and that genes are regulated by enhancers located throughout these regions, even across intervening genes. In this review, we examine the results from genome-wide studies using chromosome conformation capture technologies and from those dissecting individual gene regulatory domains, to highlight that the phenomenon of enhancer skipping is pervasive and affects multiple types of genes. We discuss how simply assigning a genomic region of interest to its nearest gene is problematic and often leads to incorrect predictions and highlight that where possible information on both the conservation and topological organisation of the genome should be used to generate better hypotheses. The article has an associated Future Leader to Watch interview. Summary: Identifying which gene is the target of an enhancer is often accomplished by assigning it to the nearest gene, here we discuss how this heuristic can lead to incorrect predictions.
Collapse
Affiliation(s)
| | - Samen Yasar
- Science Division, Yale-NUS College, Singapore 138527, Singapore
| | - Nathan Harmston
- Science Division, Yale-NUS College, Singapore 138527, Singapore.,Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore 169857, Singapore
| |
Collapse
|
7
|
Jablonski KP, Carron L, Mozziconacci J, Forné T, Hütt MT, Lesne A. Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study. Hum Genomics 2022; 16:2. [PMID: 35016721 PMCID: PMC8753905 DOI: 10.1186/s40246-022-00375-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 01/02/2022] [Indexed: 01/31/2023] Open
Abstract
Background Genome-wide association studies have identified statistical associations between various diseases, including cancers, and a large number of single-nucleotide polymorphisms (SNPs). However, they provide no direct explanation of the mechanisms underlying the association. Based on the recent discovery that changes in three-dimensional genome organization may have functional consequences on gene regulation favoring diseases, we investigated systematically the genome-wide distribution of disease-associated SNPs with respect to a specific feature of 3D genome organization: topologically associating domains (TADs) and their borders. Results For each of 449 diseases, we tested whether the associated SNPs are present in TAD borders more often than observed by chance, where chance (i.e., the null model in statistical terms) corresponds to the same number of pointwise loci drawn at random either in the entire genome, or in the entire set of disease-associated SNPs listed in the GWAS catalog. Our analysis shows that a fraction of diseases displays such a preferential localization of their risk loci. Moreover, cancers are relatively more frequent among these diseases, and this predominance is generally enhanced when considering only intergenic SNPs. The structure of SNP-based diseasome networks confirms that localization of risk loci in TAD borders differs between cancers and non-cancer diseases. Furthermore, different TAD border enrichments are observed in embryonic stem cells and differentiated cells, consistent with changes in topological domains along embryogenesis and delineating their contribution to disease risk. Conclusions Our results suggest that, for certain diseases, part of the genetic risk lies in a local genetic variation affecting the genome partitioning in topologically insulated domains. Investigating this possible contribution to genetic risk is particularly relevant in cancers. This study thus opens a way of interpreting genome-wide association studies, by distinguishing two types of disease-associated SNPs: one with an effect on an individual gene, the other acting in interplay with 3D genome organization. Supplementary Information The online version contains supplementary material available at 10.1186/s40246-022-00375-2.
Collapse
Affiliation(s)
- Kim Philipp Jablonski
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland
| | - Leopold Carron
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France.,Laboratory of Computational and Quantitative Biology, LCQB, Sorbonne Université, Paris, France
| | - Julien Mozziconacci
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France.,Structure et Instabilité des Génomes, Muséum National d'Histoire Naturelle, Paris, France
| | - Thierry Forné
- Institut de Génétique Moléculaire de Montpellier, IGMM, CNRS, Univ. Montpellier, Montpellier, France
| | - Marc-Thorsten Hütt
- Department of Life Sciences and Chemistry, Jacobs University Bremen, Bremen, Germany.
| | - Annick Lesne
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France. .,Institut de Génétique Moléculaire de Montpellier, IGMM, CNRS, Univ. Montpellier, Montpellier, France.
| |
Collapse
|
8
|
Li B, Ritchie MD. From GWAS to Gene: Transcriptome-Wide Association Studies and Other Methods to Functionally Understand GWAS Discoveries. Front Genet 2021; 12:713230. [PMID: 34659337 PMCID: PMC8515949 DOI: 10.3389/fgene.2021.713230] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Accepted: 07/27/2021] [Indexed: 12/12/2022] Open
Abstract
Since their inception, genome-wide association studies (GWAS) have identified more than a hundred thousand single nucleotide polymorphism (SNP) loci that are associated with various complex human diseases or traits. The majority of GWAS discoveries are located in non-coding regions of the human genome and have unknown functions. The valley between non-coding GWAS discoveries and downstream affected genes hinders the investigation of complex disease mechanism and the utilization of human genetics for the improvement of clinical care. Meanwhile, advances in high-throughput sequencing technologies reveal important genomic regulatory roles that non-coding regions play in the transcriptional activities of genes. In this review, we focus on data integrative bioinformatics methods that combine GWAS with functional genomics knowledge to identify genetically regulated genes. We categorize and describe two types of data integrative methods. First, we describe fine-mapping methods. Fine-mapping is an exploratory approach that calibrates likely causal variants underneath GWAS signals. Fine-mapping methods connect GWAS signals to potentially causal genes through statistical methods and/or functional annotations. Second, we discuss gene-prioritization methods. These are hypothesis generating approaches that evaluate whether genetic variants regulate genes via certain genetic regulatory mechanisms to influence complex traits, including colocalization, mendelian randomization, and the transcriptome-wide association study (TWAS). TWAS is a gene-based association approach that investigates associations between genetically regulated gene expression and complex diseases or traits. TWAS has gained popularity over the years due to its ability to reduce multiple testing burden in comparison to other variant-based analytic approaches. Multiple types of TWAS methods have been developed with varied methodological designs and biological hypotheses over the past 5 years. We dive into discussions of how TWAS methods differ in many aspects and the challenges that different TWAS methods face. Overall, TWAS is a powerful tool for identifying complex trait-associated genes. With the advent of single-cell sequencing, chromosome conformation capture, gene editing technologies, and multiplexing reporter assays, we are expecting a more comprehensive understanding of genomic regulation and genetically regulated genes underlying complex human diseases and traits in the future.
Collapse
Affiliation(s)
- Binglan Li
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, United States.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
9
|
Golson ML. Islet Epigenetic Impacts on β-Cell Identity and Function. Compr Physiol 2021; 11:1961-1978. [PMID: 34061978 DOI: 10.1002/cphy.c200004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The development and maintenance of differentiation is vital to the function of mature cells. Terminal differentiation is achieved by locking in the expression of genes essential for the function of those cells. Gene expression and its memory through generations of cell division is controlled by transcription factors and a host of epigenetic marks. In type 2 diabetes, β cells have altered gene expression compared to controls, accompanied by altered chromatin marks. Mutations, diet, and environment can all disrupt the implementation and preservation of the distinctive β-cell transcriptional signature. Understanding of the full complement of genomic control in β cells is still nascent. This article describes the known effects of histone marks and variants, DNA methylation, how they are regulated in the β cell, and how they affect cell-fate specification, maintenance, and lineage propagation. © 2021 American Physiological Society. Compr Physiol 11:1-18, 2021.
Collapse
Affiliation(s)
- Maria L Golson
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| |
Collapse
|
10
|
McArthur E, Capra JA. Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability. Am J Hum Genet 2021; 108:269-283. [PMID: 33545030 PMCID: PMC7895846 DOI: 10.1016/j.ajhg.2021.01.001] [Citation(s) in RCA: 75] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/29/2020] [Indexed: 12/22/2022] Open
Abstract
Topologically associating domains (TADs) are fundamental units of three-dimensional (3D) nuclear organization. The regions bordering TADs-TAD boundaries-contribute to the regulation of gene expression by restricting interactions of cis-regulatory sequences to their target genes. TAD and TAD-boundary disruption have been implicated in rare-disease pathogenesis; however, we have a limited framework for integrating TADs and their variation across cell types into the interpretation of common-trait-associated variants. Here, we investigate an attribute of 3D genome architecture-the stability of TAD boundaries across cell types-and demonstrate its relevance to understanding how genetic variation in TADs contributes to complex disease. By synthesizing TAD maps across 37 diverse cell types with 41 genome-wide association studies (GWASs), we investigate the differences in disease association and evolutionary pressure on variation in TADs versus TAD boundaries. We demonstrate that genetic variation in TAD boundaries contributes more to complex-trait heritability, especially for immunologic, hematologic, and metabolic traits. We also show that TAD boundaries are more evolutionarily constrained than TADs. Next, stratifying boundaries by their stability across cell types, we find substantial variation. Compared to boundaries unique to a specific cell type, boundaries stable across cell types are further enriched for complex-trait heritability, evolutionary constraint, CTCF binding, and housekeeping genes. Thus, considering TAD boundary stability across cell types provides valuable context for understanding the genome's functional landscape and enabling variant interpretation that takes 3D structure into account.
Collapse
Affiliation(s)
- Evonne McArthur
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37235, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, 94158; Bakar Institute for Computational Health Sciences, University of California, San Francisco, CA, 94158.
| |
Collapse
|
11
|
Wamsley B, Geschwind DH. Functional genomics links genetic origins to pathophysiology in neurodegenerative and neuropsychiatric disease. Curr Opin Genet Dev 2020; 65:117-125. [PMID: 32634676 PMCID: PMC8171040 DOI: 10.1016/j.gde.2020.05.032] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 05/24/2020] [Indexed: 12/30/2022]
Abstract
Neurodegenerative and neuropsychiatric disorders are pervasive and debilitating conditions characterized by diverse clinical syndromes and comorbidities, whose origins are as complex and heterogeneous as their associated phenotypes. Risk for these disorders involves substantial genetic liability, which has fueled large-scale genetic studies that have led to a flood of discoveries. In turn, these discoveries have exposed substantial gaps in our knowledge with regards to the complicated genetic architecture of each disorder and the substantial amount of genetic overlap among disorders, which implies some degree of shared pathophysiology underlying these clinically distinct, multifactorial disorders. Understanding the role of specific genetic variants will involve resolving the connections between molecular pathways, heterogeneous cell types, specific circuits and disease pathogenesis at the tissue and patient level. We consider the current known genetic basis of these disorders and highlight the utility of molecular systems approaches that establish the function of genetic variation in the context of specific neurobiological networks, cell-types, and life stages. Beyond expanding our knowledge of disease mechanisms, understanding these relationships provides promise for early detection and potential therapeutic interventions.
Collapse
Affiliation(s)
- Brie Wamsley
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Daniel H Geschwind
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA; Program in Neurobehavioral Genetics and Center for Autism Research and Treatment Semel Institute and Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
12
|
Im C, Qin N, Wang Z, Qiu W, Howell CR, Sapkota Y, Moon W, Chemaitilly W, Gibson TM, Mulrooney DA, Ness KK, Wilson CL, Morton LM, Armstrong GT, Bhatia S, Zhang J, Hudson MM, Robison LL, Yasui Y. Generalizability of "GWAS Hits" in Clinical Populations: Lessons from Childhood Cancer Survivors. Am J Hum Genet 2020; 107:636-653. [PMID: 32946765 PMCID: PMC7536574 DOI: 10.1016/j.ajhg.2020.08.014] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 08/19/2020] [Indexed: 12/15/2022] Open
Abstract
With mounting interest in translating genome-wide association study (GWAS) hits from large meta-analyses (meta-GWAS) in diverse clinical settings, evaluating their generalizability in target populations is crucial. Here, we consider long-term survivors of childhood cancers from the St. Jude Lifetime Cohort Study, and we show the limited generalizability of 1,376 robust SNP associations reported in the general population across 12 complex anthropometric and cardiometabolic phenotypes (n = 2,231; observed-to-expected replication ratio = 0.70, p = 6.2 × 10-8). An examination of five comparable phenotypes in a second independent cohort of survivors from the Childhood Cancer Survivor Study corroborated the overall limited generalizability of meta-GWAS hits to survivors (n = 4,212; observed-to-expected replication ratio = 0.55, p = 5.6 × 10-15). Finally, in direct comparisons of survivor samples against independent equivalently powered general population samples from the UK Biobank, we consistently observed lower meta-GWAS hit replication rates and poorer polygenic risk score predictive performance in survivor samples for multiple phenotypes. As a possible explanation, we found that meta-GWAS hits were less likely to be replicated in survivors who had been exposed to cancer therapies that are associated with phenotype risk. Examination of complementary DNA methylation data in a subset of survivors revealed that treatment-related methylation patterns at genomic sites linked to meta-GWAS hits may disrupt established genetic signals in survivors.
Collapse
|
13
|
Dwyer DS. Genomic Chaos Begets Psychiatric Disorder. Complex Psychiatry 2020; 6:20-29. [PMID: 34883501 PMCID: PMC7673594 DOI: 10.1159/000507988] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 04/06/2020] [Indexed: 12/21/2022] Open
Abstract
The processes that created the primordial genome are inextricably linked to current day vulnerability to developing a psychiatric disorder as summarized in this review article. Chaos and dynamic forces including duplication, transposition, and recombination generated the protogenome. To survive early stages of genome evolution, self-organization emerged to curb chaos. Eventually, the human genome evolved through a delicate balance of chaos/instability and organization/stability. However, recombination coldspots, silencing of transposable elements, and other measures to limit chaos also led to retention of variants that increase risk for disease. Moreover, ongoing dynamics in the genome creates various new mutations that determine liability for psychiatric disorders. Homologous recombination, long-range gene regulation, and gene interactions were all guided by spooky action-at-a-distance, which increased variability in the system. A probabilistic system of life was required to deal with a changing environment. This ensured the generation of outliers in the population, which enhanced the probability that some members would survive unfavorable environmental impacts. Some of the outliers produced through this process in man are ill suited to cope with the complex demands of modern life. Genomic chaos and mental distress from the psychological challenges of modern living will inevitably converge to produce psychiatric disorders in man.
Collapse
Affiliation(s)
- Donard S. Dwyer
- Departments of Psychiatry and Behavioral Medicine and Pharmacology, Toxicology and Neuroscience, LSU Health Shreveport, Shreveport, Louisiana, USA
| |
Collapse
|
14
|
Rajarajan P, Akbarian S. Use of the epigenetic toolbox
to contextualize common variants associated with schizophrenia risk
. DIALOGUES IN CLINICAL NEUROSCIENCE 2020; 21:407-416. [PMID: 31949408 PMCID: PMC6952750 DOI: 10.31887/dcns.2019.21.4/sakbarian] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Schizophrenia is a debilitating psychiatric disorder with a complex genetic architecture and limited understanding of its neuropathology, reflected by the lack of diagnostic measures and effective pharmacological treatments. Geneticists have recently identified more than 145 risk loci comprising hundreds of common variants of small effect sizes, most of which lie in noncoding genomic regions. This review will discuss how the epigenetic toolbox can be applied to contextualize genetic findings in schizophrenia. Progress in next-generation sequencing, along with increasing methodological complexity, has led to the compilation of genome-wide maps of DNA methylation, histone modifications, RNA expression, and more. Integration of chromatin conformation datasets is one of the latest efforts in deciphering schizophrenia risk, allowing the identification of genes in contact with regulatory variants across 100s of kilobases. Large-scale multiomics studies will facilitate the prioritization of putative causal risk variants and gene networks that contribute to schizophrenia etiology, informing clinical diagnostics and treatment downstream.
.
Collapse
Affiliation(s)
- Prashanth Rajarajan
- Graduate School of Biomedical Sciences; Department of Psychiatry; Friedman Brain Institute; Icahn School of Medicine at Mount Sinai, New York, NY, US
| | - Schahram Akbarian
- Department of Psychiatry; Friedman Brain Institute; Icahn School of Medicine at Mount Sinai, New York, NY, US
| |
Collapse
|
15
|
Bocher O, Génin E. Rare variant association testing in the non-coding genome. Hum Genet 2020; 139:1345-1362. [PMID: 32500240 DOI: 10.1007/s00439-020-02190-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 05/29/2020] [Indexed: 12/25/2022]
Abstract
The development of next-generation sequencing technologies has opened-up some new possibilities to explore the contribution of genetic variants to human diseases and in particular that of rare variants. Statistical methods have been developed to test for association with rare variants that require the definition of testing units and, in these testing units, the selection of qualifying variants to include in the test. In the coding regions of the genome, testing units are usually the different genes and qualifying variants are selected based on their functional effects on the encoded proteins. Extending these tests to the non-coding regions of the genome is challenging. Testing units are difficult to define as the non-coding genome organisation is still rather unknown. Qualifying variants are difficult to select as the functional impact of non-coding variants on gene expression is hard to predict. These difficulties could explain why very few investigators so far have analysed the non-coding parts of their whole genome sequencing data. These non-coding parts yet represent the vast majority of the genome and some studies suggest that they could play a major role in disease susceptibility. In this review, we discuss recent experimental and statistical developments to gain knowledge on the non-coding genome and how this knowledge could be used to include rare non-coding variants in association tests. We describe the few studies that have considered variants from the non-coding genome in association tests and how they managed to define testing units and select qualifying variants.
Collapse
Affiliation(s)
- Ozvan Bocher
- Génétique, Génomique Fonctionnelle Et Biotechnologies, Faculté de Médecine, Univ Brest, Inserm, Inserm UMR1078, Bâtiment E-IBRBS 2ieme étage, 22 avenue Camille Desmoulins, 29238, Brest Cedex 3, France.
| | - Emmanuelle Génin
- Génétique, Génomique Fonctionnelle Et Biotechnologies, Faculté de Médecine, Univ Brest, Inserm, Inserm UMR1078, Bâtiment E-IBRBS 2ieme étage, 22 avenue Camille Desmoulins, 29238, Brest Cedex 3, France.
- CHU Brest, Brest, France.
| |
Collapse
|
16
|
Mah W, Won H. The three-dimensional landscape of the genome in human brain tissue unveils regulatory mechanisms leading to schizophrenia risk. Schizophr Res 2020; 217:17-25. [PMID: 30894290 PMCID: PMC6748876 DOI: 10.1016/j.schres.2019.03.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 02/27/2019] [Accepted: 03/06/2019] [Indexed: 12/18/2022]
Abstract
Recent advances in our understanding of the genetic architecture of schizophrenia have shed light on the schizophrenia etiology. While common variation is one of the major genetic contributors, the majority of common variation reside in non-coding genome, posing a significant challenge in understanding the functional impact of this class of genetic variation. Functional genomic datasets that range from expression quantitative trait loci (eQTL) to chromatin interactions are critical to identify the potential target genes and functional consequences of non-coding variation. In this review, we discuss how three-dimensional chromatin landscape, identified by a technique called Hi-C, has facilitated the identification of potential target genes impacting schizophrenia risk. We outline key steps for Hi-C driven gene mapping, and compare Hi-C defined schizophrenia risk genes defined across developmental epochs and cell types, which offer rich insights into the temporal window and cellular etiology of schizophrenia. In contrast with a neurodevelopmental hypothesis in schizophrenia, Hi-C defined schizophrenia risk genes are postnatally enriched, suggesting that postnatal development is also important for schizophrenia pathogenesis. Moreover, Hi-C defined schizophrenia risk genes are highly expressed in excitatory neurons, highlighting excitatory neurons as a central cell type for schizophrenia. Further characterization of Hi-C defined schizophrenia risk genes demonstrated enrichment for genes that harbor loss-of-function variation in neurodevelopmental disorders, suggesting a shared genetic etiology between schizophrenia and neurodevelopmental disorders. Collectively, moving the search space from risk variants to the target genes lays a foundation to understand the neurobiological basis of schizophrenia.
Collapse
Affiliation(s)
- Won Mah
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Hyejung Won
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA.
| |
Collapse
|
17
|
Espeso-Gil S, Halene T, Bendl J, Kassim B, Ben Hutta G, Iskhakova M, Shokrian N, Auluck P, Javidfar B, Rajarajan P, Chandrasekaran S, Peter CJ, Cote A, Birnbaum R, Liao W, Borrman T, Wiseman J, Bell A, Bannon MJ, Roussos P, Crary JF, Weng Z, Marenco S, Lipska B, Tsankova NM, Huckins L, Jiang Y, Akbarian S. A chromosomal connectome for psychiatric and metabolic risk variants in adult dopaminergic neurons. Genome Med 2020; 12:19. [PMID: 32075678 PMCID: PMC7031924 DOI: 10.1186/s13073-020-0715-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 01/30/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Midbrain dopaminergic neurons (MDN) represent 0.0005% of the brain's neuronal population and mediate cognition, food intake, and metabolism. MDN are also posited to underlay the neurobiological dysfunction of schizophrenia (SCZ), a severe neuropsychiatric disorder that is characterized by psychosis as well as multifactorial medical co-morbidities, including metabolic disease, contributing to markedly increased morbidity and mortality. Paradoxically, however, the genetic risk sequences of psychosis and traits associated with metabolic disease, such as body mass, show very limited overlap. METHODS We investigated the genomic interaction of SCZ with medical conditions and traits, including body mass index (BMI), by exploring the MDN's "spatial genome," including chromosomal contact landscapes as a critical layer of cell type-specific epigenomic regulation. Low-input Hi-C protocols were applied to 5-10 × 103 dopaminergic and other cell-specific nuclei collected by fluorescence-activated nuclei sorting from the adult human midbrain. RESULTS The Hi-C-reconstructed MDN spatial genome revealed 11 "Euclidean hot spots" of clustered chromatin domains harboring risk sequences for SCZ and elevated BMI. Inter- and intra-chromosomal contacts interconnecting SCZ and BMI risk sequences showed massive enrichment for brain-specific expression quantitative trait loci (eQTL), with gene ontologies, regulatory motifs and proteomic interactions related to adipogenesis and lipid regulation, dopaminergic neurogenesis and neuronal connectivity, and reward- and addiction-related pathways. CONCLUSIONS We uncovered shared nuclear topographies of cognitive and metabolic risk variants. More broadly, our PsychENCODE sponsored Hi-C study offers a novel genomic approach for the study of psychiatric and medical co-morbidities constrained by limited overlap of their respective genetic risk architectures on the linear genome.
Collapse
Affiliation(s)
- Sergio Espeso-Gil
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Tobias Halene
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- J.J. Peters Veterans Affairs Hospital, Bronx, NY, USA
| | - Jaroslav Bendl
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Pamela Sklar Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bibi Kassim
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Gabriella Ben Hutta
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Marina Iskhakova
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Neda Shokrian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Pavan Auluck
- Human Brain Collection Core, National Institute of Mental Health, Bethesda, MD, USA
| | - Behnam Javidfar
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Prashanth Rajarajan
- MDPhD Program in the Graduate School of Biomedical Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sandhya Chandrasekaran
- MDPhD Program in the Graduate School of Biomedical Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Cyril J Peter
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alanna Cote
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Pamela Sklar Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Rebecca Birnbaum
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Will Liao
- New York Genome Center, New York, NY, 10013, USA
| | - Tyler Borrman
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, 01605, USA
| | - Jennifer Wiseman
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Aaron Bell
- Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Michael J Bannon
- Department of Pharmacology, Wayne State University, Detroit, MI, USA
| | - Panagiotis Roussos
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- J.J. Peters Veterans Affairs Hospital, Bronx, NY, USA
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Pamela Sklar Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - John F Crary
- Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, 01605, USA
| | - Stefano Marenco
- Human Brain Collection Core, National Institute of Mental Health, Bethesda, MD, USA
| | - Barbara Lipska
- Human Brain Collection Core, National Institute of Mental Health, Bethesda, MD, USA
| | - Nadejda M Tsankova
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Laura Huckins
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Pamela Sklar Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Yan Jiang
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Schahram Akbarian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Friedman Brain Institute, Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
18
|
Mora A. Gene set analysis methods for the functional interpretation of non-mRNA data—Genomic range and ncRNA data. Brief Bioinform 2019; 21:1495-1508. [DOI: 10.1093/bib/bbz090] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 05/30/2019] [Accepted: 06/28/2019] [Indexed: 12/31/2022] Open
Abstract
Abstract
Gene set analysis (GSA) is one of the methods of choice for analyzing the results of current omics studies; however, it has been mainly developed to analyze mRNA (microarray, RNA-Seq) data. The following review includes an update regarding general methods and resources for GSA and then emphasizes GSA methods and tools for non-mRNA omics datasets, specifically genomic range data (ChIP-Seq, SNP and methylation) and ncRNA data (miRNAs, lncRNAs and others). In the end, the state of the GSA field for non-mRNA datasets is discussed, and some current challenges and trends are highlighted, especially the use of network approaches to face complexity issues.
Collapse
Affiliation(s)
- Antonio Mora
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health - Chinese Academy of Sciences
| |
Collapse
|
19
|
Higgins GA, Williams AM, Ade AS, Alam HB, Athey BD. Druggable Transcriptional Networks in the Human Neurogenic Epigenome. Pharmacol Rev 2019; 71:520-538. [PMID: 31530573 PMCID: PMC6750186 DOI: 10.1124/pr.119.017681] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Chromosome conformation capture methods have revealed the dynamics of genome architecture which is spatially organized into topologically associated domains, with gene regulation mediated by enhancer-promoter pairs in chromatin space. New evidence shows that endogenous hormones and several xenobiotics act within circumscribed topological domains of the spatial genome, impacting subsets of the chromatin contacts of enhancer-gene promoter pairs in cis and trans Results from the National Institutes of Health-funded PsychENCODE project and the study of chromatin remodeling complexes have converged to provide a clearer understanding of the organization of the neurogenic epigenome in humans. Neuropsychiatric diseases, including schizophrenia, bipolar spectrum disorder, autism spectrum disorder, attention deficit hyperactivity disorder, and other neuropsychiatric disorders are significantly associated with mutations in neurogenic transcriptional networks. In this review, we have reanalyzed the results from publications of the PsychENCODE Consortium using pharmacoinformatics network analysis to better understand druggable targets that control neurogenic transcriptional networks. We found that valproic acid and other psychotropic drugs directly alter these networks, including chromatin remodeling complexes, transcription factors, and other epigenetic modifiers. We envision a new generation of CNS therapeutics targeted at neurogenic transcriptional control networks, including druggable parts of chromatin remodeling complexes and master transcription factor-controlled pharmacogenomic networks. This may provide a route to the modification of interconnected gene pathways impacted by disease in patients with neuropsychiatric and neurodegenerative disorders. Direct and indirect therapeutic strategies to modify the master regulators of neurogenic transcriptional control networks may ultimately help extend the life span of CNS neurons impacted by disease.
Collapse
Affiliation(s)
- Gerald A Higgins
- Departments of Computational Medicine and Bioinformatics (G.A.H., A.S.A., B.D.A.), Surgery (A.M.W., H.B.A.), and Psychiatry (B.D.A.), University of Michigan Medical School, Ann Arbor, Michigan
| | - Aaron M Williams
- Departments of Computational Medicine and Bioinformatics (G.A.H., A.S.A., B.D.A.), Surgery (A.M.W., H.B.A.), and Psychiatry (B.D.A.), University of Michigan Medical School, Ann Arbor, Michigan
| | - Alex S Ade
- Departments of Computational Medicine and Bioinformatics (G.A.H., A.S.A., B.D.A.), Surgery (A.M.W., H.B.A.), and Psychiatry (B.D.A.), University of Michigan Medical School, Ann Arbor, Michigan
| | - Hasan B Alam
- Departments of Computational Medicine and Bioinformatics (G.A.H., A.S.A., B.D.A.), Surgery (A.M.W., H.B.A.), and Psychiatry (B.D.A.), University of Michigan Medical School, Ann Arbor, Michigan
| | - Brian D Athey
- Departments of Computational Medicine and Bioinformatics (G.A.H., A.S.A., B.D.A.), Surgery (A.M.W., H.B.A.), and Psychiatry (B.D.A.), University of Michigan Medical School, Ann Arbor, Michigan
| |
Collapse
|
20
|
Nandakumar SK, McFarland SK, Mateyka LM, Lareau CA, Ulirsch JC, Ludwig LS, Agarwal G, Engreitz JM, Przychodzen B, McConkey M, Cowley GS, Doench JG, Maciejewski JP, Ebert BL, Root DE, Sankaran VG. Gene-centric functional dissection of human genetic variation uncovers regulators of hematopoiesis. eLife 2019; 8:44080. [PMID: 31070582 PMCID: PMC6534380 DOI: 10.7554/elife.44080] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Accepted: 05/08/2019] [Indexed: 02/06/2023] Open
Abstract
Genome-wide association studies (GWAS) have identified thousands of variants associated with human diseases and traits. However, the majority of GWAS-implicated variants are in non-coding regions of the genome and require in depth follow-up to identify target genes and decipher biological mechanisms. Here, rather than focusing on causal variants, we have undertaken a pooled loss-of-function screen in primary hematopoietic cells to interrogate 389 candidate genes contained in 75 loci associated with red blood cell traits. Using this approach, we identify 77 genes at 38 GWAS loci, with most loci harboring 1-2 candidate genes. Importantly, the hit set was strongly enriched for genes validated through orthogonal genetic approaches. Genes identified by this approach are enriched in specific and relevant biological pathways, allowing regulators of human erythropoiesis and modifiers of blood diseases to be defined. More generally, this functional screen provides a paradigm for gene-centric follow up of GWAS for a variety of human diseases and traits.
Collapse
Affiliation(s)
- Satish K Nandakumar
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States
| | - Sean K McFarland
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States
| | - Laura M Mateyka
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Biochemistry Center (BZH), Ruprecht-Karls-University Heidelberg, Heidelberg, Germany
| | - Caleb A Lareau
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Program in Biological and Medical Sciences, Harvard Medical School, Boston, United States
| | - Jacob C Ulirsch
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Program in Biological and Medical Sciences, Harvard Medical School, Boston, United States
| | - Leif S Ludwig
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States
| | - Gaurav Agarwal
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,University of Oxford, Oxford, United Kingdom.,Harvard Stem Cell Institute, Cambridge, United States
| | - Jesse M Engreitz
- Broad Institute of MIT and Harvard, Cambridge, United States.,Harvard Society of Fellows, Harvard University, Cambridge, United States
| | - Bartlomiej Przychodzen
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, United States
| | - Marie McConkey
- Division of Hematology, Brigham and Women's Hospital, Boston, United States
| | - Glenn S Cowley
- Broad Institute of MIT and Harvard, Cambridge, United States
| | - John G Doench
- Broad Institute of MIT and Harvard, Cambridge, United States
| | - Jaroslaw P Maciejewski
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, United States
| | - Benjamin L Ebert
- Broad Institute of MIT and Harvard, Cambridge, United States.,Division of Hematology, Brigham and Women's Hospital, Boston, United States.,Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, United States.,Howard Hughes Medical Institute, Chevy Chase, United States
| | - David E Root
- Broad Institute of MIT and Harvard, Cambridge, United States
| | - Vijay G Sankaran
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, United States.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, United States.,Broad Institute of MIT and Harvard, Cambridge, United States.,Harvard Stem Cell Institute, Cambridge, United States
| |
Collapse
|