Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Arbiza L, Gronau I, Aksoy BA, Hubisz MJ, Gulko B, Keinan A, Siepel A. Genome-wide inference of natural selection on human transcription factor binding sites. Nat Genet 2013;45:723-9. [PMID: 23749186 DOI: 10.1038/ng.2658] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Accepted: 05/08/2013] [Indexed: 11/09/2022]

For:	Arbiza L, Gronau I, Aksoy BA, Hubisz MJ, Gulko B, Keinan A, Siepel A. Genome-wide inference of natural selection on human transcription factor binding sites. Nat Genet 2013;45:723-9. [PMID: 23749186 DOI: 10.1038/ng.2658] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Accepted: 05/08/2013] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Chen Y, Liu S, Ren Z, Wang F, Liang Q, Jiang Y, Dai R, Duan F, Han C, Ning Z, Xia Y, Li M, Yuan K, Qiu W, Yan XX, Dai J, Kopp RF, Huang J, Xu S, Tang B, Wu L, Gamazon ER, Bigdeli T, Gershon E, Huang H, Ma C, Liu C, Chen C. Cross-ancestry analysis of brain QTLs enhances interpretation of schizophrenia genome-wide association studies. Am J Hum Genet 2024:S0002-9297(24)00336-7. [PMID: 39362218 DOI: 10.1016/j.ajhg.2024.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 09/04/2024] [Accepted: 09/06/2024] [Indexed: 10/05/2024] Open

Affiliation(s)

Yu Chen MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China; Broad Institute of MIT and Harvard, Cambridge, MA, USA
Sihan Liu MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China; Institute of Rare Diseases, West China Hospital, Sichuan University, Chengdu, China
Zongyao Ren MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Feiran Wang MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Qiuman Liang MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Yi Jiang MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Rujia Dai Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY, USA
Fangyuan Duan MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Cong Han MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Zhilin Ning Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
Yan Xia Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
Miao Li MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Kai Yuan Broad Institute of MIT and Harvard, Cambridge, MA, USA
Wenying Qiu Institute of Basic Medical Sciences, Neuroscience Center, National Human Brain Bank for Development and Function, Chinese Academy of Medical Sciences, Department of Human Anatomy, Histology and Embryology, School of Basic Medicine, Peking Union Medical College, Beijing, China
Xiao-Xin Yan Department of Human Anatomy and Neurobiology, Xiangya School of Medicine, Central South University, Changsha, China
Jiapei Dai Wuhan Institute for Neuroscience and Engineering, South-Central University for Nationalities, Wuhan, China
Richard F Kopp Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY, USA
Jufang Huang Department of Human Anatomy and Neurobiology, Xiangya School of Medicine, Central South University, Changsha, China
Shuhua Xu State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
Beisha Tang National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
Lingqian Wu MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China
Eric R Gamazon Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
Tim Bigdeli Institute for Genomics in Health, SUNY Downstate Health Sciences University, Brooklyn, NY, USA
Elliot Gershon Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL, USA
Hailiang Huang Broad Institute of MIT and Harvard, Cambridge, MA, USA
Chao Ma Institute of Basic Medical Sciences, Neuroscience Center, National Human Brain Bank for Development and Function, Chinese Academy of Medical Sciences, Department of Human Anatomy, Histology and Embryology, School of Basic Medicine, Peking Union Medical College, Beijing, China
Chunyu Liu MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China; Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY, USA.
Chao Chen MOE Key Laboratory of Rare Pediatric Diseases & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan 410000, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China; Hunan Key Laboratory of Animal Models for Human Diseases, Central South University, Changsha, China.

Collapse

Chen Y, Liu S, Ren Z, Wang F, Jiang Y, Dai R, Duan F, Han C, Ning Z, Xia Y, Li M, Yuan K, Qiu W, Yan XX, Dai J, Kopp RF, Huang J, Xu S, Tang B, Gamazon ER, Bigdeli T, Gershon E, Huang H, Ma C, Liu C, Chen C. Brain eQTLs of European, African American, and Asian ancestry improve interpretation of schizophrenia GWAS. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.13.24301833. [PMID: 38405973 PMCID: PMC10888997 DOI: 10.1101/2024.02.13.24301833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

Abstract

Research on brain expression quantitative trait loci (eQTLs) has illuminated the genetic underpinnings of schizophrenia (SCZ). Yet, the majority of these studies have been centered on European populations, leading to a constrained understanding of population diversities and disease risks. To address this gap, we examined genotype and RNA-seq data from African Americans (AA, n=158), Europeans (EUR, n=408), and East Asians (EAS, n=217). When comparing eQTLs between EUR and non-EUR populations, we observed concordant patterns of genetic regulatory effect, particularly in terms of the effect sizes of the eQTLs. However, 343,737 cis-eQTLs (representing ∼17% of all eQTLs pairs) linked to 1,276 genes (about 10% of all eGenes) and 198,769 SNPs (approximately 16% of all eSNPs) were identified only in the non-EUR populations. Over 90% of observed population differences in eQTLs could be traced back to differences in allele frequency. Furthermore, 35% of these eQTLs were notably rare (MAF < 0.05) in the EUR population. Integrating brain eQTLs with SCZ signals from diverse populations, we observed a higher disease heritability enrichment of brain eQTLs in matched populations compared to mismatched ones. Prioritization analysis identified seven new risk genes ( SFXN2 , RP11-282018.3 , CYP17A1 , VPS37B , DENR , FTCDNL1 , and NT5DC2 ), and three potential novel regulatory variants in known risk genes ( CNNM2 , C12orf65 , and MPHOSPH9 ) that were missed in the EUR dataset. Our findings underscore that increasing genetic ancestral diversity is more efficient for power improvement than merely increasing the sample size within single-ancestry eQTLs datasets. Such a strategy will not only improve our understanding of the biological underpinnings of population structures but also pave the way for the identification of novel risk genes in SCZ.

Collapse

Wang X, Ingvarsson PK. Quantifying adaptive evolution and the effects of natural selection across the Norway spruce genome. Mol Ecol 2023;32:5288-5304. [PMID: 37622583 DOI: 10.1111/mec.17106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 08/07/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023]

Liu Z, Samee M. Structural underpinnings of mutation rate variations in the human genome. Nucleic Acids Res 2023;51:7184-7197. [PMID: 37395403 PMCID: PMC10415140 DOI: 10.1093/nar/gkad551] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 06/06/2023] [Accepted: 06/15/2023] [Indexed: 07/04/2023] Open

Traniello IM, Bukhari SA, Dibaeinia P, Serrano G, Avalos A, Ahmed AC, Sankey AL, Hernaez M, Sinha S, Zhao SD, Catchen J, Robinson GE. Single-cell dissection of aggression in honeybee colonies. Nat Ecol Evol 2023;7:1232-1244. [PMID: 37264201 DOI: 10.1038/s41559-023-02090-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 05/09/2023] [Indexed: 06/03/2023]

Tuncay IO, DeVries D, Gogate A, Kaur K, Kumar A, Xing C, Goodspeed K, Seyoum-Tesfa L, Chahrour MH. The genetics of autism spectrum disorder in an East African familial cohort. CELL GENOMICS 2023;3:100322. [PMID: 37492102 PMCID: PMC10363748 DOI: 10.1016/j.xgen.2023.100322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 03/09/2023] [Accepted: 04/16/2023] [Indexed: 07/27/2023]

Affiliation(s)

Islam Oguz Tuncay Department of Neuroscience, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Darlene DeVries Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Ashlesha Gogate Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Kiran Kaur Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Ashwani Kumar Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Chao Xing Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Kimberly Goodspeed Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Department of Neurology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
Leah Seyoum-Tesfa Reaching Families Advocacy and Support Group, Dallas, TX 75243, USA
Maria H Chahrour Department of Neuroscience, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA Peter O'Donnell Jr. Brain Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA

Collapse

Mehta TK, Man A, Ciezarek A, Ranson K, Penman D, Di-Palma F, Haerty W. Chromatin accessibility in gill tissue identifies candidate genes and loci associated with aquaculture relevant traits in tilapia. Genomics 2023;115:110633. [PMID: 37121445 DOI: 10.1016/j.ygeno.2023.110633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 04/25/2023] [Accepted: 04/26/2023] [Indexed: 05/02/2023]

Zhang X, Fang B, Huang YF. Transcription factor binding sites are frequently under accelerated evolution in primates. Nat Commun 2023;14:783. [PMID: 36774380 PMCID: PMC9922303 DOI: 10.1038/s41467-023-36421-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 01/31/2023] [Indexed: 02/13/2023] Open

Linker SB, Narvaiza I, Hsu JY, Wang M, Qiu F, Mendes APD, Oefner R, Kottilil K, Sharma A, Randolph-Moore L, Mejia E, Santos R, Marchetto MC, Gage FH. Human-specific regulation of neural maturation identified by cross-primate transcriptomics. Curr Biol 2022;32:4797-4807.e5. [PMID: 36228612 DOI: 10.1016/j.cub.2022.09.028] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 07/08/2022] [Accepted: 09/14/2022] [Indexed: 11/06/2022]

Affiliation(s)

Sara B Linker Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Iñigo Narvaiza Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Jonathan Y Hsu Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Meiyan Wang Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Fan Qiu Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Ana P D Mendes Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Ruth Oefner Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Kalyani Kottilil Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Amandeep Sharma Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Lynne Randolph-Moore Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Eunice Mejia Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA
Renata Santos Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA; Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, Laboratory of Dynamics of Neuronal Structure in Health and Disease, 102 rue de la Santé, 75014 Paris, France; Institut des Sciences Biologiques, CNRS, 16 rue Pierre et Marie Curie, 75005 Paris, France
Maria C Marchetto Department of Anthropology, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA; Center for Academic Research and Training in Anthropogeny (CARTA), University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
Fred H Gage Laboratory of Genetics, Salk Institute for Biological Studies, 10010 North Pines Road, La Jolla, CA 92037, USA.

Collapse

Exploration of Tools for the Interpretation of Human Non-Coding Variants. Int J Mol Sci 2022;23:ijms232112977. [PMID: 36361767 PMCID: PMC9654743 DOI: 10.3390/ijms232112977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/17/2022] [Accepted: 10/23/2022] [Indexed: 02/01/2023] Open

Ramstein GP, Buckler ES. Prediction of evolutionary constraint by genomic annotations improves functional prioritization of genomic variants in maize. Genome Biol 2022;23:183. [PMID: 36050782 PMCID: PMC9438327 DOI: 10.1186/s13059-022-02747-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 08/15/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Crop improvement through cross-population genomic prediction and genome editing requires identification of causal variants at high resolution, within fewer than hundreds of base pairs. Most genetic mapping studies have generally lacked such resolution. In contrast, evolutionary approaches can detect genetic effects at high resolution, but they are limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Here we use genomic annotations to accurately predict nucleotide conservation across angiosperms, as a proxy for fitness effect of mutations.

Results

Using only sequence analysis, we annotate nonsynonymous mutations in 25,824 maize gene models, with information from bioinformatics and deep learning. Our predictions are validated by experimental information: within-species conservation, chromatin accessibility, and gene expression. According to gene ontology and pathway enrichment analyses, predicted nucleotide conservation points to genes in central carbon metabolism. Importantly, it improves genomic prediction for fitness-related traits such as grain yield, in elite maize panels, by stringent prioritization of fewer than 1% of single-site variants.

Conclusions

Our results suggest that predicting nucleotide conservation across angiosperms may effectively prioritize sites most likely to impact fitness-related traits in crops, without being limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Our approach—Prediction of mutation Impact by Calibrated Nucleotide Conservation (PICNC)—could be useful to select polymorphisms for accurate genomic prediction, and candidate mutations for efficient base editing. The trained PICNC models and predicted nucleotide conservation at protein-coding SNPs in maize are publicly available in CyVerse (10.25739/hybz-2957).

Supplementary Information

The online version contains supplementary material available at 10.1186/s13059-022-02747-2.

Collapse

Dukler N, Mughal MR, Ramani R, Huang YF, Siepel A. Extreme purifying selection against point mutations in the human genome. Nat Commun 2022;13:4312. [PMID: 35879308 PMCID: PMC9314448 DOI: 10.1038/s41467-022-31872-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 07/07/2022] [Indexed: 12/13/2022] Open

Systems biology analysis of human genomes points to key pathways conferring spina bifida risk. Proc Natl Acad Sci U S A 2021;118:2106844118. [PMID: 34916285 PMCID: PMC8713748 DOI: 10.1073/pnas.2106844118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2021] [Indexed: 12/15/2022] Open

Abstract

Genetic investigations of most structural birth defects, including spina bifida (SB), congenital heart disease, and craniofacial anomalies, have been underpowered for genome-wide association studies because of their rarity, genetic heterogeneity, incomplete penetrance, and environmental influences. Our systems biology strategy to investigate SB predisposition controls for population stratification and avoids much of the bias inherent in candidate gene searches that are pervasive in the field. We examine both protein coding and noncoding regions of whole genomes to analyze sequence variants, collapsed by gene or regulatory region, and apply machine learning, gene enrichment, and pathway analyses to elucidate molecular pathways and genes contributing to human SB.

Spina bifida (SB) is a debilitating birth defect caused by multiple gene and environment interactions. Though SB shows non-Mendelian inheritance, genetic factors contribute to an estimated 70% of cases. Nevertheless, identifying human mutations conferring SB risk is challenging due to its relative rarity, genetic heterogeneity, incomplete penetrance, and environmental influences that hamper genome-wide association studies approaches to untargeted discovery. Thus, SB genetic studies may suffer from population substructure and/or selection bias introduced by typical candidate gene searches. We report a population based, ancestry-matched whole-genome sequence analysis of SB genetic predisposition using a systems biology strategy to interrogate 298 case-control subject genomes (149 pairs). Genes that were enriched in likely gene disrupting (LGD), rare protein-coding variants were subjected to machine learning analysis to identify genes in which LGD variants occur with a different frequency in cases versus controls and so discriminate between these groups. Those genes with high discriminatory potential for SB significantly enriched pathways pertaining to carbon metabolism, inflammation, innate immunity, cytoskeletal regulation, and essential transcriptional regulation consistent with their having impact on the pathogenesis of human SB. Additionally, an interrogation of conserved noncoding sequences identified robust variant enrichment in regulatory regions of several transcription factors critical to embryonic development. This genome-wide perspective offers an effective approach to the interrogation of coding and noncoding sequence variant contributions to rare complex genetic disorders.

Collapse

Joshi M, Kapopoulou A, Laurent S. Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective. Front Genet 2021;12:660899. [PMID: 34276769 PMCID: PMC8282999 DOI: 10.3389/fgene.2021.660899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/31/2021] [Indexed: 01/06/2023] Open

Zhou Y, Lauschke VM. Computational Tools to Assess the Functional Consequences of Rare and Noncoding Pharmacogenetic Variability. Clin Pharmacol Ther 2021;110:626-636. [PMID: 33998671 DOI: 10.1002/cpt.2289] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 05/07/2021] [Indexed: 12/19/2022]

Zrimec J, Buric F, Kokina M, Garcia V, Zelezniak A. Learning the Regulatory Code of Gene Expression. Front Mol Biosci 2021;8:673363. [PMID: 34179082 PMCID: PMC8223075 DOI: 10.3389/fmolb.2021.673363] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 05/24/2021] [Indexed: 11/13/2022] Open

Tseng CC, Wong MC, Liao WT, Chen CJ, Lee SC, Yen JH, Chang SJ. Genetic Variants in Transcription Factor Binding Sites in Humans: Triggered by Natural Selection and Triggers of Diseases. Int J Mol Sci 2021;22:ijms22084187. [PMID: 33919522 PMCID: PMC8073710 DOI: 10.3390/ijms22084187] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 04/15/2021] [Accepted: 04/16/2021] [Indexed: 12/14/2022] Open

Zrimec J, Börlin CS, Buric F, Muhammad AS, Chen R, Siewers V, Verendel V, Nielsen J, Töpel M, Zelezniak A. Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure. Nat Commun 2020;11:6141. [PMID: 33262328 PMCID: PMC7708451 DOI: 10.1038/s41467-020-19921-4] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 11/02/2020] [Indexed: 12/31/2022] Open

Affiliation(s)

Jan Zrimec Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Christoph S Börlin Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Filip Buric Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Azam Sheikh Muhammad Computer Science and Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Rhongzen Chen Computer Science and Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Verena Siewers Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Vilhelm Verendel Computer Science and Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Jens Nielsen Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
Mats Töpel Department of Marine Sciences, University of Gothenburg, Box 461, SE-405 30, Gothenburg, Sweden Gothenburg Global Biodiversity Center (GGBC), Box 461, 40530, Gothenburg, Sweden
Aleksej Zelezniak Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden. Science for Life Laboratory, Tomtebodavägen 23a, SE-171 65, Stockholm, Sweden.

Collapse

Liu J, Robinson-Rechavi M. Robust inference of positive selection on regulatory sequences in the human brain. SCIENCE ADVANCES 2020;6:6/48/eabc9863. [PMID: 33246961 PMCID: PMC7695467 DOI: 10.1126/sciadv.abc9863] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 10/16/2020] [Indexed: 05/07/2023]

Zhang H, Shi X, Huang T, Zhao X, Chen W, Gu N, Zhang R. Dynamic landscape and evolution of m6A methylation in human. Nucleic Acids Res 2020;48:6251-6264. [PMID: 32406913 PMCID: PMC7293016 DOI: 10.1093/nar/gkaa347] [Citation(s) in RCA: 181] [Impact Index Per Article: 45.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 04/23/2020] [Accepted: 04/24/2020] [Indexed: 01/03/2023] Open

Selection against archaic hominin genetic variation in regulatory regions. Nat Ecol Evol 2020;4:1558-1566. [PMID: 32839541 DOI: 10.1038/s41559-020-01284-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 07/21/2020] [Indexed: 01/20/2023]

Huang YF. Unified inference of missense variant effects and gene constraints in the human genome. PLoS Genet 2020;16:e1008922. [PMID: 32667917 PMCID: PMC7384676 DOI: 10.1371/journal.pgen.1008922] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 07/27/2020] [Accepted: 06/09/2020] [Indexed: 01/25/2023] Open

Abstract

A challenge in medical genomics is to identify variants and genes associated with severe genetic disorders. Based on the premise that severe, early-onset disorders often result in a reduction of evolutionary fitness, several statistical methods have been developed to predict pathogenic variants or constrained genes based on the signatures of negative selection in human populations. However, we currently lack a statistical framework to jointly predict deleterious variants and constrained genes from both variant-level features and gene-level selective constraints. Here we present such a unified approach, UNEECON, based on deep learning and population genetics. UNEECON treats the contributions of variant-level features and gene-level constraints as a variant-level fixed effect and a gene-level random effect, respectively. The sum of the fixed and random effects is then combined with an evolutionary model to infer the strength of negative selection at both variant and gene levels. Compared with previously published methods, UNEECON shows improved performance in predicting missense variants and protein-coding genes associated with autosomal dominant disorders, and feature importance analysis suggests that both gene-level selective constraints and variant-level predictors are important for accurate variant prioritization. Furthermore, based on UNEECON, we observe a low correlation between gene-level intolerance to missense mutations and that to loss-of-function mutations, which can be partially explained by the prevalence of disordered protein regions that are highly tolerant to missense mutations. Finally, we show that genes intolerant to both missense and loss-of-function mutations play key roles in the central nervous system and the autism spectrum disorders. Overall, UNEECON is a promising framework for both variant and gene prioritization.

Numerous statistical methods have been developed to predict deleterious missense variants or constrained genes in the human genome, but unified prioritization methods that utilize both variant- and gene-level information are underdeveloped. Here we present UNEECON, an evolution-based deep learning framework for unified variant and gene prioritization. By integrating variant-level predictors and gene-level selective constraints, UNEECON outperforms existing methods in predicting missense variants and protein-coding genes associated with dominant disorders. Based on UNEECON, we show that disordered proteins are tolerant to missense mutations but not to loss-of-function mutations. In addition, we find that genes under strong selective constraints at both missense and loss-of-function levels are strongly associated with the central nervous system and the autism spectrum disorders, highlighting the need to investigate the function of these highly constrained genes in future studies.

Collapse

Dukler N, Huang YF, Siepel A. Phylogenetic Modeling of Regulatory Element Turnover Based on Epigenomic Data. Mol Biol Evol 2020;37:2137-2152. [PMID: 32176292 PMCID: PMC7306682 DOI: 10.1093/molbev/msaa073] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Takeda JI, Nanatsue K, Yamagishi R, Ito M, Haga N, Hirata H, Ogi T, Ohno K. InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution. NAR Genom Bioinform 2020;2:lqaa038. [PMID: 33543123 PMCID: PMC7671370 DOI: 10.1093/nargab/lqaa038] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 03/03/2020] [Accepted: 05/13/2020] [Indexed: 12/15/2022] Open

Russell LE, Schwarz UI. Variant discovery using next-generation sequencing and its future role in pharmacogenetics. Pharmacogenomics 2020;21:471-486. [DOI: 10.2217/pgs-2019-0190] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Walker RL, Ramaswami G, Hartl C, Mancuso N, Gandal MJ, de la Torre-Ubieta L, Pasaniuc B, Stein JL, Geschwind DH. Genetic Control of Expression and Splicing in Developing Human Brain Informs Disease Mechanisms. Cell 2019;179:750-771.e22. [PMID: 31626773 PMCID: PMC8963725 DOI: 10.1016/j.cell.2019.09.021] [Citation(s) in RCA: 145] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 06/06/2019] [Accepted: 09/20/2019] [Indexed: 02/08/2023]

Affiliation(s)

Rebecca L Walker Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA; Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA 90095, USA
Gokul Ramaswami Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA
Christopher Hartl Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA; Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA 90095, USA
Nicholas Mancuso Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90024, USA
Michael J Gandal Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA
Luis de la Torre-Ubieta Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA; Department of Psychiatry, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA
Bogdan Pasaniuc Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90024, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
Jason L Stein Department of Genetics and UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
Daniel H Geschwind Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA; Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA 90095, USA.

Collapse

Further Defining the Phenotypic Spectrum of B3GAT3 Mutations and Literature Review on Linkeropathy Syndromes. Genes (Basel) 2019;10:genes10090631. [PMID: 31438591 PMCID: PMC6770791 DOI: 10.3390/genes10090631] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 08/09/2019] [Accepted: 08/19/2019] [Indexed: 11/29/2022] Open

Lipan O, Wu E. A stochastic switch with different phases. CHAOS (WOODBURY, N.Y.) 2019;29:083107. [PMID: 31472510 DOI: 10.1063/1.5096778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 07/16/2019] [Indexed: 06/10/2023]

Berger MJ, Wenger AM, Guturu H, Bejerano G. Independent erosion of conserved transcription factor binding sites points to shared hindlimb, vision and external testes loss in different mammals. Nucleic Acids Res 2019;46:9299-9308. [PMID: 30137416 PMCID: PMC6182171 DOI: 10.1093/nar/gky741] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Accepted: 08/21/2018] [Indexed: 02/05/2023] Open

Huang YF, Siepel A. Estimation of allele-specific fitness effects across human protein-coding sequences and implications for disease. Genome Res 2019;29:1310-1321. [PMID: 31249063 PMCID: PMC6673719 DOI: 10.1101/gr.245522.118] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 06/20/2019] [Indexed: 12/16/2022]

Gain of transcription factor binding sites is associated to changes in the expression signature of human brain and testis and is correlated to genes with higher expression breadth. SCIENCE CHINA-LIFE SCIENCES 2019;62:526-534. [PMID: 30919278 DOI: 10.1007/s11427-018-9454-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 10/15/2018] [Indexed: 11/26/2022]

Walter Costa MB, Höner zu Siederdissen C, Dunjić M, Stadler PF, Nowick K. SSS-test: a novel test for detecting positive selection on RNA secondary structure. BMC Bioinformatics 2019;20:151. [PMID: 30898084 PMCID: PMC6429701 DOI: 10.1186/s12859-019-2711-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 03/03/2019] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

Long non-coding RNAs (lncRNAs) play an important role in regulating gene expression and are thus important for determining phenotypes. Most attempts to measure selection in lncRNAs have focused on the primary sequence. The majority of small RNAs and at least some parts of lncRNAs must fold into specific structures to perform their biological function. Comprehensive assessments of selection acting on RNAs therefore must also encompass structure. Selection pressures acting on the structure of non-coding genes can be detected within multiple sequence alignments. Approaches of this type, however, have so far focused on negative selection. Thus, a computational method for identifying ncRNAs under positive selection is needed.

RESULTS

We introduce the SSS-test (test for Selection on Secondary Structure) to identify positive selection and thus adaptive evolution. Benchmarks with biological as well as synthetic controls yield coherent signals for both negative and positive selection, demonstrating the functionality of the test. A survey of a lncRNA collection comprising 15,443 families resulted in 110 candidates that appear to be under positive selection in human. In 26 lncRNAs that have been associated with psychiatric disorders we identified local structures that have signs of positive selection in the human lineage.

CONCLUSIONS

It is feasible to assay positive selection acting on RNA secondary structures on a genome-wide scale. The detection of human-specific positive selection in lncRNAs associated with cognitive disorder provides a set of candidate genes for further experimental testing and may provide insights into the evolution of cognitive abilities in humans.

AVAILABILITY

The SSS-test and related software is available at: https://github.com/waltercostamb/SSS-test . The databases used in this work are available at: http://www.bioinf.uni-leipzig.de/Software/SSS-test/ .

Collapse

Affiliation(s)

Maria Beatriz Walter Costa Embrapa Agroenergia, Parque Estação Biológica (PqEB), Asa Norte, Brasília, DF, 70770-901 Brazil Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, 04107 Germany
Christian Höner zu Siederdissen Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, 04107 Germany
Marko Dunjić Human Biology Group, Institute for Biology, Department of Biology, Chemistry, Pharmacy, Freie Universitaet Berlin, Königin-Luise-Straße 1-3, Berlin, 14195 Germany Center for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Studentski trg 16, PO box 43, Belgrade, 11000 Serbia
Peter F. Stadler Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, 04107 Germany German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig & Competence Center for Scalable Data Services and Solutions Dresden-Leipzig & Leipzig Research Center for Civilization Diseases, University Leipzig, Leipzig, 04107 Germany Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, 04103 Germany Department of Theoretical Chemistry, University of Vienna, Währinger Straße 17, Vienna, A-1090 Austria Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark Faculdad de Ciencias, Universidad Nacional de Colombia, Sede Bogotá, Ciudad Universitaria, Bogotá, D.C., COL-111321 Colombia Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501 USA
Katja Nowick Human Biology Group, Institute for Biology, Department of Biology, Chemistry, Pharmacy, Freie Universitaet Berlin, Königin-Luise-Straße 1-3, Berlin, 14195 Germany TFome Research Group, Bioinformatics Group, Interdisciplinary Center of Bioinformatics, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, Leipzig, 04107 Germany Paul-Flechsig-Institute for Brain Research, University of Leipzig, Liebigstraße 19. Haus C, Leipzig, 04103 Germany Bioinformatics, Faculty of Agricultural Sciences, Institute of Animal Science, University of Hohenheim, Garbenstraße 13, Stuttgart, 70593 Germany

Collapse

Gulko B, Siepel A. An evolutionary framework for measuring epigenomic information and estimating cell-type-specific fitness consequences. Nat Genet 2018;51:335-342. [PMID: 30559490 DOI: 10.1038/s41588-018-0300-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/30/2018] [Indexed: 01/22/2023]

Zhou Y, Fujikura K, Mkrtchian S, Lauschke VM. Computational Methods for the Pharmacogenetic Interpretation of Next Generation Sequencing Data. Front Pharmacol 2018;9:1437. [PMID: 30564131 PMCID: PMC6288784 DOI: 10.3389/fphar.2018.01437] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 11/20/2018] [Indexed: 12/21/2022] Open

Reshef YA, Finucane HK, Kelley DR, Gusev A, Kotliar D, Ulirsch JC, Hormozdiari F, Nasser J, O'Connor L, van de Geijn B, Loh PR, Grossman SR, Bhatia G, Gazal S, Palamara PF, Pinello L, Patterson N, Adams RP, Price AL. Detecting genome-wide directional effects of transcription factor binding on polygenic disease risk. Nat Genet 2018;50:1483-1493. [PMID: 30177862 PMCID: PMC6202062 DOI: 10.1038/s41588-018-0196-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Accepted: 07/11/2018] [Indexed: 12/19/2022]

Affiliation(s)

Yakir A Reshef Department of Computer Science, Harvard University, Cambridge, MA, USA. Harvard/MIT MD/PhD Program, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA.
Hilary K Finucane Broad Institute of MIT and Harvard, Cambridge, MA, USA
David R Kelley California Life Sciences LLC, South San Francisco, CA, USA
Alexander Gusev Dana Farber Cancer Institute, Boston, MA, USA
Dylan Kotliar Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jacob C Ulirsch Broad Institute of MIT and Harvard, Cambridge, MA, USA Dana Farber Cancer Institute, Boston, MA, USA Boston Children's Hospital, Boston, MA, USA
Farhad Hormozdiari Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Joseph Nasser Broad Institute of MIT and Harvard, Cambridge, MA, USA
Luke O'Connor Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA Program in Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA, USA
Bryce van de Geijn Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Po-Ru Loh Broad Institute of MIT and Harvard, Cambridge, MA, USA Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Sharon R Grossman Harvard/MIT MD/PhD Program, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Gaurav Bhatia Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Steven Gazal Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Pier Francesco Palamara Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Statistics, University of Oxford, Oxford, UK
Luca Pinello Broad Institute of MIT and Harvard, Cambridge, MA, USA Massachusetts General Hospital, Charlestown, MA, USA Department of Pathology, Harvard Medical School, Boston, MA, USA
Nick Patterson Broad Institute of MIT and Harvard, Cambridge, MA, USA
Ryan P Adams Google Brain, New York, NY, USA Department of Computer Science, Princeton University, Princeton, NJ, USA
Alkes L Price Broad Institute of MIT and Harvard, Cambridge, MA, USA. Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA. Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Collapse

Davis GE, Lowell WE. Solar energy at birth and human lifespan. JOURNAL OF PHOTOCHEMISTRY AND PHOTOBIOLOGY B-BIOLOGY 2018;186:59-68. [DOI: 10.1016/j.jphotobiol.2018.07.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Revised: 06/29/2018] [Accepted: 07/04/2018] [Indexed: 01/03/2023]

Niu M, Tabari E, Ni P, Su Z. Towards a map of cis-regulatory sequences in the human genome. Nucleic Acids Res 2018;46:5395-5409. [PMID: 29733395 PMCID: PMC6009671 DOI: 10.1093/nar/gky338] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 04/14/2018] [Accepted: 04/19/2018] [Indexed: 01/10/2023] Open

Lee KS, Chatterjee P, Choi EY, Sung MK, Oh J, Won H, Park SM, Kim YJ, Yi SV, Choi JK. Selection on the regulation of sympathetic nervous activity in humans and chimpanzees. PLoS Genet 2018;14:e1007311. [PMID: 29672586 PMCID: PMC5908061 DOI: 10.1371/journal.pgen.1007311] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Accepted: 03/17/2018] [Indexed: 12/31/2022] Open

Sheep genome functional annotation reveals proximal regulatory elements contributed to the evolution of modern breeds. Nat Commun 2018;9:859. [PMID: 29491421 PMCID: PMC5830443 DOI: 10.1038/s41467-017-02809-1] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 12/03/2017] [Indexed: 12/30/2022] Open

Harakalova M, Asselbergs FW. Systems analysis of dilated cardiomyopathy in the next generation sequencing era. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018;10:e1419. [PMID: 29485202 DOI: 10.1002/wsbm.1419] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 12/31/2017] [Accepted: 01/17/2018] [Indexed: 12/17/2022]

Abstract

Dilated cardiomyopathy (DCM) is a form of severe failure of cardiac muscle caused by a long list of etiologies ranging from myocardial infarction, DNA mutations in cardiac genes, to toxics. Systems analysis integrating next-generation sequencing (NGS)-based omics approaches, such as the sequencing of DNA, RNA, and chromatin, provide valuable insights into DCM mechanisms. The outcome and interpretation of NGS methods can be affected by the localization of cardiac biopsy, level of tissue degradation, and variable ratios of different cell populations, especially in the presence of fibrosis. Heart tissue composition may even differ between sexes, or siblings carrying the same disease causing mutation. Therefore, before planning any experiments, it is important to fully appreciate the complexities of DCM, and the selection of samples suitable for given research question should be an interdisciplinary effort involving clinicians and biologists. The list of NGS omics datasets in DCM to date is short. More studies have to be performed to contribute to public data repositories and facilitate systems analysis. In addition, proper data integration is a difficult task requiring complex computational approaches. Despite these complications, there are multiple promising implications of systems analysis in DCM. By combining various types of datasets, for example, RNA-seq, ChIP-seq, or 4C, deep insights into cardiac biology, and possible biomarkers and treatment targets, can be gained. Systems analysis can also facilitate the annotation of noncoding mutations in cardiac-specific DNA regulatory regions that play a substantial role in maintaining the tissue- and cell-specific transcriptional programs in the heart. This article is categorized under: Physiology > Mammalian Physiology in Health and Disease Laboratory Methods and Technologies > Genetic/Genomic Methods Laboratory Methods and Technologies > RNA Methods.

Collapse

Redundant regulation. Nat Ecol Evol 2018;2:418-419. [PMID: 29379186 DOI: 10.1038/s41559-018-0479-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Dynamic evolution of regulatory element ensembles in primate CD4⁺ T cells. Nat Ecol Evol 2018;2:537-548. [PMID: 29379187 DOI: 10.1038/s41559-017-0447-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 12/08/2017] [Indexed: 12/12/2022]

Li X, Kim Y, Tsang EK, Davis JR, Damani FN, Chiang C, Hess GT, Zappala Z, Strober BJ, Scott AJ, Li A, Ganna A, Bassik MC, Merker JD, Hall IM, Battle A, Montgomery SB. The impact of rare variation on gene expression across tissues. Nature 2017;550:239-243. [PMID: 29022581 PMCID: PMC5877409 DOI: 10.1038/nature24267] [Citation(s) in RCA: 159] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 09/13/2017] [Indexed: 12/24/2022]

Affiliation(s)

Xin Li Department of Pathology, Stanford University, Stanford, California 94305, USA
Yungil Kim Department of Computer Science, Johns Hopkins University, Baltimore 21218, Maryland, USA
Emily K Tsang Department of Pathology, Stanford University, Stanford, California 94305, USA Biomedical Informatics Program, Stanford University, Stanford, California 94305, USA
Joe R Davis Department of Pathology, Stanford University, Stanford, California 94305, USA Department of Genetics, Stanford University, Stanford, California 94305, USA
Farhan N Damani Department of Computer Science, Johns Hopkins University, Baltimore 21218, Maryland, USA
Colby Chiang McDonnell Genome Institute, Washington University School of Medicine, St Louis, Missouri 63108, USA
Gaelen T Hess Department of Genetics, Stanford University, Stanford, California 94305, USA
Zachary Zappala Department of Pathology, Stanford University, Stanford, California 94305, USA Department of Genetics, Stanford University, Stanford, California 94305, USA
Benjamin J Strober Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
Alexandra J Scott McDonnell Genome Institute, Washington University School of Medicine, St Louis, Missouri 63108, USA
Amy Li Department of Genetics, Stanford University, Stanford, California 94305, USA
Andrea Ganna Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
Michael C Bassik Department of Genetics, Stanford University, Stanford, California 94305, USA
Jason D Merker Department of Pathology, Stanford University, Stanford, California 94305, USA
Ira M Hall McDonnell Genome Institute, Washington University School of Medicine, St Louis, Missouri 63108, USA Department of Medicine, Washington University School of Medicine, St Louis, Missouri 63110, USA Department of Genetics, Washington University School of Medicine, St Louis, Missouri 63110, USA
Alexis Battle Department of Computer Science, Johns Hopkins University, Baltimore 21218, Maryland, USA
Stephen B Montgomery Department of Pathology, Stanford University, Stanford, California 94305, USA Department of Genetics, Stanford University, Stanford, California 94305, USA

Collapse

Gursky VV, Kozlov KN, Kulakovskiy IV, Zubair A, Marjoram P, Lawrie DS, Nuzhdin SV, Samsonova MG. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network. PLoS One 2017;12:e0184657. [PMID: 28898266 PMCID: PMC5595321 DOI: 10.1371/journal.pone.0184657] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 08/28/2017] [Indexed: 11/18/2022] Open

Abstract

Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects.

Collapse

Kober KM, Pogson GH. Genome-wide signals of positive selection in strongylocentrotid sea urchins. BMC Genomics 2017;18:555. [PMID: 28732465 PMCID: PMC5521101 DOI: 10.1186/s12864-017-3944-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 07/13/2017] [Indexed: 12/21/2022] Open

Abstract

Background

Comparative genomics studies investigating the signals of positive selection among groups of closely related species are still rare and limited in taxonomic breadth. Such studies show great promise in advancing our knowledge about the proportion and the identity of genes experiencing diversifying selection. However, methodological challenges have led to high levels of false positives in past studies. Here, we use the well-annotated genome of the purple sea urchin, Strongylocentrotus purpuratus, as a reference to investigate the signals of positive selection at 6520 single-copy orthologs from nine sea urchin species belonging to the family Strongylocentrotidae paying careful attention to minimizing false positives.

Results

We identified 1008 (15.5%) candidate positive selection genes (PSGs). Tests for positive selection along the nine terminal branches of the phylogeny identified 824 genes that showed lineage-specific adaptive diversification (1.67% of branch-sites tests performed). Positively selected codons were not enriched at exon borders or near regions containing missing data, suggesting a limited contribution of false positives caused by alignment or annotation errors. Alignments were validated at 10 loci with re-sequencing using Sanger methods. No differences were observed in the rates of synonymous substitution (d_S), GC content, and codon bias between the candidate PSGs and those not showing positive selection. However, the candidate PSGs had 68% higher rates of nonsynonymous substitution (d_N) and 33% lower levels of heterozygosity, consistent with selective sweeps and opposite to that expected by a relaxation of selective constraint. Although positive selection was identified at reproductive proteins and innate immunity genes, the strongest signals of adaptive diversification were observed at extracellular matrix proteins, cell adhesion molecules, membrane receptors, and ion channels. Many candidate PSGs have been widely implicated as targets of pathogen binding, inactivation, mimicry, or exploitation in other groups (notably mammals).

Conclusions

Our study confirmed the widespread action of positive selection across sea urchin genomes and allowed us to reject the possibility that annotation and alignment errors (including paralogs) were responsible for creating false signals of adaptive molecular divergence. The candidate PSGs identified in our study represent promising targets for future research into the selective agents responsible for their adaptive diversification and their contribution to speciation.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-017-3944-7) contains supplementary material, which is available to authorized users.

Collapse

Li A, Hooli B, Mullin K, Tate RE, Bubnys A, Kirchner R, Chapman B, Hofmann O, Hide W, Tanzi RE. Silencing of the Drosophila ortholog of SOX5 leads to abnormal neuronal development and behavioral impairment. Hum Mol Genet 2017;26:1472-1482. [PMID: 28186563 DOI: 10.1093/hmg/ddx051] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 02/07/2017] [Indexed: 01/27/2023] Open

Affiliation(s)

Airong Li Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA
Basavaraj Hooli Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA
Kristina Mullin Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA
Rebecca E Tate Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA
Adele Bubnys Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA
Rory Kirchner Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA
Brad Chapman Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA
Oliver Hofmann Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA.,Center for Cancer Research, University of Melbourne, Melbourne 3000, Australia and
Winston Hide Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA.,Department of Neuroscience, Sheffield Institute for Translational Neuroscience, University of Sheffield, Sheffield S10 2HQ, UK
Rudolph E Tanzi Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, MassGeneral Institute for Neurodegenerative Diseases, Charlestown, MA 02129, USA

Collapse

Savisaar R, Hurst LD. Estimating the prevalence of functional exonic splice regulatory information. Hum Genet 2017;136:1059-1078. [PMID: 28405812 PMCID: PMC5602102 DOI: 10.1007/s00439-017-1798-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 04/04/2017] [Indexed: 12/14/2022]

Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet 2017;49:618-624. [PMID: 28288115 PMCID: PMC5395419 DOI: 10.1038/ng.3810] [Citation(s) in RCA: 221] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 02/13/2017] [Indexed: 12/17/2022]

Evolution of Brain Active Gene Promoters in Human Lineage Towards the Increased Plasticity of Gene Regulation. Mol Neurobiol 2017;55:1871-1904. [PMID: 28233272 DOI: 10.1007/s12035-017-0427-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Accepted: 01/26/2017] [Indexed: 01/31/2023]

Schor IE, Degner JF, Harnett D, Cannavò E, Casale FP, Shim H, Garfield DA, Birney E, Stephens M, Stegle O, Furlong EEM. Promoter shape varies across populations and affects promoter evolution and expression noise. Nat Genet 2017;49:550-558. [PMID: 28191888 DOI: 10.1038/ng.3791] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 01/20/2017] [Indexed: 12/29/2022]