1
|
Crossley ER, Fedorova L, Mulyar O, Freeman R, Khuder S, Fedorov A. Computational identification of ultra-conserved elements in the human genome: a hypothesis on homologous DNA pairing. NAR Genom Bioinform 2024; 6:lqae074. [PMID: 38962254 PMCID: PMC11217675 DOI: 10.1093/nargab/lqae074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/29/2024] [Accepted: 06/19/2024] [Indexed: 07/05/2024] Open
Abstract
Thousands of prolonged sequences of human ultra-conserved non-coding elements (UCNEs) share only one common feature: peculiarities in the unique composition of their dinucleotides. Here we investigate whether the numerous weak signals emanating from these dinucleotide arrangements can be used for computational identification of UCNEs within the human genome. For this purpose, we analyzed 4272 UCNE sequences, encompassing 1 393 448 nucleotides, alongside equally sized control samples of randomly selected human genomic sequences. Our research identified nine different features of dinucleotide arrangements that enable differentiation of UCNEs from the rest of the genome. We employed these nine features, implementing three Machine Learning techniques - Support Vector Machine, Random Forest, and Artificial Neural Networks - to classify UCNEs, achieving an accuracy rate of 82-84%, with specific conditions allowing for over 90% accuracy. Notably, the strongest feature for UCNE identification was the frequency ratio between GpC dinucleotides and the sum of GpG and CpC dinucleotides. Additionally, we investigated the entire pool of 31 046 SNPs located within UCNEs for their representation in the ClinVar database, which catalogs human SNPs with known phenotypic effects. The presence of UCNE-associated SNPs in ClinVar aligns with the expectation of a random distribution, emphasizing the enigmatic nature of UCNE phenotypic manifestation.
Collapse
Affiliation(s)
- Emily R Crossley
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
| | | | | | | | - Sadik Khuder
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
| | - Alexei Fedorov
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
- CRI Genetics LLC, Santa Monica, CA 90404, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
| |
Collapse
|
2
|
Balard A, Baltazar-Soares M, Eizaguirre C, Heckwolf MJ. An epigenetic toolbox for conservation biologists. Evol Appl 2024; 17:e13699. [PMID: 38832081 PMCID: PMC11146150 DOI: 10.1111/eva.13699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 04/22/2024] [Accepted: 04/24/2024] [Indexed: 06/05/2024] Open
Abstract
Ongoing climatic shifts and increasing anthropogenic pressures demand an efficient delineation of conservation units and accurate predictions of populations' resilience and adaptive potential. Molecular tools involving DNA sequencing are nowadays routinely used for these purposes. Yet, most of the existing tools focusing on sequence-level information have shortcomings in detecting signals of short-term ecological relevance. Epigenetic modifications carry valuable information to better link individuals, populations, and species to their environment. Here, we discuss a series of epigenetic monitoring tools that can be directly applied to various conservation contexts, complementing already existing molecular monitoring frameworks. Focusing on DNA sequence-based methods (e.g. DNA methylation, for which the applications are readily available), we demonstrate how (a) the identification of epi-biomarkers associated with age or infection can facilitate the determination of an individual's health status in wild populations; (b) whole epigenome analyses can identify signatures of selection linked to environmental conditions and facilitate estimating the adaptive potential of populations; and (c) epi-eDNA (epigenetic environmental DNA), an epigenetic-based conservation tool, presents a non-invasive sampling method to monitor biological information beyond the mere presence of individuals. Overall, our framework refines conservation strategies, ensuring a comprehensive understanding of species' adaptive potential and persistence on ecologically relevant timescales.
Collapse
Affiliation(s)
- Alice Balard
- School of Biological and Behavioural Sciences Queen Mary University of London London UK
| | | | - Christophe Eizaguirre
- School of Biological and Behavioural Sciences Queen Mary University of London London UK
| | - Melanie J Heckwolf
- Department of Ecology Leibniz Centre for Tropical Marine Research Bremen Germany
| |
Collapse
|
3
|
Paired CRISPR screening libraries for studying the function of the non-coding genome at scale. Nat Biomed Eng 2024:10.1038/s41551-024-01215-5. [PMID: 38822173 DOI: 10.1038/s41551-024-01215-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2024]
|
4
|
Li Y, Tan M, Akkari-Henić A, Zhang L, Kip M, Sun S, Sepers JJ, Xu N, Ariyurek Y, Kloet SL, Davis RP, Mikkers H, Gruber JJ, Snyder MP, Li X, Pang B. Genome-wide Cas9-mediated screening of essential non-coding regulatory elements via libraries of paired single-guide RNAs. Nat Biomed Eng 2024:10.1038/s41551-024-01204-8. [PMID: 38778183 DOI: 10.1038/s41551-024-01204-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 03/27/2024] [Indexed: 05/25/2024]
Abstract
The functions of non-coding regulatory elements (NCREs), which constitute a major fraction of the human genome, have not been systematically studied. Here we report a method involving libraries of paired single-guide RNAs targeting both ends of an NCRE as a screening system for the Cas9-mediated deletion of thousands of NCREs genome-wide to study their functions in distinct biological contexts. By using K562 and 293T cell lines and human embryonic stem cells, we show that NCREs can have redundant functions, and that many ultra-conserved elements have silencer activity and play essential roles in cell growth and in cellular responses to drugs (notably, the ultra-conserved element PAX6_Tarzan may be critical for heart development, as removing it from human embryonic stem cells led to defects in cardiomyocyte differentiation). The high-throughput screen, which is compatible with single-cell sequencing, may allow for the identification of druggable NCREs.
Collapse
Affiliation(s)
- Yufeng Li
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Minkang Tan
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Almira Akkari-Henić
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Limin Zhang
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Maarten Kip
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Shengnan Sun
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Jorian J Sepers
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Ningning Xu
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Yavuz Ariyurek
- Leiden Genome Technology Center, Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Susan L Kloet
- Leiden Genome Technology Center, Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Richard P Davis
- Department of Anatomy and Embryology, The Novo Nordisk Foundation Center for Stem Cell Medicine (reNEW), Leiden University Medical Center, Leiden, the Netherlands
| | - Harald Mikkers
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Joshua J Gruber
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | | | - Xiao Li
- Department of Biochemistry, The Center for RNA Science and Therapeutics, Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH, USA.
| | - Baoxu Pang
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands.
| |
Collapse
|
5
|
Woronowicz KC, Esin EV, Markevich GN, Martinez CS, McMenamin SK, Daane JM, Harris MP, Shkil FN. Phylogenomic analysis of the Lake Kronotskoe species flock of Dolly Varden charr reveals genetic and developmental signatures of sympatric radiation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.24.529919. [PMID: 38712299 PMCID: PMC11071292 DOI: 10.1101/2023.02.24.529919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Recent adaptive radiations provide evolutionary case studies, which provide the context to parse the relationship between genomic variation and the origins of distinct phenotypes. Sympatric radiations of the charr complex (genus Salvelinus) present a trove for phylogenetics as charrs have repeatedly diversified into multiple morphs with distinct feeding specializations. However, species flocks normally comprise only two to three lineages. Dolly Varden charr inhabiting Lake Kronotske represent the most extensive radiation described for the charr genus, containing at least seven lineages, each with defining morphological and ecological traits. Here, we perform the first genome-wide analysis of this species flock to parse the foundations of adaptive change. Our data support distinct, reproductively isolated lineages with little evidence of hybridization. We also find that specific selection on thyroid signaling and craniofacial genes forms a genomic basis for the radiation. Thyroid hormone is further implicated in subsequent lineage partitioning events. These results delineate a clear genetic basis for the diversification of specialized lineages, and highlight the role of developmental mechanisms in shaping the forms generated during adaptive radiation.
Collapse
Affiliation(s)
- Katherine C Woronowicz
- Department of Orthopaedics, Boston Children's Hospital and Department of Genetics, Harvard Medical School, Boston, MA 02115
| | - Evgeny V Esin
- AN Severtsov Institute of Ecology and Evolution, RAS; Leninskiy-33, 119071 Moscow, Russian Federation
| | - Grigorii N Markevich
- Kronotsky Nature Biosphere Reserve; Ryabikova-48, 68400 Yelizovo, Russian Federation
| | | | | | - Jacob M Daane
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204
| | - Matthew P Harris
- Department of Orthopaedics, Boston Children's Hospital and Department of Genetics, Harvard Medical School, Boston, MA 02115
| | - Fedor N Shkil
- AN Severtsov Institute of Ecology and Evolution, RAS; Leninskiy-33, 119071 Moscow, Russian Federation
- NK Koltzov Institute of Developmental Biology, RAS; Vavilova-26, 119334 Moscow, Russian Federation
| |
Collapse
|
6
|
Lopez Soriano V, Dueñas Rey A, Mukherjee R, Coppieters F, Bauwens M, Willaert A, De Baere E. Multi-omics analysis in human retina uncovers ultraconserved cis-regulatory elements at rare eye disease loci. Nat Commun 2024; 15:1600. [PMID: 38383453 PMCID: PMC10881467 DOI: 10.1038/s41467-024-45381-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 01/19/2024] [Indexed: 02/23/2024] Open
Abstract
Cross-species genome comparisons have revealed a substantial number of ultraconserved non-coding elements (UCNEs). Several of these elements have proved to be essential tissue- and cell type-specific cis-regulators of developmental gene expression. Here, we characterize a set of UCNEs as candidate CREs (cCREs) during retinal development and evaluate the contribution of their genomic variation to rare eye diseases, for which pathogenic non-coding variants are emerging. Integration of bulk and single-cell retinal multi-omics data reveals 594 genes under potential cis-regulatory control of UCNEs, of which 45 are implicated in rare eye disease. Mining of candidate cis-regulatory UCNEs in WGS data derived from the rare eye disease cohort of Genomics England reveals 178 ultrarare variants within 84 UCNEs associated with 29 disease genes. Overall, we provide a comprehensive annotation of ultraconserved non-coding regions acting as cCREs during retinal development which can be targets of non-coding variation underlying rare eye diseases.
Collapse
Affiliation(s)
- Victor Lopez Soriano
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Alfredo Dueñas Rey
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | | | - Frauke Coppieters
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
- Department of Pharmaceutics, Ghent University, Ghent, Belgium
| | - Miriam Bauwens
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Andy Willaert
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Elfride De Baere
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium.
| |
Collapse
|
7
|
Zimmermann B, Montenegro JD, Robb SMC, Fropf WJ, Weilguny L, He S, Chen S, Lovegrove-Walsh J, Hill EM, Chen CY, Ragkousi K, Praher D, Fredman D, Schultz D, Moran Y, Simakov O, Genikhovich G, Gibson MC, Technau U. Topological structures and syntenic conservation in sea anemone genomes. Nat Commun 2023; 14:8270. [PMID: 38092765 PMCID: PMC10719294 DOI: 10.1038/s41467-023-44080-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 11/29/2023] [Indexed: 12/17/2023] Open
Abstract
There is currently little information about the evolution of gene clusters, genome architectures and karyotypes in early branching animals. Slowly evolving anthozoan cnidarians can be particularly informative about the evolution of these genome features. Here we report chromosome-level genome assemblies of two related anthozoans, the sea anemones Nematostella vectensis and Scolanthus callimorphus. We find a robust set of 15 chromosomes with a clear one-to-one correspondence between the two species. Both genomes show chromosomal conservation, allowing us to reconstruct ancestral cnidarian and metazoan chromosomal blocks, consisting of at least 19 and 16 ancestral linkage groups, respectively. We show that, in contrast to Bilateria, the Hox and NK clusters of investigated cnidarians are largely disintegrated, despite the presence of staggered hox/gbx expression in Nematostella. This loss of microsynteny conservation may be facilitated by shorter distances between cis-regulatory sequences and their cognate transcriptional start sites. We find no clear evidence for topologically associated domains, suggesting fundamental differences in long-range gene regulation compared to vertebrates. These data suggest that large sets of ancestral metazoan genes have been retained in ancestral linkage groups of some extant lineages; yet, higher order gene regulation with associated 3D architecture may have evolved only after the cnidarian-bilaterian split.
Collapse
Affiliation(s)
- Bob Zimmermann
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
- Research platform SinCeReSt, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Juan D Montenegro
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
- Research platform SinCeReSt, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Sofia M C Robb
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Whitney J Fropf
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Lukas Weilguny
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Shuonan He
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Shiyuan Chen
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Jessica Lovegrove-Walsh
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Eric M Hill
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Cheng-Yi Chen
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
| | - Katerina Ragkousi
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA
- Department of Biology, Amherst College, Amherst, MA, 01002, USA
| | - Daniela Praher
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - David Fredman
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Darrin Schultz
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Yehu Moran
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
- The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
| | - Oleg Simakov
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
- Research platform SinCeReSt, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Grigory Genikhovich
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria
| | - Matthew C Gibson
- Stowers Institute for Medical Research, Kansas City, MO, 64110, USA.
| | - Ulrich Technau
- Department of Neurosciences and Developmental Biology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria.
- Research platform SinCeReSt, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria.
- Max Perutz laboratories, University of Vienna, Dr. Bohrgasse 5, 1030, Vienna, Austria.
| |
Collapse
|
8
|
Fedorova L, Crossley ER, Mulyar OA, Qiu S, Freeman R, Fedorov A. Profound Non-Randomness in Dinucleotide Arrangements within Ultra-Conserved Non-Coding Elements and the Human Genome. BIOLOGY 2023; 12:1125. [PMID: 37627009 PMCID: PMC10452674 DOI: 10.3390/biology12081125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 08/09/2023] [Accepted: 08/11/2023] [Indexed: 08/27/2023]
Abstract
Long human ultra-conserved non-coding elements (UCNEs) do not have any sequence similarity to each other or other characteristics that make them unalterable during vertebrate evolution. We hypothesized that UCNEs have unique dinucleotide (DN) composition and arrangements compared to the rest of the genome. A total of 4272 human UCNE sequences were analyzed computationally and compared with the whole genomes of human, chicken, zebrafish, and fly. Statistical analysis was performed to assess the non-randomness in DN spacing arrangements within the entire human genome and within UCNEs. Significant non-randomness in DN spacing arrangements was observed in the entire human genome. Additionally, UCNEs exhibited distinct patterns in DN arrangements compared to the rest of the genome. Approximately 83% of all DN pairs within UCNEs showed significant (>10%) non-random genomic arrangements at short distances (2-6 nucleotides) relative to each other. At the extremes, non-randomness in DN spacing distances deviated up to 40% from expected values and were frequently associated with GpC, CpG, ApT, and GpG/CpC dinucleotides. The described peculiarities in DN arrangements have persisted for hundreds of millions of years in vertebrates. These distinctive patterns may suggest that UCNEs have specific DNA conformations.
Collapse
Affiliation(s)
- Larisa Fedorova
- CRI Genetics LLC, Santa Monica, CA 90404, USA; (L.F.); (O.A.M.); (R.F.)
| | - Emily R. Crossley
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA;
| | - Oleh A. Mulyar
- CRI Genetics LLC, Santa Monica, CA 90404, USA; (L.F.); (O.A.M.); (R.F.)
| | - Shuhao Qiu
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA;
| | - Ryan Freeman
- CRI Genetics LLC, Santa Monica, CA 90404, USA; (L.F.); (O.A.M.); (R.F.)
| | - Alexei Fedorov
- CRI Genetics LLC, Santa Monica, CA 90404, USA; (L.F.); (O.A.M.); (R.F.)
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA;
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA;
| |
Collapse
|
9
|
Qi XG, Wu J, Zhao L, Wang L, Guang X, Garber PA, Opie C, Yuan Y, Diao R, Li G, Wang K, Pan R, Ji W, Sun H, Huang ZP, Xu C, Witarto AB, Jia R, Zhang C, Deng C, Qiu Q, Zhang G, Grueter CC, Wu D, Li B. Adaptations to a cold climate promoted social evolution in Asian colobine primates. Science 2023; 380:eabl8621. [PMID: 37262163 DOI: 10.1126/science.abl8621] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 07/06/2022] [Indexed: 06/03/2023]
Abstract
The biological mechanisms that underpin primate social evolution remain poorly understood. Asian colobines display a range of social organizations, which makes them good models for investigating social evolution. By integrating ecological, geological, fossil, behavioral, and genomic analyses, we found that colobine primates that inhabit colder environments tend to live in larger, more complex groups. Specifically, glacial periods during the past 6 million years promoted the selection of genes involved in cold-related energy metabolism and neurohormonal regulation. More-efficient dopamine and oxytocin pathways developed in odd-nosed monkeys, which may have favored the prolongation of maternal care and lactation, increasing infant survival in cold environments. These adaptive changes appear to have strengthened interindividual affiliation, increased male-male tolerance, and facilitated the stepwise aggregation from independent one-male groups to large multilevel societies.
Collapse
Affiliation(s)
- Xiao-Guang Qi
- College of Life Sciences, Northwest University, Xi'an, China
| | - Jinwei Wu
- College of Life Sciences, Northwest University, Xi'an, China
| | - Lan Zhao
- College of Life Sciences, Northwest University, Xi'an, China
| | - Lu Wang
- College of Life Sciences, Northwest University, Xi'an, China
| | | | - Paul A Garber
- Department of Anthropology, University of Illinois, Urbana, IL, USA
| | - Christopher Opie
- Department of Anthropology and Archaeology, University of Bristol, Bristol, UK
| | - Yuan Yuan
- College of Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Runjie Diao
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Kun Wang
- College of Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Ruliang Pan
- College of Life Sciences, Northwest University, Xi'an, China
| | - Weihong Ji
- School of Natural and Computational Sciences, Massey University, Auckland, New Zealand
| | | | - Zhi-Pang Huang
- College of Life Sciences, Northwest University, Xi'an, China
| | - Chunzhong Xu
- Shanghai Wild Animal Park Development Co., Shanghai, China
| | - Arief B Witarto
- Faculty of Medicine, Universitas Pertahanan, Jabodetabek, Indonesia
| | - Rui Jia
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | | | - Cheng Deng
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Qiang Qiu
- College of Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Guojie Zhang
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Cyril C Grueter
- School of Human Sciences, The University of Western Australia, Perth, WA, Australia
| | - Dongdong Wu
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Baoguo Li
- College of Life Sciences, Northwest University, Xi'an, China
| |
Collapse
|
10
|
Treaster S, Deelen J, Daane JM, Murabito J, Karasik D, Harris MP. Convergent genomics of longevity in rockfishes highlights the genetics of human life span variation. SCIENCE ADVANCES 2023; 9:eadd2743. [PMID: 36630509 PMCID: PMC9833670 DOI: 10.1126/sciadv.add2743] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 12/09/2022] [Indexed: 05/16/2023]
Abstract
Longevity is a defining, heritable trait that varies dramatically between species. To resolve the genetic regulation of this trait, we have mined genomic variation in rockfishes, which range in longevity from 11 to over 205 years. Multiple shifts in rockfish longevity have occurred independently and in a short evolutionary time frame, thus empowering convergence analyses. Our analyses reveal a common network of genes under convergent evolution, encompassing established aging regulators such as insulin signaling, yet also identify flavonoid (aryl-hydrocarbon) metabolism as a pathway modulating longevity. The selective pressures on these pathways indicate the ancestral state of rockfishes was long lived and that the changes in short-lived lineages are adaptive. These pathways were also used to explore genome-wide association studies of human longevity, identifying the aryl-hydrocarbon metabolism pathway to be significantly associated with human survival to the 99th percentile. This evolutionary intersection defines and cross-validates a previously unappreciated genetic architecture that associates with the evolution of longevity across vertebrates.
Collapse
Affiliation(s)
- Stephen Treaster
- Department of Orthopaedic Surgery, Boston Children’s Hospital, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Joris Deelen
- Max Planck Institute for Biology of Ageing, Joseph-Stelzmann-Str. 9b, D-50931 Köln, Germany
- Molecular Epidemiology, Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
- Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Cologne, Germany
| | - Jacob M. Daane
- Department of Biology and Biochemistry, University of Houston, Houston TX, USA
| | - Joanne Murabito
- Section of General Internal Medicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
- Framingham Heart Study, Framingham, MA, USA
| | - David Karasik
- Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel
- Marcus Institute for Aging Research, Hebrew Senior Life, Boston, MA, USA
| | - Matthew P. Harris
- Department of Orthopaedic Surgery, Boston Children’s Hospital, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
11
|
Fedorova L, Mulyar OA, Lim J, Fedorov A. Nucleotide Composition of Ultra-Conserved Elements Shows Excess of GpC and Depletion of GG and CC Dinucleotides. Genes (Basel) 2022; 13:2053. [PMID: 36360290 PMCID: PMC9690913 DOI: 10.3390/genes13112053] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 10/25/2022] [Accepted: 11/03/2022] [Indexed: 08/27/2023] Open
Abstract
The public UCNEbase database, comprising 4273 human ultra-conserved noncoding elements (UCNEs), was thoroughly investigated with the aim to find any nucleotide signals or motifs that have made these DNA sequences practically unchanged over three hundred million years of evolution. Each UCNE comprises over 200 nucleotides and has at least 95% identity between humans and chickens. A total of 31,046 SNPs were found within the UCNE database. We demonstrated that every human has over 300 mutations within 4273 UCNEs. No association of UCNEs with non-coding RNAs, nor preference of a particular meiotic recombination rate within them were found. No sequence motifs associated with UCNEs nor their flanking regions have been found. However, we demonstrated that UCNEs have strong nucleotide and dinucleotide sequence abnormalities compared to genome averages. Specifically, UCNEs are depleted for CC and GG dinucleotides, while GC dinucleotides are in excess of 28%. Importantly, GC dinucleotides have extraordinarily strong stacking free-energy inside the DNA helix and unique resistance to dissociation. Based on the adjacent nucleotide stacking abnormalities within UCNEs, we conjecture that peculiarities in dinucleotide distribution within UCNEs may create unique 3D conformation and specificity to bind proteins. We also discuss the strange dynamics of multiple SNPs inside UCNEs and reasons why these sequences are extraordinarily conserved.
Collapse
Affiliation(s)
| | | | - Jan Lim
- CRI Genetics LLC, Santa Monica, CA 90404, USA
| | - Alexei Fedorov
- CRI Genetics LLC, Santa Monica, CA 90404, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
| |
Collapse
|
12
|
Multi-omics approach dissects cis-regulatory mechanisms underlying North Carolina macular dystrophy, a retinal enhanceropathy. Am J Hum Genet 2022; 109:2029-2048. [PMID: 36243009 PMCID: PMC9674966 DOI: 10.1016/j.ajhg.2022.09.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 09/28/2022] [Indexed: 01/26/2023] Open
Abstract
North Carolina macular dystrophy (NCMD) is a rare autosomal-dominant disease affecting macular development. The disease is caused by non-coding single-nucleotide variants (SNVs) in two hotspot regions near PRDM13 and by duplications in two distinct chromosomal loci, overlapping DNase I hypersensitive sites near either PRDM13 or IRX1. To unravel the mechanisms by which these variants cause disease, we first established a genome-wide multi-omics retinal database, RegRet. Integration of UMI-4C profiles we generated on adult human retina then allowed fine-mapping of the interactions of the PRDM13 and IRX1 promoters and the identification of eighteen candidate cis-regulatory elements (cCREs), the activity of which was investigated by luciferase and Xenopus enhancer assays. Next, luciferase assays showed that the non-coding SNVs located in the two hotspot regions of PRDM13 affect cCRE activity, including two NCMD-associated non-coding SNVs that we identified herein. Interestingly, the cCRE containing one of these SNVs was shown to interact with the PRDM13 promoter, demonstrated in vivo activity in Xenopus, and is active at the developmental stage when progenitor cells of the central retina exit mitosis, suggesting that this region is a PRDM13 enhancer. Finally, mining of single-cell transcriptional data of embryonic and adult retina revealed the highest expression of PRDM13 and IRX1 when amacrine cells start to synapse with retinal ganglion cells, supporting the hypothesis that altered PRDM13 or IRX1 expression impairs interactions between these cells during retinogenesis. Overall, this study provides insight into the cis-regulatory mechanisms of NCMD and supports that this condition is a retinal enhanceropathy.
Collapse
|
13
|
Olivares‐Zambrano D, Daane J, Hyde J, Sandel MW, Aguilar A. Speciation genomics and the role of depth in the divergence of rockfishes (
Sebastes
) revealed through Pool‐seq analysis of enriched sequences. Ecol Evol 2022; 12:e9341. [PMID: 36188524 PMCID: PMC9502067 DOI: 10.1002/ece3.9341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 08/22/2022] [Accepted: 08/30/2022] [Indexed: 11/07/2022] Open
Abstract
Speciation in the marine environment is challenged by the wide geographic distribution of many taxa and potential for high rates of gene flow through larval dispersal mechanisms. Depth has recently been proposed as a potential driver of ecological divergence in fishes, and yet it is unclear how adaptation along these gradients' shapes genomic divergence. The genus Sebastes contains numerous species pairs that are depth‐segregated and can provide a better understanding of the mode and tempo of genomic diversification. Here, we present exome data on two species pairs of rockfishes that are depth‐segregated and have different degrees of divergence: S. chlorostictus–S. rosenblatti and S. crocotulus–S. miniatus. We were able to reliably identify “islands of divergence” in the species pair with more recent divergence (S. chlorostictus–S. rosenblatti) and discovered a number of genes associated with neurosensory function, suggesting a role for this pathway in the early speciation process. We also reconstructed demographic histories of divergence and found the best supported model was isolation followed by asymmetric secondary contact for both species pairs. These results suggest past ecological/geographic isolation followed by asymmetric secondary contact of deep to shallow species. Our results provide another example of using rockfish as a model for studying speciation and support the role of depth as an important mechanism for diversification in the marine environment.
Collapse
Affiliation(s)
- Daniel Olivares‐Zambrano
- Department of Biological SciencesCalifornia State University Los AngelesLos AngelesCaliforniaUSA
- Present address:
Department of Marine and Environmental BiologyUniversity of Southern CaliforniaLos AngelesCaliforniaUSA
| | - Jacob Daane
- Department of Biology and BiochemistryUniversity of HoustonHoustonTexasUSA
| | - John Hyde
- National Oceanic and Atmospheric Administration, National Marine Fisheries ServiceNational Marine Fisheries ServiceSouthwest Fisheries Science CenterLa JollaCaliforniaUSA
| | - Michael W. Sandel
- Biological and Environmental SciencesUniversity of West AlabamaLivingstonAlabamaUSA
- Department of WIldlifeFisheries, and Aquaculture, Mississippi State UniversityMississippi StateMississippiUSA
| | - Andres Aguilar
- Department of Biological SciencesCalifornia State University Los AngelesLos AngelesCaliforniaUSA
| |
Collapse
|
14
|
Extreme purifying selection against point mutations in the human genome. Nat Commun 2022; 13:4312. [PMID: 35879308 PMCID: PMC9314448 DOI: 10.1038/s41467-022-31872-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 07/07/2022] [Indexed: 12/13/2022] Open
Abstract
Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of “ultraselection” by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we find abundant ultraselection in evolutionarily ancient miRNAs and neuronal protein-coding genes, as well as at splice sites. By contrast, we find much less ultraselection in other noncoding RNAs and transcription factor binding sites, and only modest levels in ultraconserved elements. We estimate that ~0.4–0.7% of the human genome is ultraselected, implying ~ 0.26–0.51 strongly deleterious mutations per generation. Overall, our study sheds new light on the genome-wide distribution of fitness effects by combining deep sequencing data and classical theory from population genetics. Previous work has investigated selection in the coding genome, but it is not as well characterized in the non-coding genome. By analyzing rare variants in 70k genome sequences from gnomAD, the authors detect very strong purifying selection ("ultraselection”) across the human genome, finding it in some microRNAs and coding sequences but generally rare in regulatory sequences.
Collapse
|
15
|
The genomic origins of the world's first farmers. Cell 2022; 185:1842-1859.e18. [PMID: 35561686 PMCID: PMC9166250 DOI: 10.1016/j.cell.2022.04.008] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/04/2022] [Accepted: 04/06/2022] [Indexed: 11/24/2022]
Abstract
The precise genetic origins of the first Neolithic farming populations in Europe and Southwest Asia, as well as the processes and the timing of their differentiation, remain largely unknown. Demogenomic modeling of high-quality ancient genomes reveals that the early farmers of Anatolia and Europe emerged from a multiphase mixing of a Southwest Asian population with a strongly bottlenecked western hunter-gatherer population after the last glacial maximum. Moreover, the ancestors of the first farmers of Europe and Anatolia went through a period of extreme genetic drift during their westward range expansion, contributing highly to their genetic distinctiveness. This modeling elucidates the demographic processes at the root of the Neolithic transition and leads to a spatial interpretation of the population history of Southwest Asia and Europe during the late Pleistocene and early Holocene.
Collapse
|
16
|
Chronic Exposure to the Food Additive tBHQ Modulates Expression of Genes Related to SARS-CoV-2 and Influenza Viruses. Life (Basel) 2022; 12:life12050642. [PMID: 35629310 PMCID: PMC9147452 DOI: 10.3390/life12050642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 04/20/2022] [Accepted: 04/23/2022] [Indexed: 11/16/2022] Open
Abstract
Background. tert-butylhydroquinone (tBHQ) is an antioxidant commonly used as a food additive. Studies suggest that tBHQ could modulate immune responses to influenza and SARS-CoV-2 infection. In our transcriptomic analysis we explored the molecular mechanisms behind tBHQ’s modulatory properties and the relationships to respiratory viral infections. Methods. tBHQ was administered per os to BALB/c mice (1.5% [w/w]) for 20 days. Splenic T cells were isolated with magnetic separation and subjected to transcriptomic analysis. Gene-set enrichment analysis and g:Profiler was conducted to provide a functional interpretation of significantly changed genes. Further analysis for AHR/NRF2 binding sites was performed with GeneHancer. Results. In CD4+ cells, we found significantly altered expression of 269 genes by tBHQ. Of them, many had relevance in influenza infection such as genes responsible for virus entry (Anxa1/2, Cd14), interferon signaling (Dusp10, Tnfsf13), or prostaglandin synthesis (Ptgs1/2). In SARS-CoV-2 infections, interferon signaling (Ifitm1), proteolytic enzymes (CtsB), and also cell-surface proteins (Cd14, Cd151) were among the prominent alterations after tBHQ exposure. Of these genes, many had one or more binding sites for AHR and NRF2, two major xenosensors triggered by tBHQ. Conclusions. Our results strongly suggest that a common food additive, tBHQ, can modulate virus-dependent processes in both influenza and SARS-CoV-2 infections.
Collapse
|
17
|
Ni P, Su Z. PCRMS: a database of predicted cis-regulatory modules and constituent transcription factor binding sites in genomes. Database (Oxford) 2022; 2022:6572594. [PMID: 35452518 PMCID: PMC9216522 DOI: 10.1093/database/baac024] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 02/20/2022] [Accepted: 04/12/2022] [Indexed: 01/13/2023]
Abstract
More accurate and more complete predictions of cis-regulatory modules (CRMs) and constituent transcription factor (TF) binding sites (TFBSs) in genomes can facilitate characterizing functions of regulatory sequences. Here, we developed a database predicted cis-regulatory modules (PCRMS) (https://cci-bioinfo.uncc.edu) that stores highly accurate and unprecedentedly complete maps of predicted CRMs and TFBSs in the human and mouse genomes. The web interface allows the user to browse CRMs and TFBSs in an organism, find the closest CRMs to a gene, search CRMs around a gene and find all TFBSs of a TF. PCRMS can be a useful resource for the research community to characterize regulatory genomes. Database URL: https://cci-bioinfo.uncc.edu/.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| |
Collapse
|
18
|
Functional Validation of Osteoporosis Genetic Findings Using Small Fish Models. Genes (Basel) 2022; 13:genes13020279. [PMID: 35205324 PMCID: PMC8872034 DOI: 10.3390/genes13020279] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 12/11/2022] Open
Abstract
The advancement of human genomics has revolutionized our understanding of the genetic architecture of many skeletal diseases, including osteoporosis. However, interpreting results from human association studies remains a challenge, since index variants often reside in non-coding regions of the genome and do not possess an obvious regulatory function. To bridge the gap between genetic association and causality, a systematic functional investigation is necessary, such as the one offered by animal models. These models enable us to identify causal mechanisms, clarify the underlying biology, and apply interventions. Over the past several decades, small teleost fishes, mostly zebrafish and medaka, have emerged as powerful systems for modeling the genetics of human diseases. Due to their amenability to genetic intervention and the highly conserved genetic and physiological features, fish have become indispensable for skeletal genomic studies. The goal of this review is to summarize the evidence supporting the utility of Zebrafish (Danio rerio) for accelerating our understanding of human skeletal genomics and outlining the remaining gaps in knowledge. We provide an overview of zebrafish skeletal morphophysiology and gene homology, shedding light on the advantages of human skeletal genomic exploration and validation. Knowledge of the biology underlying osteoporosis through animal models will lead to the translation into new, better and more effective therapeutic approaches.
Collapse
|
19
|
Lucena-Perez M, Kleinman-Ruiz D, Marmesat E, Saveljev AP, Schmidt K, Godoy JA. Bottleneck-associated changes in the genomic landscape of genetic diversity in wild lynx populations. Evol Appl 2021; 14:2664-2679. [PMID: 34815746 PMCID: PMC8591332 DOI: 10.1111/eva.13302] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 08/17/2021] [Accepted: 09/08/2021] [Indexed: 01/06/2023] Open
Abstract
Demographic bottlenecks generally reduce genetic diversity through more intense genetic drift, but their net effect may vary along the genome due to the random nature of genetic drift and to local effects of recombination, mutation, and selection. Here, we analyzed the changes in genetic diversity following a bottleneck by comparing whole-genome diversity patterns in populations with and without severe recent documented declines of Iberian (Lynx pardinus, n = 31) and Eurasian lynx (Lynx lynx, n = 29). As expected, overall genomic diversity correlated negatively with bottleneck intensity and/or duration. Correlations of genetic diversity with divergence, chromosome size, gene or functional site content, GC content, or recombination were observed in nonbottlenecked populations, but were weaker in bottlenecked populations. Also, functional features under intense purifying selection and the X chromosome showed an increase in the observed density of variants, even resulting in higher θ W diversity than in nonbottlenecked populations. Increased diversity seems to be related to both a higher mutational input in those regions creating a large collection of low-frequency variants, a few of which increase in frequency during the bottleneck to the point they become detectable with our limited sample, and the reduced efficacy of purifying selection, which affects not only protein structure and function but also the regulation of gene expression. The results of this study alert to the possible reduction of fitness and adaptive potential associated with the genomic erosion in regulatory elements. Further, the detection of a gain of diversity in ultra-conserved elements can be used as a sensitive and easy-to-apply signature of genetic erosion in wild populations.
Collapse
Affiliation(s)
- Maria Lucena-Perez
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| | - Daniel Kleinman-Ruiz
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
- Departamento de Genética Facultad de Biología Universidad Complutense Madrid Spain
| | - Elena Marmesat
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| | - Alexander P Saveljev
- Department of Animal Ecology Russian Research Institute of Game Management and Fur Farming Kirov Russia
| | - Krzysztof Schmidt
- Mammal Research Institute Polish Academy of Sciences Białowieża Poland
| | - José A Godoy
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| |
Collapse
|
20
|
Modulation of bioelectric cues in the evolution of flying fishes. Curr Biol 2021; 31:5052-5061.e8. [PMID: 34534441 DOI: 10.1016/j.cub.2021.08.054] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/09/2021] [Accepted: 08/20/2021] [Indexed: 01/03/2023]
Abstract
Changes to allometry, or the relative proportions of organs and tissues within organisms, is a common means for adaptive character change in evolution. However, little is understood about how relative size is specified during development and shaped during evolution. Here, through a phylogenomic analysis of genome-wide variation in 35 species of flying fishes and relatives, we identify genetic signatures in both coding and regulatory regions underlying the convergent evolution of increased paired fin size and aerial gliding behaviors. To refine our analysis, we intersected convergent phylogenomic signatures with mutants with altered fin size identified in distantly related zebrafish. Through these paired approaches, we identify a surprising role for an L-type amino acid transporter, lat4a, and the potassium channel, kcnh2a, in the regulation of fin proportion. We show that interaction between these genetic loci in zebrafish closely phenocopies the observed fin proportions of flying fishes. The congruence of experimental and phylogenomic findings point to conserved, non-canonical signaling integrating bioelectric cues and amino acid transport in the establishment of relative size in development and evolution.
Collapse
|
21
|
Smits JJ, de Bruijn SE, Lanting CP, Oostrik J, O'Gorman L, Mantere T, Cremers FPM, Roosing S, Yntema HG, de Vrieze E, Derks R, Hoischen A, Pegge SAH, Neveling K, Pennings RJE, Kremer H. Exploring the missing heritability in subjects with hearing loss, enlarged vestibular aqueducts, and a single or no pathogenic SLC26A4 variant. Hum Genet 2021; 141:465-484. [PMID: 34410491 PMCID: PMC9035008 DOI: 10.1007/s00439-021-02336-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Accepted: 08/09/2021] [Indexed: 12/11/2022]
Abstract
Pathogenic variants in SLC26A4 have been associated with autosomal recessive hearing loss (arHL) and a unilateral or bilateral enlarged vestibular aqueduct (EVA). SLC26A4 is the second most frequently mutated gene in arHL. Despite the strong genotype–phenotype correlation, a significant part of cases remains genetically unresolved. In this study, we investigated a cohort of 28 Dutch index cases diagnosed with HL in combination with an EVA but without (M0) or with a single (M1) pathogenic variant in SLC26A4. To explore the missing heritability, we first determined the presence of the previously described EVA-associated haplotype (Caucasian EVA (CEVA)), characterized by 12 single nucleotide variants located upstream of SLC26A4. We found this haplotype and a delimited V1-CEVA haplotype to be significantly enriched in our M1 patient cohort (10/16 cases). The CEVA haplotype was also present in two M0 cases (2/12). Short- and long-read whole genome sequencing and optical genome mapping could not prioritize any of the variants present within the CEVA haplotype as the likely pathogenic defect. Short-read whole-genome sequencing of the six M1 cases without this haplotype and the two M0/CEVA cases only revealed previously overlooked or misinterpreted splice-altering SLC26A4 variants in two cases, who are now genetically explained. No deep-intronic or structural variants were identified in any of the M1 subjects. With this study, we have provided important insights that will pave the way for elucidating the missing heritability in M0 and M1 SLC26A4 cases. For pinpointing the pathogenic effect of the CEVA haplotype, additional analyses are required addressing defect(s) at the RNA, protein, or epigenetic level.
Collapse
Affiliation(s)
- Jeroen J Smits
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands.,Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Suzanne E de Bruijn
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Cornelis P Lanting
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Jaap Oostrik
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Luke O'Gorman
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands
| | - Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, Oulu, Finland
| | | | - Frans P M Cremers
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Susanne Roosing
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Helger G Yntema
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands
| | - Erik de Vrieze
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Ronny Derks
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands.,Center for Infectious Diseases (RCI), Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands.,Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands.,Radboud Expertise Center for Immunodeficiency and Autoinflammation and Center for Infectious Disease (RCI), Radboud University Medical Center, Nijmegen, The Netherlands
| | - Sjoert A H Pegge
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Kornelia Neveling
- Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands
| | - Ronald J E Pennings
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Hannie Kremer
- Hearing and Genes, Department of Otorhinolaryngology, Radboud University Medical Center, Nijmegen, The Netherlands. .,Department of Human Genetics, Radboud University Medical Center, Internal Postal Code 855, P.O. Box 9101, 6500 HB, Nijmegen, The Netherlands. .,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands.
| |
Collapse
|
22
|
Abstract
We developed dbCNS (http://yamasati.nig.ac.jp/dbcns), a new database for conserved noncoding sequences (CNSs). CNSs exist in many eukaryotes and are assumed to be involved in protein expression control. Version 1 of dbCNS, introduced here, includes a powerful and precise CNS identification pipeline for multiple vertebrate genomes. Mutations in CNSs may induce morphological changes and cause genetic diseases. For this reason, many vertebrate CNSs have been identified, with special reference to primate genomes. We integrated ∼6.9 million CNSs from many vertebrate genomes into dbCNS, which allows users to extract CNSs near genes of interest using keyword searches. In addition to CNSs, dbCNS contains published genome sequences of 161 species. With purposeful taxonomic sampling of genomes, users can employ CNSs as queries to reconstruct CNS alignments and phylogenetic trees, to evaluate CNS modifications, acquisitions, and losses, and to roughly identify species with CNSs having accelerated substitution rates. dbCNS also produces links to dbSNP for searching pathogenic single-nucleotide polymorphisms in human CNSs. Thus, dbCNS connects morphological changes with genetic diseases. A test analysis using 38 gnathostome genomes was accomplished within 30 s. dbCNS results can evaluate CNSs identified by other stand-alone programs using genome-scale data.
Collapse
Affiliation(s)
- Jun Inoue
- Population Genetics Laboratory, Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan.,Center for Earth Surface System Dynamics, Atmosphere and Ocean Research Institute, University of Tokyo, Kashiwa, Japan
| | - Naruya Saitou
- Population Genetics Laboratory, Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan.,Department of Okinawa Bioinformation Bank, Faculty of Medicine, University of the Ryukyus, Okinawa, Japan
| |
Collapse
|
23
|
Joshi M, Kapopoulou A, Laurent S. Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective. Front Genet 2021; 12:660899. [PMID: 34276769 PMCID: PMC8282999 DOI: 10.3389/fgene.2021.660899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/31/2021] [Indexed: 01/06/2023] Open
Abstract
The unprecedented rise of high-throughput sequencing and assay technologies has provided a detailed insight into the non-coding sequences and their potential role as gene expression regulators. These regulatory non-coding sequences are also referred to as cis-regulatory elements (CREs). Genetic variants occurring within CREs have been shown to be associated with altered gene expression and phenotypic changes. Such variants are known to occur spontaneously and ultimately get fixed, due to selection and genetic drift, in natural populations and, in some cases, pave the way for speciation. Hence, the study of genetic variation at CREs has improved our overall understanding of the processes of local adaptation and evolution. Recent advances in high-throughput sequencing and better annotations of CREs have enabled the evaluation of the impact of such variation on gene expression, phenotypic alteration and fitness. Here, we review recent research on the evolution of CREs and concentrate on studies that have investigated genetic variation occurring in these regulatory sequences within the context of population genetics.
Collapse
Affiliation(s)
- Manas Joshi
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | - Stefan Laurent
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| |
Collapse
|
24
|
Bella F, Campo S. Long non-coding RNAs and their involvement in bipolar disorders. Gene 2021; 796-797:145803. [PMID: 34175394 DOI: 10.1016/j.gene.2021.145803] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 06/22/2021] [Indexed: 01/22/2023]
Abstract
Non-coding RNAs (nc-RNAs) can be defined as RNA molecules that are not translated into proteins. Although the functional meaning of many nc-RNAs remains still to be verified, several of these molecules have a clear biological importance, which goes from translation of mRNAs to DNA replication. Indeed, regulatory nc-RNAs can be classified into two groups: short non-coding RNAs (sncRNAs) and long-non coding RNAs (lncRNAs). In the last years, lncRNAs have gained increasing importance in the study of gene regulation, helping authors understand the molecular mechanisms underlying cellular physiology and pathology. LncRNAs are greater than 200 bp and accumulate in nucleus, cytoplasm and exosomes with high tissue specificity, acting in cis or in trans in order to exert enhancer or silencer modulation on gene expression. Such regulatory features, which are widespread in human cells and tissues, can be disrupted in several morbid states. Recent evidences may suggest a disruption of lncRNAs in bipolar disorders, a cluster of severe, chronic and disabling psychiatric diseases, which are characterized by major depressive states cyclically alternating with manic episodes. Here, the authors reviewed genes, classification, biogenesis, structures, functions and databases regarding lncRNAs, and also focused on bipolar disorders, in which some lncRNAs, especially those involved in inflammation and neuronal development, has reported to be dysregulated.
Collapse
Affiliation(s)
- Fabrizio Bella
- Department of Biomedical and Dental Sciences and Morphofunctional Images, University of Messina, via Consolare Valeria, 1, Messina 98125 Italy
| | - Salvatore Campo
- Department of Biomedical and Dental Sciences and Morphofunctional Images, University of Messina, via Consolare Valeria, 1, Messina 98125 Italy.
| |
Collapse
|
25
|
Steely CJ, Russell KL, Feusier JE, Qiao Y, Tavtigian SV, Marth G, Jorde LB. Mobile element insertions and associated structural variants in longitudinal breast cancer samples. Sci Rep 2021; 11:13020. [PMID: 34158539 PMCID: PMC8219704 DOI: 10.1038/s41598-021-92444-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/07/2021] [Indexed: 02/05/2023] Open
Abstract
While mobile elements are largely inactive in healthy somatic tissues, increased activity has been found in cancer tissues, with significant variation among different cancer types. In addition to insertion events, mobile elements have also been found to mediate many structural variation events in the genome. Here, to better understand the timing and impact of mobile element insertions and associated structural variants in cancer, we examined their activity in longitudinal samples of four metastatic breast cancer patients. We identified 11 mobile element insertions or associated structural variants and found that the majority of these occurred early in tumor progression. Most of the variants impact intergenic regions; however, we identified a translocation interrupting MAP2K4 involving Alu elements and a deletion in YTHDF2 involving mobile elements that likely inactivate reported tumor suppressor genes. The high variant allele fraction of the translocation, the loss of the other copy of MAP2K4, the recurrent loss-of-function mutations found in this gene in other cancers, and the important function of MAP2K4 indicate that this translocation is potentially a driver mutation. Overall, using a unique longitudinal dataset, we find that most variants are likely passenger mutations in the four patients we examined, but some variants impact tumor progression.
Collapse
Affiliation(s)
- Cody J Steely
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA.
| | - Kristi L Russell
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Julie E Feusier
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Yi Qiao
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Sean V Tavtigian
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA
| | - Gabor Marth
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| |
Collapse
|
26
|
Ni P, Su Z. Accurate prediction of cis-regulatory modules reveals a prevalent regulatory genome of humans. NAR Genom Bioinform 2021; 3:lqab052. [PMID: 34159315 PMCID: PMC8210889 DOI: 10.1093/nargab/lqab052] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/01/2021] [Accepted: 06/14/2021] [Indexed: 02/07/2023] Open
Abstract
cis-regulatory modules(CRMs) formed by clusters of transcription factor (TF) binding sites (TFBSs) are as important as coding sequences in specifying phenotypes of humans. It is essential to categorize all CRMs and constituent TFBSs in the genome. In contrast to most existing methods that predict CRMs in specific cell types using epigenetic marks, we predict a largely cell type agonistic but more comprehensive map of CRMs and constituent TFBSs in the gnome by integrating all available TF ChIP-seq datasets. Our method is able to partition 77.47% of genome regions covered by available 6092 datasets into a CRM candidate (CRMC) set (56.84%) and a non-CRMC set (43.16%). Intriguingly, the predicted CRMCs are under strong evolutionary constraints, while the non-CRMCs are largely selectively neutral, strongly suggesting that the CRMCs are likely cis-regulatory, while the non-CRMCs are not. Our predicted CRMs are under stronger evolutionary constraints than three state-of-the-art predictions (GeneHancer, EnhancerAtlas and ENCODE phase 3) and substantially outperform them for recalling VISTA enhancers and non-coding ClinVar variants. We estimated that the human genome might encode about 1.47M CRMs and 68M TFBSs, comprising about 55% and 22% of the genome, respectively; for both of which, we predicted 80%. Therefore, the cis-regulatory genome appears to be more prevalent than originally thought.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| |
Collapse
|
27
|
Ewing AD, Cheetham SW, McGill JJ, Sharkey M, Walker R, West JA, West MJ, Summers KM. Microdeletion of 9q22.3: A patient with minimal deletion size associated with a severe phenotype. Am J Med Genet A 2021; 185:2070-2083. [PMID: 33960642 PMCID: PMC8251932 DOI: 10.1002/ajmg.a.62224] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/17/2021] [Accepted: 04/02/2021] [Indexed: 01/20/2023]
Abstract
Basal cell nevus syndrome (also known as Gorlin Syndrome; MIM109400) is an autosomal dominant disorder characterized by recurrent pathological features such as basal cell carcinomas and odontogenic keratocysts as well as skeletal abnormalities. Most affected individuals have point mutations or small insertions or deletions within the PTCH1 gene on human chromosome 9, but there are some cases with more extensive deletion of the region, usually including the neighboring FANCC and/or ERCC6L2 genes. We report a 16‐year‐old patient with a deletion of approximately 400,000 bases which removes only PTCH1 and some non‐coding RNA genes but leaves FANCC and ERCC6L2 intact. In spite of the small amount of DNA for which he is haploid, his phenotype is more extreme than many individuals with longer deletions in the region. This includes early presentation with a large number of basal cell nevi and other skin lesions, multiple jaw keratocysts, and macrosomia. We found that the deletion was in the paternal chromosome, in common with other macrosomia cases. Using public databases, we have examined possible interactions between sequences within and outside the deletion and speculate that a regulatory relationship exists with flanking genes, which is unbalanced by the deletion, resulting in abnormal activation or repression of the target genes and hence the severity of the phenotype.
Collapse
Affiliation(s)
- Adam D Ewing
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, Queensland, Australia
| | - Seth W Cheetham
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, Queensland, Australia
| | - James J McGill
- Department of Chemical Pathology, Royal Brisbane and Women's Hospital, Herston, Queensland, Australia
| | - Michael Sharkey
- Paddington Dermatology Specialist Clinic, Paddington, Queensland, Australia
| | - Rick Walker
- QLD Youth Cancer Service, Queensland Children's Hospital, South Brisbane, Queensland, Australia.,School of Clinical Medicine, The University of Queensland, Herston, Queensland, Australia
| | - Jennifer A West
- Northside Clinical School, Prince Charles Hospital, The University of Queensland, Chermside, Queensland, Australia
| | - Malcolm J West
- Northside Clinical School, Prince Charles Hospital, The University of Queensland, Chermside, Queensland, Australia
| | - Kim M Summers
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, Queensland, Australia
| |
Collapse
|
28
|
Kohl S, Llavona P, Sauer A, Reuter P, Weisschuh N, Kempf M, Dehmelt FA, Arrenberg AB, Sliesoraityte I, Zrenner E, van Schooneveld MJ, Rudolph G, Kühlewein L, Wissinger B. A duplication on chromosome 16q12 affecting the IRXB gene cluster is associated with autosomal dominant cone dystrophy with early tritanopic color vision defect. Hum Mol Genet 2021; 30:1218-1229. [PMID: 33891002 PMCID: PMC8212766 DOI: 10.1093/hmg/ddab117] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 04/15/2021] [Accepted: 04/15/2021] [Indexed: 02/06/2023] Open
Abstract
Cone dystrophies are a rare subgroup of inherited retinal dystrophies and hallmarked by color vision defects, low or decreasing visual acuity and central vision loss, nystagmus and photophobia. Applying genome-wide linkage analysis and array comparative genome hybridization, we identified a locus for autosomal dominant cone dystrophy on chromosome 16q12 in four independent multigeneration families. The locus is defined by duplications of variable size with a smallest region of overlap of 608 kb affecting the IRXB gene cluster and encompasses the genes IRX5 and IRX6. IRX5 and IRX6 belong to the Iroquois (Iro) protein family of homeodomain-containing transcription factors involved in patterning and regionalization of embryonic tissue in vertebrates, including the eye and the retina. All patients presented with a unique progressive cone dystrophy phenotype hallmarked by early tritanopic color vision defects. We propose that the disease underlies a misregulation of the IRXB gene cluster on chromosome 16q12 and demonstrate that overexpression of Irx5a and Irx6a, the two orthologous genes in zebrafish, results in visual impairment in 5-day-old zebrafish larvae.
Collapse
Affiliation(s)
- Susanne Kohl
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Pablo Llavona
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Alexandra Sauer
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Peggy Reuter
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Nicole Weisschuh
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Melanie Kempf
- University Eye Hospital, Centre for Ophthalmology, University of Tübingen, Universitätsklinikum Tübingen, Tübingen 72076, Germany.,Center for Rare Eye Diseases, University of Tübingen, Tübingen 72076, Germany
| | - Florian Alexander Dehmelt
- Werner Reichardt Centre for Integrative Neuroscience and Institute of Neurobiology, University of Tübingen, Tübingen 72076, Germany
| | - Aristides B Arrenberg
- Werner Reichardt Centre for Integrative Neuroscience and Institute of Neurobiology, University of Tübingen, Tübingen 72076, Germany
| | - Ieva Sliesoraityte
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| | - Eberhart Zrenner
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany.,Werner Reichardt Centre for Integrative Neuroscience and Institute of Neurobiology, University of Tübingen, Tübingen 72076, Germany
| | - Mary J van Schooneveld
- Department of Ophthalmology, Amsterdam University Medical Centre, Amsterdam 1100 DD, The Netherlands.,Bartiméus Diagnostic Department, Zeist, The Netherlands
| | - Günther Rudolph
- Department of Ophthalmology, University Hospital, LMU Munich, München 80336, Germany
| | - Laura Kühlewein
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany.,University Eye Hospital, Centre for Ophthalmology, University of Tübingen, Universitätsklinikum Tübingen, Tübingen 72076, Germany
| | - Bernd Wissinger
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tübingen, Tübingen 72076, Germany
| |
Collapse
|
29
|
Fu W, Wang R, Yu J, Hu D, Cai Y, Shao J, Jiang Y. GGVD: A goat genome variation database for tracking the dynamic evolutionary process of selective signatures and ancient introgressions. J Genet Genomics 2021; 48:248-256. [PMID: 33965348 DOI: 10.1016/j.jgg.2021.03.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 03/12/2021] [Accepted: 03/13/2021] [Indexed: 11/20/2022]
Abstract
Understanding the evolutionary history and adaptive process depends on the knowledge that we can acquire from both ancient and modern genomic data. With the availability of a deluge of whole-genome sequencing data from ancient and modern goat samples, a user-friendly database making efficient reuse of these important resources is needed. Here, we use the genomes of 208 modern domestic goats, 24 bezoars, 46 wild ibexes, and 82 ancient goats to present a comprehensive goat genome variation database (GGVD). GGVD hosts a total of ∼41.44 million SNPs, ∼5.14 million indels, 6,193 selected loci, and 112 introgression regions. Users can freely visualize the frequency of genomic variations in geographical maps, selective sweeps in interactive tables, Manhattan plots, or line charts, as well as the heatmap patterns of the SNP genotype. Ancient data can be shown in haplotypes to track the state of genetic variants of selection and introgression events in the early, middle, and late stages. For facilitating access to sequence features, the UCSC Genome Browser, BLAT, BLAST, LiftOver, and pcadapt are also integrated into GGVD. GGVD will be a convenient tool for population genetic studies and molecular marker designing in goat breeding programs, and it is publicly available at http://animal.nwsuaf.edu.cn/GoatVar.
Collapse
Affiliation(s)
- Weiwei Fu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Rui Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Jiantao Yu
- College of Information Engineering, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Dexiang Hu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yudong Cai
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Junjie Shao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China.
| |
Collapse
|
30
|
Vitsios D, Dhindsa RS, Middleton L, Gussow AB, Petrovski S. Prioritizing non-coding regions based on human genomic constraint and sequence context with deep learning. Nat Commun 2021; 12:1504. [PMID: 33686085 PMCID: PMC7940646 DOI: 10.1038/s41467-021-21790-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 02/12/2021] [Indexed: 11/14/2022] Open
Abstract
Elucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores. Despite being agnostic to evolutionary conservation, JARVIS performs comparably or outperforms conservation-based scores in classifying pathogenic single-nucleotide and structural variants. In constructing JARVIS, we introduce the genome-wide residual variation intolerance score (gwRVIS), applying a sliding-window approach to whole genome sequencing data from 62,784 individuals. gwRVIS distinguishes Mendelian disease genes from more tolerant CCDS regions and highlights ultra-conserved non-coding elements as the most intolerant regions in the human genome. Both JARVIS and gwRVIS capture previously inaccessible human-lineage constraint information and will enhance our understanding of the non-coding genome. Intolerance to variation is a strong indicator of disease relevance for coding regions of the human genome. Here, the authors present JARVIS, a deep learning method integrating intolerance to variation in non-coding regions and sequence-specific annotations to infer non-coding variant pathogenicity.
Collapse
Affiliation(s)
- Dimitrios Vitsios
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK.
| | - Ryan S Dhindsa
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Lawrence Middleton
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Ayal B Gussow
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - Slavé Petrovski
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK.
| |
Collapse
|
31
|
Hayes K, Barton HJ, Zeng K. A Study of Faster-Z Evolution in the Great Tit (Parus major). Genome Biol Evol 2021; 12:210-222. [PMID: 32119100 PMCID: PMC7144363 DOI: 10.1093/gbe/evaa044] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/26/2020] [Indexed: 12/17/2022] Open
Abstract
Sex chromosomes contribute substantially to key evolutionary processes such as speciation and adaptation. Several theories suggest that evolution could occur more rapidly on sex chromosomes, but currently our understanding of whether and how this occurs is limited. Here, we present an analysis of the great tit (Parus major) genome, aiming to detect signals of faster-Z evolution. We find mixed evidence of faster divergence on the Z chromosome than autosomes, with significantly higher divergence being found in ancestral repeats, but not at 4- or 0-fold degenerate sites. Interestingly, some 4-fold sites appear to be selectively constrained, which may mislead analyses that use these sites as the neutral reference (e.g., dN/dS). Consistent with other studies in birds, the mutation rate is significantly higher in males than females, and the long-term Z-to-autosome effective population size ratio is only 0.5, significantly lower than the expected value of 0.75. These are indicative of male-driven evolution and high variance in male reproductive success, respectively. We find no evidence for an increased efficacy of positive selection on the Z chromosome. In contrast, the Z chromosome in great tits appears to be affected by increased genetic drift, which has led to detectable signals of weakened intensity of purifying selection. These results provide further evidence that the Z chromosome often has a low effective population size, and that this has important consequences for its evolution. They also highlight the importance of considering multiple factors that can affect the rate of evolution and effective population sizes of sex chromosomes.
Collapse
Affiliation(s)
- Kai Hayes
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom.,Organismal and Evolutionary Biology Research Program, University of Helsinki, Finland
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| |
Collapse
|
32
|
Yang H, Luan Y, Liu T, Lee HJ, Fang L, Wang Y, Wang X, Zhang B, Jin Q, Ang KC, Xing X, Wang J, Xu J, Song F, Sriranga I, Khunsriraksakul C, Salameh T, Li D, Choudhary MNK, Topczewski J, Wang K, Gerhard GS, Hardison RC, Wang T, Cheng KC, Yue F. A map of cis-regulatory elements and 3D genome structures in zebrafish. Nature 2020; 588:337-343. [PMID: 33239788 DOI: 10.1038/s41586-020-2962-9] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2019] [Accepted: 09/17/2020] [Indexed: 01/08/2023]
Abstract
The zebrafish (Danio rerio) has been widely used in the study of human disease and development, and about 70% of the protein-coding genes are conserved between the two species1. However, studies in zebrafish remain constrained by the sparse annotation of functional control elements in the zebrafish genome. Here we performed RNA sequencing, assay for transposase-accessible chromatin using sequencing (ATAC-seq), chromatin immunoprecipitation with sequencing, whole-genome bisulfite sequencing, and chromosome conformation capture (Hi-C) experiments in up to eleven adult and two embryonic tissues to generate a comprehensive map of transcriptomes, cis-regulatory elements, heterochromatin, methylomes and 3D genome organization in the zebrafish Tübingen reference strain. A comparison of zebrafish, human and mouse regulatory elements enabled the identification of both evolutionarily conserved and species-specific regulatory sequences and networks. We observed enrichment of evolutionary breakpoints at topologically associating domain boundaries, which were correlated with strong histone H3 lysine 4 trimethylation (H3K4me3) and CCCTC-binding factor (CTCF) signals. We performed single-cell ATAC-seq in zebrafish brain, which delineated 25 different clusters of cell types. By combining long-read DNA sequencing and Hi-C, we assembled the sex-determining chromosome 4 de novo. Overall, our work provides an additional epigenomic anchor for the functional annotation of vertebrate genomes and the study of evolutionarily conserved elements of 3D genome organization.
Collapse
Affiliation(s)
- Hongbo Yang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Yu Luan
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Tingting Liu
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Hyung Joo Lee
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Li Fang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yanli Wang
- Bioinformatics and Genomics Program, The Pennsylvania State University, State College, PA, USA
| | - Xiaotao Wang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Bo Zhang
- Bioinformatics and Genomics Program, The Pennsylvania State University, State College, PA, USA
| | - Qiushi Jin
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Khai Chung Ang
- Department of Pathology and Penn State Zebrafish Functional Genomics Core, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
| | - Xiaoyun Xing
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Juan Wang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Jie Xu
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | - Fan Song
- Bioinformatics and Genomics Program, The Pennsylvania State University, State College, PA, USA
| | - Iyyanki Sriranga
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA
| | | | - Tarik Salameh
- Bioinformatics and Genomics Program, The Pennsylvania State University, State College, PA, USA
| | - Daofeng Li
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Mayank N K Choudhary
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Jacek Topczewski
- Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.,Stanley Manne Children's Research Institute, Ann and Robert H. Lurie Children's Hospital of Chicago, Chicago, IL, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Glenn S Gerhard
- Department of Medical Genetics and Molecular Biochemistry, Lewis Katz School of Medicine at Temple University, Philadelphia, PA, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Ting Wang
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Keith C Cheng
- Department of Pathology and Penn State Zebrafish Functional Genomics Core, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
| | - Feng Yue
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, IL, USA. .,Robert H. Lurie Comprehensive Cancer Center of Northwestern University, Chicago, IL, USA.
| |
Collapse
|
33
|
Low Prevalence of Lactase Persistence in Bronze Age Europe Indicates Ongoing Strong Selection over the Last 3,000 Years. Curr Biol 2020; 30:4307-4315.e13. [DOI: 10.1016/j.cub.2020.08.033] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 07/07/2020] [Accepted: 08/07/2020] [Indexed: 11/20/2022]
|
34
|
Babenko V, Babenko R, Orlov Y. Analyzing a putative enhancer of optic disc morphology. BMC Genet 2020; 21:73. [PMID: 33092545 PMCID: PMC7583307 DOI: 10.1186/s12863-020-00873-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 06/23/2020] [Indexed: 01/06/2023] Open
Abstract
Background Genome-wide association studies have identified the CDC7-TGFBR3 intergenic region on chromosome 1 to be strongly associated with optic disc area size. The mechanism of its function remained unclear until new data on eQTL markers emerged from the Genotype-Tissue Expression project. The target region was found to contain a strong silencer of the distal (800 kb) Transcription Factor (TF) gene GFI1 (Growth Factor Independent Transcription Repressor 1) specifically in neuroendocrine cells (pituitary gland). GFI1 has also been reported to be involved in the development of sensory neurons and hematopoiesis. Therefore, GFI1, being a developmental gene, is likely to affect optic disc area size by altering the expression of the associated genes via long-range interactions. Results Distribution of haplotypes in the putative enhancer region has been assessed using the data on four continental supergroups generated by the 1000 Genomes Project. The East Asian (EAS) populations were shown to manifest a highly homogenous unimodal haplotype distribution pattern within the region with the major haplotype occurring with the frequency of 0.9. Another European specific haplotype was observed with the frequency of 0.21. The major haplotype appears to be involved in silencing GFI1repressor gene expression, which might be the cause of increased optic disc area characteristic of the EAS populations. The enhancer/eQTL region overlaps AluJo element, which implies that this particular regulatory element is primate-specific and confined to few tissues. Conclusion Population specific distribution of GFI1 enhancer alleles may predispose certain ethnic groups to glaucoma.
Collapse
Affiliation(s)
- Vladimir Babenko
- Institute of Cytology and Genetics, Lavrentyeva 10, Novosibirsk, 630090, Russia. .,Novosibirsk State University, Pirogova Str 2, Novosibirsk, 630090, Russia.
| | - Roman Babenko
- Institute of Cytology and Genetics, Lavrentyeva 10, Novosibirsk, 630090, Russia.,Novosibirsk State University, Pirogova Str 2, Novosibirsk, 630090, Russia
| | - Yuri Orlov
- Institute of Cytology and Genetics, Lavrentyeva 10, Novosibirsk, 630090, Russia.,Novosibirsk State University, Pirogova Str 2, Novosibirsk, 630090, Russia.,I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), Trubetskaya 8-2, Moscow, 119991, Russia
| |
Collapse
|
35
|
An Evolutionary Cancer Epigenetic Approach Revealed DNA Hypermethylation of Ultra-Conserved Non-Coding Elements in Squamous Cell Carcinoma of Different Mammalian Species. Cells 2020; 9:cells9092092. [PMID: 32933205 PMCID: PMC7565279 DOI: 10.3390/cells9092092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 09/09/2020] [Accepted: 09/10/2020] [Indexed: 12/15/2022] Open
Abstract
Background: Ultra-conserved non-coding elements (UCNEs) are genomic sequences that exhibit > 95% sequence identity between humans, mammals, birds, reptiles, and fish. Recent findings reported their functional role in cancer. The aim of this study was to evaluate the DNA methylation modifications of UNCEs in squamous cell carcinoma (SCC) from different mammal species. Methods: Fifty SCCs from 26 humans, 17 cats, 3 dogs, 1 horse, 1 bovine, 1 badger, and 1 porcupine were investigated. Fourteen feline stomatitis and normal samples from 36 healthy human donors, 7 cats, 5 dogs, 5 horses, 2 bovines and 1 badger were collected as normal controls. Bisulfite next generation sequencing evaluated the DNA methylation level from seven UCNEs (uc.160, uc.283, uc.416, uc.339, uc.270, uc.299, and uc.328). Results: 57/59 CpGs were significantly different according to the Kruskal–Wallis test (p < 0.05) comparing normal samples with SCC. A common DNA hypermethylation pattern was observed in SCCs from all the species evaluated in this study, with an increasing trend of hypermethylation starting from normal mucosa, through stomatitis to SCC. Conclusions: Our findings indicate that UCNEs are hypermethylated in human SCC, and this behavior is also conserved among different species of mammals.
Collapse
|
36
|
Vormstein-Schneider D, Lin JD, Pelkey KA, Chittajallu R, Guo B, Arias-Garcia MA, Allaway K, Sakopoulos S, Schneider G, Stevenson O, Vergara J, Sharma J, Zhang Q, Franken TP, Smith J, Ibrahim LA, M Astro KJ, Sabri E, Huang S, Favuzzi E, Burbridge T, Xu Q, Guo L, Vogel I, Sanchez V, Saldi GA, Gorissen BL, Yuan X, Zaghloul KA, Devinsky O, Sabatini BL, Batista-Brito R, Reynolds J, Feng G, Fu Z, McBain CJ, Fishell G, Dimidschstein J. Viral manipulation of functionally distinct interneurons in mice, non-human primates and humans. Nat Neurosci 2020; 23:1629-1636. [PMID: 32807948 DOI: 10.1038/s41593-020-0692-9] [Citation(s) in RCA: 113] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 07/10/2020] [Indexed: 12/21/2022]
Abstract
Recent success in identifying gene-regulatory elements in the context of recombinant adeno-associated virus vectors has enabled cell-type-restricted gene expression. However, within the cerebral cortex these tools are largely limited to broad classes of neurons. To overcome this limitation, we developed a strategy that led to the identification of multiple new enhancers to target functionally distinct neuronal subtypes. By investigating the regulatory landscape of the disease gene Scn1a, we discovered enhancers selective for parvalbumin (PV) and vasoactive intestinal peptide-expressing interneurons. Demonstrating the functional utility of these elements, we show that the PV-specific enhancer allowed for the selective targeting and manipulation of these neurons across vertebrate species, including humans. Finally, we demonstrate that our selection method is generalizable and characterizes additional PV-specific enhancers with exquisite specificity within distinct brain regions. Altogether, these viral tools can be used for cell-type-specific circuit manipulation and hold considerable promise for use in therapeutic interventions.
Collapse
Affiliation(s)
| | - Jessica D Lin
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Kenneth A Pelkey
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
| | - Ramesh Chittajallu
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
| | - Baolin Guo
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Mario A Arias-Garcia
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Kathryn Allaway
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA.,NYU Langone Medical Center, New York University, New York, NY, USA
| | - Sofia Sakopoulos
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Gates Schneider
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Olivia Stevenson
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Josselyn Vergara
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Jitendra Sharma
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Qiangge Zhang
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Tom P Franken
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Jared Smith
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Leena A Ibrahim
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Kevin J M Astro
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Ehsan Sabri
- Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Shuhan Huang
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Emilia Favuzzi
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Timothy Burbridge
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Qing Xu
- Center for Genomics & Systems Biology, New York University, Abu Dhabi, UAE
| | - Lihua Guo
- Center for Genomics & Systems Biology, New York University, Abu Dhabi, UAE
| | - Ian Vogel
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Vanessa Sanchez
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Giuseppe A Saldi
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Bram L Gorissen
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Xiaoqing Yuan
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
| | - Kareem A Zaghloul
- National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
| | - Orrin Devinsky
- Comprehensive Epilepsy Center, New York University School of Medicine, New York, NY, USA
| | - Bernardo L Sabatini
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | | | - John Reynolds
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Guoping Feng
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Zhanyan Fu
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Chris J McBain
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
| | - Gord Fishell
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Jordane Dimidschstein
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| |
Collapse
|
37
|
Lee CM, Barber GP, Casper J, Clawson H, Diekhans M, Gonzalez JN, Hinrichs AS, Lee BT, Nassar LR, Powell CC, Raney BJ, Rosenbloom KR, Schmelter D, Speir ML, Zweig AS, Haussler D, Haeussler M, Kuhn RM, Kent WJ. UCSC Genome Browser enters 20th year. Nucleic Acids Res 2020; 48:D756-D761. [PMID: 31691824 PMCID: PMC7145642 DOI: 10.1093/nar/gkz1012] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 10/16/2019] [Accepted: 10/25/2019] [Indexed: 12/27/2022] Open
Abstract
The University of California Santa Cruz Genome Browser website (https://genome.ucsc.edu) enters its 20th year of providing high-quality genomics data visualization and genome annotations to the research community. In the past year, we have added a new option to our web BLAT tool that allows search against all genomes, a single-cell expression viewer (https://cells.ucsc.edu), a ‘lollipop’ plot display mode for high-density variation data, a RESTful API for data extraction and a custom-track backup feature. New datasets include Tabula Muris single-cell expression data, GeneHancer regulatory annotations, The Cancer Genome Atlas Pan-Cancer variants, Genome Reference Consortium Patch sequences, new ENCODE transcription factor binding site peaks and clusters, the Database of Genomic Variants Gold Standard Variants, Genomenon Mastermind variants and three new multi-species alignment tracks.
Collapse
Affiliation(s)
- Christopher M Lee
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Galt P Barber
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Jonathan Casper
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Hiram Clawson
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Mark Diekhans
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - Angie S Hinrichs
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Brian T Lee
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Luis R Nassar
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Conner C Powell
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Brian J Raney
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Kate R Rosenbloom
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Daniel Schmelter
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Matthew L Speir
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Ann S Zweig
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - David Haussler
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA.,Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Maximilian Haeussler
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Robert M Kuhn
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - W James Kent
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| |
Collapse
|
38
|
Li X, Shi L, Wang Y, Zhong J, Zhao X, Teng H, Shi X, Yang H, Ruan S, Li M, Sun ZS, Zhan Q, Mao F. OncoBase: a platform for decoding regulatory somatic mutations in human cancers. Nucleic Acids Res 2020; 47:D1044-D1055. [PMID: 30445567 PMCID: PMC6323961 DOI: 10.1093/nar/gky1139] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 11/11/2018] [Indexed: 12/16/2022] Open
Abstract
Whole-exome and whole-genome sequencing have revealed millions of somatic mutations associated with different human cancers, and the vast majority of them are located outside of coding sequences, making it challenging to directly interpret their functional effects. With the rapid advances in high-throughput sequencing technologies, genome-scale long-range chromatin interactions were detected, and distal target genes of regulatory elements were determined using three-dimensional (3D) chromatin looping. Herein, we present OncoBase (http://www.oncobase.biols.ac.cn/), an integrated database for annotating 81 385 242 somatic mutations in 68 cancer types from more than 120 cancer projects by exploring their roles in distal interactions between target genes and regulatory elements. OncoBase integrates local chromatin signatures, 3D chromatin interactions in different cell types and reconstruction of enhancer-target networks using state-of-the-art algorithms. It employs informative visualization tools to display the integrated local and 3D chromatin signatures and effects of somatic mutations on regulatory elements. Enhancer-promoter interactions estimated from chromatin interactions are integrated into a network diffusion system that quantitatively prioritizes somatic mutations and target genes from a large pool. Thus, OncoBase is a useful resource for the functional annotation of regulatory noncoding regions and systematically benchmarking the regulatory effects of embedded noncoding somatic mutations in human carcinogenesis.
Collapse
Affiliation(s)
- Xianfeng Li
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China.,Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Leisheng Shi
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yan Wang
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Jianing Zhong
- Key Laboratory of Prevention and Treatment of Cardiovascular and Cerebrovascular Diseases of Ministry of Education, Gannan Medical University, Ganzhou 341000,China
| | - Xiaolu Zhao
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Huajing Teng
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiaohui Shi
- Sino-Danish college, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Haonan Yang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Shasha Ruan
- Department of Clinical Oncology, Renmin Hospital of Wuhan University, Wuhan, Hubei 430072, China
| | - MingKun Li
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhong Sheng Sun
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Qimin Zhan
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Fengbiao Mao
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
39
|
Lee KS, Bang H, Choi JK, Kim K. Accelerated Evolution of the Regulatory Sequences of Brain Development in the Human Genome. Mol Cells 2020; 43:331-339. [PMID: 32235023 PMCID: PMC7191052 DOI: 10.14348/molcells.2020.2282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 03/03/2020] [Accepted: 03/08/2020] [Indexed: 12/14/2022] Open
Abstract
Genetic modifications in noncoding regulatory regions are likely critical to human evolution. Human-accelerated noncoding elements are highly conserved noncoding regions among vertebrates but have large differences across humans, which implies human-specific regulatory potential. In this study, we found that human-accelerated noncoding elements were frequently coupled with DNase I hypersensitive sites (DHSs), together with monomethylated and trimethylated histone H3 lysine 4, which are active regulatory markers. This coupling was particularly pronounced in fetal brains relative to adult brains, non-brain fetal tissues, and embryonic stem cells. However, fetal brain DHSs were also specifically enriched in deeply conserved sequences, implying coexistence of universal maintenance and human-specific fitness in human brain development. We assessed whether this coexisting pattern was a general one by quantitatively measuring evolutionary rates of DHSs. As a result, fetal brain DHSs showed a mixed but distinct signature of regional conservation and outlier point acceleration as compared to other DHSs. This finding suggests that brain developmental sequences are selectively constrained in general, whereas specific nucleotides are under positive selection or constraint relaxation simultaneously. Hence, we hypothesize that human- or primate-specific changes to universally conserved regulatory codes of brain development may drive the accelerated, and most likely adaptive, evolution of the regulatory network of the human brain.
Collapse
Affiliation(s)
- Kang Seon Lee
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 344, Korea
| | - Hyoeun Bang
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 344, Korea
| | - Jung Kyoon Choi
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 344, Korea
| | - Kwoneel Kim
- Department of Biology, Kyung Hee University, Seoul 0447, Korea
| |
Collapse
|
40
|
Yasuoka Y, Matsumoto M, Yagi K, Okazaki Y. Evolutionary History of GLIS Genes Illuminates Their Roles in Cell Reprograming and Ciliogenesis. Mol Biol Evol 2020; 37:100-109. [PMID: 31504761 PMCID: PMC6984359 DOI: 10.1093/molbev/msz205] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The GLIS family transcription factors, GLIS1 and GLIS3, potentiate generation of induced pluripotent stem cells (iPSCs). In contrast, another GLIS family member, GLIS2, suppresses cell reprograming. To understand how these disparate roles arose, we examined evolutionary origins and genomic organization of GLIS genes. Comprehensive phylogenetic analysis shows that GLIS1 and GLIS3 originated during vertebrate whole genome duplication, whereas GLIS2 is a sister group to the GLIS1/3 and GLI families. This result is consistent with their opposing functions in cell reprograming. Glis1 evolved faster than Glis3, losing many protein-interacting motifs. This suggests that Glis1 acquired new functions under weakened evolutionary constraints. In fact, GLIS1 induces induced pluripotent stem cells more strongly. Transcriptomic data from various animal embryos demonstrate that glis1 is maternally expressed in some tetrapods, whereas vertebrate glis3 and invertebrate glis1/3 genes are rarely expressed in oocytes, suggesting that vertebrate (or tetrapod) Glis1 acquired a new expression domain and function as a maternal factor. Furthermore, comparative genomic analysis reveals that glis1/3 is part of a bilaterian-specific gene cluster, together with rfx3, ndc1, hspb11, and lrrc42. Because known functions of these genes are related to cilia formation and function, the last common ancestor of bilaterians may have acquired this cluster by shuffling gene order to establish more sophisticated epithelial tissues involving cilia. This evolutionary study highlights the significance of GLIS1/3 for cell reprograming, development, and diseases in ciliated organs such as lung, kidney, and pancreas.
Collapse
Affiliation(s)
- Yuuri Yasuoka
- Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Masahito Matsumoto
- Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Tokyo, Japan.,Advanced Diabetic Therapeutics, Department of Metabolic Endocrinology, Graduate School of Medicine, Juntendo University, Tokyo, Japan.,Department of Biofunction Research, Institute of Biomaterials and Bioengineering, Tokyo Medical and Dental University, Tokyo, Japan
| | - Ken Yagi
- Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yasushi Okazaki
- Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Tokyo, Japan
| |
Collapse
|
41
|
Rheinbay E, Nielsen MM, Abascal F, Wala JA, Shapira O, Tiao G, Hornshøj H, Hess JM, Juul RI, Lin Z, Feuerbach L, Sabarinathan R, Madsen T, Kim J, Mularoni L, Shuai S, Lanzós A, Herrmann C, Maruvka YE, Shen C, Amin SB, Bandopadhayay P, Bertl J, Boroevich KA, Busanovich J, Carlevaro-Fita J, Chakravarty D, Chan CWY, Craft D, Dhingra P, Diamanti K, Fonseca NA, Gonzalez-Perez A, Guo Q, Hamilton MP, Haradhvala NJ, Hong C, Isaev K, Johnson TA, Juul M, Kahles A, Kahraman A, Kim Y, Komorowski J, Kumar K, Kumar S, Lee D, Lehmann KV, Li Y, Liu EM, Lochovsky L, Park K, Pich O, Roberts ND, Saksena G, Schumacher SE, Sidiropoulos N, Sieverling L, Sinnott-Armstrong N, Stewart C, Tamborero D, Tubio JMC, Umer HM, Uusküla-Reimand L, Wadelius C, Wadi L, Yao X, Zhang CZ, Zhang J, Haber JE, Hobolth A, Imielinski M, Kellis M, Lawrence MS, von Mering C, Nakagawa H, Raphael BJ, Rubin MA, Sander C, Stein LD, Stuart JM, Tsunoda T, Wheeler DA, Johnson R, Reimand J, Gerstein M, Khurana E, Campbell PJ, López-Bigas N, Weischenfeldt J, Beroukhim R, Martincorena I, Pedersen JS, Getz G. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature 2020; 578:102-111. [PMID: 32025015 PMCID: PMC7054214 DOI: 10.1038/s41586-020-1965-x] [Citation(s) in RCA: 346] [Impact Index Per Article: 86.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 12/02/2019] [Indexed: 01/28/2023]
Abstract
The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.
Collapse
Affiliation(s)
- Esther Rheinbay
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Morten Muhlig Nielsen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | | | - Jeremiah A Wala
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA, USA
| | - Ofer Shapira
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Grace Tiao
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Henrik Hornshøj
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | - Julian M Hess
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Randi Istrup Juul
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | - Ziao Lin
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Harvard University, Cambridge, MA, USA
| | - Lars Feuerbach
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Radhakrishnan Sabarinathan
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Tobias Madsen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | - Jaegil Kim
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Loris Mularoni
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Shimin Shuai
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Andrés Lanzós
- Department for BioMedical Research, University of Bern, Bern, Switzerland
- Graduate School of Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
- Department of Medical Oncology, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Carl Herrmann
- Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Bioquant Center, Institute of Pharmacy and Molecular Biotechnology, University of Heidelberg, Heidelberg, Germany
| | - Yosef E Maruvka
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA
| | - Ciyue Shen
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- cBio Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Samirkumar B Amin
- Department of Genomic Medicine, University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, TX, USA
| | - Pratiti Bandopadhayay
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Johanna Bertl
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | - Keith A Boroevich
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - John Busanovich
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Joana Carlevaro-Fita
- Department for BioMedical Research, University of Bern, Bern, Switzerland
- Graduate School of Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
- Department of Medical Oncology, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Dimple Chakravarty
- Department of Genitourinary Medical Oncology - Research, Division of Cancer Medicine, University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Department of Urology, Icahn school of Medicine at Mount Sinai, New York, NY, USA
| | - Calvin Wing Yiu Chan
- Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | - David Craft
- Department of Radiation Oncology, Harvard Medical School, Massachusetts General Hospital, Boston, MA, USA
| | - Priyanka Dhingra
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Klev Diamanti
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Nuno A Fonseca
- European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton, UK
| | - Abel Gonzalez-Perez
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Qianyun Guo
- Bioinformatics Research Centre (BiRC), Aarhus University, Aarhus, Denmark
| | - Mark P Hamilton
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, USA
| | - Nicholas J Haradhvala
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA
| | - Chen Hong
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | - Keren Isaev
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Todd A Johnson
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Malene Juul
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
| | - Andre Kahles
- Division of Computational Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Abdullah Kahraman
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Youngwook Kim
- Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Jan Komorowski
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
- Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
| | - Kiran Kumar
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Donghoon Lee
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Kjong-Van Lehmann
- Division of Computational Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Yilong Li
- SBGD Inc, Cambridge, MA, USA
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Eric Minwei Liu
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Lucas Lochovsky
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Keunchil Park
- Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Oriol Pich
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Nicola D Roberts
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Gordon Saksena
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Steven E Schumacher
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Nikos Sidiropoulos
- Biotech Research & Innovation Centre (BRIC), The Finsen Laboratory, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
| | - Lina Sieverling
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | | | - Chip Stewart
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Tamborero
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Jose M C Tubio
- Department of Zoology, Genetics and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- The Biomedical Research Centre (CINBIO), Universidade de Vigo, Vigo, Spain
| | - Husen M Umer
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
- Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institute, Stockholm, Sweden
| | - Liis Uusküla-Reimand
- Genetics and Genome Biology Program, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Gene Technology, Tallinn University of Technology, Tallinn, Estonia
| | - Claes Wadelius
- Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Lina Wadi
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | | | - Cheng-Zhong Zhang
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Jing Zhang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - James E Haber
- Department of Biology and Rosenstiel Basic Medical Sciences Research Center, Brandeis University, Waltham, MA, USA
| | - Asger Hobolth
- Bioinformatics Research Centre (BiRC), Aarhus University, Aarhus, Denmark
| | - Marcin Imielinski
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, and Englander Institute for Precision Medicine, and Institute for Computational Biomedicine, and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Manolis Kellis
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA
| | - Michael S Lawrence
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA
| | - Christian von Mering
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Hidewaki Nakagawa
- Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Mark A Rubin
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
- Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Chris Sander
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- cBio Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Lincoln D Stein
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Joshua M Stuart
- Center for Biomolecular Science and Engineering, University of California at Santa Cruz, Santa Cruz, CA, USA
| | - Tatsuhiko Tsunoda
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
- Laboratory for Medical Science Mathematics, Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - David A Wheeler
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Rory Johnson
- Department for BioMedical Research, University of Bern, Bern, Switzerland
- Department of Medical Oncology, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Department of Computer Science, Yale University, New Haven, CT, USA
| | - Ekta Khurana
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
- Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Peter J Campbell
- Wellcome Trust Sanger Institute, Hinxton, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Núria López-Bigas
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Joachim Weischenfeldt
- Biotech Research & Innovation Centre (BRIC), The Finsen Laboratory, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark.
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.
| | - Rameen Beroukhim
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA, USA.
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | | | - Jakob Skou Pedersen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark.
- Bioinformatics Research Centre (BiRC), Aarhus University, Aarhus, Denmark.
| | - Gad Getz
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA.
- Harvard Medical School, Boston, MA, USA.
- Department of Pathology, Massachusetts General Hospital, Boston, MA, USA.
| |
Collapse
|
42
|
Ayad LAK, Pissis SP, Polychronopoulos D. CNEFinder: finding conserved non-coding elements in genomes. Bioinformatics 2019; 34:i743-i747. [PMID: 30423090 PMCID: PMC6129273 DOI: 10.1093/bioinformatics/bty601] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Motivation Conserved non-coding elements (CNEs) represent an enigmatic class of genomic elements which, despite being extremely conserved across evolution, do not encode for proteins. Their functions are still largely unknown. Thus, there exists a need to systematically investigate their roles in genomes. Towards this direction, identifying sets of CNEs in a wide range of organisms is an important first step. Currently, there are no tools published in the literature for systematically identifying CNEs in genomes. Results We fill this gap by presenting CNEFinder; a tool for identifying CNEs between two given DNA sequences with user-defined criteria. The results presented here show the tool’s ability of identifying CNEs accurately and efficiently. CNEFinder is based on a k-mer technique for computing maximal exact matches. The tool thus does not require or compute whole-genome alignments or indexes, such as the suffix array or the Burrows Wheeler Transform (BWT), which makes it flexible to use on a wide scale. Availability and implementation Free software under the terms of the GNU GPL (https://github.com/lorrainea/CNEFinder).
Collapse
Affiliation(s)
| | - Solon P Pissis
- Department of Informatics, King's College London, London, UK
| | | |
Collapse
|
43
|
Daane JM, Dornburg A, Smits P, MacGuigan DJ, Brent Hawkins M, Near TJ, William Detrich Iii H, Harris MP. Historical contingency shapes adaptive radiation in Antarctic fishes. Nat Ecol Evol 2019; 3:1102-1109. [PMID: 31182814 PMCID: PMC7147983 DOI: 10.1038/s41559-019-0914-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 05/02/2019] [Indexed: 12/02/2022]
Abstract
Adaptive radiation illustrates links between ecological opportunity, natural selection and the generation of biodiversity. Central to adaptive radiation is the association between a diversifying lineage and the evolution of phenotypic variation that facilitates the use of new environments or resources. However, is not clear whether adaptive evolution or historical contingency is more important for the origin of key phenotypic traits in adaptive radiation. Here we use targeted sequencing of >250,000 loci across 46 species to examine hypotheses concerning the origin and diversification of key traits in the adaptive radiation of Antarctic notothenioid fishes. Contrary to expectations of adaptive evolution, we show that notothenioids experienced a punctuated burst of genomic diversification and evolved key skeletal modifications before the onset of polar conditions in the Southern Ocean. We show that diversifying selection in pathways associated with human skeletal dysplasias facilitates ecologically important variation in buoyancy among Antarctic notothenioid species, and demonstrate the sufficiency of altered trip11, col1a2 and col1a1a function in zebrafish (Danio rerio) to phenocopy skeletal reduction in Antarctic notothenioids. Rather than adaptation being driven by the cooling of the Antarctic, our results highlight the role of historical contingency in shaping the adaptive radiation of notothenioids. Understanding the historical and environmental context for the origin of key traits in adaptive radiations extends beyond reconstructing events that result in evolutionary innovation, as it also provides a context in forecasting the effects of climate change on the stability and evolvability of natural populations.
Collapse
Affiliation(s)
- Jacob M Daane
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA, USA.
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC, USA
| | - Patrick Smits
- Orthopaedic Research Laboratories, Department of Orthopaedic Surgery, Boston Children's Hospital, Boston, MA, USA
| | - Daniel J MacGuigan
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| | - M Brent Hawkins
- Orthopaedic Research Laboratories, Department of Orthopaedic Surgery, Boston Children's Hospital, Boston, MA, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Thomas J Near
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Peabody Museum of Natural History, Yale University, New Haven, CT, USA
| | - H William Detrich Iii
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA, USA.
| | - Matthew P Harris
- Orthopaedic Research Laboratories, Department of Orthopaedic Surgery, Boston Children's Hospital, Boston, MA, USA.
- Department of Genetics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
44
|
Mookerjee-Basu J, Hua X, Ge L, Nicolas E, Li Q, Czyzewicz P, Zhongping D, Peri S, FuxmanBass JI, Walhout AJM, Kappes DJ. Functional Conservation of a Developmental Switch in Mammals since the Jurassic Age. Mol Biol Evol 2019; 36:39-53. [PMID: 30295892 DOI: 10.1093/molbev/msy191] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
ThPOK is a "master regulator" of T lymphocyte lineage choice, whose presence or absence is sufficient to dictate development to the CD4 or CD8 lineages, respectively. Induction of ThPOK is transcriptionally regulated, via a lineage-specific silencer element, SilThPOK. Here, we take advantage of the available genome sequence data as well as site-specific gene targeting technology, to evaluate the functional conservation of ThPOK regulation across mammalian evolution, and assess the importance of motif grammar (order and orientation of TF binding sites) on SilThPOK function in vivo. We make three important points: First, the SilThPOK is present in marsupial and placental mammals, but is not found in available genome assemblies of nonmammalian vertebrates, indicating that it arose after divergence of mammals from other vertebrates. Secondly, by replacing the murine SilThPOK in situ with its marsupial equivalent using a knockin approach, we demonstrate that the marsupial SilThPOK supports correct CD4 T lymphocyte lineage-specification in mice. To our knowledge, this is the first in vivo demonstration of functional equivalency for a silencer element between marsupial and placental mammals using a definitive knockin approach. Finally, we show that alteration of the position/orientation of a highly conserved region within the murine SilThPOK is sufficient to destroy silencer activity in vivo, demonstrating that motif grammar of this "solid" synteny block is critical for silencer function. Dependence of SilThPOK function on motif grammar conserved since the mid-Jurassic age, 165 Ma, suggests that the SilThPOK operates as a silenceosome, by analogy with the previously proposed enhanceosome model.
Collapse
Affiliation(s)
- Jayati Mookerjee-Basu
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Xiang Hua
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Lu Ge
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Emmanuelle Nicolas
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Qin Li
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Philip Czyzewicz
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Dai Zhongping
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Suraj Peri
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| | - Juan I FuxmanBass
- Program in Systems Biology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA
| | - Albertha J M Walhout
- Program in Systems Biology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA
| | - Dietmar J Kappes
- Blood Cell Development and Cancer Program, Fox Chase Cancer Center, Philadelphia, PA
| |
Collapse
|
45
|
Flasch DA, Macia Á, Sánchez L, Ljungman M, Heras SR, García-Pérez JL, Wilson TE, Moran JV. Genome-wide de novo L1 Retrotransposition Connects Endonuclease Activity with Replication. Cell 2019; 177:837-851.e28. [PMID: 30955886 DOI: 10.1016/j.cell.2019.02.050] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 01/10/2019] [Accepted: 02/25/2019] [Indexed: 12/18/2022]
Abstract
L1 retrotransposon-derived sequences comprise approximately 17% of the human genome. Darwinian selective pressures alter L1 genomic distributions during evolution, confounding the ability to determine initial L1 integration preferences. Here, we generated high-confidence datasets of greater than 88,000 engineered L1 insertions in human cell lines that act as proxies for cells that accommodate retrotransposition in vivo. Comparing these insertions to a null model, in which L1 endonuclease activity is the sole determinant dictating L1 integration preferences, demonstrated that L1 insertions are not significantly enriched in genes, transcribed regions, or open chromatin. By comparison, we provide compelling evidence that the L1 endonuclease disproportionately cleaves predominant lagging strand DNA replication templates, while lagging strand 3'-hydroxyl groups may prime endonuclease-independent L1 retrotransposition in a Fanconi anemia cell line. Thus, acquisition of an endonuclease domain, in conjunction with the ability to integrate into replicating DNA, allowed L1 to become an autonomous, interspersed retrotransposon.
Collapse
Affiliation(s)
- Diane A Flasch
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA.
| | - Ángela Macia
- Department of Genomic Medicine, GENYO: Centre for Genomics and Oncology (Pfizer-University of Granada and Andalusian Regional Government), PTS Granada, 18016, Spain
| | - Laura Sánchez
- Department of Genomic Medicine, GENYO: Centre for Genomics and Oncology (Pfizer-University of Granada and Andalusian Regional Government), PTS Granada, 18016, Spain
| | - Mats Ljungman
- Department of Radiation Oncology, University of Michigan Comprehensive Cancer Center, Translational Oncology Program and Center for RNA Biomedicine, University of Michigan, Ann Arbor, Michigan, 48109, USA; Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, Michigan, 48109, USA
| | - Sara R Heras
- Department of Genomic Medicine, GENYO: Centre for Genomics and Oncology (Pfizer-University of Granada and Andalusian Regional Government), PTS Granada, 18016, Spain
| | - José L García-Pérez
- Department of Genomic Medicine, GENYO: Centre for Genomics and Oncology (Pfizer-University of Granada and Andalusian Regional Government), PTS Granada, 18016, Spain; Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Thomas E Wilson
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA; Department of Pathology, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA.
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA; Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA.
| |
Collapse
|
46
|
Babenko V, Babenko R, Gamieldien J, Markel A. FTO haplotyping underlines high obesity risk for European populations. BMC Med Genomics 2019; 12:46. [PMID: 30871540 PMCID: PMC6417164 DOI: 10.1186/s12920-019-0491-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Background Fat mass and obesity-associated (FTO) gene has been under close investigation since the discovery of its high impact on the obesity status in 2007 by a range of publications. Recent report on its implication in adipocytes underscored its molecular and functional mechanics in pathology. Still, the population specific features of the locus structure have not been approached in detail. Methods We analyzed the population specific haplotype profiles of FTO genomic locus identified by Genome Wide Association Studies (GWAS) for the high obesity risk by examining eighteen 1000G populations from 4 continental groups. The GWAS SNPs cluster is located in the FTO gene intron 1 spanning around 70 kb. Results We reconstructed the ancestral state of the locus, which comprised low-risk major allele found in all populations, and two minor risk-associated alleles, each one specific for African and European populations, correspondingly. The locus structure and its allele frequency distribution underscore the high risk allele frequency specifically for the European population. South Asian populations have the second highest frequency of risk alleles, while East Asian populations have the lowest. African population-specific minor allele was only partially risk-associated. All of the GWAS SNPs considered are manifested by low risk alleles as reference (major) ones (p > 0.5) in each of the continental groups. Strikingly, rs1421085, recently reported as a causal SNP, was found to be monomorphic in ancestral (African) populations, implying possible selection sweep in the course of its rapid fixation, as reported previously. Conclusion The observations underscore varying FTO -linked risk in the manifestation of population specific epidemiology of genetically bound obesity. The results imply that the FTO locus is one of the major genetic determinants for obesity risk from GWAS SNPs set. Electronic supplementary material The online version of this article (10.1186/s12920-019-0491-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vladimir Babenko
- Federal Research Center Institute of Cytology and Genetics SB RAS, 10 Lavrentieva Ave, Novosibirsk, Russian Federation, 630090. .,Novosibirsk State University, 2 Pirogova Str, Novosibirsk, Russian Federation, 630090.
| | - Roman Babenko
- Federal Research Center Institute of Cytology and Genetics SB RAS, 10 Lavrentieva Ave, Novosibirsk, Russian Federation, 630090.,Novosibirsk State University, 2 Pirogova Str, Novosibirsk, Russian Federation, 630090
| | - Junaid Gamieldien
- South African National Bioinformatics Institute/SAMRC Unit for Bioinformatics Capacity Development, University of the Western Cape, Cape Town, 7535, South Africa
| | - Arcady Markel
- Federal Research Center Institute of Cytology and Genetics SB RAS, 10 Lavrentieva Ave, Novosibirsk, Russian Federation, 630090.,Novosibirsk State University, 2 Pirogova Str, Novosibirsk, Russian Federation, 630090
| |
Collapse
|
47
|
Weirick T, Militello G, Ponomareva Y, John D, Döring C, Dimmeler S, Uchida S. Logic programming to infer complex RNA expression patterns from RNA-seq data. Brief Bioinform 2019; 19:199-209. [PMID: 28011754 DOI: 10.1093/bib/bbw117] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2016] [Indexed: 12/15/2022] Open
Abstract
To meet the increasing demand in the field, numerous long noncoding RNA (lncRNA) databases are available. Given many lncRNAs are specifically expressed in certain cell types and/or time-dependent manners, most lncRNA databases fall short of providing such profiles. We developed a strategy using logic programming to handle the complex organization of organs, their tissues and cell types as well as gender and developmental time points. To showcase this strategy, we introduce 'RenalDB' (http://renaldb.uni-frankfurt.de), a database providing expression profiles of RNAs in major organs focusing on kidney tissues and cells. RenalDB uses logic programming to describe complex anatomy, sample metadata and logical relationships defining expression, enrichment or specificity. We validated the content of RenalDB with biological experiments and functionally characterized two long intergenic noncoding RNAs: LOC440173 is important for cell growth or cell survival, whereas PAXIP1-AS1 is a regulator of cell death. We anticipate RenalDB will be used as a first step toward functional studies of lncRNAs in the kidney.
Collapse
Affiliation(s)
- Tyler Weirick
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany.,Cardiovascular Innovation Institute, University of Louisville, 302 E Muhammad Ali Blvd, Louisville, KY, U.S.A
| | - Giuseppe Militello
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany.,Cardiovascular Innovation Institute, University of Louisville, 302 E Muhammad Ali Blvd, Louisville, KY, U.S.A
| | - Yuliya Ponomareva
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany
| | - David John
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany
| | - Claudia Döring
- Dr. Senckenberg Institute of Pathology, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany
| | - Stefanie Dimmeler
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany
| | - Shizuka Uchida
- Institute of Cardiovascular Regeneration, Centre for Molecular Medicine, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, Germany.,German Center for Cardiovascular Research, Partner side Rhein-Main, Frankfurt am Main, Germany.,Cardiovascular Innovation Institute, University of Louisville, 302 E Muhammad Ali Blvd, Louisville, KY, U.S.A
| |
Collapse
|
48
|
Wang M, Lemos B. Ribosomal DNA harbors an evolutionarily conserved clock of biological aging. Genome Res 2019; 29:325-333. [PMID: 30765617 PMCID: PMC6396418 DOI: 10.1101/gr.241745.118] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Accepted: 01/22/2019] [Indexed: 01/17/2023]
Abstract
The ribosomal DNA (rDNA) is the most evolutionarily conserved segment of the genome and gives origin to the nucleolus, an energy intensive nuclear organelle and major hub influencing myriad molecular processes from cellular metabolism to epigenetic states of the genome. The rDNA/nucleolus has been directly and mechanistically implicated in aging and longevity in organisms as diverse as yeasts, Drosophila, and humans. The rDNA is also a significant target of DNA methylation that silences supernumerary rDNA units and regulates nucleolar activity. Here, we introduce an age clock built exclusively with CpG methylation within the rDNA. The ribosomal clock is sufficient to accurately estimate individual age within species, is responsive to genetic and environmental interventions that modulate life-span, and operates across species as distant as humans, mice, and dogs. Further analyses revealed a significant excess of age-associated hypermethylation in the rDNA relative to other segments of the genome, and which forms the basis of the rDNA clock. Our observations identified an evolutionarily conserved marker of aging that is easily ascertained, grounded on nucleolar biology, and could serve as a universal marker to gauge individual age and response to interventions in humans as well as laboratory and wild organisms across a wide diversity of species.
Collapse
Affiliation(s)
- Meng Wang
- Department of Environmental Health, Program in Molecular and Integrative Physiological Sciences, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Bernardo Lemos
- Department of Environmental Health, Program in Molecular and Integrative Physiological Sciences, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA.,Broad Institute of Harvard and MIT, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
49
|
Abstract
One of the most important resources for researchers of noncoding RNAs is the information available in public databases spread over the internet. However, the effective exploration of this data can represent a daunting task, given the large amount of databases available and the variety of stored data. This chapter describes a classification of databases based on information source, type of RNA, source organisms, data formats, and the mechanisms for information retrieval, detailing the relevance of each of these classifications and its usability by researchers. This classification is used to update a 2012 review, indexing now more than 229 public databases. This review will include an assessment of the new trends for ncRNA research based on the information that is being offered by the databases. Additionally, we will expand the previous analysis focusing on the usability and application of these databases in pathogen and disease research. Finally, this chapter will analyze how currently available database schemas can help the development of new and improved web resources.
Collapse
|
50
|
Wirthlin M, Lima NCB, Guedes RLM, Soares AER, Almeida LGP, Cavaleiro NP, Loss de Morais G, Chaves AV, Howard JT, Teixeira MDM, Schneider PN, Santos FR, Schatz MC, Felipe MS, Miyaki CY, Aleixo A, Schneider MPC, Jarvis ED, Vasconcelos ATR, Prosdocimi F, Mello CV. Parrot Genomes and the Evolution of Heightened Longevity and Cognition. Curr Biol 2018; 28:4001-4008.e7. [PMID: 30528582 DOI: 10.1016/j.cub.2018.10.050] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Revised: 08/14/2018] [Accepted: 10/23/2018] [Indexed: 10/27/2022]
Abstract
Parrots are one of the most distinct and intriguing groups of birds, with highly expanded brains [1], highly developed cognitive [2] and vocal communication [3] skills, and a long lifespan compared to other similar-sized birds [4]. Yet the genetic basis of these traits remains largely unidentified. To address this question, we have generated a high-coverage, annotated assembly of the genome of the blue-fronted Amazon (Amazona aestiva) and carried out extensive comparative analyses with 30 other avian species, including 4 additional parrots. We identified several genomic features unique to parrots, including parrot-specific novel genes and parrot-specific modifications to coding and regulatory sequences of existing genes. We also discovered genomic features under strong selection in parrots and other long-lived birds, including genes previously associated with lifespan determination as well as several hundred new candidate genes. These genes support a range of cellular functions, including telomerase activity; DNA damage repair; control of cell proliferation, cancer, and immunity; and anti-oxidative mechanisms. We also identified brain-expressed, parrot-specific paralogs with known functions in neural development or vocal-learning brain circuits. Intriguingly, parrot-specific changes in conserved regulatory sequences were overwhelmingly associated with genes that are linked to cognitive abilities and have undergone similar selection in the human lineage, suggesting convergent evolution. These findings bring novel insights into the genetics and evolution of longevity and cognition, as well as provide novel targets for exploring the mechanistic basis of these traits.
Collapse
Affiliation(s)
- Morgan Wirthlin
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, USA
| | - Nicholas C B Lima
- Laboratório de Genômica e Biodiversidade, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ 21941-902, Brazil
| | - Rafael Lucas Muniz Guedes
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - André E R Soares
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - Luiz Gonzaga P Almeida
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - Nathalia P Cavaleiro
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - Guilherme Loss de Morais
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - Anderson V Chaves
- Programa de Pós-graduação em Manejo e Conservação de Ecossistemas Naturais e Agrários, Instituto de Ciências Biológicas e da Saúde, Universidade Federal de Viçosa, Florestal, Minas Gerais, Brazil
| | - Jason T Howard
- Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY 10065, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Marcus de Melo Teixeira
- Núcleo de Medicina Tropical, Faculdade de Medicina, Universidade de Brasília, Brasília, DF 70910-900, Brazil
| | - Patricia N Schneider
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, PA, Brazil
| | - Fabrício R Santos
- Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Michael C Schatz
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Maria Sueli Felipe
- Programa de Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília e Depto. de Biologia Celular, Universidade de Brasilia, Brasilia, DF, Brazil
| | - Cristina Y Miyaki
- Instituto de Biociências, Universidade de São Paulo, R. do Matão, 277, São Paulo, SP 05508-090, Brazil
| | - Alexandre Aleixo
- Coordenação de Zoologia, Museu Paraense Emilio Goeldi, Belém, PA 66040-170, Brazil
| | - Maria P C Schneider
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, PA, Brazil
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY 10065, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Ana Tereza R Vasconcelos
- Laboratório Nacional de Computação Científica, Rua Getúlio Vargas 333, Quitandinha, Petrópolis, RJ 25651-070, Brazil
| | - Francisco Prosdocimi
- Laboratório de Genômica e Biodiversidade, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ 21941-902, Brazil.
| | - Claudio V Mello
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, USA.
| |
Collapse
|