1
|
Romanenko SA, Kliver SF, Serdyukova NA, Perelman PL, Trifonov VA, Seluanov A, Gorbunova V, Azpurua J, Pereira JC, Ferguson-Smith MA, Graphodatsky AS. Integration of fluorescence in situ hybridization and chromosome-length genome assemblies revealed synteny map for guinea pig, naked mole-rat, and human. Sci Rep 2023; 13:21055. [PMID: 38030702 PMCID: PMC10687270 DOI: 10.1038/s41598-023-46595-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 11/02/2023] [Indexed: 12/01/2023] Open
Abstract
Descriptions of karyotypes of many animal species are currently available. In addition, there has been a significant increase in the number of sequenced genomes and an ever-improving quality of genome assembly. To close the gap between genomic and cytogenetic data we applied fluorescent in situ hybridization (FISH) and Hi-C technology to make the first full chromosome-level genome comparison of the guinea pig (Cavia porcellus), naked mole-rat (Heterocephalus glaber), and human. Comparative chromosome maps obtained by FISH with chromosome-specific probes link genomic scaffolds to individual chromosomes and orient them relative to centromeres and heterochromatic blocks. Hi-C assembly made it possible to close all gaps on the comparative maps and to reveal additional rearrangements that distinguish the karyotypes of the three species. As a result, we integrated the bioinformatic and cytogenetic data and adjusted the previous comparative maps and genome assemblies of the guinea pig, naked mole-rat, and human. Syntenic associations in the two hystricomorphs indicate features of their putative ancestral karyotype. We postulate that the two approaches applied in this study complement one another and provide complete information about the organization of these genomes at the chromosome level.
Collapse
Affiliation(s)
- Svetlana A Romanenko
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia.
| | - Sergei F Kliver
- Center for Evolutionary Hologenomics, The Globe Institute, The University of Copenhagen, Copenhagen, Denmark
| | - Natalia A Serdyukova
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
| | - Polina L Perelman
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
| | - Vladimir A Trifonov
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Andrei Seluanov
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Vera Gorbunova
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Jorge Azpurua
- Department of Biochemistry and Molecular Medicine, The George Washington University, Washington, DC, USA
| | - Jorge C Pereira
- Animal and Veterinary Research Centre, University of Trás-os-Montes and Alto Douro, Vila Real, Portugal
- Cambridge Resource Centre for Comparative Genomics, Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - Malcolm A Ferguson-Smith
- Cambridge Resource Centre for Comparative Genomics, Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - Alexander S Graphodatsky
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, Novosibirsk, Russia
| |
Collapse
|
2
|
Chen X, Wang Z, Zhang C, Hu J, Lu Y, Zhou H, Mei Y, Cong Y, Guo F, Wang Y, He K, Liu Y, Li F. Unraveling the complex evolutionary history of lepidopteran chromosomes through ancestral chromosome reconstruction and novel chromosome nomenclature. BMC Biol 2023; 21:265. [PMID: 37981687 PMCID: PMC10658929 DOI: 10.1186/s12915-023-01762-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 11/06/2023] [Indexed: 11/21/2023] Open
Abstract
BACKGROUND Lepidoptera is one of the most species-rich animal groups, with substantial karyotype variations among species due to chromosomal rearrangements. Knowledge of the evolutionary patterns of lepidopteran chromosomes still needs to be improved. RESULTS Here, we used chromosome-level genome assemblies of 185 lepidopteran insects to reconstruct an ancestral reference genome and proposed a new chromosome nomenclature. Thus, we renamed over 5000 extant chromosomes with this system, revealing the historical events of chromosomal rearrangements and their features. Additionally, our findings indicate that, compared with autosomes, the Z chromosome in Lepidoptera underwent a fast loss of conserved genes, rapid acquisition of lineage-specific genes, and a low rate of gene duplication. Moreover, we presented evidence that all available 67 W chromosomes originated from a common ancestor chromosome, with four neo-W chromosomes identified, including one generated by fusion with an autosome and three derived through horizontal gene transfer. We also detected nearly 4000 inter-chromosomal gene movement events. Notably, Geminin is transferred from the autosome to the Z chromosome. When located on the autosome, Geminin shows female-biased expression, but on the Z chromosome, it exhibits male-biased expression. This contributes to the sexual dimorphism of body size in silkworms. CONCLUSIONS Our study sheds light on the complex evolutionary history of lepidopteran chromosomes based on ancestral chromosome reconstruction and novel chromosome nomenclature.
Collapse
Affiliation(s)
- Xi Chen
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Zuoqi Wang
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Chaowei Zhang
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Jingheng Hu
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yueqi Lu
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Hang Zhou
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yang Mei
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yuyang Cong
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Fangyuan Guo
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Yaqin Wang
- State Key Laboratory of Rice Biology, Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Kang He
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Ying Liu
- Key Laboratory of Green Prevention and Control of Agricultural Transboundary Pests of Yunnan Province and Agricultural Environment/ Agriculture Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, China
| | - Fei Li
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China.
| |
Collapse
|
3
|
Skorupski J, Brandes F, Seebass C, Festl W, Śmietana P, Balacco J, Jain N, Tilley T, Abueg L, Wood J, Sims Y, Formenti G, Fedrigo O, Jarvis ED. Prioritizing Endangered Species in Genome Sequencing: Conservation Genomics in Action with the First Platinum-Standard Reference-Quality Genome of the Critically Endangered European Mink Mustela lutreola L., 1761. Int J Mol Sci 2023; 24:14816. [PMID: 37834264 PMCID: PMC10573602 DOI: 10.3390/ijms241914816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 09/23/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023] Open
Abstract
The European mink Mustela lutreola (Mustelidae) ranks among the most endangered mammalian species globally, experiencing a rapid and severe decline in population size, density, and distribution. Given the critical need for effective conservation strategies, understanding its genomic characteristics becomes paramount. To address this challenge, the platinum-quality, chromosome-level reference genome assembly for the European mink was successfully generated under the project of the European Mink Centre consortium. Leveraging PacBio HiFi long reads, we obtained a 2586.3 Mbp genome comprising 25 scaffolds, with an N50 length of 154.1 Mbp. Through Hi-C data, we clustered and ordered the majority of the assembly (>99.9%) into 20 chromosomal pseudomolecules, including heterosomes, ranging from 6.8 to 290.1 Mbp. The newly sequenced genome displays a GC base content of 41.9%. Additionally, we successfully assembled the complete mitochondrial genome, spanning 16.6 kbp in length. The assembly achieved a BUSCO (Benchmarking Universal Single-Copy Orthologs) completeness score of 98.2%. This high-quality reference genome serves as a valuable genomic resource for future population genomics studies concerning the European mink and related taxa. Furthermore, the newly assembled genome holds significant potential in addressing key conservation challenges faced by M. lutreola. Its applications encompass potential revision of management units, assessment of captive breeding impacts, resolution of phylogeographic questions, and facilitation of monitoring and evaluating the efficiency and effectiveness of dedicated conservation strategies for the European mink. This species serves as an example that highlights the paramount importance of prioritizing endangered species in genome sequencing projects due to the race against time, which necessitates the comprehensive exploration and characterization of their genomic resources before their populations face extinction.
Collapse
Affiliation(s)
- Jakub Skorupski
- Institute of Marine and Environmental Sciences, University of Szczecin, Wąska 13 St., 71-415 Szczecin, Poland
- Polish Society for Conservation Genetics LUTREOLA, Maciejkowa 21 St., 71-784 Szczecin, Poland
| | - Florian Brandes
- Wildtier- und Artenschutzstation e.V., Hohe Warte 1, 31553 Sachsenhagen, Germany
| | | | - Wolfgang Festl
- EuroNerz e.V., Kleine Gildewart 3, 49074 Osnabrück, Germany
| | - Przemysław Śmietana
- Institute of Marine and Environmental Sciences, University of Szczecin, Wąska 13 St., 71-415 Szczecin, Poland
- Polish Society for Conservation Genetics LUTREOLA, Maciejkowa 21 St., 71-784 Szczecin, Poland
| | - Jennifer Balacco
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Nivesh Jain
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Tatiana Tilley
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Linelle Abueg
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Jonathan Wood
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Ying Sims
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| | - Erich D. Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA
| |
Collapse
|
4
|
Kliver S, Houck ML, Perelman PL, Totikov A, Tomarovsky A, Dudchenko O, Omer AD, Colaric Z, Weisz D, Aiden EL, Chan S, Hastie A, Komissarov A, Ryder OA, Graphodatsky A, Johnson WE, Maldonado JE, Pukazhenthi BS, Marinari PE, Wildt DE, Koepfli KP. Chromosome-length genome assembly and karyotype of the endangered black-footed ferret (Mustela nigripes). J Hered 2023; 114:539-548. [PMID: 37249392 PMCID: PMC10848218 DOI: 10.1093/jhered/esad035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 05/27/2023] [Indexed: 05/31/2023] Open
Abstract
The black-footed ferret (Mustela nigripes) narrowly avoided extinction to become an oft-cited example of the benefits of intensive management, research, and collaboration to save a species through ex situ conservation breeding and reintroduction into its former range. However, the species remains at risk due to possible inbreeding, disease susceptibility, and multiple fertility challenges. Here, we report the de novo genome assembly of a male black-footed ferret generated through a combination of linked-read sequencing, optical mapping, and Hi-C proximity ligation. In addition, we report the karyotype for this species, which was used to anchor and assign chromosome numbers to the chromosome-length scaffolds. The draft assembly was ~2.5 Gb in length, with 95.6% of it anchored to 19 chromosome-length scaffolds, corresponding to the 2n = 38 chromosomes revealed by the karyotype. The assembly has contig and scaffold N50 values of 148.8 kbp and 145.4 Mbp, respectively, and is up to 96% complete based on BUSCO analyses. Annotation of the assembly, including evidence from RNA-seq data, identified 21,406 protein-coding genes and a repeat content of 37.35%. Phylogenomic analyses indicated that the black-footed ferret diverged from the European polecat/domestic ferret lineage 1.6 million yr ago. This assembly will enable research on the conservation genomics of black-footed ferrets and thereby aid in the further restoration of this endangered species.
Collapse
Affiliation(s)
- Sergei Kliver
- Center for Evolutionary Hologenomics, The Globe Institute, The University of Copenhagen, Copenhagen, Denmark
| | - Marlys L Houck
- Beckman Center for Conservation Research, San Diego Zoo Wildlife Alliance, Escondido, CA, United States
| | - Polina L Perelman
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
| | - Azamat Totikov
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| | - Andrey Tomarovsky
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| | - Olga Dudchenko
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
- Center for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX, United States
| | - Arina D Omer
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
| | - Zane Colaric
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
| | - David Weisz
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
| | - Erez Lieberman Aiden
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
- Center for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX, United States
- Broad Institute of MIT and Harvard, Cambridge, MA, United States
| | - Saki Chan
- Department of Research and Development, Bionano Genomics, San Diego, CA, United States
| | - Alex Hastie
- Department of Research and Development, Bionano Genomics, San Diego, CA, United States
| | - Aleksey Komissarov
- Applied Genomics Laboratory, SCAMT Institute, ITMO University, Saint Petersburg, Russia
| | - Oliver A Ryder
- Beckman Center for Conservation Research, San Diego Zoo Wildlife Alliance, Escondido, CA, United States
| | - Alexander Graphodatsky
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
| | - Warren E Johnson
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States
- The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, United States
- Walter Reed Army Institute of Research, Silver Spring, MD, United States
- Loyola University Maryland, Baltimore, MD, United States
| | - Jesús E Maldonado
- Center for Conservation Genomics, Smithsonian’s National Zoo and Conservation Biology Institute, Washington, DC, United States
| | - Budhan S Pukazhenthi
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States
| | - Paul E Marinari
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States
| | - David E Wildt
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States
| | - Klaus-Peter Koepfli
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States
- Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA, United States
| |
Collapse
|
5
|
Biltueva LS, Vorobieva NV, Lemskya NA, Perelman PL, Trifonov VA, Panov VV, Abramov AV, Kawada SI, Serdukova NA, Graphodatsky AS. Chromosomal Evolution of the Talpinae. Genes (Basel) 2023; 14:1472. [PMID: 37510376 PMCID: PMC10379030 DOI: 10.3390/genes14071472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/14/2023] [Accepted: 07/15/2023] [Indexed: 07/30/2023] Open
Abstract
In recent years, the number of mole species with species status confirmed by genetic methods has been continuously increasing. Unfortunately, cytogenetic data are not yet available for all species. Here, for the first time, a GTG-banded karyotype of the small-toothed mole from Vietnam, Euroscaptor parvidens, a representative of the Eastern clade of the genus Euroscaptor, has been described. Through comparative analysis of available Euroscaptor (Euroscaptor parvidens, Euroscaptor klossi, and Euroscaptor malayana) and Oreoscaptor (Oreoscaptor mizura) karyotypes, we found cytogenetic signatures for each of the studied species. Zoo-FISH with sorted chromosomes of the Siberian mole (Talpa altaica) on chromosome sets of the small-toothed mole (E. parvidens), the small Japanese mole (Mogera imaizumii) from the closely related genus, and the Japanese shrew mole (Urotrichus talpoides) from the tribe Urotrichini made it possible to identify syntenic regions between these species. We propose a possible ancestral karyotype of the tribe and, based on it, traced the features of chromosomal rearrangements accompanying the divergence of moles. The low rates of chromosomal evolution within the species of the genus Talpa-T. altaica and T. europaea-and the high rates of karyotypic reshuffling within the Asian genera of the tribe were confirmed. The karyotype of the Japanese mountain mole O. mizura seems to be the most conserved among the Asian moles. The most frequently occurring types of chromosomal rearrangements in moles are the pericentric inversions and amplification of heterochromatin. The pericentric inversions on four pairs of autosomes are shared between the closely related genera Euroscaptor, Oreoscaptor, and Mogera, while many more apomorphic rearrangements have occurred in each lineage additionally. The highest rate of chromosomal changes, with five rearrangements occurring over approximately 7 million years, was recorded in the lineage of the small-toothed mole.
Collapse
Affiliation(s)
- Larisa S Biltueva
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Nadezhda V Vorobieva
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Natalya A Lemskya
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Polina L Perelman
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Vladimir A Trifonov
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Victor V Panov
- Institute of Systematics and Ecology of Animals SB RAS, Frunze st.11, 630091 Novosibirsk, Russia
| | - Alexey V Abramov
- Zoological Institute RAS, 199034 Saint Petersburg, Russia
- Joint Vietnamese-Russian Tropical Research and Technological Centre, Nguyen Van Huyen, Nghia Do, Cau Giay, Hanoi 650000, Vietnam
| | - Shin-Ichiro Kawada
- Department of Zoology, National Museum of Nature and Science, 4-1-1, Amakubo, Tsukuba 305-0005, Ibaraki, Japan
| | - Natalya A Serdukova
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| | - Alexandr S Graphodatsky
- Institute of Molecular and Cellular Biology SB RAS, Lavrentiev Ave., 8/2, 630090 Novosibirsk, Russia
| |
Collapse
|
6
|
Yakupova A, Tomarovsky A, Totikov A, Beklemisheva V, Logacheva M, Perelman PL, Komissarov A, Dobrynin P, Krasheninnikova K, Tamazian G, Serdyukova NA, Rayko M, Bulyonkova T, Cherkasov N, Pylev V, Peterfeld V, Penin A, Balanovska E, Lapidus A, OBrien SJ, Graphodatsky A, Koepfli KP, Kliver S. Chromosome-Length Assembly of the Baikal Seal (Pusa sibirica) Genome Reveals a Historically Large Population Prior to Isolation in Lake Baikal. Genes (Basel) 2023; 14:genes14030619. [PMID: 36980891 PMCID: PMC10048373 DOI: 10.3390/genes14030619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/31/2023] [Accepted: 02/24/2023] [Indexed: 03/05/2023] Open
Abstract
Pusa sibirica, the Baikal seal, is the only extant, exclusively freshwater, pinniped species. The pending issue is, how and when they reached their current habitat—the rift lake Baikal, more than three thousand kilometers away from the Arctic Ocean. To explore the demographic history and genetic diversity of this species, we generated a de novo chromosome-length assembly, and compared it with three closely related marine pinniped species. Multiple whole genome alignment of the four species compared with their karyotypes showed high conservation of chromosomal features, except for three large inversions on chromosome VI. We found the mean heterozygosity of the studied Baikal seal individuals was relatively low (0.61 SNPs/kbp), but comparable to other analyzed pinniped samples. Demographic reconstruction of seals revealed differing trajectories, yet remarkable variations in Ne occurred during approximately the same time periods. The Baikal seal showed a significantly more severe decline relative to other species. This could be due to the difference in environmental conditions encountered by the earlier populations of Baikal seals, as ice sheets changed during glacial–interglacial cycles. We connect this period to the time of migration to Lake Baikal, which occurred ~3–0.3 Mya, after which the population stabilized, indicating balanced habitat conditions.
Collapse
Affiliation(s)
- Aliya Yakupova
- Computer Technologies Laboratory, ITMO University, 19701 Saint Petersburg, Russia
- Correspondence: (A.Y.); (A.G.)
| | - Andrey Tomarovsky
- Computer Technologies Laboratory, ITMO University, 19701 Saint Petersburg, Russia
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
| | - Azamat Totikov
- Computer Technologies Laboratory, ITMO University, 19701 Saint Petersburg, Russia
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
| | - Violetta Beklemisheva
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
| | - Maria Logacheva
- Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Polina L. Perelman
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
| | - Aleksey Komissarov
- Applied Genomics Laboratory, SCAMT Institute, ITMO University, 9 Ulitsa Lomonosova, 191002 Saint Petersburg, Russia
| | - Pavel Dobrynin
- Computer Technologies Laboratory, ITMO University, 19701 Saint Petersburg, Russia
- Human Genetics Laboratory, Vavilov Institute of General Genetics RAS, 119991 Moscow, Russia
| | | | - Gaik Tamazian
- Centre for Computational Biology, Peter the Great Saint Petersburg Polytechnic University, 195251 St. Petersburg, Russia
| | - Natalia A. Serdyukova
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
| | - Mike Rayko
- Center for Bioinformatics and Algorithmic Biotechnology, St. Petersburg State University, 199034 St. Petersburg, Russia
| | - Tatiana Bulyonkova
- Laboratory of Mixed Computations, A.P. Ershov Institute of Informatics Systems SB RAS, 630090 Novosibirsk, Russia
| | - Nikolay Cherkasov
- Centre for Computational Biology, Peter the Great Saint Petersburg Polytechnic University, 195251 St. Petersburg, Russia
| | - Vladimir Pylev
- Laboratory of Human Population Genetics, Research Centre for Medical Genetics, 115522 Moscow, Russia
| | - Vladimir Peterfeld
- Baikal Branch of State Research and Industrial Center of Fisheries, 670034 Ulan-Ude, Russia
| | - Aleksey Penin
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia
| | - Elena Balanovska
- Laboratory of Human Population Genetics, Research Centre for Medical Genetics, 115522 Moscow, Russia
| | - Alla Lapidus
- Center for Bioinformatics and Algorithmic Biotechnology, St. Petersburg State University, 199034 St. Petersburg, Russia
| | - DNA Zoo Consortium
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Stephen J. OBrien
- Guy Harvey Oceanographic Center, Halmos College of Arts and Sciences, NOVA Southeastern University, Fort Lauderdale, FL 33004, USA
| | - Alexander Graphodatsky
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia
- Correspondence: (A.Y.); (A.G.)
| | - Klaus-Peter Koepfli
- Smithsonian-Mason School of Conservation, George Mason University, 1500 Remount Road, Front Royal, VA 22630, USA
- Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, 1500 Remount Road, Front Royal, VA 22630, USA
| | - Sergei Kliver
- Center for Evolutionary Hologenomics, The Globe Institute, The University of Copenhagen, 5A, Oester Farimagsgade, 1353 Copenhagen, Denmark
| |
Collapse
|
7
|
Dmitriev AA, Pushkova EN, Melnikova NV. Plant Genome Sequencing: Modern Technologies and Novel Opportunities for Breeding. Mol Biol 2022. [DOI: 10.1134/s0026893322040045] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
8
|
Derežanin L, Blažytė A, Dobrynin P, Duchêne DA, Grau JH, Jeon S, Kliver S, Koepfli KP, Meneghini D, Preick M, Tomarovsky A, Totikov A, Fickel J, Förster DW. Multiple types of genomic variation contribute to adaptive traits in the mustelid subfamily Guloninae. Mol Ecol 2022; 31:2898-2919. [PMID: 35334142 DOI: 10.1111/mec.16443] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 03/06/2022] [Accepted: 03/14/2022] [Indexed: 11/28/2022]
Abstract
Species of the mustelid subfamily Guloninae inhabit diverse habitats on multiple continents, and occupy a variety of ecological niches. They differ in feeding ecologies, reproductive strategies and morphological adaptations. To identify candidate loci associated with adaptations to their respective environments, we generated a de novo assembly of the tayra (Eira barbara), the earliest diverging species in the subfamily, and compared this with the genomes available for the wolverine (Gulo gulo) and the sable (Martes zibellina). Our comparative genomic analyses included searching for signs of positive selection, examining changes in gene family sizes, as well as searching for species-specific structural variants (SVs). Among candidate loci associated with phenotypic traits, we observed many related to diet, body condition and reproduction. For example, for the tayra, which has an atypical gulonine reproductive strategy of aseasonal breeding, we observe species-specific changes in many pregnancy-related genes. For the wolverine, a circumpolar hypercarnivore that must cope with seasonal food scarcity, we observed many changes in genes associated with diet and body condition. All types of genomic variation examined (single nucleotide polymorphisms, gene family expansions, structural variants) contributed substantially to the identification of candidate loci. This strongly argues for consideration of variation other than single nucleotide polymorphisms in comparative genomics studies aiming to identify loci of adaptive significance.
Collapse
Affiliation(s)
- Lorena Derežanin
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| | - Asta Blažytė
- Department of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST, Ulsan, 44919, Republic of Korea
| | - Pavel Dobrynin
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia
| | - David A Duchêne
- Center for Evolutionary Hologenomics, The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Øster Farimagsgade 5, 1353, Copenhagen, Denmark
| | - José Horacio Grau
- amedes Genetics, amedes Medizinische Dienstleistungen GmbH, Jägerstr. 61, 10117, Berlin, Germany
| | - Sungwon Jeon
- Department of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST, Ulsan, 44919, Republic of Korea.,Clinomics Inc, Ulsan, 44919, Republic of Korea
| | - Sergei Kliver
- Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Klaus-Peter Koepfli
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Smithsonian-Mason School of Conservation, 1500 Remount Road, Front Royal, VA, 22630, USA.,Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, 1500 Remount Road, Front Royal, VA, 22630, USA
| | - Dorina Meneghini
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| | - Michaela Preick
- Institute for Biochemistry and Biology, Faculty of Mathematics and Natural Sciences, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, OT, Germany
| | - Andrey Tomarovsky
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 1 Pirogova str, Novosibirsk, 630090, Russia
| | - Azamat Totikov
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 1 Pirogova str, Novosibirsk, 630090, Russia
| | - Jörns Fickel
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany.,Institute for Biochemistry and Biology, Faculty of Mathematics and Natural Sciences, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, OT, Germany
| | - Daniel W Förster
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| |
Collapse
|
9
|
Abstract
We found the three-dimensional (3D) structure of chromatin at the chromosome level to be highly conserved for more than 50 million y of carnivore evolution. Intrachromosomal contacts were maintained even after chromosome rearrangements within carnivore lineages, demonstrating that the maintenance of 3D chromatin architecture is essential for conserved genome functions. These discoveries enabled the identification of orthologous chromosomal DNA segments among related species, a method we call 3D comparative scaffotyping. The method has application for putative chromosome assignment of chromosome-scale DNA sequence scaffolds produced by de novo genome sequencing. Broadly applied to biodiversity genome sequencing efforts, the approach can reduce costs associated with karyotyping and the physical mapping of DNA segments to chromosomes. High throughput chromatin conformation capture (Hi-C) of leukocyte DNA was used to investigate the evolutionary stability of chromatin conformation at the chromosomal level in 11 species from three carnivore families: Felidae, Canidae, and Ursidae. Chromosome-scale scaffolds (C-scaffolds) of each species were initially used for whole-genome alignment to a reference genome within each family. This approach established putative orthologous relationships between C-scaffolds among the different species. Hi-C contact maps for all C-scaffolds were then visually compared and found to be distinct for a given reference chromosome or C-scaffold within a species and indistinguishable for orthologous C-scaffolds having a 1:1 relationship within a family. The visual patterns within families were strongly supported by eigenvectors from the Hi-C contact maps. Analysis of Hi-C contact maps and eigenvectors across the three carnivore families revealed that most cross-family orthologous subchromosomal fragments have a conserved three-dimensional (3D) chromatin structure and thus have been under strong evolutionary constraint for ∼54 My of carnivore evolution. The most pronounced differences in chromatin conformation were observed for the X chromosome and the red fox genome, whose chromosomes have undergone extensive rearrangements relative to other canids. We also demonstrate that Hi-C contact map pattern analysis can be used to accurately identify orthologous relationships between C-scaffolds and chromosomes, a method we termed “3D comparative scaffotyping.” This method provides a powerful means for estimating karyotypes in de novo sequenced species that have unknown karyotype and no physical mapping information.
Collapse
|
10
|
Pucker B, Irisarri I, de Vries J, Xu B. Plant genome sequence assembly in the era of long reads: Progress, challenges and future directions. QUANTITATIVE PLANT BIOLOGY 2022; 3:e5. [PMID: 37077982 PMCID: PMC10095996 DOI: 10.1017/qpb.2021.18] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 11/24/2021] [Accepted: 12/21/2021] [Indexed: 05/03/2023]
Abstract
Third-generation long-read sequencing is transforming plant genomics. Oxford Nanopore Technologies and Pacific Biosciences are offering competing long-read sequencing technologies and enable plant scientists to investigate even large and complex plant genomes. Sequencing projects can be conducted by single research groups and sequences of smaller plant genomes can be completed within days. This also resulted in an increased investigation of genomes from multiple species in large scale to address fundamental questions associated with the origin and evolution of land plants. Increased accessibility of sequencing devices and user-friendly software allows more researchers to get involved in genomics. Current challenges are accurately resolving diploid or polyploid genome sequences and better accounting for the intra-specific diversity by switching from the use of single reference genome sequences to a pangenome graph.
Collapse
Affiliation(s)
- Boas Pucker
- Department of Plant Sciences, University of Cambridge, Cambridge, United Kingdom
- Institute of Plant Biology & Braunschweig Integrated Centre of Systems Biology (BRICS), TU Braunschweig, Braunschweig, Germany
- Author for correspondence: Boas Pucker E-mail:
| | - Iker Irisarri
- Department of Applied Bioinformatics, Institute for Microbiology and Genetics, University of Goettingen, Göttingen, Germany
- Campus Institute Data Science (CIDAS), University of Goettingen, Göttingen, Germany
| | - Jan de Vries
- Department of Applied Bioinformatics, Institute for Microbiology and Genetics, University of Goettingen, Göttingen, Germany
- Campus Institute Data Science (CIDAS), University of Goettingen, Göttingen, Germany
- Department of Applied Bioinformatics, Göttingen Center for Molecular Biosciences (GZMB), University of Goettingen, Göttingen, Germany
| | - Bo Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
11
|
Vozdova M, Kubickova S, Cernohorska H, Fröhlich J, Rubes J. Anchoring the CerEla1.0 Genome Assembly to Red Deer ( Cervus elaphus) and Cattle ( Bos taurus) Chromosomes and Specification of Evolutionary Chromosome Rearrangements in Cervidae. Animals (Basel) 2021; 11:ani11092614. [PMID: 34573579 PMCID: PMC8465983 DOI: 10.3390/ani11092614] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 08/31/2021] [Accepted: 09/02/2021] [Indexed: 11/18/2022] Open
Abstract
Simple Summary The red deer (Cervus elaphus) de novo genome assembly (CerEla1.0) has provided a great resource for genetic studies in various deer species. In this study, we used gene order comparisons between C. elaphus CerEla1.0 and B. taurus ARS-UCD1.2 genome assemblies and fluorescence in situ hybridization (FISH) with bovine BAC probes to verify the red deer-bovine chromosome relationships and anchor the CerEla1.0 C-scaffolds to karyotypes of both species. We showed the homology between bovine and deer chromosomes and determined the centromere-telomere orientation of the CerEla1.0 C-scaffolds. Using a set of BAC probes, we were able to narrow the positions of evolutionary chromosome breakpoints defining the family Cervidae. In addition, we revealed several errors in the current CerEla1.0 genome assembly. Finally, we expanded our analysis to other Cervidae and confirmed the locations of the cervid evolutionary fissions and orientation of the fused chromosomes in eight cervid species. Our results can serve as a basis for necessary improvements of the red deer genome assembly and provide support to other genetic studies in Cervidae. Abstract The family Cervidae groups a range of species with an increasing economic significance. Their karyotypes share 35 evolutionary conserved chromosomal segments with cattle (Bos taurus). Recent publication of the annotated red deer (Cervus elaphus) whole genome assembly (CerEla1.0) has provided a basis for advanced genetic studies. In this study, we compared the red deer CerEla1.0 and bovine ARS-UCD1.2 genome assembly and used fluorescence in situ hybridization with bovine BAC probes to verify the homology between bovine and deer chromosomes, determined the centromere-telomere orientation of the CerEla1.0 C-scaffolds and specified positions of the cervid evolutionary chromosome breakpoints. In addition, we revealed several incongruences between the current deer and bovine genome assemblies that were shown to be caused by errors in the CerEla1.0 assembly. Finally, we verified the centromere-to-centromere orientation of evolutionarily fused chromosomes in seven additional deer species, giving a support to previous studies on their chromosome evolution.
Collapse
|
12
|
Chromosome-Level Genome Assemblies Expand Capabilities of Genomics for Conservation Biology. Genes (Basel) 2021; 12:genes12091336. [PMID: 34573318 PMCID: PMC8466942 DOI: 10.3390/genes12091336] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 08/20/2021] [Accepted: 08/25/2021] [Indexed: 11/26/2022] Open
Abstract
Genome assemblies are in the process of becoming an increasingly important tool for understanding genetic diversity in threatened species. Unfortunately, due to limited budgets typical for the area of conservation biology, genome assemblies of threatened species, when available, tend to be highly fragmented, represented by tens of thousands of scaffolds not assigned to chromosomal locations. The recent advent of high-throughput chromosome conformation capture (Hi-C) enables more contiguous assemblies containing scaffolds spanning the length of entire chromosomes for little additional cost. These inexpensive contiguous assemblies can be generated using Hi-C scaffolding of existing short-read draft assemblies, where N50 of the draft contigs is larger than 0.1% of the estimated genome size and can greatly improve analyses and facilitate visualization of genome-wide features including distribution of genetic diversity in markers along chromosomes or chromosome-length scaffolds. We compared distribution of genetic diversity along chromosomes of eight mammalian species, including six listed as threatened by IUCN, where both draft genome assemblies and newer chromosome-level assemblies were available. The chromosome-level assemblies showed marked improvement in localization and visualization of genetic diversity, especially where the distribution of low heterozygosity across the genomes of threatened species was not uniform.
Collapse
|
13
|
Yamaguchi K, Kadota M, Nishimura O, Ohishi Y, Naito Y, Kuraku S. Technical considerations in Hi-C scaffolding and evaluation of chromosome-scale genome assemblies. Mol Ecol 2021; 30:5923-5934. [PMID: 34432923 PMCID: PMC9292758 DOI: 10.1111/mec.16146] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 07/28/2021] [Accepted: 08/18/2021] [Indexed: 12/15/2022]
Abstract
The recent development of ecological studies has been fueled by the introduction of massive information based on chromosome‐scale genome sequences, even for species for which genetic linkage is not accessible. This was enabled mainly by the application of Hi‐C, a method for genome‐wide chromosome conformation capture that was originally developed for investigating the long‐range interaction of chromatins. Performing genomic scaffolding using Hi‐C data is highly resource‐demanding and employs elaborate laboratory steps for sample preparation. It starts with building a primary genome sequence assembly as an input, which is followed by computation for genome scaffolding using Hi‐C data, requiring careful validation. This article presents technical considerations for obtaining optimal Hi‐C scaffolding results and provides a test case of its application to a reptile species, the Madagascar ground gecko (Paroedura picta). Among the metrics that are frequently used for evaluating scaffolding results, we investigate the validity of the completeness assessment of chromosome‐scale genome assemblies using single‐copy reference orthologues.
Collapse
Affiliation(s)
- Kazuaki Yamaguchi
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Mitsutaka Kadota
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Osamu Nishimura
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Yuta Ohishi
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Yuki Naito
- Database Center for Life Science (DBCLS), Mishima, Japan
| | - Shigehiro Kuraku
- Laboratory for Phyloinformatics, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan.,Molecular Life History Laboratory, National Institute of Genetics, Mishima, Japan.,Department of Genetics, Sokendai (Graduate University for Advanced Studies), Mishima, Japan
| |
Collapse
|
14
|
Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, Koren S, Uliano-Silva M, Chow W, Fungtammasan A, Kim J, Lee C, Ko BJ, Chaisson M, Gedman GL, Cantin LJ, Thibaud-Nissen F, Haggerty L, Bista I, Smith M, Haase B, Mountcastle J, Winkler S, Paez S, Howard J, Vernes SC, Lama TM, Grutzner F, Warren WC, Balakrishnan CN, Burt D, George JM, Biegler MT, Iorns D, Digby A, Eason D, Robertson B, Edwards T, Wilkinson M, Turner G, Meyer A, Kautt AF, Franchini P, Detrich HW, Svardal H, Wagner M, Naylor GJP, Pippel M, Malinsky M, Mooney M, Simbirsky M, Hannigan BT, Pesout T, Houck M, Misuraca A, Kingan SB, Hall R, Kronenberg Z, Sović I, Dunn C, Ning Z, Hastie A, Lee J, Selvaraj S, Green RE, Putnam NH, Gut I, Ghurye J, Garrison E, Sims Y, Collins J, Pelan S, Torrance J, Tracey A, Wood J, Dagnew RE, Guan D, London SE, Clayton DF, Mello CV, Friedrich SR, Lovell PV, Osipova E, Al-Ajli FO, Secomandi S, Kim H, Theofanopoulou C, Hiller M, Zhou Y, Harris RS, Makova KD, Medvedev P, Hoffman J, Masterson P, Clark K, Martin F, Howe K, Flicek P, Walenz BP, Kwak W, Clawson H, Diekhans M, Nassar L, Paten B, Kraus RHS, Crawford AJ, Gilbert MTP, Zhang G, Venkatesh B, Murphy RW, Koepfli KP, Shapiro B, Johnson WE, Di Palma F, Marques-Bonet T, Teeling EC, Warnow T, Graves JM, Ryder OA, Haussler D, O'Brien SJ, Korlach J, Lewin HA, Howe K, Myers EW, Durbin R, Phillippy AM, Jarvis ED. Towards complete and error-free genome assemblies of all vertebrate species. Nature 2021; 592:737-746. [PMID: 33911273 PMCID: PMC8081667 DOI: 10.1038/s41586-021-03451-0] [Citation(s) in RCA: 698] [Impact Index Per Article: 232.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 03/12/2021] [Indexed: 02/02/2023]
Abstract
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
Collapse
Affiliation(s)
- Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shane A McCarthy
- Department of Genetics, University of Cambridge, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | - Joana Damas
- The Genome Center, University of California Davis, Davis, CA, USA
| | - Giulio Formenti
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Marcela Uliano-Silva
- Leibniz Institute for Zoo and Wildlife Research, Department of Evolutionary Genetics, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | | | | | - Juwan Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Chul Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Byung June Ko
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Mark Chaisson
- University of Southern California, Los Angeles, CA, USA
| | - Gregory L Gedman
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Lindsey J Cantin
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Iliana Bista
- Department of Genetics, University of Cambridge, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Bettina Haase
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | | | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- DRESDEN-concept Genome Center, Dresden, Germany
| | - Sadye Paez
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | | | - Sonja C Vernes
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
- School of Biology, University of St Andrews, St Andrews, UK
| | - Tanya M Lama
- University of Massachusetts Cooperative Fish and Wildlife Research Unit, Amherst, MA, USA
| | - Frank Grutzner
- School of Biological Science, The Environment Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Wesley C Warren
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | | | - Dave Burt
- UQ Genomics, University of Queensland, Brisbane, Queensland, Australia
| | - Julia M George
- Department of Biological Sciences, Clemson University, Clemson, SC, USA
| | - Matthew T Biegler
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - David Iorns
- The Genetic Rescue Foundation, Wellington, New Zealand
| | - Andrew Digby
- Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand
| | - Daryl Eason
- Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand
| | - Bruce Robertson
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | | | - Mark Wilkinson
- Department of Life Sciences, Natural History Museum, London, UK
| | - George Turner
- School of Natural Sciences, Bangor University, Gwynedd, UK
| | - Axel Meyer
- Department of Biology, University of Konstanz, Konstanz, Germany
| | - Andreas F Kautt
- Department of Biology, University of Konstanz, Konstanz, Germany
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Paolo Franchini
- Department of Biology, University of Konstanz, Konstanz, Germany
| | - H William Detrich
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA, USA
| | - Hannes Svardal
- Department of Biology, University of Antwerp, Antwerp, Belgium
- Naturalis Biodiversity Center, Leiden, The Netherlands
| | - Maximilian Wagner
- Institute of Biology, Karl-Franzens University of Graz, Graz, Austria
| | - Gavin J P Naylor
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology, Dresden, Germany
| | - Milan Malinsky
- Wellcome Sanger Institute, Cambridge, UK
- Zoological Institute, University of Basel, Basel, Switzerland
| | | | | | | | - Trevor Pesout
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | | | | | | | | | | | - Ivan Sović
- Pacific Biosciences, Menlo Park, CA, USA
- Digital BioLogic, Ivanić-Grad, Croatia
| | | | - Zemin Ning
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Joyce Lee
- Bionano Genomics, San Diego, CA, USA
| | | | - Richard E Green
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
- Dovetail Genomics, Santa Cruz, CA, USA
| | | | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Jay Ghurye
- Dovetail Genomics, Santa Cruz, CA, USA
- Department of Computer Science, University of Maryland College Park, College Park, MD, USA
| | - Erik Garrison
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Ying Sims
- Wellcome Sanger Institute, Cambridge, UK
| | | | | | | | | | | | | | - Dengfeng Guan
- Department of Genetics, University of Cambridge, Cambridge, UK
- School of Computer Science and Technology, Center for Bioinformatics, Harbin Institute of Technology, Harbin, China
| | - Sarah E London
- Department of Psychology, Institute for Mind and Biology, University of Chicago, Chicago, IL, USA
| | - David F Clayton
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Claudio V Mello
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Samantha R Friedrich
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Peter V Lovell
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Ekaterina Osipova
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
| | - Farooq O Al-Ajli
- Monash University Malaysia Genomics Facility, School of Science, Selangor Darul Ehsan, Malaysia
- Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia, Selangor Darul Ehsan, Malaysia
- Qatar Falcon Genome Project, Doha, Qatar
| | | | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- eGnome, Inc., Seoul, Republic of Korea
| | | | - Michael Hiller
- LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany
- Senckenberg Research Institute, Frankfurt, Germany
- Goethe-University, Faculty of Biosciences, Frankfurt, Germany
| | | | - Robert S Harris
- Department of Biology, Pennsylvania State University, University Park, PA, USA
| | - Kateryna D Makova
- Department of Biology, Pennsylvania State University, University Park, PA, USA
- Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA
- Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA
| | - Paul Medvedev
- Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA
- Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA
- Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Jinna Hoffman
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Karen Clark
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Fergal Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Kevin Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Brian P Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Woori Kwak
- eGnome, Inc., Seoul, Republic of Korea
- Hoonygen, Seoul, Korea
| | - Hiram Clawson
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Luis Nassar
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Robert H S Kraus
- Department of Biology, University of Konstanz, Konstanz, Germany
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany
| | - Andrew J Crawford
- Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
- University Museum, NTNU, Trondheim, Norway
| | - Guojie Zhang
- China National Genebank, BGI-Shenzhen, Shenzhen, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Byrappa Venkatesh
- Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore, Singapore
| | - Robert W Murphy
- Centre for Biodiversity, Royal Ontario Museum, Toronto, Ontario, Canada
| | - Klaus-Peter Koepfli
- Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA
- The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, USA
- Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Federica Di Palma
- Department of Biological Sciences, Earlham Institute, University of East Anglia, Norwich, UK
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Tandy Warnow
- Department of Computer Science, The University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | | | - Oliver A Ryder
- San Diego Zoo Global, Escondido, CA, USA
- Department of Evolution, Behavior, and Ecology, University of California San Diego, La Jolla, CA, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Stephen J O'Brien
- Laboratory of Genomics Diversity-Center for Computer Technologies, ITMO University, St. Petersburg, Russian Federation
- Guy Harvey Oceanographic Center, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Fort Lauderdale, FL, USA
| | | | - Harris A Lewin
- The Genome Center, University of California Davis, Davis, CA, USA
- Department of Evolution and Ecology, University of California Davis, Davis, CA, USA
- John Muir Institute for the Environment, University of California Davis, Davis, CA, USA
| | | | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Center for Systems Biology, Dresden, Germany.
- Faculty of Computer Science, Technical University Dresden, Dresden, Germany.
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, UK.
- Wellcome Sanger Institute, Cambridge, UK.
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
15
|
Iannucci A, Makunin AI, Lisachov AP, Ciofi C, Stanyon R, Svartman M, Trifonov VA. Bridging the Gap between Vertebrate Cytogenetics and Genomics with Single-Chromosome Sequencing (ChromSeq). Genes (Basel) 2021; 12:124. [PMID: 33478118 PMCID: PMC7835784 DOI: 10.3390/genes12010124] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/10/2021] [Accepted: 01/15/2021] [Indexed: 01/23/2023] Open
Abstract
The study of vertebrate genome evolution is currently facing a revolution, brought about by next generation sequencing technologies that allow researchers to produce nearly complete and error-free genome assemblies. Novel approaches however do not always provide a direct link with information on vertebrate genome evolution gained from cytogenetic approaches. It is useful to preserve and link cytogenetic data with novel genomic discoveries. Sequencing of DNA from single isolated chromosomes (ChromSeq) is an elegant approach to determine the chromosome content and assign genome assemblies to chromosomes, thus bridging the gap between cytogenetics and genomics. The aim of this paper is to describe how ChromSeq can support the study of vertebrate genome evolution and how it can help link cytogenetic and genomic data. We show key examples of ChromSeq application in the refinement of vertebrate genome assemblies and in the study of vertebrate chromosome and karyotype evolution. We also provide a general overview of the approach and a concrete example of genome refinement using this method in the species Anolis carolinensis.
Collapse
Affiliation(s)
- Alessio Iannucci
- Department of Biology, University of Florence, 50019 Sesto Fiorentino, Italy; (C.C.); (R.S.)
| | - Alexey I. Makunin
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK;
- Institute of Molecular and Cellular Biology SB RAS, 630090 Novosibirsk, Russia;
| | - Artem P. Lisachov
- Institute of Environmental and Agricultural Biology (X-BIO), University of Tyumen, 625003 Tyumen, Russia;
- Institute of Cytology and Genetics SB RAS, 630090 Novosibirsk, Russia
| | - Claudio Ciofi
- Department of Biology, University of Florence, 50019 Sesto Fiorentino, Italy; (C.C.); (R.S.)
| | - Roscoe Stanyon
- Department of Biology, University of Florence, 50019 Sesto Fiorentino, Italy; (C.C.); (R.S.)
| | - Marta Svartman
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil;
| | | |
Collapse
|
16
|
Karyotype Evolution in 10 Pinniped Species: Variability of Heterochromatin versus High Conservatism of Euchromatin as Revealed by Comparative Molecular Cytogenetics. Genes (Basel) 2020; 11:genes11121485. [PMID: 33321928 PMCID: PMC7763226 DOI: 10.3390/genes11121485] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 12/04/2020] [Accepted: 12/04/2020] [Indexed: 11/19/2022] Open
Abstract
Pinnipedia karyotype evolution was studied here using human, domestic dog, and stone marten whole-chromosome painting probes to obtain comparative chromosome maps among species of Odobenidae (Odobenus rosmarus), Phocidae (Phoca vitulina, Phoca largha, Phoca hispida, Pusa sibirica, Erignathus barbatus), and Otariidae (Eumetopias jubatus, Callorhinus ursinus, Phocarctos hookeri, and Arctocephalus forsteri). Structural and functional chromosomal features were assessed with telomere repeat and ribosomal-DNA probes and by CBG (C-bands revealed by barium hydroxide treatment followed by Giemsa staining) and CDAG (Chromomycin A3-DAPI after G-banding) methods. We demonstrated diversity of heterochromatin among pinniped karyotypes in terms of localization, size, and nucleotide composition. For the first time, an intrachromosomal rearrangement common for Otariidae and Odobenidae was revealed. We postulate that the order of evolutionarily conserved segments in the analyzed pinnipeds is the same as the order proposed for the ancestral Carnivora karyotype (2n = 38). The evolution of conserved genomes of pinnipeds has been accompanied by few fusion events (less than one rearrangement per 10 million years) and by novel intrachromosomal changes including the emergence of new centromeres and pericentric inversion/centromere repositioning. The observed interspecific diversity of pinniped karyotypes driven by constitutive heterochromatin variation likely has played an important role in karyotype evolution of pinnipeds, thereby contributing to the differences of pinnipeds’ chromosome sets.
Collapse
|
17
|
O'Neill RJ. Seq'ing identity and function in a repeat-derived noncoding RNA world. Chromosome Res 2020; 28:111-127. [PMID: 32146545 PMCID: PMC7393779 DOI: 10.1007/s10577-020-09628-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 02/07/2020] [Accepted: 02/14/2020] [Indexed: 01/06/2023]
Abstract
Innovations in high-throughout sequencing approaches are being marshaled to both reveal the composition of the abundant and heterogeneous noncoding RNAs that populate cell nuclei and lend insight to the mechanisms by which noncoding RNAs influence chromosome biology and gene expression. This review focuses on some of the recent technological developments that have enabled the isolation of nascent transcripts and chromatin-associated and DNA-interacting RNAs. Coupled with emerging genome assembly and analytical approaches, the field is poised to achieve a comprehensive catalog of nuclear noncoding RNAs, including those derived from repetitive regions within eukaryotic genomes. Herein, particular attention is paid to the challenges and advances in the sequence analyses of repeat and transposable element-derived noncoding RNAs and in ascribing specific function(s) to such RNAs.
Collapse
Affiliation(s)
- Rachel J O'Neill
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, 06269, USA.
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA.
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, 06030, USA.
| |
Collapse
|