1
|
Secomandi S, Gallo GR, Rossi R, Rodríguez Fernandes C, Jarvis ED, Bonisoli-Alquati A, Gianfranceschi L, Formenti G. Pangenome graphs and their applications in biodiversity genomics. Nat Genet 2025:10.1038/s41588-024-02029-6. [PMID: 39779953 DOI: 10.1038/s41588-024-02029-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 11/08/2024] [Indexed: 01/11/2025]
Abstract
Complete datasets of genetic variants are key to biodiversity genomic studies. Long-read sequencing technologies allow the routine assembly of highly contiguous, haplotype-resolved reference genomes. However, even when complete, reference genomes from a single individual may bias downstream analyses and fail to adequately represent genetic diversity within a population or species. Pangenome graphs assembled from aligned collections of high-quality genomes can overcome representation bias by integrating sequence information from multiple genomes from the same population, species or genus into a single reference. Here, we review the available tools and data structures to build, visualize and manipulate pangenome graphs while providing practical examples and discussing their applications in biodiversity and conservation genomics across the tree of life.
Collapse
Affiliation(s)
- Simona Secomandi
- Laboratory of Neurogenetics of Language, the Rockefeller University, New York, NY, USA
| | | | - Riccardo Rossi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| | - Carlos Rodríguez Fernandes
- Centre for Ecology, Evolution and Environmental Changes (CE3C) and CHANGE, Global Change and Sustainability Institute, Departamento de Biologia Animal, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
- Faculdade de Psicologia, Universidade de Lisboa, Lisboa, Portugal
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, the Rockefeller University, New York, NY, USA
- The Vertebrate Genome Laboratory, New York, NY, USA
| | - Andrea Bonisoli-Alquati
- Department of Biological Sciences, California State Polytechnic University, Pomona, Pomona, CA, USA
| | | | | |
Collapse
|
2
|
Orozco-Arias S, Sierra P, Durbin R, González J. MCHelper automatically curates transposable element libraries across eukaryotic species. Genome Res 2024; 34:2256-2268. [PMID: 39653419 PMCID: PMC11694758 DOI: 10.1101/gr.278821.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 09/18/2024] [Indexed: 12/25/2024]
Abstract
The number of species with high-quality genome sequences continues to increase, in part due to the scaling up of multiple large-scale biodiversity sequencing projects. While the need to annotate genic sequences in these genomes is widely acknowledged, the parallel need to annotate transposable element (TE) sequences that have been shown to alter genome architecture, rewire gene regulatory networks, and contribute to the evolution of host traits is becoming ever more evident. However, accurate genome-wide annotation of TE sequences is still technically challenging. Several de novo TE identification tools are now available, but manual curation of the libraries produced by these tools is needed to generate high-quality genome annotations. Manual curation is time-consuming, and thus impractical for large-scale genomic studies, and lacks reproducibility. In this work, we present the Manual Curator Helper tool MCHelper, which automates the TE library curation process. By leveraging MCHelper's fully automated mode with the outputs from three de novo TE identification tools, RepeatModeler2, EDTA, and REPET, in the fruit fly, rice, hooded crow, zebrafish, maize, and human, we show a substantial improvement in the quality of the TE libraries and genome annotations. MCHelper libraries are less redundant, with up to 65% reduction in the number of consensus sequences, have up to 11.4% fewer false positive sequences, and up to ∼48% fewer "unclassified/unknown" TE consensus sequences. Genome-wide TE annotations are also improved, including larger unfragmented insertions. Moreover, MCHelper is an easy-to-install and easy-to-use tool.
Collapse
Affiliation(s)
| | - Pío Sierra
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, 08003 Barcelona, Spain;
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Spain
| |
Collapse
|
3
|
Mykhailenko A, Zieliński P, Bednarz A, Schlyter F, Andersson MN, Antunes B, Borowski Z, Krokene P, Melin M, Morales-García J, Müller J, Nowak Z, Schebeck M, Stauffer C, Viiri H, Zaborowska J, Babik W, Nadachowska-Brzyska K. Complex Genomic Landscape of Inversion Polymorphism in Europe's Most Destructive Forest Pest. Genome Biol Evol 2024; 16:evae263. [PMID: 39656753 DOI: 10.1093/gbe/evae263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2024] [Revised: 11/29/2024] [Accepted: 12/02/2024] [Indexed: 12/17/2024] Open
Abstract
In many species, polymorphic genomic inversions underlie complex phenotypic polymorphisms and facilitate local adaptation in the face of gene flow. Multiple polymorphic inversions can co-occur in a genome, but the prevalence, evolutionary significance, and limits to complexity of genomic inversion landscapes remain poorly understood. Here, we examine genome-wide genetic variation in one of Europe's most destructive forest pests, the spruce bark beetle Ips typographus, scan for polymorphic inversions, and test whether inversions are associated with key traits in this species. We analyzed 240 individuals from 18 populations across the species' European range and, using a whole-genome resequencing approach, identified 27 polymorphic inversions covering ∼28% of the genome. The inversions vary in size and in levels of intra-inversion recombination, are highly polymorphic across the species range, and often overlap, forming a complex genomic architecture. We found no support for mechanisms such as directional selection, overdominance, and associative overdominance that are often invoked to explain the presence of large inversion polymorphisms in the genome. This suggests that inversions are either neutral or maintained by the combined action of multiple evolutionary forces. We also found that inversions are enriched in odorant receptor genes encoding elements of recognition pathways for host plants, mates, and symbiotic fungi. Our results indicate that the genome of this major forest pest of growing social, political, and economic importance harbors one of the most complex inversion landscapes described to date and raise questions about the limits of intraspecific genomic architecture complexity.
Collapse
Affiliation(s)
- Anastasiia Mykhailenko
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, 30-348 Kraków, Poland
| | - Piotr Zieliński
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Aleksandra Bednarz
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Fredrik Schlyter
- Chemical Ecology, Department of Plant Protection Biology, Swedish University of Agricultural Sciences Alnarp, 234 22 Lomma, Sweden
- ETM, Faculty of Forestry and Wood Sciences, Czech University of Life Sciences Prague, 165 00 Praha, Czechia
| | | | - Bernardo Antunes
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Zbigniew Borowski
- Departament of Forest Ecology, Forest Research Institute, 05-090 Raszyn, Poland
| | - Paal Krokene
- Division of Biotechnology and Plant Health, Norwegian Institute of Bioeconomy Research, 1433 Ås, Norway
| | - Markus Melin
- Forest Health and Bidiversity Group, Natural Resources Institute Finland, 80100 Joensuu, Finland
| | - Julia Morales-García
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, 30-348 Kraków, Poland
| | - Jörg Müller
- Field Station Fabrikschleichach, Animal Ecology and Tropical Biology, Biocenter, University of Würzburg, 96181 Rauhenebrach, Germany
- Bavarian Forest National Park, 94481 Grafenau, Germany
| | - Zuzanna Nowak
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Martin Schebeck
- Institute of Forest Entomology, Forest Pathology and Forest Protection, Department of Forest and Soil Sciences, University of Natural Resources and Life Sciences Vienna (BOKU), 1190 Vienna, Austria
| | - Christian Stauffer
- Institute of Forest Entomology, Forest Pathology and Forest Protection, Department of Forest and Soil Sciences, University of Natural Resources and Life Sciences Vienna (BOKU), 1190 Vienna, Austria
| | - Heli Viiri
- UPM Forest, UPM-Kymmene, 33100 Tampere, Finland
| | - Julia Zaborowska
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | - Wiesław Babik
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, 30-387 Kraków, Poland
| | | |
Collapse
|
4
|
Chhina AK, Abhari N, Mooers A, Lewthwaite JMM. Linking the spatial and genomic structure of adaptive potential for conservation management: a review. Genome 2024; 67:403-423. [PMID: 39083766 DOI: 10.1139/gen-2024-0036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2024]
Abstract
We unified the recent literature with the goal to contribute to the discussion on how genetic diversity might best be conserved. We argue that this decision will be guided by how genomic variation is distributed among manageable populations (i.e., its spatial structure), the degree to which adaptive potential is best predicted by variation across the entire genome or the subset of that variation that is identified as putatively adaptive (i.e., its genomic structure), and whether we are managing species as single entities or as collections of diversifying lineages. The distribution of genetic variation and our ultimate goal will have practical implications for on-the-ground management. If adaptive variation is largely polygenic or responsive to change, its spatial structure might be broadly governed by the forces determining genome-wide variation (linked selection, drift, and gene flow), making measurement and prioritization straightforward. If we are managing species as single entities, then population-level prioritization schemes are possible so as to maximize future pooled genetic variation. We outline one such scheme based on the popular Shapley value from cooperative game theory that considers the relative genetic contribution of a population to an unknown future collection of populations.
Collapse
Affiliation(s)
- Avneet K Chhina
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Niloufar Abhari
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| | - Arne Mooers
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Jayme M M Lewthwaite
- Marine and Environmental Biology, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
5
|
Smolka M, Paulin LF, Grochowski CM, Horner DW, Mahmoud M, Behera S, Kalef-Ezra E, Gandhi M, Hong K, Pehlivan D, Scholz SW, Carvalho CMB, Proukakis C, Sedlazeck FJ. Detection of mosaic and population-level structural variants with Sniffles2. Nat Biotechnol 2024; 42:1571-1580. [PMID: 38168980 PMCID: PMC11217151 DOI: 10.1038/s41587-023-02024-y] [Citation(s) in RCA: 52] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 10/11/2023] [Indexed: 01/05/2024]
Abstract
Calling structural variations (SVs) is technically challenging, but using long reads remains the most accurate way to identify complex genomic alterations. Here we present Sniffles2, which improves over current methods by implementing a repeat aware clustering coupled with a fast consensus sequence and coverage-adaptive filtering. Sniffles2 is 11.8 times faster and 29% more accurate than state-of-the-art SV callers across different coverages (5-50×), sequencing technologies (ONT and HiFi) and SV types. Furthermore, Sniffles2 solves the problem of family-level to population-level SV calling to produce fully genotyped VCF files. Across 11 probands, we accurately identified causative SVs around MECP2, including highly complex alleles with three overlapping SVs. Sniffles2 also enables the detection of mosaic SVs in bulk long-read data. As a result, we identified multiple mosaic SVs in brain tissue from a patient with multiple system atrophy. The identified SV showed a remarkable diversity within the cingulate cortex, impacting both genes involved in neuron function and repetitive elements.
Collapse
Affiliation(s)
- Moritz Smolka
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | - Luis F Paulin
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | | | - Dominic W Horner
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Sairam Behera
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | - Ester Kalef-Ezra
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Mira Gandhi
- Pacific Northwest Research Institute (PNRI), Seattle, WA, USA
| | - Karl Hong
- Bionano Genomics, San Diego, CA, USA
| | - Davut Pehlivan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Division of Neurology and Developmental Neuroscience, Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
| | - Sonja W Scholz
- Neurodegenerative Diseases Research Unit, National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
- Department of Neurology, Johns Hopkins University Medical Center, Baltimore, MD, USA
| | - Claudia M B Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Pacific Northwest Research Institute (PNRI), Seattle, WA, USA
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
6
|
Romero FG, Beaudry FEG, Hovmand Warner E, Nguyen TN, Fitzpatrick JW, Chen N. A new high-quality genome assembly and annotation for the threatened Florida Scrub-Jay (Aphelocoma coerulescens). G3 (BETHESDA, MD.) 2024; 14:jkae232. [PMID: 39328063 PMCID: PMC11631490 DOI: 10.1093/g3journal/jkae232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 08/20/2024] [Accepted: 09/25/2024] [Indexed: 09/28/2024]
Abstract
The Florida Scrub-Jay (Aphelocoma coerulescens), a Federally Threatened, cooperatively-breeding bird, is an emerging model system in evolutionary biology and ecology. Extensive individual-based monitoring and genetic sampling for decades has yielded a wealth of data, allowing for the detailed study of social behavior, demography, and population genetics of this natural population. Here, we report a linkage map and a chromosome-level genome assembly and annotation for a female Florida Scrub-Jay made with long-read sequencing technology, chromatin conformation data, and the linkage map. We constructed a linkage map comprising 4,468 SNPs that had 34 linkage groups and a total sex-averaged autosomal genetic map length of 2446.78 cM. The new genome assembly is 1.33 Gb in length, consisting of 33 complete or near-complete autosomes and the sex chromosomes (ZW). This highly contiguous assembly has an NG50 of 68 Mb and a Benchmarking Universal Single-Copy Orthologs (BUSCO) completeness score of 97.1% with respect to the Aves database. The annotated gene set has a BUSCO transcriptome completeness score of 95.5% and 17,964 identified protein-coding genes, 92.5% of which have associated functional annotations. This new, high-quality genome assembly and linkage map of the Florida Scrub-Jay provides valuable tools for future research into the evolutionary dynamics of small, natural populations of conservation concern.
Collapse
Affiliation(s)
- Faye G Romero
- Department of Biology, University of Rochester, Rochester, NY 14620, USA
| | - Felix E G Beaudry
- Department of Biology, University of Rochester, Rochester, NY 14620, USA
- Clinical Translation, Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada
| | | | - Tram N Nguyen
- Department of Ecology & Evolutionary Biology, Cornell University, Ithaca, NY 14853, USA
| | - John W Fitzpatrick
- Department of Ecology & Evolutionary Biology, Cornell University, Ithaca, NY 14853, USA
- Cornell Lab of Ornithology, Cornell University, Ithaca, NY 14850, USA
| | - Nancy Chen
- Department of Biology, University of Rochester, Rochester, NY 14620, USA
| |
Collapse
|
7
|
Canesin LEC, Vilaça ST, Oliveira RRM, Al-Ajli F, Tracey A, Sims Y, Formenti G, Fedrigo O, Banhos A, Sanaiotti TM, Farias IP, Jarvis ED, Oliveira G, Hrbek T, Solferini V, Aleixo A. A reference genome for the Harpy Eagle reveals steady demographic decline and chromosomal rearrangements in the origin of Accipitriformes. Sci Rep 2024; 14:19925. [PMID: 39261501 PMCID: PMC11390914 DOI: 10.1038/s41598-024-70305-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 08/14/2024] [Indexed: 09/13/2024] Open
Abstract
The Harpy Eagle (Harpia harpyja) is an iconic species that inhabits forested landscapes in Neotropical regions, with decreasing population trends mainly due to habitat loss, and currently classified as vulnerable. Here, we report on a chromosome-scale genome assembly for a female individual combining long reads, optical mapping, and chromatin conformation capture reads. The final assembly spans 1.35 Gb, with N50scaffold equal to 58.1 Mb and BUSCO completeness of 99.7%. We built the first extensive transposable element (TE) library for the Accipitridae to date and identified 7,228 intact TEs. We found a burst of an unknown TE ~ 13-22 million years ago (MYA), coincident with the split of the Harpy Eagle from other Harpiinae eagles. We also report a burst of solo-LTRs and CR1 retrotransposons ~ 31-33 MYA, overlapping with the split of the ancestor to all Harpiinae from other Accipitridae subfamilies. Comparative genomics with other Accipitridae, the closely related Cathartidae and Galloanserae revealed major chromosome-level rearrangements at the basal Accipitriformes genome, in contrast to a conserved ancient genome architecture for the latter two groups. A historical demography reconstruction showed a rapid decline in effective population size over the last 20,000 years. This reference genome serves as a crucial resource for future conservation efforts towards the Harpy Eagle.
Collapse
Affiliation(s)
| | - Sibelle T Vilaça
- Instituto Tecnológico Vale - Desenvolvimento Sustentável (ITV-DS), Belém, Brazil
| | - Renato R M Oliveira
- Instituto Tecnológico Vale - Desenvolvimento Sustentável (ITV-DS), Belém, Brazil
| | - Farooq Al-Ajli
- Rockefeller University, New York, USA
- Katara Biodiversity Genomics Program, Katara Cultural Village Foundation, Doha, Qatar
| | | | - Ying Sims
- Rockefeller University, New York, USA
| | | | | | - Aureo Banhos
- Universidade Federal do Espírito Santo (UFES), Alegre, Brazil
| | | | | | - Erich D Jarvis
- Rockefeller University, New York, USA
- Howard Hughes Medical Institute (HHMI), New York, USA
| | - Guilherme Oliveira
- Instituto Tecnológico Vale - Desenvolvimento Sustentável (ITV-DS), Belém, Brazil
| | - Tomas Hrbek
- Universidade Federal do Amazonas (UFAM), Manaus, Brazil
- Trinity University, San Antonio, USA
| | - Vera Solferini
- Universidade Estadual de Campinas (Unicamp), Campinas, Brazil
| | - Alexandre Aleixo
- Instituto Tecnológico Vale - Desenvolvimento Sustentável (ITV-DS), Belém, Brazil.
| |
Collapse
|
8
|
Shogren EH, Sardell JM, Muirhead CA, Martí E, Cooper EA, Moyle RG, Presgraves DC, Uy JAC. Recent secondary contact, genome-wide admixture, and asymmetric introgression of neo-sex chromosomes between two Pacific island bird species. PLoS Genet 2024; 20:e1011360. [PMID: 39172766 PMCID: PMC11340901 DOI: 10.1371/journal.pgen.1011360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 06/28/2024] [Indexed: 08/24/2024] Open
Abstract
Secondary contact between closely related taxa represents a "moment of truth" for speciation-an opportunity to test the efficacy of reproductive isolation that evolved in allopatry and to identify the genetic, behavioral, and/or ecological barriers that separate species in sympatry. Sex chromosomes are known to rapidly accumulate differences between species, an effect that may be exacerbated for neo-sex chromosomes that are transitioning from autosomal to sex-specific inheritance. Here we report that, in the Solomon Islands, two closely related bird species in the honeyeater family-Myzomela cardinalis and Myzomela tristrami-carry neo-sex chromosomes and have come into recent secondary contact after ~1.1 my of geographic isolation. Hybrids of the two species were first observed in sympatry ~100 years ago. To determine the genetic consequences of hybridization, we use population genomic analyses of individuals sampled in allopatry and in sympatry to characterize gene flow in the contact zone. Using genome-wide estimates of diversity, differentiation, and divergence, we find that the degree and direction of introgression varies dramatically across the genome. For sympatric birds, autosomal introgression is bidirectional, with phenotypic hybrids and phenotypic parentals of both species showing admixed ancestry. In other regions of the genome, however, the story is different. While introgression on the Z/neo-Z-linked sequence is limited, introgression of W/neo-W regions and mitochondrial sequence (mtDNA) is highly asymmetric, moving only from the invading M. cardinalis to the resident M. tristrami. The recent hybridization between these species has thus enabled gene flow in some genomic regions but the interaction of admixture, asymmetric mate choice, and/or natural selection has led to the variation in the amount and direction of gene flow at sex-linked regions of the genome.
Collapse
Affiliation(s)
- Elsie H. Shogren
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Jason M. Sardell
- PrecisionLife Ltd, Hanborough Business Park, Long Hanborough, Witney, Oxon, United Kingdom
| | - Christina A. Muirhead
- Department of Biology, University of Rochester, Rochester, New York, United States of America
- The Ronin Institute, Montclair, New Jersey, United States of America
| | - Emiliano Martí
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Elizabeth A. Cooper
- Department of Bioinformatics & Genomics, University of North Carolina, Charlotte, North Carolina, United States of America
| | - Robert G. Moyle
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, United States of America
| | - Daven C. Presgraves
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - J. Albert C. Uy
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| |
Collapse
|
9
|
Pan C, Reinert K. Leaf: an ultrafast filter for population-scale long-read SV detection. Genome Biol 2024; 25:155. [PMID: 38872200 PMCID: PMC11170821 DOI: 10.1186/s13059-024-03297-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 06/04/2024] [Indexed: 06/15/2024] Open
Abstract
Advances in sequencing technology have facilitated population-scale long-read structural variant (SV) detection. Arguably, one of the main challenges in population-scale analysis is developing effective computational pipelines. Here, we present a new filter-based pipeline for population-scale long-read SV detection. It better captures SV signals at an early stage than conventional assembly-based or alignment-based pipelines. Assessments in this work suggest that the filter-based pipeline helps better resolve intra-read rearrangements. Moreover, it is also more computationally efficient than conventional pipelines and thus may facilitate population-scale long-read applications.
Collapse
Affiliation(s)
- Chenxu Pan
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, 14195, Berlin, Germany.
| | - Knut Reinert
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, 14195, Berlin, Germany
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, 14195, Germany
| |
Collapse
|
10
|
Recuerda M, Campagna L. How structural variants shape avian phenotypes: Lessons from model systems. Mol Ecol 2024; 33:e17364. [PMID: 38651830 DOI: 10.1111/mec.17364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 04/04/2024] [Accepted: 04/09/2024] [Indexed: 04/25/2024]
Abstract
Despite receiving significant recent attention, the relevance of structural variation (SV) in driving phenotypic diversity remains understudied, although recent advances in long-read sequencing, bioinformatics and pangenomic approaches have enhanced SV detection. We review the role of SVs in shaping phenotypes in avian model systems, and identify some general patterns in SV type, length and their associated traits. We found that most of the avian SVs so far identified are short indels in chickens, which are frequently associated with changes in body weight and plumage colouration. Overall, we found that relatively short SVs are more frequently detected, likely due to a combination of their prevalence compared to large SVs, and a detection bias, stemming primarily from the widespread use of short-read sequencing and associated analytical methods. SVs most commonly involve non-coding regions, especially introns, and when patterns of inheritance were reported, SVs associated primarily with dominant discrete traits. We summarise several examples of phenotypic convergence across different species, mediated by different SVs in the same or different genes and different types of changes in the same gene that can lead to various phenotypes. Complex rearrangements and supergenes, which can simultaneously affect and link several genes, tend to have pleiotropic phenotypic effects. Additionally, SVs commonly co-occur with single-nucleotide polymorphisms, highlighting the need to consider all types of genetic changes to understand the basis of phenotypic traits. We end by summarising expectations for when long-read technologies become commonly implemented in non-model birds, likely leading to an increase in SV discovery and characterisation. The growing interest in this subject suggests an increase in our understanding of the phenotypic effects of SVs in upcoming years.
Collapse
Affiliation(s)
- María Recuerda
- Fuller Evolutionary Biology Program, Cornell Lab of Ornithology, Ithaca, New York, USA
| | - Leonardo Campagna
- Fuller Evolutionary Biology Program, Cornell Lab of Ornithology, Ithaca, New York, USA
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York, USA
| |
Collapse
|
11
|
Boman J, Qvarnström A, Mugal CF. Regulatory and evolutionary impact of DNA methylation in two songbird species and their naturally occurring F 1 hybrids. BMC Biol 2024; 22:124. [PMID: 38807214 PMCID: PMC11134931 DOI: 10.1186/s12915-024-01920-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 05/15/2024] [Indexed: 05/30/2024] Open
Abstract
BACKGROUND Regulation of transcription by DNA methylation in 5'-CpG-3' context is a widespread mechanism allowing differential expression of genetically identical cells to persist throughout development. Consequently, differences in DNA methylation can reinforce variation in gene expression among cells, tissues, populations, and species. Despite a surge in studies on DNA methylation, we know little about the importance of DNA methylation in population differentiation and speciation. Here we investigate the regulatory and evolutionary impact of DNA methylation in five tissues of two Ficedula flycatcher species and their naturally occurring F1 hybrids. RESULTS We show that the density of CpG in the promoters of genes determines the strength of the association between DNA methylation and gene expression. The impact of DNA methylation on gene expression varies among tissues with the brain showing unique patterns. Differentially expressed genes between parental species are predicted by genetic and methylation differentiation in CpG-rich promoters. However, both these factors fail to predict hybrid misexpression suggesting that promoter mismethylation is not a main determinant of hybrid misexpression in Ficedula flycatchers. Using allele-specific methylation estimates in hybrids, we also determine the genome-wide contribution of cis- and trans effects in DNA methylation differentiation. These distinct mechanisms are roughly balanced in all tissues except the brain, where trans differences predominate. CONCLUSIONS Overall, this study provides insight on the regulatory and evolutionary impact of DNA methylation in songbirds.
Collapse
Affiliation(s)
- Jesper Boman
- Department of Ecology and Genetics (IEG), Division of Evolutionary Biology, Uppsala University, Norbyvägen 18D, Uppsala, SE-752 36, Sweden.
| | - Anna Qvarnström
- Department of Ecology and Genetics (IEG), Division of Animal Ecology, Uppsala University, Norbyvägen 18D, Uppsala, SE-752 36, Sweden
| | - Carina F Mugal
- Department of Ecology and Genetics (IEG), Division of Evolutionary Biology, Uppsala University, Norbyvägen 18D, Uppsala, SE-752 36, Sweden.
- CNRS, Laboratory of Biometry and Evolutionary Biology (LBBE), UMR 5558, University of Lyon 1, Villeurbanne, France.
| |
Collapse
|
12
|
Hu H, Gao R, Gao W, Gao B, Jiang Z, Zhou M, Wang G, Jiang T. SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies. Brief Bioinform 2024; 25:bbae336. [PMID: 38980375 PMCID: PMC11232458 DOI: 10.1093/bib/bbae336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 06/03/2024] [Accepted: 06/27/2024] [Indexed: 07/10/2024] Open
Abstract
Structural variation (SV) is an important form of genomic variation that influences gene function and expression by altering the structure of the genome. Although long-read data have been proven to better characterize SVs, SVs detected from noisy long-read data still include a considerable portion of false-positive calls. To accurately detect SVs in long-read data, we present SVDF, a method that employs a learning-based noise filtering strategy and an SV signature-adaptive clustering algorithm, for effectively reducing the likelihood of false-positive events. Benchmarking results from multiple orthogonal experiments demonstrate that, across different sequencing platforms and depths, SVDF achieves higher calling accuracy for each sample compared to several existing general SV calling tools. We believe that, with its meticulous and sensitive SV detection capability, SVDF can bring new opportunities and advancements to cutting-edge genomic research.
Collapse
Affiliation(s)
- Heng Hu
- College of Life Sciences, Northeast Forestry University, Harbin 150000, China
| | - Runtian Gao
- College of Life Sciences, Northeast Forestry University, Harbin 150000, China
| | - Wentao Gao
- College of Life Sciences, Northeast Forestry University, Harbin 150000, China
| | - Bo Gao
- Department of Radiology, The Second Affiliated Hospital of Harbin Medical University, Harbin 150000, China
| | - Zhongjun Jiang
- College of Life Sciences, Northeast Forestry University, Harbin 150000, China
| | - Murong Zhou
- College of Life Sciences, Northeast Forestry University, Harbin 150000, China
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150000, China
- State Key Laboratory of Tree Genetics and Breeding, Harbin 150000, China
| | - Tao Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150000, China
| |
Collapse
|
13
|
Ferguson S, Jones A, Murray K, Andrew R, Schwessinger B, Borevitz J. Plant genome evolution in the genus Eucalyptus is driven by structural rearrangements that promote sequence divergence. Genome Res 2024; 34:606-619. [PMID: 38589251 PMCID: PMC11146599 DOI: 10.1101/gr.277999.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 03/22/2024] [Indexed: 04/10/2024]
Abstract
Genomes have a highly organized architecture (nonrandom organization of functional and nonfunctional genetic elements within chromosomes) that is essential for many biological functions, particularly gene expression and reproduction. Despite the need to conserve genome architecture, a high level of structural variation has been observed within species. As species separate and diverge, genome architecture also diverges, becoming increasingly poorly conserved as divergence time increases. However, within plant genomes, the processes of genome architecture divergence are not well described. Here we use long-read sequencing and de novo assembly of 33 phylogenetically diverse, wild and naturally evolving Eucalyptus species, covering 1-50 million years of diverging genome evolution to measure genome architectural conservation and describe architectural divergence. The investigation of these genomes revealed that following lineage divergence, genome architecture is highly fragmented by rearrangements. As genomes continue to diverge, the accumulation of mutations and the subsequent divergence beyond recognition of rearrangements become the primary driver of genome divergence. The loss of syntenic regions also contribute to genome divergence but at a slower pace than that of rearrangements. We hypothesize that duplications and translocations are potentially the greatest contributors to Eucalyptus genome divergence.
Collapse
Affiliation(s)
- Scott Ferguson
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, 2601, Australia;
| | - Ashley Jones
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, 2601, Australia;
| | - Kevin Murray
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, 2601, Australia
- Weigel Department, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Rose Andrew
- Botany & N.C.W. Beadle Herbarium, School of Environmental and Rural Science, University of New England, Armidale, New South Wales 2351, Australia
| | - Benjamin Schwessinger
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, 2601, Australia
| | - Justin Borevitz
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, 2601, Australia
| |
Collapse
|
14
|
Zheng W, Gojobori J, Suh A, Satta Y. Different Host-Endogenous Retrovirus Relationships between Mammals and Birds Reflected in Genome-Wide Evolutionary Interaction Patterns. Genome Biol Evol 2024; 16:evae065. [PMID: 38527852 PMCID: PMC11005779 DOI: 10.1093/gbe/evae065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 02/25/2024] [Accepted: 03/21/2024] [Indexed: 03/27/2024] Open
Abstract
Mammals and birds differ largely in their average endogenous retrovirus loads, namely the proportion of endogenous retrovirus in the genome. The host-endogenous retrovirus relationships, including conflict and co-option, have been hypothesized among the causes of this difference. However, there has not been studies about the genomic evolutionary signal of constant host-endogenous retrovirus interactions in a long-term scale and how such interactions could lead to the endogenous retrovirus load difference. Through a phylogeny-controlled correlation analysis on ∼5,000 genes between the dN/dS ratio of each gene and the load of endogenous retrovirus in 12 mammals and 21 birds, separately, we detected genes that may have evolved in association with endogenous retrovirus loads. Birds have a higher proportion of genes with strong correlation between dN/dS and the endogenous retrovirus load than mammals. Strong evidence of association is found between the dN/dS of the coding gene for leucine-rich repeat-containing protein 23 and endogenous retrovirus load in birds. Gene set enrichment analysis shows that gene silencing rather than immunity and DNA recombination may have a larger contribution to the association between dN/dS and the endogenous retrovirus load for both mammals and birds. The above results together showing different evolutionary patterns between bird and mammal genes can partially explain the apparently lower endogenous retrovirus loads of birds, while gene silencing may be a universal mechanism that plays a remarkable role in the evolutionary interaction between the host and endogenous retrovirus. In summary, our study presents signals that the host genes might have driven or responded to endogenous retrovirus load changes in long-term evolution.
Collapse
Affiliation(s)
- Wanjing Zheng
- Department of Evolutionary Studies of Biosystems, School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Kanagawa 240-0193, Japan
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Jun Gojobori
- Department of Evolutionary Studies of Biosystems, School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Kanagawa 240-0193, Japan
- Research Center for Integrative Evolutionary Science, SOKENDAI (The Graduate University for Advanced Studies), Kanagawa 240-0193, Japan
| | - Alexander Suh
- Department of Organismal Biology—Systematic Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala 75236, Sweden
- School of Biological Sciences—Organisms and the Environment, University of East Anglia, Norwich, UK
| | - Yoko Satta
- Department of Evolutionary Studies of Biosystems, School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Kanagawa 240-0193, Japan
- Research Center for Integrative Evolutionary Science, SOKENDAI (The Graduate University for Advanced Studies), Kanagawa 240-0193, Japan
| |
Collapse
|
15
|
David G, Bertolotti A, Layer R, Scofield D, Hayward A, Baril T, Burnett HA, Gudmunds E, Jensen H, Husby A. Calling Structural Variants with Confidence from Short-Read Data in Wild Bird Populations. Genome Biol Evol 2024; 16:evae049. [PMID: 38489588 PMCID: PMC11018544 DOI: 10.1093/gbe/evae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/28/2024] [Accepted: 03/07/2024] [Indexed: 03/17/2024] Open
Abstract
Comprehensive characterization of structural variation in natural populations has only become feasible in the last decade. To investigate the population genomic nature of structural variation, reproducible and high-confidence structural variation callsets are first required. We created a population-scale reference of the genome-wide landscape of structural variation across 33 Nordic house sparrows (Passer domesticus). To produce a consensus callset across all samples using short-read data, we compare heuristic-based quality filtering and visual curation (Samplot/PlotCritic and Samplot-ML) approaches. We demonstrate that curation of structural variants is important for reducing putative false positives and that the time invested in this step outweighs the potential costs of analyzing short-read-discovered structural variation data sets that include many potential false positives. We find that even a lenient manual curation strategy (e.g. applied by a single curator) can reduce the proportion of putative false positives by up to 80%, thus enriching the proportion of high-confidence variants. Crucially, in applying a lenient manual curation strategy with a single curator, nearly all (>99%) variants rejected as putative false positives were also classified as such by a more stringent curation strategy using three additional curators. Furthermore, variants rejected by manual curation failed to reflect the expected population structure from SNPs, whereas variants passing curation did. Combining heuristic-based quality filtering with rapid manual curation of structural variants in short-read data can therefore become a time- and cost-effective first step for functional and population genomic studies requiring high-confidence structural variation callsets.
Collapse
Affiliation(s)
- Gabriel David
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | | | - Ryan Layer
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Computer Science, University of Colorado, Boulder, CO, USA
| | - Douglas Scofield
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, UK
| | - Hamish A Burnett
- Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Erik Gudmunds
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Henrik Jensen
- Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Arild Husby
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
16
|
Carpinteyro-Ponce J, Machado CA. The Complex Landscape of Structural Divergence Between the Drosophila pseudoobscura and D. persimilis Genomes. Genome Biol Evol 2024; 16:evae047. [PMID: 38482945 PMCID: PMC10980976 DOI: 10.1093/gbe/evae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2024] [Indexed: 04/01/2024] Open
Abstract
Structural genomic variants are key drivers of phenotypic evolution. They can span hundreds to millions of base pairs and can thus affect large numbers of genetic elements. Although structural variation is quite common within and between species, its characterization depends upon the quality of genome assemblies and the proportion of repetitive elements. Using new high-quality genome assemblies, we report a complex and previously hidden landscape of structural divergence between the genomes of Drosophila persimilis and D. pseudoobscura, two classic species in speciation research, and study the relationships among structural variants, transposable elements, and gene expression divergence. The new assemblies confirm the already known fixed inversion differences between these species. Consistent with previous studies showing higher levels of nucleotide divergence between fixed inversions relative to collinear regions of the genome, we also find a significant overrepresentation of INDELs inside the inversions. We find that transposable elements accumulate in regions with low levels of recombination, and spatial correlation analyses reveal a strong association between transposable elements and structural variants. We also report a strong association between differentially expressed (DE) genes and structural variants and an overrepresentation of DE genes inside the fixed chromosomal inversions that separate this species pair. Interestingly, species-specific structural variants are overrepresented in DE genes involved in neural development, spermatogenesis, and oocyte-to-embryo transition. Overall, our results highlight the association of transposable elements with structural variants and their importance in driving evolutionary divergence.
Collapse
Affiliation(s)
| | - Carlos A Machado
- Department of Biology, University of Maryland, College Park, MD, USA
| |
Collapse
|
17
|
Hooper R, Maher K, Moore K, McIvor G, Hosken D, Thornton A. Ultimate drivers of forced extra-pair copulations in birds lacking a penis: jackdaws as a case-study. ROYAL SOCIETY OPEN SCIENCE 2024; 11:231226. [PMID: 38545615 PMCID: PMC10966391 DOI: 10.1098/rsos.231226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 12/14/2023] [Accepted: 02/01/2024] [Indexed: 04/26/2024]
Abstract
Forced copulation is common, presumably because it can increase male reproductive success. Forced extra-pair copulation (FEPC) occurs in birds, even though most species lack a penis and are widely thought to require female cooperation for fertilization. How FEPC persists, despite a presumed lack of siring success and likely non-negligible costs to the male, is unknown. Using the jackdaw (Corvus monedula) as a case study, we use SNPs to quantify the extra-pair paternity rate through FEPC and evaluate explanations for the persistence of FEPC in species without a penis. We then collate evidence for FEPC across penis-lacking birds. Combining genetic and behavioural analyses, our study suggests that the most likely explanations for the maintenance of FEPC in jackdaws are that it provides a selective advantage to males or it is a relic. Our literature review shows that across birds lacking a penis, FEPC is taxonomically widespread, and yet, little is known about its evolution. A broader implementation of the approach used here, combining both genetic and behavioural data, may shed light on why this widespread sexual behaviour persists. Additional work is necessary to understand whether a penis is needed for paternity through forced copulation and to quantify the costs of FEPC.
Collapse
Affiliation(s)
- Rebecca Hooper
- Centre for Research in Animal Behaviour, University of Exeter, Exeter, UK
| | - Kathryn Maher
- NERC Environmental Omics Facility, School of Biosciences, The University of Sheffield, Sheffield, UK
| | - Karen Moore
- Faculty of Health and Life Sciences, University of Exeter, Exeter, UK
| | - Guillam McIvor
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, UK
| | - David Hosken
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, UK
| | - Alex Thornton
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, UK
| |
Collapse
|
18
|
Wang YL, Li L, Paudel BR, Zhao JL. Genomic Insights into High-Altitude Adaptation: A Comparative Analysis of Roscoea alpina and R. purpurea in the Himalayas. Int J Mol Sci 2024; 25:2265. [PMID: 38396942 PMCID: PMC10889555 DOI: 10.3390/ijms25042265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 02/05/2024] [Accepted: 02/09/2024] [Indexed: 02/25/2024] Open
Abstract
Environmental stress at high altitudes drives the development of distinct adaptive mechanisms in plants. However, studies exploring the genetic adaptive mechanisms of high-altitude plant species are scarce. In the present study, we explored the high-altitude adaptive mechanisms of plants in the Himalayas through whole-genome resequencing. We studied two widespread members of the Himalayan endemic alpine genus Roscoea (Zingiberaceae): R. alpina (a selfing species) and R. purpurea (an outcrossing species). These species are distributed widely in the Himalayas with distinct non-overlapping altitude distributions; R. alpina is distributed at higher elevations, and R. purpurea occurs at lower elevations. Compared to R. purpurea, R. alpina exhibited higher levels of linkage disequilibrium, Tajima's D, and inbreeding coefficient, as well as lower recombination rates and genetic diversity. Approximately 96.3% of the genes in the reference genome underwent significant genetic divergence (FST ≥ 0.25). We reported 58 completely divergent genes (FST = 1), of which only 17 genes were annotated with specific functions. The functions of these genes were primarily related to adapting to the specific characteristics of high-altitude environments. Our findings provide novel insights into how evolutionary innovations promote the adaptation of mountain alpine species to high altitudes and harsh habitats.
Collapse
Affiliation(s)
- Ya-Li Wang
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming 650504, China; (Y.-L.W.); (L.L.)
| | - Li Li
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming 650504, China; (Y.-L.W.); (L.L.)
| | - Babu Ram Paudel
- Research Centre for Applied Science and Technology, Tribhuvan University, Kirtipur 44613, Nepal
| | - Jian-Li Zhao
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming 650504, China; (Y.-L.W.); (L.L.)
| |
Collapse
|
19
|
Delmore K, Justen H, Kay KM, Kitano J, Moyle LC, Stelkens R, Streisfeld MA, Yamasaki YY, Ross J. Genomic Approaches Are Improving Taxonomic Representation in Genetic Studies of Speciation. Cold Spring Harb Perspect Biol 2024; 16:a041438. [PMID: 37848243 PMCID: PMC10835617 DOI: 10.1101/cshperspect.a041438] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2023]
Abstract
Until recently, our understanding of the genetics of speciation was limited to a narrow group of model species with a specific set of characteristics that made genetic analysis feasible. Rapidly advancing genomic technologies are eliminating many of the distinctions between laboratory and natural systems. In light of these genomic developments, we review the history of speciation genetics, advances that have been gleaned from model and non-model organisms, the current state of the field, and prospects for broadening the diversity of taxa included in future studies. Responses to a survey of speciation scientists across the world reveal the ongoing division between the types of questions that are addressed in model and non-model organisms. To bridge this gap, we suggest integrating genetic studies from model systems that can be reared in the laboratory or greenhouse with genomic studies in related non-models where extensive ecological knowledge exists.
Collapse
Affiliation(s)
- Kira Delmore
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - Hannah Justen
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - Kathleen M Kay
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, California 95060, USA
| | - Jun Kitano
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Leonie C Moyle
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Rike Stelkens
- Division of Population Genetics, Department of Zoology, Stockholm University, 106 91 Stockholm, Sweden
| | - Matthew A Streisfeld
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon 97403, USA
| | - Yo Y Yamasaki
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Joseph Ross
- Department of Biology, California State University, Fresno, California 93740, USA
| |
Collapse
|
20
|
Delmore KE, Van Doren BM, Ullrich K, Curk T, van der Jeugd HP, Liedvogel M. Structural genomic variation and migratory behavior in a wild songbird. Evol Lett 2023; 7:401-412. [PMID: 38045725 PMCID: PMC10693001 DOI: 10.1093/evlett/qrad040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/31/2023] [Accepted: 09/01/2023] [Indexed: 12/05/2023] Open
Abstract
Structural variants (SVs) are a major source of genetic variation; and descriptions in natural populations and connections with phenotypic traits are beginning to accumulate in the literature. We integrated advances in genomic sequencing and animal tracking to begin filling this knowledge gap in the Eurasian blackcap. Specifically, we (a) characterized the genome-wide distribution, frequency, and overall fitness effects of SVs using haplotype-resolved assemblies for 79 birds, and (b) used these SVs to study the genetics of seasonal migration. We detected >15 K SVs. Many SVs overlapped repetitive regions and exhibited evidence of purifying selection suggesting they have overall deleterious effects on fitness. We used estimates of genomic differentiation to identify SVs exhibiting evidence of selection in blackcaps with different migratory strategies. Insertions and deletions dominated the SVs we identified and were associated with genes that are either directly (e.g., regulatory motifs that maintain circadian rhythms) or indirectly (e.g., through immune response) related to migration. We also broke migration down into individual traits (direction, distance, and timing) using existing tracking data and tested if genetic variation at the SVs we identified could account for phenotypic variation at these traits. This was only the case for 1 trait-direction-and 1 specific SV (a deletion on chromosome 27) accounted for much of this variation. Our results highlight the evolutionary importance of SVs in natural populations and provide insight into the genetic basis of seasonal migration.
Collapse
Affiliation(s)
- Kira E Delmore
- MPRG Behavioural Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Texas A&M University, 3528 TAMU, College Station, TX, United States
| | - Benjamin M Van Doren
- MPRG Behavioural Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Zoology, Edward Grey Institute, University of Oxford, Oxford, United Kingdom
- Cornell Lab of Ornithology, Cornell University, Ithaca, NY, United States
| | - Kristian Ullrich
- MPRG Behavioural Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Teja Curk
- Vogeltrekstation—Dutch Centre for Avian Migration and Demography, Netherlands Institute of Ecology (NIOO-KNAW), 6700 AB Wageningen, The Netherlands
- Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
| | - Henk P van der Jeugd
- Vogeltrekstation—Dutch Centre for Avian Migration and Demography, Netherlands Institute of Ecology (NIOO-KNAW), 6700 AB Wageningen, The Netherlands
| | - Miriam Liedvogel
- MPRG Behavioural Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Institute of Avian Research “Vogelwarte Helgoland,”Wilhelmshaven, Germany
| |
Collapse
|
21
|
Dallaire X, Bouchard R, Hénault P, Ulmo-Diaz G, Normandeau E, Mérot C, Bernatchez L, Moore JS. Widespread Deviant Patterns of Heterozygosity in Whole-Genome Sequencing Due to Autopolyploidy, Repeated Elements, and Duplication. Genome Biol Evol 2023; 15:evad229. [PMID: 38085037 PMCID: PMC10752349 DOI: 10.1093/gbe/evad229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/30/2023] [Indexed: 12/28/2023] Open
Abstract
Most population genomic tools rely on accurate single nucleotide polymorphism (SNP) calling and filtering to meet their underlying assumptions. However, genomic complexity, resulting from structural variants, paralogous sequences, and repetitive elements, presents significant challenges in assembling contiguous reference genomes. Consequently, short-read resequencing studies can encounter mismapping issues, leading to SNPs that deviate from Mendelian expected patterns of heterozygosity and allelic ratio. In this study, we employed the ngsParalog software to identify such deviant SNPs in whole-genome sequencing (WGS) data with low (1.5×) to intermediate (4.8×) coverage for four species: Arctic Char (Salvelinus alpinus), Lake Whitefish (Coregonus clupeaformis), Atlantic Salmon (Salmo salar), and the American Eel (Anguilla rostrata). The analyses revealed that deviant SNPs accounted for 22% to 62% of all SNPs in salmonid datasets and approximately 11% in the American Eel dataset. These deviant SNPs were particularly concentrated within repetitive elements and genomic regions that had recently undergone rediploidization in salmonids. Additionally, narrow peaks of elevated coverage were ubiquitous along all four reference genomes, encompassed most deviant SNPs, and could be partially associated with transposons and tandem repeats. Including these deviant SNPs in genomic analyses led to highly distorted site frequency spectra, underestimated pairwise FST values, and overestimated nucleotide diversity. Considering the widespread occurrence of deviant SNPs arising from a variety of sources, their important impact in estimating population parameters, and the availability of effective tools to identify them, we propose that excluding deviant SNPs from WGS datasets is required to improve genomic inferences for a wide range of taxa and sequencing depths.
Collapse
Affiliation(s)
- Xavier Dallaire
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Centre d'Études Nordiques, Université Laval, Québec, Canada
| | - Raphael Bouchard
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
| | - Philippe Hénault
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
| | - Gabriela Ulmo-Diaz
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
| | - Eric Normandeau
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
- Plateforme de bio-informatique de l’IBIS, Université Laval, Québec, Canada
| | - Claire Mérot
- CNRS, UMR 6553 ECOBIO, Université de Rennes, Rennes, France
| | - Louis Bernatchez
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
| | - Jean-Sébastien Moore
- Institut de biologie intégrative et des systèmes, Université Laval, Québec, Canada
- Centre d'Études Nordiques, Université Laval, Québec, Canada
- Ressources Aquatique Québec, Université de Rimouski, Rimouski, Canada
| |
Collapse
|
22
|
Lucek K, Giménez MD, Joron M, Rafajlović M, Searle JB, Walden N, Westram AM, Faria R. The Impact of Chromosomal Rearrangements in Speciation: From Micro- to Macroevolution. Cold Spring Harb Perspect Biol 2023; 15:a041447. [PMID: 37604585 PMCID: PMC10626258 DOI: 10.1101/cshperspect.a041447] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
Chromosomal rearrangements (CRs) have been known since almost the beginning of genetics. While an important role for CRs in speciation has been suggested, evidence primarily stems from theoretical and empirical studies focusing on the microevolutionary level (i.e., on taxon pairs where speciation is often incomplete). Although the role of CRs in eukaryotic speciation at a macroevolutionary level has been supported by associations between species diversity and rates of evolution of CRs across phylogenies, these findings are limited to a restricted range of CRs and taxa. Now that more broadly applicable and precise CR detection approaches have become available, we address the challenges in filling some of the conceptual and empirical gaps between micro- and macroevolutionary studies on the role of CRs in speciation. We synthesize what is known about the macroevolutionary impact of CRs and suggest new research avenues to overcome the pitfalls of previous studies to gain a more comprehensive understanding of the evolutionary significance of CRs in speciation across the tree of life.
Collapse
Affiliation(s)
- Kay Lucek
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, Switzerland
| | - Mabel D Giménez
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Instituto de Genética Humana de Misiones (IGeHM), Parque de la Salud de la Provincia de Misiones "Dr. Ramón Madariaga," N3300KAZ Posadas, Misiones, Argentina
- Facultad de Ciencias Exactas Químicas y Naturales, Universidad Nacional de Misiones, N3300LQH Posadas, Misiones, Argentina
| | - Mathieu Joron
- Centre d'Ecologie Fonctionnelle et Evolutive, Université de Montpellier, CNRS, EPHE, IRD, 34293 Montpellier, France
| | - Marina Rafajlović
- Department of Marine Sciences, University of Gothenburg, 405 30 Gothenburg, Sweden
- Centre for Marine Evolutionary Biology, University of Gothenburg, 405 30 Gothenburg, Sweden
| | - Jeremy B Searle
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York 14853, USA
| | - Nora Walden
- Centre for Organismal Studies, University of Heidelberg, 69117 Heidelberg, Germany
| | - Anja Marie Westram
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| | - Rui Faria
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado;
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| |
Collapse
|
23
|
Zinevich L, Prommer M, Laczkó L, Rozhkova D, Sorokin A, Karyakin I, Bagyura J, Cserkész T, Sramkó G. Phylogenomic insights into the polyphyletic nature of Altai falcons within eastern sakers (Falco cherrug) and the origins of gyrfalcons (Falco rusticolus). Sci Rep 2023; 13:17800. [PMID: 37853004 PMCID: PMC10584951 DOI: 10.1038/s41598-023-44534-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 10/10/2023] [Indexed: 10/20/2023] Open
Abstract
The Altai falcon from Central Asia always attracted the attention of humans. Long considered a totemic bird in its native area, modern falconers still much appreciated this large-bodied and mighty bird of prey due to its rarity and unique look. The peculiar body characteristics halfway between the saker falcon (Falco cherrug) and the gyrfalcon (F. rusticolus) triggered debates about its contentious taxonomy. The weak phylogenetic signal associated with traditional genetic methods could not resolve this uncertainty. Here, we address the controversial evolutionary origin of Altai falcons by means of a genome-wide approach, Restriction-site Associated DNA sequencing, using sympatric eastern sakers falcons, allopatric western saker falcons and gyrfalcons as outgroup. This approach provided an unprecedented insight into the phylogenetic relationships of the studied populations by delivering 17,095 unlinked SNPs shedding light on the polyphyletic nature of Altai falcons within eastern sakers. Thus we concluded that the former must correspond to a low taxonomic rank, probably an ecotype or form of the latter. Also, we found that eastern sakers are paraphyletic without gyrfalcons, thus, these latter birds are best regarded as the direct sister lineage of the eastern sakers. This evolutionary relationship, corroborated also by re-analyzing the dataset with the inclusion of outgroup samples (F. biarmicus and F. peregrinus), put eastern sakers into a new light as the potential ancestral genetic source of high latitude and altitude adaptation in descendent populations. Finally, conservation genomic values hint at the stable genetic background of the studied saker populations.
Collapse
Affiliation(s)
- Liudmila Zinevich
- Koltzov Institute of Developmental Biology Russian Academy of Sciences, Moscow, Russian Federation
- All-Russian Research Institute for Environmental Protection, Moscow, Russian Federation
| | | | - Levente Laczkó
- HUN-REN-UD Conservation Biology Research Group, Egyetem tér 1, Debrecen, 4032, Hungary
- Evolutionary Genomics Research Group, Department of Botany, University of Debrecen, Egyetem tér 1, Debrecen, 4032, Hungary
| | - Daria Rozhkova
- Koltzov Institute of Developmental Biology Russian Academy of Sciences, Moscow, Russian Federation
- All-Russian Research Institute for Environmental Protection, Moscow, Russian Federation
| | - Alexander Sorokin
- All-Russian Research Institute for Environmental Protection, Moscow, Russian Federation
| | | | - János Bagyura
- MME - BirdLife Hungary, Költő utca 21, Budapest, 1121, Hungary
| | - Tamás Cserkész
- Hungarian Natural History Museum, Baross utca 13, Budapest, 1088, Hungary
| | - Gábor Sramkó
- HUN-REN-UD Conservation Biology Research Group, Egyetem tér 1, Debrecen, 4032, Hungary.
- Evolutionary Genomics Research Group, Department of Botany, University of Debrecen, Egyetem tér 1, Debrecen, 4032, Hungary.
| |
Collapse
|
24
|
Liu X, Liu W, Lenstra JA, Zheng Z, Wu X, Yang J, Li B, Yang Y, Qiu Q, Liu H, Li K, Liang C, Guo X, Ma X, Abbott RJ, Kang M, Yan P, Liu J. Evolutionary origin of genomic structural variations in domestic yaks. Nat Commun 2023; 14:5617. [PMID: 37726270 PMCID: PMC10509194 DOI: 10.1038/s41467-023-41220-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 08/23/2023] [Indexed: 09/21/2023] Open
Abstract
Yak has been subject to natural selection, human domestication and interspecific introgression during its evolution. However, genetic variants favored by each of these processes have not been distinguished previously. We constructed a graph-genome for 47 genomes of 7 cross-fertile bovine species. This allowed detection of 57,432 high-resolution structural variants (SVs) within and across the species, which were genotyped in 386 individuals. We distinguished the evolutionary origins of diverse SVs in domestic yaks by phylogenetic analyses. We further identified 334 genes overlapping with SVs in domestic yaks that bore potential signals of selection from wild yaks, plus an additional 686 genes introgressed from cattle. Nearly 90% of the domestic yaks were introgressed by cattle. Introgression of an SV spanning the KIT gene triggered the breeding of white domestic yaks. We validated a significant association of the selected stratified SVs with gene expression, which contributes to phenotypic variations. Our results highlight that SVs of different origins contribute to the phenotypic diversity of domestic yaks.
Collapse
Affiliation(s)
- Xinfeng Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
- Academy of Plateau Science and Sustainability, Qinghai Normal University, Xining, 810016, China
| | - Wenyu Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Johannes A Lenstra
- Faculty of Veterinary Medicine, Utrecht University, Utrecht, 3508 TD, The Netherlands
| | - Zeyu Zheng
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Xiaoyun Wu
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
| | - Jiao Yang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Bowen Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Yongzhi Yang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Qiang Qiu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Hongyu Liu
- Anhui Provincial Laboratory of Local Livestock and Poultry Genetical Resource Conservation and Breeding, College of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
| | - Kexin Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Chunnian Liang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
| | - Xian Guo
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
| | - Xiaoming Ma
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
| | - Richard J Abbott
- School of Biology, University of St Andrews, St Andrews, KY16 9AJ, UK
| | - Minghui Kang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China.
| | - Ping Yan
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China.
| | - Jianquan Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystem, College of Ecology, Lanzhou University, Lanzhou, 730000, China.
- Academy of Plateau Science and Sustainability, Qinghai Normal University, Xining, 810016, China.
| |
Collapse
|
25
|
Catalán A, Merondun J, Knief U, Wolf JBW. Chromatin accessibility, not 5mC methylation covaries with partial dosage compensation in crows. PLoS Genet 2023; 19:e1010901. [PMID: 37747941 PMCID: PMC10575545 DOI: 10.1371/journal.pgen.1010901] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 10/13/2023] [Accepted: 08/07/2023] [Indexed: 09/27/2023] Open
Abstract
The evolution of genetic sex determination is often accompanied by degradation of the sex-limited chromosome. Male heterogametic systems have evolved convergent, epigenetic mechanisms restoring the resulting imbalance in gene dosage between diploid autosomes (AA) and the hemizygous sex chromosome (X). Female heterogametic systems (AAf Zf, AAm ZZm) tend to only show partial dosage compensation (0.5 < Zf:AAf < 1) and dosage balance (0.5
Collapse
Affiliation(s)
- Ana Catalán
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala, Sweden
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Justin Merondun
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Ulrich Knief
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
- Evolutionary Biology & Ecology,Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Jochen B. W. Wolf
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala, Sweden
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| |
Collapse
|
26
|
Benham PM, Cicero C, DeRaad DA, McCormack JE, Wayne RK, Escalona M, Beraut E, Marimuthu MPA, Nguyen O, Nachman MW, Bowie RCK. A highly contiguous reference genome for the Steller's jay (Cyanocitta stelleri). J Hered 2023; 114:549-560. [PMID: 37395718 PMCID: PMC10445514 DOI: 10.1093/jhered/esad042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 06/29/2023] [Indexed: 07/04/2023] Open
Abstract
The Steller's jay is a familiar bird of western forests from Alaska south to Nicaragua. Here, we report a draft reference assembly for the species generated from PacBio HiFi long-read and Omni-C chromatin-proximity sequencing data as part of the California Conservation Genomics Project (CCGP). Sequenced reads were assembled into 352 scaffolds totaling 1.16 Gb in length. Assembly metrics indicate a highly contiguous and complete assembly with a contig N50 of 7.8 Mb, scaffold N50 of 25.8 Mb, and BUSCO completeness score of 97.2%. Repetitive elements span 16.6% of the genome including nearly 90% of the W chromosome. Compared with high-quality assemblies from other members of the family Corvidae, the Steller's jay genome contains a larger proportion of repetitive elements than 4 crow species (Corvus), but a lower proportion of repetitive elements than the California scrub-jay (Aphelocoma californica). This reference genome will serve as an essential resource for future studies on speciation, local adaptation, phylogeography, and conservation genetics in this species of significant biological interest.
Collapse
Affiliation(s)
- Phred M Benham
- Museum of Vertebrate Zoology, University of California Berkeley, Berkeley, CA, United States
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA, United States
| | - Carla Cicero
- Museum of Vertebrate Zoology, University of California Berkeley, Berkeley, CA, United States
| | - Devon A DeRaad
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, United States
| | - John E McCormack
- Moore Laboratory of Zoology, Occidental College, Los Angeles, CA, United States
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Merly Escalona
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States
| | - Eric Beraut
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, United States
| | - Mohan P A Marimuthu
- DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States
| | - Oanh Nguyen
- DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States
| | - Michael W Nachman
- Museum of Vertebrate Zoology, University of California Berkeley, Berkeley, CA, United States
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA, United States
| | - Rauri C K Bowie
- Museum of Vertebrate Zoology, University of California Berkeley, Berkeley, CA, United States
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA, United States
| |
Collapse
|
27
|
Shinde SS, Sharma A, Vijay N. Decoding the fibromelanosis locus complex chromosomal rearrangement of black-bone chicken: genetic differentiation, selective sweeps and protein-coding changes in Kadaknath chicken. Front Genet 2023; 14:1180658. [PMID: 37424723 PMCID: PMC10325862 DOI: 10.3389/fgene.2023.1180658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/05/2023] [Indexed: 07/11/2023] Open
Abstract
Black-bone chicken (BBC) meat is popular for its distinctive taste and texture. A complex chromosomal rearrangement at the fibromelanosis (Fm) locus on the 20th chromosome results in increased endothelin-3 (EDN3) gene expression and is responsible for melanin hyperpigmentation in BBC. We use public long-read sequencing data of the Silkie breed to resolve high-confidence haplotypes at the Fm locus spanning both Dup1 and Dup2 regions and establish that the Fm_2 scenario is correct of the three possible scenarios of the complex chromosomal rearrangement. The relationship between Chinese and Korean BBC breeds with Kadaknath native to India is underexplored. Our data from whole-genome re-sequencing establish that all BBC breeds, including Kadaknath, share the complex chromosomal rearrangement junctions at the fibromelanosis (Fm) locus. We also identify two Fm locus proximal regions (∼70 Kb and ∼300 Kb) with signatures of selection unique to Kadaknath. These regions harbor several genes with protein-coding changes, with the bactericidal/permeability-increasing-protein-like gene having two Kadaknath-specific changes within protein domains. Our results indicate that protein-coding changes in the bactericidal/permeability-increasing-protein-like gene hitchhiked with the Fm locus in Kadaknath due to close physical linkage. Identifying this Fm locus proximal selective sweep sheds light on the genetic distinctiveness of Kadaknath compared to other BBC.
Collapse
Affiliation(s)
| | | | - Nagarjun Vijay
- Computational Evolutionary Genomics Lab, Department of Biological Sciences, IISER Bhopal, Bhauri, Madhya Pradesh, India
| |
Collapse
|
28
|
Mueller JC, Schlebusch SA, Pei Y, Poignet M, Vontzou N, Ruiz-Ruano FJ, Albrecht T, Reifová R, Forstmeier W, Suh A, Kempenaers B. Micro Germline-Restricted Chromosome in Blue Tits: Evidence for Meiotic Functions. Mol Biol Evol 2023; 40:msad096. [PMID: 37116210 PMCID: PMC10172847 DOI: 10.1093/molbev/msad096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 04/19/2023] [Accepted: 04/21/2023] [Indexed: 04/30/2023] Open
Abstract
The germline-restricted chromosome (GRC) is likely present in all songbird species but differs widely in size and gene content. This extra chromosome has been described as either a microchromosome with only limited basic gene content or a macrochromosome with enriched gene functions related to female gonad and embryo development. Here, we assembled, annotated, and characterized the first micro-GRC in the blue tit (Cyanistes caeruleus) using high-fidelity long-read sequencing data. Although some genes on the blue tit GRC show signals of pseudogenization, others potentially have important functions, either currently or in the past. We highlight the GRC gene paralog BMP15, which is among the highest expressed GRC genes both in blue tits and in zebra finches (Taeniopygia guttata) and is known to play a role in oocyte and follicular maturation in other vertebrates. The GRC genes of the blue tit are further enriched for functions related to the synaptonemal complex. We found a similar functional enrichment when analyzing published data on GRC genes from two nightingale species (Luscinia spp.). We hypothesize that these genes play a role in maintaining standard maternal inheritance or in recombining maternal and paternal GRCs during potential episodes of biparental inheritance.
Collapse
Affiliation(s)
- Jakob C Mueller
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| | | | - Yifan Pei
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| | - Manon Poignet
- Department of Zoology, Charles University, Prague, Czech Republic
| | - Niki Vontzou
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Francisco J Ruiz-Ruano
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
- Department of Organismal Biology, Evolutionary Biology Centre, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Tomáš Albrecht
- Department of Zoology, Charles University, Prague, Czech Republic
- Institute of Vertebrate Biology, Academy of Sciences, Brno, Czech Republic
| | - Radka Reifová
- Department of Zoology, Charles University, Prague, Czech Republic
| | - Wolfgang Forstmeier
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| | - Alexander Suh
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
- Department of Organismal Biology, Evolutionary Biology Centre, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Bart Kempenaers
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| |
Collapse
|
29
|
Mérot C, Stenløkk KSR, Venney C, Laporte M, Moser M, Normandeau E, Árnyasi M, Kent M, Rougeux C, Flynn JM, Lien S, Bernatchez L. Genome assembly, structural variants, and genetic differentiation between lake whitefish young species pairs (Coregonus sp.) with long and short reads. Mol Ecol 2023; 32:1458-1477. [PMID: 35416336 DOI: 10.1111/mec.16468] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 03/24/2022] [Accepted: 04/01/2022] [Indexed: 11/26/2022]
Abstract
Nascent pairs of ecologically differentiated species offer an opportunity to get a better glimpse at the genetic architecture of speciation. Of particular interest is our recent ability to consider a wider range of genomic variants, not only single-nucleotide polymorphisms (SNPs), thanks to long-read sequencing technology. We can now identify structural variants (SVs) such as insertions, deletions and other rearrangements, allowing further insights into the genetic architecture of speciation and how different types of variants are involved in species differentiation. Here, we investigated genomic patterns of differentiation between sympatric species pairs (Dwarf and Normal) belonging to the lake whitefish (Coregonus clupeaformis) species complex. We assembled the first reference genomes for both C. clupeaformis sp. Normal and C. clupeaformis sp. Dwarf, annotated the transposable elements and analysed the genomes in the light of related coregonid species. Next, we used a combination of long- and short-read sequencing to characterize SVs and genotype them at the population scale using genome-graph approaches, showing that SVs cover five times more of the genome than SNPs. We then integrated both SNPs and SVs to investigate the genetic architecture of species differentiation in two different lakes and highlighted an excess of shared outliers of differentiation. In particular, a large fraction of SVs differentiating the two species correspond to insertions or deletions of transposable elements (TEs), suggesting that TE accumulation may represent a key component of genetic divergence between the Dwarf and Normal species. Together, our results suggest that SVs may play an important role in speciation and that, by combining second- and third-generation sequencing, we now have the ability to integrate SVs into speciation genomics.
Collapse
Affiliation(s)
- Claire Mérot
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada.,UMR 6553 Ecobio, OSUR, CNRS, Université de Rennes, Rennes, France
| | - Kristina S R Stenløkk
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Clare Venney
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| | - Martin Laporte
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada.,Ministère des Forêts, de la Faune et des Parcs (MFFP) du Québec, Québec, Québec, Canada
| | - Michel Moser
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Eric Normandeau
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| | - Mariann Árnyasi
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Matthew Kent
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Clément Rougeux
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| | - Jullien M Flynn
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, USA
| | - Sigbjørn Lien
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Louis Bernatchez
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| |
Collapse
|
30
|
Peona V, Kutschera VE, Blom MPK, Irestedt M, Suh A. Satellite DNA evolution in Corvoidea inferred from short and long reads. Mol Ecol 2023; 32:1288-1305. [PMID: 35488497 DOI: 10.1111/mec.16484] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 04/11/2022] [Accepted: 04/17/2022] [Indexed: 11/29/2022]
Abstract
Satellite DNA (satDNA) is a fast-evolving portion of eukaryotic genomes. The homogeneous and repetitive nature of such satDNA causes problems during the assembly of genomes, and therefore it is still difficult to study it in detail in nonmodel organisms as well as across broad evolutionary timescales. Here, we combined the use of short- and long-read data to explore the diversity and evolution of satDNA between individuals of the same species and between genera of birds spanning ~40 millions of years of bird evolution using birds-of-paradise (Paradisaeidae) and crow (Corvus) species. These avian species highlighted the presence of a GC-rich Corvoidea satellitome composed of 61 satellite families and provided a set of candidate satDNA monomers for being centromeric on the basis of length, abundance, homogeneity and transcription. Surprisingly, we found that the satDNA of crow species rapidly diverged between closely related species while the satDNA appeared more similar between birds-of-paradise species belonging to different genera.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Verena E Kutschera
- Department of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Mozes P K Blom
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden.,Museum für Naturkunde, Leibniz Institut für Evolutions- und Biodiversitätsforschung, Berlin, Germany
| | - Martin Irestedt
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Alexander Suh
- Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.,School of Biological Sciences-Organisms and the Environment, University of East Anglia, Norwich, UK
| |
Collapse
|
31
|
Zhang L, Chaturvedi S, Nice CC, Lucas LK, Gompert Z. Population genomic evidence of selection on structural variants in a natural hybrid zone. Mol Ecol 2023; 32:1497-1514. [PMID: 35398939 DOI: 10.1111/mec.16469] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/18/2022] [Accepted: 03/28/2022] [Indexed: 12/01/2022]
Abstract
Structural variants (SVs) can promote speciation by directly causing reproductive isolation or by suppressing recombination across large genomic regions. Whereas examples of each mechanism have been documented, systematic tests of the role of SVs in speciation are lacking. Here, we take advantage of long-read (Oxford nanopore) whole-genome sequencing and a hybrid zone between two Lycaeides butterfly taxa (L. melissa and Jackson Hole Lycaeides) to comprehensively evaluate genome-wide patterns of introgression for SVs and relate these patterns to hypotheses about speciation. We found >100,000 SVs segregating within or between the two hybridizing species. SVs and SNPs exhibited similar levels of genetic differentiation between species, with the exception of inversions, which were more differentiated. We detected credible variation in patterns of introgression among SV loci in the hybrid zone, with 562 of 1419 ancestry-informative SVs exhibiting genomic clines that deviated from null expectations based on genome-average ancestry. Overall, hybrids exhibited a directional shift towards Jackson Hole Lycaeides ancestry at SV loci, consistent with the hypothesis that these loci experienced more selection on average than SNP loci. Surprisingly, we found that deletions, rather than inversions, showed the highest skew towards excess ancestry from Jackson Hole Lycaeides. Excess Jackson Hole Lycaeides ancestry in hybrids was also especially pronounced for Z-linked SVs and inversions containing many genes. In conclusion, our results show that SVs are ubiquitous and suggest that SVs in general, but especially deletions, might disproportionately affect hybrid fitness and thus contribute to reproductive isolation.
Collapse
Affiliation(s)
- Linyi Zhang
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.,Department of Biology, Utah State University, Logan, Utah, USA
| | - Samridhi Chaturvedi
- Department of Integrative Biology, University of California, Berkeley, California, USA
| | - Chris C Nice
- Department of Biology, Texas State University, San Marcos, Texas, USA
| | - Lauren K Lucas
- Department of Biology, Utah State University, Logan, Utah, USA
| | | |
Collapse
|
32
|
Jun G, English AC, Metcalf GA, Yang J, Chaisson MJP, Pankratz N, Menon VK, Salerno WJ, Krasheninina O, Smith AV, Lane JA, Blackwell T, Kang HM, Salvi S, Meng Q, Shen H, Pasham D, Bhamidipati S, Kottapalli K, Arnett DK, Ashley-Koch A, Auer PL, Beutel KM, Bis JC, Blangero J, Bowden DW, Brody JA, Cade BE, Chen YDI, Cho MH, Curran JE, Fornage M, Freedman BI, Fingerlin T, Gelb BD, Hou L, Hung YJ, Kane JP, Kaplan R, Kim W, Loos RJ, Marcus GM, Mathias RA, McGarvey ST, Montgomery C, Naseri T, Nouraie SM, Preuss MH, Palmer ND, Peyser PA, Raffield LM, Ratan A, Redline S, Reupena S, Rotter JI, Rich SS, Rienstra M, Ruczinski I, Sankaran VG, Schwartz DA, Seidman CE, Seidman JG, Silverman EK, Smith JA, Stilp A, Taylor KD, Telen MJ, Weiss ST, Williams LK, Wu B, Yanek LR, Zhang Y, Lasky-Su J, Gingras MC, Dutcher SK, Eichler EE, Gabriel S, Germer S, Kim R, Viaud-Martinez KA, Nickerson DA, Luo J, Reiner A, Gibbs RA, Boerwinkle E, Abecasis G, Sedlazeck FJ. Structural variation across 138,134 samples in the TOPMed consortium. RESEARCH SQUARE 2023:rs.3.rs-2515453. [PMID: 36778386 PMCID: PMC9915771 DOI: 10.21203/rs.3.rs-2515453/v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hematologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.
Collapse
Affiliation(s)
- Goo Jun
- Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston
| | - Adam C English
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Ginger A Metcalf
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Jianzhi Yang
- University of Southern California, Los Angeles, CA, USA
| | | | | | - Vipin K Menon
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | | | | | - Albert V Smith
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - John A Lane
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Tom Blackwell
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Hyun Min Kang
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Sejal Salvi
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Qingchang Meng
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Hua Shen
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Divya Pasham
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Sravya Bhamidipati
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Kavya Kottapalli
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Donna K. Arnett
- Department of Epidemiology, University of Kentucky College of Public Health
| | - Allison Ashley-Koch
- Department of Medicine, Duke University Medical Center, Durham, NC
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC
| | - Paul L. Auer
- Division of Biostatistics and Cancer Center, Medical College of Wisconsin, Milwaukee WI
| | | | - Joshua C. Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas, Rio Grande Valley School of Medicine, Brownsville, TX
| | - Donald W. Bowden
- Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Brian E. Cade
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA
| | - Yii-Der Ida Chen
- Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
| | - Michael H. Cho
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | - Joanne E. Curran
- Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
| | - Barry I. Freedman
- Department of Internal Medicine, Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Tasha Fingerlin
- Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St., Denver, CO, 80206, USA
| | - Bruce D. Gelb
- Mindich Child Health and Development Institute and the Departments of Pediatrics and Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai
| | | | - Yi-Jen Hung
- Institute of Preventive Medicine, National Defense Medical Center, Taiwan
| | - John P Kane
- Cardiovascular Research Institute, University of California, San Francisco
| | - Robert Kaplan
- Department of epidemiology and population health, Albert Einstein College of Medicine, Bronx NY USA
| | - Wonji Kim
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | - Ruth J.F. Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Gregory M Marcus
- Division of Cardiology, University of California, San Francisco CA
| | - Rasika A. Mathias
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Stephen T. McGarvey
- Department of Epidemiology, International Health Institute and Department of Anthropology, Brown University
| | - Courtney Montgomery
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | - S. Mehdi Nouraie
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Michael H. Preuss
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | | | - Patricia A. Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA
| | | | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA
| | | | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Stephen S. Rich
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA
| | - Michiel Rienstra
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins University Bloomberg, School of Public Health, Baltimore, MD, USA
| | - Vijay G. Sankaran
- Division of Hematology/Oncology, Boston Children’s Hospital and Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| | | | - Christine E. Seidman
- Department of Genetics, Harvard Medical School
- Cardiovascular Division, Brigham & Women’s Hospital, Harvard University
- Howard Hughes Medical Institute, Harvard University
| | | | - Edwin K. Silverman
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA
| | - Jennifer A. Smith
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Adrienne Stilp
- Department of Biostatistics, University of Washington, Seattle, WA
| | - Kent D. Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA
| | - Marilyn J. Telen
- Department of Medicine, Duke University Medical Center, Durham, NC
| | - Scott T. Weiss
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | - L. Keoki Williams
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America
| | - Baojun Wu
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America
| | - Lisa R. Yanek
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Yingze Zhang
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Jessica Lasky-Su
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | | | - Susan K. Dutcher
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA
| | | | | | - Ryan Kim
- Psomagen, Inc.,Rockville, Maryland, USA
| | | | | | | | - James Luo
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alex Reiner
- Department of Epidemiology, University of Washington, Seattle, WA 98109, USA
| | - Richard A Gibbs
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Eric Boerwinkle
- Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Goncalo Abecasis
- Regeneron Genetics Center
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Fritz J Sedlazeck
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
- Department of Computer Science, Rice University, 6100 Main Street, Houston, TX, 77005, USA
| |
Collapse
|
33
|
Jun G, English AC, Metcalf GA, Yang J, Chaisson MJP, Pankratz N, Menon VK, Salerno WJ, Krasheninina O, Smith AV, Lane JA, Blackwell T, Kang HM, Salvi S, Meng Q, Shen H, Pasham D, Bhamidipati S, Kottapalli K, Arnett DK, Ashley-Koch A, Auer PL, Beutel KM, Bis JC, Blangero J, Bowden DW, Brody JA, Cade BE, Chen YDI, Cho MH, Curran JE, Fornage M, Freedman BI, Fingerlin T, Gelb BD, Hou L, Hung YJ, Kane JP, Kaplan R, Kim W, Loos RJ, Marcus GM, Mathias RA, McGarvey ST, Montgomery C, Naseri T, Nouraie SM, Preuss MH, Palmer ND, Peyser PA, Raffield LM, Ratan A, Redline S, Reupena S, Rotter JI, Rich SS, Rienstra M, Ruczinski I, Sankaran VG, Schwartz DA, Seidman CE, Seidman JG, Silverman EK, Smith JA, Stilp A, Taylor KD, Telen MJ, Weiss ST, Williams LK, Wu B, Yanek LR, Zhang Y, Lasky-Su J, Gingras MC, Dutcher SK, Eichler EE, Gabriel S, Germer S, Kim R, Viaud-Martinez KA, Nickerson DA, Luo J, Reiner A, Gibbs RA, Boerwinkle E, Abecasis G, Sedlazeck FJ. Structural variation across 138,134 samples in the TOPMed consortium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.25.525428. [PMID: 36747810 PMCID: PMC9900832 DOI: 10.1101/2023.01.25.525428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.
Collapse
Affiliation(s)
- Goo Jun
- Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston
| | - Adam C English
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Ginger A Metcalf
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Jianzhi Yang
- University of Southern California, Los Angeles, CA, USA
| | | | | | - Vipin K Menon
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | | | | | - Albert V Smith
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - John A Lane
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Tom Blackwell
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Hyun Min Kang
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Sejal Salvi
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Qingchang Meng
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Hua Shen
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Divya Pasham
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Sravya Bhamidipati
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Kavya Kottapalli
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Donna K. Arnett
- Department of Epidemiology, University of Kentucky College of Public Health
| | - Allison Ashley-Koch
- Department of Medicine, Duke University Medical Center, Durham, NC
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC
| | - Paul L. Auer
- Division of Biostatistics and Cancer Center, Medical College of Wisconsin, Milwaukee WI
| | | | - Joshua C. Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas, Rio Grande Valley School of Medicine, Brownsville, TX
| | - Donald W. Bowden
- Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Brian E. Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA
| | - Yii-Der Ida Chen
- Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
| | - Michael H. Cho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Joanne E. Curran
- Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
| | - Barry I. Freedman
- Department of Internal Medicine, Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Tasha Fingerlin
- Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St., Denver, CO, 80206, USA
| | - Bruce D. Gelb
- Mindich Child Health and Development Institute and the Departments of Pediatrics and Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai
| | | | - Yi-Jen Hung
- Institute of Preventive Medicine, National Defense Medical Center, Taiwan
| | - John P Kane
- Cardiovascular Research Institute, University of California, San Francisco
| | - Robert Kaplan
- Department of epidemiology and population health, Albert Einstein College of Medicine, Bronx NY USA
| | - Wonji Kim
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Ruth J.F. Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Gregory M Marcus
- Division of Cardiology, University of California, San Francisco CA
| | - Rasika A. Mathias
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Stephen T. McGarvey
- Department of Epidemiology, International Health Institute and Department of Anthropology, Brown University
| | - Courtney Montgomery
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | - S. Mehdi Nouraie
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Michael H. Preuss
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | | | - Patricia A. Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA
| | | | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA
| | | | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Stephen S. Rich
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA
| | - Michiel Rienstra
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins University Bloomberg, School of Public Health, Baltimore, MD, USA
| | - Vijay G. Sankaran
- Division of Hematology/Oncology, Boston Children's Hospital and Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| | | | - Christine E. Seidman
- Department of Genetics, Harvard Medical School
- Cardiovascular Division, Brigham & Women’s Hospital, Harvard University
- Howard Hughes Medical Institute, Harvard University
| | | | - Edwin K. Silverman
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
| | - Jennifer A. Smith
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Adrienne Stilp
- Department of Biostatistics, University of Washington, Seattle, WA
| | - Kent D. Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA
| | - Marilyn J. Telen
- Department of Medicine, Duke University Medical Center, Durham, NC
| | - Scott T. Weiss
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - L. Keoki Williams
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America
| | - Baojun Wu
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America
| | - Lisa R. Yanek
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Yingze Zhang
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213
| | - Jessica Lasky-Su
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | | | - Susan K. Dutcher
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA
| | | | | | - Ryan Kim
- Psomagen, Inc.,Rockville, Maryland, USA
| | | | | | | | - James Luo
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alex Reiner
- Department of Epidemiology, University of Washington, Seattle, WA 98109, USA
| | - Richard A Gibbs
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Eric Boerwinkle
- Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
| | - Goncalo Abecasis
- Regeneron Genetics Center
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Fritz J Sedlazeck
- Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
- Department of Computer Science, Rice University, 6100 Main Street, Houston, TX, 77005, USA
| |
Collapse
|
34
|
Chen Y, Wang AY, Barkley CA, Zhang Y, Zhao X, Gao M, Edmonds MD, Chong Z. Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak. Nat Commun 2023; 14:283. [PMID: 36650186 PMCID: PMC9845341 DOI: 10.1038/s41467-023-35996-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 01/11/2023] [Indexed: 01/19/2023] Open
Abstract
Long-read sequencing has demonstrated great potential for characterizing all types of structural variations (SVs). However, existing algorithms have insufficient sensitivity and precision. To address these limitations, we present DeBreak, a computational method for comprehensive and accurate SV discovery. Based on alignment results, DeBreak employs a density-based approach for clustering SV candidates together with a local de novo assembly approach for reconstructing long insertions. A partial order alignment algorithm ensures precise SV breakpoints with single base-pair resolution, and a k-means clustering method can report multi-allele SV events. DeBreak outperforms existing tools on both simulated and real long-read sequencing data from both PacBio and Nanopore platforms. An important application of DeBreak is analyzing cancer genomes for potentially tumor-driving SVs. DeBreak can also be used for supplementing whole-genome assembly-based SV discovery.
Collapse
Affiliation(s)
- Yu Chen
- Department of Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
- Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Amy Y Wang
- Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
- Department of Medicine, Division of General Internal Medicine, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Courtney A Barkley
- Department of Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Yixin Zhang
- Department of Computer Science, College of Arts and Sciences, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Xinyang Zhao
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Min Gao
- Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
- Department of Medicine, Division of Cardiovascular Disease, Heersink School of Medicine, University of Alabama at Birmingham, AL, 35233, Birmingham, USA
| | - Mick D Edmonds
- Department of Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA
| | - Zechen Chong
- Department of Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA.
- Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35294, USA.
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA.
| |
Collapse
|
35
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
- California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada
- Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
36
|
Tigano A, Russello MA. The genomic basis of reproductive and migratory behaviour in a polymorphic salmonid. Mol Ecol 2022; 31:6588-6604. [PMID: 36208020 DOI: 10.1111/mec.16724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 09/21/2022] [Accepted: 09/29/2022] [Indexed: 01/13/2023]
Abstract
Recent ecotypic differentiation provides unique opportunities to investigate the genomic basis and architecture of local adaptation, while offering insights into how species form and persist. Sockeye salmon (Oncorhynchus nerka) exhibit migratory and resident ("kokanee") ecotypes, which are further distinguished into shore-spawning and stream-spawning reproductive ecotypes. Here, we analysed 36 sockeye (stream-spawning) and kokanee (stream- and shore-spawning) genomes from a system where they co-occur and have recent common ancestry (Okanagan Lake/River in British Columbia, Canada) to investigate the genomic basis of reproductive and migratory behaviour. Examination of the genomic landscape of differentiation, differences in allele frequencies and genotype-phenotype associations revealed three main blocks of sequence differentiation on chromosomes 7, 12 and 20, associated with migratory behaviour, spawning location and spawning timing. Structural variants identified in these same areas suggest they could contribute to ecotypic differentiation directly as causal variants or via maintenance of their genomic architecture through recombination suppression mechanisms. Genes in these regions were related to spatial memory and swimming endurance (SYNGAP, TPM3), as well as eye and brain development (including SIX6), potentially associated with differences in migratory behaviour and visual habitats across spawning locations, respectively. Additional genes (GREB1L, ROCK1) identified here have been associated with timing of migration in other salmonids and could explain variation in timing of O. nerka spawning. Together, these results based on the joint analysis of sequence and structural variation represent a significant advance in our understanding of the genomic landscape of ecotypic differentiation at different stages in the speciation continuum.
Collapse
Affiliation(s)
- Anna Tigano
- Department of Biology, The University of British Columbia, Kelowna, British Columbia, Canada
| | - Michael A Russello
- Department of Biology, The University of British Columbia, Kelowna, British Columbia, Canada
| |
Collapse
|
37
|
Pokrovac I, Pezer Ž. Recent advances and current challenges in population genomics of structural variation in animals and plants. Front Genet 2022; 13:1060898. [PMID: 36523759 PMCID: PMC9745067 DOI: 10.3389/fgene.2022.1060898] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 11/15/2022] [Indexed: 05/02/2024] Open
Abstract
The field of population genomics has seen a surge of studies on genomic structural variation over the past two decades. These studies witnessed that structural variation is taxonomically ubiquitous and represent a dominant form of genetic variation within species. Recent advances in technology, especially the development of long-read sequencing platforms, have enabled the discovery of structural variants (SVs) in previously inaccessible genomic regions which unlocked additional structural variation for population studies and revealed that more SVs contribute to evolution than previously perceived. An increasing number of studies suggest that SVs of all types and sizes may have a large effect on phenotype and consequently major impact on rapid adaptation, population divergence, and speciation. However, the functional effect of the vast majority of SVs is unknown and the field generally lacks evidence on the phenotypic consequences of most SVs that are suggested to have adaptive potential. Non-human genomes are heavily under-represented in population-scale studies of SVs. We argue that more research on other species is needed to objectively estimate the contribution of SVs to evolution. We discuss technical challenges associated with SV detection and outline the most recent advances towards more representative reference genomes, which opens a new era in population-scale studies of structural variation.
Collapse
Affiliation(s)
| | - Željka Pezer
- Laboratory for Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
38
|
Campagna L, Toews DP. The genomics of adaptation in birds. Curr Biol 2022; 32:R1173-R1186. [DOI: 10.1016/j.cub.2022.07.076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
39
|
Ruggieri AA, Livraghi L, Lewis JJ, Evans E, Cicconardi F, Hebberecht L, Ortiz-Ruiz Y, Montgomery SH, Ghezzi A, Rodriguez-Martinez JA, Jiggins CD, McMillan WO, Counterman BA, Papa R, Van Belleghem SM. A butterfly pan-genome reveals that a large amount of structural variation underlies the evolution of chromatin accessibility. Genome Res 2022; 32:1862-1875. [PMID: 36109150 PMCID: PMC9712634 DOI: 10.1101/gr.276839.122] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 09/13/2022] [Indexed: 01/16/2023]
Abstract
Despite insertions and deletions being the most common structural variants (SVs) found across genomes, not much is known about how much these SVs vary within populations and between closely related species, nor their significance in evolution. To address these questions, we characterized the evolution of indel SVs using genome assemblies of three closely related Heliconius butterfly species. Over the relatively short evolutionary timescales investigated, up to 18.0% of the genome was composed of indels between two haplotypes of an individual Heliconius charithonia butterfly and up to 62.7% included lineage-specific SVs between the genomes of the most distant species (11 Mya). Lineage-specific sequences were mostly characterized as transposable elements (TEs) inserted at random throughout the genome and their overall distribution was similarly affected by linked selection as single nucleotide substitutions. Using chromatin accessibility profiles (i.e., ATAC-seq) of head tissue in caterpillars to identify sequences with potential cis-regulatory function, we found that out of the 31,066 identified differences in chromatin accessibility between species, 30.4% were within lineage-specific SVs and 9.4% were characterized as TE insertions. These TE insertions were localized closer to gene transcription start sites than expected at random and were enriched for sites with significant resemblance to several transcription factor binding sites with known function in neuron development in Drosophila We also identified 24 TE insertions with head-specific chromatin accessibility. Our results show high rates of structural genome evolution that were previously overlooked in comparative genomic studies and suggest a high potential for structural variation to serve as raw material for adaptive evolution.
Collapse
Affiliation(s)
- Angelo A Ruggieri
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
| | - Luca Livraghi
- Department of Biological Sciences, The George Washington University, Washington, DC 20052, USA
- Smithsonian Tropical Research Institute, Apartado 0843-03092 Panamá, Panama
| | - James J Lewis
- Department of Zoology, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
| | - Elizabeth Evans
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
| | - Francesco Cicconardi
- School of Biological Sciences, Bristol University, Bristol BS8 1QU, United Kingdom
| | - Laura Hebberecht
- School of Biological Sciences, Bristol University, Bristol BS8 1QU, United Kingdom
| | - Yadira Ortiz-Ruiz
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
- Molecular Sciences and Research Center, University of Puerto Rico, San Juan 00926, Puerto Rico
| | - Stephen H Montgomery
- School of Biological Sciences, Bristol University, Bristol BS8 1QU, United Kingdom
| | - Alfredo Ghezzi
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
| | | | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, United Kingdom
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Apartado 0843-03092 Panamá, Panama
| | - Brian A Counterman
- Department of Biological Sciences, Auburn University, Auburn, Alabama 36849, USA
| | - Riccardo Papa
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
- Molecular Sciences and Research Center, University of Puerto Rico, San Juan 00926, Puerto Rico
| | - Steven M Van Belleghem
- Department of Biology, University of Puerto Rico-Rio Piedras, San Juan PR 00931, Puerto Rico
- Ecology, Evolution and Conservation Biology, Biology Department, KU Leuven, 3000 Leuven, Belgium
| |
Collapse
|
40
|
Schikora-Tamarit MÀ, Gabaldón T. PerSVade: personalized structural variant detection in any species of interest. Genome Biol 2022; 23:175. [PMID: 35974382 PMCID: PMC9380391 DOI: 10.1186/s13059-022-02737-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 07/22/2022] [Indexed: 11/12/2022] Open
Abstract
Structural variants (SVs) underlie genomic variation but are often overlooked due to difficult detection from short reads. Most algorithms have been tested on humans, and it remains unclear how applicable they are in other organisms. To solve this, we develop perSVade (personalized structural variation detection), a sample-tailored pipeline that provides optimally called SVs and their inferred accuracy, as well as small and copy number variants. PerSVade increases SV calling accuracy on a benchmark of six eukaryotes. We find no universal set of optimal parameters, underscoring the need for sample-specific parameter optimization. PerSVade will facilitate SV detection and study across diverse organisms.
Collapse
Affiliation(s)
- Miquel Àngel Schikora-Tamarit
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028, Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain.
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028, Barcelona, Spain.
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain.
- Centro Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain.
| |
Collapse
|
41
|
Eriksson MC, Mandáková T, McCann J, Temsch EM, Chase MW, Hedrén M, Weiss-Schneeweiss H, Paun O. Repeat Dynamics across Timescales: A Perspective from Sibling Allotetraploid Marsh Orchids (Dactylorhiza majalis s.l.). Mol Biol Evol 2022; 39:msac167. [PMID: 35904928 PMCID: PMC9366187 DOI: 10.1093/molbev/msac167] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
To provide insights into the fate of transposable elements (TEs) across timescales in a post-polyploidization context, we comparatively investigate five sibling Dactylorhiza allotetraploids (Orchidaceae) formed independently and sequentially between 500 and 100K generations ago by unidirectional hybridization between diploids D. fuchsii and D. incarnata. Our results first reveal that the paternal D. incarnata genome shows a marked increased content of LTR retrotransposons compared to the maternal species, reflected in its larger genome size and consistent with a previously hypothesized bottleneck. With regard to the allopolyploids, in the youngest D. purpurella both genome size and TE composition appear to be largely additive with respect to parents, whereas for polyploids of intermediate ages we uncover rampant genome expansion on a magnitude of multiple entire genomes of some plants such as Arabidopsis. The oldest allopolyploids in the series are not larger than the intermediate ones. A putative tandem repeat, potentially derived from a non-autonomous miniature inverted-repeat TE (MITE) drives much of the genome dynamics in the allopolyploids. The highly dynamic MITE-like element is found in higher proportions in the maternal diploid, D. fuchsii, but is observed to increase in copy number in both subgenomes of the allopolyploids. Altogether, the fate of repeats appears strongly regulated and therefore predictable across multiple independent allopolyploidization events in this system. Apart from the MITE-like element, we consistently document a mild genomic shock following the allopolyploidizations investigated here, which may be linked to their relatively large genome sizes, possibly associated with strong selection against further genome expansions.
Collapse
Affiliation(s)
- Mimmi C Eriksson
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria
- Vienna Graduate School of Population Genetics, Veterinärplatz 1, A-1210 Vienna, Austria
| | - Terezie Mandáková
- Plant Cytogenomics Research Group, CEITEC−Central−European Institute of Technology, Masaryk University, Brno 62500, Czech Republic
- Central European Institute of Technology, Masaryk University, Brno 62500, Czech Republic
- Institute of Experimental Biology, Faculty of Science, Masaryk University, Brno 62500, Czech Republic
| | - Jamie McCann
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria
| | - Eva M Temsch
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria
| | - Mark W Chase
- Royal Botanic Gardens Kew, London TW9 3AE, United Kingdom
- Department of Environment and Agriculture, Curtin University, Perth, Western Australia, Australia
| | - Mikael Hedrén
- Department of Biology, University of Lund, Sölvegatan 37, SE-223 62 Lund, Sweden
| | - Hanna Weiss-Schneeweiss
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria
| |
Collapse
|
42
|
Du Z, D’Alessandro E, Zheng Y, Wang M, Chen C, Wang X, Song C. Retrotransposon Insertion Polymorphisms (RIPs) in Pig Coat Color Candidate Genes. Animals (Basel) 2022; 12:ani12080969. [PMID: 35454216 PMCID: PMC9031378 DOI: 10.3390/ani12080969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/28/2022] [Accepted: 04/05/2022] [Indexed: 12/17/2022] Open
Abstract
The diversity of livestock coat color results from human positive selection and represents an indispensable part of breed identity. As an important biodiversity resource, pigs have many special characteristics, including the most visualized feature, coat color, and excellent adaptation, and the coat color represents an important phenotypic characteristic of the pig breed. Exploring the genetic mechanisms of phenotypic characteristics and the melanocortin system is of considerable interest in domestic animals because their energy metabolism and pigmentation have been under strong selection. In this study, 20 genes related to coat color in mammals were selected, and the structural variations (SVs) in these genic regions were identified by sequence alignment across 17 assembled pig genomes, from representing different types of pigs (miniature, lean, and fat type). A total of 167 large structural variations (>50 bp) of coat-color genes, which overlap with retrotransposon insertions (>50 bp), were obtained and designated as putative RIPs. Finally, 42 RIPs were confirmed by PCR detection. Additionally, eleven RIP sites were further evaluated for their genotypic distributions by PCR in more individuals of eleven domesticated breeds representing different coat color groups. Differential distributions of these RIPs were observed across populations, and some RIPs may be associated with breed differences.
Collapse
Affiliation(s)
- Zhanyu Du
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
| | - Enrico D’Alessandro
- Department of Veterinary Sciences, University of Messina, Via Palatucci, 98168 Messina, Italy;
| | - Yao Zheng
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
| | - Mengli Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
| | - Cai Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
| | - Xiaoyan Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
| | - Chengyi Song
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (Z.D.); (Y.Z.); (M.W.); (C.C.); (X.W.)
- Correspondence:
| |
Collapse
|
43
|
Derežanin L, Blažytė A, Dobrynin P, Duchêne DA, Grau JH, Jeon S, Kliver S, Koepfli KP, Meneghini D, Preick M, Tomarovsky A, Totikov A, Fickel J, Förster DW. Multiple types of genomic variation contribute to adaptive traits in the mustelid subfamily Guloninae. Mol Ecol 2022; 31:2898-2919. [PMID: 35334142 DOI: 10.1111/mec.16443] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 03/06/2022] [Accepted: 03/14/2022] [Indexed: 11/28/2022]
Abstract
Species of the mustelid subfamily Guloninae inhabit diverse habitats on multiple continents, and occupy a variety of ecological niches. They differ in feeding ecologies, reproductive strategies and morphological adaptations. To identify candidate loci associated with adaptations to their respective environments, we generated a de novo assembly of the tayra (Eira barbara), the earliest diverging species in the subfamily, and compared this with the genomes available for the wolverine (Gulo gulo) and the sable (Martes zibellina). Our comparative genomic analyses included searching for signs of positive selection, examining changes in gene family sizes, as well as searching for species-specific structural variants (SVs). Among candidate loci associated with phenotypic traits, we observed many related to diet, body condition and reproduction. For example, for the tayra, which has an atypical gulonine reproductive strategy of aseasonal breeding, we observe species-specific changes in many pregnancy-related genes. For the wolverine, a circumpolar hypercarnivore that must cope with seasonal food scarcity, we observed many changes in genes associated with diet and body condition. All types of genomic variation examined (single nucleotide polymorphisms, gene family expansions, structural variants) contributed substantially to the identification of candidate loci. This strongly argues for consideration of variation other than single nucleotide polymorphisms in comparative genomics studies aiming to identify loci of adaptive significance.
Collapse
Affiliation(s)
- Lorena Derežanin
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| | - Asta Blažytė
- Department of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST, Ulsan, 44919, Republic of Korea
| | - Pavel Dobrynin
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia
| | - David A Duchêne
- Center for Evolutionary Hologenomics, The GLOBE Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Øster Farimagsgade 5, 1353, Copenhagen, Denmark
| | - José Horacio Grau
- amedes Genetics, amedes Medizinische Dienstleistungen GmbH, Jägerstr. 61, 10117, Berlin, Germany
| | - Sungwon Jeon
- Department of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST, Ulsan, 44919, Republic of Korea.,Clinomics Inc, Ulsan, 44919, Republic of Korea
| | - Sergei Kliver
- Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Klaus-Peter Koepfli
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Smithsonian-Mason School of Conservation, 1500 Remount Road, Front Royal, VA, 22630, USA.,Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, 1500 Remount Road, Front Royal, VA, 22630, USA
| | - Dorina Meneghini
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| | - Michaela Preick
- Institute for Biochemistry and Biology, Faculty of Mathematics and Natural Sciences, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, OT, Germany
| | - Andrey Tomarovsky
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 1 Pirogova str, Novosibirsk, 630090, Russia
| | - Azamat Totikov
- Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr, 197101, Saint Petersburg, Russia.,Institute of Molecular and Cellular Biology, SB RAS, 8/2 Acad. Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 1 Pirogova str, Novosibirsk, 630090, Russia
| | - Jörns Fickel
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany.,Institute for Biochemistry and Biology, Faculty of Mathematics and Natural Sciences, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, OT, Germany
| | - Daniel W Förster
- Leibniz Institute for Zoo and Wildlife Research (IZW, Alfred Kowalke Straße 17, 10315, Berlin, Germany
| |
Collapse
|
44
|
Li A, Liu A, Wu S, Qu K, Hu H, Yang J, Shrestha N, Liu J, Ren G. Comparison of structural variants in the whole genome sequences of two Medicago truncatula ecotypes: Jemalong A17 and R108. BMC PLANT BIOLOGY 2022; 22:77. [PMID: 35193491 PMCID: PMC8862580 DOI: 10.1186/s12870-022-03469-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 02/14/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Structural variants (SVs) constitute a large proportion of the genomic variation that results in phenotypic variation in plants. However, they are still a largely unexplored feature in most plant genomes. Here, we present the whole-genome landscape of SVs between two model legume Medicago truncatula ecotypes-Jemalong A17 and R108- that have been extensively used in various legume biology studies. RESULTS To catalogue SVs, we first resolved the previously published R108 genome assembly (R108 v1.0) to chromosome-scale using 124 × Hi-C data, resulting in a high-quality genome assembly. The inter-chromosomal reciprocal translocations between chromosomes 4 and 8 were confirmed by performing syntenic analysis between the two genomes. Combined with the Hi-C data, it appears that these translocation events had a significant effect on chromatin organization. Using both whole-genome and short-read alignments, we identified the genomic landscape of SVs between the two genomes, some of which may account for several phenotypic differences, including their differential responses to aluminum toxicity and iron deficiency, and the development of different anthocyanin leaf markings. We also found extensive SVs within the nodule-specific cysteine-rich gene family which encodes antimicrobial peptides essential for terminal bacteroid differentiation during nitrogen-fixing symbiosis. CONCLUSIONS Our results provide a near-complete R108 genome assembly and the first genomic landscape of SVs obtained by comparing two M. truncatula ecotypes. This may provide valuable genomic resources for the functional and molecular research of legume biology in the future.
Collapse
Affiliation(s)
- Ao Li
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Ai Liu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Shuang Wu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Kunjing Qu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Hongyin Hu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Jinli Yang
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Nawal Shrestha
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Jianquan Liu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Lab of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, China
| | - Guangpeng Ren
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China.
| |
Collapse
|
45
|
Warmuth VM, Weissensteiner MH, Wolf J. Ineffective silencing of transposable elements on an avian W Chromosome. Genome Res 2022; 32:671-681. [PMID: 35149543 PMCID: PMC8997356 DOI: 10.1101/gr.275465.121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 02/08/2022] [Indexed: 11/24/2022]
Abstract
One of the defining features of transposable elements (TEs) is their ability to move to new locations in the host genome. To minimise the potentially deleterious effects of de novo TE insertions, hosts have evolved several mechanisms to control TE activity, including recombination-mediated removal and epigenetic silencing; however, increasing evidence suggests that silencing of TEs is often incomplete. The crow family experienced a recent radiation of LTR retrotransposons (LTRs), offering an opportunity to gain insight into the regulatory control of young, potentially still active TEs. We quantified the abundance of TE-derived transcripts across several tissues in 15 Eurasian crows (Corvus (corone) spp.) raised under common garden conditions and find evidence for ineffective TE suppression on the female-specific W Chromosome. Using RNA-seq data, we show that ~ 9.5% of all transcribed TEs had considerably greater (average: 16-fold) transcript abundance in female crows, and that more than 85% of these female-biased TEs originated on the W Chromosome. After accounting for differences in TE density among chromosomal classes, W-linked TEs were significantly more highly expressed than TEs residing on other chromosomes, consistent with ineffective silencing on the former. Together, our results suggest that the crow W Chromosome acts as a source of transcriptionally active TEs, with possible negative fitness consequences for female birds analogous to Drosophila (an X/Y system), where overexpression of Y-linked TEs is associated with male-specific aging and fitness loss ('toxic Y').
Collapse
|
46
|
Widanagama SD, Freeland JR, Xu X, Shafer ABA. Genome assembly, annotation, and comparative analysis of the cattail Typha latifolia. G3 GENES|GENOMES|GENETICS 2022; 12:6433155. [PMID: 34871392 PMCID: PMC9210280 DOI: 10.1093/g3journal/jkab401] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 11/13/2021] [Indexed: 11/19/2022]
Abstract
Cattails (Typha species) comprise a genus of emergent wetland plants with a global distribution. Typha latifolia and Typha angustifolia are two of the most widespread species, and in areas of sympatry can interbreed to produce the hybrid Typha × glauca. In some regions, the relatively high fitness of Typha × glauca allows it to outcompete and displace both parent species, while simultaneously reducing plant and invertebrate biodiversity, and modifying nutrient and water cycling. We generated a high-quality whole-genome assembly of T. latifolia using PacBio long-read and high coverage Illumina sequences that will facilitate evolutionary and ecological studies in this hybrid zone. Genome size was 287 Mb and consisted of 1158 scaffolds, with an N50 of 8.71 Mb; 43.84% of the genome were identified as repetitive elements. The assembly has a BUSCO score of 96.03%, and 27,432 genes and 2700 RNA sequences were putatively identified. Comparative analysis detected over 9000 shared orthologs with related taxa and phylogenomic analysis supporting T. latifolia as a divergent lineage within Poales. This high-quality scaffold-level reference genome will provide a useful resource for future population genomic analyses and improve our understanding of Typha hybrid dynamics.
Collapse
Affiliation(s)
- Shane D Widanagama
- Department of Computer Science, Trent University, Peterborough, ON K9L 0G2, Canada
| | - Joanna R Freeland
- Department of Biology, Trent University, Peterborough, ON K9L 0G2, Canada
| | - Xinwei Xu
- Department of Ecology, College of Life Sciences, Wuhan University, Wuhan 430072, China
| | - Aaron B A Shafer
- Department of Forensic Sciences, Trent University, Peterborough, ON K9L 0G2, Canada
- Corresponding author: Department of Forensic Sciences, Trent University, DNA Building, 2140 East Bank Drive, Peterborough, ON, K9L 0G2, Canada.
| |
Collapse
|
47
|
Shakya SB, Wang-Claypool CY, Cicero C, Bowie RCK, Mason NA. Neo-sex chromosome evolution and phenotypic differentiation across an elevational gradient in horned larks (Eremophila Alpestris). Mol Ecol 2022; 31:1783-1799. [PMID: 35048444 DOI: 10.1111/mec.16357] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 12/16/2021] [Accepted: 01/07/2022] [Indexed: 11/29/2022]
Abstract
Genetic structure and phenotypic variation among populations is affected by both geographic distance and environmental variation across species' distributions. Understanding the relative contributions of isolation by distance (IBD) and isolation by environment (IBE) is important for elucidating population dynamics across habitats and ecological gradients. In this study, we compared phenotypic and genetic variation among Horned Lark (Eremophila alpestris) populations from 10 sites encompassing an elevational gradient from low-elevation desert scrub in Death Valley (285 a.s.l.) to high-elevation meadows in the White Mountains of the Sierra Nevada of California (greater than 3000 m a.s.l.). Using a ddRAD dataset of 28,474 SNPs aligned to a high-quality reference genome, we compared genetic structure with elevational, environmental, and spatial distance to quantify how different aspects of the landscape drive genomic and phenotypic differentiation in Horned Larks. We found larger-bodied birds were associated with sites that had less seasonality and higher annual precipitation, and longer spurs occurred in soils with more clay and silt content, less sand, and finer fragments. Larks have large neo-sex chromosomes, and we found that associations with elevation and environmental variation were much stronger among neo-sex chromosomes compared to autosomes. Furthermore, we found that putative chromosomal translocations, fusions, and inversions were associated with elevation and may underlie local adaptation across an elevational gradient in Horned Larks. Our results suggest that genetic variation in Horned Larks is affected more by IBD than IBE, but specific phenotypes and genomic regions-particually on neo-sex chromosomes-bear stronger associations with the environment.
Collapse
Affiliation(s)
- Subir B Shakya
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Cynthia Y Wang-Claypool
- Museum of Vertebrate Zoology, University of California, Berkeley, California, USA.,Department of Integrative Biology, University of California, Berkeley, California, USA
| | - Carla Cicero
- Museum of Vertebrate Zoology, University of California, Berkeley, California, USA
| | - Rauri C K Bowie
- Museum of Vertebrate Zoology, University of California, Berkeley, California, USA.,Department of Integrative Biology, University of California, Berkeley, California, USA
| | - Nicholas A Mason
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA.,Museum of Vertebrate Zoology, University of California, Berkeley, California, USA
| |
Collapse
|
48
|
Wu L, Jiao X, Zhang D, Cheng Y, Song G, Qu Y, Lei F. Comparative Genomics and Evolution of Avian Specialized Traits. Curr Genomics 2021; 22:496-511. [PMID: 35386431 PMCID: PMC8905638 DOI: 10.2174/1389202923666211227143952] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 06/30/2021] [Accepted: 07/02/2021] [Indexed: 11/25/2022] Open
Abstract
Genomic data are important for understanding the origin and evolution of traits. Under the context of rapidly developing of sequencing technologies and more widely available genome sequences, researchers are able to study evolutionary mechanisms of traits via comparative genomic methods. Compared with other vertebrates, bird genomes are relatively small and exhibit conserved synteny with few repetitive elements, which makes them suitable for evolutionary studies. Increasing genomic progress has been reported on the evolution of powered flight, body size variation, beak morphology, plumage colouration, high-elevation colonization, migration, and vocalization. By summarizing previous studies, we demonstrate the genetic bases of trait evolution, highlighting the roles of small-scale sequence variation, genomic structural variation, and changes in gene interaction networks. We suggest that future studies should focus on improving the quality of reference genomes, exploring the evolution of regulatory elements and networks, and combining genomic data with morphological, ecological, behavioural, and developmental biology data.
Collapse
Affiliation(s)
- Lei Wu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xiaolu Jiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Dezhi Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yalin Cheng
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Gang Song
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yanhua Qu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| |
Collapse
|
49
|
Vargas-Chavez C, Longo Pendy NM, Nsango SE, Aguilera L, Ayala D, González J. Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
Affiliation(s)
- Carlos Vargas-Chavez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Neil Michel Longo Pendy
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,École Doctorale Régional (EDR) en Infectiologie Tropicale d'Afrique Centrale, BP 876, Franceville, Gabon
| | - Sandrine E Nsango
- Faculté de Médecine et des Sciences Pharmaceutiques, Université de Douala, BP 2701, Douala, Cameroun
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Diego Ayala
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle (MIVEGEC), Université Montpellier, CNRS, IRD, 64501 Montpellier, France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| |
Collapse
|
50
|
Poignet M, Johnson Pokorná M, Altmanová M, Majtánová Z, Dedukh D, Albrecht T, Reif J, Osiejuk TS, Reifová R. Comparison of Karyotypes in Two Hybridizing Passerine Species: Conserved Chromosomal Structure but Divergence in Centromeric Repeats. Front Genet 2021; 12:768987. [PMID: 34938317 PMCID: PMC8687609 DOI: 10.3389/fgene.2021.768987] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 11/10/2021] [Indexed: 11/23/2022] Open
Abstract
Changes in chromosomal structure involving chromosomal rearrangements or copy number variation of specific sequences can play an important role in speciation. Here, we explored the chromosomal structure of two hybridizing passerine species; the common nightingale (Luscinia megarhynchos) and the thrush nightingale (Luscinia luscinia), using conventional cytogenetic approaches, immunostaining of meiotic chromosomes, fluorescence in situ hybridization as well as comparative genomic hybridization (CGH). We found that the two nightingale species show conserved karyotypes with the same diploid chromosome number of 2n = 84. In addition to standard chromosomes, both species possessed a small germline restricted chromosome of similar size as a microchromosome. Just a few subtle changes in chromosome morphology were observed between the species, suggesting that only a limited number of chromosomal rearrangements occurred after the species divergence. The interspecific CGH experiment suggested that the two nightingale species might have diverged in centromeric repetitive sequences in most macro- and microchromosomes. In addition, some chromosomes showed changes in copy number of centromeric repeats between the species. The observation of very similar karyotypes in the two nightingale species is consistent with a generally slow rate of karyotype evolution in birds. The divergence of centromeric sequences between the two species could theoretically cause meiotic drive or reduced fertility in interspecific hybrids. Nevertheless, further studies are needed to evaluate the potential role of chromosomal structural variations in nightingale speciation.
Collapse
Affiliation(s)
- Manon Poignet
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Martina Johnson Pokorná
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Marie Altmanová
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Zuzana Majtánová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Dmitry Dedukh
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Tomáš Albrecht
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic
| | - Jiří Reif
- Institute for Environmental Studies, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Zoology and Laboratory of Ornithology, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Tomasz S. Osiejuk
- Department of Behavioural Ecology, Institute of Environmental Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
| | - Radka Reifová
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| |
Collapse
|