1
|
Merondun J, Marques CI, Andrade P, Meshcheryagina S, Galván I, Afonso S, Alves JM, Araújo PM, Bachurin G, Balacco J, Bán M, Fedrigo O, Formenti G, Fossøy F, Fülöp A, Golovatin M, Granja S, Hewson C, Honza M, Howe K, Larson G, Marton A, Moskát C, Mountcastle J, Procházka P, Red’kin Y, Sims Y, Šulc M, Tracey A, Wood JMD, Jarvis ED, Hauber ME, Carneiro M, Wolf JBW. Evolution and genetic architecture of sex-limited polymorphism in cuckoos. Sci Adv 2024; 10:eadl5255. [PMID: 38657058 PMCID: PMC11042743 DOI: 10.1126/sciadv.adl5255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 03/20/2024] [Indexed: 04/26/2024]
Abstract
Sex-limited polymorphism has evolved in many species including our own. Yet, we lack a detailed understanding of the underlying genetic variation and evolutionary processes at work. The brood parasitic common cuckoo (Cuculus canorus) is a prime example of female-limited color polymorphism, where adult males are monochromatic gray and females exhibit either gray or rufous plumage. This polymorphism has been hypothesized to be governed by negative frequency-dependent selection whereby the rarer female morph is protected against harassment by males or from mobbing by parasitized host species. Here, we show that female plumage dichromatism maps to the female-restricted genome. We further demonstrate that, consistent with balancing selection, ancestry of the rufous phenotype is shared with the likewise female dichromatic sister species, the oriental cuckoo (Cuculus optatus). This study shows that sex-specific polymorphism in trait variation can be resolved by genetic variation residing on a sex-limited chromosome and be maintained across species boundaries.
Collapse
Affiliation(s)
- Justin Merondun
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
- Department of Ornithology, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| | - Cristiana I. Marques
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências da Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| | - Pedro Andrade
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| | - Swetlana Meshcheryagina
- Institute of Plant and Animal Ecology, Ural Branch, Russian Academy of Sciences, Yekaterinburg, Russia
| | - Ismael Galván
- Departamento de Ecología Evolutiva, Museo Nacional de Ciencias Naturales, CSIC, Madrid, Spain
| | - Sandra Afonso
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| | - Joel M. Alves
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
- Palaeogenomics and Bio-Archaeology Research Network, School of Archaeology, University of Oxford, Oxford, OX1 3QY, UK
| | - Pedro M. Araújo
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
- Department of Life Sciences, MARE–Marine and Environmental Sciences Centre/ARNET–Aquatic Research Network, University of Coimbra, Coimbra, Portugal
| | | | - Jennifer Balacco
- The Vertebrate Genome Lab, Rockefeller University, New York, NY 10065, USA
| | - Miklós Bán
- HUN-REN-UD Behavioral Ecology Research Group, Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
| | - Olivier Fedrigo
- The Vertebrate Genome Lab, Rockefeller University, New York, NY 10065, USA
| | - Giulio Formenti
- The Vertebrate Genome Lab, Rockefeller University, New York, NY 10065, USA
| | - Frode Fossøy
- Centre for Biodiversity Genetics, Norwegian Institute for Nature Research, Trondheim, Norway
| | - Attila Fülöp
- HUN-REN-UD Behavioral Ecology Research Group, Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
- Evolutionary Ecology Group, Hungarian Department of Biology and Ecology, Babeş-Bolyai University, Cluj-Napoca, Romania
- STAR-UBB Institute of Advanced Studies in Science and Technology, Babeş-Bolyai University, Cluj-Napoca, Romania
| | - Mikhail Golovatin
- Institute of Plant and Animal Ecology, Ural Branch, Russian Academy of Sciences, Yekaterinburg, Russia
| | - Sofia Granja
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
- Palaeogenomics and Bio-Archaeology Research Network, School of Archaeology, University of Oxford, Oxford, OX1 3QY, UK
| | | | - Marcel Honza
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic
| | - Kerstin Howe
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Greger Larson
- Palaeogenomics and Bio-Archaeology Research Network, School of Archaeology, University of Oxford, Oxford, OX1 3QY, UK
| | - Attila Marton
- Evolutionary Ecology Group, Faculty of Biology and Geology, Babeș-Bolyai University, Cluj-Napoca, Romania
- Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
| | - Csaba Moskát
- Hungarian Natural History Museum, Budapest, Hungary
| | | | - Petr Procházka
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic
| | | | - Ying Sims
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Michal Šulc
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic
| | - Alan Tracey
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Erich D. Jarvis
- The Vertebrate Genome Lab, Rockefeller University, New York, NY 10065, USA
| | - Mark E. Hauber
- Advanced Science Research Center and Program in Psychology, Graduate Center of the City University of New York, New York, NY 10031, USA
| | - Miguel Carneiro
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| | - Jochen B. W. Wolf
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| |
Collapse
|
2
|
Sebastianelli M, Lukhele SM, Secomandi S, de Souza SG, Haase B, Moysi M, Nikiforou C, Hutfluss A, Mountcastle J, Balacco J, Pelan S, Chow W, Fedrigo O, Downs CT, Monadjem A, Dingemanse NJ, Jarvis ED, Brelsford A, vonHoldt BM, Kirschel ANG. A genomic basis of vocal rhythm in birds. Nat Commun 2024; 15:3095. [PMID: 38653976 DOI: 10.1038/s41467-024-47305-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 03/22/2024] [Indexed: 04/25/2024] Open
Abstract
Vocal rhythm plays a fundamental role in sexual selection and species recognition in birds, but little is known of its genetic basis due to the confounding effect of vocal learning in model systems. Uncovering its genetic basis could facilitate identifying genes potentially important in speciation. Here we investigate the genomic underpinnings of rhythm in vocal non-learning Pogoniulus tinkerbirds using 135 individual whole genomes distributed across a southern African hybrid zone. We find rhythm speed is associated with two genes that are also known to affect human speech, Neurexin-1 and Coenzyme Q8A. Models leveraging ancestry reveal these candidate loci also impact rhythmic stability, a trait linked with motor performance which is an indicator of quality. Character displacement in rhythmic stability suggests possible reinforcement against hybridization, supported by evidence of asymmetric assortative mating in the species producing faster, more stable rhythms. Because rhythm is omnipresent in animal communication, candidate genes identified here may shape vocal rhythm across birds and other vertebrates.
Collapse
Affiliation(s)
- Matteo Sebastianelli
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus.
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, 751 23, Uppsala, Sweden.
| | - Sifiso M Lukhele
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus
| | - Simona Secomandi
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus
| | - Stacey G de Souza
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus
| | - Bettina Haase
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | - Michaella Moysi
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus
| | - Christos Nikiforou
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus
| | - Alexander Hutfluss
- Behavioural Ecology, Faculty of Biology, LMU Munich (LMU), 82152, Planegg-Martinsried, Germany
| | | | - Jennifer Balacco
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | | | | | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | - Colleen T Downs
- Centre for Functional Biodiversity, School of Life Sciences, University of KwaZulu-Natal, Pietermaritzburg, 3209, South Africa
| | - Ara Monadjem
- Department of Biological Sciences, University of Eswatini, Kwaluseni, Eswatini
- Mammal Research Institute, Department of Zoology & Entomology, University of Pretoria, Private Bag 20, Hatfield, 0028, Pretoria, South Africa
| | - Niels J Dingemanse
- Behavioural Ecology, Faculty of Biology, LMU Munich (LMU), 82152, Planegg-Martinsried, Germany
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Alan Brelsford
- Department of Evolution, Ecology and Organismal Biology, University of California Riverside, Riverside, CA, 92521, USA
| | - Bridgett M vonHoldt
- Department of Ecology & Evolutionary Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Alexander N G Kirschel
- Department of Biological Sciences, University of Cyprus, PO Box 20537, Nicosia, 1678, Cyprus.
| |
Collapse
|
3
|
Hickey G, Monlong J, Ebler J, Novak AM, Eizenga JM, Gao Y, Marschall T, Li H, Paten B, Abel HJ, Antonacci-Fulton LL, Asri M, Baid G, Baker CA, Belyaeva A, Billis K, Bourque G, Buonaiuto S, Carroll A, Chaisson MJP, Chang PC, Chang XH, Cheng H, Chu J, Cody S, Colonna V, Cook DE, Cook-Deegan RM, Cornejo OE, Diekhans M, Doerr D, Ebert P, Ebler J, Eichler EE, Eizenga JM, Fairley S, Fedrigo O, Felsenfeld AL, Feng X, Fischer C, Flicek P, Formenti G, Frankish A, Fulton RS, Gao Y, Garg S, Garrison E, Garrison NA, Giron CG, Green RE, Groza C, Guarracino A, Haggerty L, Hall IM, Harvey WT, Haukness M, Haussler D, Heumos S, Hickey G, Hoekzema K, Hourlier T, Howe K, Jain M, Jarvis ED, Ji HP, Kenny EE, Koenig BA, Kolesnikov A, Korbel JO, Kordosky J, Koren S, Lee H, Lewis AP, Li H, Liao WW, Lu S, Lu TY, Lucas JK, Magalhães H, Marco-Sola S, Marijon P, Markello C, Marschall T, Martin FJ, McCartney A, McDaniel J, Miga KH, Mitchell MW, Monlong J, Mountcastle J, Munson KM, Mwaniki MN, Nattestad M, Novak AM, Nurk S, Olsen HE, Olson ND, Paten B, Pesout T, Phillippy AM, Popejoy AB, Porubsky D, Prins P, Puiu D, Rautiainen M, Regier AA, Rhie A, Sacco S, Sanders AD, Schneider VA, Schultz BI, Shafin K, Sibbesen JA, Sirén J, Smith MW, Sofia HJ, Tayoun ANA, Thibaud-Nissen F, Tomlinson C, Tricomi FF, Villani F, Vollger MR, Wagner J, Walenz B, Wang T, Wood JMD, Zimin AV, Zook JM. Pangenome graph construction from genome alignments with Minigraph-Cactus. Nat Biotechnol 2024; 42:663-673. [PMID: 37165083 PMCID: PMC10638906 DOI: 10.1038/s41587-023-01793-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 04/18/2023] [Indexed: 05/12/2023]
Abstract
Pangenome references address biases of reference genomes by storing a representative set of diverse haplotypes and their alignment, usually as a graph. Alternate alleles determined by variant callers can be used to construct pangenome graphs, but advances in long-read sequencing are leading to widely available, high-quality phased assemblies. Constructing a pangenome graph directly from assemblies, as opposed to variant calls, leverages the graph's ability to represent variation at different scales. Here we present the Minigraph-Cactus pangenome pipeline, which creates pangenomes directly from whole-genome alignments, and demonstrate its ability to scale to 90 human haplotypes from the Human Pangenome Reference Consortium. The method builds graphs containing all forms of genetic variation while allowing use of current mapping and genotyping tools. We measure the effect of the quality and completeness of reference genomes used for analysis within the pangenomes and show that using the CHM13 reference from the Telomere-to-Telomere Consortium improves the accuracy of our methods. We also demonstrate construction of a Drosophila melanogaster pangenome.
Collapse
Affiliation(s)
- Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Jean Monlong
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Adam M. Novak
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Jordan M. Eizenga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Haley J. Abel
- Division of Oncology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Carl A. Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montreal, QC, Canada
- Canadian Center for Computational Genomics, McGill University, Montreal, QC, Canada
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | | | - Mark J. P. Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | - Xian H. Chang
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Justin Chu
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Sarah Cody
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | | | - Robert M. Cook-Deegan
- Arizona State University, Barrett and O’Connor Washington Center, Washington, DC, USA
| | - Omar E. Cornejo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Daniel Doerr
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Peter Ebert
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Jana Ebler
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Jordan M. Eizenga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam L. Felsenfeld
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shilpa Garg
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Nanibaa’ A. Garrison
- Institute for Society and Genetics, College of Letters and Science, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Division of General Internal Medicine and Health Services Research, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Carlos Garcia Giron
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Richard E. Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA
- Dovetail Genomics, Scotts Valley, CA, USA
| | - Cristian Groza
- Quantitative Life Sciences, McGill University, Montreal, QC, Canada
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Ira M. Hall
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
| | - William T. Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Simon Heumos
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
- Biomedical Data Science, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Miten Jain
- Northeastern University, Boston, MA, USA
| | - Erich D. Jarvis
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Hanlee P. Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Eimear E. Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Barbara A. Koenig
- Program in Bioethics and Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA
| | | | - Jan O. Korbel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - HoJoon Lee
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexandra P. Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Wen-Wei Liao
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
- Division of Biology and Biomedical Sciences, Washington University School of Medicine, St. Louis, MO, USA
| | - Shuangjia Lu
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Julian K. Lucas
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Hugo Magalhães
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Santiago Marco-Sola
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
- Departament d’Arquitectura de Computadors i Sistemes Operatius, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pierre Marijon
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Charles Markello
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Tobias Marschall
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Fergal J. Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Ann McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jennifer McDaniel
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Jean Monlong
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | | | - Katherine M. Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Adam M. Novak
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Hugh E. Olsen
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Nathan D. Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Trevor Pesout
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alice B. Popejoy
- Department of Public Health Sciences, University of California, Davis, Davis, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Pjotr Prins
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Daniela Puiu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Allison A. Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Samuel Sacco
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Ashley D. Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Baergen I. Schultz
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | | | - Jonas A. Sibbesen
- Center for Health Data Science, University of Copenhagen, Copenhagen, Denmark
| | - Jouni Sirén
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Michael W. Smith
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Heidi J. Sofia
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Ahmad N. Abou Tayoun
- Al Jalila Genomics Center of Excellence, Al Jalila Children’s Specialty Hospital, Dubai, UAE
- Center for Genomic Discovery, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, UAE
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Flavia Villani
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Brian Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ting Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Aleksey V. Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Justin M. Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| |
Collapse
|
4
|
Bukhman YV, Meyer S, Chu LF, Abueg L, Antosiewicz-Bourget J, Balacco J, Brecht M, Dinatale E, Fedrigo O, Formenti G, Fungtammasan A, Giri SJ, Hiller M, Howe K, Kihara D, Mamott D, Mountcastle J, Pelan S, Rabbani K, Sims Y, Tracey A, Wood JMD, Jarvis ED, Thomson JA, Chaisson MJP, Stewart R. Chromosome level genome assembly of the Etruscan shrew Suncus etruscus. Sci Data 2024; 11:176. [PMID: 38326333 PMCID: PMC10850158 DOI: 10.1038/s41597-024-03011-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 01/26/2024] [Indexed: 02/09/2024] Open
Abstract
Suncus etruscus is one of the world's smallest mammals, with an average body mass of about 2 grams. The Etruscan shrew's small body is accompanied by a very high energy demand and numerous metabolic adaptations. Here we report a chromosome-level genome assembly using PacBio long read sequencing, 10X Genomics linked short reads, optical mapping, and Hi-C linked reads. The assembly is partially phased, with the 2.472 Gbp primary pseudohaplotype and 1.515 Gbp alternate. We manually curated the primary assembly and identified 22 chromosomes, including X and Y sex chromosomes. The NCBI genome annotation pipeline identified 39,091 genes, 19,819 of them protein-coding. We also identified segmental duplications, inferred GO term annotations, and computed orthologs of human and mouse genes. This reference-quality genome will be an important resource for research on mammalian development, metabolism, and body size control.
Collapse
Affiliation(s)
- Yury V Bukhman
- Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI, 53715, USA.
| | - Susanne Meyer
- Neuroscience Research Institute, University of California - Santa Barbara, 494 UCEN Rd, Isla Vista, CA, 93117, USA
| | - Li-Fang Chu
- Department of Comparative Biology and Experimental Medicine, University of Calgary, 2500 University Drive NW, Calgary, Alberta, T2N 1N4, Canada
| | - Linelle Abueg
- Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | | | - Jennifer Balacco
- Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Michael Brecht
- BCCN/Humboldt University Berlin, Philippstr, 13 House 6, 10115, Berlin, Germany
| | - Erica Dinatale
- Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076, Tübingen, Germany
| | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Giulio Formenti
- Laboratory of Neurogenetics of Language, The Rockefeller University/HHMI, 1230 York Avenue, New York, NY, 10065, USA
| | | | - Swagarika Jaharlal Giri
- Department of Computer Science, Purdue University, 249 S. Martin Jischke Dr, West Lafayette, IN, 47907, USA
| | - Michael Hiller
- LOEWE Centre for Translational Biodiversity Genomics, Senckenberganlage 25, 60325, Frankfurt, Germany
- Senckenberg Research Institute, Senckenberganlage 25, 60325, Frankfurt, Germany
- Institute of Cell Biology and Neuroscience, Faculty of Biosciences, Goethe University Frankfurt, Max-von-Laue-Str. 9, 60438, Frankfurt, Germany
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, 249 S. Martin Jischke Dr, West Lafayette, IN, 47907, USA
- Department of Biological Sciences, Purdue University, 249 S. Martin Jischke Dr., West Lafayette, IN, 47907, USA
| | - Daniel Mamott
- Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI, 53715, USA
| | - Jacquelyn Mountcastle
- Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Sarah Pelan
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Keon Rabbani
- Department of Quantitative and Computational Biology, University of Southern California, 1050 Childs Way RRI 408, Los Angeles, CA, 90089, USA
| | - Ying Sims
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Alan Tracey
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | | | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University/HHMI, 1230 York Avenue, New York, NY, 10065, USA
| | - James A Thomson
- Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI, 53715, USA
- Department of Molecular, Cellular and Developmental Biology, University of California Santa Barbara, Santa Barbara, CA, 93106, USA
- Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI, 53726, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, 1050 Childs Way RRI 408, Los Angeles, CA, 90089, USA
| | - Ron Stewart
- Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI, 53715, USA
| |
Collapse
|
5
|
Sendell-Price AT, Tulenko FJ, Pettersson M, Kang D, Montandon M, Winkler S, Kulb K, Naylor GP, Phillippy A, Fedrigo O, Mountcastle J, Balacco JR, Dutra A, Dale RE, Haase B, Jarvis ED, Myers G, Burgess SM, Currie PD, Andersson L, Schartl M. Low mutation rate in epaulette sharks is consistent with a slow rate of evolution in sharks. Nat Commun 2023; 14:6628. [PMID: 37857613 PMCID: PMC10587355 DOI: 10.1038/s41467-023-42238-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 10/03/2023] [Indexed: 10/21/2023] Open
Abstract
Sharks occupy diverse ecological niches and play critical roles in marine ecosystems, often acting as apex predators. They are considered a slow-evolving lineage and have been suggested to exhibit exceptionally low cancer rates. These two features could be explained by a low nuclear mutation rate. Here, we provide a direct estimate of the nuclear mutation rate in the epaulette shark (Hemiscyllium ocellatum). We generate a high-quality reference genome, and resequence the whole genomes of parents and nine offspring to detect de novo mutations. Using stringent criteria, we estimate a mutation rate of 7×10-10 per base pair, per generation. This represents one of the lowest directly estimated mutation rates for any vertebrate clade, indicating that this basal vertebrate group is indeed a slowly evolving lineage whose ability to restore genetic diversity following a sustained population bottleneck may be hampered by a low mutation rate.
Collapse
Affiliation(s)
- Ashley T Sendell-Price
- Department of Medical Biochemistry and Microbiology, Uppsala University, SE75123, Uppsala, Sweden
- Bioinformatics Research Technology Platform, University of Warwick, Coventry, UK
| | - Frank J Tulenko
- Australian Regenerative Medicine Institute, Monash University, Victoria, 3800, Australia
| | - Mats Pettersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, SE75123, Uppsala, Sweden
| | - Du Kang
- The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, TX, 78666, USA
| | - Margo Montandon
- Australian Regenerative Medicine Institute, Monash University, Victoria, 3800, Australia
| | - Sylke Winkler
- Max-Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
| | - Kathleen Kulb
- Max-Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
| | - Gavin P Naylor
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Adam Phillippy
- Translational and Functional Genomics Branch, National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, 20892, USA
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, Rockefeller University, New York, NY, 10065, USA
| | - Jacquelyn Mountcastle
- Research Center for Genomic and Computational Biology, Duke University, Durham, NC, 27708, USA
| | - Jennifer R Balacco
- Research Center for Genomic and Computational Biology, Duke University, Durham, NC, 27708, USA
| | - Amalia Dutra
- Cytogenetics and Microscopy Core, National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, 20892, USA
| | - Rebecca E Dale
- Australian Regenerative Medicine Institute, Monash University, Victoria, 3800, Australia
| | - Bettina Haase
- Vertebrate Genome Laboratory, Rockefeller University, New York, NY, 10065, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, Rockefeller University, New York, NY, 10065, USA
| | - Gene Myers
- Max-Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center of Systems Biology Dresden, 01307, Dresden, Germany
| | - Shawn M Burgess
- Translational and Functional Genomics Branch, National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, 20892, USA.
| | - Peter D Currie
- Australian Regenerative Medicine Institute, Monash University, Victoria, 3800, Australia.
- EMBL Australia, Victorian Node, Monash University, Clayton, Victoria, 3800, Australia.
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, SE75123, Uppsala, Sweden.
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX77483, USA.
| | - Manfred Schartl
- Developmental Biochemistry, Theodor-Boveri Institute, Biocenter, University of Würzburg, 97074, Würzburg, Germany.
| |
Collapse
|
6
|
Bond DM, Ortega-Recalde O, Laird MK, Hayakawa T, Richardson KS, Reese FCB, Kyle B, McIsaac-Williams BE, Robertson BC, van Heezik Y, Adams AL, Chang WS, Haase B, Mountcastle J, Driller M, Collins J, Howe K, Go Y, Thibaud-Nissen F, Lister NC, Waters PD, Fedrigo O, Jarvis ED, Gemmell NJ, Alexander A, Hore TA. The admixed brushtail possum genome reveals invasion history in New Zealand and novel imprinted genes. Nat Commun 2023; 14:6364. [PMID: 37848431 PMCID: PMC10582058 DOI: 10.1038/s41467-023-41784-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 09/13/2023] [Indexed: 10/19/2023] Open
Abstract
Combining genome assembly with population and functional genomics can provide valuable insights to development and evolution, as well as tools for species management. Here, we present a chromosome-level genome assembly of the common brushtail possum (Trichosurus vulpecula), a model marsupial threatened in parts of their native range in Australia, but also a major introduced pest in New Zealand. Functional genomics reveals post-natal activation of chemosensory and metabolic genes, reflecting unique adaptations to altricial birth and delayed weaning, a hallmark of marsupial development. Nuclear and mitochondrial analyses trace New Zealand possums to distinct Australian subspecies, which have subsequently hybridised. This admixture allowed phasing of parental alleles genome-wide, ultimately revealing at least four genes with imprinted, parent-specific expression not yet detected in other species (MLH1, EPM2AIP1, UBP1 and GPX7). We find that reprogramming of possum germline imprints, and the wider epigenome, is similar to eutherian mammals except onset occurs after birth. Together, this work is useful for genetic-based control and conservation of possums, and contributes to understanding of the evolution of novel mammalian epigenetic traits.
Collapse
Affiliation(s)
- Donna M Bond
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | | | - Melanie K Laird
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | - Takashi Hayakawa
- Faculty of Environmental Earth Science, Hokkaido University, Sapporo, Hokkaido, 060-0808, Japan
| | - Kyle S Richardson
- Department of Anatomy, University of Otago, Dunedin, New Zealand
- Biology Department, University of Montana Western, Dillon, MT, 59725, USA
| | - Finlay C B Reese
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | - Bruce Kyle
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | | | | | | | - Amy L Adams
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | - Wei-Shan Chang
- School of Life and Environmental Science, Faculty of Science, The University of Sydney, Sydney, NSW, Australia
- Health and Biosecurity, CSIRO, Canberra, ACT, Australia
| | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | | | - Joanna Collins
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Yasuhiro Go
- Graduate School of Information Science, Hyogo University, Hyogo, Japan
- Cognitive Genomics Research Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of Natural Sciences, Aichi, Japan
- Department of System Neuroscience, National Institute for Physiological Sciences, Aichi, Japan
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Nicholas C Lister
- School of Biotechnology and Biomolecular Science, Faculty of Science, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Paul D Waters
- School of Biotechnology and Biomolecular Science, Faculty of Science, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, 10065, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| | - Neil J Gemmell
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | - Alana Alexander
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | - Timothy A Hore
- Department of Anatomy, University of Otago, Dunedin, New Zealand.
| |
Collapse
|
7
|
Liao WW, Asri M, Ebler J, Doerr D, Haukness M, Hickey G, Lu S, Lucas JK, Monlong J, Abel HJ, Buonaiuto S, Chang XH, Cheng H, Chu J, Colonna V, Eizenga JM, Feng X, Fischer C, Fulton RS, Garg S, Groza C, Guarracino A, Harvey WT, Heumos S, Howe K, Jain M, Lu TY, Markello C, Martin FJ, Mitchell MW, Munson KM, Mwaniki MN, Novak AM, Olsen HE, Pesout T, Porubsky D, Prins P, Sibbesen JA, Sirén J, Tomlinson C, Villani F, Vollger MR, Antonacci-Fulton LL, Baid G, Baker CA, Belyaeva A, Billis K, Carroll A, Chang PC, Cody S, Cook DE, Cook-Deegan RM, Cornejo OE, Diekhans M, Ebert P, Fairley S, Fedrigo O, Felsenfeld AL, Formenti G, Frankish A, Gao Y, Garrison NA, Giron CG, Green RE, Haggerty L, Hoekzema K, Hourlier T, Ji HP, Kenny EE, Koenig BA, Kolesnikov A, Korbel JO, Kordosky J, Koren S, Lee H, Lewis AP, Magalhães H, Marco-Sola S, Marijon P, McCartney A, McDaniel J, Mountcastle J, Nattestad M, Nurk S, Olson ND, Popejoy AB, Puiu D, Rautiainen M, Regier AA, Rhie A, Sacco S, Sanders AD, Schneider VA, Schultz BI, Shafin K, Smith MW, Sofia HJ, Abou Tayoun AN, Thibaud-Nissen F, Tricomi FF, Wagner J, Walenz B, Wood JMD, Zimin AV, Bourque G, Chaisson MJP, Flicek P, Phillippy AM, Zook JM, Eichler EE, Haussler D, Wang T, Jarvis ED, Miga KH, Garrison E, Marschall T, Hall IM, Li H, Paten B. A draft human pangenome reference. Nature 2023; 617:312-324. [PMID: 37165242 PMCID: PMC10172123 DOI: 10.1038/s41586-023-05896-x] [Citation(s) in RCA: 170] [Impact Index Per Article: 170.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Accepted: 02/28/2023] [Indexed: 05/12/2023]
Abstract
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
Collapse
Affiliation(s)
- Wen-Wei Liao
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
- Division of Biology and Biomedical Sciences, Washington University School of Medicine, St. Louis, MO, USA
| | - Mobin Asri
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Daniel Doerr
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Marina Haukness
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Glenn Hickey
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Shuangjia Lu
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
| | - Julian K Lucas
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Jean Monlong
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Haley J Abel
- Division of Oncology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | - Xian H Chang
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Justin Chu
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jordan M Eizenga
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Robert S Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Shilpa Garg
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Cristian Groza
- Quantitative Life Sciences, McGill University, Montréal, Québec, Canada
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Simon Heumos
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
- Biomedical Data Science, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Miten Jain
- Northeastern University, Boston, MA, USA
| | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Charles Markello
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Adam M Novak
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Hugh E Olsen
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Trevor Pesout
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Pjotr Prins
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jonas A Sibbesen
- Center for Health Data Science, University of Copenhagen, Copenhagen, Denmark
| | - Jouni Sirén
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Flavia Villani
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Carl A Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | | | - Sarah Cody
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Robert M Cook-Deegan
- Barrett and O'Connor Washington Center, Arizona State University, Washington, DC, USA
| | - Omar E Cornejo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Mark Diekhans
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Peter Ebert
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam L Felsenfeld
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Nanibaa' A Garrison
- Institute for Society and Genetics, College of Letters and Science, University of California, Los Angeles, CA, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
- Division of General Internal Medicine and Health Services Research, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | - Carlos Garcia Giron
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Richard E Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
- Dovetail Genomics, Scotts Valley, CA, USA
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Hanlee P Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Barbara A Koenig
- Program in Bioethics and Institute for Human Genetics, University of California, San Francisco, CA, USA
| | | | - Jan O Korbel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - HoJoon Lee
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Hugo Magalhães
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Santiago Marco-Sola
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
- Departament d'Arquitectura de Computadors i Sistemes Operatius, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pierre Marijon
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Ann McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jennifer McDaniel
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | | | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Nathan D Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Alice B Popejoy
- Department of Public Health Sciences, University of California, Davis, CA, USA
| | - Daniela Puiu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Allison A Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Samuel Sacco
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Ashley D Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Baergen I Schultz
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | | | - Michael W Smith
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Heidi J Sofia
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Ahmad N Abou Tayoun
- Al Jalila Genomics Center of Excellence, Al Jalila Children's Specialty Hospital, Dubai, UAE
- Center for Genomic Discovery, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, UAE
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Brian Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Aleksey V Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
- Canadian Center for Computational Genomics, McGill University, Montréal, Québec, Canada
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - David Haussler
- Genomics Institute, University of California, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Ting Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Karen H Miga
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA.
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany.
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany.
| | - Ira M Hall
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA.
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA.
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Benedict Paten
- Genomics Institute, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
8
|
Timoshevskaya N, Eşkut KI, Timoshevskiy VA, Robb SMC, Holt C, Hess JE, Parker HJ, Baker CF, Miller AK, Saraceno C, Yandell M, Krumlauf R, Narum SR, Lampman RT, Gemmell NJ, Mountcastle J, Haase B, Balacco JR, Formenti G, Pelan S, Sims Y, Howe K, Fedrigo O, Jarvis ED, Smith JJ. An improved germline genome assembly for the sea lamprey Petromyzon marinus illuminates the evolution of germline-specific chromosomes. Cell Rep 2023; 42:112263. [PMID: 36930644 PMCID: PMC10166183 DOI: 10.1016/j.celrep.2023.112263] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 10/17/2022] [Accepted: 02/28/2023] [Indexed: 03/17/2023] Open
Abstract
Programmed DNA loss is a gene silencing mechanism that is employed by several vertebrate and nonvertebrate lineages, including all living jawless vertebrates and songbirds. Reconstructing the evolution of somatically eliminated (germline-specific) sequences in these species has proven challenging due to a high content of repeats and gene duplications in eliminated sequences and a corresponding lack of highly accurate and contiguous assemblies for these regions. Here, we present an improved assembly of the sea lamprey (Petromyzon marinus) genome that was generated using recently standardized methods that increase the contiguity and accuracy of vertebrate genome assemblies. This assembly resolves highly contiguous, somatically retained chromosomes and at least one germline-specific chromosome, permitting new analyses that reconstruct the timing, mode, and repercussions of recruitment of genes to the germline-specific fraction. These analyses reveal major roles of interchromosomal segmental duplication, intrachromosomal duplication, and positive selection for germline functions in the long-term evolution of germline-specific chromosomes.
Collapse
Affiliation(s)
| | - Kaan I Eşkut
- Department of Biology, University of Kentucky, Lexington, KY 40506, USA
| | | | - Sofia M C Robb
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Carson Holt
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Jon E Hess
- Columbia River Inter-Tribal Fish Commission, Portland, OR 97232, USA
| | - Hugo J Parker
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Cindy F Baker
- National Institute of Water and Atmospheric Research Limited (NIWA), Hamilton, Waikato 3261, New Zealand
| | - Allison K Miller
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, Otago 9054, New Zealand
| | - Cody Saraceno
- Department of Biology, University of Kentucky, Lexington, KY 40506, USA
| | - Mark Yandell
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Robb Krumlauf
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA; Department of Anatomy & Cell Biology, The University of Kansas School of Medicine, Kansas City, KS 66160, USA
| | - Shawn R Narum
- Columbia River Inter-Tribal Fish Commission, Hagerman, ID 83332, USA
| | - Ralph T Lampman
- Yakama Nation Fisheries Resource Management Program, Pacific Lamprey Project, Toppenish, WA 98948, USA
| | - Neil J Gemmell
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, Otago 9054, New Zealand
| | | | - Bettina Haase
- Vertebrate Genome Lab, The Rockefeller University, New York, NY 10065, USA
| | - Jennifer R Balacco
- Vertebrate Genome Lab, The Rockefeller University, New York, NY 10065, USA
| | - Giulio Formenti
- Vertebrate Genome Lab, The Rockefeller University, New York, NY 10065, USA; Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY 10065, USA
| | - Sarah Pelan
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | - Ying Sims
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, New York, NY 10065, USA
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY 10065, USA; Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY 10065, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Jeramiah J Smith
- Department of Biology, University of Kentucky, Lexington, KY 40506, USA.
| |
Collapse
|
9
|
Secomandi S, Gallo GR, Sozzoni M, Iannucci A, Galati E, Abueg L, Balacco J, Caprioli M, Chow W, Ciofi C, Collins J, Fedrigo O, Ferretti L, Fungtammasan A, Haase B, Howe K, Kwak W, Lombardo G, Masterson P, Messina G, Møller AP, Mountcastle J, Mousseau TA, Ferrer Obiol J, Olivieri A, Rhie A, Rubolini D, Saclier M, Stanyon R, Stucki D, Thibaud-Nissen F, Torrance J, Torroni A, Weber K, Ambrosini R, Bonisoli-Alquati A, Jarvis ED, Gianfranceschi L, Formenti G. A chromosome-level reference genome and pangenome for barn swallow population genomics. Cell Rep 2023; 42:111992. [PMID: 36662619 PMCID: PMC10044405 DOI: 10.1016/j.celrep.2023.111992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 07/20/2022] [Accepted: 01/04/2023] [Indexed: 01/20/2023] Open
Abstract
Insights into the evolution of non-model organisms are limited by the lack of reference genomes of high accuracy, completeness, and contiguity. Here, we present a chromosome-level, karyotype-validated reference genome and pangenome for the barn swallow (Hirundo rustica). We complement these resources with a reference-free multialignment of the reference genome with other bird genomes and with the most comprehensive catalog of genetic markers for the barn swallow. We identify potentially conserved and accelerated genes using the multialignment and estimate genome-wide linkage disequilibrium using the catalog. We use the pangenome to infer core and accessory genes and to detect variants using it as a reference. Overall, these resources will foster population genomics studies in the barn swallow, enable detection of candidate genes in comparative genomics studies, and help reduce bias toward a single reference genome.
Collapse
Affiliation(s)
- Simona Secomandi
- Department of Biosciences, University of Milan, Milan, Italy; Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Guido R Gallo
- Department of Biosciences, University of Milan, Milan, Italy
| | | | - Alessio Iannucci
- Department of Biology, University of Florence, Sesto Fiorentino (FI), Italy
| | - Elena Galati
- Department of Biosciences, University of Milan, Milan, Italy
| | - Linelle Abueg
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Jennifer Balacco
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Manuela Caprioli
- Department of Environmental Sciences and Policy, University of Milan, Milan, Italy
| | | | - Claudio Ciofi
- Department of Biology, University of Florence, Sesto Fiorentino (FI), Italy
| | | | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Luca Ferretti
- Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy
| | | | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | - Woori Kwak
- Department of Medical and Biological Sciences, The Catholic University of Korea, Bucheon 14662, Korea
| | - Gianluca Lombardo
- Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | - Anders P Møller
- Ecologie Systématique Evolution, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Orsay Cedex, France
| | | | - Timothy A Mousseau
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Joan Ferrer Obiol
- Department of Environmental Sciences and Policy, University of Milan, Milan, Italy
| | - Anna Olivieri
- Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Diego Rubolini
- Department of Environmental Sciences and Policy, University of Milan, Milan, Italy
| | | | - Roscoe Stanyon
- Department of Biology, University of Florence, Sesto Fiorentino (FI), Italy
| | | | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | - Antonio Torroni
- Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy
| | | | - Roberto Ambrosini
- Department of Environmental Sciences and Policy, University of Milan, Milan, Italy
| | - Andrea Bonisoli-Alquati
- Department of Biological Sciences, California State Polytechnic University - Pomona, Pomona, CA, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA; The Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | | | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA.
| |
Collapse
|
10
|
Smith J, Alfieri JM, Anthony N, Arensburger P, Athrey GN, Balacco J, Balic A, Bardou P, Barela P, Bigot Y, Blackmon H, Borodin PM, Carroll R, Casono MC, Charles M, Cheng H, Chiodi M, Cigan L, Coghill LM, Crooijmans R, Das N, Davey S, Davidian A, Degalez F, Dekkers JM, Derks M, Diack AB, Djikeng A, Drechsler Y, Dyomin A, Fedrigo O, Fiddaman SR, Formenti G, Frantz LAF, Fulton JE, Gaginskaya E, Galkina S, Gallardo RA, Geibel J, Gheyas AA, Godinez CJP, Goodell A, Graves JAM, Griffin DK, Haase B, Han JL, Hanotte O, Henderson LJ, Hou ZC, Howe K, Huynh L, Ilatsia E, Jarvis ED, Johnson SM, Kaufman J, Kelly T, Kemp S, Kern C, Keroack JH, Klopp C, Lagarrigue S, Lamont SJ, Lange M, Lanke A, Larkin DM, Larson G, Layos JKN, Lebrasseur O, Malinovskaya LP, Martin RJ, Martin Cerezo ML, Mason AS, McCarthy FM, McGrew MJ, Mountcastle J, Muhonja CK, Muir W, Muret K, Murphy TD, Ng'ang'a I, Nishibori M, O'Connor RE, Ogugo M, Okimoto R, Ouko O, Patel HR, Perini F, Pigozzi MI, Potter KC, Price PD, Reimer C, Rice ES, Rocos N, Rogers TF, Saelao P, Schauer J, Schnabel RD, Schneider VA, Simianer H, Smith A, Stevens MP, Stiers K, Tiambo CK, Tixier-Boichard M, Torgasheva AA, Tracey A, Tregaskes CA, Vervelde L, Wang Y, Warren WC, Waters PD, Webb D, Weigend S, Wolc A, Wright AE, Wright D, Wu Z, Yamagata M, Yang C, Yin ZT, Young MC, Zhang G, Zhao B, Zhou H. Fourth Report on Chicken Genes and Chromosomes 2022. Cytogenet Genome Res 2023; 162:405-528. [PMID: 36716736 DOI: 10.1159/000529376] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 01/22/2023] [Indexed: 02/01/2023] Open
Affiliation(s)
- Jacqueline Smith
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - James M Alfieri
- Interdisciplinary Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
- Department of Biology, Texas A&M University, College Station, Texas, USA
- Department of Poultry Science, Texas A&M University, College Station, Texas, USA
| | | | - Peter Arensburger
- Biological Sciences Department, California State Polytechnic University, Pomona, California, USA
| | - Giridhar N Athrey
- Interdisciplinary Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
- Department of Poultry Science, Texas A&M University, College Station, Texas, USA
| | | | - Adam Balic
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Philippe Bardou
- Université de Toulouse, INRAE, ENVT, GenPhySE, Sigenae, Castanet Tolosan, France
| | | | - Yves Bigot
- PRC, UMR INRAE 0085, CNRS 7247, Centre INRAE Val de Loire, Nouzilly, France
| | - Heath Blackmon
- Interdisciplinary Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
- Department of Biology, Texas A&M University, College Station, Texas, USA
| | - Pavel M Borodin
- Department of Molecular Genetics, Cell Biology and Bioinformatics, Institute of Cytology and Genetics of Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russian Federation
| | - Rachel Carroll
- Department of Animal Sciences, Data Science and Informatics Institute, University of Missouri, Columbia, Missouri, USA
| | | | - Mathieu Charles
- University Paris-Saclay, INRAE, AgroParisTech, GABI, Sigenae, Jouy-en-Josas, France
| | - Hans Cheng
- USDA, ARS, USNPRC, Avian Disease and Oncology Laboratory, East Lansing, Michigan, USA
| | | | | | - Lyndon M Coghill
- Department of Veterinary Pathology, University of Missouri, Columbia, Missouri, USA
| | - Richard Crooijmans
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, The Netherlands
| | | | - Sean Davey
- University of Arizona, Tucson, Arizona, USA
| | - Asya Davidian
- Saint Petersburg State University, Saint Petersburg, Russian Federation
| | - Fabien Degalez
- INRAE, INSTITUT AGRO, PEGASE UMR 1348, Saint-Gilles, France
| | - Jack M Dekkers
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- Department of Animal Science, Iowa State University, Ames, Iowa, USA
| | - Martijn Derks
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, The Netherlands
| | - Abigail B Diack
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Appolinaire Djikeng
- Centre for Tropical Livestock Genetics and Health (CTLGH) - The Roslin Institute, Edinburgh, UK
| | - Yvonne Drechsler
- College of Veterinary Medicine, Western University of Health Sciences, Pomona, California, USA
| | - Alexander Dyomin
- Saint Petersburg State University, Saint Petersburg, Russian Federation
| | | | | | | | - Laurent A F Frantz
- Queen Mary University of London, Bethnal Green, London, UK
- Palaeogenomics Group, Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Janet E Fulton
- Hy-Line International, Research and Development, Dallas Center, Iowa, USA
| | - Elena Gaginskaya
- Saint Petersburg State University, Saint Petersburg, Russian Federation
| | - Svetlana Galkina
- Saint Petersburg State University, Saint Petersburg, Russian Federation
| | - Rodrigo A Gallardo
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- School of Veterinary Medicine, University of California, Davis, California, USA
| | - Johannes Geibel
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt, Germany
- Center for Integrated Breeding Research, University of Göttingen, Göttingen, Germany
| | - Almas A Gheyas
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Cyrill John P Godinez
- Department of Animal Science, College of Agriculture and Food Science, Visayas State University, Baybay City, Philippines
| | | | - Jennifer A M Graves
- Department of Environment and Genetics, La Trobe University, Melbourne, Victoria, Australia
- Institute for Applied Ecology, University of Canberra, Canberra, Australian Capital Territory, Australia
| | | | | | - Jian-Lin Han
- CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
- International Livestock Research Institute (ILRI), Addis Ababa, Ethiopia
| | - Olivier Hanotte
- International Livestock Research Institute (ILRI), Addis Ababa, Ethiopia
- Cells, Organisms and Molecular Genetics, School of Life Sciences, University of Nottingham, Nottingham, UK
- Centre for Tropical Livestock Genetics and Health, The Roslin Institute, Edinburgh, UK
| | - Lindsay J Henderson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Zhuo-Cheng Hou
- National Engineering Laboratory for Animal Breeding and Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | | | - Lan Huynh
- Institute for Immunology and Infection Research, University of Edinburgh, Edinburgh, UK
| | - Evans Ilatsia
- Dairy Research Institute, Kenya Agricultural and Livestock Organization, Naivasha, Kenya
| | | | | | - Jim Kaufman
- Institute for Immunology and Infection Research, University of Edinburgh, Edinburgh, UK
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Department of Pathology, University of Cambridge, Cambridge, UK
| | - Terra Kelly
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- School of Veterinary Medicine, University of California, Davis, California, USA
| | - Steve Kemp
- Centre for Tropical Livestock Genetics and Health (CTLGH) - ILRI, Nairobi, Kenya
| | - Colin Kern
- Department of Animal Science, University of California, Davis, California, USA
| | | | | | | | - Susan J Lamont
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- Department of Animal Science, Iowa State University, Ames, Iowa, USA
| | - Margaret Lange
- Department of Molecular Microbiology and Immunology, University of Missouri, Columbia, Missouri, USA
| | - Anika Lanke
- BASIS Chandler High School, Chandler, Arizona, USA
| | - Denis M Larkin
- Department of Comparative Biomedical Sciences, Royal Veterinary College, University of London, London, UK
| | - Greger Larson
- The Palaeogenomics and Bio-Archaeology Research Network, Research Laboratory for Archaeology and History of Art, The University of Oxford, Oxford, UK
| | - John King N Layos
- College of Agriculture and Forestry, Capiz State University, Mambusao, Philippines
| | - Ophélie Lebrasseur
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Toulouse III Paul Sabatier, Toulouse, France
- Instituto Nacional de Antropología y Pensamiento Latinoamericano, Ciudad Autónoma de Buenos Aires, Argentina
| | - Lyubov P Malinovskaya
- Department of Cytology and Genetics, Novosibirsk State University, Novosibirsk, Russian Federation
| | | | | | | | | | - Michael J McGrew
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
- Centre for Tropical Livestock Genetics and Health (CTLGH) - The Roslin Institute, Edinburgh, UK
| | | | - Christine Kamidi Muhonja
- Dairy Research Institute, Kenya Agricultural and Livestock Organization, Naivasha, Kenya
- Centre for Tropical Livestock Genetics and Health (CTLGH) - ILRI, Nairobi, Kenya
| | - William Muir
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Kévin Muret
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, France
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Masahide Nishibori
- Laboratory of Animal Genetics, Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima, Japan
| | | | - Moses Ogugo
- Centre for Tropical Livestock Genetics and Health (CTLGH) - ILRI, Nairobi, Kenya
| | - Ron Okimoto
- Cobb-Vantress, Siloam Springs, Arkansas, USA
| | - Ochieng Ouko
- Dairy Research Institute, Kenya Agricultural and Livestock Organization, Naivasha, Kenya
| | - Hardip R Patel
- The John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Francesco Perini
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
- Department of Agricultural, Food and Environmental Sciences, University of Perugia, Perugia, Italy
| | - María Ines Pigozzi
- INBIOMED (CONICET-UBA), Facultad de Medicina, Universidad de Buenos Aires, Buenos Aires, Argentina
| | | | - Peter D Price
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
| | - Christian Reimer
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt, Germany
| | - Edward S Rice
- Department of Animal Sciences, Bond Life Sciences Center, University of Missouri, Columbia, Missouri, USA
| | - Nicolas Rocos
- Institute for Immunology and Infection Research, University of Edinburgh, Edinburgh, UK
| | - Thea F Rogers
- Department of Molecular Evolution and Development, University of Vienna, Vienna, Austria
| | - Perot Saelao
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- Department of Animal Science, University of California, Davis, California, USA
- Veterinary Pest Genetics Research Unit, USDA, Kerrville, Texas, USA
| | - Jens Schauer
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt, Germany
| | - Robert D Schnabel
- Department of Animal Sciences, University of Missouri, Columbia, Missouri, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Henner Simianer
- Center for Integrated Breeding Research, University of Göttingen, Göttingen, Germany
| | - Adrian Smith
- Department of Zoology, University of Oxford, Oxford, UK
| | - Mark P Stevens
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Kyle Stiers
- Department of Veterinary Pathology, University of Missouri, Columbia, Missouri, USA
| | | | | | - Anna A Torgasheva
- Department of Molecular Genetics, Cell Biology and Bioinformatics, Institute of Cytology and Genetics of Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russian Federation
| | - Alan Tracey
- Wellcome Trust Sanger Institute, Hinxton, UK
| | - Clive A Tregaskes
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Department of Pathology, University of Cambridge, Cambridge, UK
| | - Lonneke Vervelde
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Ying Wang
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- Department of Animal Science, University of California, Davis, California, USA
| | - Wesley C Warren
- Department of Animal Sciences, Bond Life Sciences Center, University of Missouri, Columbia, Missouri, USA
- Department of Animal Sciences, University of Missouri, Columbia, Missouri, USA
| | - Paul D Waters
- School of Biotechnology and Biomolecular Science, Faculty of Science, UNSW Sydney, Sydney, New South Wales, Australia
| | - David Webb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Steffen Weigend
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt, Germany
- Center for Integrated Breeding Research, University of Göttingen, Göttingen, Germany
| | - Anna Wolc
- Department of Animal Science, Iowa State University, Ames, Iowa, USA
- Hy-Line International, Research and Development, Dallas Center, Iowa, USA
| | - Alison E Wright
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
| | - Dominic Wright
- AVIAN Behavioural Genomics and Physiology, IFM Biology, Linköping University, Linköping, Sweden
| | - Zhou Wu
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Edinburgh, UK
| | - Masahito Yamagata
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts, USA
| | | | - Zhong-Tao Yin
- National Engineering Laboratory for Animal Breeding and Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | | | - Guojie Zhang
- Center for Evolutionary and Organismal Biology, Zhejiang University School of Medicine, Hangzhou, China
| | - Bingru Zhao
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - Huaijun Zhou
- Feed the Future Innovation Lab for Genomics to Improve Poultry, University of California, Davis, California, USA
- Department of Animal Science, University of California, Davis, California, USA
| |
Collapse
|
11
|
Meyer BS, Moiron M, Caswara C, Chow W, Fedrigo O, Formenti G, Haase B, Howe K, Mountcastle J, Uliano-Silva M, Wood J, Jarvis ED, Liedvogel M, Bouwhuis S. Sex-specific changes in autosomal methylation rate in ageing common terns. Front Ecol Evol 2023. [DOI: 10.3389/fevo.2023.982443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Senescence, an age-related decline in survival and/or reproductive performance, occurs in species across the tree of life. Molecular mechanisms underlying this within-individual phenomenon are still largely unknown, but DNA methylation changes with age are among the candidates. Using a longitudinal approach, we investigated age-specific changes in autosomal methylation of common terns, relatively long-lived migratory seabirds known to show senescence. We collected blood at 1-, 3- and/or 4-year intervals, extracted DNA from the erythrocytes and estimated autosomal DNA methylation by mapping Reduced Representative Bisulfite Sequencing reads to a de novo assembled reference genome. We found autosomal methylation levels to decrease with age within females, but not males, and no evidence for selective (dis)appearance of birds of either sex in relation to their methylation level. Moreover, although we found positions in the genome to consistently vary in their methylation levels, individuals did not show such strong consistent variance. These results pave the way for studies at the level of genome features or specific positions, which should elucidate the functional consequences of the patterns observed, and how they translate to the ageing phenotype.
Collapse
|
12
|
Karawita AC, Cheng Y, Chew KY, Challagulla A, Kraus R, Mueller RC, Tong MZW, Hulme KD, Bielefeldt-Ohmann H, Steele LE, Wu M, Sng J, Noye E, Bruxner TJ, Au GG, Lowther S, Blommaert J, Suh A, McCauley AJ, Kaur P, Dudchenko O, Aiden E, Fedrigo O, Formenti G, Mountcastle J, Chow W, Martin FJ, Ogeh DN, Thiaud-Nissen F, Howe K, Tracey A, Smith J, Kuo RI, Renfree MB, Kimura T, Sakoda Y, McDougall M, Spencer HG, Pyne M, Tolf C, Waldenström J, Jarvis ED, Baker ML, Burt DW, Short KR. The swan genome and transcriptome, it is not all black and white. Genome Biol 2023; 24:13. [PMID: 36683094 PMCID: PMC9867998 DOI: 10.1186/s13059-022-02838-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 12/12/2022] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND The Australian black swan (Cygnus atratus) is an iconic species with contrasting plumage to that of the closely related northern hemisphere white swans. The relative geographic isolation of the black swan may have resulted in a limited immune repertoire and increased susceptibility to infectious diseases, notably infectious diseases from which Australia has been largely shielded. Unlike mallard ducks and the mute swan (Cygnus olor), the black swan is extremely sensitive to highly pathogenic avian influenza. Understanding this susceptibility has been impaired by the absence of any available swan genome and transcriptome information. RESULTS Here, we generate the first chromosome-length black and mute swan genomes annotated with transcriptome data, all using long-read based pipelines generated for vertebrate species. We use these genomes and transcriptomes to show that unlike other wild waterfowl, black swans lack an expanded immune gene repertoire, lack a key viral pattern-recognition receptor in endothelial cells and mount a poorly controlled inflammatory response to highly pathogenic avian influenza. We also implicate genetic differences in SLC45A2 gene in the iconic plumage of the black swan. CONCLUSION Together, these data suggest that the immune system of the black swan is such that should any avian viral infection become established in its native habitat, the black swan would be in a significant peril.
Collapse
Affiliation(s)
- Anjana C. Karawita
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia ,grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - Yuanyuan Cheng
- grid.1013.30000 0004 1936 834XSchool of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006 Australia
| | - Keng Yih Chew
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Arjun Challagulla
- grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - Robert Kraus
- grid.507516.00000 0004 7661 536XDepartment of Migration, Max Planck Institute of Animal Behavior, Radolfzell, 78315 Germany ,grid.9811.10000 0001 0658 7699Department of Biology, University of Konstanz, Konstanz, 78457 Germany
| | - Ralf C. Mueller
- grid.507516.00000 0004 7661 536XDepartment of Migration, Max Planck Institute of Animal Behavior, Radolfzell, 78315 Germany ,grid.9811.10000 0001 0658 7699Department of Biology, University of Konstanz, Konstanz, 78457 Germany
| | - Marcus Z. W. Tong
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Katina D. Hulme
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Helle Bielefeldt-Ohmann
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Lauren E. Steele
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Melanie Wu
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Julian Sng
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Ellesandra Noye
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Timothy J. Bruxner
- grid.1003.20000 0000 9320 7537Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Gough G. Au
- grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - Suzanne Lowther
- grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - Julie Blommaert
- grid.8993.b0000 0004 1936 9457Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Science for Life Laboratory, Uppsala, 752 36 Sweden ,The New Zealand Institute for Plant & Food Research Ltd, Nelson, 7010 New Zealand
| | - Alexander Suh
- grid.8993.b0000 0004 1936 9457Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Science for Life Laboratory, Uppsala, 752 36 Sweden ,grid.8273.e0000 0001 1092 7967School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU UK
| | - Alexander J. McCauley
- grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - Parwinder Kaur
- grid.1012.20000 0004 1936 7910School of Agriculture and Environment, The University of Western Australia, Perth, WA 6009 Australia
| | - Olga Dudchenko
- grid.39382.330000 0001 2160 926XThe Centre for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA ,grid.21940.3e0000 0004 1936 8278Centre for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX 77030 USA
| | - Erez Aiden
- grid.1012.20000 0004 1936 7910School of Agriculture and Environment, The University of Western Australia, Perth, WA 6009 Australia ,grid.39382.330000 0001 2160 926XThe Centre for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA ,grid.21940.3e0000 0004 1936 8278Centre for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX 77030 USA ,grid.66859.340000 0004 0546 1623Broad Institute of MIT and Harvard, Cambridge, MA 02139 USA ,Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech, Pudong, 201210 China
| | - Olivier Fedrigo
- grid.134907.80000 0001 2166 1519The Vertebrate Genome Laboratory, The Rockefeller University, NY, 10065 USA
| | - Giulio Formenti
- grid.134907.80000 0001 2166 1519The Vertebrate Genome Laboratory, The Rockefeller University, NY, 10065 USA
| | - Jacquelyn Mountcastle
- grid.134907.80000 0001 2166 1519The Vertebrate Genome Laboratory, The Rockefeller University, NY, 10065 USA
| | - William Chow
- grid.10306.340000 0004 0606 5382Tree of Life, Welcome Sanger Institute, Cambridge, CB10 1SA UK
| | - Fergal J. Martin
- grid.225360.00000 0000 9709 7726European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Denye N. Ogeh
- grid.225360.00000 0000 9709 7726European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Françoise Thiaud-Nissen
- grid.94365.3d0000 0001 2297 5165National Centre for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD USA
| | - Kerstin Howe
- grid.10306.340000 0004 0606 5382Tree of Life, Welcome Sanger Institute, Cambridge, CB10 1SA UK
| | - Alan Tracey
- grid.10306.340000 0004 0606 5382Tree of Life, Welcome Sanger Institute, Cambridge, CB10 1SA UK
| | - Jacqueline Smith
- grid.4305.20000 0004 1936 7988The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG UK
| | - Richard I. Kuo
- grid.4305.20000 0004 1936 7988The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG UK
| | - Marilyn B. Renfree
- grid.1008.90000 0001 2179 088XSchool of Biosciences, The University of Melbourne, Melbourne, VIC 3052 Australia
| | - Takashi Kimura
- grid.39158.360000 0001 2173 7691Faculty of Veterinary Medicine, Hokkaido University, Sapporo, Hokkaido 060-0818 Japan
| | - Yoshihiro Sakoda
- grid.39158.360000 0001 2173 7691Faculty of Veterinary Medicine, Hokkaido University, Sapporo, Hokkaido 060-0818 Japan
| | - Mathew McDougall
- New Zealand Fish & Game – Eastern Region, Rotorua, 3046 New Zealand
| | - Hamish G. Spencer
- grid.29980.3a0000 0004 1936 7830Department of Zoology, University of Otago, Dunedin, 9054 New Zealand
| | - Michael Pyne
- Currumbin Wildlife Sanctuary, Currumbin, QLD 4223 Australia
| | - Conny Tolf
- grid.8148.50000 0001 2174 3522Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, SE-391 82 Sweden
| | - Jonas Waldenström
- grid.8148.50000 0001 2174 3522Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, SE-391 82 Sweden
| | - Erich D. Jarvis
- grid.134907.80000 0001 2166 1519The Vertebrate Genome Laboratory, The Rockefeller University, NY, 10065 USA
| | - Michelle L. Baker
- grid.413322.50000 0001 2188 8254Commonwealth Scientific and Industrial Research Organisation, Australian Centre for Disease Preparedness, 5 Portarlington Road, Geelong, VIC 3220 Australia
| | - David W. Burt
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| | - Kirsty R. Short
- grid.1003.20000 0000 9320 7537School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072 Australia
| |
Collapse
|
13
|
Toh H, Yang C, Formenti G, Raja K, Yan L, Tracey A, Chow W, Howe K, Bergeron LA, Zhang G, Haase B, Mountcastle J, Fedrigo O, Fogg J, Kirilenko B, Munegowda C, Hiller M, Jain A, Kihara D, Rhie A, Phillippy AM, Swanson SA, Jiang P, Clegg DO, Jarvis ED, Thomson JA, Stewart R, Chaisson MJP, Bukhman YV. A haplotype-resolved genome assembly of the Nile rat facilitates exploration of the genetic basis of diabetes. BMC Biol 2022; 20:245. [DOI: 10.1186/s12915-022-01427-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 09/29/2022] [Indexed: 11/09/2022] Open
Abstract
Abstract
Background
The Nile rat (Avicanthis niloticus) is an important animal model because of its robust diurnal rhythm, a cone-rich retina, and a propensity to develop diet-induced diabetes without chemical or genetic modifications. A closer similarity to humans in these aspects, compared to the widely used Mus musculus and Rattus norvegicus models, holds the promise of better translation of research findings to the clinic.
Results
We report a 2.5 Gb, chromosome-level reference genome assembly with fully resolved parental haplotypes, generated with the Vertebrate Genomes Project (VGP). The assembly is highly contiguous, with contig N50 of 11.1 Mb, scaffold N50 of 83 Mb, and 95.2% of the sequence assigned to chromosomes. We used a novel workflow to identify 3613 segmental duplications and quantify duplicated genes. Comparative analyses revealed unique genomic features of the Nile rat, including some that affect genes associated with type 2 diabetes and metabolic dysfunctions. We discuss 14 genes that are heterozygous in the Nile rat or highly diverged from the house mouse.
Conclusions
Our findings reflect the exceptional level of genomic resolution present in this assembly, which will greatly expand the potential of the Nile rat as a model organism.
Collapse
|
14
|
Dahn HA, Mountcastle J, Balacco J, Winkler S, Bista I, Schmitt AD, Pettersson OV, Formenti G, Oliver K, Smith M, Tan W, Kraus A, Mac S, Komoroske LM, Lama T, Crawford AJ, Murphy RW, Brown S, Scott AF, Morin PA, Jarvis ED, Fedrigo O. Benchmarking ultra-high molecular weight DNA preservation methods for long-read and long-range sequencing. Gigascience 2022; 11:6659719. [PMID: 35946988 PMCID: PMC9364683 DOI: 10.1093/gigascience/giac068] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 01/26/2022] [Accepted: 06/16/2022] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND Studies in vertebrate genomics require sampling from a broad range of tissue types, taxa, and localities. Recent advancements in long-read and long-range genome sequencing have made it possible to produce high-quality chromosome-level genome assemblies for almost any organism. However, adequate tissue preservation for the requisite ultra-high molecular weight DNA (uHMW DNA) remains a major challenge. Here we present a comparative study of preservation methods for field and laboratory tissue sampling, across vertebrate classes and different tissue types. RESULTS We find that storage temperature was the strongest predictor of uHMW fragment lengths. While immediate flash-freezing remains the sample preservation gold standard, samples preserved in 95% EtOH or 20-25% DMSO-EDTA showed little fragment length degradation when stored at 4°C for 6 hours. Samples in 95% EtOH or 20-25% DMSO-EDTA kept at 4°C for 1 week after dissection still yielded adequate amounts of uHMW DNA for most applications. Tissue type was a significant predictor of total DNA yield but not fragment length. Preservation solution had a smaller but significant influence on both fragment length and DNA yield. CONCLUSION We provide sample preservation guidelines that ensure sufficient DNA integrity and amount required for use with long-read and long-range sequencing technologies across vertebrates. Our best practices generated the uHMW DNA needed for the high-quality reference genomes for phase 1 of the Vertebrate Genomes Project, whose ultimate mission is to generate chromosome-level reference genome assemblies of all ∼70,000 extant vertebrate species.
Collapse
Affiliation(s)
| | | | | | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony 01307, Germany
| | - Iliana Bista
- Tree of Life Program, Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge, Cambridgeshire CB2 3EH, UK
| | | | | | | | - Karen Oliver
- Tree of Life Program, Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
| | - Michelle Smith
- Tree of Life Program, Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
| | - Wenhua Tan
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony 01307, Germany
| | - Anne Kraus
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony 01307, Germany
| | - Stephen Mac
- Arima Genomics, Inc., San Diego, CA 92121, USA
| | - Lisa M Komoroske
- Department of Environmental Conservation, University of Massachusetts Amherst, Amherst, MA 01003-9285, USA
| | - Tanya Lama
- Department of Environmental Conservation, University of Massachusetts Amherst, Amherst, MA 01003-9285, USA
| | - Andrew J Crawford
- Department of Biological Sciences, Universidad de los Andes, Bogotá 111711, Colombia
| | - Robert W Murphy
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario M5S 3B2, Canada
| | - Samara Brown
- The Rockefeller University, New York, NY 10065, USA
| | - Alan F Scott
- Department of Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Phillip A Morin
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, CA 92037, USA
| | - Erich D Jarvis
- The Rockefeller University, New York, NY 10065, USA
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Olivier Fedrigo
- Correspondence address. Olivier Fedrigo, Vertebrate Genome Laboratory, The Rockefeller University, 1230 York Avenue, Box 366, New York, NY 10065, USA. E-mail:
| |
Collapse
|
15
|
Palmada-Flores M, Orkin JD, Haase B, Mountcastle J, Bertelsen MF, Fedrigo O, Kuderna LFK, Jarvis ED, Marques-Bonet T. A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta). Gigascience 2022; 11:6562532. [PMID: 35365833 PMCID: PMC8975718 DOI: 10.1093/gigascience/giac026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 01/14/2022] [Accepted: 02/19/2022] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND The ring-tailed lemur (Lemur catta) is a charismatic strepsirrhine primate endemic to Madagascar. These lemurs are of particular interest, given their status as a flagship species and widespread publicity in the popular media. Unfortunately, a recent population decline has resulted in the census population decreasing to <2,500 individuals in the wild, and the species's classification as an endangered species by the IUCN. As is the case for most strepsirrhine primates, only a limited amount of genomic research has been conducted on L. catta, in part owing to the lack of genomic resources. RESULTS We generated a new high-quality reference genome assembly for L. catta (mLemCat1) that conforms to the standards of the Vertebrate Genomes Project. This new long-read assembly is composed of Pacific Biosciences continuous long reads (CLR data), Optical Mapping Bionano reads, Arima HiC data, and 10X linked reads. The contiguity and completeness of the assembly are extremely high, with scaffold and contig N50 values of 90.982 and 10.570 Mb, respectively. Additionally, when compared to other high-quality primate assemblies, L. catta has the lowest reported number of Alu elements, which results predominantly from a lack of AluS and AluY elements. CONCLUSIONS mLemCat1 is an excellent genomic resource not only for the ring-tailed lemur community, but also for other members of the Lemuridae family, and is the first very long read assembly for a strepsirrhine.
Collapse
Affiliation(s)
- Marc Palmada-Flores
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
| | - Joseph D Orkin
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain.,Département d'anthropologie, Université de Montréal, Montréal, QC H3T 1N8, Canada
| | - Bettina Haase
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA
| | | | - Mads F Bertelsen
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg C 1870, Denmark.,Center for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksber 1870, Denmark
| | - Olivier Fedrigo
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA
| | - Lukas F K Kuderna
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
| | - Erich D Jarvis
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA.,Center for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksber 1870, Denmark.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.,Laboratory of Neurogenetics of Language, The Rockefeller University, NY 10065, USA
| | - Tomas Marques-Bonet
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Barcelona 08010, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelon 08028a, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| |
Collapse
|
16
|
Biegler MT, Fedrigo O, Collier P, Mountcastle J, Haase B, Tilgner HU, Jarvis ED. Induction of an immortalized songbird cell line allows for gene characterization and knockout by CRISPR-Cas9. Sci Rep 2022; 12:4369. [PMID: 35288582 PMCID: PMC8921232 DOI: 10.1038/s41598-022-07434-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 02/14/2022] [Indexed: 12/20/2022] Open
Abstract
The zebra finch is one of the most commonly studied songbirds in biology, particularly in genomics, neuroscience and vocal communication. However, this species lacks a robust cell line for molecular biology research and reagent optimization. We generated a cell line, designated CFS414, from zebra finch embryonic fibroblasts using the SV40 large and small T antigens. This cell line demonstrates an improvement over previous songbird cell lines through continuous and density-independent growth, allowing for indefinite culture and monoclonal line derivation. Cytogenetic, genomic, and transcriptomic profiling established the provenance of this cell line and identified the expression of genes relevant to ongoing songbird research. Using this cell line, we disrupted endogenous gene sequences using S.aureus Cas9 and confirmed a stress-dependent localization response of a song system specialized gene, SAP30L. The utility of CFS414 cells enhances the comprehensive molecular potential of the zebra finch and validates cell immortalization strategies in a songbird species.
Collapse
Affiliation(s)
- Matthew T Biegler
- Laboratory of Neurogenetics of Language, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, 10065, USA
| | - Paul Collier
- Center for Neurogenetics, Graduate School of Medical Sciences, Weil Cornell Medical Center, New York, NY, 10065, USA
| | | | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, 10065, USA
| | - Hagen U Tilgner
- Center for Neurogenetics, Graduate School of Medical Sciences, Weil Cornell Medical Center, New York, NY, 10065, USA
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
17
|
Mueller RC, Ellström P, Howe K, Uliano-Silva M, Kuo RI, Miedzinska K, Warr A, Fedrigo O, Haase B, Mountcastle J, Chow W, Torrance J, Wood JMD, Järhult JD, Naguib MM, Olsen B, Jarvis ED, Smith J, Eöry L, Kraus RHS. A high-quality genome and comparison of short- versus long-read transcriptome of the palaearctic duck Aythya fuligula (tufted duck). Gigascience 2021; 10:giab081. [PMID: 34927191 PMCID: PMC8685854 DOI: 10.1093/gigascience/giab081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 07/15/2021] [Accepted: 11/22/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The tufted duck is a non-model organism that experiences high mortality in highly pathogenic avian influenza outbreaks. It belongs to the same bird family (Anatidae) as the mallard, one of the best-studied natural hosts of low-pathogenic avian influenza viruses. Studies in non-model bird species are crucial to disentangle the role of the host response in avian influenza virus infection in the natural reservoir. Such endeavour requires a high-quality genome assembly and transcriptome. FINDINGS This study presents the first high-quality, chromosome-level reference genome assembly of the tufted duck using the Vertebrate Genomes Project pipeline. We sequenced RNA (complementary DNA) from brain, ileum, lung, ovary, spleen, and testis using Illumina short-read and Pacific Biosciences long-read sequencing platforms, which were used for annotation. We found 34 autosomes plus Z and W sex chromosomes in the curated genome assembly, with 99.6% of the sequence assigned to chromosomes. Functional annotation revealed 14,099 protein-coding genes that generate 111,934 transcripts, which implies a mean of 7.9 isoforms per gene. We also identified 246 small RNA families. CONCLUSIONS This annotated genome contributes to continuing research into the host response in avian influenza virus infections in a natural reservoir. Our findings from a comparison between short-read and long-read reference transcriptomics contribute to a deeper understanding of these competing options. In this study, both technologies complemented each other. We expect this annotation to be a foundation for further comparative and evolutionary genomic studies, including many waterfowl relatives with differing susceptibilities to avian influenza viruses.
Collapse
Affiliation(s)
- Ralf C Mueller
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, 78315, Germany
- Department of Biology, University of Konstanz, Konstanz, 78457, Germany
| | - Patrik Ellström
- Department of Medical Sciences, Zoonosis Science Center, Uppsala University, Uppsala, SE-75185, Sweden
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | | | - Richard I Kuo
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Katarzyna Miedzinska
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Amanda Warr
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, 10065, NY
| | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, 10065, NY
| | | | - William Chow
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | - James Torrance
- Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK
| | | | - Josef D Järhult
- Department of Medical Sciences, Zoonosis Science Center, Uppsala University, Uppsala, SE-75185, Sweden
| | - Mahmoud M Naguib
- Department of Medical Biochemistry and Microbiology, Zoonosis Science Center, Uppsala University, Uppsala, 75237, Sweden
| | - Björn Olsen
- Department of Medical Sciences, Zoonosis Science Center, Uppsala University, Uppsala, SE-75185, Sweden
| | - Erich D Jarvis
- Vertebrate Genome Laboratory and HHMI, The Rockefeller University, New York, 10065, NY
| | - Jacqueline Smith
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Lél Eöry
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Robert H S Kraus
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, 78315, Germany
- Department of Biology, University of Konstanz, Konstanz, 78457, Germany
| |
Collapse
|
18
|
Hansen T, Fjelldal PG, Lien S, Smith M, Corton C, Oliver K, Skelton J, Betteridge E, Doulcan J, Fedrigo O, Mountcastle J, Jarvis E, McCarthy SA, Chow W, Howe K, Torrance J, Wood J, Sims Y, Haggerty L, Challis R, Threlfall J, Mead D, Durbin R, Blaxter M. The genome sequence of the brown trout, Salmo trutta Linnaeus 1758. Wellcome Open Res 2021; 6:108. [PMID: 34632087 PMCID: PMC8488904 DOI: 10.12688/wellcomeopenres.16838.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/05/2021] [Indexed: 11/20/2022] Open
Abstract
We present a genome assembly from an individual female Salmo trutta (the brown trout; Chordata; Actinopteri; Salmoniformes; Salmonidae). The genome sequence is 2.37 gigabases in span. The majority of the assembly is scaffolded into 40 chromosomal pseudomolecules. Gene annotation of this assembly on Ensembl has identified 43,935 protein coding genes.
Collapse
Affiliation(s)
- Tom Hansen
- Institute of Marine Research (IMR), Matredal, Norway
| | | | - Sigbjørn Lien
- Norwegian University of Life Sciences, Ås, 1432, Norway
| | - Michelle Smith
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Craig Corton
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Karen Oliver
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Jason Skelton
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Emma Betteridge
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Jale Doulcan
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Achilles Therapeutics plc, London, W6 8PW, UK
| | | | | | - Erich Jarvis
- The Rockefeller University, New York, New York, 10065, USA.,Howard Hughes Medical Institute, Chevy Chase, Maryland, 20815, USA
| | - Shane A McCarthy
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - William Chow
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Kerstin Howe
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - James Torrance
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Jonathan Wood
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Ying Sims
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Leanne Haggerty
- EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Richard Challis
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Jonathan Threlfall
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Daniel Mead
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Owlstone Medical, Cambridge Science Park, Cambridge, CB4 0GJ, UK
| | - Richard Durbin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Mark Blaxter
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
19
|
Peart CR, Williams C, Pophaly SD, Neely BA, Gulland FMD, Adams DJ, Ng BL, Cheng W, Goebel ME, Fedrigo O, Haase B, Mountcastle J, Fungtammasan A, Formenti G, Collins J, Wood J, Sims Y, Torrance J, Tracey A, Howe K, Rhie A, Hoffman JI, Johnson J, Jarvis ED, Breen M, Wolf JBW. Hi-C scaffolded short- and long-read genome assemblies of the California sea lion are broadly consistent for syntenic inference across 45 million years of evolution. Mol Ecol Resour 2021; 21:2455-2470. [PMID: 34097816 PMCID: PMC9732816 DOI: 10.1111/1755-0998.13443] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 05/06/2021] [Accepted: 05/26/2021] [Indexed: 12/13/2022]
Abstract
With the advent of chromatin-interaction maps, chromosome-level genome assemblies have become a reality for a wide range of organisms. Scaffolding quality is, however, difficult to judge. To explore this gap, we generated multiple chromosome-scale genome assemblies of an emerging wild animal model for carcinogenesis, the California sea lion (Zalophus californianus). Short-read assemblies were scaffolded with two independent chromatin interaction mapping data sets (Hi-C and Chicago), and long-read assemblies with three data types (Hi-C, optical maps and 10X linked reads) following the "Vertebrate Genomes Project (VGP)" pipeline. In both approaches, 18 major scaffolds recovered the karyotype (2n = 36), with scaffold N50s of 138 and 147 Mb, respectively. Synteny relationships at the chromosome level with other pinniped genomes (2n = 32-36), ferret (2n = 34), red panda (2n = 36) and domestic dog (2n = 78) were consistent across approaches and recovered known fissions and fusions. Comparative chromosome painting and multicolour chromosome tiling with a panel of 264 genome-integrated single-locus canine bacterial artificial chromosome probes provided independent evaluation of genome organization. Broad-scale discrepancies between the approaches were observed within chromosomes, most commonly in translocations centred around centromeres and telomeres, which were better resolved in the VGP assembly. Genomic and cytological approaches agreed on near-perfect synteny of the X chromosome, and in combination allowed detailed investigation of autosomal rearrangements between dog and sea lion. This study presents high-quality genomes of an emerging cancer model and highlights that even highly fragmented short-read assemblies scaffolded with Hi-C can yield reliable chromosome-level scaffolds suitable for comparative genomic analyses.
Collapse
Affiliation(s)
- Claire R. Peart
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Munchen, Germany
| | - Christina Williams
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, North Carolina, USA
| | - Saurabh D. Pophaly
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Munchen, Germany,Max Planck institute for Plant Breeding Research, Cologne, Germany
| | - Benjamin A. Neely
- National Institute of Standards and Technology, NIST Charleston, Charleston, South Carolina, USA
| | - Frances M. D. Gulland
- Karen Dryer Wildlife Health Center, University of California Davis, Davis, California, USA
| | - David J. Adams
- Cytometry Core Facility, Wellcome Sanger Institute, Cambridge, UK
| | - Bee Ling Ng
- Cytometry Core Facility, Wellcome Sanger Institute, Cambridge, UK
| | - William Cheng
- Cytometry Core Facility, Wellcome Sanger Institute, Cambridge, UK
| | - Michael E. Goebel
- Institute of Marine Science, University of California Santa Cruz, Santa Cruz, California, USA
| | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, New York City, New York, USA
| | - Bettina Haase
- Vertebrate Genome Lab, The Rockefeller University, New York City, New York, USA
| | | | | | - Giulio Formenti
- Vertebrate Genome Lab, The Rockefeller University, New York City, New York, USA,Laboratory of Neurogenetics of Language, The Rockefeller University, New York City, New York, USA
| | - Joanna Collins
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Jonathan Wood
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Ying Sims
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - James Torrance
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Alan Tracey
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Kerstin Howe
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, NIH, Bethesda, Maryland, USA
| | - Joseph I. Hoffman
- Department of Animal Behaviour, Bielefeld University, Bielefeld, Germany,British Antarctic Survey, Cambridge, UK
| | - Jeremy Johnson
- Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, USA
| | - Erich D. Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York City, New York, USA,Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Matthew Breen
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, North Carolina, USA,Comparative Medicine Institute, North Carolina State University, Raleigh, North Carolina, USA
| | - Jochen B. W. Wolf
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Munchen, Germany
| |
Collapse
|
20
|
Formenti G, Rhie A, Balacco J, Haase B, Mountcastle J, Fedrigo O, Brown S, Capodiferro MR, Al-Ajli FO, Ambrosini R, Houde P, Koren S, Oliver K, Smith M, Skelton J, Betteridge E, Dolucan J, Corton C, Bista I, Torrance J, Tracey A, Wood J, Uliano-Silva M, Howe K, McCarthy S, Winkler S, Kwak W, Korlach J, Fungtammasan A, Fordham D, Costa V, Mayes S, Chiara M, Horner DS, Myers E, Durbin R, Achilli A, Braun EL, Phillippy AM, Jarvis ED. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications. Genome Biol 2021; 22:120. [PMID: 33910595 PMCID: PMC8082918 DOI: 10.1186/s13059-021-02336-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 03/31/2021] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. RESULTS As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. CONCLUSIONS Our results indicate that even in the "simple" case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.
Collapse
Affiliation(s)
- Giulio Formenti
- The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
- Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY, USA.
- The Howards Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jennifer Balacco
- The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA
| | - Bettina Haase
- The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA
| | | | - Olivier Fedrigo
- The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA
| | - Samara Brown
- Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY, USA
- The Howards Hughes Medical Institute, Chevy Chase, MD, USA
| | | | - Farooq O Al-Ajli
- Monash University Malaysia Genomics Facility, School of Science, Bandar Sunway, Selangor Darul Ehsan, Malaysia
- Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia, Bandar Sunway, Selangor Darul Ehsan, Malaysia
- Qatar Falcon Genome Project, Doha, State of Qatar
| | - Roberto Ambrosini
- Department of Environmental Science and Policy, University of Milan, Milan, Italy
| | - Peter Houde
- Department of Biology, New Mexico State University, Las Cruces, NM, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | | | | | | | | | - Iliana Bista
- Wellcome Sanger Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | | | | | | | | | | | - Shane McCarthy
- Wellcome Sanger Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology & Genetics, Dresden, Germany
| | | | | | | | - Daniel Fordham
- Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK
| | - Vania Costa
- Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK
| | - Simon Mayes
- Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK
| | - Matteo Chiara
- Department of Biosciences, University of Milan, Milan, Italy
| | - David S Horner
- Department of Biosciences, University of Milan, Milan, Italy
| | - Eugene Myers
- Max Planck Institute of Molecular Cell Biology & Genetics, Dresden, Germany
| | - Richard Durbin
- Wellcome Sanger Institute, Cambridge, UK
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Alessandro Achilli
- Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy
| | - Edward L Braun
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Erich D Jarvis
- The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
- Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY, USA.
- The Howards Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
21
|
Yang C, Zhou Y, Marcus S, Formenti G, Bergeron LA, Song Z, Bi X, Bergman J, Rousselle MMC, Zhou C, Zhou L, Deng Y, Fang M, Xie D, Zhu Y, Tan S, Mountcastle J, Haase B, Balacco J, Wood J, Chow W, Rhie A, Pippel M, Fabiszak MM, Koren S, Fedrigo O, Freiwald WA, Howe K, Yang H, Phillippy AM, Schierup MH, Jarvis ED, Zhang G. Evolutionary and biomedical insights from a marmoset diploid genome assembly. Nature 2021; 594:227-233. [PMID: 33910227 PMCID: PMC8189906 DOI: 10.1038/s41586-021-03535-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 04/12/2021] [Indexed: 01/23/2023]
Abstract
The accurate and complete assembly of both haplotype sequences of a diploid organism is essential to understanding the role of variation in genome functions, phenotypes and diseases1. Here, using a trio-binning approach, we present a high-quality, diploid reference genome, with both haplotypes assembled independently at the chromosome level, for the common marmoset (Callithrix jacchus), an primate model system that is widely used in biomedical research2,3. The full spectrum of heterozygosity between the two haplotypes involves 1.36% of the genome-much higher than the 0.13% indicated by the standard estimation based on single-nucleotide heterozygosity alone. The de novo mutation rate is 0.43 × 10-8 per site per generation, and the paternal inherited genome acquired twice as many mutations as the maternal. Our diploid assembly enabled us to discover a recent expansion of the sex-differentiation region and unique evolutionary changes in the marmoset Y chromosome. In addition, we identified many genes with signatures of positive selection that might have contributed to the evolution of Callithrix biological features. Brain-related genes were highly conserved between marmosets and humans, although several genes experienced lineage-specific copy number variations or diversifying selection, with implications for the use of marmosets as a model system.
Collapse
Affiliation(s)
- Chentao Yang
- BGI-Shenzhen, Shenzhen, China.,Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Stephanie Marcus
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Giulio Formenti
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.,Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Lucie A Bergeron
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Zhenzhen Song
- University of the Chinese Academy of Sciences, Beijing, China
| | | | - Juraj Bergman
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | | | | | | | - Yuan Deng
- BGI-Shenzhen, Shenzhen, China.,Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Duo Xie
- BGI-Shenzhen, Shenzhen, China
| | | | | | | | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Jennifer Balacco
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | | | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Center for Systems Biology, Dresden, Germany
| | | | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Winrich A Freiwald
- Laboratory of Neural Systems, The Rockefeller University, New York, NY, USA.,Center for Brains, Minds and Machines (CBMM), The Rockefeller University, New York, NY, USA
| | | | - Huanming Yang
- BGI-Shenzhen, Shenzhen, China.,University of the Chinese Academy of Sciences, Beijing, China.,James D. Watson Institute of Genome Sciences, Hangzhou, China.,Guangdong Provincial Academician Workstation of BGI Synthetic Genomics, BGI-Shenzhen, Shenzhen, China
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.,Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA.,Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Guojie Zhang
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark. .,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China. .,China National GeneBank, BGI-Shenzhen, Shenzhen, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
22
|
Zhou Y, Shearwin-Whyatt L, Li J, Song Z, Hayakawa T, Stevens D, Fenelon JC, Peel E, Cheng Y, Pajpach F, Bradley N, Suzuki H, Nikaido M, Damas J, Daish T, Perry T, Zhu Z, Geng Y, Rhie A, Sims Y, Wood J, Haase B, Mountcastle J, Fedrigo O, Li Q, Yang H, Wang J, Johnston SD, Phillippy AM, Howe K, Jarvis ED, Ryder OA, Kaessmann H, Donnelly P, Korlach J, Lewin HA, Graves J, Belov K, Renfree MB, Grutzner F, Zhou Q, Zhang G. Platypus and echidna genomes reveal mammalian biology and evolution. Nature 2021; 592:756-762. [PMID: 33408411 PMCID: PMC8081666 DOI: 10.1038/s41586-020-03039-0] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 07/30/2020] [Indexed: 12/13/2022]
Abstract
Egg-laying mammals (monotremes) are the only extant mammalian outgroup to therians (marsupial and eutherian animals) and provide key insights into mammalian evolution1,2. Here we generate and analyse reference genomes of the platypus (Ornithorhynchus anatinus) and echidna (Tachyglossus aculeatus), which represent the only two extant monotreme lineages. The nearly complete platypus genome assembly has anchored almost the entire genome onto chromosomes, markedly improving the genome continuity and gene annotation. Together with our echidna sequence, the genomes of the two species allow us to detect the ancestral and lineage-specific genomic changes that shape both monotreme and mammalian evolution. We provide evidence that the monotreme sex chromosome complex originated from an ancestral chromosome ring configuration. The formation of such a unique chromosome complex may have been facilitated by the unusually extensive interactions between the multi-X and multi-Y chromosomes that are shared by the autosomal homologues in humans. Further comparative genomic analyses unravel marked differences between monotremes and therians in haptoglobin genes, lactation genes and chemosensory receptor genes for smell and taste that underlie the ecological adaptation of monotremes.
Collapse
Affiliation(s)
- Yang Zhou
- BGI-Shenzhen, Shenzhen, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Linda Shearwin-Whyatt
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | - Jing Li
- MOE Laboratory of Biosystems Homeostasis and Protection and Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, China
| | - Zhenzhen Song
- BGI-Shenzhen, Shenzhen, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China
| | - Takashi Hayakawa
- Faculty of Environmental Earth Science, Hokkaido University, Sapporo, Japan
- Japan Monkey Centre, Inuyama, Japan
| | - David Stevens
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | - Jane C Fenelon
- School of BioSciences, The University of Melbourne, Melbourne, Victoria, Australia
| | - Emma Peel
- School of Life and Environmental Sciences, The University of Sydney, Sydney, New South Wales, Australia
| | - Yuanyuan Cheng
- School of Life and Environmental Sciences, The University of Sydney, Sydney, New South Wales, Australia
| | - Filip Pajpach
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | - Natasha Bradley
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | | | - Masato Nikaido
- School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
| | - Joana Damas
- The Genome Center, University of California, Davis, CA, USA
| | - Tasman Daish
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | - Tahlia Perry
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia
| | - Zexian Zhu
- MOE Laboratory of Biosystems Homeostasis and Protection and Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, China
| | - Yuncong Geng
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ying Sims
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Jonathan Wood
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Bettina Haase
- The Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | | | - Olivier Fedrigo
- The Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | - Qiye Li
- BGI-Shenzhen, Shenzhen, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, China
- James D. Watson Institute of Genome Sciences, Hangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
- Guangdong Provincial Academician Workstation of BGI Synthetic Genomics, BGI-Shenzhen, Shenzhen, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, China
- James D. Watson Institute of Genome Sciences, Hangzhou, China
| | - Stephen D Johnston
- School of Agriculture and Food Sciences, The University of Queensland, Gatton, Queensland, Australia
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Kerstin Howe
- Tree of Life Programme, Wellcome Sanger Institute, Cambridge, UK
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | | | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany
| | - Peter Donnelly
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | - Harris A Lewin
- The Genome Center, University of California, Davis, CA, USA
- Department of Evolution and Ecology, College of Biological Sciences, University of California, Davis, CA, USA
- Department of Reproduction and Population Health, School of Veterinary Medicine, University of California, Davis, CA, USA
| | - Jennifer Graves
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
- Institute for Applied Ecology, University of Canberra, Canberra, Australian Capital Territory, Australia
- School of Life Sciences, La Trobe University, Melbourne, Victoria, Australia
| | - Katherine Belov
- School of Life and Environmental Sciences, The University of Sydney, Sydney, New South Wales, Australia
| | - Marilyn B Renfree
- School of BioSciences, The University of Melbourne, Melbourne, Victoria, Australia
| | - Frank Grutzner
- School of Biological Sciences, The Environment Institute, The University of Adelaide, Adelaide, South Australia, Australia.
| | - Qi Zhou
- MOE Laboratory of Biosystems Homeostasis and Protection and Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, China.
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna, Austria.
- Center for Reproductive Medicine, The 2nd Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, China.
| | - Guojie Zhang
- BGI-Shenzhen, Shenzhen, China.
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
23
|
Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, Koren S, Uliano-Silva M, Chow W, Fungtammasan A, Kim J, Lee C, Ko BJ, Chaisson M, Gedman GL, Cantin LJ, Thibaud-Nissen F, Haggerty L, Bista I, Smith M, Haase B, Mountcastle J, Winkler S, Paez S, Howard J, Vernes SC, Lama TM, Grutzner F, Warren WC, Balakrishnan CN, Burt D, George JM, Biegler MT, Iorns D, Digby A, Eason D, Robertson B, Edwards T, Wilkinson M, Turner G, Meyer A, Kautt AF, Franchini P, Detrich HW, Svardal H, Wagner M, Naylor GJP, Pippel M, Malinsky M, Mooney M, Simbirsky M, Hannigan BT, Pesout T, Houck M, Misuraca A, Kingan SB, Hall R, Kronenberg Z, Sović I, Dunn C, Ning Z, Hastie A, Lee J, Selvaraj S, Green RE, Putnam NH, Gut I, Ghurye J, Garrison E, Sims Y, Collins J, Pelan S, Torrance J, Tracey A, Wood J, Dagnew RE, Guan D, London SE, Clayton DF, Mello CV, Friedrich SR, Lovell PV, Osipova E, Al-Ajli FO, Secomandi S, Kim H, Theofanopoulou C, Hiller M, Zhou Y, Harris RS, Makova KD, Medvedev P, Hoffman J, Masterson P, Clark K, Martin F, Howe K, Flicek P, Walenz BP, Kwak W, Clawson H, Diekhans M, Nassar L, Paten B, Kraus RHS, Crawford AJ, Gilbert MTP, Zhang G, Venkatesh B, Murphy RW, Koepfli KP, Shapiro B, Johnson WE, Di Palma F, Marques-Bonet T, Teeling EC, Warnow T, Graves JM, Ryder OA, Haussler D, O'Brien SJ, Korlach J, Lewin HA, Howe K, Myers EW, Durbin R, Phillippy AM, Jarvis ED. Towards complete and error-free genome assemblies of all vertebrate species. Nature 2021; 592:737-746. [PMID: 33911273 PMCID: PMC8081667 DOI: 10.1038/s41586-021-03451-0] [Citation(s) in RCA: 591] [Impact Index Per Article: 197.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 03/12/2021] [Indexed: 02/02/2023]
Abstract
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
Collapse
Affiliation(s)
- Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shane A McCarthy
- Department of Genetics, University of Cambridge, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | - Joana Damas
- The Genome Center, University of California Davis, Davis, CA, USA
| | - Giulio Formenti
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Marcela Uliano-Silva
- Leibniz Institute for Zoo and Wildlife Research, Department of Evolutionary Genetics, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | | | | | - Juwan Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Chul Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Byung June Ko
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Mark Chaisson
- University of Southern California, Los Angeles, CA, USA
| | - Gregory L Gedman
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Lindsey J Cantin
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Iliana Bista
- Department of Genetics, University of Cambridge, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Bettina Haase
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
| | | | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- DRESDEN-concept Genome Center, Dresden, Germany
| | - Sadye Paez
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | | | - Sonja C Vernes
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
- School of Biology, University of St Andrews, St Andrews, UK
| | - Tanya M Lama
- University of Massachusetts Cooperative Fish and Wildlife Research Unit, Amherst, MA, USA
| | - Frank Grutzner
- School of Biological Science, The Environment Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Wesley C Warren
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | | | - Dave Burt
- UQ Genomics, University of Queensland, Brisbane, Queensland, Australia
| | - Julia M George
- Department of Biological Sciences, Clemson University, Clemson, SC, USA
| | - Matthew T Biegler
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - David Iorns
- The Genetic Rescue Foundation, Wellington, New Zealand
| | - Andrew Digby
- Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand
| | - Daryl Eason
- Kākāpō Recovery, Department of Conservation, Invercargill, New Zealand
| | - Bruce Robertson
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | | | - Mark Wilkinson
- Department of Life Sciences, Natural History Museum, London, UK
| | - George Turner
- School of Natural Sciences, Bangor University, Gwynedd, UK
| | - Axel Meyer
- Department of Biology, University of Konstanz, Konstanz, Germany
| | - Andreas F Kautt
- Department of Biology, University of Konstanz, Konstanz, Germany
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Paolo Franchini
- Department of Biology, University of Konstanz, Konstanz, Germany
| | - H William Detrich
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA, USA
| | - Hannes Svardal
- Department of Biology, University of Antwerp, Antwerp, Belgium
- Naturalis Biodiversity Center, Leiden, The Netherlands
| | - Maximilian Wagner
- Institute of Biology, Karl-Franzens University of Graz, Graz, Austria
| | - Gavin J P Naylor
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology, Dresden, Germany
| | - Milan Malinsky
- Wellcome Sanger Institute, Cambridge, UK
- Zoological Institute, University of Basel, Basel, Switzerland
| | | | | | | | - Trevor Pesout
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | | | | | | | | | | | - Ivan Sović
- Pacific Biosciences, Menlo Park, CA, USA
- Digital BioLogic, Ivanić-Grad, Croatia
| | | | - Zemin Ning
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Joyce Lee
- Bionano Genomics, San Diego, CA, USA
| | | | - Richard E Green
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
- Dovetail Genomics, Santa Cruz, CA, USA
| | | | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Jay Ghurye
- Dovetail Genomics, Santa Cruz, CA, USA
- Department of Computer Science, University of Maryland College Park, College Park, MD, USA
| | - Erik Garrison
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Ying Sims
- Wellcome Sanger Institute, Cambridge, UK
| | | | | | | | | | | | | | - Dengfeng Guan
- Department of Genetics, University of Cambridge, Cambridge, UK
- School of Computer Science and Technology, Center for Bioinformatics, Harbin Institute of Technology, Harbin, China
| | - Sarah E London
- Department of Psychology, Institute for Mind and Biology, University of Chicago, Chicago, IL, USA
| | - David F Clayton
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Claudio V Mello
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Samantha R Friedrich
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Peter V Lovell
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR, USA
| | - Ekaterina Osipova
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
| | - Farooq O Al-Ajli
- Monash University Malaysia Genomics Facility, School of Science, Selangor Darul Ehsan, Malaysia
- Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia, Selangor Darul Ehsan, Malaysia
- Qatar Falcon Genome Project, Doha, Qatar
| | | | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- eGnome, Inc., Seoul, Republic of Korea
| | | | - Michael Hiller
- LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany
- Senckenberg Research Institute, Frankfurt, Germany
- Goethe-University, Faculty of Biosciences, Frankfurt, Germany
| | | | - Robert S Harris
- Department of Biology, Pennsylvania State University, University Park, PA, USA
| | - Kateryna D Makova
- Department of Biology, Pennsylvania State University, University Park, PA, USA
- Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA
- Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA
| | - Paul Medvedev
- Center for Medical Genomics, Pennsylvania State University, University Park, PA, USA
- Center for Computational Biology and Bioinformatics, Pennsylvania State University, University Park, PA, USA
- Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Jinna Hoffman
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Karen Clark
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, USA
| | - Fergal Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Kevin Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Brian P Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Woori Kwak
- eGnome, Inc., Seoul, Republic of Korea
- Hoonygen, Seoul, Korea
| | - Hiram Clawson
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Luis Nassar
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Robert H S Kraus
- Department of Biology, University of Konstanz, Konstanz, Germany
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany
| | - Andrew J Crawford
- Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
- University Museum, NTNU, Trondheim, Norway
| | - Guojie Zhang
- China National Genebank, BGI-Shenzhen, Shenzhen, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Byrappa Venkatesh
- Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore, Singapore
| | - Robert W Murphy
- Centre for Biodiversity, Royal Ontario Museum, Toronto, Ontario, Canada
| | - Klaus-Peter Koepfli
- Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA
- The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, USA
- Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Federica Di Palma
- Department of Biological Sciences, Earlham Institute, University of East Anglia, Norwich, UK
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Tandy Warnow
- Department of Computer Science, The University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | | | - Oliver A Ryder
- San Diego Zoo Global, Escondido, CA, USA
- Department of Evolution, Behavior, and Ecology, University of California San Diego, La Jolla, CA, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Stephen J O'Brien
- Laboratory of Genomics Diversity-Center for Computer Technologies, ITMO University, St. Petersburg, Russian Federation
- Guy Harvey Oceanographic Center, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Fort Lauderdale, FL, USA
| | | | - Harris A Lewin
- The Genome Center, University of California Davis, Davis, CA, USA
- Department of Evolution and Ecology, University of California Davis, Davis, CA, USA
- John Muir Institute for the Environment, University of California Davis, Davis, CA, USA
| | | | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Center for Systems Biology, Dresden, Germany.
- Faculty of Computer Science, Technical University Dresden, Dresden, Germany.
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, UK.
- Wellcome Sanger Institute, Cambridge, UK.
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA.
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
24
|
Morin PA, Archer FI, Avila CD, Balacco JR, Bukhman YV, Chow W, Fedrigo O, Formenti G, Fronczek JA, Fungtammasan A, Gulland FMD, Haase B, Peter Heide-Jorgensen M, Houck ML, Howe K, Misuraca AC, Mountcastle J, Musser W, Paez S, Pelan S, Phillippy A, Rhie A, Robinson J, Rojas-Bracho L, Rowles TK, Ryder OA, Smith CR, Stevenson S, Taylor BL, Teilmann J, Torrance J, Wells RS, Westgate AJ, Jarvis ED. Reference genome and demographic history of the most endangered marine mammal, the vaquita. Mol Ecol Resour 2020; 21:1008-1020. [PMID: 33089966 PMCID: PMC8247363 DOI: 10.1111/1755-0998.13284] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 09/08/2020] [Accepted: 10/08/2020] [Indexed: 12/12/2022]
Abstract
The vaquita is the most critically endangered marine mammal, with fewer than 19 remaining in the wild. First described in 1958, the vaquita has been in rapid decline for more than 20 years resulting from inadvertent deaths due to the increasing use of large-mesh gillnets. To understand the evolutionary and demographic history of the vaquita, we used combined long-read sequencing and long-range scaffolding methods with long- and short-read RNA sequencing to generate a near error-free annotated reference genome assembly from cell lines derived from a female individual. The genome assembly consists of 99.92% of the assembled sequence contained in 21 nearly gapless chromosome-length autosome scaffolds and the X-chromosome scaffold, with a scaffold N50 of 115 Mb. Genome-wide heterozygosity is the lowest (0.01%) of any mammalian species analysed to date, but heterozygosity is evenly distributed across the chromosomes, consistent with long-term small population size at genetic equilibrium, rather than low diversity resulting from a recent population bottleneck or inbreeding. Historical demography of the vaquita indicates long-term population stability at less than 5,000 (Ne) for over 200,000 years. Together, these analyses indicate that the vaquita genome has had ample opportunity to purge highly deleterious alleles and potentially maintain diversity necessary for population health.
Collapse
Affiliation(s)
- Phillip A Morin
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, CA, USA
| | - Frederick I Archer
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, CA, USA
| | - Catherine D Avila
- San Diego Zoo Institute for Conservation Research, Escondido, CA, USA
| | - Jennifer R Balacco
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Yury V Bukhman
- Regenerative Biology, Morgridge Institute for Research, Madison, WI, USA
| | | | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Julie A Fronczek
- San Diego Zoo Institute for Conservation Research, Escondido, CA, USA
| | | | | | - Bettina Haase
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | | | - Marlys L Houck
- San Diego Zoo Institute for Conservation Research, Escondido, CA, USA
| | | | - Ann C Misuraca
- San Diego Zoo Institute for Conservation Research, Escondido, CA, USA
| | | | | | - Sadye Paez
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | | | - Adam Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
| | - Jacqueline Robinson
- Institute for Human Genetics, University of California, San Francisco, CA, USA
| | | | - Teri K Rowles
- Office of Protected Resources, National Marine Fisheries Service, NOAA, Silver Spring, MD, USA
| | - Oliver A Ryder
- San Diego Zoo Institute for Conservation Research, Escondido, CA, USA
| | | | | | - Barbara L Taylor
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, CA, USA
| | - Jonas Teilmann
- Marine Mammal Research, Department of Bioscience, Aarhus University, Roskilde, Denmark
| | | | - Randall S Wells
- Chicago Zoological Society's Sarasota Dolphin Research Program, c/o Mote Marine Laboratory, Sarasota, FL, USA
| | | | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA.,Howard Hughes Medical Institute, Chevy Chase, MD, USA
| |
Collapse
|
25
|
Mead D, Fingland K, Cripps R, Portela Miguez R, Smith M, Corton C, Oliver K, Skelton J, Betteridge E, Dolucan J, Dudchenko O, Omer AD, Weisz D, Lieberman Aiden E, Fedrigo O, Mountcastle J, Jarvis E, McCarthy SA, Sims Y, Torrance J, Tracey A, Howe K, Challis R, Durbin R, Blaxter M. The genome sequence of the Eurasian red squirrel, Sciurus vulgaris Linnaeus 1758. Wellcome Open Res 2020; 5:18. [PMID: 32587897 PMCID: PMC7309416 DOI: 10.12688/wellcomeopenres.15679.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/22/2020] [Indexed: 01/27/2023] Open
Abstract
We present a genome assembly from an individual male Sciurus vulgaris (the Eurasian red squirrel; Vertebrata; Mammalia; Eutheria; Rodentia; Sciuridae). The genome sequence is 2.88 gigabases in span. The majority of the assembly is scaffolded into 21 chromosomal-level scaffolds, with both X and Y sex chromosomes assembled.
Collapse
Affiliation(s)
- Daniel Mead
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Kathryn Fingland
- School of Animal, Rural and Environmental Sciences, Nottingham Trent University, Nottingham, NG25 0QF, UK
| | - Rachel Cripps
- The Wildlife Trust for Lancashire, Manchester and North Merseyside, Preston, PR5 6BY, UK
| | | | - Michelle Smith
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Craig Corton
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Karen Oliver
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Jason Skelton
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Emma Betteridge
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Jale Dolucan
- Baylor College of Medicine, Houston, TX, 77030, USA
| | | | | | - David Weisz
- Baylor College of Medicine, Houston, TX, 77030, USA
| | | | - Olivier Fedrigo
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, 10065, USA
| | - Jacquelyn Mountcastle
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, 10065, USA
| | - Erich Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, 10065, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| | - Shane A. McCarthy
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Ying Sims
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - James Torrance
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Alan Tracey
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Richard Challis
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Richard Durbin
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| |
Collapse
|