1
|
Ho DV, Tormey D, Odell A, Newton AA, Schnittker RR, Baumann DP, Neaves WB, Schroeder MR, Sigauke RF, Barley AJ, Baumann P. Post-meiotic mechanism of facultative parthenogenesis in gonochoristic whiptail lizard species. eLife 2024; 13:e97035. [PMID: 38847388 PMCID: PMC11161175 DOI: 10.7554/elife.97035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 05/17/2024] [Indexed: 06/09/2024] Open
Abstract
Facultative parthenogenesis (FP) has historically been regarded as rare in vertebrates, but in recent years incidences have been reported in a growing list of fish, reptile, and bird species. Despite the increasing interest in the phenomenon, the underlying mechanism and evolutionary implications have remained unclear. A common finding across many incidences of FP is either a high degree of homozygosity at microsatellite loci or low levels of heterozygosity detected in next-generation sequencing data. This has led to the proposal that second polar body fusion following the meiotic divisions restores diploidy and thereby mimics fertilization. Here, we show that FP occurring in the gonochoristic Aspidoscelis species A. marmoratus and A. arizonae results in genome-wide homozygosity, an observation inconsistent with polar body fusion as the underlying mechanism of restoration. Instead, a high-quality reference genome for A. marmoratus and analysis of whole-genome sequencing from multiple FP and control animals reveals that a post-meiotic mechanism gives rise to homozygous animals from haploid, unfertilized oocytes. Contrary to the widely held belief that females need to be isolated from males to undergo FP, females housed with conspecific and heterospecific males produced unfertilized eggs that underwent spontaneous development. In addition, offspring arising from both fertilized eggs and parthenogenetic development were observed to arise from a single clutch. Strikingly, our data support a mechanism for facultative parthenogenesis that removes all heterozygosity in a single generation. Complete homozygosity exposes the genetic load and explains the high rate of congenital malformations and embryonic mortality associated with FP in many species. Conversely, for animals that develop normally, FP could potentially exert strong purifying selection as all lethal recessive alleles are purged in a single generation.
Collapse
Affiliation(s)
- David V Ho
- Department of Biology, Johannes Gutenberg UniversityMainzGermany
- Institute of Quantitative and Computational Biosciences, Johannes Gutenberg UniversityMainzGermany
| | - Duncan Tormey
- Stowers Institute for Medical ResearchKansas CityUnited States
| | - Aaron Odell
- Department of Biology, Johannes Gutenberg UniversityMainzGermany
| | | | | | - Diana P Baumann
- Stowers Institute for Medical ResearchKansas CityUnited States
| | | | | | | | - Anthony J Barley
- School of Mathematical and Natural Sciences, Arizona State University–West Valley CampusGlendaleUnited States
| | - Peter Baumann
- Department of Biology, Johannes Gutenberg UniversityMainzGermany
- Institute of Quantitative and Computational Biosciences, Johannes Gutenberg UniversityMainzGermany
- Institute of Molecular BiologyMainzGermany
| |
Collapse
|
2
|
Schultz DT, Heath-Heckman EA, Winchell CJ, Kuo DH, Yu YS, Oberauer F, Kocot KM, Cho SJ, Simakov O, Weisblat DA. Acceleration of genome rearrangement in clitellate annelids. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.12.593736. [PMID: 38798472 PMCID: PMC11118384 DOI: 10.1101/2024.05.12.593736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Comparisons of multiple metazoan genomes have revealed the existence of ancestral linkage groups (ALGs), genomic scaffolds sharing sets of orthologous genes that have been inherited from ancestral animals for hundreds of millions of years (Simakov et al. 2022; Schultz et al. 2023) These ALGs have persisted across major animal taxa including Cnidaria, Deuterostomia, Ecdysozoa and Spiralia. Notwithstanding this general trend of chromosome-scale conservation, ALGs have been obliterated by extensive genome rearrangements in certain groups, most notably including Clitellata (oligochaetes and leeches), a group of easily overlooked invertebrates that is of tremendous ecological, agricultural and economic importance (Charles 2019; Barrett 2016). To further investigate these rearrangements, we have undertaken a comparison of 12 clitellate genomes (including four newly sequenced species) and 11 outgroup representatives. We show that these rearrangements began at the base of the Clitellata (rather than progressing gradually throughout polychaete annelids), that the inter-chromosomal rearrangements continue in several clitellate lineages and that these events have substantially shaped the evolution of the otherwise highly conserved Hox cluster.
Collapse
Affiliation(s)
- Darrin T. Schultz
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - Elizabeth A.C. Heath-Heckman
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - Christopher J. Winchell
- Department of Molecular and Cell Biology, University of California, 385 Weill Hall, Berkeley, CA 94720-3200, USA
| | - Dian-Han Kuo
- Department of Life Science & Museum of Zoology, National Taiwan University, No. 1 Section 4 Roosevelt Rd., Taipei 10617, Taiwan
| | - Yun-sang Yu
- Department of Biological Sciences and Biotechnology, Chungbuk National University, Cheongju, 28644, Republic of Korea
| | - Fabian Oberauer
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - Kevin M. Kocot
- Department of Biological Sciences, University of Alabama, Tuscaloosa, AL 35487, USA
- Alabama Museum of Natural History, University of Alabama, Tuscaloosa, AL 35487, USA
| | - Sung-Jin Cho
- Department of Biological Sciences and Biotechnology, Chungbuk National University, Cheongju, 28644, Republic of Korea
| | - Oleg Simakov
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - David A. Weisblat
- Department of Molecular and Cell Biology, University of California, 385 Weill Hall, Berkeley, CA 94720-3200, USA
| |
Collapse
|
3
|
Espinosa E, Bautista R, Larrosa R, Plata O. Advancements in long-read genome sequencing technologies and algorithms. Genomics 2024; 116:110842. [PMID: 38608738 DOI: 10.1016/j.ygeno.2024.110842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 04/01/2024] [Accepted: 04/06/2024] [Indexed: 04/14/2024]
Abstract
The recent advent of long read sequencing technologies, such as Pacific Biosciences (PacBio) and Oxford Nanopore technology (ONT), have led to substantial improvements in accuracy and computational cost in sequencing genomes. However, de novo whole-genome assembly still presents significant challenges related to the quality of the results. Pursuing de novo whole-genome assembly remains a formidable challenge, underscored by intricate considerations surrounding computational demands and result quality. As sequencing accuracy and throughput steadily advance, a continuous stream of innovative assembly tools floods the field. Navigating this dynamic landscape necessitates a reasonable choice of sequencing platform, depth, and assembly tools to orchestrate high-quality genome reconstructions. This comprehensive review delves into the intricate interplay between cutting-edge long read sequencing technologies, assembly methodologies, and the ever-evolving field of genomics. With a focus on addressing the pivotal challenges and harnessing the opportunities presented by these advancements, we provide an in-depth exploration of the crucial factors influencing the selection of optimal strategies for achieving robust and insightful genome assemblies.
Collapse
Affiliation(s)
- Elena Espinosa
- Department of Computer Architecture, University of Malaga, Louis Pasteur, 35, Campus de Teatinos, Malaga 29071, Spain.
| | - Rocio Bautista
- Supercomputing and Bioinnovation Center, University of Malaga, C. Severo Ochoa, 34, Malaga 29590, Spain.
| | - Rafael Larrosa
- Department of Computer Architecture, University of Malaga, Louis Pasteur, 35, Campus de Teatinos, Malaga 29071, Spain; Supercomputing and Bioinnovation Center, University of Malaga, C. Severo Ochoa, 34, Malaga 29590, Spain.
| | - Oscar Plata
- Department of Computer Architecture, University of Malaga, Louis Pasteur, 35, Campus de Teatinos, Malaga 29071, Spain.
| |
Collapse
|
4
|
Session AM. Allopolyploid subgenome identification and implications for evolutionary analysis. Trends Genet 2024:S0168-9525(24)00070-2. [PMID: 38637269 DOI: 10.1016/j.tig.2024.03.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 03/21/2024] [Accepted: 03/21/2024] [Indexed: 04/20/2024]
Abstract
Whole-genome duplications (WGDs) are widespread genomic events in eukaryotes that are hypothesized to contribute to the evolutionary success of many lineages, including flowering plants, Saccharomyces yeast, and vertebrates. WGDs generally can be classified into autopolyploids (ploidy increase descended from one species) or allopolyploids (ploidy increase descended from multiple species). Assignment of allopolyploid progenitor species (called subgenomes in the polyploid) is important to understanding the biology and evolution of polyploids, including the asymmetric subgenome evolution following hybridization (biased fractionation). Here, I review the different methodologies used to identify the ancestors of allopolyploid subgenomes, discuss the advantages and disadvantages of these methods, and outline the implications of how these methods affect the subsequent evolutionary analysis of these genomes.
Collapse
Affiliation(s)
- Adam M Session
- Department of Biological Sciences, Binghamton University, Binghamton, NY 13902, USA.
| |
Collapse
|
5
|
Marlétaz F, Timoshevskaya N, Timoshevskiy VA, Parey E, Simakov O, Gavriouchkina D, Suzuki M, Kubokawa K, Brenner S, Smith JJ, Rokhsar DS. The hagfish genome and the evolution of vertebrates. Nature 2024; 627:811-820. [PMID: 38262590 PMCID: PMC10972751 DOI: 10.1038/s41586-024-07070-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 01/15/2024] [Indexed: 01/25/2024]
Abstract
As the only surviving lineages of jawless fishes, hagfishes and lampreys provide a crucial window into early vertebrate evolution1-3. Here we investigate the complex history, timing and functional role of genome-wide duplications4-7 and programmed DNA elimination8,9 in vertebrates in the light of a chromosome-scale genome sequence for the brown hagfish Eptatretus atami. Combining evidence from syntenic and phylogenetic analyses, we establish a comprehensive picture of vertebrate genome evolution, including an auto-tetraploidization (1RV) that predates the early Cambrian cyclostome-gnathostome split, followed by a mid-late Cambrian allo-tetraploidization (2RJV) in gnathostomes and a prolonged Cambrian-Ordovician hexaploidization (2RCY) in cyclostomes. Subsequently, hagfishes underwent extensive genomic changes, with chromosomal fusions accompanied by the loss of genes that are essential for organ systems (for example, genes involved in the development of eyes and in the proliferation of osteoclasts); these changes account, in part, for the simplification of the hagfish body plan1,2. Finally, we characterize programmed DNA elimination in hagfish, identifying protein-coding genes and repetitive elements that are deleted from somatic cell lineages during early development. The elimination of these germline-specific genes provides a mechanism for resolving genetic conflict between soma and germline by repressing germline and pluripotency functions, paralleling findings in lampreys10,11. Reconstruction of the early genomic history of vertebrates provides a framework for further investigations of the evolution of cyclostomes and jawed vertebrates.
Collapse
Affiliation(s)
- Ferdinand Marlétaz
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK.
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan.
| | | | | | - Elise Parey
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Oleg Simakov
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Department for Neurosciences and Developmental Biology, University of Vienna, Vienna, Austria
| | - Daria Gavriouchkina
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- UK Dementia Research Institute, University College London, London, UK
| | - Masakazu Suzuki
- Department of Science, Graduate School of Integrated Science and Technology, Shizuoka University, Shizuoka, Japan
| | - Kaoru Kubokawa
- Ocean Research Institute, The University of Tokyo, Tokyo, Japan
| | - Sydney Brenner
- Comparative and Medical Genomics Laboratory, Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore, Singapore
| | - Jeramiah J Smith
- Department of Biology, University of Kentucky, Lexington, KY, USA.
| | - Daniel S Rokhsar
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan.
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
6
|
Liang Q, Muñoz-Amatriaín M, Shu S, Lo S, Wu X, Carlson JW, Davidson P, Goodstein DM, Phillips J, Janis NM, Lee EJ, Liang C, Morrell PL, Farmer AD, Xu P, Close TJ, Lonardi S. A view of the pan-genome of domesticated Cowpea (Vigna unguiculata [L.] Walp.). THE PLANT GENOME 2024; 17:e20319. [PMID: 36946261 DOI: 10.1002/tpg2.20319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/19/2023] [Accepted: 02/04/2023] [Indexed: 06/18/2023]
Abstract
Cowpea, Vigna unguiculata L. Walp., is a diploid warm-season legume of critical importance as both food and fodder in sub-Saharan Africa. This species is also grown in Northern Africa, Europe, Latin America, North America, and East to Southeast Asia. To capture the genomic diversity of domesticates of this important legume, de novo genome assemblies were produced for representatives of six subpopulations of cultivated cowpea identified previously from genotyping of several hundred diverse accessions. In the most complete assembly (IT97K-499-35), 26,026 core and 4963 noncore genes were identified, with 35,436 pan genes when considering all seven accessions. GO terms associated with response to stress and defense response were highly enriched among the noncore genes, while core genes were enriched in terms related to transcription factor activity, and transport and metabolic processes. Over 5 million single nucleotide polymorphisms (SNPs) relative to each assembly and over 40 structural variants >1 Mb in size were identified by comparing genomes. Vu10 was the chromosome with the highest frequency of SNPs, and Vu04 had the most structural variants. Noncore genes harbor a larger proportion of potentially disruptive variants than core genes, including missense, stop gain, and frameshift mutations; this suggests that noncore genes substantially contribute to diversity within domesticated cowpea.
Collapse
Affiliation(s)
- Qihua Liang
- Department of Computer Science and Engineering, University of California Riverside, Riverside, CA, USA
| | - María Muñoz-Amatriaín
- Department of Botany and Plant Sciences, University of California Riverside, Riverside, CA, USA
- Departamento de Biología Molecular, Universidad de León, León, Spain
| | - Shengqiang Shu
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sassoum Lo
- Department of Botany and Plant Sciences, University of California Riverside, Riverside, CA, USA
- Department of Plant Sciences, University of California Davis, Davis, CA, USA
| | - Xinyi Wu
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products, Institute of Vegetables, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Joseph W Carlson
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Patrick Davidson
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David M Goodstein
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Jeremy Phillips
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nadia M Janis
- Department of Agronomy and Plant Genetics, University of Minnesota Twin Cities, Saint Paul, MN, USA
| | - Elaine J Lee
- Department of Agronomy and Plant Genetics, University of Minnesota Twin Cities, Saint Paul, MN, USA
| | - Chenxi Liang
- Department of Agronomy and Plant Genetics, University of Minnesota Twin Cities, Saint Paul, MN, USA
| | - Peter L Morrell
- Department of Agronomy and Plant Genetics, University of Minnesota Twin Cities, Saint Paul, MN, USA
| | | | - Pei Xu
- Key Lab of Specialty Agri-Product Quality and Hazard Controlling Technology of Zhejiang Province, China Jiliang University, Hangzhou, China
| | - Timothy J Close
- Department of Botany and Plant Sciences, University of California Riverside, Riverside, CA, USA
| | - Stefano Lonardi
- Department of Computer Science and Engineering, University of California Riverside, Riverside, CA, USA
| |
Collapse
|
7
|
Bredeson JV, Mudd AB, Medina-Ruiz S, Mitros T, Smith OK, Miller KE, Lyons JB, Batra SS, Park J, Berkoff KC, Plott C, Grimwood J, Schmutz J, Aguirre-Figueroa G, Khokha MK, Lane M, Philipp I, Laslo M, Hanken J, Kerdivel G, Buisine N, Sachs LM, Buchholz DR, Kwon T, Smith-Parker H, Gridi-Papp M, Ryan MJ, Denton RD, Malone JH, Wallingford JB, Straight AF, Heald R, Hockemeyer D, Harland RM, Rokhsar DS. Conserved chromatin and repetitive patterns reveal slow genome evolution in frogs. Nat Commun 2024; 15:579. [PMID: 38233380 PMCID: PMC10794172 DOI: 10.1038/s41467-023-43012-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 10/27/2023] [Indexed: 01/19/2024] Open
Abstract
Frogs are an ecologically diverse and phylogenetically ancient group of anuran amphibians that include important vertebrate cell and developmental model systems, notably the genus Xenopus. Here we report a high-quality reference genome sequence for the western clawed frog, Xenopus tropicalis, along with draft chromosome-scale sequences of three distantly related emerging model frog species, Eleutherodactylus coqui, Engystomops pustulosus, and Hymenochirus boettgeri. Frog chromosomes have remained remarkably stable since the Mesozoic Era, with limited Robertsonian (i.e., arm-preserving) translocations and end-to-end fusions found among the smaller chromosomes. Conservation of synteny includes conservation of centromere locations, marked by centromeric tandem repeats associated with Cenp-a binding surrounded by pericentromeric LINE/L1 elements. This work explores the structure of chromosomes across frogs, using a dense meiotic linkage map for X. tropicalis and chromatin conformation capture (Hi-C) data for all species. Abundant satellite repeats occupy the unusually long (~20 megabase) terminal regions of each chromosome that coincide with high rates of recombination. Both embryonic and differentiated cells show reproducible associations of centromeric chromatin and of telomeres, reflecting a Rabl-like configuration. Our comparative analyses reveal 13 conserved ancestral anuran chromosomes from which contemporary frog genomes were constructed.
Collapse
Affiliation(s)
- Jessen V Bredeson
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
- DOE-Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA
| | - Austin B Mudd
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Sofia Medina-Ruiz
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Therese Mitros
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Owen Kabnick Smith
- Department of Biochemistry, Stanford University School of Medicine, 279 Campus Drive, Beckman Center 409, Stanford, CA, 94305-5307, USA
| | - Kelly E Miller
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Jessica B Lyons
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Sanjit S Batra
- Computer Science Division, University of California Berkeley, 2626 Hearst Avenue, Berkeley, CA, 94720, USA
| | - Joseph Park
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Kodiak C Berkoff
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Christopher Plott
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Jane Grimwood
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Jeremy Schmutz
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Guadalupe Aguirre-Figueroa
- Department of Biochemistry, Stanford University School of Medicine, 279 Campus Drive, Beckman Center 409, Stanford, CA, 94305-5307, USA
| | - Mustafa K Khokha
- Pediatric Genomics Discovery Program, Departments of Pediatrics and Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT, 06510, USA
| | - Maura Lane
- Pediatric Genomics Discovery Program, Departments of Pediatrics and Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT, 06510, USA
| | - Isabelle Philipp
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Mara Laslo
- Department of Organismic and Evolutionary Biology, and Museum of Comparative Zoology, Harvard University, Cambridge, MA, 02138, USA
| | - James Hanken
- Department of Organismic and Evolutionary Biology, and Museum of Comparative Zoology, Harvard University, Cambridge, MA, 02138, USA
| | - Gwenneg Kerdivel
- Département Adaptation du Vivant, UMR 7221 CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Nicolas Buisine
- Département Adaptation du Vivant, UMR 7221 CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Laurent M Sachs
- Département Adaptation du Vivant, UMR 7221 CNRS, Muséum National d'Histoire Naturelle, Paris, France
| | - Daniel R Buchholz
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Taejoon Kwon
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan, 44919, Republic of Korea
- Center for Genomic Integrity, Institute for Basic Science (IBS), Ulsan, 44919, Republic of Korea
| | - Heidi Smith-Parker
- Department of Integrative Biology, Patterson Labs, 2401 Speedway, University of Texas, Austin, TX, 78712, USA
| | - Marcos Gridi-Papp
- Department of Biological Sciences, University of the Pacific, 3601 Pacific Avenue, Stockton, CA, 95211, USA
| | - Michael J Ryan
- Department of Integrative Biology, Patterson Labs, 2401 Speedway, University of Texas, Austin, TX, 78712, USA
| | - Robert D Denton
- Department of Molecular and Cell Biology and Institute of Systems Genomics, University of Connecticut, 181 Auditorium Road, Unit 3197, Storrs, CT, 06269, USA
| | - John H Malone
- Department of Molecular and Cell Biology and Institute of Systems Genomics, University of Connecticut, 181 Auditorium Road, Unit 3197, Storrs, CT, 06269, USA
| | - John B Wallingford
- Department of Molecular Biosciences, Patterson Labs, 2401 Speedway, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Aaron F Straight
- Department of Biochemistry, Stanford University School of Medicine, 279 Campus Drive, Beckman Center 409, Stanford, CA, 94305-5307, USA
| | - Rebecca Heald
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Dirk Hockemeyer
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, 94720, USA
- Chan-Zuckerberg BioHub, 499 Illinois Street, San Francisco, CA, 94158, USA
| | - Richard M Harland
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA
| | - Daniel S Rokhsar
- Department of Molecular and Cell Biology, Weill Hall, University of California, Berkeley, CA, 94720, USA.
- DOE-Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.
- Innovative Genomics Institute, University of California, Berkeley, CA, 94720, USA.
- Chan-Zuckerberg BioHub, 499 Illinois Street, San Francisco, CA, 94158, USA.
- Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 9040495, Japan.
| |
Collapse
|
8
|
Gupta P, Geniza M, Elser J, Al-Bader N, Baschieri R, Phillips JL, Haq E, Preece J, Naithani S, Jaiswal P. Reference genome of the nutrition-rich orphan crop chia ( Salvia hispanica) and its implications for future breeding. FRONTIERS IN PLANT SCIENCE 2023; 14:1272966. [PMID: 38162307 PMCID: PMC10757625 DOI: 10.3389/fpls.2023.1272966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 10/23/2023] [Indexed: 01/03/2024]
Abstract
Chia (Salvia hispanica L.) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucleotide polymorphisms (SNPs) and 15,380 simple sequence repeat (SSR) marker sites. By investigating the chia transcriptome, we discovered that ~44% of the genes undergo alternative splicing with a higher frequency of intron retention events. Additionally, we identified chia genes associated with important nutrient content and quality traits, such as the biosynthesis of PUFAs and seed mucilage fiber (dietary fiber) polysaccharides. Notably, this is the first report of in-silico annotation of a plant genome for protein-derived small bioactive peptides (biopeptides) associated with improving human health. To facilitate further research and translational applications of this valuable orphan crop, we have developed the Salvia genomics database (SalviaGDB), accessible at https://salviagdb.org.
Collapse
Affiliation(s)
- Parul Gupta
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Matthew Geniza
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
- Molecular and Cellular Biology Graduate Program, Oregon State University, Corvallis, OR, United States
| | - Justin Elser
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Noor Al-Bader
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
- Molecular and Cellular Biology Graduate Program, Oregon State University, Corvallis, OR, United States
| | - Rachel Baschieri
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Jeremy Levi Phillips
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Ebaad Haq
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Justin Preece
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| |
Collapse
|
9
|
Destanović D, Schultz DT, Styfhals R, Cruz F, Gómez-Garrido J, Gut M, Gut I, Fiorito G, Simakov O, Alioto TS, Ponte G, Seuntjens E. A chromosome-level reference genome for the common octopus, Octopus vulgaris (Cuvier, 1797). G3 (BETHESDA, MD.) 2023; 13:jkad220. [PMID: 37850903 PMCID: PMC10700109 DOI: 10.1093/g3journal/jkad220] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 08/18/2023] [Indexed: 10/19/2023]
Abstract
Cephalopods are emerging animal models and include iconic species for studying the link between genomic innovations and physiological and behavioral complexities. Coleoid cephalopods possess the largest nervous system among invertebrates, both for cell counts and brain-to-body ratio. Octopus vulgaris has been at the center of a long-standing tradition of research into diverse aspects of cephalopod biology, including behavioral and neural plasticity, learning and memory recall, regeneration, and sophisticated cognition. However, no chromosome-scale genome assembly was available for O. vulgaris to aid in functional studies. To fill this gap, we sequenced and assembled a chromosome-scale genome of the common octopus, O. vulgaris. The final assembly spans 2.8 billion basepairs, 99.34% of which are in 30 chromosome-scale scaffolds. Hi-C heatmaps support a karyotype of 1n = 30 chromosomes. Comparisons with other octopus species' genomes show a conserved octopus karyotype and a pattern of local genome rearrangements between species. This new chromosome-scale genome of O. vulgaris will further facilitate research in all aspects of cephalopod biology, including various forms of plasticity and the neural machinery underlying sophisticated cognition, as well as an understanding of cephalopod evolution.
Collapse
Affiliation(s)
- Dalila Destanović
- Department of Neurosciences and Developmental Biology, University of Vienna, Vienna 1030, Austria
| | - Darrin T Schultz
- Department of Neurosciences and Developmental Biology, University of Vienna, Vienna 1030, Austria
| | - Ruth Styfhals
- Department of Biology, Lab of Developmental Neurobiology, Animal Physiology and Neurobiology Division, KU Leuven, Leuven 3000, Belgium
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Naples 80121, Italy
| | - Fernando Cruz
- Centro Nacional de Análisis Genómico (CNAG), Barcelona 08028, Spain
| | | | - Marta Gut
- Centro Nacional de Análisis Genómico (CNAG), Barcelona 08028, Spain
| | - Ivo Gut
- Centro Nacional de Análisis Genómico (CNAG), Barcelona 08028, Spain
| | - Graziano Fiorito
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Naples 80121, Italy
| | - Oleg Simakov
- Department of Neurosciences and Developmental Biology, University of Vienna, Vienna 1030, Austria
| | - Tyler S Alioto
- Centro Nacional de Análisis Genómico (CNAG), Barcelona 08028, Spain
| | - Giovanna Ponte
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Naples 80121, Italy
| | - Eve Seuntjens
- Department of Biology, Lab of Developmental Neurobiology, Animal Physiology and Neurobiology Division, KU Leuven, Leuven 3000, Belgium
- KU Leuven Institute for Single Cell Omics (LISCO), KU Leuven, Leuven 3000, Belgium
- Leuven Brain Institute, KU Leuven, Leuven 3000, Belgium
| |
Collapse
|
10
|
Lavretsky P, Hernández F, Swale T, Mohl JE. Chromosomal-level reference genome of a wild North American mallard (Anas platyrhynchos). G3 (BETHESDA, MD.) 2023; 13:jkad171. [PMID: 37523777 PMCID: PMC10542157 DOI: 10.1093/g3journal/jkad171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 07/07/2023] [Accepted: 07/10/2023] [Indexed: 08/02/2023]
Abstract
The mallard (Anas platyrhynchos) is one of the most common, economically, and socially important birds around the world. Mallards were not only an important food source for early humans but eventually becoming intimately linked with people as they were domesticated over the last 2,000 years. To date, mallard genomes are largely reconstructed from samples of domestic or unknown genetic heritage. Here, we report the first high-quality genome assembly and annotation of a genetically vetted wild mallard from North America (NAwild_v1.0). The genome was assembled using a combination of shotgun libraries, proximity ligation Chicago, and Dovetail Hi-C libraries. The final assembly is ∼1.04 Gb in size, with 98.3% of the sequence located in 30 full or nearly full chromosome-level scaffolds, and with a N50/L50 of 79.1 Mb/4 scaffolds. We used a combination of gene prediction and similarity approaches to annotate a total of 23,584 functional genes, of which 19,242 were associated to GO terms. The genome assembly and the set of annotated genes yielded a 95.4% completeness score when compared with the BUSCO aves_odb10 dataset. Next, we aligned 3 previously published mallard genomes to ours, and demonstrate how runs of homozygosity and nucleotide diversity are substantially higher and lower, respectively, to ours and how these artificially changed genomes resulted in profoundly different and biased demographic histories. Our wild mallard assembly not only provides a valuable resource to shed light onto genome evolution, speciation, and other adaptive processes, but also helping with identifying functional genes that have been significantly altered during the domestication process.
Collapse
Affiliation(s)
- Philip Lavretsky
- Department of Biological Sciences, University of Texas at El Paso, El Paso, TX 79968, USA
| | - Flor Hernández
- Department of Biological Sciences, University of Texas at El Paso, El Paso, TX 79968, USA
| | - Thomas Swale
- Cantata Bio, 100 Enterprise Way Suite A101, Scotts Valley, CA 95066
| | - Jonathon E Mohl
- Department of Mathematical Sciences, University of Texas at El Paso, El Paso, TX 79968, USA
| |
Collapse
|
11
|
Elizondo EC, Faircloth BC, Brumfield RT, Shakya SB, Ellis VA, Schmidt CJ, Kovach AI, Gregory Shriver W. A high-quality de novo genome assembly for clapper rail (Rallus crepitans). G3 (BETHESDA, MD.) 2023; 13:jkad097. [PMID: 37130071 PMCID: PMC10484055 DOI: 10.1093/g3journal/jkad097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 03/26/2022] [Accepted: 03/10/2023] [Indexed: 05/03/2023]
Abstract
The clapper rail (Rallus crepitans), of the family Rallidae, is a secretive marsh bird species that is adapted for high salinity habitats. They are very similar in appearance to the closely related king rail (R. elegans), but while king rails are limited primarily to freshwater marshes, clapper rails are highly adapted to tolerate salt marshes. Both species can be found in brackish marshes where they freely hybridize, but the distribution of their respective habitats precludes the formation of a continuous hybrid zone and secondary contact can occur repeatedly. This system, thus, provides unique opportunities to investigate the underlying mechanisms driving their differential salinity tolerance as well as the maintenance of the species boundary between the 2 species. To facilitate these studies, we assembled a de novo reference genome assembly for a female clapper rail. Chicago and HiC libraries were prepared as input for the Dovetail HiRise pipeline to scaffold the genome. The pipeline, however, did not recover the Z chromosome so a custom script was used to assemble the Z chromosome. We generated a near chromosome level assembly with a total length of 994.8 Mb comprising 13,226 scaffolds. The assembly had a scaffold N50 was 82.7 Mb, L50 of four, and had a BUSCO completeness score of 92%. This assembly is among the most contiguous genomes among the species in the family Rallidae. It will serve as an important tool in future studies on avian salinity tolerance, interspecific hybridization, and speciation.
Collapse
Affiliation(s)
- Elisa C Elizondo
- Department of Entomology and Wildlife Ecology, University of Delaware, Newark, DE 19716, USA
| | - Brant C Faircloth
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Robb T Brumfield
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Subir B Shakya
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Vincenzo A Ellis
- Department of Entomology and Wildlife Ecology, University of Delaware, Newark, DE 19716, USA
| | - Carl J Schmidt
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716, USA
| | - Adrienne I Kovach
- Department of Natural Resources, University of New Hampshire, Durham, NH 03824, USA
| | - W Gregory Shriver
- Department of Entomology and Wildlife Ecology, University of Delaware, Newark, DE 19716, USA
| |
Collapse
|
12
|
Inwood SN, Skelly J, Guhlin JG, Harrop TWR, Goldson SL, Dearden PK. Chromosome-level genome assemblies of two parasitoid biocontrol wasps reveal the parthenogenesis mechanism and an associated novel virus. BMC Genomics 2023; 24:440. [PMID: 37543591 PMCID: PMC10403939 DOI: 10.1186/s12864-023-09538-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 07/27/2023] [Indexed: 08/07/2023] Open
Abstract
BACKGROUND Biocontrol is a key technology for the control of pest species. Microctonus parasitoid wasps (Hymenoptera: Braconidae) have been released in Aotearoa New Zealand as biocontrol agents, targeting three different pest weevil species. Despite their value as biocontrol agents, no genome assemblies are currently available for these Microctonus wasps, limiting investigations into key biological differences between the different species and strains. METHODS AND FINDINGS Here we present high-quality genomes for Microctonus hyperodae and Microctonus aethiopoides, assembled with short read sequencing and Hi-C scaffolding. These assemblies have total lengths of 106.7 Mb for M. hyperodae and 129.2 Mb for M. aethiopoides, with scaffold N50 values of 9 Mb and 23 Mb respectively. With these assemblies we investigated differences in reproductive mechanisms, and association with viruses between Microctonus wasps. Meiosis-specific genes are conserved in asexual Microctonus, with in-situ hybridisation validating expression of one of these genes in the ovaries of asexual Microctonus aethiopoides. This implies asexual reproduction in these Microctonus wasps involves meiosis, with the potential for sexual reproduction maintained. Investigation of viral gene content revealed candidate genes that may be involved in virus-like particle production in M. aethiopoides, as well as a novel virus infecting M. hyperodae, for which a complete genome was assembled. CONCLUSION AND SIGNIFICANCE These are the first published genomes for Microctonus wasps which have been deployed as biocontrol agents, in Aotearoa New Zealand. These assemblies will be valuable resources for continued investigation and monitoring of these biocontrol systems. Understanding the biology underpinning Microctonus biocontrol is crucial if we are to maintain its efficacy, or in the case of M. hyperodae to understand what may have influenced the significant decline of biocontrol efficacy. The potential for sexual reproduction in asexual Microctonus is significant given that empirical modelling suggests this asexual reproduction is likely to have contributed to biocontrol decline. Furthermore the identification of a novel virus in M. hyperodae highlights a previously unknown aspect of this biocontrol system, which may contribute to premature mortality of the host pest. These findings have potential to be exploited in future in attempt to increase the effectiveness of M. hyperodae biocontrol.
Collapse
Affiliation(s)
- Sarah N Inwood
- Bioprotection Aotearoa and Biochemistry Department, University of Otago, Dunedin, Aotearoa, New Zealand
| | - John Skelly
- Bioprotection Aotearoa and Biochemistry Department, University of Otago, Dunedin, Aotearoa, New Zealand
- Humble Bee Bio, Wellington, Aotearoa, New Zealand
| | - Joseph G Guhlin
- Genomics Aotearoa, University of Otago, Dunedin, Aotearoa, New Zealand
| | - Thomas W R Harrop
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Stephen L Goldson
- Biocontrol and Biosecurity Group, AgResearch Limited, Lincoln, Aotearoa, New Zealand
| | - Peter K Dearden
- Bioprotection Aotearoa and Biochemistry Department, University of Otago, Dunedin, Aotearoa, New Zealand.
- Genomics Aotearoa, University of Otago, Dunedin, Aotearoa, New Zealand.
| |
Collapse
|
13
|
Perera OP, Saha S, Glover J, Parys KA, Allen KC, Grozeva S, Kurtz R, Reddy GVP, Johnston JS, Daly M, Swale T. A chromosome scale assembly of the tarnished plant bug, Lygus lineolaris (Palisot de Beauvois), genome. BMC Res Notes 2023; 16:125. [PMID: 37370172 DOI: 10.1186/s13104-023-06408-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open
Abstract
OBJECTIVE The tarnished plant bug (TPB), Lygus lineolaris (Palisot de Beauvois) (Hemiptera: Miridae), is a pest damaging many cultivated crops in North America. Although partial transcriptome data are available for this pest, a genome assembly was not available for this species. This assembly of a high-quality chromosome-length genome of TPB is aimed to develop the genetic resources that can provide the foundation required for advancing research on this species. RESULTS The initial genome of TPB assembled with paired-end nucleotide sequences generated with Illumina technology was scaffolded with Illumina HiseqX reads generated from a proximity ligated (HiC) library to obtain a high-quality genome assembly. The final assembly contained 3963 scaffolds longer than 1 kbp to yield a genome of 599.96 Mbp. The N50 of the TPB genome assembly was 35.64 Mbp and 98.68% of the genome was assembled into 17 scaffolds larger than 1 Mbp. This megabase scaffold number is the same as the number of chromosomes observed in karyotyping of this insect. The TPB genome is known to have high repetitive DNA content, and the reduced assembled genome size compared to flowcytometric estimates of approximately 860 Mbp may be due to the collapsed assembly of highly similar regions.
Collapse
Affiliation(s)
- O P Perera
- Southern Insect Management Research Unit, USDA ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA.
| | - Surya Saha
- Boyce Thompson Institute, 533 Tower Rd, Ithaca, NY, 14853, USA
| | - James Glover
- Southern Insect Management Research Unit, USDA ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA
| | - Katherine A Parys
- Pollinator Health in Southern Crop Ecosystems Research Unit, USDA ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA
| | - K Clint Allen
- Southern Insect Management Research Unit, USDA ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA
| | - Snejana Grozeva
- Institute of Zoology, Bulgarian Academy of Sciences, 1 Tsar Osvoboditel, Sofia, 1000, Bulgaria
| | - Ryan Kurtz
- , Cotton, Incorporated, Cary, NC, 27513, USA
| | - Gadi V P Reddy
- Southern Insect Management Research Unit, USDA ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA
| | - J Spencer Johnston
- Department of Entomology, Texas A&M University, College Station, TX, 77843, USA
| | - Mark Daly
- Dovetail Genomics, LLC, 100 Enterprise Way, Suite A101, Scotts Valley, CA, 95066, USA
| | - Thomas Swale
- Dovetail Genomics, LLC, 100 Enterprise Way, Suite A101, Scotts Valley, CA, 95066, USA
| |
Collapse
|
14
|
Marlétaz F, Timoshevskaya N, Timoshevskiy V, Simakov O, Parey E, Gavriouchkina D, Suzuki M, Kubokawa K, Brenner S, Smith J, Rokhsar DS. The hagfish genome and the evolution of vertebrates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.17.537254. [PMID: 37131617 PMCID: PMC10153176 DOI: 10.1101/2023.04.17.537254] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
As the only surviving lineages of jawless fishes, hagfishes and lampreys provide a critical window into early vertebrate evolution. Here, we investigate the complex history, timing, and functional role of genome-wide duplications in vertebrates in the light of a chromosome-scale genome of the brown hagfish Eptatretus atami. Using robust chromosome-scale (paralogon-based) phylogenetic methods, we confirm the monophyly of cyclostomes, document an auto-tetraploidization (1RV) that predated the origin of crown group vertebrates ~517 Mya, and establish the timing of subsequent independent duplications in the gnathostome and cyclostome lineages. Some 1RV gene duplications can be linked to key vertebrate innovations, suggesting that this early genomewide event contributed to the emergence of pan-vertebrate features such as neural crest. The hagfish karyotype is derived by numerous fusions relative to the ancestral cyclostome arrangement preserved by lampreys. These genomic changes were accompanied by the loss of genes essential for organ systems (eyes, osteoclast) that are absent in hagfish, accounting in part for the simplification of the hagfish body plan; other gene family expansions account for hagfishes' capacity to produce slime. Finally, we characterise programmed DNA elimination in somatic cells of hagfish, identifying protein-coding and repetitive elements that are deleted during development. As in lampreys, the elimination of these genes provides a mechanism for resolving genetic conflict between soma and germline by repressing germline/pluripotency functions. Reconstruction of the early genomic history of vertebrates provides a framework for further exploration of vertebrate novelties.
Collapse
Affiliation(s)
- Ferdinand Marlétaz
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | | | | | - Oleg Simakov
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Department of Molecular Evolution and Development, University of Vienna, Vienna, Austria
| | - Elise Parey
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Daria Gavriouchkina
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Present address: UK Dementia Research Institute, University College London, London, UK
| | - Masakazu Suzuki
- Department of Science, Graduate School of Integrated Science and Technology, Shizuoka University, Shizuoka, Japan
| | - Kaoru Kubokawa
- Ocean Research Institute, The University of Tokyo, Tokyo, Japan
| | - Sydney Brenner
- Comparative and Medical Genomics Laboratory, Institute of Molecular and Cell Biology, A*STAR, Biopolis, Singapore 138673, Singapore
- Deceased
| | - Jeramiah Smith
- Department of Biology, University of Kentucky, Lexington, KY, USA
| | - Daniel S Rokhsar
- Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
15
|
Marlétaz F, Couloux A, Poulain J, Labadie K, Da Silva C, Mangenot S, Noel B, Poustka AJ, Dru P, Pegueroles C, Borra M, Lowe EK, Lhomond G, Besnardeau L, Le Gras S, Ye T, Gavriouchkina D, Russo R, Costa C, Zito F, Anello L, Nicosia A, Ragusa MA, Pascual M, Molina MD, Chessel A, Di Carlo M, Turon X, Copley RR, Exposito JY, Martinez P, Cavalieri V, Ben Tabou de Leon S, Croce J, Oliveri P, Matranga V, Di Bernardo M, Morales J, Cormier P, Geneviève AM, Aury JM, Barbe V, Wincker P, Arnone MI, Gache C, Lepage T. Analysis of the P. lividus sea urchin genome highlights contrasting trends of genomic and regulatory evolution in deuterostomes. CELL GENOMICS 2023; 3:100295. [PMID: 37082140 PMCID: PMC10112332 DOI: 10.1016/j.xgen.2023.100295] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 12/24/2022] [Accepted: 03/06/2023] [Indexed: 04/22/2023]
Abstract
Sea urchins are emblematic models in developmental biology and display several characteristics that set them apart from other deuterostomes. To uncover the genomic cues that may underlie these specificities, we generated a chromosome-scale genome assembly for the sea urchin Paracentrotus lividus and an extensive gene expression and epigenetic profiles of its embryonic development. We found that, unlike vertebrates, sea urchins retained ancestral chromosomal linkages but underwent very fast intrachromosomal gene order mixing. We identified a burst of gene duplication in the echinoid lineage and showed that some of these expanded genes have been recruited in novel structures (water vascular system, Aristotle's lantern, and skeletogenic micromere lineage). Finally, we identified gene-regulatory modules conserved between sea urchins and chordates. Our results suggest that gene-regulatory networks controlling development can be conserved despite extensive gene order rearrangement.
Collapse
Affiliation(s)
- Ferdinand Marlétaz
- Center for Life’s Origin & Evolution, Department of Genetics, Evolution, & Environment, University College London, WC1 6BT London, UK
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
- Genoscope, Institut de Biologie François-Jacob, Commissariat à l’Énergie Atomique (CEA), Université Paris-Saclay, Évry, France
| | - Arnaud Couloux
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Julie Poulain
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Karine Labadie
- Genoscope, Institut de Biologie François-Jacob, Commissariat à l’Énergie Atomique (CEA), Université Paris-Saclay, Évry, France
| | - Corinne Da Silva
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Sophie Mangenot
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Benjamin Noel
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Albert J. Poustka
- Evolution and Development Group, Max-Planck-Institut für Molekulare Genetik, 14195 Berlin, Germany
- Dahlem Center for Genome Research and Medical Systems Biology (Environmental and Phylogenomics Group), 12489 Berlin, Germany
| | - Philippe Dru
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Cinta Pegueroles
- Institute for Research on Biodiversity (IRBio), Department of Genetics, Microbiology, and Statistics, University of Barcelona, 08028 Barcelona, Spain
| | - Marco Borra
- Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Elijah K. Lowe
- Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Guy Lhomond
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Lydia Besnardeau
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Stéphanie Le Gras
- Plateforme GenomEast, IGBMC, CNRS UMR7104, INSERM U1258, Université de Strasbourg, 67404 Illirch Cedex, France
| | - Tao Ye
- Plateforme GenomEast, IGBMC, CNRS UMR7104, INSERM U1258, Université de Strasbourg, 67404 Illirch Cedex, France
| | - Daria Gavriouchkina
- Molecular Genetics Unit, Okinawa Institute of Science and Technology, 904-0495 Onna-son, Japan
| | - Roberta Russo
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Caterina Costa
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Francesca Zito
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Letizia Anello
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Aldo Nicosia
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Maria Antonietta Ragusa
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies, University of Palermo, 90128 Palermo, Italy
| | - Marta Pascual
- Institute for Research on Biodiversity (IRBio), Department of Genetics, Microbiology, and Statistics, University of Barcelona, 08028 Barcelona, Spain
| | - M. Dolores Molina
- Departament de Genètica, Microbiologia, i Estadística, Universitat de Barcelona, 08028 Barcelona, Spain
- Institut Biology Valrose, Université Côte d’Azur, 06108 Nice Cedex 2, France
| | - Aline Chessel
- Institut Biology Valrose, Université Côte d’Azur, 06108 Nice Cedex 2, France
| | - Marta Di Carlo
- Institute for Biomedical Research and Innovation (CNR), 90146 Palermo, Italy
| | - Xavier Turon
- Department of Marine Ecology, Centre d’Estudis Avançats de Blanes (CEAB, CSIC), 17300 Blanes, Spain
| | - Richard R. Copley
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Jean-Yves Exposito
- Laboratoire de Biologie Tissulaire et d’Ingénierie Thérapeutique (LBTI), UMR CNRS 5305, Institut de Biologie et Chimie des Protéines, Université Lyon 1, 69367 Lyon, France
| | - Pedro Martinez
- Departament de Genètica, Microbiologia, i Estadística, Universitat de Barcelona, 08028 Barcelona, Spain
- Institut Català de Recerca i Estudis Avançats (ICREA), 08028 Barcelona, Spain
| | - Vincenzo Cavalieri
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies, University of Palermo, 90128 Palermo, Italy
| | - Smadar Ben Tabou de Leon
- Department of Marine Biology, Charney School of Marine Sciences, University of Haifa, 31095 Haifa, Israel
| | - Jenifer Croce
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Paola Oliveri
- Center for Life’s Origin & Evolution, Department of Genetics, Evolution, & Environment, University College London, WC1 6BT London, UK
| | - Valeria Matranga
- Consiglio Nazionale delle Ricerche, Istituto per la Ricerca e l’Innovazione Biomedica (IRIB), 90146 Palermo, Italy
| | - Maria Di Bernardo
- Consiglio Nazionale delle Ricerche, Istituto di Farmacologia Traslazionale, 90146 Palermo, Italy
| | - Julia Morales
- Integrative Biology of Marine Models (LBI2M), Station Biologique de Roscoff, CNRS, Sorbonne Université, 29680 Roscoff, France
| | - Patrick Cormier
- Integrative Biology of Marine Models (LBI2M), Station Biologique de Roscoff, CNRS, Sorbonne Université, 29680 Roscoff, France
| | - Anne-Marie Geneviève
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins, BIOM, 66650 Banyuls/Mer, France
| | - Jean Marc Aury
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Valérie Barbe
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut de Biologie François Jacob, Commissariat à l’Énergie Atomique, CNRS, Université Évry, Université Paris-Saclay, 91057 Évry, France
| | - Maria Ina Arnone
- Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Christian Gache
- Laboratoire de Biologie du Développement de Villefranche-sur-Mer (LBDV), Sorbonne Université, CNRS, 06230 Villefranche-sur-Mer, France
| | - Thierry Lepage
- Institut Biology Valrose, Université Côte d’Azur, 06108 Nice Cedex 2, France
| |
Collapse
|
16
|
Shirasawa K, Moraga R, Ghelfi A, Hirakawa H, Nagasaki H, Ghamkhar K, Barrett BA, Griffiths AG, Isobe SN. An improved reference genome for Trifolium subterraneum L. provides insight into molecular diversity and intra-specific phylogeny. FRONTIERS IN PLANT SCIENCE 2023; 14:1103857. [PMID: 36875612 PMCID: PMC9975737 DOI: 10.3389/fpls.2023.1103857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
Subterranean clover (Trifolium subterraneum L., Ts) is a geocarpic, self-fertile annual forage legume with a compact diploid genome (n = x = 8, 544 Mb/1C). Its resilience and climate adaptivity have made it an economically important species in Mediterranean and temperate zones. Using the cultivar Daliak, we generated higher resolution sequence data, created a new genome assembly TSUd_3.0, and conducted molecular diversity analysis for copy number variant (CNV) and single-nucleotide polymorphism (SNP) among 36 cultivars. TSUd_3.0 substantively improves prior genome assemblies with new Hi-C and long-read sequence data, covering 531 Mb, containing 41,979 annotated genes and generating a 94.4% BUSCO score. Comparative genomic analysis among select members of the tribe Trifolieae indicated TSUd 3.0 corrects six assembly-error inversion/duplications and confirmed phylogenetic relationships. Its synteny with T. pratense, T. repens, Medicago truncatula and Lotus japonicus genomes were assessed, with the more distantly related T. repens and M. truncatula showing higher levels of co-linearity with Ts than between Ts and its close relative T. pratense. Resequencing of 36 cultivars discovered 7,789,537 SNPs subsequently used for genomic diversity assessment and sequence-based clustering. Heterozygosity estimates ranged from 1% to 21% within the 36 cultivars and may be influenced by admixture. Phylogenetic analysis supported subspecific genetic structure, although it indicates four or five groups, rather than the three recognized subspecies. Furthermore, there were incidences where cultivars characterized as belonging to a particular subspecies clustered with another subspecies when using genomic data. These outcomes suggest that further investigation of Ts sub-specific classification using molecular and morpho-physiological data is needed to clarify these relationships. This upgraded reference genome, complemented with comprehensive sequence diversity analysis of 36 cultivars, provides a platform for future gene functional analysis of key traits, and genome-based breeding strategies for climate adaptation and agronomic performance. Pangenome analysis, more in-depth intra-specific phylogenomic analysis using the Ts core collection, and functional genetic and genomic studies are needed to further augment knowledge of Trifolium genomes.
Collapse
Affiliation(s)
- Kenta Shirasawa
- Department of Frontier Research and Development, Kazusa DNA Research Institute, Kisarazu, Japan
| | - Roger Moraga
- AgResearch, Grasslands Research Centre, Palmerston North, New Zealand
- Tea Break Bioinformatics Limited, Palmerston North, New Zealand
| | - Andrea Ghelfi
- Department of Frontier Research and Development, Kazusa DNA Research Institute, Kisarazu, Japan
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Japan
| | - Hideki Hirakawa
- Department of Frontier Research and Development, Kazusa DNA Research Institute, Kisarazu, Japan
| | - Hideki Nagasaki
- Department of Frontier Research and Development, Kazusa DNA Research Institute, Kisarazu, Japan
| | - Kioumars Ghamkhar
- AgResearch, Grasslands Research Centre, Palmerston North, New Zealand
| | - Brent A. Barrett
- AgResearch, Grasslands Research Centre, Palmerston North, New Zealand
| | | | - Sachiko N. Isobe
- Department of Frontier Research and Development, Kazusa DNA Research Institute, Kisarazu, Japan
| |
Collapse
|
17
|
Cossette ML, Stewart DT, Haghani A, Zoller JA, Shafer ABA, Horvath S. Epigenetics and island-mainland divergence in an insectivorous small mammal. Mol Ecol 2023; 32:152-166. [PMID: 36226847 DOI: 10.1111/mec.16735] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 09/20/2022] [Accepted: 09/28/2022] [Indexed: 12/29/2022]
Abstract
Geographically isolated populations, specifically island-mainland counterparts, tend to exhibit phenotypic variation in many species. The so-called island syndrome occurs when different environmental pressures lead to insular divergence from mainland populations. This phenomenon can be seen in an island population of Nova Scotia masked shrews (Sorex cinereus), which have developed a specialized feeding habit and digestive enzyme compared to their mainland counterparts. Epigenetic modifications, such as DNA methylation (DNAm), can impact phenotypes by altering gene expression without changing the DNA sequence. Here, we used a de novo masked shrew genome assembly and a mammalian methylation array profiling 37 thousand conserved CpGs to investigate morphological and DNA methylation patterns between island and mainland populations. Island shrews were morphologically and epigenetically different than their mainland counterparts, exhibiting a smaller body size. A gene ontology enrichment analyses of differentially methylated CpGs implicated developmental and digestive system related pathways. Based on our shrew epigenetic clock, island shrews might also be aging faster than their mainland counterparts. This study provides novel insight on phenotypic and epigenetic divergence in island-mainland mammal populations and suggests an underlying role of methylation in island-mainland divergence.
Collapse
Affiliation(s)
- Marie-Laurence Cossette
- Department of Environmental Life Sciences Graduate Program, Trent University, Peterborough, Ontario, Canada
| | - Donald T Stewart
- Department of Biology, Acadia University, Wolfville, Nova Scotia, Canada
| | - Amin Haghani
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, USA
| | - Joseph A Zoller
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, California, USA
| | - Aaron B A Shafer
- Department of Environmental Life Sciences Graduate Program, Trent University, Peterborough, Ontario, Canada
- Department of Forensic Science, Trent University, Peterborough, Ontario, Canada
| | - Steve Horvath
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, USA
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, California, USA
- Altos Labs, San Diego, California, USA
| |
Collapse
|
18
|
Evans BJ, Mudd AB, Bredeson JV, Furman BLS, Wasonga DV, Lyons JB, Harland RM, Rokhsar DS. New insights into Xenopus sex chromosome genomics from the Marsabit clawed frog X. borealis. J Evol Biol 2022; 35:1777-1790. [PMID: 36054077 PMCID: PMC9722552 DOI: 10.1111/jeb.14078] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 06/23/2022] [Accepted: 07/14/2022] [Indexed: 11/26/2022]
Abstract
In many groups, sex chromosomes change frequently but the drivers of their rapid evolution are varied and often poorly characterized. With an aim of further understanding sex chromosome turnover, we investigated the polymorphic sex chromosomes of the Marsabit clawed frog, Xenopus borealis, using genomic data and a new chromosome-scale genome assembly. We confirmed previous findings that 54.1 Mb of chromosome 8L is sex-linked in animals from east Kenya and a laboratory strain, but most (or all) of this region is not sex-linked in natural populations from west Kenya. Previous work suggests possible degeneration of the Z chromosomes in the east population because many sex-linked transcripts of this female heterogametic population have female-biased expression, and we therefore expected this chromosome to not be present in the west population. In contrast, our simulations support a model where most or all of the sex-linked portion of the Z chromosome from the east acquired autosomal segregation in the west, and where much genetic variation specific to the large sex-linked portion of the W chromosome from the east is not present in the west. These recent changes are consistent with the hot-potato model, wherein sex chromosome turnover is favoured by natural selection if it purges a (minimally) degenerate sex-specific sex chromosome, but counterintuitively suggest natural selection failed to purge a Z chromosome that has signs of more advanced and possibly more ancient regulatory degeneration. These findings highlight complex evolutionary dynamics of young, rapidly evolving Xenopus sex chromosomes and set the stage for mechanistic work aimed at pinpointing additional sex-determining genes in this group.
Collapse
Affiliation(s)
- Ben J Evans
- Biology Department, Life Sciences Building Room 328, McMaster University, Hamilton, Ontario, Canada
| | - Austin B Mudd
- Department of Molecular and Cell Biology, University of California, Berkeley, California, USA
| | - Jessen V Bredeson
- Department of Molecular and Cell Biology, University of California, Berkeley, California, USA
| | - Benjamin L S Furman
- Biology Department, Life Sciences Building Room 328, McMaster University, Hamilton, Ontario, Canada
- Canexia Health, Vancouver, British Columbia, Canada
| | | | - Jessica B Lyons
- Department of Molecular and Cell Biology, University of California, Berkeley, California, USA
| | - Richard M Harland
- Department of Molecular and Cell Biology, University of California, Berkeley, California, USA
| | - Dan S Rokhsar
- Department of Molecular and Cell Biology, University of California, Berkeley, California, USA
- Okinawa Institute of Science and Technology Graduate University, Onna, Japan
- Chan-Zuckerberg BioHub, San Francisco, California, USA
| |
Collapse
|
19
|
Zee A, Deng DZQ, Adams M, Schimke KD, Corbett-Detig R, Russell SL, Zhang X, Schmitz RJ, Vollmers C. Sequencing Illumina libraries at high accuracy on the ONT MinION using R2C2. Genome Res 2022; 32:2092-2106. [PMID: 36351772 PMCID: PMC9808628 DOI: 10.1101/gr.277031.122] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 10/21/2022] [Indexed: 11/11/2022]
Abstract
High-throughput short-read sequencing has taken on a central role in research and diagnostics. Hundreds of different assays take advantage of Illumina short-read sequencers, the predominant short-read sequencing technology available today. Although other short-read sequencing technologies exist, the ubiquity of Illumina sequencers in sequencing core facilities and the high capital costs of these technologies have limited their adoption. Among a new generation of sequencing technologies, Oxford Nanopore Technologies (ONT) holds a unique position because the ONT MinION, an error-prone long-read sequencer, is associated with little to no capital cost. Here we show that we can make short-read Illumina libraries compatible with the ONT MinION by using the rolling circle to concatemeric consensus (R2C2) method to circularize and amplify the short library molecules. This results in longer DNA molecules containing tandem repeats of the original short library molecules. This longer DNA is ideally suited for the ONT MinION, and after sequencing, the tandem repeats in the resulting raw reads can be converted into high-accuracy consensus reads with similar error rates to that of the Illumina MiSeq. We highlight this capability by producing and benchmarking RNA-seq, ChIP-seq, and regular and target-enriched Tn5 libraries. We also explore the use of this approach for rapid evaluation of sequencing library metrics by implementing a real-time analysis workflow.
Collapse
Affiliation(s)
- Alexander Zee
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Dori Z Q Deng
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Matthew Adams
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Kayla D Schimke
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Russell Corbett-Detig
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Shelbi L Russell
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| |
Collapse
|
20
|
Chromosome-scale genome assembly of the brown anole (Anolis sagrei), an emerging model species. Commun Biol 2022; 5:1126. [PMID: 36284162 PMCID: PMC9596491 DOI: 10.1038/s42003-022-04074-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 10/06/2022] [Indexed: 12/12/2022] Open
Abstract
Rapid technological improvements are democratizing access to high quality, chromosome-scale genome assemblies. No longer the domain of only the most highly studied model organisms, now non-traditional and emerging model species can be genome-enabled using a combination of sequencing technologies and assembly software. Consequently, old ideas built on sparse sampling across the tree of life have recently been amended in the face of genomic data drawn from a growing number of high-quality reference genomes. Arguably the most valuable are those long-studied species for which much is already known about their biology; what many term emerging model species. Here, we report a highly complete chromosome-scale genome assembly for the brown anole, Anolis sagrei – a lizard species widely studied across a variety of disciplines and for which a high-quality reference genome was long overdue. This assembly exceeds the vast majority of existing reptile and snake genomes in contiguity (N50 = 253.6 Mb) and annotation completeness. Through the analysis of this genome and population resequence data, we examine the history of repetitive element accumulation, identify the X chromosome, and propose a hypothesis for the evolutionary history of fusions between autosomes and the X that led to the sex chromosomes of A. sagrei. A highly-complete chromosome-scale genome assembly of the brown anole, Anolis sagrei, provides insight into the evolution of sex chromosomes and is a crucial resource for this model lizard species.
Collapse
|
21
|
Abstract
The platyrrhine family Cebidae (capuchin and squirrel monkeys) exhibit among the largest primate encephalization quotients. Each cebid lineage is also characterized by notable lineage-specific traits, with capuchins showing striking similarities to Hominidae such as high sensorimotor intelligence with tool use, advanced cognitive abilities, and behavioral flexibility. Here, we take a comparative genomics approach, performing genome-wide tests for positive selection across five cebid branches, to gain insight into major periods of cebid adaptive evolution. We uncover candidate targets of selection across cebid evolutionary history that may underlie the emergence of lineage-specific traits. Our analyses highlight shifting and sustained selective pressures on genes related to brain development, longevity, reproduction, and morphology, including evidence for cumulative and diversifying neurobiological adaptations across cebid evolution. In addition to generating a high-quality reference genome assembly for robust capuchins, our results lend to a better understanding of the adaptive diversification of this distinctive primate clade.
Collapse
|
22
|
Giorgashvili E, Reichel K, Caswara C, Kerimov V, Borsch T, Gruenstaeudl M. Software Choice and Sequencing Coverage Can Impact Plastid Genome Assembly-A Case Study in the Narrow Endemic Calligonum bakuense. FRONTIERS IN PLANT SCIENCE 2022; 13:779830. [PMID: 35874012 PMCID: PMC9296850 DOI: 10.3389/fpls.2022.779830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 06/13/2022] [Indexed: 06/15/2023]
Abstract
Most plastid genome sequences are assembled from short-read whole-genome sequencing data, yet the impact that sequencing coverage and the choice of assembly software can have on the accuracy of the resulting assemblies is poorly understood. In this study, we test the impact of both factors on plastid genome assembly in the threatened and rare endemic shrub Calligonum bakuense. We aim to characterize the differences across plastid genome assemblies generated by different assembly software tools and levels of sequencing coverage and to determine if these differences are large enough to affect the phylogenetic position inferred for C. bakuense compared to congeners. Four assembly software tools (FastPlast, GetOrganelle, IOGA, and NOVOPlasty) and seven levels of sequencing coverage across the plastid genome (original sequencing depth, 2,000x, 1,000x, 500x, 250x, 100x, and 50x) are compared in our analyses. The resulting assemblies are evaluated with regard to reproducibility, contig number, gene complement, inverted repeat length, and computation time; the impact of sequence differences on phylogenetic reconstruction is assessed. Our results show that software choice can have a considerable impact on the accuracy and reproducibility of plastid genome assembly and that GetOrganelle produces the most consistent assemblies for C. bakuense. Moreover, we demonstrate that a sequencing coverage between 500x and 100x can reduce both the sequence variability across assembly contigs and computation time. When comparing the most reliable plastid genome assemblies of C. bakuense, a sequence difference in only three nucleotide positions is detected, which is less than the difference potentially introduced through software choice.
Collapse
Affiliation(s)
- Eka Giorgashvili
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Katja Reichel
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Calvinna Caswara
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Vuqar Kerimov
- Institute of Botany, Azerbaijan National Academy of Sciences (ANAS), Baku, Azerbaijan
| | - Thomas Borsch
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
- Botanischer Garten und Botanisches Museum Berlin, Freie Universität Berlin, Berlin, Germany
| | - Michael Gruenstaeudl
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
23
|
Tarafder S, Islam M, Shatabda S, Rahman A. Figbird: A probabilistic method for filling gaps in genome assemblies. Bioinformatics 2022; 38:3717-3724. [PMID: 35731219 DOI: 10.1093/bioinformatics/btac404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 06/12/2022] [Accepted: 06/17/2022] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Advances in sequencing technologies have led to the sequencing of genomes of a multitude of organisms. However, draft genomes of many of these organisms contain a large number of gaps due to the repeats in genomes, low sequencing coverage and limitations in sequencing technologies. Although there exist several tools for filling gaps, many of these do not utilize all information relevant to gap filling. RESULTS Here, we present a probabilistic method for filling gaps in draft genome assemblies using second generation reads based on a generative model for sequencing that takes into account information on insert sizes and sequencing errors. Our method is based on the expectation-maximization (EM) algorithm unlike the graph based methods adopted in the literature. Experiments on real biological datasets show that this novel approach can fill up large portions of gaps with small number of errors and misassemblies compared to other state of the art gap filling tools. AVAILABILITY AND IMPLEMENTATION The method is implemented using C ++ in a software named "Filling Gaps by Iterative Read Distribution (Figbird)", which is available at: https://github.com/SumitTarafder/Figbird. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sumit Tarafder
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, 1205, Bangladesh.,Department of Computer Science and Engineering, United International University, Dhaka, 1212, Bangladesh
| | - Mazharul Islam
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, 1205, Bangladesh.,Department of Computer Science and Engineering, United International University, Dhaka, 1212, Bangladesh
| | - Swakkhar Shatabda
- Department of Computer Science and Engineering, United International University, Dhaka, 1212, Bangladesh
| | - Atif Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, 1205, Bangladesh
| |
Collapse
|
24
|
Moore EC, Thomas GWC, Mortimer S, Kopania EEK, Hunnicutt KE, Clare-Salzler ZJ, Larson EL, Good JM. The evolution of widespread recombination suppression on the dwarf hamster (Phodopus) X chromosome. Genome Biol Evol 2022; 14:6596369. [PMID: 35642315 PMCID: PMC9185382 DOI: 10.1093/gbe/evac080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2022] [Indexed: 11/24/2022] Open
Abstract
The X chromosome of therian mammals shows strong conservation among distantly related species, limiting insights into the distinct selective processes that have shaped sex chromosome evolution. We constructed a chromosome-scale de novo genome assembly for the Siberian dwarf hamster (Phodopus sungorus), a species reported to show extensive recombination suppression across an entire arm of the X chromosome. Combining a physical genome assembly based on shotgun and long-range proximity ligation sequencing with a dense genetic map, we detected widespread suppression of female recombination across ∼65% of the Phodopus X chromosome. This region of suppressed recombination likely corresponds to the Xp arm, which has previously been shown to be highly heterochromatic. Using additional sequencing data from two closely related species (P. campbelli and P. roborovskii), we show that recombination suppression on Xp appears to be independent of major structural rearrangements. The suppressed Xp arm was enriched for several transposable element families and de-enriched for genes primarily expressed in placenta, but otherwise showed similar gene densities, expression patterns, and rates of molecular evolution when compared to the recombinant Xq arm. Phodopus Xp gene content and order was also broadly conserved relative to the more distantly related rat X chromosome. These data suggest that widespread suppression of recombination has likely evolved through the transient induction of facultative heterochromatin on the Phodopus Xp arm without major changes in chromosome structure or genetic content. Thus, substantial changes in the recombination landscape have so far had relatively subtle influences on patterns of X-linked molecular evolution in these species.
Collapse
Affiliation(s)
- Emily C Moore
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Gregg W C Thomas
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Sebastian Mortimer
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Emily E K Kopania
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Kelsie E Hunnicutt
- Department of Biological Sciences, The University of Denver, Denver, Colorado, 80208, USA
| | | | - Erica L Larson
- Department of Biological Sciences, The University of Denver, Denver, Colorado, 80208, USA
| | - Jeffrey M Good
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| |
Collapse
|
25
|
Friis G, Vizueta J, Ketterson ED, Milá B. A high-quality genome assembly and annotation of the dark-eyed junco Junco hyemalis, a recently diversified songbird. G3 (BETHESDA, MD.) 2022; 12:jkac083. [PMID: 35404451 PMCID: PMC9157146 DOI: 10.1093/g3journal/jkac083] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 03/31/2022] [Indexed: 11/26/2022]
Abstract
The dark-eyed junco (Junco hyemalis) is one of the most common passerines of North America, and has served as a model organism in studies related to ecophysiology, behavior, and evolutionary biology for over a century. It is composed of at least 6 distinct, geographically structured forms of recent evolutionary origin, presenting remarkable variation in phenotypic traits, migratory behavior, and habitat. Here, we report a high-quality genome assembly and annotation of the dark-eyed junco generated using a combination of shotgun libraries and proximity ligation Chicago and Dovetail Hi-C libraries. The final assembly is ∼1.03 Gb in size, with 98.3% of the sequence located in 30 full or nearly full chromosome scaffolds, and with a N50/L50 of 71.3 Mb/5 scaffolds. We identified 19,026 functional genes combining gene prediction and similarity approaches, of which 15,967 were associated to GO terms. The genome assembly and the set of annotated genes yielded 95.4% and 96.2% completeness scores, respectively when compared with the BUSCO avian dataset. This new assembly for J. hyemalis provides a valuable resource for genome evolution analysis, and for identifying functional genes involved in adaptive processes and speciation.
Collapse
Affiliation(s)
- Guillermo Friis
- Department of Biodiversity and Evolutionary Biology, National Museum of Natural Sciences, Spanish National Research Council (CSIC), Madrid 28006, Spain
| | - Joel Vizueta
- Centre for Social Evolution, University of Copenhaguen, Copenhaguen 1165, Denmark
| | - Ellen D Ketterson
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Borja Milá
- Department of Biodiversity and Evolutionary Biology, National Museum of Natural Sciences, Spanish National Research Council (CSIC), Madrid 28006, Spain
| |
Collapse
|
26
|
Korchanová Z, Švec M, Janáková E, Lampar A, Majka M, Holušová K, Bonchev G, Juračka J, Cápal P, Valárik M. Identification, High-Density Mapping, and Characterization of New Major Powdery Mildew Resistance Loci From the Emmer Wheat Landrace GZ1. FRONTIERS IN PLANT SCIENCE 2022; 13:897697. [PMID: 35646009 PMCID: PMC9141293 DOI: 10.3389/fpls.2022.897697] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 04/13/2022] [Indexed: 06/15/2023]
Abstract
Powdery mildew is one of the most devastating diseases of wheat which significantly decreases yield and quality. Identification of new sources of resistance and their implementation in breeding programs is the most effective way of disease control. Two major powdery mildew resistance loci conferring resistance to all races in seedling and adult plant stages were identified in the emmer wheat landrace GZ1. Their positions, effects, and transferability were verified using two linkage maps (1,510 codominant SNP markers) constructed from two mapping populations (276 lines in total) based on the resistant GZ1 line. The dominant resistance locus QPm.GZ1-7A was located in a 90 cM interval of chromosome 7AL and explains up to 20% of the trait variation. The recessive locus QPm.GZ1-2A, which provides total resistance, explains up to 40% of the trait variation and was located in the distal part of chromosome 2AL. The locus was saturated with 14 PCR-based markers and delimited to a 0.99 cM region which corresponds to 4.3 Mb of the cv. Zavitan reference genome and comprises 55 predicted genes with no apparent candidate for the QPm.GZ1-2A resistance gene. No recessive resistance gene or allele was located at the locus before, suggesting the presence of a new powdery mildew resistance gene in the GZ1. The mapping data and markers could be used for the implementation of the locus in breeding. Moreover, they are an ideal base for cloning and study of host-pathogen interaction pathways determined by the resistance genes.
Collapse
Affiliation(s)
- Zuzana Korchanová
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
- Department of Cell Biology and Genetics, Faculty of Science, Palacký University Olomouc, Olomouc, Czechia
| | - Miroslav Švec
- Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
| | - Eva Janáková
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
| | - Adam Lampar
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
- Department of Cell Biology and Genetics, Faculty of Science, Palacký University Olomouc, Olomouc, Czechia
| | - Maciej Majka
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
- Institute of Plant Genetics, Polish Academy of Sciences, Poznań, Poland
| | - Kateřina Holušová
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
| | - Georgi Bonchev
- Faculty of Natural Sciences, Comenius University in Bratislava, Bratislava, Slovakia
- Institute of Plant Physiology and Genetics, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Jakub Juračka
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
- Department of Computer Science, Faculty of Science, Palacký University Olomouc, Olomouc, Czechia
| | - Petr Cápal
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
| | - Miroslav Valárik
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czechia
| |
Collapse
|
27
|
Palaeogenomic analysis of black rat (Rattus rattus) reveals multiple European introductions associated with human economic history. Nat Commun 2022; 13:2399. [PMID: 35504912 PMCID: PMC9064997 DOI: 10.1038/s41467-022-30009-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 03/18/2022] [Indexed: 11/29/2022] Open
Abstract
The distribution of the black rat (Rattus rattus) has been heavily influenced by its association with humans. The dispersal history of this non-native commensal rodent across Europe, however, remains poorly understood, and different introductions may have occurred during the Roman and medieval periods. Here, in order to reconstruct the population history of European black rats, we first generate a de novo genome assembly of the black rat. We then sequence 67 ancient and three modern black rat mitogenomes, and 36 ancient and three modern nuclear genomes from archaeological sites spanning the 1st-17th centuries CE in Europe and North Africa. Analyses of our newly reported sequences, together with published mitochondrial DNA sequences, confirm that black rats were introduced into the Mediterranean and Europe from Southwest Asia. Genomic analyses of the ancient rats reveal a population turnover in temperate Europe between the 6th and 10th centuries CE, coincident with an archaeologically attested decline in the black rat population. The near disappearance and re-emergence of black rats in Europe may have been the result of the breakdown of the Roman Empire, the First Plague Pandemic, and/or post-Roman climatic cooling. ‘Archaeogenetic analysis of black rat remains reveals that this species was introduced into temperate Europe twice, in the Roman and medieval periods. This population turnover was likely associated with multiple historical and environmental factors.’
Collapse
|
28
|
Zhou Y, Liu M, Yang J. Recovering metagenome-assembled genomes from shotgun metagenomic sequencing data: methods, applications, challenges, and opportunities. Microbiol Res 2022; 260:127023. [DOI: 10.1016/j.micres.2022.127023] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 03/07/2022] [Accepted: 04/05/2022] [Indexed: 12/12/2022]
|
29
|
Ventolero MF, Wang S, Hu H, Li X. Computational analyses of bacterial strains from shotgun reads. Brief Bioinform 2022; 23:6524011. [PMID: 35136954 DOI: 10.1093/bib/bbac013] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 12/21/2022] Open
Abstract
Shotgun sequencing is routinely employed to study bacteria in microbial communities. With the vast amount of shotgun sequencing reads generated in a metagenomic project, it is crucial to determine the microbial composition at the strain level. This study investigated 20 computational tools that attempt to infer bacterial strain genomes from shotgun reads. For the first time, we discussed the methodology behind these tools. We also systematically evaluated six novel-strain-targeting tools on the same datasets and found that BHap, mixtureS and StrainFinder performed better than other tools. Because the performance of the best tools is still suboptimal, we discussed future directions that may address the limitations.
Collapse
Affiliation(s)
| | - Saidi Wang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA.,Genomics and Bioinformatics Cluster, University of Central Florida, Orlando, FL 32816, USA
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
30
|
Koochekian N, Ascanio A, Farleigh K, Card DC, Schield DR, Castoe TA, Jezkova T. A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles. Gigascience 2022; 11:6521878. [PMID: 35134927 PMCID: PMC8848323 DOI: 10.1093/gigascience/giab098] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 09/27/2021] [Accepted: 12/15/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The increasing number of chromosome-level genome assemblies has advanced our knowledge and understanding of macroevolutionary processes. Here, we introduce the genome of the desert horned lizard, Phrynosoma platyrhinos, an iguanid lizard occupying extreme desert conditions of the American southwest. We conduct analysis of the chromosomal structure and composition of this species and compare these features across genomes of 12 other reptiles (5 species of lizards, 3 snakes, 3 turtles, and 1 bird). FINDINGS The desert horned lizard genome was sequenced using Illumina paired-end reads and assembled and scaffolded using Dovetail Genomics Hi-C and Chicago long-range contact data. The resulting genome assembly has a total length of 1,901.85 Mb, scaffold N50 length of 273.213 Mb, and includes 5,294 scaffolds. The chromosome-level assembly is composed of 6 macrochromosomes and 11 microchromosomes. A total of 20,764 genes were annotated in the assembly. GC content and gene density are higher for microchromosomes than macrochromosomes, while repeat element distributions show the opposite trend. Pathway analyses provide preliminary evidence that microchromosome and macrochromosome gene content are functionally distinct. Synteny analysis indicates that large microchromosome blocks are conserved among closely related species, whereas macrochromosomes show evidence of frequent fusion and fission events among reptiles, even between closely related species. CONCLUSIONS Our results demonstrate dynamic karyotypic evolution across Reptilia, with frequent inferred splits, fusions, and rearrangements that have resulted in shuffling of chromosomal blocks between macrochromosomes and microchromosomes. Our analyses also provide new evidence for distinct gene content and chromosomal structure between microchromosomes and macrochromosomes within reptiles.
Collapse
Affiliation(s)
| | - Alfredo Ascanio
- Department of Biology, Miami University, Oxford, OH 45056, USA
| | - Keaka Farleigh
- Department of Biology, Miami University, Oxford, OH 45056, USA
| | - Daren C Card
- Department of Organismic & Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.,Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Drew R Schield
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO 80309, USA
| | - Todd A Castoe
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Tereza Jezkova
- Department of Biology, Miami University, Oxford, OH 45056, USA
| |
Collapse
|
31
|
A comparative genomics examination of desiccation tolerance and sensitivity in two sister grass species. Proc Natl Acad Sci U S A 2022; 119:2118886119. [PMID: 35082155 PMCID: PMC8812550 DOI: 10.1073/pnas.2118886119] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/14/2021] [Indexed: 12/13/2022] Open
Abstract
This is a significant sister group contrast comparative study of the underpinning genomics and evolution of desiccation tolerance (DT), a critical trait in the evolution of land plants. Our results revealed that the DT grass Sporobolus stapfianus is transcriptionally primed to tolerate a dehydration/desiccation event and that the desiccation response in the DT S. stapfianus is distinct from the water stress response of the desiccation-sensitive Sporobolus pyramidalis. Our results also show that the desiccation response is largely unique, indicating a recent evolution of this trait within the angiosperms, and that inhibition of senescence during dehydration is likely critical in rendering a plant desiccation tolerant. Desiccation tolerance is an ancient and complex trait that spans all major lineages of life on earth. Although important in the evolution of land plants, the mechanisms that underlay this complex trait are poorly understood, especially for vegetative desiccation tolerance (VDT). The lack of suitable closely related plant models that offer a direct contrast between desiccation tolerance and sensitivity has hampered progress. We have assembled high-quality genomes for two closely related grasses, the desiccation-tolerant Sporobolus stapfianus and the desiccation-sensitive Sporobolus pyramidalis. Both species are complex polyploids; S. stapfianus is primarily tetraploid, and S. pyramidalis is primarily hexaploid. S. pyramidalis undergoes a major transcriptome remodeling event during initial exposure to dehydration, while S. stapfianus has a muted early response, with peak remodeling during the transition between 1.5 and 1.0 grams of water (gH2O) g−1 dry weight (dw). Functionally, the dehydration transcriptome of S. stapfianus is unrelated to that for S. pyramidalis. A comparative analysis of the transcriptomes of the hydrated controls for each species indicated that S. stapfianus is transcriptionally primed for desiccation. Cross-species comparative analyses indicated that VDT likely evolved from reprogramming of desiccation tolerance mechanisms that evolved in seeds and that the tolerance mechanism of S. stapfianus represents a recent evolution for VDT within the Chloridoideae. Orthogroup analyses of the significantly differentially abundant transcripts reconfirmed our present understanding of the response to dehydration, including the lack of an induction of senescence in resurrection angiosperms. The data also suggest that failure to maintain protein structure during dehydration is likely critical in rendering a plant desiccation sensitive.
Collapse
|
32
|
Valenza‐Troubat N, Davy M, Storey R, Wylie MJ, Hilario E, Ritchie P, Wellenreuther M. Differential expression analyses reveal extensive transcriptional plasticity induced by temperature in New Zealand silver trevally ( Pseudocaranx georgianus). Evol Appl 2022; 15:237-248. [PMID: 35233245 PMCID: PMC8867707 DOI: 10.1111/eva.13332] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 10/26/2021] [Accepted: 10/29/2021] [Indexed: 12/01/2022] Open
Abstract
Ectotherm species, such as marine fishes, depend on environmental temperature to regulate their vital functions. In finfish aquaculture production, being able to predict physiological responses in growth and other economic traits to temperature is crucial to address challenges inherent in the selection of grow-out locations. This will become an even more significant issue under the various predicted future climate change scenarios. In this study, we used the marine teleost silver trevally (Pseudocaranx georgianus), a species currently being explored as a candidate for aquaculture in New Zealand, as a model to study plasticity in gene expression patterns and growth in response to different temperatures. Using a captive study population, temperature conditions were experimentally manipulated for 1 month to mimic seasonal extremes. Phenotypic differences in growth were measured in 400 individuals, and gene expression patterns of pituitary gland and liver were determined in a subset of 100 individuals. Results showed that growth increased 50% in the warmer compared with the colder condition, suggesting that temperature has a large impact on metabolic activities associated with growth. A total of 265,116,678 single-end RNA sequence reads were aligned to the trevally genome, and 28,416 transcript models were developed (27,887 of these had GenBank accessions, and 17,980 unique gene symbols). Further filtering reduced this set to 8597 gene models. 39 and 238 differentially expressed genes (DEGs) were found in the pituitary gland and the liver, respectively (|log2FC| > 0.26, p-value < 0.05). Of these, 6 DEGs showed a common expression pattern between both tissues, all involved in housekeeping functions. Temperature-modulated growth responses were linked to major pathways affecting metabolism, cell regulation and signalling, previously shown to be important for temperature tolerance in other fish species. An interesting finding of this study was that genes linked to the reproductive system were up-regulated in both tissues in the high treatment, indicating the onset of sexual maturation. Few studies have investigated the thermal plasticity of the gene expression in the main organs of the somatotropic axis simultaneously. Our findings indicate that trevally exhibit substantial growth differences and predictable plastic regulatory responses to different temperature conditions. We identified a set of genes that provide a list of candidates for further investigations for selective breeding objectives and how populations may adapt to increasing temperatures.
Collapse
Affiliation(s)
| | - Marcus Davy
- The New Zealand Institute for Plant and Food Research LimitedTe PukeNew Zealand
| | - Roy Storey
- The New Zealand Institute for Plant and Food Research LimitedTe PukeNew Zealand
| | - Matthew J. Wylie
- The New Zealand Institute for Plant and Food Research LimitedNelsonNew Zealand
| | - Elena Hilario
- The New Zealand Institute for Plant and Food Research LimitedTe PukeNew Zealand
| | - Peter Ritchie
- School of Biological SciencesVictoria University of WellingtonWellingtonNew Zealand
| | - Maren Wellenreuther
- The New Zealand Institute for Plant and Food Research LimitedNelsonNew Zealand
- School of Biological SciencesUniversity of AucklandAucklandNew Zealand
| |
Collapse
|
33
|
Rahman A, Pachter L. SWALO: scaffolding with assembly likelihood optimization. Nucleic Acids Res 2021; 49:e117. [PMID: 34417615 PMCID: PMC8599790 DOI: 10.1093/nar/gkab717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 06/16/2021] [Accepted: 08/16/2021] [Indexed: 01/01/2023] Open
Abstract
Scaffolding, i.e. ordering and orienting contigs is an important step in genome assembly. We present a method for scaffolding using second generation sequencing reads based on likelihoods of genome assemblies. A generative model for sequencing is used to obtain maximum likelihood estimates of gaps between contigs and to estimate whether linking contigs into scaffolds would lead to an increase in the likelihood of the assembly. We then link contigs if they can be unambiguously joined or if the corresponding increase in likelihood is substantially greater than that of other possible joins of those contigs. The method is implemented in a tool called Swalo with approximations to make it efficient and applicable to large datasets. Analysis on real and simulated datasets reveals that it consistently makes more or similar number of correct joins as other scaffolders while linking very few contigs incorrectly, thus outperforming other scaffolders and demonstrating that substantial improvement in genome assembly may be achieved through the use of statistical models. Swalo is freely available for download at https://atifrahman.github.io/SWALO/.
Collapse
Affiliation(s)
- Atif Rahman
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA.,Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
| | - Lior Pachter
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA.,Departments of Mathematics and Molecular & Cell Biology, University of California, Berkeley, CA 94720, USA.,Departments of Biology and Computing & Mathematical Sciences, California Institute of Technology, Pasadena, CA 91103, USA
| |
Collapse
|
34
|
Catanach A, Ruigrok M, Bowatte D, Davy M, Storey R, Valenza-Troubat N, López-Girona E, Hilario E, Wylie MJ, Chagné D, Wellenreuther M. The genome of New Zealand trevally (Carangidae: Pseudocaranx georgianus) uncovers a XY sex determination locus. BMC Genomics 2021; 22:785. [PMID: 34727894 PMCID: PMC8561880 DOI: 10.1186/s12864-021-08102-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 10/14/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The genetic control of sex determination in teleost species is poorly understood. This is partly because of the diversity of mechanisms that determine sex in this large group of vertebrates, including constitutive genes linked to sex chromosomes, polygenic constitutive mechanisms, environmental factors, hermaphroditism, and unisexuality. Here we use a de novo genome assembly of New Zealand silver trevally (Pseudocaranx georgianus) together with sex-specific whole genome sequencing data to detect sexually divergent genomic regions, identify candidate genes and develop molecular makers. RESULTS The de novo assembly of an unsexed trevally (Trevally_v1) resulted in a final assembly of 579.4 Mb in length, with a N50 of 25.2 Mb. Of the assembled scaffolds, 24 were of chromosome scale, ranging from 11 to 31 Mb in length. A total of 28,416 genes were annotated after 12.8 % of the assembly was masked with repetitive elements. Whole genome re-sequencing of 13 wild sexed trevally (seven males and six females) identified two sexually divergent regions located on two scaffolds, including a 6 kb region at the proximal end of chromosome 21. Blast analyses revealed similarity between one region and the aromatase genes cyp19 (a1a/b) (E-value < 1.00E-25, identity > 78.8 %). Males contained higher numbers of heterozygous variants in both regions, while females showed regions of very low read-depth, indicative of male-specificity of this genomic region. Molecular markers were developed and subsequently tested on 96 histologically-sexed fish (42 males and 54 females). Three markers amplified in absolute correspondence with sex (positive in males, negative in females). CONCLUSIONS The higher number of heterozygous variants in males combined with the absence of these regions in females support a XY sex-determination model, indicating that the trevally_v1 genome assembly was developed from a male specimen. This sex system contrasts with the ZW sex-determination model documented in closely related carangid species. Our results indicate a sex-determining function of a cyp19a1a-like gene, suggesting the molecular pathway of sex determination is somewhat conserved in this family. The genomic resources developed here will facilitate future comparative work, and enable improved insights into the varied sex determination pathways in teleosts. The sex marker developed in this study will be a valuable resource for aquaculture selective breeding programmes, and for determining sex ratios in wild populations.
Collapse
Affiliation(s)
- Andrew Catanach
- The New Zealand Institute for Plant & Food Research Ltd, Christchurch, New Zealand
| | - Mike Ruigrok
- Department of Bioinformatics, University of Applied Sciences Leiden, Leiden, The Netherlands
- The New Zealand Institute for Plant & Food Research Ltd, Nelson, New Zealand
| | - Deepa Bowatte
- The New Zealand Institute for Plant & Food Research Ltd, Palmerston North, New Zealand
| | - Marcus Davy
- The New Zealand Institute for Plant & Food Research Ltd, Te Puke, New Zealand
| | - Roy Storey
- The New Zealand Institute for Plant & Food Research Ltd, Te Puke, New Zealand
| | | | - Elena López-Girona
- The New Zealand Institute for Plant & Food Research Ltd, Palmerston North, New Zealand
| | - Elena Hilario
- The New Zealand Institute for Plant & Food Research Ltd, Auckland, New Zealand
| | - Matthew J Wylie
- The New Zealand Institute for Plant & Food Research Ltd, Nelson, New Zealand
| | - David Chagné
- The New Zealand Institute for Plant & Food Research Ltd, Palmerston North, New Zealand
| | - Maren Wellenreuther
- The New Zealand Institute for Plant & Food Research Ltd, Nelson, New Zealand.
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand.
| |
Collapse
|
35
|
Schultz DT, Francis WR, McBroome JD, Christianson LM, Haddock SHD, Green RE. A chromosome-scale genome assembly and karyotype of the ctenophore Hormiphora californensis. G3 (BETHESDA, MD.) 2021; 11:jkab302. [PMID: 34545398 PMCID: PMC8527503 DOI: 10.1093/g3journal/jkab302] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 08/18/2021] [Indexed: 11/12/2022]
Abstract
Here, we present a karyotype, a chromosome-scale genome assembly, and a genome annotation from the ctenophore Hormiphora californensis (Ctenophora: Cydippida: Pleurobrachiidae). The assembly spans 110 Mb in 44 scaffolds and 99.47% of the bases are contained in 13 scaffolds. Chromosome micrographs and Hi-C heatmaps support a karyotype of 13 diploid chromosomes. Hi-C data reveal three large heterozygous inversions on chromosome 1, and one heterozygous inversion shares the same gene order found in the genome of the ctenophore Pleurobrachia bachei. We find evidence that H. californensis and P. bachei share thirteen homologous chromosomes, and the same karyotype of 1n = 13. The manually curated PacBio Iso-Seq-based genome annotation reveals complex gene structures, including nested genes and trans-spliced leader sequences. This chromosome-scale assembly is a useful resource for ctenophore biology and will aid future studies of metazoan evolution and phylogenetics.
Collapse
Affiliation(s)
- Darrin T Schultz
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA
- Monterey Bay Aquarium Research Institute, Moss Landing, CA 95039, USA
| | - Warren R Francis
- Department of Biology, University of Southern Denmark, Odense 5230, Denmark
| | - Jakob D McBroome
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - Steven H D Haddock
- Monterey Bay Aquarium Research Institute, Moss Landing, CA 95039, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Richard E Green
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| |
Collapse
|
36
|
Kayani MUR, Huang W, Feng R, Chen L. Genome-resolved metagenomics using environmental and clinical samples. Brief Bioinform 2021; 22:bbab030. [PMID: 33758906 PMCID: PMC8425419 DOI: 10.1093/bib/bbab030] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 11/29/2020] [Accepted: 01/20/2021] [Indexed: 12/25/2022] Open
Abstract
Recent advances in high-throughput sequencing technologies and computational methods have added a new dimension to metagenomic data analysis i.e. genome-resolved metagenomics. In general terms, it refers to the recovery of draft or high-quality microbial genomes and their taxonomic classification and functional annotation. In recent years, several studies have utilized the genome-resolved metagenome analysis approach and identified previously unknown microbial species from human and environmental metagenomes. In this review, we describe genome-resolved metagenome analysis as a series of four necessary steps: (i) preprocessing of the sequencing reads, (ii) de novo metagenome assembly, (iii) genome binning and (iv) taxonomic and functional analysis of the recovered genomes. For each of these four steps, we discuss the most commonly used tools and the currently available pipelines to guide the scientific community in the recovery and subsequent analyses of genomes from any metagenome sample. Furthermore, we also discuss the tools required for validation of assembly quality as well as for improving quality of the recovered genomes. We also highlight the currently available pipelines that can be used to automate the whole analysis without having advanced bioinformatics knowledge. Finally, we will highlight the most widely adapted and actively maintained tools and pipelines that can be helpful to the scientific community in decision making before they commence the analysis.
Collapse
Affiliation(s)
- Masood ur Rehman Kayani
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| | - Wanqiu Huang
- Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 200,000, China
| | - Ru Feng
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| | - Lei Chen
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| |
Collapse
|
37
|
Thompson AW, Hawkins MB, Parey E, Wcisel DJ, Ota T, Kawasaki K, Funk E, Losilla M, Fitch OE, Pan Q, Feron R, Louis A, Montfort J, Milhes M, Racicot BL, Childs KL, Fontenot Q, Ferrara A, David SR, McCune AR, Dornburg A, Yoder JA, Guiguen Y, Roest Crollius H, Berthelot C, Harris MP, Braasch I. The bowfin genome illuminates the developmental evolution of ray-finned fishes. Nat Genet 2021; 53:1373-1384. [PMID: 34462605 PMCID: PMC8423624 DOI: 10.1038/s41588-021-00914-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 07/13/2021] [Indexed: 02/07/2023]
Abstract
The bowfin (Amia calva) is a ray-finned fish that possesses a unique suite of ancestral and derived phenotypes, which are key to understanding vertebrate evolution. The phylogenetic position of bowfin as a representative of neopterygian fishes, its archetypical body plan and its unduplicated and slowly evolving genome make bowfin a central species for the genomic exploration of ray-finned fishes. Here we present a chromosome-level genome assembly for bowfin that enables gene-order analyses, settling long-debated neopterygian phylogenetic relationships. We examine chromatin accessibility and gene expression through bowfin development to investigate the evolution of immune, scale, respiratory and fin skeletal systems and identify hundreds of gene-regulatory loci conserved across vertebrates. These resources connect developmental evolution among bony fishes, further highlighting the bowfin's importance for illuminating vertebrate biology and diversity in the genomic era.
Collapse
Affiliation(s)
- Andrew W Thompson
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Ecology, Evolution & Behavior Program, Michigan State University, East Lansing, MI, USA
| | - M Brent Hawkins
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Orthopedic Research, Boston Children's Hospital, Boston, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Elise Parey
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Dustin J Wcisel
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA
| | - Tatsuya Ota
- Department of Evolutionary Studies of Biosystems, SOKENDAI (the Graduate University for Advanced Studies), Hayama, Japan
| | - Kazuhiko Kawasaki
- Department of Anthropology, Pennsylvania State University, University Park, PA, USA
| | - Emily Funk
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, USA
- Animal Science Department, University of California Davis, Davis, CA, USA
| | - Mauricio Losilla
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Ecology, Evolution & Behavior Program, Michigan State University, East Lansing, MI, USA
| | - Olivia E Fitch
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Ecology, Evolution & Behavior Program, Michigan State University, East Lansing, MI, USA
| | - Qiaowei Pan
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Romain Feron
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexandra Louis
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | | | - Marine Milhes
- GeT-PlaGe, INRAE, Genotoul, Castanet-Tolosan, France
| | - Brett L Racicot
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
| | - Kevin L Childs
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Quenton Fontenot
- Department of Biological Sciences, Nicholls State University, Thibodaux, LA, USA
| | - Allyse Ferrara
- Department of Biological Sciences, Nicholls State University, Thibodaux, LA, USA
| | - Solomon R David
- Department of Biological Sciences, Nicholls State University, Thibodaux, LA, USA
| | - Amy R McCune
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, USA
| | - Alex Dornburg
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, NC State University, Raleigh, NC, USA
- Comparative Medicine Institute, NC State University, Raleigh, NC, USA
- Center for Human Health and the Environment, NC State University, Raleigh, NC, USA
| | | | - Hugues Roest Crollius
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Camille Berthelot
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Matthew P Harris
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Orthopedic Research, Boston Children's Hospital, Boston, MA, USA
| | - Ingo Braasch
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA.
- Ecology, Evolution & Behavior Program, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
38
|
Ayling M, Clark MD, Leggett RM. New approaches for metagenome assembly with short reads. Brief Bioinform 2021; 21:584-594. [PMID: 30815668 PMCID: PMC7299287 DOI: 10.1093/bib/bbz020] [Citation(s) in RCA: 100] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 01/31/2019] [Accepted: 02/01/2019] [Indexed: 02/07/2023] Open
Abstract
In recent years, the use of longer range read data combined with advances in assembly algorithms has stimulated big improvements in the contiguity and quality of genome assemblies. However, these advances have not directly transferred to metagenomic data sets, as assumptions made by the single genome assembly algorithms do not apply when assembling multiple genomes at varying levels of abundance. The development of dedicated assemblers for metagenomic data was a relatively late innovation and for many years, researchers had to make do using tools designed for single genomes. This has changed in the last few years and we have seen the emergence of a new type of tool built using different principles. In this review, we describe the challenges inherent in metagenomic assemblies and compare the different approaches taken by these novel assembly tools.
Collapse
Affiliation(s)
- Martin Ayling
- Earlham Institute, Norwich Research Park, Norwich, UK
| | | | | |
Collapse
|
39
|
Undin M, Lockhart PJ, Hills SFK, Armstrong DP, Castro I. Mixed Mating in a Multi-Origin Population Suggests High Potential for Genetic Rescue in North Island Brown Kiwi, Apteryx mantelli. FRONTIERS IN CONSERVATION SCIENCE 2021. [DOI: 10.3389/fcosc.2021.702128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Reinforcement translocations are increasingly utilised in conservation with the goal of achieving genetic rescue. However, concerns regarding undesirable results, such as genetic homogenisation or replacement, are widespread. One factor influencing translocation outcomes is the rate at which the resident and the introduced individuals interbreed. Consequently, post-release mate choice is a key behaviour to consider in conservation planning. Here we studied mating, and its consequences for genomic admixture, in the North Island brown kiwi Apteryx mantelli population on Ponui Island which was founded by two translocation events over 50 years ago. The two source populations used are now recognised as belonging to two separate management units between which birds differ in size and are genetically differentiated. We examined the correlation between male and female morphometrics for 17 known pairs and quantified the relatedness of 20 pairs from this admixed population. In addition, we compared the genetic similarity and makeup of 106 Ponui Island birds, including 23 known pairs, to birds representing the source populations for the original translocations. We found no evidence for size-assortative mating. On the contrary, genomic SNP data suggested that kiwi of one feather did not flock together, meaning that mate choice resulted in pairing between individuals that were less related than expected by random chance. Furthermore, the birds in the current Ponui Island population were found to fall along a gradient of genomic composition consistent with non-clustered representation of the two parental genomes. These findings indicate potential for successful genetic rescue in future Apteryx reinforcement translocations, a potential that is currently under utilised due to restrictive translocation policies. In light of our findings, we suggest that reconsideration of these policies could render great benefits for the future diversity of this iconic genus in New Zealand.
Collapse
|
40
|
Saremi NF, Oppenheimer J, Vollmers C, O'Connell B, Milne SA, Byrne A, Yu L, Ryder OA, Green RE, Shapiro B. An Annotated Draft Genome for the Andean Bear, Tremarctos ornatus. J Hered 2021; 112:377-384. [PMID: 33882130 PMCID: PMC8280923 DOI: 10.1093/jhered/esab021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/20/2021] [Indexed: 12/18/2022] Open
Abstract
The Andean bear is the only extant member of the Tremarctine subfamily and the only extant ursid species to inhabit South America. Here, we present an annotated de novo assembly of a nuclear genome from a captive-born female Andean bear, Mischief, generated using a combination of short and long DNA and RNA reads. Our final assembly has a length of 2.23 Gb, and a scaffold N50 of 21.12 Mb, contig N50 of 23.5 kb, and BUSCO score of 88%. The Andean bear genome will be a useful resource for exploring the complex phylogenetic history of extinct and extant bear species and for future population genetics studies of Andean bears.
Collapse
Affiliation(s)
- Nedda F Saremi
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA
| | - Jonas Oppenheimer
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA
| | - Christopher Vollmers
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA
| | - Brendan O'Connell
- Department of Medical and Molecular Genetics, Oregon Health & Science University, Portland, OR
| | - Shard A Milne
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA
| | - Ashley Byrne
- Department of Molecular, Cellular, Developmental Biology, University of California Santa Cruz, Santa Cruz, CA
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, School of Life Sciences, Yunnan University, Kunming, China
| | | | - Richard E Green
- Department of Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA.,Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA
| |
Collapse
|
41
|
Said M, Holušová K, Farkas A, Ivanizs L, Gaál E, Cápal P, Abrouk M, Martis-Thiele MM, Kalapos B, Bartoš J, Friebe B, Doležel J, Molnár I. Development of DNA Markers From Physically Mapped Loci in Aegilops comosa and Aegilops umbellulata Using Single-Gene FISH and Chromosome Sequences. FRONTIERS IN PLANT SCIENCE 2021; 12:689031. [PMID: 34211490 PMCID: PMC8240756 DOI: 10.3389/fpls.2021.689031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/19/2021] [Indexed: 05/31/2023]
Abstract
Breeding of agricultural crops adapted to climate change and resistant to diseases and pests is hindered by a limited gene pool because of domestication and thousands of years of human selection. One way to increase genetic variation is chromosome-mediated gene transfer from wild relatives by cross hybridization. In the case of wheat (Triticum aestivum), the species of genus Aegilops are a particularly attractive source of new genes and alleles. However, during the evolution of the Aegilops and Triticum genera, diversification of the D-genome lineage resulted in the formation of diploid C, M, and U genomes of Aegilops. The extent of structural genome alterations, which accompanied their evolution and speciation, and the shortage of molecular tools to detect Aegilops chromatin hamper gene transfer into wheat. To investigate the chromosome structure and help develop molecular markers with a known physical position that could improve the efficiency of the selection of desired introgressions, we developed single-gene fluorescence in situ hybridization (FISH) maps for M- and U-genome progenitors, Aegilops comosa and Aegilops umbellulata, respectively. Forty-three ortholog genes were located on 47 loci in Ae. comosa and on 52 loci in Ae. umbellulata using wheat cDNA probes. The results obtained showed that M-genome chromosomes preserved collinearity with those of wheat, excluding 2 and 6M containing an intrachromosomal rearrangement and paracentric inversion of 6ML, respectively. While Ae. umbellulata chromosomes 1, 3, and 5U maintained collinearity with wheat, structural reorganizations in 2, 4, 6, and 7U suggested a similarity with the C genome of Aegilops markgrafii. To develop molecular markers with exact physical positions on chromosomes of Aegilops, the single-gene FISH data were validated in silico using DNA sequence assemblies from flow-sorted M- and U-genome chromosomes. The sequence similarity search of cDNA sequences confirmed 44 out of the 47 single-gene loci in Ae. comosa and 40 of the 52 map positions in Ae. umbellulata. Polymorphic regions, thus, identified enabled the development of molecular markers, which were PCR validated using wheat-Aegilops disomic chromosome addition lines. The single-gene FISH-based approach allowed the development of PCR markers specific for cytogenetically mapped positions on Aegilops chromosomes, substituting as yet unavailable segregating map. The new knowledge and resources will support the efforts for the introgression of Aegilops genes into wheat and their cloning.
Collapse
Affiliation(s)
- Mahmoud Said
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
- Agricultural Research Centre, Field Crops Research Institute, Cairo, Egypt
| | - Katerina Holušová
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - András Farkas
- ELKH Centre for Agricultural Research, Agricultural Institute, Martonvásár, Hungary
| | - László Ivanizs
- ELKH Centre for Agricultural Research, Agricultural Institute, Martonvásár, Hungary
| | - Eszter Gaál
- ELKH Centre for Agricultural Research, Agricultural Institute, Martonvásár, Hungary
| | - Petr Cápal
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - Michael Abrouk
- Biological and Environmental Science and Engineering Division, Center for Desert Agriculture, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Mihaela M. Martis-Thiele
- NBIS (National Bioinformatics Infrastructure Sweden, Science for Life Laboratory), Division of Cell Biology, Department of Clinical and Experimental Medicine, Faculty of Medicine and Health Sciences, Linköping University, Linköping, Sweden
| | - Balázs Kalapos
- ELKH Centre for Agricultural Research, Agricultural Institute, Martonvásár, Hungary
| | - Jan Bartoš
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - Bernd Friebe
- Wheat Genetics Resource Center, Kansas State University, Manhattan, KS, United States
| | - Jaroslav Doležel
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
| | - István Molnár
- Institute of Experimental Botany of the Czech Academy of Sciences, Center of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czechia
- ELKH Centre for Agricultural Research, Agricultural Institute, Martonvásár, Hungary
| |
Collapse
|
42
|
Polinski JM, Zimin AV, Clark KF, Kohn AB, Sadowski N, Timp W, Ptitsyn A, Khanna P, Romanova DY, Williams P, Greenwood SJ, Moroz LL, Walt DR, Bodnar AG. The American lobster genome reveals insights on longevity, neural, and immune adaptations. SCIENCE ADVANCES 2021; 7:7/26/eabe8290. [PMID: 34162536 PMCID: PMC8221624 DOI: 10.1126/sciadv.abe8290] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 05/07/2021] [Indexed: 05/30/2023]
Abstract
The American lobster, Homarus americanus, is integral to marine ecosystems and supports an important commercial fishery. This iconic species also serves as a valuable model for deciphering neural networks controlling rhythmic motor patterns and olfaction. Here, we report a high-quality draft assembly of the H. americanus genome with 25,284 predicted gene models. Analysis of the neural gene complement revealed extraordinary development of the chemosensory machinery, including a profound diversification of ligand-gated ion channels and secretory molecules. The discovery of a novel class of chimeric receptors coupling pattern recognition and neurotransmitter binding suggests a deep integration between the neural and immune systems. A robust repertoire of genes involved in innate immunity, genome stability, cell survival, chemical defense, and cuticle formation represents a diversity of defense mechanisms essential to thrive in the benthic marine environment. Together, these unique evolutionary adaptations contribute to the longevity and ecological success of this long-lived benthic predator.
Collapse
Affiliation(s)
| | - Aleksey V Zimin
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA
| | - K Fraser Clark
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, Nova Scotia B2N 5E3, Canada
| | - Andrea B Kohn
- The Whitney Laboratory for Marine Bioscience and Department of Neuroscience, University of Florida, Gainesville and St. Augustine, FL 32080-8623, USA
| | - Norah Sadowski
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Winston Timp
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Andrey Ptitsyn
- Gloucester Marine Genomics Institute, Gloucester, MA 01930, USA
| | - Prarthana Khanna
- Genetics Program, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Daria Y Romanova
- Institute of Higher Nervous Activity and Neurophysiology of RAS, Moscow 117485, Russia
| | - Peter Williams
- The Whitney Laboratory for Marine Bioscience and Department of Neuroscience, University of Florida, Gainesville and St. Augustine, FL 32080-8623, USA
| | - Spencer J Greenwood
- Department of Biomedical Sciences, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, Prince Edward Island C1A 4P3, Canada
| | - Leonid L Moroz
- The Whitney Laboratory for Marine Bioscience and Department of Neuroscience, University of Florida, Gainesville and St. Augustine, FL 32080-8623, USA
| | - David R Walt
- Gloucester Marine Genomics Institute, Gloucester, MA 01930, USA
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Wyss Institute for Biologically Inspired Engineering at Harvard University, Boston, MA 02115, USA
| | - Andrea G Bodnar
- Gloucester Marine Genomics Institute, Gloucester, MA 01930, USA.
| |
Collapse
|
43
|
Gomes-Dos-Santos A, Lopes-Lima M, Machado AM, Marcos Ramos A, Usié A, Bolotov IN, Vikhrev IV, Breton S, Castro LFC, da Fonseca RR, Geist J, Österling ME, Prié V, Teixeira A, Gan HM, Simakov O, Froufe E. The Crown Pearl: a draft genome assembly of the European freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758). DNA Res 2021; 28:6182681. [PMID: 33755103 PMCID: PMC8088596 DOI: 10.1093/dnares/dsab002] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 03/22/2021] [Indexed: 11/17/2022] Open
Abstract
Since historical times, the inherent human fascination with pearls turned the freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758) into a highly valuable cultural and economic resource. Although pearl harvesting in M. margaritifera is nowadays residual, other human threats have aggravated the species conservation status, especially in Europe. This mussel presents a myriad of rare biological features, e.g. high longevity coupled with low senescence and Doubly Uniparental Inheritance of mitochondrial DNA, for which the underlying molecular mechanisms are poorly known. Here, the first draft genome assembly of M. margaritifera was produced using a combination of Illumina Paired-end and Mate-pair approaches. The genome assembly was 2.4 Gb long, possessing 105,185 scaffolds and a scaffold N50 length of 288,726 bp. The ab initio gene prediction allowed the identification of 35,119 protein-coding genes. This genome represents an essential resource for studying this species’ unique biological and evolutionary features and ultimately will help to develop new tools to promote its conservation.
Collapse
Affiliation(s)
- André Gomes-Dos-Santos
- CIIMAR/CIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208 Matosinhos, Portugal.,Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
| | - Manuel Lopes-Lima
- CIIMAR/CIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208 Matosinhos, Portugal.,CIBIO/InBIO-Research Center in Biodiversity and Genetic Resources, Universidade do Porto, Campus Agrário de Vairão, Rua Padre Armando Quintas, 4485-661 Vairão, Portugal.,IUCN SSC Mollusc Specialist Group, c/o IUCN, Cambridge, England
| | - André M Machado
- CIIMAR/CIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208 Matosinhos, Portugal
| | - António Marcos Ramos
- Centro de Biotecnologia Agrícola e Agro-alimentar do Alentejo (CEBAL), Instituto Politécnico de Beja (IPBeja), 7801-908 Beja, Portugal.,MED-Mediterranean Institute for Agriculture, Environment and Development, CEBAL-Centro de Biotecnologia Agrícola e Agro-Alimentar do Alentejo, 7801-908 Beja, Portugal
| | - Ana Usié
- Centro de Biotecnologia Agrícola e Agro-alimentar do Alentejo (CEBAL), Instituto Politécnico de Beja (IPBeja), 7801-908 Beja, Portugal.,MED-Mediterranean Institute for Agriculture, Environment and Development, CEBAL-Centro de Biotecnologia Agrícola e Agro-Alimentar do Alentejo, 7801-908 Beja, Portugal
| | - Ivan N Bolotov
- Federal Center for Integrated Arctic Research, Russian Academy of Sciences, Arkhangelsk 163000, Russia
| | - Ilya V Vikhrev
- Federal Center for Integrated Arctic Research, Russian Academy of Sciences, Arkhangelsk 163000, Russia
| | - Sophie Breton
- Department of Biological Sciences, University of Montreal, Montreal, Canada
| | - L Filipe C Castro
- CIIMAR/CIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208 Matosinhos, Portugal.,Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
| | - Rute R da Fonseca
- Center for Macroecology, Evolution and Climate, GLOBE Institute, University of Copenhagen, 2100 Copenhagen, Denmark
| | - Juergen Geist
- Aquatic Systems Biology Unit, Technical University of Munich, TUM School of Life Sciences, D-85354 Freising, Germany
| | - Martin E Österling
- Department of Environmental and Life Sciences-Biology, Karlstad University, 651 88 Karlstad, Sweden
| | - Vincent Prié
- Research Associate, Institute of Systematics, Evolution, Biodiversity (ISYEB), National Museum of Natural History (MNHN), CNRS, SU, EPHE, 75005 Paris, France
| | - Amílcar Teixeira
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Bragança, Portugal
| | - Han Ming Gan
- GeneSEQ Sdn Bhd, Bandar Bukit Beruntung, Rawang 48300, Selangor, Malaysia
| | - Oleg Simakov
- Department of Neurosciences and Developmental Biology, University of Vienna, 1010 Vienna, Austria
| | - Elsa Froufe
- CIIMAR/CIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Avenida General Norton de Matos, S/N, P 4450-208 Matosinhos, Portugal
| |
Collapse
|
44
|
Zhan S, Griswold C, Lukens L. Zea mays RNA-seq estimated transcript abundances are strongly affected by read mapping bias. BMC Genomics 2021; 22:285. [PMID: 33874908 PMCID: PMC8056621 DOI: 10.1186/s12864-021-07577-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 03/30/2021] [Indexed: 11/27/2022] Open
Abstract
Background Genetic variation for gene expression is a source of phenotypic variation for natural and agricultural species. The common approach to map and to quantify gene expression from genetically distinct individuals is to assign their RNA-seq reads to a single reference genome. However, RNA-seq reads from alleles dissimilar to this reference genome may fail to map correctly, causing transcript levels to be underestimated. Presently, the extent of this mapping problem is not clear, particularly in highly diverse species. We investigated if mapping bias occurred and if chromosomal features associated with mapping bias. Zea mays presents a model species to assess these questions, given it has genotypically distinct and well-studied genetic lines. Results In Zea mays, the inbred B73 genome is the standard reference genome and template for RNA-seq read assignments. In the absence of mapping bias, B73 and a second inbred line, Mo17, would each have an approximately equal number of regulatory alleles that increase gene expression. Remarkably, Mo17 had 2–4 times fewer such positively acting alleles than did B73 when RNA-seq reads were aligned to the B73 reference genome. Reciprocally, over one-half of the B73 alleles that increased gene expression were not detected when reads were aligned to the Mo17 genome template. Genes at dissimilar chromosomal ends were strongly affected by mapping bias, and genes at more similar pericentromeric regions were less affected. Biased transcript estimates were higher in untranslated regions and lower in splice junctions. Bias occurred across software and alignment parameters. Conclusions Mapping bias very strongly affects gene transcript abundance estimates in maize, and bias varies across chromosomal features. Individual genome or transcriptome templates are likely necessary for accurate transcript estimation across genetically variable individuals in maize and other species. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07577-3.
Collapse
Affiliation(s)
- Shuhua Zhan
- Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada
| | - Cortland Griswold
- Department of Integrative Biology, University of Guelph, Guelph, Ontario, Canada
| | - Lewis Lukens
- Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada.
| |
Collapse
|
45
|
Flanagan BA, Krueger-Hadfield SA, Murren CJ, Nice CC, Strand AE, Sotka EE. Founder effects shape linkage disequilibrium and genomic diversity of a partially clonal invader. Mol Ecol 2021; 30:1962-1978. [PMID: 33604965 DOI: 10.1111/mec.15854] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 01/18/2021] [Accepted: 02/01/2021] [Indexed: 12/20/2022]
Abstract
The genomic variation of an invasive species may be affected by complex demographic histories and evolutionary changes during the invasion. Here, we describe the relative influence of bottlenecks, clonality, and population expansion in determining genomic variability of the widespread red macroalga Agarophyton vermiculophyllum. Its introduction from mainland Japan to the estuaries of North America and Europe coincided with shifts from predominantly sexual to partially clonal reproduction and rapid adaptive evolution. A survey of 62,285 SNPs for 351 individuals from 35 populations, aligned to 24 chromosome-length scaffolds indicate that linkage disequilibrium (LD), observed heterozygosity (Ho ), Tajima's D, and nucleotide diversity (Pi) were greater among non-native than native populations. Evolutionary simulations indicate LD and Tajima's D were consistent with a severe population bottleneck. Also, the increased rate of clonal reproduction in the non-native range could not have produced the observed patterns by itself but may have magnified the bottleneck effect on LD. Elevated marker diversity in the genetic source populations could have contributed to the increased Ho and Pi observed in the non-native range. We refined the previous invasion source region to a ~50 km section of northeastern Honshu Island. Outlier detection methods failed to reveal any consistently differentiated loci shared among invaded regions, probably because of the complex A. vermiculophyllum demographic history. Our results reinforce the importance of demographic history, specifically founder effects, in driving genomic variation of invasive populations, even when localized adaptive evolution and reproductive system shifts are observed.
Collapse
Affiliation(s)
- Ben A Flanagan
- Department of Biology, College of Charleston, Charleston, SC, USA.,Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA
| | - Stacy A Krueger-Hadfield
- Department of Biology, College of Charleston, Charleston, SC, USA.,Department of Biology, University of Alabama at Birmingham, Birmingham, AL, USA
| | | | - Chris C Nice
- Department of Biology, Population and Conservation Biology Program, Texas State University, San Marcos, TX, USA
| | - Allan E Strand
- Department of Biology, College of Charleston, Charleston, SC, USA
| | - Erik E Sotka
- Department of Biology, College of Charleston, Charleston, SC, USA
| |
Collapse
|
46
|
Guo G, Chen H, Yan D, Cheng J, Chen JY, Chong Z. Scalable De Novo Genome Assembly Using a Pregel-Like Graph-Parallel System. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:731-744. [PMID: 31180898 DOI: 10.1109/tcbb.2019.2920912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
De novo genome assembly is the process of stitching short DNA sequences to generate longer DNA sequences, without using any reference sequence for alignment. It enables high-throughput genome sequencing and thus accelerates the discovery of new genomes. In this paper, we present a toolkit, called PPA-assembler, for de novo genome assembly in a distributed setting. The operations in our toolkit provide strong performance guarantees, and can be assembled to implement various sequencing strategies. PPA-assembler adopts the popular de Bruijn graph based approach for sequencing, and each operation is implemented as a program in Google's Pregel framework which can be easily deployed in a generic cluster. Experiments on large real and simulated datasets demonstrate that PPA-assembler is much more efficient than the state-of-the-arts while providing comparable sequencing quality. PPA-assembler has been open-sourced at https://github.com/yaobaiwei/PPA-Assembler.
Collapse
|
47
|
Silva AT, Gao B, Fisher KM, Mishler BD, Ekwealor JTB, Stark LR, Li X, Zhang D, Bowker MA, Brinda JC, Coe KK, Oliver MJ. To dry perchance to live: Insights from the genome of the desiccation-tolerant biocrust moss Syntrichia caninervis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:1339-1356. [PMID: 33277766 DOI: 10.1111/tpj.15116] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 11/30/2020] [Indexed: 05/24/2023]
Abstract
With global climate change, water scarcity threatens whole agro/ecosystems. The desert moss Syntrichia caninervis, an extremophile, offers novel insights into surviving desiccation and heat. The sequenced S. caninervis genome consists of 13 chromosomes containing 16 545 protein-coding genes and 2666 unplaced scaffolds. Syntenic relationships within the S. caninervis and Physcomitrella patens genomes indicate the S. caninervis genome has undergone a single whole genome duplication event (compared to two for P. patens) and evidence suggests chromosomal or segmental losses in the evolutionary history of S. caninervis. The genome contains a large sex chromosome composed primarily of repetitive sequences with a large number of Copia and Gypsy elements. Orthogroup analyses revealed an expansion of ELIP genes encoding proteins important in photoprotection. The transcriptomic response to desiccation identified four structural clusters of novel genes. The genomic resources established for this extremophile offer new perspectives for understanding the evolution of desiccation tolerance in plants.
Collapse
Affiliation(s)
- Anderson T Silva
- Division of Plant Sciences and Interdisciplinary Plant Group, University of Missouri, Columbia, Missouri, 65211, USA
| | - Bei Gao
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Science, Urumqi, 830011, China
| | - Kirsten M Fisher
- Department of Biological Sciences, California State University, Los Angeles, California, 90032, USA
| | - Brent D Mishler
- Department of Integrative Biology, University and Jepson Herbaria, University of California, Berkeley, California, 94720-2465, USA
| | - Jenna T B Ekwealor
- Department of Integrative Biology, University and Jepson Herbaria, University of California, Berkeley, California, 94720-2465, USA
| | - Lloyd R Stark
- School of Life Sciences, University of Nevada, Las Vegas, Nevada, 89154-4004, USA
| | - Xiaoshuang Li
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Science, Urumqi, 830011, China
| | - Daoyuan Zhang
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Science, Urumqi, 830011, China
| | - Matthew A Bowker
- School of Forestry, Northern Arizona University, Flagstaff, Arizona, 86011, USA
| | - John C Brinda
- Missouri Botanical Garden, St. Louis, Missouri, 63110-0299, USA
| | - Kirsten K Coe
- Department of Biology, Middlebury College, Middlebury, Vermont, 40506-0225, USA
| | - Melvin J Oliver
- Division of Plant Sciences and Interdisciplinary Plant Group, University of Missouri, Columbia, Missouri, 65211, USA
- USDA-ARS-MWA, Plant Genetics Research Unit, Columbia, Missouri, 65211, USA
| |
Collapse
|
48
|
Burley JT, Kellner JR, Hubbell SP, Faircloth BC. Genome assemblies for two Neotropical trees: Jacaranda copaia and Handroanthus guayacan. G3 (BETHESDA, MD.) 2021; 11:jkab010. [PMID: 33693604 PMCID: PMC8034707 DOI: 10.1093/g3journal/jkab010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 12/22/2020] [Indexed: 12/01/2022]
Abstract
The lack of genomic resources for tropical canopy trees is impeding several research avenues in tropical forest biology. We present genome assemblies for two Neotropical hardwood species, Jacaranda copaia and Handroanthus (formerly Tabebuia) guayacan, that are model systems for research on tropical tree demography and flowering phenology. For each species, we combined Illumina short-read data with in vitro proximity-ligation (Chicago) libraries to generate an assembly. For Jacaranda copaia, we obtained 104X physical coverage and produced an assembly with N50/N90 scaffold lengths of 1.020/0.277 Mbp. For H. guayacan, we obtained 129X coverage and produced an assembly with N50/N90 scaffold lengths of 0.795/0.165 Mbp. J. copaia and H. guayacan assemblies contained 95.8% and 87.9% of benchmarking orthologs, although they constituted only 77.1% and 66.7% of the estimated genome sizes of 799 and 512 Mbp, respectively. These differences were potentially due to high repetitive sequence content (>59.31% and 45.59%) and high heterozygosity (0.5% and 0.8%) in each species. Finally, we compared each new assembly to a previously sequenced genome for Handroanthus impetiginosus using whole-genome alignment. This analysis indicated extensive gene duplication in H. impetiginosus since its divergence from H. guayacan.
Collapse
Affiliation(s)
- John T Burley
- Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912, USA
- Institute at Brown for Environment and Society, Brown University, Providence, RI 02912, USA
| | - James R Kellner
- Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912, USA
- Institute at Brown for Environment and Society, Brown University, Providence, RI 02912, USA
| | - Stephen P Hubbell
- Department of Ecology and Evolutionary Biology, University of California—Los Angeles, Los Angeles, CA 90095, USA
| | - Brant C Faircloth
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
49
|
Warren WC, Harris RA, Haukness M, Fiddes IT, Murali SC, Fernandes J, Dishuck PC, Storer JM, Raveendran M, Hillier LW, Porubsky D, Mao Y, Gordon D, Vollger MR, Lewis AP, Munson KM, DeVogelaere E, Armstrong J, Diekhans M, Walker JA, Tomlinson C, Graves-Lindsay TA, Kremitzki M, Salama SR, Audano PA, Escalona M, Maurer NW, Antonacci F, Mercuri L, Maggiolini FAM, Catacchio CR, Underwood JG, O'Connor DH, Sanders AD, Korbel JO, Ferguson B, Kubisch HM, Picker L, Kalin NH, Rosene D, Levine J, Abbott DH, Gray SB, Sanchez MM, Kovacs-Balint ZA, Kemnitz JW, Thomasy SM, Roberts JA, Kinnally EL, Capitanio JP, Skene JHP, Platt M, Cole SA, Green RE, Ventura M, Wiseman RW, Paten B, Batzer MA, Rogers J, Eichler EE. Sequence diversity analyses of an improved rhesus macaque genome enhance its biomedical utility. Science 2021; 370:370/6523/eabc6617. [PMID: 33335035 DOI: 10.1126/science.abc6617] [Citation(s) in RCA: 73] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 10/29/2020] [Indexed: 12/15/2022]
Abstract
The rhesus macaque (Macaca mulatta) is the most widely studied nonhuman primate (NHP) in biomedical research. We present an updated reference genome assembly (Mmul_10, contig N50 = 46 Mbp) that increases the sequence contiguity 120-fold and annotate it using 6.5 million full-length transcripts, thus improving our understanding of gene content, isoform diversity, and repeat organization. With the improved assembly of segmental duplications, we discovered new lineage-specific genes and expanded gene families that are potentially informative in studies of evolution and disease susceptibility. Whole-genome sequencing (WGS) data from 853 rhesus macaques identified 85.7 million single-nucleotide variants (SNVs) and 10.5 million indel variants, including potentially damaging variants in genes associated with human autism and developmental delay, providing a framework for developing noninvasive NHP models of human disease.
Collapse
Affiliation(s)
- Wesley C Warren
- Department of Animal Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA. .,Department of Surgery, School of Medicine, University of Missouri, Columbia, MO 65211, USA.,Institute of Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - R Alan Harris
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Marina Haukness
- Computational Genomics Laboratory, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - Shwetha C Murali
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Jason Fernandes
- Department of Biomolecular Engineering, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Jessica M Storer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA.,Institue for Systems Biology, Seattle, WA 98109, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - LaDeana W Hillier
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Yafei Mao
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - David Gordon
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Elizabeth DeVogelaere
- Computational Genomics Laboratory, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Joel Armstrong
- Computational Genomics Laboratory, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Mark Diekhans
- Computational Genomics Laboratory, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Jerilyn A Walker
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University, St. Louis, MO 63108, USA
| | | | - Milinn Kremitzki
- McDonnell Genome Institute, Washington University, St. Louis, MO 63108, USA
| | - Sofie R Salama
- Department of Biomolecular Engineering, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Peter A Audano
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Merly Escalona
- Department of Biomolecular Engineering, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Nicholas W Maurer
- Department of Biomolecular Engineering, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - Ludovica Mercuri
- Department of Biology, University of Bari 'Aldo Moro', 70125 Bari, Italy
| | | | | | | | - David H O'Connor
- Department of Pathology and Laboratory Medicine, Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53711, USA
| | - Ashley D Sanders
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Betsy Ferguson
- Division of Genetics, Oregon National Primate Research Center, Oregon Health and Science University, Beaverton, OR 97006, USA
| | | | - Louis Picker
- Oregon National Primate Research Center and Vaccine and Gene Therapy Institute, Oregon Health Sciences University, Beaverton, OR 97006, USA
| | - Ned H Kalin
- Department of Psychiatry, University of Wisconsin School of Medicine and Public Health, Madison, WI 53719, USA
| | - Douglas Rosene
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Jon Levine
- Department of Neuroscience, University of Wisconsin, Madison, WI 53175, USA.,Wisconsin National Primate Research Center, University of Wisconsin, Madison, WI 53171, USA
| | - David H Abbott
- Wisconsin National Primate Research Center, University of Wisconsin, Madison, WI 53171, USA.,Department of Obstetrics and Gynecology, Wisconsin National Primate Research Center, University of Wisconsin, Madison, WI 53715, USA
| | - Stanton B Gray
- The University of Texas MD Anderson Cancer Center, Michale E. Keeling Center for Comparative Medicine and Research, Bastrop, TX 78602, USA
| | - Mar M Sanchez
- Yerkes National Primate Research Center, Atlanta, GA 30329, USA.,Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA 30329, USA
| | | | - Joseph W Kemnitz
- Wisconsin National Primate Research Center, University of Wisconsin, Madison, WI 53171, USA.,Department of Cell and Regenerative Biology, University of Wisconsin, Madison, WI 53706, USA
| | - Sara M Thomasy
- Department of Surgical and Radiological Sciences, School of Veterinary Medicine, University of California-Davis, Davis, CA 95616, USA.,Department of Ophthalmology and Vision Science, School of Medicine, University of California-Davis, Davis, CA 95817, USA
| | | | - Erin L Kinnally
- California National Primate Research Center, Davis, CA 95616, USA.,Department of Psychology, University of California, Davis, CA 95616, USA
| | - John P Capitanio
- California National Primate Research Center, Davis, CA 95616, USA.,Department of Psychology, University of California, Davis, CA 95616, USA
| | - J H Pate Skene
- Department of Neurobiology, Duke University School of Medicine, Durham, NC 27710, USA
| | - Michael Platt
- Department of Neuroscience, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Shelley A Cole
- Population Health Program, Texas Biomedical Research Institute and Southwest National Primate Research Center, San Antonio, TX 78227, USA
| | - Richard E Green
- Department of Biomolecular Engineering, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Mario Ventura
- Department of Biology, University of Bari 'Aldo Moro', 70125 Bari, Italy
| | - Roger W Wiseman
- Department of Pathology and Laboratory Medicine, Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53711, USA
| | - Benedict Paten
- Computational Genomics Laboratory, University of California-Santa Cruz, Santa Cruz, CA 95064, USA
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. .,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
50
|
Island songbirds as windows into evolution in small populations. Curr Biol 2021; 31:1303-1310.e4. [PMID: 33476557 DOI: 10.1016/j.cub.2020.12.040] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 10/12/2020] [Accepted: 12/23/2020] [Indexed: 11/20/2022]
Abstract
Due to their limited ranges and inherent isolation, island species have long been recognized as crucial systems for tackling a range of evolutionary questions, including in the early study of speciation.1,2 Such species have been less studied in the understanding of the evolutionary forces driving DNA sequence evolution. Island species usually have lower census population sizes (N) than continental species and, supposedly, lower effective population sizes (Ne). Given that both the rates of change caused by genetic drift and by selection are dependent upon Ne, island species are theoretically expected to exhibit (1) lower genetic diversity, (2) less effective natural selection against slightly deleterious mutations,3,4 and (3) a lower rate of adaptive evolution.5-8 Here, we have used a large set of newly sequenced and published whole-genome sequences of Passerida species (14 insular and 11 continental) to test these predictions. We confirm that island species exhibit lower census size and Ne, supporting the hypothesis that the smaller area available on islands constrains the upper bound of Ne. In the insular species, we find lower nucleotide diversity in coding regions, higher ratios of non-synonymous to synonymous polymorphisms, and lower adaptive substitution rates. Our results provide robust evidence that the lower Ne experienced by island species has affected both the ability of natural selection to efficiently remove weakly deleterious mutations and also the adaptive potential of island species, therefore providing considerable empirical support for the nearly neutral theory. We discuss the implications for both evolutionary and conservation biology.
Collapse
|