1
|
Mayeur H, Leyhr J, Mulley J, Leurs N, Michel L, Sharma K, Lagadec R, Aury JM, Osborne OG, Mulhair P, Poulain J, Mangenot S, Mead D, Smith M, Corton C, Oliver K, Skelton J, Betteridge E, Dolucan J, Dudchenko O, Omer AD, Weisz D, Aiden EL, McCarthy S, Sims Y, Torrance J, Tracey A, Howe K, Baril T, Hayward A, Martinand-Mari C, Sanchez S, Haitina T, Martin K, Korsching SI, Mazan S, Debiais-Thibaud M. The sensory shark: high-quality morphological, genomic and transcriptomic data for the small-spotted catshark Scyliorhinus canicula reveal the molecular bases of sensory organ evolution in jawed vertebrates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.23.595469. [PMID: 39005470 PMCID: PMC11244906 DOI: 10.1101/2024.05.23.595469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Cartilaginous fishes (chimaeras and elasmobranchs -sharks, skates and rays) hold a key phylogenetic position to explore the origin and diversifications of jawed vertebrates. Here, we report and integrate reference genomic, transcriptomic and morphological data in the small-spotted catshark Scyliorhinus canicula to shed light on the evolution of sensory organs. We first characterise general aspects of the catshark genome, confirming the high conservation of genome organisation across cartilaginous fishes, and investigate population genomic signatures. Taking advantage of a dense sampling of transcriptomic data, we also identify gene signatures for all major organs, including chondrichthyan specializations, and evaluate expression diversifications between paralogs within major gene families involved in sensory functions. Finally, we combine these data with 3D synchrotron imaging and in situ gene expression analyses to explore chondrichthyan-specific traits and more general evolutionary trends of sensory systems. This approach brings to light, among others, novel markers of the ampullae of Lorenzini electro-sensory cells, a duplication hotspot for crystallin genes conserved in jawed vertebrates, and a new metazoan clade of the Transient-receptor potential (TRP) family. These resources and results, obtained in an experimentally tractable chondrichthyan model, open new avenues to integrate multiomics analyses for the study of elasmobranchs and jawed vertebrates.
Collapse
|
2
|
Law STS, Nong W, Li C, Chong TK, Yip HY, Swale T, Chiu SW, Chung RYN, Lam HM, Wong SYS, Wong H, Hui JHL. Genome of tropical bed bug Cimex hemipterus (Cimicidae, Hemiptera) reveals tetraspanin expanded in bed bug ancestor. INSECT SCIENCE 2024. [PMID: 38830803 DOI: 10.1111/1744-7917.13388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 04/27/2024] [Accepted: 05/02/2024] [Indexed: 06/05/2024]
Abstract
Cimex species are ectoparasites that exclusively feed on warm-blooded animals such as birds and mammals. Three cimicid species are known to be persistent pests for humans, including the tropical bed bug Cimex hemipterus, common bed bug Cimex lectularius, and Eastern bat bug Leptocimex boueti. To date, genomic information is restricted to the common bed bug C. lectularius, which limits understanding their biology and to provide controls of bed bug infestations. Here, a chromosomal-level genome assembly of C. hemipterus (495 Mb [megabase pairs]) contained on 16 pseudochromosomes (scaffold N50 = 34 Mb), together with 9 messenger RNA and small RNA transcriptomes were obtained. In comparison between hemipteran genomes, we found that the tetraspanin superfamily was expanded in the Cimex ancestor. This study provides the first genome assembly for the tropical bed bug C. hemipterus, and offers an unprecedented opportunity to address questions relating to bed bug infestations, as well as genomic evolution to hemipterans more widely.
Collapse
Affiliation(s)
- Sean Tsz Sum Law
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Chade Li
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Tze Kiu Chong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Ho Yin Yip
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | | | - Siu Wai Chiu
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Roger Yat-Nork Chung
- School of Public Health and Primary Care, CUHK Institute of Health Equity, The Chinese University of Hong Kong, Hong Kong, China
| | - Hon-Ming Lam
- School of Life Sciences, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Samuel Y S Wong
- School of Public Health and Primary Care, CUHK Institute of Health Equity, The Chinese University of Hong Kong, Hong Kong, China
| | - Hung Wong
- Department of Social Work, CUHK Institute of Health Equity, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
3
|
Peona V, Martelossi J, Almojil D, Bocharkina J, Brännström I, Brown M, Cang A, Carrasco-Valenzuela T, DeVries J, Doellman M, Elsner D, Espíndola-Hernández P, Montoya GF, Gaspar B, Zagorski D, Hałakuc P, Ivanovska B, Laumer C, Lehmann R, Boštjančić LL, Mashoodh R, Mazzoleni S, Mouton A, Nilsson MA, Pei Y, Potente G, Provataris P, Pardos-Blas JR, Raut R, Sbaffi T, Schwarz F, Stapley J, Stevens L, Sultana N, Symonova R, Tahami MS, Urzì A, Yang H, Yusuf A, Pecoraro C, Suh A. Teaching transposon classification as a means to crowd source the curation of repeat annotation - a tardigrade perspective. Mob DNA 2024; 15:10. [PMID: 38711146 DOI: 10.1186/s13100-024-00319-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 04/09/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND The advancement of sequencing technologies results in the rapid release of hundreds of new genome assemblies a year providing unprecedented resources for the study of genome evolution. Within this context, the significance of in-depth analyses of repetitive elements, transposable elements (TEs) in particular, is increasingly recognized in understanding genome evolution. Despite the plethora of available bioinformatic tools for identifying and annotating TEs, the phylogenetic distance of the target species from a curated and classified database of repetitive element sequences constrains any automated annotation effort. Moreover, manual curation of raw repeat libraries is deemed essential due to the frequent incompleteness of automatically generated consensus sequences. RESULTS Here, we present an example of a crowd-sourcing effort aimed at curating and annotating TE libraries of two non-model species built around a collaborative, peer-reviewed teaching process. Manual curation and classification are time-consuming processes that offer limited short-term academic rewards and are typically confined to a few research groups where methods are taught through hands-on experience. Crowd-sourcing efforts could therefore offer a significant opportunity to bridge the gap between learning the methods of curation effectively and empowering the scientific community with high-quality, reusable repeat libraries. CONCLUSIONS The collaborative manual curation of TEs from two tardigrade species, for which there were no TE libraries available, resulted in the successful characterization of hundreds of new and diverse TEs in a reasonable time frame. Our crowd-sourcing setting can be used as a teaching reference guide for similar projects: A hidden treasure awaits discovery within non-model organisms.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- Swiss Ornithological Institute Vogelwarte, Sempach, CH-6204, Switzerland.
- Department of Bioinformatics and Genetics, Swedish Natural History Museum, Stockholm, Sweden.
| | - Jacopo Martelossi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, Bologna, 40126, Italy.
| | - Dareen Almojil
- New York University Abu Dhabi, Saadiyat Island, United Arab Emirates
| | | | - Ioana Brännström
- Natural History Museum, Oslo University, Oslo, Norway
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Max Brown
- Anglia Ruskin University, East Rd, Cambridge, CB1 1PT, UK
| | | | - Tomàs Carrasco-Valenzuela
- Evolutionary Genetics Department, Leibniz Institute for Zoo and Wildlife Research, 10315, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, 14195, Berlin, Germany
| | - Jon DeVries
- Reed College, Portland, OR, United States of America
| | - Meredith Doellman
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL, 60637, USA
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Daniel Elsner
- Evolutionary Biology & Ecology, University of Freiburg, Freiburg, Germany
| | - Pamela Espíndola-Hernández
- Research Unit Comparative Microbiome Analysis (COMI), Helmholtz Zentrum München, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany
| | | | - Bence Gaspar
- Institute of Evolution and Ecology, University of Tuebingen, Tuebingen, Germany
| | - Danijela Zagorski
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czech Republic
| | - Paweł Hałakuc
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Beti Ivanovska
- Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Budapest, Hungary
| | | | - Robert Lehmann
- Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Ljudevit Luka Boštjančić
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Rahia Mashoodh
- Department of Genetics, Environment & Evolution, Centre for Biodiversity & Environment Research, University College London, London, UK
| | - Sofia Mazzoleni
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Alice Mouton
- INBIOS-Conservation Genetic Lab, University of Liege, Liege, Belgium
| | - Maria Anna Nilsson
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Yifan Pei
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden
- Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 127, 53113, Bonn, Germany
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Panagiotis Provataris
- German Cancer Research Center, NGS Core Facility, DKFZ-ZMBH Alliance, 69120, Heidelberg, Germany
| | - José Ramón Pardos-Blas
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales (MNCN-CSIC), José Gutiérrez Abascal 2, Madrid, 28006, Spain
| | - Ravindra Raut
- Department of Biotechnology, National Institute of Technology Durgapur, Durgapur, India
| | - Tomasa Sbaffi
- Molecular Ecology Group (MEG), National Research Council of Italy - Water Research Institute (CNR-IRSA), Verbania, Italy
| | - Florian Schwarz
- Eurofins Genomics Europe Pharma and Diagnostics Products & Services Sales GmbH, Ebersberg, Germany
| | - Jessica Stapley
- Plant Pathology Group, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Nusrat Sultana
- Department of Botany, Jagannath Univerity, Dhaka, 1100, Bangladesh
| | - Radka Symonova
- Institute of Hydrobiology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Mohadeseh S Tahami
- Department of Biological and Environmental Science, University of Jyväskylä, P.O. Box 35, Jyväskylä, 40014, Finland
| | - Alice Urzì
- Centogene GmbH, Am Strande 7, 18055, Rostock, Germany
| | - Heidi Yang
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Abdullah Yusuf
- Zell- und Molekularbiologie der Pflanzen, Technische Universität Dresden, Dresden, Germany
| | | | - Alexander Suh
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU, UK.
- Present address: Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 160, 53113, Bonn, Germany.
| |
Collapse
|
4
|
David G, Bertolotti A, Layer R, Scofield D, Hayward A, Baril T, Burnett HA, Gudmunds E, Jensen H, Husby A. Calling Structural Variants with Confidence from Short-Read Data in Wild Bird Populations. Genome Biol Evol 2024; 16:evae049. [PMID: 38489588 PMCID: PMC11018544 DOI: 10.1093/gbe/evae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/28/2024] [Accepted: 03/07/2024] [Indexed: 03/17/2024] Open
Abstract
Comprehensive characterization of structural variation in natural populations has only become feasible in the last decade. To investigate the population genomic nature of structural variation, reproducible and high-confidence structural variation callsets are first required. We created a population-scale reference of the genome-wide landscape of structural variation across 33 Nordic house sparrows (Passer domesticus). To produce a consensus callset across all samples using short-read data, we compare heuristic-based quality filtering and visual curation (Samplot/PlotCritic and Samplot-ML) approaches. We demonstrate that curation of structural variants is important for reducing putative false positives and that the time invested in this step outweighs the potential costs of analyzing short-read-discovered structural variation data sets that include many potential false positives. We find that even a lenient manual curation strategy (e.g. applied by a single curator) can reduce the proportion of putative false positives by up to 80%, thus enriching the proportion of high-confidence variants. Crucially, in applying a lenient manual curation strategy with a single curator, nearly all (>99%) variants rejected as putative false positives were also classified as such by a more stringent curation strategy using three additional curators. Furthermore, variants rejected by manual curation failed to reflect the expected population structure from SNPs, whereas variants passing curation did. Combining heuristic-based quality filtering with rapid manual curation of structural variants in short-read data can therefore become a time- and cost-effective first step for functional and population genomic studies requiring high-confidence structural variation callsets.
Collapse
Affiliation(s)
- Gabriel David
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | | | - Ryan Layer
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Computer Science, University of Colorado, Boulder, CO, USA
| | - Douglas Scofield
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, UK
| | - Hamish A Burnett
- Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Erik Gudmunds
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Henrik Jensen
- Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Arild Husby
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
5
|
Baril T, Galbraith J, Hayward A. Earl Grey: A Fully Automated User-Friendly Transposable Element Annotation and Analysis Pipeline. Mol Biol Evol 2024; 41:msae068. [PMID: 38577785 PMCID: PMC11003543 DOI: 10.1093/molbev/msae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 02/20/2024] [Accepted: 03/22/2024] [Indexed: 04/06/2024] Open
Abstract
Transposable elements (TEs) are major components of eukaryotic genomes and are implicated in a range of evolutionary processes. Yet, TE annotation and characterization remain challenging, particularly for nonspecialists, since existing pipelines are typically complicated to install, run, and extract data from. Current methods of automated TE annotation are also subject to issues that reduce overall quality, particularly (i) fragmented and overlapping TE annotations, leading to erroneous estimates of TE count and coverage, and (ii) repeat models represented by short sections of total TE length, with poor capture of 5' and 3' ends. To address these issues, we present Earl Grey, a fully automated TE annotation pipeline designed for user-friendly curation and annotation of TEs in eukaryotic genome assemblies. Using nine simulated genomes and an annotation of Drosophila melanogaster, we show that Earl Grey outperforms current widely used TE annotation methodologies in ameliorating the issues mentioned above while scoring highly in benchmarking for TE annotation and classification and being robust across genomic contexts. Earl Grey provides a comprehensive and fully automated TE annotation toolkit that provides researchers with paper-ready summary figures and outputs in standard formats compatible with other bioinformatics tools. Earl Grey has a modular format, with great scope for the inclusion of additional modules focused on further quality control and tailored analyses in future releases.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, Switzerland
| | - James Galbraith
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
| |
Collapse
|
6
|
Zheng C, Chen Q, Huang S, Song W, Chen G, Lin H, Xu C, Qian X, Cheng Y, Jiang A, Fan Z, Liu Y. Chromosome-scale Genome assembly of the critically endangered White-eared Night-Heron (Gorsachius magnificus). Sci Data 2024; 11:73. [PMID: 38228677 DOI: 10.1038/s41597-023-02894-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/27/2023] [Indexed: 01/18/2024] Open
Abstract
The White-eared Night-Heron (Gorsachius magnificus, G. magnificus) is a critically endangered heron that is very poorly known and only found in southern China and northern Vietnam, with an estimated population of 250 to 999 mature individuals. However, the lack of a reference genome has hindered the implementation of conservation management efforts. In this study, we present the first high-quality chromosome-scale reference genome, which was assembled by integrating PacBio long-reads sequencing, Illumina paired-end sequencing, and Hi-C technology. The genome has a total length of 1.176 Gb, with a scaffold N50 of 84.77 Mb and a contig N50 of 18.46 Mb. Utilizing Hi-C data, we anchored 99.89% of the scaffold sequences onto 29 pairs of chromosomes. Additionally, we identified 18,062 protein-coding genes in the genome, with 95.00% of which were functionally annotated. Notably, BUSCO assessment confirmed the presence of 97.2% of highly conserved Aves genes within the genome. This chromosome-level genome assembly and annotation will be valuable for future investigating the G. magnificus's evolutionary adaptation and conservation.
Collapse
Affiliation(s)
- Chenqing Zheng
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, China
- School of Ecology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Qing Chen
- School of Ecology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Shiguo Huang
- The Forestry Bureau of Chun'an County, Chun'an, 510275, Zheijang, China
| | - Weizhen Song
- The Forestry Bureau of Chun'an County, Chun'an, 510275, Zheijang, China
| | - Guoling Chen
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, China
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Hongzhou Lin
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, China
| | - Chunsheng Xu
- The Forestry Bureau of Chun'an County, Chun'an, 510275, Zheijang, China
| | - Xiran Qian
- The Forestry Bureau of Chun'an County, Chun'an, 510275, Zheijang, China
| | - Yachang Cheng
- School of Ecology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Aiwu Jiang
- Guangxi Key Laboratory of Forest Ecology and Conservation, College of Forestry, Guangxi University, Nanning, 530004, China
| | - Zhongyong Fan
- Zhejiang Museum of Natural History, Zhejiang Biodiversity Research Center, Hangzhou, 310014, China.
| | - Yang Liu
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, China.
- School of Ecology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| |
Collapse
|
7
|
Gozashti L, Hoekstra HE. Accounting for diverse transposable element landscapes is key to developing and evaluating accurate de novo annotation strategies. Genome Biol 2024; 25:4. [PMID: 38166955 PMCID: PMC10763064 DOI: 10.1186/s13059-023-03118-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 11/22/2023] [Indexed: 01/05/2024] Open
Abstract
Transposable elements (TEs) are important drivers of genome evolution. Nonetheless, TE annotation remains a complex and challenging task. As more genomes from phylogenetically diverse species are published, a comprehensive pipeline for accurate annotation of diverse TEs is increasingly important. Recently, (Ou et al. Genome Biol. 20:275, 2019) developed a new comprehensive pipeline, Extensive De novo Transposable element Annotator (EDTA), and benchmarked its performance on the genomes of three species: maize, wheat, and fruit fly. Because TE landscapes can vary tremendously across species, we tested EDTA's performance on four additional genomes with different TE landscapes: mouse, zebrafish, zebra finch, and chicken. Our analysis reveals that EDTA faces challenges with repeat classification in these genomes and underperforms overall relative to its benchmark dataset. Notably, EDTA consistently misclassifies nonLTR retrotransposons as DNA transposons, resulting in erroneous TE annotations for species with considerable repertoires of nonLTR retrotransposons. Overall, we set expectations for EDTA's performance on genomes spanning additional diversity, urge caution when using EDTA on genomes with divergent TE repertoires from the species on which it was initially benchmarked, and hope to motivate the development of methods that are robust to both the diversity of TEs and TE landscapes observed across species.
Collapse
Affiliation(s)
- Landen Gozashti
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA, 02138, USA
| | - Hopi E Hoekstra
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA, 02138, USA.
| |
Collapse
|
8
|
Pirani RM, Arias CF, Charles K, Chung AK, Curlis JD, Nicholson DJ, Vargas M, Cox CL, McMillan WO, Logan ML. A high-quality genome for the slender anole (Anolis apletophallus): an emerging model for field studies of tropical ecology and evolution. G3 (BETHESDA, MD.) 2023; 14:jkad248. [PMID: 37875105 PMCID: PMC10755174 DOI: 10.1093/g3journal/jkad248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Revised: 01/22/2023] [Accepted: 10/05/2023] [Indexed: 10/26/2023]
Abstract
The slender anole, Anolis apletophallus, is a small arboreal lizard of the rainforest understory of central and eastern Panama. This species has been the subject of numerous ecological and evolutionary studies over the past 60 years as a result of attributes that make it especially amenable to field and laboratory science. Slender anoles are highly abundant, short-lived (nearly 100% annual turnover), easy to manipulate in both the lab and field, and are ubiquitous in the forests surrounding the Smithsonian Tropical Research Institute in Panama, where researchers have access to high-quality laboratory facilities. Here, we present a high-quality genome for the slender anole, which is an important new resource for studying this model species. We assembled and annotated the slender anole genome by combining 3 technologies: Oxford Nanopore, 10× Genomics Linked-Reads, and Dovetail Omni-C. We compared this genome with the recently published brown anole (Anolis sagrei) and the canonical green anole (Anolis carolinensis) genomes. Our genome is the first assembled for an Anolis lizard from mainland Central or South America, the regions that host the majority of diversity in the genus. This new reference genome is one of the most complete genomes of any anole assembled to date and should facilitate deeper studies of slender anole evolution, as well as broader scale comparative genomic studies of both mainland and island species. In turn, such studies will further our understanding of the well-known adaptive radiation of Anolis lizards.
Collapse
Affiliation(s)
- Renata M Pirani
- Department of Biology, University of Nevada Reno, Reno, NV 89557, USA
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
| | - Carlos F Arias
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20013, USA
| | - Kristin Charles
- Department of Biology, University of Nevada Reno, Reno, NV 89557, USA
| | - Albert K Chung
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544-2016, USA
| | - John David Curlis
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
- Department of Ecology and Evolution, University of Michigan, Ann Arbor, MI 48109-1085, USA
| | - Daniel J Nicholson
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
- University of Texas, Arlington, TX 76019, USA
| | - Marta Vargas
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
| | - Christian L Cox
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
- Department of Biological Sciences and Institute of Environment, Florida International University, Miami, FL 33199, USA
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
| | - Michael L Logan
- Department of Biology, University of Nevada Reno, Reno, NV 89557, USA
- Smithsonian Tropical Research Institute, Panama City 0843-03092, Panama
| |
Collapse
|
9
|
Baril T, Pym A, Bass C, Hayward A. Transposon accumulation at xenobiotic gene family loci in aphids. Genome Res 2023; 33:1718-1733. [PMID: 37852781 PMCID: PMC10691553 DOI: 10.1101/gr.277820.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 08/29/2023] [Indexed: 10/20/2023]
Abstract
The evolution of resistance is a major challenge for the sustainable control of pests and pathogens. Thus, a deeper understanding of the evolutionary and genomic mechanisms underpinning resistance evolution is required to safeguard health and food production. Several studies have implicated transposable elements (TEs) in xenobiotic-resistance evolution in insects. However, analyses are generally restricted to one insect species and/or one or a few xenobiotic gene families (XGFs). We examine evidence for TE accumulation at XGFs by performing a comparative genomic analysis across 20 aphid genomes, considering major subsets of XGFs involved in metabolic resistance to insecticides: cytochrome P450s, glutathione S-transferases, esterases, UDP-glucuronosyltransferases, and ABC transporters. We find that TEs are significantly enriched at XGFs compared with other genes. XGFs show similar levels of TE enrichment to those of housekeeping genes. But unlike housekeeping genes, XGFs are not constitutively expressed in germline cells, supporting the selective enrichment of TEs at XGFs rather than enrichment owing to chromatin availability. Hotspots of extreme TE enrichment occur around certain XGFs. We find, in aphids of agricultural importance, particular enrichment of TEs around cytochrome P450 genes with known functions in the detoxification of synthetic insecticides. Our results provide evidence supporting a general role for TEs as a source of genomic variation at host XGFs and highlight the existence of considerable variability in TE content across XGFs and host species. These findings show the need for detailed functional verification analyses to clarify the significance of individual TE insertions and elucidate underlying mechanisms at TE-XGF hotspots.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Adam Pym
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Chris Bass
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| |
Collapse
|
10
|
Zhao P, Gu L, Gao Y, Pan Z, Liu L, Li X, Zhou H, Yu D, Han X, Qian L, Liu GE, Fang L, Wang Z. Young SINEs in pig genomes impact gene regulation, genetic diversity, and complex traits. Commun Biol 2023; 6:894. [PMID: 37652983 PMCID: PMC10471783 DOI: 10.1038/s42003-023-05234-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are a major source of genetic polymorphisms and play a role in chromatin architecture, gene regulatory networks, and genomic evolution. However, their functional role in pigs and contributions to complex traits are largely unknown. We created a catalog of TEs (n = 3,087,929) in pigs and found that young SINEs were predominantly silenced by histone modifications, DNA methylation, and decreased accessibility. However, some transcripts from active young SINEs showed high tissue-specificity, as confirmed by analyzing 3570 RNA-seq samples. We also detected 211,067 dimorphic SINEs in 374 individuals, including 340 population-specific ones associated with local adaptation. Mapping these dimorphic SINEs to genome-wide associations of 97 complex traits in pigs, we found 54 candidate genes (e.g., ANK2 and VRTN) that might be mediated by TEs. Our findings highlight the important roles of young SINEs and provide a supplement for genotype-to-phenotype associations and modern breeding in pigs.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lihong Gu
- Institute of Animal Science & Veterinary Medicine, Hainan Academy of Agricultural Sciences, No. 14 Xingdan Road, Haikou, 571100, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Lei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Xingzheng Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Dongyou Yu
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Xinyan Han
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lichun Qian
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark.
| | - Zhengguang Wang
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China.
| |
Collapse
|
11
|
Magdy Mohamed Abdelaziz Barakat S, Sallehuddin R, Yuhaniz SS, R. Khairuddin RF, Mahmood Y. Genome assembly composition of the String "ACGT" array: a review of data structure accuracy and performance challenges. PeerJ Comput Sci 2023; 9:e1180. [PMID: 37547391 PMCID: PMC10403225 DOI: 10.7717/peerj-cs.1180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 04/27/2023] [Indexed: 08/08/2023]
Abstract
Background The development of sequencing technology increases the number of genomes being sequenced. However, obtaining a quality genome sequence remains a challenge in genome assembly by assembling a massive number of short strings (reads) with the presence of repetitive sequences (repeats). Computer algorithms for genome assembly construct the entire genome from reads in two approaches. The de novo approach concatenates the reads based on the exact match between their suffix-prefix (overlapping). Reference-guided approach orders the reads based on their offsets in a well-known reference genome (reads alignment). The presence of repeats extends the technical ambiguity, making the algorithm unable to distinguish the reads resulting in misassembly and affecting the assembly approach accuracy. On the other hand, the massive number of reads causes a big assembly performance challenge. Method The repeat identification method was introduced for misassembly by prior identification of repetitive sequences, creating a repeat knowledge base to reduce ambiguity during the assembly process, thus enhancing the accuracy of the assembled genome. Also, hybridization between assembly approaches resulted in a lower misassembly degree with the aid of the reference genome. The assembly performance is optimized through data structure indexing and parallelization. This article's primary aim and contribution are to support the researchers through an extensive review to ease other researchers' search for genome assembly studies. The study also, highlighted the most recent developments and limitations in genome assembly accuracy and performance optimization. Results Our findings show the limitations of the repeat identification methods available, which only allow to detect of specific lengths of the repeat, and may not perform well when various types of repeats are present in a genome. We also found that most of the hybrid assembly approaches, either starting with de novo or reference-guided, have some limitations in handling repetitive sequences as it is more computationally costly and time intensive. Although the hybrid approach was found to outperform individual assembly approaches, optimizing its performance remains a challenge. Also, the usage of parallelization in overlapping and reads alignment for genome assembly is yet to be fully implemented in the hybrid assembly approach. Conclusion We suggest combining multiple repeat identification methods to enhance the accuracy of identifying the repeats as an initial step to the hybrid assembly approach and combining genome indexing with parallelization for better optimization of its performance.
Collapse
Affiliation(s)
| | - Roselina Sallehuddin
- Computer Science, School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai, Johor, Malaysia
| | - Siti Sophiayati Yuhaniz
- Advanced Informatics Department, Razak Faculty of Technology and Informatics, Universiti Teknologi Malaysia, Kuala Lumpur, Kuala Lumpur, Malaysia
| | | | - Yasir Mahmood
- Faculty of Information Technology, The University of Lahore, Lahore, Lahore, Pakistan
| |
Collapse
|
12
|
Martelossi J, Nicolini F, Subacchi S, Pasquale D, Ghiselli F, Luchetti A. Multiple and diversified transposon lineages contribute to early and recent bivalve genome evolution. BMC Biol 2023; 21:145. [PMID: 37365567 DOI: 10.1186/s12915-023-01632-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 05/25/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND Transposable elements (TEs) can represent one of the major sources of genomic variation across eukaryotes, providing novel raw materials for species diversification and innovation. While considerable effort has been made to study their evolutionary dynamics across multiple animal clades, molluscs represent a substantially understudied phylum. Here, we take advantage of the recent increase in mollusc genomic resources and adopt an automated TE annotation pipeline combined with a phylogenetic tree-based classification, as well as extensive manual curation efforts, to characterize TE repertories across 27 bivalve genomes with a particular emphasis on DDE/D class II elements, long interspersed nuclear elements (LINEs), and their evolutionary dynamics. RESULTS We found class I elements as highly dominant in bivalve genomes, with LINE elements, despite less represented in terms of copy number per genome, being the most common retroposon group covering up to 10% of their genome. We mined 86,488 reverse transcriptases (RVT) containing LINE coming from 12 clades distributed across all known superfamilies and 14,275 class II DDE/D-containing transposons coming from 16 distinct superfamilies. We uncovered a previously underestimated rich and diverse bivalve ancestral transposon complement that could be traced back to their most recent common ancestor that lived ~ 500 Mya. Moreover, we identified multiple instances of lineage-specific emergence and loss of different LINEs and DDE/D lineages with the interesting cases of CR1- Zenon, Proto2, RTE-X, and Academ elements that underwent a bivalve-specific amplification likely associated with their diversification. Finally, we found that this LINE diversity is maintained in extant species by an equally diverse set of long-living and potentially active elements, as suggested by their evolutionary history and transcription profiles in both male and female gonads. CONCLUSIONS We found that bivalves host an exceptional diversity of transposons compared to other molluscs. Their LINE complement could mainly follow a "stealth drivers" model of evolution where multiple and diversified families are able to survive and co-exist for a long period of time in the host genome, potentially shaping both recent and early phases of bivalve genome evolution and diversification. Overall, we provide not only the first comparative study of TE evolutionary dynamics in a large but understudied phylum such as Mollusca, but also a reference library for ORF-containing class II DDE/D and LINE elements, which represents an important genomic resource for their identification and characterization in novel genomes.
Collapse
Affiliation(s)
- Jacopo Martelossi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Filippo Nicolini
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy
- Fano Marine Center, Department of Biological, Geological and Environmental Sciences, University of Bologna, Viale Adriatico 1/N, 61032, Fano, Italy
| | - Simone Subacchi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Daniela Pasquale
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Fabrizio Ghiselli
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy.
| | - Andrea Luchetti
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| |
Collapse
|
13
|
Bista I, Wood JMD, Desvignes T, McCarthy SA, Matschiner M, Ning Z, Tracey A, Torrance J, Sims Y, Chow W, Smith M, Oliver K, Haggerty L, Salzburger W, Postlethwait JH, Howe K, Clark MS, William Detrich H, Christina Cheng CH, Miska EA, Durbin R. Genomics of cold adaptations in the Antarctic notothenioid fish radiation. Nat Commun 2023; 14:3412. [PMID: 37296119 PMCID: PMC10256766 DOI: 10.1038/s41467-023-38567-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/05/2023] [Indexed: 06/12/2023] Open
Abstract
Numerous novel adaptations characterise the radiation of notothenioids, the dominant fish group in the freezing seas of the Southern Ocean. To improve understanding of the evolution of this iconic fish group, here we generate and analyse new genome assemblies for 24 species covering all major subgroups of the radiation, including five long-read assemblies. We present a new estimate for the onset of the radiation at 10.7 million years ago, based on a time-calibrated phylogeny derived from genome-wide sequence data. We identify a two-fold variation in genome size, driven by expansion of multiple transposable element families, and use the long-read data to reconstruct two evolutionarily important, highly repetitive gene family loci. First, we present the most complete reconstruction to date of the antifreeze glycoprotein gene family, whose emergence enabled survival in sub-zero temperatures, showing the expansion of the antifreeze gene locus from the ancestral to the derived state. Second, we trace the loss of haemoglobin genes in icefishes, the only vertebrates lacking functional haemoglobins, through complete reconstruction of the two haemoglobin gene clusters across notothenioid families. Both the haemoglobin and antifreeze genomic loci are characterised by multiple transposon expansions that may have driven the evolutionary history of these genes.
Collapse
Affiliation(s)
- Iliana Bista
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK.
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge, CB2 1QN, UK.
- Naturalis Biodiversity Center, Leiden, 2333 CR, the Netherlands.
| | - Jonathan M D Wood
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Thomas Desvignes
- University of Oregon, Institute of Neuroscience, 1254 University of Oregon, 13th Avenue, Eugene, OR, 97403, USA
| | - Shane A McCarthy
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK
| | - Michael Matschiner
- University of Oslo, Natural History Museum, University of Oslo, Sars' gate 1, 0562, Oslo, Norway
- University of Zurich, Department of Palaeontology and Museum, University of Zurich, Karl-Schmid-Strasse 4, 8006, Zurich, Switzerland
| | - Zemin Ning
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Alan Tracey
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - James Torrance
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Ying Sims
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - William Chow
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Michelle Smith
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Karen Oliver
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Walter Salzburger
- University of Basel, Zoological Institute, Department of Environmental Sciences, Vesalgasse 1, 4051, Basel, Switzerland
| | - John H Postlethwait
- University of Oregon, Institute of Neuroscience, 1254 University of Oregon, 13th Avenue, Eugene, OR, 97403, USA
| | - Kerstin Howe
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Melody S Clark
- British Antarctic Survey, High Cross, Madingley Road, Cambridge, CB3 0ET, UK
| | - H William Detrich
- Northeastern University, Department of Marine and Environmental Sciences, Marine Science Centre, 430 Nahant Rd., Nahant, MA, 01908, USA
| | - C-H Christina Cheng
- Department of Evolution, Ecology, and Behaviour, University of Illinois, Urbana-Champaign, IL, 61801, USA
| | - Eric A Miska
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge, CB2 1QN, UK
| | - Richard Durbin
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK.
| |
Collapse
|
14
|
Osmanski AB, Paulat NS, Korstian J, Grimshaw JR, Halsey M, Sullivan KAM, Moreno-Santillán DD, Crookshanks C, Roberts J, Garcia C, Johnson MG, Densmore LD, Stevens RD, Rosen J, Storer JM, Hubley R, Smit AFA, Dávalos LM, Karlsson EK, Lindblad-Toh K, Ray DA. Insights into mammalian TE diversity through the curation of 248 genome assemblies. Science 2023; 380:eabn1430. [PMID: 37104570 PMCID: PMC11103246 DOI: 10.1126/science.abn1430] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 10/28/2022] [Indexed: 04/29/2023]
Abstract
We examined transposable element (TE) content of 248 placental mammal genome assemblies, the largest de novo TE curation effort in eukaryotes to date. We found that although mammals resemble one another in total TE content and diversity, they show substantial differences with regard to recent TE accumulation. This includes multiple recent expansion and quiescence events across the mammalian tree. Young TEs, particularly long interspersed elements, drive increases in genome size, whereas DNA transposons are associated with smaller genomes. Mammals tend to accumulate only a few types of TEs at any given time, with one TE type dominating. We also found association between dietary habit and the presence of DNA transposon invasions. These detailed annotations will serve as a benchmark for future comparative TE analyses among placental mammals.
Collapse
Affiliation(s)
- Austin B. Osmanski
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Nicole S. Paulat
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Jenny Korstian
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Jenna R. Grimshaw
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Michaela Halsey
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | | | | | | | - Jacquelyn Roberts
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Carlos Garcia
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Matthew G. Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | | | - Richard D. Stevens
- Department of Natural Resources Management and Natural Science Research Laboratory, Museum of Texas Tech University, Lubbock, TX, USA
| | | | - Jeb Rosen
- Institute for Systems Biology, Seattle, WA, USA
| | | | | | | | - Liliana M. Dávalos
- Department of Ecology & Evolution, Stony Brook University, Stony Brook, NY, USA
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY, USA
| | - Elinor K. Karlsson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kerstin Lindblad-Toh
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Bioinformatics and Integrative Biology, UMass Chan Medical School, Worcester, MA, USA
- Program in Molecular Medicine, UMass Chan Medical School, Worcester, MA, USA
| | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| |
Collapse
|
15
|
Mackintosh A, Laetsch DR, Baril T, Ebdon S, Jay P, Vila R, Hayward A, Lohse K. The genome sequence of the scarce swallowtail, Iphiclides podalirius. G3 GENES|GENOMES|GENETICS 2022; 12:6656352. [PMID: 35929795 PMCID: PMC9434224 DOI: 10.1093/g3journal/jkac193] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 07/20/2022] [Indexed: 12/04/2022]
Abstract
The scarce swallowtail, Iphiclides podalirius (Linnaeus, 1758), is a species of butterfly in the family Papilionidae. Here, we present a chromosome-level genome assembly for Iphiclides podalirius as well as gene and transposable element annotations. We investigate how the density of genomic features differs between the 30 Iphiclides podalirius chromosomes. We find that shorter chromosomes have higher heterozygosity at four-fold-degenerate sites and a greater density of transposable elements. While the first result is an expected consequence of differences in recombination rate, the second suggests a counter-intuitive relationship between recombination and transposable element evolution. This high-quality genome assembly, the first for any species in the tribe Leptocircini, will be a valuable resource for population genomics in the genus Iphiclides and comparative genomics more generally.
Collapse
Affiliation(s)
- Alexander Mackintosh
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Dominik R Laetsch
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus , Cornwall TR10 9FE, UK
| | - Sam Ebdon
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Paul Jay
- Ecologie Systématique Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay , 91400 Orsay, France
| | - Roger Vila
- Institut de Biologia Evolutiva (CSIC—Universitat Pompeu Fabra) , Barcelona 08003, Spain
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus , Cornwall TR10 9FE, UK
| | - Konrad Lohse
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| |
Collapse
|
16
|
Law STS, Nong W, So WL, Baril T, Swale T, Chan CB, Tobe SS, Kai ZP, Bendena WG, Hayward A, Hui JHL. Chromosomal-level reference genome of the moth Heortia vitessoides (Lepidoptera: Crambidae), a major pest of agarwood-producing trees. Genomics 2022; 114:110440. [PMID: 35905835 DOI: 10.1016/j.ygeno.2022.110440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 07/13/2022] [Accepted: 07/24/2022] [Indexed: 11/04/2022]
Abstract
The moth Heortia vitessoides Moore (Lepidoptera: Crambidae) is a major pest of ecologically, commercially and culturally important agarwood-producing trees in the genus Aquilaria. In particular, H. vitessoides is one of the most destructive defoliating pests of the incense tree Aquilaria sinesis, which produces a valuable fragrant wood used as incense and in traditional Chinese medicine [33]. Nevertheless, a genomic resource for H. vitessoides is lacking. Here, we present a chromosomal-level assembly for H. vitessoides, consisting of a 517 megabase (Mb) genome assembly with high physical contiguity (scaffold N50 of 18.2 Mb) and high completeness (97.9% complete BUSCO score). To aid gene annotation, 8 messenger RNA transcriptomes from different developmental stages were generated, and a total of 16,421 gene models were predicted. Expansion of gene families involved in xenobiotic metabolism and development were detected, including duplications of cytosolic sulfotransferase (SULT) genes shared among lepidopterans. In addition, small RNA sequencing of 5 developmental stages of H. vitessoides facilitated the identification of 85 lepidopteran conserved microRNAs, 94 lineage-specific microRNAs, as well as several microRNA clusters. A large proportion of the H. vitessoides genome consists of repeats, with a 29.12% total genomic contribution from transposable elements, of which long interspersed nuclear elements (LINEs) are the dominant component (17.41%). A sharp decrease in the genome-wide percentage of LINEs with lower levels of genetic distance to family consensus sequences suggests that LINE activity has peaked in H. vitessoides. In contrast, opposing patterns suggest a substantial recent increase in DNA and LTR element activity. Together with annotations of essential sesquiterpenoid hormonal pathways, neuropeptides, microRNAs and transposable elements, the high-quality genomic and transcriptomic resources we provide for the economically important moth H. vitessoides provide a platform for the development of genomic approaches to pest management, and contribute to addressing fundamental research questions in Lepidoptera.
Collapse
Affiliation(s)
- Sean T S Law
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China.
| | - Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Wai Lok So
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | | | | | - Chi Bun Chan
- School of Biological Science, The University of Hong Kong, Hong Kong, China
| | - Stephen S Tobe
- Department of Cell and Systems Biology, University of Toronto, Canada
| | - Zhen-Peng Kai
- School of Chemical and Environmental Engineering, Shanghai Institute of Technology, Shanghai, China
| | | | | | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China.
| |
Collapse
|
17
|
Mackintosh A, Laetsch DR, Baril T, Foster RG, Dincă V, Vila R, Hayward A, Lohse K. The genome sequence of the lesser marbled fritillary, Brenthis ino, and evidence for a segregating neo-Z chromosome. G3 GENES|GENOMES|GENETICS 2022; 12:6554998. [PMID: 35348678 PMCID: PMC9157119 DOI: 10.1093/g3journal/jkac069] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 03/23/2022] [Indexed: 01/06/2023]
Abstract
The lesser marbled fritillary, Brenthis ino (Rottemburg, 1775), is a species of Palearctic butterfly. Male Brenthis ino individuals have been reported to have between 12 and 14 pairs of chromosomes, a much-reduced chromosome number than is typical in butterflies. Here, we present a chromosome-level genome assembly for Brenthis ino, as well as gene and transposable element annotations. The assembly is 411.8 Mb in length with a contig N50 of 9.6 Mb and a scaffold N50 of 29.5 Mb. We also show evidence that the male individual from which we generated HiC data was heterozygous for a neo-Z chromosome, consistent with inheriting 14 chromosomes from one parent and 13 from the other. This genome assembly will be a valuable resource for studying chromosome evolution in Lepidoptera, as well as for comparative and population genomics more generally.
Collapse
Affiliation(s)
- Alexander Mackintosh
- Institute of Evolutionary Biology, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Dominik R Laetsch
- Institute of Evolutionary Biology, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter , Cornwall TR10 9FE, UK
| | - Robert G Foster
- Edinburgh Genomics, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Vlad Dincă
- Ecology and Genetics Research Unit, University of Oulu , Oulu 90014, Finland
| | - Roger Vila
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra) , Barcelona 08003, Spain
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter , Cornwall TR10 9FE, UK
| | - Konrad Lohse
- Institute of Evolutionary Biology, University of Edinburgh , Edinburgh EH9 3FL, UK
| |
Collapse
|
18
|
Myriapod genomes reveal ancestral horizontal gene transfer and hormonal gene loss in millipedes. Nat Commun 2022; 13:3010. [PMID: 35637228 PMCID: PMC9151784 DOI: 10.1038/s41467-022-30690-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 05/12/2022] [Indexed: 01/08/2023] Open
Abstract
Animals display a fascinating diversity of body plans. Correspondingly, genomic analyses have revealed dynamic evolution of gene gains and losses among animal lineages. Here we sequence six new myriapod genomes (three millipedes, three centipedes) at key phylogenetic positions within this major but understudied arthropod lineage. We combine these with existing genomic resources to conduct a comparative analysis across all available myriapod genomes. We find that millipedes generally have considerably smaller genomes than centipedes, with the repeatome being a major contributor to genome size, driven by independent large gains of transposons in three centipede species. In contrast to millipedes, centipedes gained a large number of gene families after the subphyla diverged, with gains contributing to sensory and locomotory adaptations that facilitated their ecological shift to predation. We identify distinct horizontal gene transfer (HGT) events from bacteria to millipedes and centipedes, with no identifiable HGTs shared among all myriapods. Loss of juvenile hormone O-methyltransferase, a key enzyme in catalysing sesquiterpenoid hormone production in arthropods, was also revealed in all millipede lineages. Our findings suggest that the rapid evolution of distinct genomic pathways in centipede and millipede lineages following their divergence from the myriapod ancestor, was shaped by differing ecological pressures. Myriapods play an important ecological role in soil and forest ecosystems. Here the authors analyse nine myriapod genomes, showing rapid evolution of distinct genomic pathways in centipede and millipede lineages, shaped by differing ecological pressures.
Collapse
|
19
|
Goubert C, Craig RJ, Bilat AF, Peona V, Vogan AA, Protasio AV. A beginner's guide to manual curation of transposable elements. Mob DNA 2022; 13:7. [PMID: 35354491 PMCID: PMC8969392 DOI: 10.1186/s13100-021-00259-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/17/2021] [Indexed: 12/11/2022] Open
Abstract
Background In the study of transposable elements (TEs), the generation of a high confidence set of consensus sequences that represent the diversity of TEs found in a given genome is a key step in the path to investigate these fascinating genomic elements. Many algorithms and pipelines are available to automatically identify putative TE families present in a genome. Despite the availability of these valuable resources, producing a library of high-quality full-length TE consensus sequences largely remains a process of manual curation. This know-how is often passed on from mentor-to-mentee within research groups, making it difficult for those outside the field to access this highly specialised skill. Results Our manuscript attempts to fill this gap by providing a set of detailed computer protocols, software recommendations and video tutorials for those aiming to manually curate TEs. Detailed step-by-step protocols, aimed at the complete beginner, are presented in the Supplementary Methods. Conclusions The proposed set of programs and tools presented here will make the process of manual curation achievable and amenable to all researchers and in special to those new to the field of TEs. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-021-00259-7.
Collapse
Affiliation(s)
- Clement Goubert
- Canadian Center for Computational Genomics, McGill University, Montreal, Québec, Canada.,Department of Human Genetics, McGill University, Montreal, Québec, Canada
| | - Rory J Craig
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Agustin F Bilat
- Departamento de Genética, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
| | - Valentina Peona
- Department of Organismal Biology, Uppsala University, Norbyvägen 18D, 752 36, Uppsala, Sweden
| | - Aaron A Vogan
- Department of Organismal Biology, Uppsala University, Norbyvägen 18D, 752 36, Uppsala, Sweden
| | - Anna V Protasio
- Department of Pathology, Tennis Court Road, Cambridge, CB1 2PQ, UK. .,Christ's College, St Andrews Street, Cambridge, CB2 3BU, UK.
| |
Collapse
|
20
|
Baril T, Hayward A. Migrators within migrators: exploring transposable element dynamics in the monarch butterfly, Danaus plexippus. Mob DNA 2022; 13:5. [PMID: 35172896 PMCID: PMC8848866 DOI: 10.1186/s13100-022-00263-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 02/06/2022] [Indexed: 01/10/2023] Open
Abstract
Background Lepidoptera (butterflies and moths) are an important model system in ecology and evolution. A high-quality chromosomal genome assembly is available for the monarch butterfly (Danaus plexippus), but it lacks an in-depth transposable element (TE) annotation, presenting an opportunity to explore monarch TE dynamics and the impact of TEs on shaping the monarch genome. Results We find 6.21% of the monarch genome is comprised of TEs, a reduction of 6.85% compared to the original TE annotation performed on the draft genome assembly. Monarch TE content is low compared to two closely related species with available genomes, Danaus chrysippus (33.97% TE) and Danaus melanippus (11.87% TE). The biggest TE contributions to genome size in the monarch are LINEs and Penelope-like elements, and three newly identified families, r2-hero_dPle (LINE), penelope-1_dPle (Penelope-like), and hase2-1_dPle (SINE), collectively contribute 34.92% of total TE content. We find evidence of recent TE activity, with two novel Tc1 families rapidly expanding over recent timescales (tc1-1_dPle, tc1-2_dPle). LINE fragments show signatures of genomic deletions indicating a high rate of TE turnover. We investigate associations between TEs and wing colouration and immune genes and identify a three-fold increase in TE content around immune genes compared to other host genes. Conclusions We provide a detailed TE annotation and analysis for the monarch genome, revealing a considerably smaller TE contribution to genome content compared to two closely related Danaus species with available genome assemblies. We identify highly successful novel DNA TE families rapidly expanding over recent timescales, and ongoing signatures of both TE expansion and removal highlight the dynamic nature of repeat content in the monarch genome. Our findings also suggest that insect immune genes are promising candidates for future interrogation of TE-mediated host adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-022-00263-5.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK.
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK.
| |
Collapse
|
21
|
Nong W, Yu Y, Aase-Remedios ME, Xie Y, So WL, Li Y, Wong CF, Baril T, Law STS, Lai SY, Haimovitz J, Swale T, Chen SS, Kai ZP, Sun X, Wu Z, Hayward A, Ferrier DEK, Hui JHL. Genome of the ramshorn snail Biomphalaria straminea-an obligate intermediate host of schistosomiasis. Gigascience 2022; 11:6528774. [PMID: 35166339 PMCID: PMC8848322 DOI: 10.1093/gigascience/giac012] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Revised: 01/02/2022] [Accepted: 01/25/2022] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Schistosomiasis, or bilharzia, is a parasitic disease caused by trematode flatworms of the genus Schistosoma. Infection by Schistosoma mansoni in humans results when cercariae emerge into water from freshwater snails in the genus Biomphalaria and seek out and penetrate human skin. The snail Biomphalaria straminea is native to South America and is now also present in Central America and China, and represents a potential vector host for spreading schistosomiasis. To date, genomic information for the genus is restricted to the neotropical species Biomphalaria glabrata. This limits understanding of the biology and management of other schistosomiasis vectors, such as B. straminea. FINDINGS Using a combination of Illumina short-read, 10X Genomics linked-read, and Hi-C sequencing data, our 1.005 Gb B. straminea genome assembly is of high contiguity, with a scaffold N50 of 25.3 Mb. Transcriptomes from adults were also obtained. Developmental homeobox genes, hormonal genes, and stress-response genes were identified, and repeat content was annotated (40.68% of genomic content). Comparisons with other mollusc genomes (including Gastropoda, Bivalvia, and Cephalopoda) revealed syntenic conservation, patterns of homeobox gene linkage indicative of evolutionary changes to gene clusters, expansion of heat shock protein genes, and the presence of sesquiterpenoid and cholesterol metabolic pathway genes in Gastropoda. In addition, hormone treatment together with RT-qPCR assay reveal a sesquiterpenoid hormone responsive system in B. straminea, illustrating that this renowned insect hormonal system is also present in the lophotrochozoan lineage. CONCLUSION This study provides the first genome assembly for the snail B. straminea and offers an unprecedented opportunity to address a variety of phenomena related to snail vectors of schistosomiasis, as well as evolutionary and genomics questions related to molluscs more widely.
Collapse
Affiliation(s)
- Wenyan Nong
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Yifei Yu
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Madeleine E Aase-Remedios
- The Scottish Oceans Institute, Gatty Marine Laboratory, School of Biology, University of St. Andrews, St. Andrews, UK
| | - Yichun Xie
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Wai Lok So
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Yiqian Li
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Cheuk Fung Wong
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | | | - Sean T S Law
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Sheung Yee Lai
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | | | | | - Shan-Shan Chen
- Institute of Agro-food Standard and Testing Technology, Shanghai Academy of Agricultural Sciences, Shanghai, China
| | - Zhen-Peng Kai
- School of Chemical and Environmental Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Xi Sun
- Sun Yat-sen University, Guangdong, China
| | | | | | - David E K Ferrier
- The Scottish Oceans Institute, Gatty Marine Laboratory, School of Biology, University of St. Andrews, St. Andrews, UK
| | - Jerome H L Hui
- School of Life Science, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
22
|
Vargas-Chavez C, Longo Pendy NM, Nsango SE, Aguilera L, Ayala D, González J. Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
Affiliation(s)
- Carlos Vargas-Chavez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Neil Michel Longo Pendy
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,École Doctorale Régional (EDR) en Infectiologie Tropicale d'Afrique Centrale, BP 876, Franceville, Gabon
| | - Sandrine E Nsango
- Faculté de Médecine et des Sciences Pharmaceutiques, Université de Douala, BP 2701, Douala, Cameroun
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Diego Ayala
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle (MIVEGEC), Université Montpellier, CNRS, IRD, 64501 Montpellier, France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| |
Collapse
|
23
|
Galbraith JD, Kortschak RD, Suh A, Adelson DL. Genome Stability Is in the Eye of the Beholder: CR1 Retrotransposon Activity Varies Significantly across Avian Diversity. Genome Biol Evol 2021; 13:6433158. [PMID: 34894225 PMCID: PMC8665684 DOI: 10.1093/gbe/evab259] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/12/2021] [Indexed: 12/20/2022] Open
Abstract
Since the sequencing of the zebra finch genome it has become clear that avian genomes, while largely stable in terms of chromosome number and gene synteny, are more dynamic at an intrachromosomal level. A multitude of intrachromosomal rearrangements and significant variation in transposable element (TE) content have been noted across the avian tree. TEs are a source of genome plasticity, because their high similarity enables chromosomal rearrangements through nonallelic homologous recombination, and they have potential for exaptation as regulatory and coding sequences. Previous studies have investigated the activity of the dominant TE in birds, chicken repeat 1 (CR1) retrotransposons, either focusing on their expansion within single orders, or comparing passerines with nonpasserines. Here, we comprehensively investigate and compare the activity of CR1 expansion across orders of birds, finding levels of CR1 activity vary significantly both between and within orders. We describe high levels of TE expansion in genera which have speciated in the last 10 Myr including kiwis, geese, and Amazon parrots; low levels of TE expansion in songbirds across their diversification, and near inactivity of TEs in the cassowary and emu for millions of years. CR1s have remained active over long periods of time across most orders of neognaths, with activity at any one time dominated by one or two families of CR1s. Our findings of higher TE activity in species-rich clades and dominant families of TEs within lineages mirror past findings in mammals and indicate that genome evolution in amniotes relies on universal TE-driven processes.
Collapse
Affiliation(s)
- James D Galbraith
- School of Biological Sciences, The University of Adelaide, South Australia, Australia
| | | | - Alexander Suh
- School of Biological Sciences, University of East Anglia, Norwich, United Kingdom.,Department of Organismal Biology, Evolutionary Biology Centre (EBC), Science for Life Laboratory, Uppsala University, Sweden
| | - David L Adelson
- School of Biological Sciences, The University of Adelaide, South Australia, Australia
| |
Collapse
|
24
|
Ibrahim MA, Al-Shomrani BM, Simenc M, Alharbi SN, Alqahtani FH, Al-Fageeh MB, Manee MM. Comparative analysis of transposable elements provides insights into genome evolution in the genus Camelus. BMC Genomics 2021; 22:842. [PMID: 34800971 PMCID: PMC8605555 DOI: 10.1186/s12864-021-08117-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 10/23/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) are common features in eukaryotic genomes that are known to affect genome evolution critically and to play roles in gene regulation. Vertebrate genomes are dominated by TEs, which can reach copy numbers in the hundreds of thousands. To date, details regarding the presence and characteristics of TEs in camelid genomes have not been made available. RESULTS We conducted a genome-wide comparative analysis of camelid TEs, focusing on the identification of TEs and elucidation of transposition histories in four species: Camelus dromedarius, C. bactrianus, C. ferus, and Vicugna pacos. Our TE library was created using both de novo structure-based and homology-based searching strategies ( https://github.com/kacst-bioinfo-lab/TE_ideintification_pipeline ). Annotation results indicated a similar proportion of each genomes comprising TEs (35-36%). Class I LTR retrotransposons comprised 16-20% of genomes, and mostly consisted of the endogenous retroviruses (ERVs) groups ERVL, ERVL-MaLR, ERV_classI, and ERV_classII. Non-LTR elements comprised about 12% of genomes and consisted of SINEs (MIRs) and the LINE superfamilies LINE1, LINE2, L3/CR1, and RTE clades. Least represented were the Class II DNA transposons (2%), consisting of hAT-Charlie, TcMar-Tigger, and Helitron elements and comprising about 1-2% of each genome. CONCLUSIONS The findings of the present study revealed that the distribution of transposable elements across camelid genomes is approximately similar. This investigation presents a characterization of TE content in four camelid to contribute to developing a better understanding of camelid genome architecture and evolution.
Collapse
Affiliation(s)
- Mohanad A Ibrahim
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Badr M Al-Shomrani
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Mathew Simenc
- Department of Biological Sciences, California State University, Fullerton, USA
| | - Sultan N Alharbi
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Fahad H Alqahtani
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Mohamed B Al-Fageeh
- Life Sciences and Environment Research Institute, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Manee M Manee
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia.
| |
Collapse
|
25
|
Parisot N, Vargas-Chávez C, Goubert C, Baa-Puyoulet P, Balmand S, Beranger L, Blanc C, Bonnamour A, Boulesteix M, Burlet N, Calevro F, Callaerts P, Chancy T, Charles H, Colella S, Da Silva Barbosa A, Dell'Aglio E, Di Genova A, Febvay G, Gabaldón T, Galvão Ferrarini M, Gerber A, Gillet B, Hubley R, Hughes S, Jacquin-Joly E, Maire J, Marcet-Houben M, Masson F, Meslin C, Montagné N, Moya A, Ribeiro de Vasconcelos AT, Richard G, Rosen J, Sagot MF, Smit AFA, Storer JM, Vincent-Monegat C, Vallier A, Vigneron A, Zaidman-Rémy A, Zamoum W, Vieira C, Rebollo R, Latorre A, Heddi A. The transposable element-rich genome of the cereal pest Sitophilus oryzae. BMC Biol 2021; 19:241. [PMID: 34749730 PMCID: PMC8576890 DOI: 10.1186/s12915-021-01158-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 09/27/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The rice weevil Sitophilus oryzae is one of the most important agricultural pests, causing extensive damage to cereal in fields and to stored grains. S. oryzae has an intracellular symbiotic relationship (endosymbiosis) with the Gram-negative bacterium Sodalis pierantonius and is a valuable model to decipher host-symbiont molecular interactions. RESULTS We sequenced the Sitophilus oryzae genome using a combination of short and long reads to produce the best assembly for a Curculionidae species to date. We show that S. oryzae has undergone successive bursts of transposable element (TE) amplification, representing 72% of the genome. In addition, we show that many TE families are transcriptionally active, and changes in their expression are associated with insect endosymbiotic state. S. oryzae has undergone a high gene expansion rate, when compared to other beetles. Reconstruction of host-symbiont metabolic networks revealed that, despite its recent association with cereal weevils (30 kyear), S. pierantonius relies on the host for several amino acids and nucleotides to survive and to produce vitamins and essential amino acids required for insect development and cuticle biosynthesis. CONCLUSIONS Here we present the genome of an agricultural pest beetle, which may act as a foundation for pest control. In addition, S. oryzae may be a useful model for endosymbiosis, and studying TE evolution and regulation, along with the impact of TEs on eukaryotic genomes.
Collapse
Affiliation(s)
- Nicolas Parisot
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Carlos Vargas-Chávez
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- Institute for Integrative Systems Biology (I2SySBio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Present Address: Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Clément Goubert
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, New York, 14853, USA
- Present Address: Human Genetics, McGill University, Montreal, QC, Canada
| | | | - Séverine Balmand
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Louis Beranger
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Caroline Blanc
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Aymeric Bonnamour
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Matthieu Boulesteix
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France
| | - Nelly Burlet
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France
| | - Federica Calevro
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Patrick Callaerts
- Department of Human Genetics, Laboratory of Behavioral and Developmental Genetics, KU Leuven, University of Leuven, B-3000, Leuven, Belgium
| | - Théo Chancy
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Hubert Charles
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- ERABLE European Team, INRIA, Rhône-Alpes, France
| | - Stefano Colella
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- Present Address: LSTM, Laboratoire des Symbioses Tropicales et Méditerranéennes, IRD, CIRAD, INRAE, SupAgro, Univ Montpellier, Montpellier, France
| | - André Da Silva Barbosa
- INRAE, Sorbonne Université, CNRS, IRD, UPEC, Université de Paris, Institute of Ecology and Environmental Sciences of Paris, Versailles, France
| | - Elisa Dell'Aglio
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Alex Di Genova
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France
- ERABLE European Team, INRIA, Rhône-Alpes, France
- Instituto de Ciencias de la Ingeniería, Universidad de O'Higgins, Rancagua, Chile
| | - Gérard Febvay
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Toni Gabaldón
- Life Sciences, Barcelona Supercomputing Centre (BSC-CNS), Barcelona, Spain
- Mechanisms of Disease, Institute for Research in Biomedicine (IRB), Barcelona, Spain
- Institut Catalan de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | | | - Alexandra Gerber
- Laboratório de Bioinformática, Laboratório Nacional de Computação Científica, Petrópolis, Brazil
| | - Benjamin Gillet
- Institut de Génomique Fonctionnelle de Lyon (IGFL), Université de Lyon, Ecole Normale Supérieure de Lyon, CNRS UMR 5242, Lyon, France
| | | | - Sandrine Hughes
- Institut de Génomique Fonctionnelle de Lyon (IGFL), Université de Lyon, Ecole Normale Supérieure de Lyon, CNRS UMR 5242, Lyon, France
| | - Emmanuelle Jacquin-Joly
- INRAE, Sorbonne Université, CNRS, IRD, UPEC, Université de Paris, Institute of Ecology and Environmental Sciences of Paris, Versailles, France
| | - Justin Maire
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- Present Address: School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | | | - Florent Masson
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- Present Address: Global Health Institute, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015, Lausanne, Switzerland
| | - Camille Meslin
- INRAE, Sorbonne Université, CNRS, IRD, UPEC, Université de Paris, Institute of Ecology and Environmental Sciences of Paris, Versailles, France
| | - Nicolas Montagné
- INRAE, Sorbonne Université, CNRS, IRD, UPEC, Université de Paris, Institute of Ecology and Environmental Sciences of Paris, Versailles, France
| | - Andrés Moya
- Institute for Integrative Systems Biology (I2SySBio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Sanitary and Biomedical Research of Valencian Community (FISABIO), València, Spain
| | | | - Gautier Richard
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653, Le Rheu, France
| | - Jeb Rosen
- Institute for Systems Biology, Seattle, WA, USA
| | - Marie-France Sagot
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France
- ERABLE European Team, INRIA, Rhône-Alpes, France
| | | | | | | | - Agnès Vallier
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Aurélien Vigneron
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
- Present Address: Department of Evolutionary Ecology, Institute for Organismic and Molecular Evolution, Johannes Gutenberg University, 55128, Mainz, Germany
| | - Anna Zaidman-Rémy
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Waël Zamoum
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, Université Lyon 1, Université Lyon, Villeurbanne, France.
- ERABLE European Team, INRIA, Rhône-Alpes, France.
| | - Rita Rebollo
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France.
| | - Amparo Latorre
- Institute for Integrative Systems Biology (I2SySBio), Universitat de València and Spanish Research Council (CSIC), València, Spain.
- Foundation for the Promotion of Sanitary and Biomedical Research of Valencian Community (FISABIO), València, Spain.
| | - Abdelaziz Heddi
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR 203, 69621 Villeurbanne, France.
| |
Collapse
|
26
|
Stelzer CP, Blommaert J, Waldvogel AM, Pichler M, Hecox-Lea B, Mark Welch DB. Comparative analysis reveals within-population genome size variation in a rotifer is driven by large genomic elements with highly abundant satellite DNA repeat elements. BMC Biol 2021; 19:206. [PMID: 34530817 PMCID: PMC8447722 DOI: 10.1186/s12915-021-01134-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 08/27/2021] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Eukaryotic genomes are known to display an enormous variation in size, but the evolutionary causes of this phenomenon are still poorly understood. To obtain mechanistic insights into such variation, previous studies have often employed comparative genomics approaches involving closely related species or geographically isolated populations within a species. Genome comparisons among individuals of the same population remained so far understudied-despite their great potential in providing a microevolutionary perspective to genome size evolution. The rotifer Brachionus asplanchnoidis represents one of the most extreme cases of within-population genome size variation among eukaryotes, displaying almost twofold variation within a geographic population. RESULTS Here, we used a whole-genome sequencing approach to identify the underlying DNA sequence differences by assembling a high-quality reference genome draft for one individual of the population and aligning short reads of 15 individuals from the same geographic population including the reference individual. We identified several large, contiguous copy number variable regions (CNVs), up to megabases in size, which exhibited striking coverage differences among individuals, and whose coverage overall scaled with genome size. CNVs were of remarkably low complexity, being mainly composed of tandemly repeated satellite DNA with only a few interspersed genes or other sequences, and were characterized by a significantly elevated GC-content. CNV patterns in offspring of two parents with divergent genome size and CNV patterns in several individuals from an inbred line differing in genome size demonstrated inheritance and accumulation of CNVs across generations. CONCLUSIONS By identifying the exact genomic elements that cause within-population genome size variation, our study paves the way for studying genome size evolution in contemporary populations rather than inferring patterns and processes a posteriori from species comparisons.
Collapse
Affiliation(s)
- C P Stelzer
- Research Department for Limnology, University of Innsbruck, Mondsee, Austria.
| | - J Blommaert
- Research Department for Limnology, University of Innsbruck, Mondsee, Austria
- Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - A M Waldvogel
- Institute of Zoology, University of Cologne, Cologne, Germany
| | - M Pichler
- Research Department for Limnology, University of Innsbruck, Mondsee, Austria
| | - B Hecox-Lea
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| | - D B Mark Welch
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| |
Collapse
|
27
|
Bell EA, Butler CL, Oliveira C, Marburger S, Yant L, Taylor MI. Transposable element annotation in non-model species: The benefits of species-specific repeat libraries using semi-automated EDTA and DeepTE de novo pipelines. Mol Ecol Resour 2021; 22:823-833. [PMID: 34407282 DOI: 10.1111/1755-0998.13489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 08/12/2021] [Indexed: 11/30/2022]
Abstract
Transposable elements (TEs) are significant genomic components which can be detected either through sequence homology against existing databases or de novo, with the latter potentially reducing the risk of underestimating TE abundance. Here, we describe the semi-automated generation of a de novo TE library using the newly developed EDTA pipeline and DeepTE classifier in a non-model teleost (Corydoras fulleri). Using both genomic and transcriptomic data, we assess this de novo pipeline's performance across four TE based metrics: (i) abundance, (ii) composition, (iii) fragmentation, and (iv) age distributions. We then compare the results to those found when using a curated teleost library (Danio rerio). We identify quantitative differences in these metrics and highlight how TE library choice can have major impacts on TE-based estimates in non-model species.
Collapse
Affiliation(s)
- Ellen A Bell
- School of Biological Sciences, University of East Anglia, Norwich, Norfolk, UK
| | | | - Claudio Oliveira
- Department of Structural and Functional Biology, Institute of Biosciences/UNESP, Rua Professor Doutor Antonio Celso Wagner Zanin, Botucatu, São Paulo, Brazil
| | - Sarah Marburger
- Department of Cell and Developmental Biology, John Innes Centre, Norwich, Norfolk, UK
| | - Levi Yant
- Future Food Beacon of Excellence and the School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Martin I Taylor
- School of Biological Sciences, University of East Anglia, Norwich, Norfolk, UK
| |
Collapse
|
28
|
Sellinger TPP, Abu-Awad D, Tellier A. Limits and convergence properties of the sequentially Markovian coalescent. Mol Ecol Resour 2021; 21:2231-2248. [PMID: 33978324 DOI: 10.1111/1755-0998.13416] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 04/19/2021] [Accepted: 04/29/2021] [Indexed: 02/07/2023]
Abstract
Several methods based on the sequentially Markovian coalescent (SMC) make use of full genome sequence data from samples to infer population demographic history including past changes in population size, admixture, migration events and population structure. More recently, the original theoretical framework has been extended to allow the simultaneous estimation of population size changes along with other life history traits such as selfing or seed banking. The latter developments enhance the applicability of SMC methods to nonmodel species. Although convergence proofs have been given using simulated data in a few specific cases, an in-depth investigation of the limitations of SMC methods is lacking. In order to explore such limits, we first develop a tool inferring the best case convergence of SMC methods assuming the true underlying coalescent genealogies are known. This tool can be used to quantify the amount and type of information that can be confidently retrieved from given data sets prior to the analysis of the real data. Second, we assess the inference accuracy when the assumptions of SMC approaches are violated due to departures from the model, namely the presence of transposable elements, variable recombination and mutation rates along the sequence, and SNP calling errors. Third, we deliver a new interpretation of SMC methods by highlighting the importance of the transition matrix, which we argue can be used as a set of summary statistics in other statistical inference methods, uncoupling the SMC from hidden Markov models (HMMs). We finally offer recommendations to better apply SMC methods and build adequate data sets under budget constraints.
Collapse
Affiliation(s)
| | - Diala Abu-Awad
- Department of Life Science Systems, Technical University of Munich, Munchen, Germany
| | - Aurélien Tellier
- Department of Life Science Systems, Technical University of Munich, Munchen, Germany
| |
Collapse
|
29
|
Tiedeman Z, Signor S. The transposable elements of the Drosophila serrata reference panel. Genome Biol Evol 2021; 13:6265467. [PMID: 33950180 PMCID: PMC8434751 DOI: 10.1093/gbe/evab100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/30/2021] [Indexed: 11/13/2022] Open
Abstract
Transposable elements (TEs) are an important component of the complex genomic ecosystem. Understanding the tempo and mode of TE proliferation, that is whether it is in maintained in transposition selection balance, or is induced periodically by environmental stress or other factors, is important for understanding the evolution of organismal genomes through time. Although TEs have been characterized in individuals or limited samples, a true understanding of the population genetics of TEs, and therefore the tempo and mode of transposition, is still lacking. Here, we characterize the TE landscape in an important model Drosophila, Drosophila serrata using the D. serrata reference panel, which is comprised of 102 sequenced inbred genotypes. We annotate the families of TEs in the D. serrata genome and investigate variation in TE copy number between genotypes. We find that many TEs have low copy number in the population, but this varies by family and includes a single TE making up to 50% of the genome content of TEs. We find that some TEs proliferate in particular genotypes compared with population levels. In addition, we characterize variation in each TE family allowing copy number to vary in each genotype and find that some TEs have diversified very little between individuals suggesting recent spread. TEs are important sources of spontaneous mutations in Drosophila, making up a large fraction of the total number of mutations in particular genotypes. Understanding the dynamics of TEs within populations will be an important step toward characterizing the origin of variation within and between species.
Collapse
Affiliation(s)
- Zachery Tiedeman
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| | - Sarah Signor
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| |
Collapse
|
30
|
Sielemann K, Weisshaar B, Pucker B. Reference-based QUantification Of gene Dispensability (QUOD). PLANT METHODS 2021; 17:18. [PMID: 33563309 PMCID: PMC7871624 DOI: 10.1186/s13007-021-00718-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 02/03/2021] [Indexed: 05/03/2023]
Abstract
BACKGROUND Dispensability of genes in a phylogenetic lineage, e.g. a species, genus, or higher-level clade, is gaining relevance as most genome sequencing projects move to a pangenome level. Most analyses classify genes as core genes, which are present in all investigated individual genomes, and dispensable genes, which only occur in a single or a few investigated genomes. The binary classification as 'core' or 'dispensable' is often based on arbitrary cutoffs of presence/absence in the analysed genomes. Even when extended to 'conditionally dispensable', this concept still requires the assignment of genes to distinct groups. RESULTS Here, we present a new method which overcomes this distinct classification by quantifying gene dispensability and present a dedicated tool for reference-based QUantification Of gene Dispensability (QUOD). As a proof of concept, sequence data of 966 Arabidopsis thaliana accessions (Ath-966) were processed to calculate a gene-specific dispensability score for each gene based on normalised coverage in read mappings. We validated this score by comparison of highly conserved Benchmarking Universal Single Copy Orthologs (BUSCOs) to all other genes. The average scores of BUSCOs were significantly lower than the scores of non-BUSCOs. Analysis of variation demonstrated lower variation values between replicates of a single accession than between iteratively, randomly selected accessions from the whole dataset Ath-966. Functional investigations revealed defense and antimicrobial response genes among the genes with high-dispensability scores. CONCLUSIONS Instead of classifying a gene as core or dispensable, QUOD assigns a dispensability score to each gene. Hence, QUOD facilitates the identification of candidate dispensable genes, associated with high dispensability scores, which often underlie lineage-specific adaptation to varying environmental conditions.
Collapse
Affiliation(s)
- Katharina Sielemann
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615 Bielefeld, Germany
- Graduate School DILS, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Bielefeld University, 33615 Bielefeld, Germany
| | - Bernd Weisshaar
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615 Bielefeld, Germany
| | - Boas Pucker
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615 Bielefeld, Germany
- Evolution and Diversity, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| |
Collapse
|
31
|
de Oliveira DS, Rosa MT, Vieira C, Loreto ELS. Oxidative and radiation stress induces transposable element transcription in Drosophila melanogaster. J Evol Biol 2021; 34:628-638. [PMID: 33484011 DOI: 10.1111/jeb.13762] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 10/27/2020] [Accepted: 12/06/2020] [Indexed: 12/20/2022]
Abstract
It has been shown that stressors are capable of activating transposable elements (TEs). Currently, there is a hypothesis that stress activation of TEs may be involved in adaptive evolution, favouring the increase in genetic variability when the population is under adverse conditions. However, TE activation under stress is still poorly understood. In the present study, we estimated the fraction of differentially expressed TEs (DETEs) under ionizing radiation (144, 360 and 864 Gy) and oxidative stress (dioxin, formaldehyde and toluene) treatments. The stress intensity of each treatment was estimated by measuring the number of differentially expressed genes, and we show that several TEs families are activated by stress whereas others are repressed. The proportion of DETEs was positively related to stress intensity. However, even under the strongest stress, only a small fraction of TE families were activated (9.28%) and 17.72% were repressed. Considering all treatments together, the activated proportion was 19.83%. Nevertheless, as several TEs are incomplete or degenerated, only 10.55% of D. melanogaster mobilome is, at same time, activated by the stressors and able to transpose or at least code a protein. Thus, our study points out that although stress activates TEs, it is not a generalized activation process, and for some families, the stress induces repression.
Collapse
Affiliation(s)
- Daniel Siqueira de Oliveira
- Laboratoire de Biométrie et Biologie Evolutive, UMR 5558, 1- Université de Lyon, Université Lyon 1, CNRS, Villeurbanne, France
| | - Marcos Trindade Rosa
- PPG Biodiversidade Animal, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, UMR 5558, 1- Université de Lyon, Université Lyon 1, CNRS, Villeurbanne, France
| | - Elgion L S Loreto
- Dep de Bioquímica e Biologia Molecular, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
| |
Collapse
|
32
|
Nong W, Qu Z, Li Y, Barton-Owen T, Wong AYP, Yip HY, Lee HT, Narayana S, Baril T, Swale T, Cao J, Chan TF, Kwan HS, Ngai SM, Panagiotou G, Qian PY, Qiu JW, Yip KY, Ismail N, Pati S, John A, Tobe SS, Bendena WG, Cheung SG, Hayward A, Hui JHL. Horseshoe crab genomes reveal the evolution of genes and microRNAs after three rounds of whole genome duplication. Commun Biol 2021; 4:83. [PMID: 33469163 PMCID: PMC7815833 DOI: 10.1038/s42003-020-01637-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 12/21/2020] [Indexed: 11/08/2022] Open
Abstract
Whole genome duplication (WGD) has occurred in relatively few sexually reproducing invertebrates. Consequently, the WGD that occurred in the common ancestor of horseshoe crabs ~135 million years ago provides a rare opportunity to decipher the evolutionary consequences of a duplicated invertebrate genome. Here, we present a high-quality genome assembly for the mangrove horseshoe crab Carcinoscorpius rotundicauda (1.7 Gb, N50 = 90.2 Mb, with 89.8% sequences anchored to 16 pseudomolecules, 2n = 32), and a resequenced genome of the tri-spine horseshoe crab Tachypleus tridentatus (1.7 Gb, N50 = 109.7 Mb). Analyses of gene families, microRNAs, and synteny show that horseshoe crabs have undergone three rounds (3R) of WGD. Comparison of C. rotundicauda and T. tridentatus genomes from populations from several geographic locations further elucidates the diverse fates of both coding and noncoding genes. Together, the present study represents a cornerstone for improving our understanding of invertebrate WGD events on the evolutionary fates of genes and microRNAs, at both the individual and population level. We also provide improved genomic resources for horseshoe crabs, of applied value for breeding programs and conservation of this fascinating and unusual invertebrate lineage.
Collapse
Affiliation(s)
- Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Zhe Qu
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Yiqian Li
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Tom Barton-Owen
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Annette Y P Wong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Ho Yin Yip
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Hoi Ting Lee
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Satya Narayana
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn, UK
| | | | - Jianquan Cao
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Ting Fung Chan
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Hoi Shan Kwan
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Sai Ming Ngai
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Gianni Panagiotou
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
- Leibniz Institute of Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | - Pei-Yuan Qian
- Department of Ocean Science and Hong Kong Branch of Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou), Hong Kong University of Science and Technology, Hong Kong, China
| | - Jian-Wen Qiu
- Department of Biology, Hong Kong Baptist University, Hong Kong, China
| | - Kevin Y Yip
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Noraznawati Ismail
- Institute of Marine Biotechnology, Universiti Malaysia Terengganu, Terengganu, Malaysia
| | - Siddhartha Pati
- Department of Bioscience and Biotechnology, Fakir Mohan University, Balasore, India
- Institute of Tropical Biodiversity and Sustainable Development, University Malaysia Terengganu, 20130, Kuala Nerus, Terengganu, Malaysia
- Research Division, Association for Biodiversity Conservation and Research (ABC), Odisha, 756003, India
| | - Akbar John
- Institute of Oceanography and Maritime Studies (INOCEM), Kulliyyah of Science, International Islamic University, Kuantan, Malaysia
| | - Stephen S Tobe
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
| | | | - Siu Gin Cheung
- Department of Chemistry, City University of Hong Kong, Hong Kong, China
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn, UK
| | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China.
| |
Collapse
|
33
|
Hruska JP, Manthey JD. De novo assembly of a chromosome-scale reference genome for the northern flicker Colaptes auratus. G3 (BETHESDA, MD.) 2021; 11:jkaa026. [PMID: 33561233 PMCID: PMC8022726 DOI: 10.1093/g3journal/jkaa026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 11/12/2020] [Indexed: 11/15/2022]
Abstract
The northern flicker, Colaptes auratus, is a widely distributed North American woodpecker and a long-standing focal species for the study of ecology, behavior, phenotypic differentiation, and hybridization. We present here a highly contiguous de novo genome assembly of C. auratus, the first such assembly for the species and the first published chromosome-level assembly for woodpeckers (Picidae). The assembly was generated using a combination of short-read Chromium 10× and long-read PacBio sequencing, and further scaffolded with chromatin conformation capture (Hi-C) reads. The resulting genome assembly is 1.378 Gb in size, with a scaffold N50 of 11 and a scaffold L50 of 43.948 Mb. This assembly contains 87.4-91.7% of genes present across four sets of universal single-copy orthologs found in tetrapods and birds. We annotated the assembly both for genes and repetitive content, identifying 18,745 genes and a prevalence of ∼28.0% repetitive elements. Lastly, we used fourfold degenerate sites from neutrally evolving genes to estimate a mutation rate for C. auratus, which we estimated to be 4.007 × 10-9 substitutions/site/year, about 1.5× times faster than an earlier mutation rate estimate of the family. The highly contiguous assembly and annotations we report will serve as a resource for future studies on the genomics of C. auratus and comparative evolution of woodpeckers.
Collapse
Affiliation(s)
- Jack P Hruska
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409-43131, USA
| | - Joseph D Manthey
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409-43131, USA
| |
Collapse
|
34
|
Peona V, Blom MPK, Xu L, Burri R, Sullivan S, Bunikis I, Liachko I, Haryoko T, Jønsson KA, Zhou Q, Irestedt M, Suh A. Identifying the causes and consequences of assembly gaps using a multiplatform genome assembly of a bird-of-paradise. Mol Ecol Resour 2021; 21:263-286. [PMID: 32937018 PMCID: PMC7757076 DOI: 10.1111/1755-0998.13252] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 08/21/2020] [Accepted: 08/26/2020] [Indexed: 01/09/2023]
Abstract
Genome assemblies are currently being produced at an impressive rate by consortia and individual laboratories. The low costs and increasing efficiency of sequencing technologies now enable assembling genomes at unprecedented quality and contiguity. However, the difficulty in assembling repeat-rich and GC-rich regions (genomic "dark matter") limits insights into the evolution of genome structure and regulatory networks. Here, we compare the efficiency of currently available sequencing technologies (short/linked/long reads and proximity ligation maps) and combinations thereof in assembling genomic dark matter. By adopting different de novo assembly strategies, we compare individual draft assemblies to a curated multiplatform reference assembly and identify the genomic features that cause gaps within each assembly. We show that a multiplatform assembly implementing long-read, linked-read and proximity sequencing technologies performs best at recovering transposable elements, multicopy MHC genes, GC-rich microchromosomes and the repeat-rich W chromosome. Telomere-to-telomere assemblies are not a reality yet for most organisms, but by leveraging technology choice it is now possible to minimize genome assembly gaps for downstream analysis. We provide a roadmap to tailor sequencing projects for optimized completeness of both the coding and noncoding parts of nonmodel genomes.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Ecology and Genetics—Evolutionary BiologyScience for Life LaboratoriesUppsala UniversityUppsalaSweden
- Department of Organismal Biology—Systematic BiologyScience for Life LaboratoriesUppsala UniversityUppsalaSweden
| | - Mozes P. K. Blom
- Department of Bioinformatics and GeneticsSwedish Museum of Natural HistoryStockholmSweden
- Museum für NaturkundeLeibniz Institut für Evolutions‐ und BiodiversitätsforschungBerlinGermany
| | - Luohao Xu
- Department of Neurosciences and Developmental BiologyUniversity of ViennaViennaAustria
| | - Reto Burri
- Department of Population EcologyInstitute of Ecology and EvolutionFriedrich‐Schiller‐University JenaJenaGermany
| | | | - Ignas Bunikis
- Department of Immunology, Genetics and PathologyScience for Life LaboratoryUppsala Genome CenterUppsala UniversityUppsalaSweden
| | | | - Tri Haryoko
- Research Centre for BiologyMuseum Zoologicum BogorienseIndonesian Institute of Sciences (LIPI)CibinongIndonesia
| | - Knud A. Jønsson
- Natural History Museum of DenmarkUniversity of CopenhagenCopenhagenDenmark
| | - Qi Zhou
- Department of Neurosciences and Developmental BiologyUniversity of ViennaViennaAustria
- MOE Laboratory of Biosystems Homeostasis & ProtectionLife Sciences InstituteZhejiang UniversityHangzhouChina
- Center for Reproductive MedicineThe 2nd Affiliated HospitalSchool of MedicineZhejiang UniversityHangzhouChina
| | - Martin Irestedt
- Department of Bioinformatics and GeneticsSwedish Museum of Natural HistoryStockholmSweden
| | - Alexander Suh
- Department of Ecology and Genetics—Evolutionary BiologyScience for Life LaboratoriesUppsala UniversityUppsalaSweden
- Department of Organismal Biology—Systematic BiologyScience for Life LaboratoriesUppsala UniversityUppsalaSweden
- School of Biological Sciences—Organisms and the EnvironmentUniversity of East AngliaNorwichUK
| |
Collapse
|
35
|
Li Y, Nong W, Baril T, Yip HY, Swale T, Hayward A, Ferrier DEK, Hui JHL. Reconstruction of ancient homeobox gene linkages inferred from a new high-quality assembly of the Hong Kong oyster (Magallana hongkongensis) genome. BMC Genomics 2020; 21:713. [PMID: 33059600 PMCID: PMC7566022 DOI: 10.1186/s12864-020-07027-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 08/25/2020] [Indexed: 12/31/2022] Open
Abstract
Background Homeobox-containing genes encode crucial transcription factors involved in animal, plant and fungal development, and changes to homeobox genes have been linked to the evolution of novel body plans and morphologies. In animals, some homeobox genes are clustered together in the genome, either as remnants from ancestral genomic arrangements, or due to coordinated gene regulation. Consequently, analyses of homeobox gene organization across animal phylogeny provide important insights into the evolution of genome organization and developmental gene control, and their interaction. However, homeobox gene organization remains to be fully elucidated in several key animal ancestors, including those of molluscs, lophotrochozoans and bilaterians. Results Here, we present a high-quality chromosome-level genome assembly of the Hong Kong oyster, Magallana hongkongensis (2n = 20), for which 93.2% of the genomic sequences are contained on 10 pseudomolecules (~ 758 Mb, scaffold N50 = 72.3 Mb). Our genome assembly was scaffolded using Hi-C reads, facilitating a larger scaffold size compared to the recently published M. hongkongensis genome of Peng et al. (Mol Ecol Resources, 2020), which was scaffolded using the Crassostrea gigas assembly. A total of 46,963 predicted gene models (45,308 protein coding genes) were incorporated in our genome, and genome completeness estimated by BUSCO was 94.6%. Homeobox gene linkages were analysed in detail relative to available data for other mollusc lineages. Conclusions The analyses performed in this study and the accompanying genome sequence provide important genetic resources for this economically and culturally valuable oyster species, and offer a platform to improve understanding of animal biology and evolution more generally. Transposable element content is comparable to that found in other mollusc species, contrary to the conclusion of another recent analysis. Also, our chromosome-level assembly allows the inference of ancient gene linkages (synteny) for the homeobox-containing genes, even though a number of the homeobox gene clusters, like the Hox/ParaHox clusters, are undergoing dispersal in molluscs such as this oyster.
Collapse
Affiliation(s)
- Yiqian Li
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Tobias Baril
- Department of Conservation and Ecology, Penryn Campus, University of Exeter, Exeter, UK
| | - Ho Yin Yip
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | | | - Alexander Hayward
- Department of Conservation and Ecology, Penryn Campus, University of Exeter, Exeter, UK.
| | - David E K Ferrier
- The Scottish Oceans Institute, Gatty Martine Laboratory, University of St. Andrews, St Andrews, UK.
| | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong.
| |
Collapse
|
36
|
Galbraith JD, Ludington AJ, Suh A, Sanders KL, Adelson DL. New Environment, New Invaders-Repeated Horizontal Transfer of LINEs to Sea Snakes. Genome Biol Evol 2020; 12:2370-2383. [PMID: 33022046 PMCID: PMC7846101 DOI: 10.1093/gbe/evaa208] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/29/2020] [Indexed: 12/12/2022] Open
Abstract
Although numerous studies have found horizontal transposon transfer (HTT) to be widespread across metazoans, few have focused on HTT in marine ecosystems. To investigate potential recent HTTs into marine species, we searched for novel repetitive elements in sea snakes, a group of elapids which transitioned to a marine habitat at most 18 Ma. Our analysis uncovered repeated HTTs into sea snakes following their marine transition. The seven subfamilies of horizontally transferred LINE retrotransposons we identified in the olive sea snake (Aipysurus laevis) are transcribed, and hence are likely still active and expanding across the genome. A search of 600 metazoan genomes found all seven were absent from other amniotes, including terrestrial elapids, with the most similar LINEs present in fish and marine invertebrates. The one exception was a similar LINE found in sea kraits, a lineage of amphibious elapids which independently transitioned to a marine environment 25 Ma. Our finding of repeated horizontal transfer events into marine snakes greatly expands past findings that the marine environment promotes the transfer of transposons. Transposons are drivers of evolution as sources of genomic sequence and hence genomic novelty. We identified 13 candidate genes for HTT-induced adaptive change based on internal or neighboring HTT LINE insertions. One of these, ADCY4, is of particular interest as a part of the KEGG adaptation pathway “Circadian Entrainment.” This provides evidence of the ecological interactions between species influencing evolution of metazoans not only through specific selection pressures, but also by contributing novel genomic material.
Collapse
Affiliation(s)
| | | | - Alexander Suh
- Department of Ecology and Genetics-Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Sweden.,Department of Organismal Biology-Systematic Biology, Evolutionary Biology Centre, Uppsala University, Sweden.,School of Biological Sciences, University of East Anglia, Norwich, United Kingdom
| | - Kate L Sanders
- School of Biological Sciences, University of Adelaide, Australia
| | - David L Adelson
- School of Biological Sciences, University of Adelaide, Australia
| |
Collapse
|
37
|
Qu Z, Nong W, So WL, Barton-Owen T, Li Y, Leung TCN, Li C, Baril T, Wong AYP, Swale T, Chan TF, Hayward A, Ngai SM, Hui JHL. Millipede genomes reveal unique adaptations during myriapod evolution. PLoS Biol 2020; 18:e3000636. [PMID: 32991578 PMCID: PMC7523956 DOI: 10.1371/journal.pbio.3000636] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 08/24/2020] [Indexed: 01/27/2023] Open
Abstract
The Myriapoda, composed of millipedes and centipedes, is a fascinating but poorly understood branch of life, including species with a highly unusual body plan and a range of unique adaptations to their environment. Here, we sequenced and assembled 2 chromosomal-level genomes of the millipedes Helicorthomorpha holstii (assembly size = 182 Mb; shortest scaffold/contig length needed to cover 50% of the genome [N50] = 18.11 Mb mainly on 8 pseudomolecules) and Trigoniulus corallinus (assembly size = 449 Mb, N50 = 26.78 Mb mainly on 17 pseudomolecules). Unique genomic features, patterns of gene regulation, and defence systems in millipedes, not observed in other arthropods, are revealed. Both repeat content and intron size are major contributors to the observed differences in millipede genome size. Tight Hox and the first loose ecdysozoan ParaHox homeobox clusters are identified, and a myriapod-specific genomic rearrangement including Hox3 is also observed. The Argonaute (AGO) proteins for loading small RNAs are duplicated in both millipedes, but unlike in insects, an AGO duplicate has become a pseudogene. Evidence of post-transcriptional modification in small RNAs—including species-specific microRNA arm switching—providing differential gene regulation is also obtained. Millipedes possesses a unique ozadene defensive gland unlike the venomous forcipules found in centipedes. We identify sets of genes associated with the ozadene that play roles in chemical defence as well as antimicrobial activity. Macro-synteny analyses revealed highly conserved genomic blocks between the 2 millipedes and deuterostomes. Collectively, our analyses of millipede genomes reveal that a series of unique adaptations have occurred in this major lineage of arthropod diversity. The 2 high-quality millipede genomes provided here shed new light on the conserved and lineage-specific features of millipedes and centipedes. These findings demonstrate the importance of the consideration of both centipede and millipede genomes—and in particular the reconstruction of the myriapod ancestral situation—for future research to improve understanding of arthropod evolution, and animal evolutionary genomics more widely. Myriapods were among the first arthropods to invade the land over 400 million years ago, and survive today as the herbivorous millipedes and venomous centipedes. This study describes the genome sequences of two millipedes, Helicorthomorpha holstii and Trigoniulus corallinus, revealing unique adaptations not found in other arthropods.
Collapse
Affiliation(s)
- Zhe Qu
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Wai Lok So
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Tom Barton-Owen
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Yiqian Li
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Thomas C. N. Leung
- School of Life Sciences, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Chade Li
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Tobias Baril
- Department of Conservation and Ecology, Penryn Campus, University of Exeter, Exeter, United Kingdom
| | - Annette Y. P. Wong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Thomas Swale
- Dovetail Genomics, Scotts Valley, California, United States of America
| | - Ting-Fung Chan
- School of Life Sciences, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Alexander Hayward
- Department of Conservation and Ecology, Penryn Campus, University of Exeter, Exeter, United Kingdom
| | - Sai-Ming Ngai
- School of Life Sciences, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
| | - Jerome H. L. Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong
- * E-mail:
| |
Collapse
|
38
|
Rodriguez-Caro L, Fenner J, Benson C, Van Belleghem SM, Counterman BA. Genome Assembly of the Dogface Butterfly Zerene cesonia. Genome Biol Evol 2020; 12:3580-3585. [PMID: 31755926 PMCID: PMC6944212 DOI: 10.1093/gbe/evz254] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/14/2019] [Indexed: 02/04/2023] Open
Abstract
Comparisons of high-quality, reference butterfly, and moth genomes have been instrumental to advancing our understanding of how hybridization, and natural selection drive genomic change during the origin of new species and novel traits. Here, we present a genome assembly of the Southern Dogface butterfly, Zerene cesonia (Pieridae) whose brilliant wing colorations have been implicated in developmental plasticity, hybridization, sexual selection, and speciation. We assembled 266,407,278 bp of the Z. cesonia genome, which accounts for 98.3% of the estimated 271 Mb genome size. Using a hybrid approach involving Chicago libraries with Hi-Rise assembly and a diploid Meraculous assembly, the final haploid genome was assembled. In the final assembly, nearly all autosomes and the Z chromosome were assembled into single scaffolds. The largest 29 scaffolds accounted for 91.4% of the genome assembly, with the remaining ∼8% distributed among another 247 scaffolds and overall N50 of 9.2 Mb. Tissue-specific RNA-seq informed annotations identified 16,442 protein-coding genes, which included 93.2% of the arthropod Benchmarking Universal Single-Copy Orthologs (BUSCO). The Z. cesonia genome assembly had ∼9% identified as repetitive elements, with a transposable element landscape rich in helitrons. Similar to other Lepidoptera genomes, Z. cesonia showed a high conservation of chromosomal synteny. The Z. cesonia assembly provides a high-quality reference for studies of chromosomal arrangements in the Pierid family, as well as for population, phylo, and functional genomic studies of adaptation and speciation.
Collapse
Affiliation(s)
- Luis Rodriguez-Caro
- Department of Biological Sciences, Mississippi State University.,Division of Biological Sciences, University of Montana, Missoula, MT
| | - Jennifer Fenner
- Department of Biological Sciences, Mississippi State University
| | - Caleb Benson
- Department of Biological Sciences, Mississippi State University
| | | | | |
Collapse
|
39
|
Zavallo D, Crescente JM, Gantuz M, Leone M, Vanzetti LS, Masuelli RW, Asurmendi S. Genomic re-assessment of the transposable element landscape of the potato genome. PLANT CELL REPORTS 2020; 39:1161-1174. [PMID: 32435866 DOI: 10.1007/s00299-020-02554-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 05/07/2020] [Indexed: 05/14/2023]
Abstract
We provide a comprehensive and reliable potato TE landscape, based on a wide variety of identification tools and integrative approaches, producing clear and ready-to-use outputs for the scientific community. Transposable elements (TEs) are DNA sequences with the ability to autoreplicate and move throughout the host genome. TEs are major drivers in stress response and genome evolution. Given their significance, the development of clear and efficient TE annotation pipelines has become essential for many species. The latest de novo TE discovery tools, along with available TEs from Repbase and sRNA-seq data, allowed us to perform a reliable potato TEs detection, classification and annotation through an open-source and freely available pipeline ( https://github.com/DiegoZavallo/TE_Discovery ). Using a variety of tools, approaches and rules, we were able to provide a clearly annotated of characterized TEs landscape. Additionally, we described the distribution of the different types of TEs across the genome, where LTRs and MITEs present a clear clustering pattern in pericentromeric and subtelomeric/telomeric regions respectively. Finally, we analyzed the insertion age and distribution of LTR retrotransposon families which display a distinct pattern between the two major superfamilies. While older Gypsy elements concentrated around heterochromatic regions, younger Copia elements located predominantly on euchromatic regions. Overall, we delivered not only a reliable, ready-to-use potato TE annotation files, but also all the necessary steps to perform de novo detection for other species.
Collapse
Affiliation(s)
- Diego Zavallo
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| | - Juan Manuel Crescente
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Magdalena Gantuz
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Melisa Leone
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina
- Agencia Nacional de Promocion Científica y Tecnológica (ANPCyT), Buenos Aires, Argentina
| | - Leonardo Sebastian Vanzetti
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ricardo Williams Masuelli
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Sebastian Asurmendi
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| |
Collapse
|
40
|
Blommaert J. Genome size evolution: towards new model systems for old questions. Proc Biol Sci 2020; 287:20201441. [PMID: 32842932 PMCID: PMC7482279 DOI: 10.1098/rspb.2020.1441] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 07/29/2020] [Indexed: 12/20/2022] Open
Abstract
Genome size (GS) variation is a fundamental biological characteristic; however, its evolutionary causes and consequences are the topic of ongoing debate. Whether GS is a neutral trait or one subject to selective pressures, and how strong these selective pressures are, may remain open questions. Fundamentally, the genomic sequences responsible for this variation directly impact the potential evolutionary outcomes and, equally, are the targets of different evolutionary pressures. For example, duplications and deletions of genic regions (large or small) can have immediate and drastic phenotypic effects, while an expansion or contraction of non-coding DNA is less likely to cause catastrophic phenotypic effects. However, in the long term, the accumulation or deletion of ncDNA is likely to have larger effects. Modern sequencing technologies are allowing for the dissection of these proximate causes, but a combination of these new technologies with more traditional evolutionary experiments and approaches could revolutionize this debate and potentially resolve many of these arguments. Here, I discuss an ambitious way forward for GS research, putting it in context of historical debates, theories and sometimes contradictory evidence, and highlighting the promise of combining new sequencing technologies and analytical developments with more traditional experimental evolution approaches.
Collapse
Affiliation(s)
- Julie Blommaert
- Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
41
|
Mikkelsen EK, Weir JT. The genome of the Xingu scale-backed antbird (Willisornis vidua nigrigula) reveals lineage-specific adaptations. Genomics 2020; 112:4552-4560. [PMID: 32771623 DOI: 10.1016/j.ygeno.2020.07.047] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 07/08/2020] [Accepted: 07/30/2020] [Indexed: 12/23/2022]
Abstract
Antbirds (Thamnophilidae) are a large neotropical family of passerine bird renowned for the ant-following foraging strategies of several members of this clade. The high diversity of antbirds provides ample opportunity for speciation studies, however these studies can be hindered by the lack of an annotated antbird reference genome. In this study, we produced a high-quality annotated reference genome for the Xingu Scale-backed Antbird (Willisornis vidua nigrigula) using 10X Genomics Chromium linked-reads technology. The assembly is 1.09 Gb, with a scaffold N50 of 12.1 Mb and 17,475 annotated protein coding genes. We compare the proteome of W. v. nigrigula to several other passerines, and produce annotations for two additional antbird genomes in order to identify genes under lineage-specific positive selection and gene families with evidence for significant expansions in antbirds. Several of these genes have functions potentially related to the lineage-specific traits of antbirds, including adaptations for thermoregulation in a humid tropical environment.
Collapse
Affiliation(s)
- Else K Mikkelsen
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S 3B2, ON, Canada.
| | - Jason T Weir
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S 3B2, ON, Canada; Department of Biological Sciences, University of Toronto Scarborough, Toronto M1C 1A4, ON, Canada; Department of Ornithology, Royal Ontario Museum, Toronto, Canada
| |
Collapse
|
42
|
Jebb D, Huang Z, Pippel M, Hughes GM, Lavrichenko K, Devanna P, Winkler S, Jermiin LS, Skirmuntt EC, Katzourakis A, Burkitt-Gray L, Ray DA, Sullivan KAM, Roscito JG, Kirilenko BM, Dávalos LM, Corthals AP, Power ML, Jones G, Ransome RD, Dechmann DKN, Locatelli AG, Puechmaille SJ, Fedrigo O, Jarvis ED, Hiller M, Vernes SC, Myers EW, Teeling EC. Six reference-quality genomes reveal evolution of bat adaptations. Nature 2020; 583:578-584. [PMID: 32699395 PMCID: PMC8075899 DOI: 10.1038/s41586-020-2486-3] [Citation(s) in RCA: 158] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 06/09/2020] [Indexed: 11/08/2022]
Abstract
Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our 'Tool to infer Orthologs from Genome Alignments' (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.
Collapse
Affiliation(s)
- David Jebb
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Zixia Huang
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Graham M Hughes
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Ksenia Lavrichenko
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Paolo Devanna
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Lars S Jermiin
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
- Earth Institute, University College Dublin, Dublin, Ireland
| | - Emilia C Skirmuntt
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Aris Katzourakis
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Lucy Burkitt-Gray
- Conway Institute of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Kevin A M Sullivan
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Juliana G Roscito
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Bogdan M Kirilenko
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY, USA
| | | | - Megan L Power
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Gareth Jones
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Roger D Ransome
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Dina K N Dechmann
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany
- Department of Biology, University of Konstanz, Konstanz, Germany
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Andrea G Locatelli
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Sébastien J Puechmaille
- ISEM, University of Montpellier, Montpellier, France
- Zoological Institute and Museum, University of Greifswald, Greifswald, Germany
| | - Olivier Fedrigo
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Michael Hiller
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
| | - Sonja C Vernes
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
- Faculty of Computer Science, Technical University Dresden, Dresden, Germany.
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland.
| |
Collapse
|
43
|
Amorim IC, Melo ES, Moura RC, Wallau GL. Diverse mobilome of Dichotomius (Luederwaldtinia) schiffleri (Coleoptera: Scarabaeidae) reveals long-range horizontal transfer events of DNA transposons. Mol Genet Genomics 2020; 295:1339-1353. [PMID: 32601732 DOI: 10.1007/s00438-020-01703-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 06/17/2020] [Indexed: 10/24/2022]
Abstract
Transposable elements (TEs) are mobile DNA sequences that are able to move from one genomic location to another. These selfish elements are known as genomic parasites, since they hijack the host molecular machinery to generate new copies of themselves. The mobilization of TEs can be seen as a natural mutagen because new TE copies can insert into different loci and impact host genomic structure through different mechanisms. Although our knowledge about TEs is improving with new genomes available, there is still very limited data about the mobilome of species from the Coleoptera order, the most diverse order of insects, including species from the Scarabaeidae family. Therefore, the main goal of this study was to characterize the mobilome of D. (Luederwaldtinia) schiffleri, based on low-coverage genome sequencing, and reconstruct their evolutionary history. We used a combination of four different approaches for TE characterization and maximum likelihood phylogenetic analysis to study their evolution. We found a large and diverse mobilome composed of 38 TE superfamilies, 20 DNA transposon and 18 retrotransposons, accounting for 21% of the genome. Moreover, we found a number of incongruences between the TE and host phylogenetic trees in three DNA transposon TE superfamilies, which represents five TE families, suggesting possible horizontal transfer events between highly divergent taxa. In summary, we found an abundant and diverse mobilome and a number of horizontal transfer events that have shaped the evolutionary history of this species.
Collapse
Affiliation(s)
- I C Amorim
- Laboratório de Biodiversidade E Genética de Insetos, Instituto de Ciências Biológicas, Universidade de Pernambuco, Rua Arnóbio Marques, 310- Santo Amaro, Recife, PE, CEP: 50100-130, Brasil
| | - E S Melo
- Departamento de Entomologia, Instituto Aggeu Magalhães, FIOCRUZ, Recife, PE, Brasil
| | - R C Moura
- Laboratório de Biodiversidade E Genética de Insetos, Instituto de Ciências Biológicas, Universidade de Pernambuco, Rua Arnóbio Marques, 310- Santo Amaro, Recife, PE, CEP: 50100-130, Brasil.
| | - G L Wallau
- Departamento de Entomologia, Instituto Aggeu Magalhães, FIOCRUZ, Recife, PE, Brasil.
| |
Collapse
|
44
|
Yan H, Bombarely A, Li S. DeepTE: a computational method for de novo classification of transposons with convolutional neural network. Bioinformatics 2020; 36:4269-4275. [DOI: 10.1093/bioinformatics/btaa519] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/12/2020] [Accepted: 05/12/2020] [Indexed: 01/23/2023] Open
Abstract
Abstract
Motivation
Transposable elements (TEs) classification is an essential step to decode their roles in genome evolution. With a large number of genomes from non-model species becoming available, accurate and efficient TE classification has emerged as a new challenge in genomic sequence analysis.
Results
We developed a novel tool, DeepTE, which classifies unknown TEs using convolutional neural networks (CNNs). DeepTE transferred sequences into input vectors based on k-mer counts. A tree structured classification process was used where eight models were trained to classify TEs into super families and orders. DeepTE also detected domains inside TEs to correct false classification. An additional model was trained to distinguish between non-TEs and TEs in plants. Given unclassified TEs of different species, DeepTE can classify TEs into seven orders, which include 15, 24 and 16 super families in plants, metazoans and fungi, respectively. In several benchmarking tests, DeepTE outperformed other existing tools for TE classification. In conclusion, DeepTE successfully leverages CNN for TE classification, and can be used to precisely classify TEs in newly sequenced eukaryotic genomes.
Availability and implementation
DeepTE is accessible at https://github.com/LiLabAtVT/DeepTE.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haidong Yan
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
| | - Aureliano Bombarely
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Department of Life Sciences, University of Milan, Milan 20122, Italy
| | - Song Li
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Graduate Program in Genetics, Bioinformatics and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
45
|
Louha S, Ray DA, Winker K, Glenn TC. A High-Quality Genome Assembly of the North American Song Sparrow, Melospiza melodia. G3 (BETHESDA, MD.) 2020; 10:1159-1166. [PMID: 32075855 PMCID: PMC7144075 DOI: 10.1534/g3.119.400929] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 02/13/2020] [Indexed: 01/25/2023]
Abstract
The song sparrow, Melospiza melodia, is one of the most widely distributed species of songbirds found in North America. It has been used in a wide range of behavioral and ecological studies. This species' pronounced morphological and behavioral diversity across populations makes it a favorable candidate in several areas of biomedical research. We have generated a high-quality de novo genome assembly of M. melodia using Illumina short read sequences from genomic and in vitro proximity-ligation libraries. The assembled genome is 978.3 Mb, with a physical coverage of 24.9×, N50 scaffold size of 5.6 Mb and N50 contig size of 31.7 Kb. Our genome assembly is highly complete, with 87.5% full-length genes present out of a set of 4,915 universal single-copy orthologs present in most avian genomes. We annotated our genome assembly and constructed 15,086 gene models, a majority of which have high homology to related birds, Taeniopygia guttata and Junco hyemalis In total, 83% of the annotated genes are assigned with putative functions. Furthermore, only ∼7% of the genome is found to be repetitive; these regions and other non-coding functional regions are also identified. The high-quality M. melodia genome assembly and annotations we report will serve as a valuable resource for facilitating studies on genome structure and evolution that can contribute to biomedical research and serve as a reference in population genomic and comparative genomic studies of closely related species.
Collapse
Affiliation(s)
- Swarnali Louha
- Institute of Bioinformatics, University of Georgia, Athens, GA
| | - David A Ray
- Department of Biological Science, Texas Tech University, Lubbock, TX
| | | | - Travis C Glenn
- Institute of Bioinformatics, University of Georgia, Athens, GA
- Department of Environmental Health Science, University of Georgia, Athens, GA
| |
Collapse
|
46
|
Nong W, Law STS, Wong AYP, Baril T, Swale T, Chu LM, Hayward A, Lau DTW, Hui JHL. Chromosomal-level reference genome of the incense tree Aquilaria sinensis. Mol Ecol Resour 2020; 20:971-979. [PMID: 32157789 PMCID: PMC7496549 DOI: 10.1111/1755-0998.13154] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 01/01/2020] [Accepted: 03/04/2020] [Indexed: 11/29/2022]
Abstract
Trees in the genus Aquilaria (Thymelaeaceae) are known as lign aloes, and are native to the forests of southeast Asia. Lign aloes produce agarwood as an antimicrobial defence. Agarwood has a long history of cultural and medicinal use, and is of considerable commercial value. However, due to habitat destruction and over collection, lign aloes are threatened in the wild. We present a chromosomal‐level assembly for Aquilaria sinensis, a lign aloe endemic to China known as the incense tree, based on Illumina short‐read, 10X Genomics linked‐read, and Hi‐C sequencing data. Our 783.8 Mbp A. sinensis genome assembly is of high physical contiguity, with a scaffold N50 of 87.6 Mbp, and high completeness, with a 95.8% BUSCO score for eudicotyledon genes. We include 17 transcriptomes from various plant tissues, providing a total of 35,965 gene models. We reveal the first complete set of genes involved in sesquiterpenoid production, plant defence, and agarwood production for the genus Aquilaria, including genes involved in the biosynthesis of sesquiterpenoids via the mevalonic acid (MVA), 1‐deoxy‐D‐xylulose‐5‐phosphate (DXP), and methylerythritol phosphate (MEP) pathways. We perform a detailed repeat content analysis, revealing that transposable elements account for ~61% of the genome, with major contributions from gypsy‐like and copia‐like LTR retroelements. We also provide a comparative analysis of repeat content across sequenced species in the order Malvales. Our study reveals the first chromosomal‐level genome assembly for a tree in the genus Aquilaria and provides an unprecedented opportunity to address a variety of applied, genomic and evolutionary questions in the Thymelaeaceae more widely.
Collapse
Affiliation(s)
- Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Sean T S Law
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Annette Y P Wong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Tobias Baril
- Department of Conservation and Ecology, University of Exeter, Exeter, UK
| | | | - Lee Man Chu
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Alexander Hayward
- Department of Conservation and Ecology, University of Exeter, Exeter, UK
| | - David T W Lau
- Shiu-Ying Hu Herbarium, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
47
|
Homology-Free Detection of Transposable Elements Unveils Their Dynamics in Three Ecologically Distinct Rhodnius Species. Genes (Basel) 2020; 11:genes11020170. [PMID: 32041215 PMCID: PMC7073582 DOI: 10.3390/genes11020170] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 01/30/2020] [Indexed: 01/09/2023] Open
Abstract
Transposable elements (TEs) are widely distributed repetitive sequences in the genomes across the tree of life, and represent an important source of genetic variability. Their distribution among genomes is specific to each lineage. A phenomenon associated with this feature is the sudden expansion of one or several TE families, called bursts of transposition. We previously proposed that bursts of the Mariner family (DNA transposons) contributed to the speciation of Rhodnius prolixus Stål, 1859. This hypothesis motivated us to study two additional species of the R. prolixus complex: Rhodnius montenegrensis da Rosa et al., 2012 and Rhodnius marabaensis Souza et al., 2016, together with a new, de novo annotation of the R. prolixus repeatome using unassembled short reads. Our analysis reveals that the total amount of TEs present in Rhodnius genomes (19% to 23.5%) is three to four times higher than that expected based on the original quantifications performed for the original genome description of R. prolixus. We confirm here that the repeatome of the three species is dominated by Class II elements of the superfamily Tc1-Mariner, as well as members of the LINE order (Class I). In addition to R. prolixus, we also identified a recent burst of transposition of the Mariner family in R. montenegrensis and R. marabaensis, suggesting that this phenomenon may not be exclusive to R. prolixus. Rather, we hypothesize that whilst the expansion of Mariner elements may have contributed to the diversification of the R. prolixus-R. robustus species complex, the distinct ecological characteristics of these new species did not drive the general evolutionary trajectories of these TEs.
Collapse
|
48
|
da Fonseca RR, Couto A, Machado AM, Brejova B, Albertin CB, Silva F, Gardner P, Baril T, Hayward A, Campos A, Ribeiro ÂM, Barrio-Hernandez I, Hoving HJ, Tafur-Jimenez R, Chu C, Frazão B, Petersen B, Peñaloza F, Musacchia F, Alexander GC, Osório H, Winkelmann I, Simakov O, Rasmussen S, Rahman MZ, Pisani D, Vinther J, Jarvis E, Zhang G, Strugnell JM, Castro LFC, Fedrigo O, Patricio M, Li Q, Rocha S, Antunes A, Wu Y, Ma B, Sanges R, Vinar T, Blagoev B, Sicheritz-Ponten T, Nielsen R, Gilbert MTP. A draft genome sequence of the elusive giant squid, Architeuthis dux. Gigascience 2020; 9:giz152. [PMID: 31942620 PMCID: PMC6962438 DOI: 10.1093/gigascience/giz152] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 10/27/2019] [Accepted: 12/05/2019] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked. FINDINGS We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome. CONCLUSIONS This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.
Collapse
Affiliation(s)
- Rute R da Fonseca
- Center for Macroecology, Evolution and Climate (CMEC), GLOBE Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Alvarina Couto
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain
| | - Andre M Machado
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
| | - Brona Brejova
- Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská dolina, 842 48 Bratislava, Slovak Republic
| | - Carolin B Albertin
- Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, Woods Hole, MA 02543, USA
| | - Filipe Silva
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal
| | - Paul Gardner
- Department of Biochemistry, University of Otago, 710 Cumberland Street, North Dunedin, Dunedin 9016, New Zealand
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK
| | - Alexandre Campos
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
| | - Ângela M Ribeiro
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
| | - Inigo Barrio-Hernandez
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| | - Henk-Jan Hoving
- GEOMAR Helmholtz Centre for Ocean Research Kiel,Wischhofstraße 1-3, 24148 Kiel, Germany
| | - Ricardo Tafur-Jimenez
- Instituto del Mar del Perú, Esq. Gamarra y Gral. Valle, Chucuito Apartado 22, Callao, Peru
| | - Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Barbara Frazão
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
- IPMA, Fitoplâncton Lab, Rua C do Aeroporto, 1749-077, Lisboa, Portugal
| | - Bent Petersen
- Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Batu 3 1/2, Butik Air Nasi, 08100 Bedong, Kedah, Malaysia
- Evolutionary Genomics Section, Globe Institute, University of Copenhagen,Øster Farimagsgade 5, 1353 Copenhagen, Denmark
| | - Fernando Peñaloza
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología-Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Mexico City, México
| | - Francesco Musacchia
- Genomic Medicine, Telethon Institute of Genetics and Medicine, Via Campi Flegrei, 34, 80078 Pozzuoli, Naples, Italy
| | - Graham C Alexander
- GCB Sequencing and Genomic Technologies Shared Resource, Duke University CIEMAS, 101 Science Drive, Durham, NC 27708, USA
| | - Hugo Osório
- i3S-Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal
- IPATIMUP-Institute of Molecular Pathology and Immunology, University of Porto, Rua Júlio Amaral de Carvalho 45, 4200-135 Porto, Portugal
- Faculty of Medicine of the University of Porto, Alameda Prof. Hernani Monteiro, 4200-319 Porto, Portugal
| | - Inger Winkelmann
- Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark
| | - Oleg Simakov
- Department of Molecular Evolution and Development, University of Vienna, Althanstrasse 14 (UZA1), A-1090 Vienna, Austria
| | - Simon Rasmussen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen, Denmark
| | - M Ziaur Rahman
- Bioinformatics Solutions Inc, 470 Weber St N Suite 204, Waterloo, ON N2L 6J2, Canada
| | - Davide Pisani
- School of Biological Sciences and School of Earth Sciences, University of Bristol, 24 Tyndall Avenue, Bristol, BS8 1TG, UK
| | - Jakob Vinther
- School of Biological Sciences and School of Earth Sciences, University of Bristol, 24 Tyndall Avenue, Bristol, BS8 1TG, UK
| | - Erich Jarvis
- Howard Hughes Medical Institute, 4000 Jones Bridge Rd, Chevy Chase, MD 20815, USA
- The Rockefeller University, 1230 York Ave, New York, NY 10065, USA
| | - Guojie Zhang
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
- China National Genebank, BGI-Shenzhen, Shenzhen 518083, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, 32 Jiaochang Donglu Kunming, Yunnan 650223, China
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, 32 Jiaochang Donglu Kunming, Yunnan 650223, China
| | - Jan M Strugnell
- Centre for Sustainable Tropical Fisheries & Aquaculture, James Cook University, Townsville, Douglas QLD 4814, Australia
- Department of Ecology, Environment and Evolution, School of Life Sciences, La Trobe University, Melbourne Victoria 3086, Australia
| | - L Filipe C Castro
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal
| | - Olivier Fedrigo
- The Rockefeller University, 1230 York Ave, New York, NY 10065, USA
| | - Mateus Patricio
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Qiye Li
- BGI-Shenzhen, Shenzhen, China
| | - Sara Rocha
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain
- Biomedical Research Center (CINBIO), University of Vigo, Campus Universitario Lagoas-Marcosende, 36310 Vigo, Spain
| | - Agostinho Antunes
- CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal
| | - Yufeng Wu
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269, USA
| | - Bin Ma
- School of Computer Science, University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
| | - Remo Sanges
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Via Bonomea 265, 34136 Trieste, Italy
- Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Tomas Vinar
- Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská dolina, 842 48 Bratislava, Slovak Republic
| | - Blagoy Blagoev
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| | - Thomas Sicheritz-Ponten
- Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Batu 3 1/2, Butik Air Nasi, 08100 Bedong, Kedah, Malaysia
- Evolutionary Genomics Section, Globe Institute, University of Copenhagen,Øster Farimagsgade 5, 1353 Copenhagen, Denmark
| | - Rasmus Nielsen
- Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark
- Departments of Integrative Biology and Statistics, University of California, 3040 Valley Life Sciences, Berkeley, CA 94720-3200, USA
| | - M Thomas P Gilbert
- Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark
- Norwegian University of Science and Technology, University Museum, Høgskolering 1, 7491 Trondheim, Norway
| |
Collapse
|
49
|
Böhrer M, Rymen B, Himber C, Gerbaud A, Pflieger D, Laudencia-Chingcuanco D, Cartwright A, Vogel J, Sibout R, Blevins T. Integrated Genome-Scale Analysis and Northern Blot Detection of Retrotransposon siRNAs Across Plant Species. Methods Mol Biol 2020; 2166:387-411. [PMID: 32710422 DOI: 10.1007/978-1-0716-0712-1_23] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Cells have sophisticated RNA-directed mechanisms to regulate genes, destroy viruses, or silence transposable elements (TEs). In terrestrial plants, a specialized non-coding RNA machinery involving RNA polymerase IV (Pol IV) and small interfering RNAs (siRNAs) targets DNA methylation and silencing to TEs. Here, we present a bioinformatics protocol for annotating and quantifying siRNAs that derive from long terminal repeat (LTR) retrotransposons. The approach was validated using small RNA northern blot analyses, comparing the species Arabidopsis thaliana and Brachypodium distachyon. To assist hybridization probe design, we configured a genome browser to show small RNA-seq mappings in distinct colors and shades according to their nucleotide lengths and abundances, respectively. Samples from wild-type and pol IV mutant plants, cross-species negative controls, and a conserved microRNA control validated the detected siRNA signals, confirming their origin from specific TEs and their Pol IV-dependent biogenesis. Moreover, an optimized labeling method yielded probes that could detect low-abundance siRNAs from B. distachyon TEs. The integration of de novo TE annotation, small RNA-seq profiling, and northern blotting, as outlined here, will facilitate the comparative genomic analysis of RNA silencing in crop plants and non-model species.
Collapse
Affiliation(s)
- Marcel Böhrer
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France
| | - Bart Rymen
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France
| | - Christophe Himber
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France
| | - Aude Gerbaud
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France
| | - David Pflieger
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France
| | | | - Amy Cartwright
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - John Vogel
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Richard Sibout
- INRAE, UR BIA, Nantes, France.,Institut Jean-Pierre Bourgin, UMR 1318, INRAE, AgroParisTech, CNRS, Université Paris-Saclay, Versailles, France
| | - Todd Blevins
- Institut de Biologie Moléculaire des Plantes, CNRS UPR2357, Université de Strasbourg, Strasbourg, France.
| |
Collapse
|
50
|
Lu S, Yang J, Dai X, Xie F, He J, Dong Z, Mao J, Liu G, Chang Z, Zhao R, Wan W, Zhang R, Li Y, Wang W, Li X. Chromosomal-level reference genome of Chinese peacock butterfly (Papilio bianor) based on third-generation DNA sequencing and Hi-C analysis. Gigascience 2019; 8:giz128. [PMID: 31682256 PMCID: PMC6827417 DOI: 10.1093/gigascience/giz128] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 08/18/2019] [Accepted: 10/04/2019] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND Papilio bianor Cramer, 1777 (commonly known as the Chinese peacock butterfly) (Insecta, Lepidoptera, Papilionidae) is a widely distributed swallowtail butterfly with a wide number of geographic populations ranging from the southeast of Russia to China, Japan, India, Vietnam, Myanmar, and Thailand. Its wing color consists of both pigmentary colored scales (black, reddish) and structural colored scales (iridescent blue or green dust). A high-quality reference genome of P. bianor is an important foundation for investigating iridescent color evolution, phylogeography, and the evolution of swallowtail butterflies. FINDINGS We obtained a chromosome-level de novo genome assembly of the highly heterozygous P. bianor using long Pacific Biosciences sequencing reads and high-throughput chromosome conformation capture technology. The final assembly is 421.52 Mb on 30 chromosomes (29 autosomes and 1 Z sex chromosome) with 13.12 Mb scaffold N50. In total, 15,375 protein-coding genes and 233.09 Mb of repetitive sequences were identified. Phylogenetic analyses indicated that P. bianor separated from a common ancestor of swallowtails ∼23.69-36.04 million years ago. Demographic history suggested that the population expansion of this species from the last interglacial period to the last glacial maximum possibly resulted from its decreased natural enemies and its adaptation to climate change during the glacial period. CONCLUSIONS We present a high-quality chromosome-level reference genome of P. bianor using long-read single-molecule sequencing and Hi-C-based chromatin interaction maps. Our results lay the foundation for exploring the genetic basis of special biological features of P. bianor and also provide a useful data source for comparative genomics and phylogenomics among butterflies and moths.
Collapse
Affiliation(s)
- Sihan Lu
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Jie Yang
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
| | - Xuelei Dai
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, No.22 Xinong Road,Yangling, Shaanxi 712100, China
| | - Feiang Xie
- School of Marine Science and Technology, Zhejiang Ocean University, No.1 Haida South Road, Lincheng Changzhi Island, Zhoushan, Zhejiang 316022, China
| | - Jinwu He
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
| | - Zhiwei Dong
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Junlai Mao
- School of Marine Science and Technology, Zhejiang Ocean University, No.1 Haida South Road, Lincheng Changzhi Island, Zhoushan, Zhejiang 316022, China
| | - Guichun Liu
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Zhou Chang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Ruoping Zhao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Wenting Wan
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
| | - Ru Zhang
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
| | - Yuan Li
- Nextomics Biosciences Institute, No.666 Gaoxin Road, Wuhan, Hubei 430000, China
| | - Wen Wang
- Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, No.1 Dongxiang Road, Chang'an District, Xi'an, Shaanxi 710129, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| | - Xueyan Li
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32 Jiaochang Raod, Kunming, Yunnan 650223, China
| |
Collapse
|