1
|
Haese-Hill W, Crouch K, Otto TD. Annotation and visualization of parasite, fungi and arthropod genomes with Companion. Nucleic Acids Res 2024; 52:W39-W44. [PMID: 38752499 PMCID: PMC11223846 DOI: 10.1093/nar/gkae378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 04/06/2024] [Accepted: 04/30/2024] [Indexed: 07/06/2024] Open
Abstract
As sequencing genomes has become increasingly popular, the need for annotation of the resulting assemblies is growing. Structural and functional annotation is still challenging as it includes finding the correct gene sequences, annotating other elements such as RNA and being able to submit those data to databases to share it with the community. Compared to de novo assembly where contiguous chromosomes are a sign of high quality, it is difficult to visualize and assess the quality of annotation. We developed the Companion web server to allow non-experts to annotate their genome using a reference-based method, enabling them to assess the output before submitting to public databases. In this update paper, we describe how we have included novel methods for gene finding and made the Companion server more efficient for annotation of genomes of up to 1 Gb in size. The reference set was increased to include genomes of interest for human and animal health from the fungi and arthropod kingdoms. We show that Companion outperforms existing comparable tools where closely related references are available.
Collapse
Affiliation(s)
| | - Kathryn Crouch
- School of Infection & Immunity, University of Glasgow, UK
| | - Thomas D Otto
- School of Infection & Immunity, University of Glasgow, UK
- LPHI, CNRS, INSERM, Université de Montpellier, France
| |
Collapse
|
2
|
Liang Y, Carrillo-Baltodano AM, Martín-Durán JM. Emerging trends in the study of spiralian larvae. Evol Dev 2024; 26:e12459. [PMID: 37787615 DOI: 10.1111/ede.12459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 09/11/2023] [Accepted: 09/13/2023] [Indexed: 10/04/2023]
Abstract
Many animals undergo indirect development, where their embryogenesis produces an intermediate life stage, or larva, that is often free-living and later metamorphoses into an adult. As their adult counterparts, larvae can have unique and diverse morphologies and occupy various ecological niches. Given their broad phylogenetic distribution, larvae have been central to hypotheses about animal evolution. However, the evolution of these intermediate forms and the developmental mechanisms diversifying animal life cycles are still debated. This review focuses on Spiralia, a large and diverse clade of bilaterally symmetrical animals with a fascinating array of larval forms, most notably the archetypical trochophore larva. We explore how classic research and modern advances have improved our understanding of spiralian larvae, their development, and evolution. Specifically, we examine three morphological features of spiralian larvae: the anterior neural system, the ciliary bands, and the posterior hyposphere. The combination of molecular and developmental evidence with modern high-throughput techniques, such as comparative genomics, single-cell transcriptomics, and epigenomics, is a promising strategy that will lead to new testable hypotheses about the mechanisms behind the evolution of larvae and life cycles in Spiralia and animals in general. We predict that the increasing number of available genomes for Spiralia and the optimization of genome-wide and single-cell approaches will unlock the study of many emerging spiralian taxa, transforming our views of the evolution of this animal group and their larvae.
Collapse
Affiliation(s)
- Yan Liang
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | | | - José M Martín-Durán
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| |
Collapse
|
3
|
Sierra P, Durbin R. Identification of transposable element families from pangenome polymorphisms. Mob DNA 2024; 15:13. [PMID: 38926873 PMCID: PMC11202377 DOI: 10.1186/s13100-024-00323-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
BACKGROUND Transposable Elements (TEs) are segments of DNA, typically a few hundred base pairs up to several tens of thousands bases long, that have the ability to generate new copies of themselves in the genome. Most existing methods used to identify TEs in a newly sequenced genome are based on their repetitive character, together with detection based on homology and structural features. As new high quality assemblies become more common, including the availability of multiple independent assemblies from the same species, an alternative strategy for identification of TE families becomes possible in which we focus on the polymorphism at insertion sites caused by TE mobility. RESULTS We develop the idea of using the structural polymorphisms found in pangenomes to create a library of the TE families recently active in a species, or in a closely related group of species. We present a tool, pantera, that achieves this task, and illustrate its use both on species with well-curated libraries, and on new assemblies. CONCLUSIONS Our results show that pantera is sensitive and accurate, tending to correctly identify complete elements with precise boundaries, and is particularly well suited to detect larger, low copy number TEs that are often undetected with existing de novo methods.
Collapse
Affiliation(s)
- Pío Sierra
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK.
| |
Collapse
|
4
|
Sidharthan VK, Reddy V, Kiran G, Rajeswari V, Baranwal VK, Kumar MK, Kumar KS. Probing of plant transcriptomes reveals the hidden genetic diversity of the family Secoviridae. Arch Virol 2024; 169:150. [PMID: 38898334 DOI: 10.1007/s00705-024-06076-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 05/07/2024] [Indexed: 06/21/2024]
Abstract
Secoviruses are single-stranded RNA viruses that infect plants. In the present study, we identified 61 putative novel secoviral genomes in various plant species by mining publicly available plant transcriptome data. These viral sequences represent the genomes of 13 monopartite and 48 bipartite secovirids. The genome sequences of 52 secovirids were coding-complete, and nine were partial. Except for small open reading frames (ORFs) determined in waikaviral genomes and RNA2 of torradoviruses, all of the recovered genomes/genome segments contained a large ORF encoding a polyprotein. Based on genome organization and phylogeny, all but three of the novel secoviruses were assigned to different genera. The genome organization of two identified waika-like viruses resembled that of the recently identified waika-like virus Triticum aestivum secovirus. Phylogenetic analysis revealed a pattern of host-virus co-evolution in a few waika- and waika-like viruses and increased phylogenetic diversity of nepoviruses. The study provides a basis for further investigation of the biological properties of these novel secoviruses.
Collapse
Affiliation(s)
- V Kavi Sidharthan
- Division of Genetics and Tree Improvement, ICFRE-Institute of Forest Biodiversity, Hyderabad, India.
| | - Vijayprakash Reddy
- Division of Genetics and Tree Improvement, ICFRE-Institute of Forest Biodiversity, Hyderabad, India
| | - G Kiran
- Division of Genetics and Tree Improvement, ICFRE-Institute of Forest Biodiversity, Hyderabad, India
| | - V Rajeswari
- School of Agricultural Sciences, Malla Reddy University, Hyderabad, India
| | - V K Baranwal
- Division of Plant Pathology, ICAR-Indian Agricultural Research Institute, New Delhi, India
| | - M Kiran Kumar
- Division of Genetics and Tree Improvement, ICFRE-Institute of Forest Biodiversity, Hyderabad, India
| | - K Sudheer Kumar
- Division of Genetics and Tree Improvement, ICFRE-Institute of Forest Biodiversity, Hyderabad, India
| |
Collapse
|
5
|
Jackson DJ, Cerveau N, Posnien N. De novo assembly of transcriptomes and differential gene expression analysis using short-read data from emerging model organisms - a brief guide. Front Zool 2024; 21:17. [PMID: 38902827 PMCID: PMC11188175 DOI: 10.1186/s12983-024-00538-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open
Abstract
Many questions in biology benefit greatly from the use of a variety of model systems. High-throughput sequencing methods have been a triumph in the democratization of diverse model systems. They allow for the economical sequencing of an entire genome or transcriptome of interest, and with technical variations can even provide insight into genome organization and the expression and regulation of genes. The analysis and biological interpretation of such large datasets can present significant challenges that depend on the 'scientific status' of the model system. While high-quality genome and transcriptome references are readily available for well-established model systems, the establishment of such references for an emerging model system often requires extensive resources such as finances, expertise and computation capabilities. The de novo assembly of a transcriptome represents an excellent entry point for genetic and molecular studies in emerging model systems as it can efficiently assess gene content while also serving as a reference for differential gene expression studies. However, the process of de novo transcriptome assembly is non-trivial, and as a rule must be empirically optimized for every dataset. For the researcher working with an emerging model system, and with little to no experience with assembling and quantifying short-read data from the Illumina platform, these processes can be daunting. In this guide we outline the major challenges faced when establishing a reference transcriptome de novo and we provide advice on how to approach such an endeavor. We describe the major experimental and bioinformatic steps, provide some broad recommendations and cautions for the newcomer to de novo transcriptome assembly and differential gene expression analyses. Moreover, we provide an initial selection of tools that can assist in the journey from raw short-read data to assembled transcriptome and lists of differentially expressed genes.
Collapse
Affiliation(s)
- Daniel J Jackson
- University of Göttingen, Department of Geobiology, Goldschmidtstr.3, Göttingen, 37077, Germany.
| | - Nicolas Cerveau
- University of Göttingen, Department of Geobiology, Goldschmidtstr.3, Göttingen, 37077, Germany
| | - Nico Posnien
- University of Göttingen, Department of Developmental Biology, GZMB, Justus-Von-Liebig-Weg 11, Göttingen, 37077, Germany.
| |
Collapse
|
6
|
Sanita Lima M, Silva Domingues D, Rossi Paschoal A, Smith DR. Long-read RNA sequencing can probe organelle genome pervasive transcription. Brief Funct Genomics 2024:elae026. [PMID: 38880995 DOI: 10.1093/bfgp/elae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/20/2024] [Accepted: 05/30/2024] [Indexed: 06/18/2024] Open
Abstract
40 years ago, organelle genomes were assumed to be streamlined and, perhaps, unexciting remnants of their prokaryotic past. However, the field of organelle genomics has exposed an unparallel diversity in genome architecture (i.e. genome size, structure, and content). The transcription of these eccentric genomes can be just as elaborate - organelle genomes are pervasively transcribed into a plethora of RNA types. However, while organelle protein-coding genes are known to produce polycistronic transcripts that undergo heavy posttranscriptional processing, the nature of organelle noncoding transcriptomes is still poorly resolved. Here, we review how wet-lab experiments and second-generation sequencing data (i.e. short reads) have been useful to determine certain types of organelle RNAs, particularly noncoding RNAs. We then explain how third-generation (long-read) RNA-Seq data represent the new frontier in organelle transcriptomics. We show that public repositories (e.g. NCBI SRA) already contain enough data for inter-phyla comparative studies and argue that organelle biologists can benefit from such data. We discuss the prospects of using publicly available sequencing data for organelle-focused studies and examine the challenges of such an approach. We highlight that the lack of a comprehensive database dedicated to organelle genomics/transcriptomics is a major impediment to the development of a field with implications in basic and applied science.
Collapse
Affiliation(s)
- Matheus Sanita Lima
- Department of Biology, Western University, 1151 Richmond Street, London, Ontario N6A 5B7, Canada
| | - Douglas Silva Domingues
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Avenida Padua Dias 11, Piracicaba, SP 13418-900, Brazil
| | - Alexandre Rossi Paschoal
- Department of Computer Science, Bioinformatics and Pattern Recognition Group (BIOINFO-CP), Federal University of Technology - Paraná - UTFPR, Avenida Alberto Carazzai 1640, Cornélio Procópio, PR 86300000, Brazil
| | - David Roy Smith
- Department of Biology, Western University, 1151 Richmond Street, London, Ontario N6A 5B7, Canada
| |
Collapse
|
7
|
Caporale LH. Evolutionary feedback from the environment shapes mechanisms that generate genome variation. J Physiol 2024; 602:2601-2614. [PMID: 38194279 DOI: 10.1113/jp284411] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 12/14/2023] [Indexed: 01/10/2024] Open
Abstract
Darwin recognized that 'a grand and almost untrodden field of inquiry will be opened, on the causes and laws of variation.' However, because the Modern Synthesis assumes that the intrinsic probability of any individual mutation is unrelated to that mutation's potential adaptive value, attention has been focused on selection rather than on the intrinsic generation of variation. Yet many examples illustrate that the term 'random' mutation, as widely understood, is inaccurate. The probabilities of distinct classes of variation are neither evenly distributed across a genome nor invariant over time, nor unrelated to their potential adaptive value. Because selection acts upon variation, multiple biochemical mechanisms can and have evolved that increase the relative probability of adaptive mutations. In effect, the generation of heritable variation is in a feedback loop with selection, such that those mechanisms that tend to generate variants that survive recurring challenges in the environment would be captured by this survival and thus inherited and accumulated within lineages of genomes. Moreover, because genome variation is affected by a wide range of biochemical processes, genome variation can be regulated. Biochemical mechanisms that sense stress, from lack of nutrients to DNA damage, can increase the probability of specific classes of variation. A deeper understanding of evolution involves attention to the evolution of, and environmental influences upon, the intrinsic variation generated in gametes, in other words upon the biochemical mechanisms that generate variation across generations. These concepts have profound implications for the types of questions that can and should be asked, as omics databases become more comprehensive, detection methods more sensitive, and computation and experimental analyses even more high throughput and thus capable of revealing the intrinsic generation of variation in individual gametes. These concepts also have profound implications for evolutionary theory, which, upon reflection it will be argued, predicts that selection would increase the probability of generating adaptive mutations, in other words, predicts that the ability to evolve itself evolves.
Collapse
|
8
|
Price DRG, Steele P, Frew D, McLean K, Androscuk D, Geldhof P, Borloo J, Albaladejo JP, Nisbet AJ, McNeilly TN. Characterisation of protective vaccine antigens from the thiol-containing components of excretory/secretory material of Ostertagia ostertagi. Vet Parasitol 2024; 328:110154. [PMID: 38490160 DOI: 10.1016/j.vetpar.2024.110154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 02/21/2024] [Accepted: 02/28/2024] [Indexed: 03/17/2024]
Abstract
Previous vaccination trials have demonstrated that thiol proteins affinity purified from Ostertagia ostertagi excretory-secretory products (O. ostertagi ES-thiol) are protective against homologous challenge. Here we have shown that protection induced by this vaccine was consistent across four independent vaccine-challenge experiments. Protection is associated with reduced cumulative faecal egg counts across the duration of the trials, relative to control animals. To better understand the diversity of antigens in O. ostertagi ES-thiol we used high-resolution shotgun proteomics to identify 490 unique proteins in the vaccine preparation. The most numerous ES-thiol proteins, with 91 proteins identified, belong to the sperm-coating protein/Tpx/antigen 5/pathogenesis-related protein 1 (SCP/TAPS) family. This family includes previously identified O. ostertagi vaccine antigens O. ostertagi ASP-1 and ASP-2. The ES-thiol fraction also has numerous proteinases, representing three distinct classes, including: metallo-; aspartyl- and cysteine proteinases. In terms of number of family members, the M12 astacin-like metalloproteinases, with 33 proteins, are the most abundant proteinase family in O. ostertagi ES-thiol. The O. ostertagi ES-thiol proteome provides a comprehensive database of proteins present in this vaccine preparation and will guide future vaccine antigen discovery projects.
Collapse
Affiliation(s)
- Daniel R G Price
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK.
| | - Philip Steele
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| | - David Frew
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| | - Kevin McLean
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| | - Dorota Androscuk
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| | - Peter Geldhof
- Laboratory of Parasitology, Department of Virology, Parasitology and Immunology, Faculty of Veterinary Sciences, Ghent University, Belgium
| | - Jimmy Borloo
- Laboratory of Parasitology, Department of Virology, Parasitology and Immunology, Faculty of Veterinary Sciences, Ghent University, Belgium
| | - Javier Palarea Albaladejo
- Biomathematics and Statistics Scotland, JCMB, The King's Buildings, Peter Guthrie Tait Road, Edinburgh, Scotland, UK; Department of Computer Science, Applied Mathematics and Statistics, University of Girona, Girona, Spain
| | - Alasdair J Nisbet
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| | - Tom N McNeilly
- Moredun Research Institute, Pentlands Science Park, Midlothian EH26 0PZ, UK
| |
Collapse
|
9
|
Blackman R, Couton M, Keck F, Kirschner D, Carraro L, Cereghetti E, Perrelet K, Bossart R, Brantschen J, Zhang Y, Altermatt F. Environmental DNA: The next chapter. Mol Ecol 2024; 33:e17355. [PMID: 38624076 DOI: 10.1111/mec.17355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/28/2024] [Accepted: 04/03/2024] [Indexed: 04/17/2024]
Abstract
Molecular tools are an indispensable part of ecology and biodiversity sciences and implemented across all biomes. About a decade ago, the use and implementation of environmental DNA (eDNA) to detect biodiversity signals extracted from environmental samples opened new avenues of research. Initial eDNA research focused on understanding population dynamics of target species. Its scope thereafter broadened, uncovering previously unrecorded biodiversity via metabarcoding in both well-studied and understudied ecosystems across all taxonomic groups. The application of eDNA rapidly became an established part of biodiversity research, and a research field by its own. Here, we revisit key expectations made in a land-mark special issue on eDNA in Molecular Ecology in 2012 to frame the development in six key areas: (1) sample collection, (2) primer development, (3) biomonitoring, (4) quantification, (5) behaviour of DNA in the environment and (6) reference database development. We pinpoint the success of eDNA, yet also discuss shortfalls and expectations not met, highlighting areas of research priority and identify the unexpected developments. In parallel, our retrospective couples a screening of the peer-reviewed literature with a survey of eDNA users including academics, end-users and commercial providers, in which we address the priority areas to focus research efforts to advance the field of eDNA. With the rapid and ever-increasing pace of new technical advances, the future of eDNA looks bright, yet successful applications and best practices must become more interdisciplinary to reach its full potential. Our retrospect gives the tools and expectations towards concretely moving the field forward.
Collapse
Affiliation(s)
- Rosetta Blackman
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Marjorie Couton
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - François Keck
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Dominik Kirschner
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Environmental Systems Science, Institute of Terrestrial Ecosystems, Ecosystems and Landscape Evolution, ETH Zürich, Zürich, Switzerland
- Department of Landscape Dynamics & Ecology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | - Luca Carraro
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Eva Cereghetti
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Kilian Perrelet
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
- Department of Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
- Department of Urban Water Management, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
| | - Raphael Bossart
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
| | - Jeanine Brantschen
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Yan Zhang
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University, Nanjing, China
| | - Florian Altermatt
- Department of Aquatic Ecology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| |
Collapse
|
10
|
Yu H, Li Y, Han W, Bao L, Liu F, Ma Y, Pu Z, Zeng Q, Zhang L, Bao Z, Wang S. Pan-evolutionary and regulatory genome architecture delineated by an integrated macro- and microsynteny approach. Nat Protoc 2024; 19:1623-1678. [PMID: 38514839 DOI: 10.1038/s41596-024-00966-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 12/20/2023] [Indexed: 03/23/2024]
Abstract
The forthcoming massive genome data generated by the Earth BioGenome Project will open up a new era of comparative genomics, for which genome synteny analysis provides an important framework. Profiling genome synteny represents an essential step in elucidating genome architecture, regulatory blocks/elements and their evolutionary history. Here we describe PanSyn, ( https://github.com/yhw320/PanSyn ), the most comprehensive and up-to-date genome synteny pipeline, providing step-by-step instructions and application examples to demonstrate its usage. PanSyn inherits both basic and advanced functions from existing popular tools, offering a user-friendly, highly customized approach for genome macrosynteny analysis and integrated pan-evolutionary and regulatory analysis of genome architecture, which are not yet available in public synteny software or tools. The advantages of PanSyn include: (i) advanced microsynteny analysis by functional profiling of microsynteny genes and associated regulatory elements; (ii) comprehensive macrosynteny analysis, including the inference of karyotype evolution from ancestors to extant species; and (iii) functional integration of microsynteny and macrosynteny for pan-evolutionary profiling of genome architecture and regulatory blocks, as well as integration with external functional genomics datasets from three- or four-dimensional genome and ENCODE projects. PanSyn requires basic knowledge of the Linux environment and Perl programming language and the ability to access a computer cluster, especially for large-scale genomic comparisons. Our protocol can be easily implemented by a competent graduate student or postdoc and takes several days to weeks to execute for dozens to hundreds of genomes. PanSyn provides yet the most comprehensive and powerful tool for integrated evolutionary and functional genomics.
Collapse
Affiliation(s)
- Hongwei Yu
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Yuli Li
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China.
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao, China.
| | - Wentao Han
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Lisui Bao
- Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao, China
| | - Fuyun Liu
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Yuanting Ma
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Zhongqi Pu
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Qifan Zeng
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Lingling Zhang
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao, China
| | - Zhenmin Bao
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou, China
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China
- Laboratory for Marine Fisheries and Aquaculture, Laoshan Laboratory, Qingdao, China
| | - Shi Wang
- Fang Zongxi Center for Marine Evo-Devo & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China.
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou, China.
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China.
| |
Collapse
|
11
|
Pastusiak A, Reddy MR, Chen X, Hoyer I, Dorman J, Gebhardt ME, Carpi G, Norris DE, Pipas JM, Jackson EK. A metagenomic analysis of the phase 2 Anopheles gambiae 1000 genomes dataset reveals a wide diversity of cobionts associated with field collected mosquitoes. Commun Biol 2024; 7:667. [PMID: 38816486 PMCID: PMC11139907 DOI: 10.1038/s42003-024-06337-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 05/15/2024] [Indexed: 06/01/2024] Open
Abstract
The Anopheles gambiae 1000 Genomes (Ag1000G) Consortium previously utilized deep sequencing methods to catalogue genetic diversity across African An. gambiae populations. We analyzed the complete datasets of 1142 individually sequenced mosquitoes through Microsoft Premonition's Bayesian mixture model based (BMM) metagenomics pipeline. All specimens were confirmed as either An. gambiae sensu stricto (s.s.) or An. coluzzii with a high degree of confidence ( > 98% identity to reference). Homo sapiens DNA was identified in all specimens indicating contamination may have occurred either at the time of specimen collection, preparation and/or sequencing. We found evidence of vertebrate hosts in 162 specimens. 59 specimens contained validated Plasmodium falciparum reads. Human hepatitis B and primate erythroparvovirus-1 viral sequences were identified in fifteen and three mosquito specimens, respectively. 478 of the 1,142 specimens were found to contain bacterial reads and bacteriophage-related contigs were detected in 27 specimens. This analysis demonstrates the capacity of metagenomic approaches to elucidate important vector-host-pathogen interactions of epidemiological significance.
Collapse
Affiliation(s)
| | - Michael R Reddy
- Microsoft Premonition, Microsoft Research, Redmond, WA, 98052, USA.
| | - Xiaoji Chen
- Microsoft Premonition, Microsoft Research, Redmond, WA, 98052, USA
| | - Isaiah Hoyer
- Microsoft Premonition, Microsoft Research, Redmond, WA, 98052, USA
| | - Jack Dorman
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Mary E Gebhardt
- The W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Malaria Research Institute, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
| | - Giovanna Carpi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Douglas E Norris
- The W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Malaria Research Institute, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
| | - James M Pipas
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Ethan K Jackson
- Microsoft Premonition, Microsoft Research, Redmond, WA, 98052, USA
| |
Collapse
|
12
|
Wos G, Palomar G, Golab MJ, Marszałek M, Sniegula S. Effects of overwintering on the transcriptome and fitness traits in a damselfly with variable voltinism across two latitudes. Sci Rep 2024; 14:12192. [PMID: 38806592 PMCID: PMC11133422 DOI: 10.1038/s41598-024-63066-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 05/24/2024] [Indexed: 05/30/2024] Open
Abstract
Winter diapause consists of cessation of development that allows individuals to survive unfavourable conditions. Winter diapause may bear various costs and questions have been raised about the evolutionary mechanisms maintaining facultative diapause. Here, we explored to what extent a facultative winter diapause affects life-history traits and the transcriptome in the damselfly Ischnura elegans, and whether these effects were latitude-specific. We collected adult females at central and high latitudes and raised their larvae in growth chambers. Larvae were split into a non-diapausing and post-winter (diapausing) cohort, were phenotyped and collected for a gene expression analysis. At the phenotypic level, we found no difference in survival between the two cohorts, and the post-winter cohort was larger and heavier than the non-winter cohort. These effects were mostly independent of the latitude of origin. At the transcriptomic level, wintering affected gene expression with a small fraction of genes significantly overlapping across latitudes, especially those related to morphogenesis. In conclusion, we found clear effects of diapause on the phenotype but little evidence for latitudinal-specific effects of diapause. Our results showed a shared transcriptomic basis underpinning diapause demonstrated, here, at the intraspecific level and supported the idea of evolutionary convergence of the response to diapause across organisms.
Collapse
Affiliation(s)
- Guillaume Wos
- Institute of Nature Conservation Polish Academy of Sciences, Al. Adama Mickiewicza 33, 31-120, Kraków, Poland.
| | - Gemma Palomar
- Institute of Nature Conservation Polish Academy of Sciences, Al. Adama Mickiewicza 33, 31-120, Kraków, Poland
- Department of Genetics, Physiology and Microbiology, Faculty of Biological Sciences, Complutense University of Madrid, José Antonio Novais, 12, 28040, Madrid, Spain
| | - Maria J Golab
- Institute of Nature Conservation Polish Academy of Sciences, Al. Adama Mickiewicza 33, 31-120, Kraków, Poland
| | - Marzena Marszałek
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Kraków, Poland
| | - Szymon Sniegula
- Institute of Nature Conservation Polish Academy of Sciences, Al. Adama Mickiewicza 33, 31-120, Kraków, Poland.
| |
Collapse
|
13
|
Khalaf A, Francis O, Blaxter ML. Genome evolution in intracellular parasites: Microsporidia and Apicomplexa. J Eukaryot Microbiol 2024:e13033. [PMID: 38785208 DOI: 10.1111/jeu.13033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 03/29/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024]
Abstract
Microsporidia and Apicomplexa are eukaryotic, single-celled, intracellular parasites with huge public health and economic importance. Typically, these parasites are studied separately, emphasizing their uniqueness and diversity. In this review, we explore the huge amount of genomic data that has recently become available for the two groups. We compare and contrast their genome evolution and discuss how their transitions to intracellular life may have shaped it. In particular, we explore genome reduction and compaction, genome expansion and ploidy, gene shuffling and rearrangements, and the evolution of centromeres and telomeres.
Collapse
Affiliation(s)
- Amjad Khalaf
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | - Ore Francis
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | | |
Collapse
|
14
|
Schultz DT, Heath-Heckman EA, Winchell CJ, Kuo DH, Yu YS, Oberauer F, Kocot KM, Cho SJ, Simakov O, Weisblat DA. Acceleration of genome rearrangement in clitellate annelids. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.12.593736. [PMID: 38798472 PMCID: PMC11118384 DOI: 10.1101/2024.05.12.593736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Comparisons of multiple metazoan genomes have revealed the existence of ancestral linkage groups (ALGs), genomic scaffolds sharing sets of orthologous genes that have been inherited from ancestral animals for hundreds of millions of years (Simakov et al. 2022; Schultz et al. 2023) These ALGs have persisted across major animal taxa including Cnidaria, Deuterostomia, Ecdysozoa and Spiralia. Notwithstanding this general trend of chromosome-scale conservation, ALGs have been obliterated by extensive genome rearrangements in certain groups, most notably including Clitellata (oligochaetes and leeches), a group of easily overlooked invertebrates that is of tremendous ecological, agricultural and economic importance (Charles 2019; Barrett 2016). To further investigate these rearrangements, we have undertaken a comparison of 12 clitellate genomes (including four newly sequenced species) and 11 outgroup representatives. We show that these rearrangements began at the base of the Clitellata (rather than progressing gradually throughout polychaete annelids), that the inter-chromosomal rearrangements continue in several clitellate lineages and that these events have substantially shaped the evolution of the otherwise highly conserved Hox cluster.
Collapse
Affiliation(s)
- Darrin T. Schultz
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - Elizabeth A.C. Heath-Heckman
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - Christopher J. Winchell
- Department of Molecular and Cell Biology, University of California, 385 Weill Hall, Berkeley, CA 94720-3200, USA
| | - Dian-Han Kuo
- Department of Life Science & Museum of Zoology, National Taiwan University, No. 1 Section 4 Roosevelt Rd., Taipei 10617, Taiwan
| | - Yun-sang Yu
- Department of Biological Sciences and Biotechnology, Chungbuk National University, Cheongju, 28644, Republic of Korea
| | - Fabian Oberauer
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - Kevin M. Kocot
- Department of Biological Sciences, University of Alabama, Tuscaloosa, AL 35487, USA
- Alabama Museum of Natural History, University of Alabama, Tuscaloosa, AL 35487, USA
| | - Sung-Jin Cho
- Department of Biological Sciences and Biotechnology, Chungbuk National University, Cheongju, 28644, Republic of Korea
| | - Oleg Simakov
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna 1010, Austria
| | - David A. Weisblat
- Department of Molecular and Cell Biology, University of California, 385 Weill Hall, Berkeley, CA 94720-3200, USA
| |
Collapse
|
15
|
Mascher M, Marone MP, Schreiber M, Stein N. Are cereal grasses a single genetic system? NATURE PLANTS 2024; 10:719-731. [PMID: 38605239 DOI: 10.1038/s41477-024-01674-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 03/17/2024] [Indexed: 04/13/2024]
Abstract
In 1993, a passionate and provocative call to arms urged cereal researchers to consider the taxon they study as a single genetic system and collaborate with each other. Since then, that group of scientists has seen their discipline blossom. In an attempt to understand what unity of genetic systems means and how the notion was borne out by later research, we survey the progress and prospects of cereal genomics: sequence assemblies, population-scale sequencing, resistance gene cloning and domestication genetics. Gene order may not be as extraordinarily well conserved in the grasses as once thought. Still, several recurring themes have emerged. The same ancestral molecular pathways defining plant architecture have been co-opted in the evolution of different cereal crops. Such genetic convergence as much as cross-fertilization of ideas between cereal geneticists has led to a rich harvest of genes that, it is hoped, will lead to improved varieties.
Collapse
Affiliation(s)
- Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| | - Marina Püpke Marone
- Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany
| | - Mona Schreiber
- University of Marburg, Department of Biology, Marburg, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany.
- Martin Luther University Halle-Wittenberg, Halle (Saale), Germany.
| |
Collapse
|
16
|
Wilson CG, Pieszko T, Nowell RW, Barraclough TG. Recombination in bdelloid rotifer genomes: asexuality, transfer and stress. Trends Genet 2024; 40:422-436. [PMID: 38458877 DOI: 10.1016/j.tig.2024.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/31/2024] [Accepted: 02/01/2024] [Indexed: 03/10/2024]
Abstract
Bdelloid rotifers constitute a class of microscopic animals living in freshwater habitats worldwide. Several strange features of bdelloids have drawn attention: their ability to tolerate desiccation and other stresses, a lack of reported males across the clade despite centuries of study, and unusually high numbers of horizontally acquired, non-metazoan genes. Genome sequencing is transforming our understanding of their lifestyle and its consequences, while in turn providing wider insights about recombination and genome organisation in animals. Many questions remain, not least how to reconcile apparent genomic signatures of sex with the continued absence of reported males, why bdelloids have so many horizontally acquired genes, and how their remarkable ability to survive stress interacts with recombination and other genomic processes.
Collapse
Affiliation(s)
- Christopher G Wilson
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford OX1 3SZ, UK.
| | - Tymoteusz Pieszko
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford OX1 3SZ, UK
| | - Reuben W Nowell
- Institute of Ecology and Evolution, Ashworth Laboratories, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK; Biological and Environmental Sciences, School of Natural Sciences, University of Stirling, Stirling FK9 4LA, UK
| | | |
Collapse
|
17
|
Wong Y, Ignatieva A, Koskela J, Gorjanc G, Wohns AW, Kelleher J. A general and efficient representation of ancestral recombination graphs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.03.565466. [PMID: 37961279 PMCID: PMC10635123 DOI: 10.1101/2023.11.03.565466] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
As a result of recombination, adjacent nucleotides can have different paths of genetic inheritance and therefore the genealogical trees for a sample of DNA sequences vary along the genome. The structure capturing the details of these intricately interwoven paths of inheritance is referred to as an ancestral recombination graph (ARG). Classical formalisms have focused on mapping coalescence and recombination events to the nodes in an ARG. This approach is out of step with modern developments, which do not represent genetic inheritance in terms of these events or explicitly infer them. We present a simple formalism that defines an ARG in terms of specific genomes and their intervals of genetic inheritance, and show how it generalises these classical treatments and encompasses the outputs of recent methods. We discuss nuances arising from this more general structure, and argue that it forms an appropriate basis for a software standard in this rapidly growing field.
Collapse
Affiliation(s)
- Yan Wong
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
| | - Anastasia Ignatieva
- School of Mathematics and Statistics, University of Glasgow, UK
- Department of Statistics, University of Oxford, UK
| | - Jere Koskela
- School of Mathematics, Statistics and Physics, Newcastle University, UK
- Department of Statistics, University of Warwick, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, UK
| | - Anthony W. Wohns
- Broad Institute of MIT and Harvard, Cambridge, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, USA
| | - Jerome Kelleher
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
| |
Collapse
|
18
|
Li H, Durbin R. Genome assembly in the telomere-to-telomere era. Nat Rev Genet 2024:10.1038/s41576-024-00718-w. [PMID: 38649458 DOI: 10.1038/s41576-024-00718-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2024] [Indexed: 04/25/2024]
Abstract
Genome sequences largely determine the biology and encode the history of an organism, and de novo assembly - the process of reconstructing the genome sequence of an organism from sequencing reads - has been a central problem in bioinformatics for four decades. Until recently, genomes were typically assembled into fragments of a few megabases at best, but now technological advances in long-read sequencing enable the near-complete assembly of each chromosome - also known as telomere-to-telomere assembly - for many organisms. Here, we review recent progress on assembly algorithms and protocols, with a focus on how to derive near-telomere-to-telomere assemblies. We also discuss the additional developments that will be required to resolve remaining assembly gaps and to assemble non-diploid genomes.
Collapse
Affiliation(s)
- Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Richard Durbin
- Department of Genetics, Cambridge University, Cambridge, UK.
| |
Collapse
|
19
|
Lindeboom TA, Sanchez Olmos MDC, Schulz K, Brinkmann CK, Ramírez Rojas AA, Hochrein L, Schindler D. An Optimized Genotyping Workflow for Identifying Highly SCRaMbLEd Synthetic Yeasts. ACS Synth Biol 2024; 13:1116-1127. [PMID: 38597458 PMCID: PMC11036488 DOI: 10.1021/acssynbio.3c00476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 03/01/2024] [Accepted: 03/25/2024] [Indexed: 04/11/2024]
Abstract
Synthetic Sc2.0 yeast strains contain hundreds to thousands of loxPsym recombination sites that allow restructuring of the Saccharomyces cerevisiae genome by SCRaMbLE. Thus, a highly diverse yeast population can arise from a single genotype. The selection of genetically diverse candidates with rearranged synthetic chromosomes for downstream analysis requires an efficient and straightforward workflow. Here we present loxTags, a set of qPCR primers for genotyping across loxPsym sites to detect not only deletions but also inversions and translocations after SCRaMbLE. To cope with the large number of amplicons, we generated qTagGer, a qPCR genotyping primer prediction tool. Using loxTag-based genotyping and long-read sequencing, we show that light-inducible Cre recombinase L-SCRaMbLE can efficiently generate diverse recombination events when applied to Sc2.0 strains containing a linear or a circular version of synthetic chromosome III.
Collapse
Affiliation(s)
- Timon A Lindeboom
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | | | - Karina Schulz
- Department of Molecular Biology, University of Potsdam, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany
| | - Cedric K Brinkmann
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Adán A Ramírez Rojas
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Lena Hochrein
- Department of Molecular Biology, University of Potsdam, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany
| | - Daniel Schindler
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
- Center for Synthetic Microbiology, Philipps-University Marburg, Karl-von-Frisch-Str. 14, 35032Marburg, Germany
| |
Collapse
|
20
|
Bernal-Gallardo JJ, de Folter S. Plant genome information facilitates plant functional genomics. PLANTA 2024; 259:117. [PMID: 38592421 PMCID: PMC11004055 DOI: 10.1007/s00425-024-04397-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 03/20/2024] [Indexed: 04/10/2024]
Abstract
MAIN CONCLUSION In this review, we give an overview of plant sequencing efforts and how this impacts plant functional genomics research. Plant genome sequence information greatly facilitates the studies of plant biology, functional genomics, evolution of genomes and genes, domestication processes, phylogenetic relationships, among many others. More than two decades of sequencing efforts have boosted the number of available sequenced plant genomes. The first plant genome, of Arabidopsis, was published in the year 2000 and currently, 4604 plant genomes from 1482 plant species have been published. Various large sequence initiatives are running, which are planning to produce tens of thousands of sequenced plant genomes in the near future. In this review, we give an overview on the status of sequenced plant genomes and on the use of genome information in different research areas.
Collapse
Affiliation(s)
- Judith Jazmin Bernal-Gallardo
- Unidad de Genómica Avanzada (UGA-Langebio), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Irapuato, Mexico
| | - Stefan de Folter
- Unidad de Genómica Avanzada (UGA-Langebio), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Irapuato, Mexico.
| |
Collapse
|
21
|
Comparative genomics uncover the evolutionary history of butterfly and moth chromosomes. Nat Ecol Evol 2024; 8:612-613. [PMID: 38383854 DOI: 10.1038/s41559-024-02341-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
|
22
|
Kyriacou RG, Mulhair PO, Holland PWH. GC Content Across Insect Genomes: Phylogenetic Patterns, Causes and Consequences. J Mol Evol 2024; 92:138-152. [PMID: 38491221 PMCID: PMC10978632 DOI: 10.1007/s00239-024-10160-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/06/2024] [Indexed: 03/18/2024]
Abstract
The proportions of A:T and G:C nucleotide pairs are often unequal and can vary greatly between animal species and along chromosomes. The causes and consequences of this variation are incompletely understood. The recent release of high-quality genome sequences from the Darwin Tree of Life and other large-scale genome projects provides an opportunity for GC heterogeneity to be compared across a large number of insect species. Here we analyse GC content along chromosomes, and within protein-coding genes and codons, of 150 insect species from four holometabolous orders: Coleoptera, Diptera, Hymenoptera, and Lepidoptera. We find that protein-coding sequences have higher GC content than the genome average, and that Lepidoptera generally have higher GC content than the other three insect orders examined. GC content is higher in small chromosomes in most Lepidoptera species, but this pattern is less consistent in other orders. GC content also increases towards subtelomeric regions within protein-coding genes in Diptera, Coleoptera and Lepidoptera. Two species of Diptera, Bombylius major and B. discolor, have very atypical genomes with ubiquitous increase in AT content, especially at third codon positions. Despite dramatic AT-biased codon usage, we find no evidence that this has driven divergent protein evolution. We argue that the GC landscape of Lepidoptera, Diptera and Coleoptera genomes is influenced by GC-biased gene conversion, strongest in Lepidoptera, with some outlier taxa affected drastically by counteracting processes.
Collapse
Affiliation(s)
- Riccardo G Kyriacou
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter O Mulhair
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter W H Holland
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK.
| |
Collapse
|
23
|
Wright CJ, Stevens L, Mackintosh A, Lawniczak M, Blaxter M. Comparative genomics reveals the dynamics of chromosome evolution in Lepidoptera. Nat Ecol Evol 2024; 8:777-790. [PMID: 38383850 PMCID: PMC11009112 DOI: 10.1038/s41559-024-02329-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 01/12/2024] [Indexed: 02/23/2024]
Abstract
Chromosomes are a central unit of genome organization. One-tenth of all described species on Earth are butterflies and moths, the Lepidoptera, which generally possess 31 chromosomes. However, some species display dramatic variation in chromosome number. Here we analyse 210 chromosomally complete lepidopteran genomes and show that the chromosomes of extant lepidopterans are derived from 32 ancestral linkage groups, which we term Merian elements. Merian elements have remained largely intact through 250 million years of evolution and diversification. Against this stable background, eight lineages have undergone extensive reorganization either through numerous fissions or a combination of fusion and fission events. Outside these lineages, fusions are rare and fissions are rarer still. Fusions often involve small, repeat-rich Merian elements and the sex-linked element. Our results reveal the constraints on genome architecture in Lepidoptera and provide a deeper understanding of chromosomal rearrangements in eukaryotic genome evolution.
Collapse
Affiliation(s)
| | - Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | | | | | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK.
| |
Collapse
|
24
|
Olbrich M, Bartels L, Wohlers I. Sequencing technologies and hardware-accelerated parallel computing transform computational genomics research. FRONTIERS IN BIOINFORMATICS 2024; 4:1384497. [PMID: 38567256 PMCID: PMC10985184 DOI: 10.3389/fbinf.2024.1384497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 03/07/2024] [Indexed: 04/04/2024] Open
Affiliation(s)
- Michael Olbrich
- Center for Biotechnology, Khalifa University for Science and Technology, Abu Dhabi, United Arab Emirates
| | - Lennart Bartels
- Biomolecular Data Science in Pneumology, Research Center Borstel, Borstel, Germany
| | - Inken Wohlers
- Biomolecular Data Science in Pneumology, Research Center Borstel, Borstel, Germany
- University of Lübeck, Lübeck, Germany
| |
Collapse
|
25
|
Adkins P, Harley J, Brittain R, Scott-Somme K, Azzopardi F. The genome sequence of the John Dory, Zeus faber Linnaeus, 1758. Wellcome Open Res 2024; 9:150. [PMID: 38881949 PMCID: PMC11179049 DOI: 10.12688/wellcomeopenres.21140.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/04/2024] [Indexed: 06/18/2024] Open
Abstract
We present a genome assembly from an individual Zeus faber (the John Dory; Chordata; Actinopteri; Zeiformes; Zeidae). The genome sequence is 804.7 megabases in span. Most of the assembly is scaffolded into 22 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 16.72 kilobases in length.
Collapse
Affiliation(s)
- Patrick Adkins
- The Marine Biological Association, Plymouth, England, UK
| | - Joanna Harley
- The Marine Biological Association, Plymouth, England, UK
| | | | | | | |
Collapse
|
26
|
Skojec C, Godfrey RK, Kawahara AY. Long read genome assembly of Automeris io (Lepidoptera: Saturniidae) an emerging model for the evolution of deimatic displays. G3 (BETHESDA, MD.) 2024; 14:jkad292. [PMID: 38324397 PMCID: PMC10917498 DOI: 10.1093/g3journal/jkad292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 12/11/2023] [Indexed: 02/09/2024]
Abstract
Automeris moths are a morphologically diverse group with 135 described species that have a geographic range that spans from the New World temperate zone to the Neotropics. Many Automeris have elaborate hindwing eyespots that are thought to deter or disrupt the attack of potential predators, allowing the moth time to escape. The Io moth (Automeris io), known for its striking eyespots, is a well-studied species within the genus and is an emerging model system to study the evolution of deimatism. Existing research on the eyespot pattern development will be augmented by genomic resources that allow experimental manipulation of this emerging model. Here, we present a high-quality, PacBio HiFi genome assembly for Io moth to aid existing research on the molecular development of eyespots and future research on other deimatic traits. This 490 Mb assembly is highly contiguous (N50 = 15.78 mbs) and complete (benchmarking universal single-copy orthologs = 98.4%). Additionally, we were able to recover orthologs of genes previously identified as being involved in wing pattern formation and movement.
Collapse
Affiliation(s)
- Chelsea Skojec
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural
History, University of Florida, Gainesville, FL
32611, USA
- Department of Biology, University of Florida, 220 Bartram
Hall, Gainesville, FL 32611, USA
| | - R Keating Godfrey
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural
History, University of Florida, Gainesville, FL
32611, USA
| | - Akito Y Kawahara
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural
History, University of Florida, Gainesville, FL
32611, USA
- Department of Biology, University of Florida, 220 Bartram
Hall, Gainesville, FL 32611, USA
| |
Collapse
|
27
|
Schreiber M, Jayakodi M, Stein N, Mascher M. Plant pangenomes for crop improvement, biodiversity and evolution. Nat Rev Genet 2024:10.1038/s41576-024-00691-4. [PMID: 38378816 DOI: 10.1038/s41576-024-00691-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/14/2023] [Indexed: 02/22/2024]
Abstract
Plant genome sequences catalogue genes and the genetic elements that regulate their expression. Such inventories further research aims as diverse as mapping the molecular basis of trait diversity in domesticated plants or inquiries into the origin of evolutionary innovations in flowering plants millions of years ago. The transformative technological progress of DNA sequencing in the past two decades has enabled researchers to sequence ever more genomes with greater ease. Pangenomes - complete sequences of multiple individuals of a species or higher taxonomic unit - have now entered the geneticists' toolkit. The genomes of crop plants and their wild relatives are being studied with translational applications in breeding in mind. But pangenomes are applicable also in ecological and evolutionary studies, as they help classify and monitor biodiversity across the tree of life, deepen our understanding of how plant species diverged and show how plants adapt to changing environments or new selection pressures exerted by human beings.
Collapse
Affiliation(s)
- Mona Schreiber
- Department of Biology, University of Marburg, Marburg, Germany
| | - Murukarthick Jayakodi
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
- Martin Luther University Halle-Wittenberg, Halle (Saale), Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| |
Collapse
|
28
|
Vancaester E, Blaxter ML. MarkerScan: Separation and assembly of cobionts sequenced alongside target species in biodiversity genomics projects. Wellcome Open Res 2024; 9:33. [PMID: 38617467 PMCID: PMC11016177 DOI: 10.12688/wellcomeopenres.20730.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2023] [Indexed: 04/16/2024] Open
Abstract
Contamination of public databases by mislabelled sequences has been highlighted for many years and the avalanche of novel sequencing data now being deposited has the potential to make databases difficult to use effectively. It is therefore crucial that sequencing projects and database curators perform pre-submission checks to remove obvious contamination and avoid propagating erroneous taxonomic relationships. However, it is important also to recognise that biological contamination of a target sample with unexpected species' DNA can also lead to the discovery of fascinating biological phenomena through the identification of environmental organisms or endosymbionts. Here, we present a novel, integrated method for detection and generation of high-quality genomes of all non-target genomes co-sequenced in eukaryotic genome sequencing projects. After performing taxonomic profiling of an assembly from the raw data, and leveraging the identity of small rRNA sequences discovered therein as markers, a targeted classification approach retrieves and assembles high-quality genomes. The genomes of these cobionts are then not only removed from the target species' genome but also available for further interrogation. Source code is available from https://github.com/CobiontID/MarkerScan. MarkerScan is written in Python and is deployed as a Docker container.
Collapse
Affiliation(s)
| | - Mark L. Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
29
|
Khalaf A, Lawniczak MKN, Blaxter ML, Jaron KS. Polyploidy is widespread in Microsporidia. Microbiol Spectr 2024; 12:e0366923. [PMID: 38214524 PMCID: PMC10845963 DOI: 10.1128/spectrum.03669-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 12/15/2023] [Indexed: 01/13/2024] Open
Abstract
Microsporidia are obligate intracellular eukaryotic parasites with an extremely broad host range. They have both economic and public health importance. Ploidy in microsporidia is variable, with a few species formally identified as diploid and one as polyploid. Given the increase in the number of studies sequencing microsporidian genomes, it is now possible to assess ploidy levels across all currently explored microsporidian diversity. We estimate ploidy for all microsporidian data sets available on the Sequence Read Archive using k-mer-based analyses, indicating that polyploidy is widespread in Microsporidia and that ploidy change is dynamic in the group. Using genome-wide heterozygosity estimates, we also show that polyploid microsporidian genomes are relatively homozygous, and we discuss the implications of these findings on the timing of polyploidization events and their origin.IMPORTANCEMicrosporidia are single-celled intracellular parasites, distantly related to fungi, that can infect a broad range of hosts, from humans all the way to protozoans. Exploiting the wealth of microsporidian genomic data available, we use k-mer-based analyses to assess ploidy status across the group. Understanding a genome's ploidy is crucial in order to assemble it effectively and may also be relevant for better understanding a parasite's behavior and life cycle. We show that tetraploidy is present in at least six species in Microsporidia and that these polyploidization events are likely to have occurred independently. We discuss why these findings may be paradoxical, given that Microsporidia, like other intracellular parasites, have extremely small, reduced genomes.
Collapse
Affiliation(s)
- Amjad Khalaf
- Tree of Life, Wellcome Sanger Institute, Cambridge, United Kingdom
| | | | - Mark L. Blaxter
- Tree of Life, Wellcome Sanger Institute, Cambridge, United Kingdom
| | - Kamil S. Jaron
- Tree of Life, Wellcome Sanger Institute, Cambridge, United Kingdom
| |
Collapse
|
30
|
Höglund J, Dias G, Olsen RA, Soares A, Bunikis I, Talla V, Backström N. A Chromosome-Level Genome Assembly and Annotation for the Clouded Apollo Butterfly (Parnassius mnemosyne): A Species of Global Conservation Concern. Genome Biol Evol 2024; 16:evae031. [PMID: 38368625 PMCID: PMC10901555 DOI: 10.1093/gbe/evae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/06/2024] [Accepted: 02/10/2024] [Indexed: 02/20/2024] Open
Abstract
The clouded apollo (Parnassius mnemosyne) is a palearctic butterfly distributed over a large part of western Eurasia, but population declines and fragmentation have been observed in many parts of the range. The development of genomic tools can help to shed light on the genetic consequences of the decline and to make informed decisions about direct conservation actions. Here, we present a high-contiguity, chromosome-level genome assembly of a female clouded apollo butterfly and provide detailed annotations of genes and transposable elements. We find that the large genome (1.5 Gb) of the clouded apollo is extraordinarily repeat rich (73%). Despite that, the combination of sequencing techniques allowed us to assemble all chromosomes (nc = 29) to a high degree of completeness. The annotation resulted in a relatively high number of protein-coding genes (22,854) compared with other Lepidoptera, of which a large proportion (21,635) could be assigned functions based on homology with other species. A comparative analysis indicates that overall genome structure has been largely conserved, both within the genus and compared with the ancestral lepidopteran karyotype. The high-quality genome assembly and detailed annotation presented here will constitute an important tool for forthcoming efforts aimed at understanding the genetic consequences of fragmentation and decline, as well as for assessments of genetic diversity, population structure, inbreeding, and genetic load in the clouded apollo butterfly.
Collapse
Affiliation(s)
- Jacob Höglund
- Animal Ecology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala SE-752 36, Sweden
| | - Guilherme Dias
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Uppsala 752 37, Sweden
| | - Remi-André Olsen
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna 17165, Sweden
| | - André Soares
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Uppsala 752 37, Sweden
| | - Ignas Bunikis
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, National Genomics Infrastructure hosted by SciLifeLab, Uppsala, Sweden
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala 752 37, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala SE-752 36, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala SE-752 36, Sweden
| |
Collapse
|
31
|
Schiebelhut LM, Guillaume AS, Kuhn A, Schweizer RM, Armstrong EE, Beaumont MA, Byrne M, Cosart T, Hand BK, Howard L, Mussmann SM, Narum SR, Rasteiro R, Rivera-Colón AG, Saarman N, Sethuraman A, Taylor HR, Thomas GWC, Wellenreuther M, Luikart G. Genomics and conservation: Guidance from training to analyses and applications. Mol Ecol Resour 2024; 24:e13893. [PMID: 37966259 DOI: 10.1111/1755-0998.13893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 10/25/2023] [Accepted: 10/30/2023] [Indexed: 11/16/2023]
Abstract
Environmental change is intensifying the biodiversity crisis and threatening species across the tree of life. Conservation genomics can help inform conservation actions and slow biodiversity loss. However, more training, appropriate use of novel genomic methods and communication with managers are needed. Here, we review practical guidance to improve applied conservation genomics. We share insights aimed at ensuring effectiveness of conservation actions around three themes: (1) improving pedagogy and training in conservation genomics including for online global audiences, (2) conducting rigorous population genomic analyses properly considering theory, marker types and data interpretation and (3) facilitating communication and collaboration between managers and researchers. We aim to update students and professionals and expand their conservation toolkit with genomic principles and recent approaches for conserving and managing biodiversity. The biodiversity crisis is a global problem and, as such, requires international involvement, training, collaboration and frequent reviews of the literature and workshops as we do here.
Collapse
Affiliation(s)
- Lauren M Schiebelhut
- Life and Environmental Sciences, University of California, Merced, California, USA
| | - Annie S Guillaume
- Geospatial Molecular Epidemiology group (GEOME), Laboratory for Biological Geochemistry (LGB), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Arianna Kuhn
- Department of Biological Sciences, University of Lethbridge, Lethbridge, Alberta, Canada
- Virginia Museum of Natural History, Martinsville, Virginia, USA
| | - Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | | | - Mark A Beaumont
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Margaret Byrne
- Department of Biodiversity, Conservation and Attractions, Biodiversity and Conservation Science, Perth, Western Australia, Australia
| | - Ted Cosart
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| | - Brian K Hand
- Flathead Lake Biological Station, University of Montana, Polson, Montana, USA
| | - Leif Howard
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| | - Steven M Mussmann
- Southwestern Native Aquatic Resources and Recovery Center, U.S. Fish & Wildlife Service, Dexter, New Mexico, USA
| | - Shawn R Narum
- Hagerman Genetics Lab, University of Idaho, Hagerman, Idaho, USA
| | - Rita Rasteiro
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Angel G Rivera-Colón
- Department of Evolution, Ecology, and Behavior, University of Illinois at Urbana-Champaign, Champaign, Illinois, USA
| | - Norah Saarman
- Department of Biology and Ecology Center, Utah State University, Logan, Utah, USA
| | - Arun Sethuraman
- Department of Biology, San Diego State University, San Diego, California, USA
| | - Helen R Taylor
- Royal Zoological Society of Scotland, Edinburgh, Scotland
| | - Gregg W C Thomas
- Informatics Group, Harvard University, Cambridge, Massachusetts, USA
| | - Maren Wellenreuther
- Plant and Food Research, Nelson, New Zealand
- University of Auckland, Auckland, New Zealand
| | - Gordon Luikart
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| |
Collapse
|
32
|
Roell MS, Ott MC, Mair MM, Pamminger T. Missing Genomic Resources for the Next Generation of Environmental Risk Assessment. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:1877-1881. [PMID: 38245867 PMCID: PMC10832041 DOI: 10.1021/acs.est.3c08701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/22/2024]
Abstract
Environmental risk assessment traditionally relies on a wide range of in vivo testing to assess the potential hazards of chemicals in the environment. These tests are often time-consuming and costly and can cause test organisms' suffering. Recent developments of reliable low-cost alternatives, both in vivo- and in silico-based, opened the door to reconsider current toxicity assessment. However, many of these new approach methodologies (NAMs) rely on high-quality annotated genomes for surrogate species of regulatory risk assessment. Currently, a lack of genomic information slows the process of NAM development. Here, we present a phylogenetically resolved overview of missing genomic resources for surrogate species within a regulatory ecotoxicological risk assessment. We call for an organized and systematic effort within the (regulatory) ecotoxicological community to provide these missing genomic resources. Further, we discuss the potential of a standardized genomic surrogate species landscape to enable a robust and nonanimal-reliant ecotoxicological risk assessment in the systems ecotoxicology era.
Collapse
Affiliation(s)
- Marc-Sven Roell
- R&D
Bayer AG, Crop Science Division, Monheim am Rhein 40789, Germany
| | | | - Magdalena M. Mair
- Bayreuth
Center for Ecology and Environmental Research (BayCEER), Bayreuth 95447, Germany
- Statistical
Ecotoxicology, University of Bayreuth, Bayreuth 95447, Germany
| | - Tobias Pamminger
- R&D
Bayer AG, Crop Science Division, Monheim am Rhein 40789, Germany
| |
Collapse
|
33
|
Harrison PW, Amode MR, Austine-Orimoloye O, Azov A, Barba M, Barnes I, Becker A, Bennett R, Berry A, Bhai J, Bhurji SK, Boddu S, Branco Lins PR, Brooks L, Ramaraju S, Campbell L, Martinez MC, Charkhchi M, Chougule K, Cockburn A, Davidson C, De Silva N, Dodiya K, Donaldson S, El Houdaigui B, Naboulsi T, Fatima R, Giron CG, Genez T, Grigoriadis D, Ghattaoraya G, Martinez JG, Gurbich T, Hardy M, Hollis Z, Hourlier T, Hunt T, Kay M, Kaykala V, Le T, Lemos D, Lodha D, Marques-Coelho D, Maslen G, Merino G, Mirabueno L, Mushtaq A, Hossain S, Ogeh D, Sakthivel MP, Parker A, Perry M, Piližota I, Poppleton D, Prosovetskaia I, Raj S, Pérez-Silva J, Salam A, Saraf S, Saraiva-Agostinho N, Sheppard D, Sinha S, Sipos B, Sitnik V, Stark W, Steed E, Suner MM, Surapaneni L, Sutinen K, Tricomi FF, Urbina-Gómez D, Veidenberg A, Walsh TA, Ware D, Wass E, Willhoft N, Allen J, Alvarez-Jarreta J, Chakiachvili M, Flint B, Giorgetti S, Haggerty L, Ilsley G, Keatley J, Loveland J, Moore B, Mudge J, Naamati G, Tate J, Trevanion S, Winterbottom A, Frankish A, Hunt SE, Cunningham F, Dyer S, Finn R, Martin F, Yates A. Ensembl 2024. Nucleic Acids Res 2024; 52:D891-D899. [PMID: 37953337 PMCID: PMC10767893 DOI: 10.1093/nar/gkad1049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/20/2023] [Accepted: 10/24/2023] [Indexed: 11/14/2023] Open
Abstract
Ensembl (https://www.ensembl.org) is a freely available genomic resource that has produced high-quality annotations, tools, and services for vertebrates and model organisms for more than two decades. In recent years, there has been a dramatic shift in the genomic landscape, with a large increase in the number and phylogenetic breadth of high-quality reference genomes, alongside major advances in the pan-genome representations of higher species. In order to support these efforts and accelerate downstream research, Ensembl continues to focus on scaling for the rapid annotation of new genome assemblies, developing new methods for comparative analysis, and expanding the depth and quality of our genome annotations. This year we have continued our expansion to support global biodiversity research, doubling the number of annotated genomes we support on our Rapid Release site to over 1700, driven by our close collaboration with biodiversity projects such as Darwin Tree of Life. We have also strengthened support for key agricultural species, including the first regulatory builds for farmed animals, and have updated key tools and resources that support the global scientific community, notably the Ensembl Variant Effect Predictor. Ensembl data, software, and tools are freely available.
Collapse
Affiliation(s)
- Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - M Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Olanrewaju Austine-Orimoloye
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Andrey G Azov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Matthieu Barba
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - If Barnes
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Arne Becker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Ruth Bennett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Andrew Berry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jyothish Bhai
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Simarpreet Kaur Bhurji
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Sanjay Boddu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Paulo R Branco Lins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Lucy Brooks
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Shashank Budhanuru Ramaraju
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Lahcen I Campbell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Manuel Carbajo Martinez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Mehrnaz Charkhchi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA
| | - Alexander Cockburn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Nishadi H De Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Kamalkumar Dodiya
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Sarah Donaldson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Bilal El Houdaigui
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Tamara El Naboulsi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Reham Fatima
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Carlos Garcia Giron
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Thiago Genez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Dionysios Grigoriadis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Gurpreet S Ghattaoraya
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jose Gonzalez Martinez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Tatiana A Gurbich
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Matthew Hardy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Zoe Hollis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Vinay Kaykala
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Tuan Le
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Diana Lemos
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Disha Lodha
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Diego Marques-Coelho
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Gareth Maslen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Gabriela Alejandra Merino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Louisse Paola Mirabueno
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Aleena Mushtaq
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Syed Nakib Hossain
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Denye N Ogeh
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Manoj Pandian Sakthivel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Anne Parker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Malcolm Perry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Ivana Piližota
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Daniel Poppleton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Irina Prosovetskaia
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Shriya Raj
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - José G Pérez-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Ahamed Imran Abdul Salam
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Shradha Saraf
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Nuno Saraiva-Agostinho
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Dan Sheppard
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Swati Sinha
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Botond Sipos
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Vasily Sitnik
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - William Stark
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Emily Steed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Marie-Marthe Suner
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Likhitha Surapaneni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Kyösti Sutinen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - David Urbina-Gómez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Andres Veidenberg
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Thomas A Walsh
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Doreen Ware
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA
- USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, NY 14853, USA
| | - Elizabeth Wass
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Natalie L Willhoft
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jamie Allen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jorge Alvarez-Jarreta
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Marc Chakiachvili
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Bethany Flint
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Stefano Giorgetti
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Garth R Ilsley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jon Keatley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jane E Loveland
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Benjamin Moore
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Guy Naamati
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - John Tate
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Stephen J Trevanion
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Andrea Winterbottom
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| | - Andrew D Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, Cambridgeshire CB10 1SD, UK
| |
Collapse
|
34
|
Boyes D, Blaxter ML. The genome sequence of the Tipped Oak Case-bearer, Coleophora flavipennella (Duponchel 1843). Wellcome Open Res 2024; 9:3. [PMID: 38686023 PMCID: PMC11056683 DOI: 10.12688/wellcomeopenres.19917.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2023] [Indexed: 05/02/2024] Open
Abstract
We present a genome assembly from an individual female Coleophora flavipennella (the Tipped Oak Case-bearer; Arthropoda; Insecta; Lepidoptera; Coleophoridae). The genome sequence is 989.3 megabases in span. Most of the assembly is scaffolded into 57 chromosomal pseudomolecules, including the W and Z sex chromosomes. The mitochondrial genome has also been assembled and is 15.77 kilobases in length.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
35
|
Curd EE, Gal L, Gallego R, Silliman K, Nielsen S, Gold Z. rCRUX: A Rapid and Versatile Tool for Generating Metabarcoding Reference libraries in R. ENVIRONMENTAL DNA (HOBOKEN, N.J.) 2024; 6:e489. [PMID: 38370872 PMCID: PMC10871694 DOI: 10.1002/edn3.489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 10/19/2023] [Indexed: 02/20/2024]
Abstract
The sequencing revolution requires accurate taxonomic classification of DNA sequences. Key to making accurate taxonomic assignments are curated, comprehensive reference barcode databases. However, the generation and curation of such databases has remained challenging given the large and continuously growing volumes of both DNA sequence data and novel reference barcode targets. Monitoring and research applications require a greater diversity of specialized gene regions and targeted taxa then are currently curated by professional staff. Thus there is a growing need for an easy to implement computational tool that can generate comprehensive metabarcoding reference libraries for any bespoke locus. We address this need by reimagining CRUX from the Anacapa Toolkit and present the rCRUX package in R which, like it's predecessor, relies on sequence homology and PCR primer compatibility instead of keyword-searches to avoid limitations of user-defined metadata. The typical workflow involves searching for plausible seed amplicons (get_seeds_local() or get_seeds_remote()) by simulating in silico PCR to acquire a set of sequences analogous to PCR products containing a user-defined set of primer sequences. Next, these seeds are used to iteratively blast search seed sequences against a local copy of the National Center for Biotechnology Information (NCBI) formatted nt database using a taxonomic-rank based stratified random sampling approach ( blast_seeds() ). This results in a comprehensive set of sequence matches. This database is dereplicated and cleaned (derep_and_clean_db()) by identifying identical reference sequences and collapsing the taxonomic path to the lowest taxonomic agreement across all matching reads. This results in a curated, comprehensive database of primer-specific reference barcode sequences from NCBI. Databases can then be compared (compare_db()) to determine read and taxonomic overlap. We demonstrate that rCRUX provides more comprehensive reference databases for the MiFish Universal Teleost 12S, Taberlet trnl, fungal ITS, and Leray CO1 loci than CRABS, MetaCurator, RESCRIPt, and ecoPCR reference databases. We then further demonstrate the utility of rCRUX by generating 24 reference databases for 20 metabarcoding loci, many of which lack dedicated reference database curation efforts. The rCRUX package provides a simple to use tool for the generation of curated, comprehensive reference databases for user-defined loci, facilitating accurate and effective taxonomic classification of metabarcoding and DNA sequence efforts broadly.
Collapse
Affiliation(s)
- Emily E. Curd
- Vermont Biomedical Research Network, University of Vermont, VT, USA
| | - Luna Gal
- Landmark College, VT, USA
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
| | - Ramon Gallego
- Departamento de Biología, Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
| | - Katherine Silliman
- Northern Gulf Institute, Mississippi State University, Starkville, MS, USA
- NOAA Atlantic Oceanographic and Meteorological Laboratory, Miami, FL, USA
| | | | - Zachary Gold
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
- NOAA Pacific Marine Environmental Laboratory, Seattle, WA, USA
| |
Collapse
|
36
|
Valerio F, Twort VG, Duplouy A. Screening Host Genomic Data for Wolbachia Infections. Methods Mol Biol 2024; 2739:251-274. [PMID: 38006557 DOI: 10.1007/978-1-0716-3553-7_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2023]
Abstract
Less than a decade ago, the production of Wolbachia genomic assemblies was tedious, time-consuming, and expensive. The production of Wolbachia genomic DNA free of contamination from host DNA, as required for Wolbachia-targeted sequencing, was then only possible after the amplification and extraction of a large amount of clonal Wolbachia DNA. However, as an endosymbiotic bacterium, Wolbachia does not grow outside the host cell environment, and large-scale recovery of the bacteria required mass rearing of their host, preferably clones of a single individual to avoid strain genetic diversity, or amplification of cell cultures infected with a single Wolbachia strain. Bacterial DNA could be separated from host DNA based on genomic size. Nowadays, the production of full Wolbachia genomes does not require the physical isolation of the bacterial strains from their respective hosts, and the bacterium is often sequenced as a by-catch of host genomic projects. Here, we provide a step-by-step protocol to (1) identify whether host genome projects contain reads from associated Wolbachia and (2) isolate/retrieve the Wolbachia reads from the rest of the sequenced material. We hope this simple protocol will support many projects aiming at studying diverse Wolbachia genome assemblies.
Collapse
Affiliation(s)
- Federica Valerio
- Insect Symbiosis Ecology and Evolution, Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland
- Research Centre for Ecological Changes, University of Helsinki, Helsinki, Finland
| | - Victoria G Twort
- The Finnish Museum of Natural History, Luomus, University of Helsinki, Helsinki, Finland
| | - Anne Duplouy
- Insect Symbiosis Ecology and Evolution, Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland.
- Research Centre for Ecological Changes, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
37
|
Mabbitt PD. Structural biology in the Anthropocene epoch. Nat Struct Mol Biol 2024; 31:2. [PMID: 38253664 DOI: 10.1038/s41594-024-01222-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
|
38
|
Villegas LI, Ferretti L, Wiehe T, Waldvogel A, Schiffer PH. Parthenogenomics: Insights on mutation rates and nucleotide diversity in parthenogenetic Panagrolaimus nematodes. Ecol Evol 2024; 14:e10831. [PMID: 38192904 PMCID: PMC10771965 DOI: 10.1002/ece3.10831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/11/2023] [Accepted: 12/13/2023] [Indexed: 01/10/2024] Open
Abstract
Asexual reproduction is assumed to lead to the accumulation of deleterious mutations, and reduced heterozygosity due to the absence of recombination. Panagrolaimid nematode species display different modes of reproduction. Sexual reproduction with distinct males and females, asexual reproduction through parthenogenesis in the genus Panagrolaimus, and hermaphroditism in Propanagrolaimus. Here, we compared genomic features of free-living nematodes in populations and species isolated from geographically distant regions to study diversity, and genome-wide differentiation under different modes of reproduction. We firstly estimated genome-wide spontaneous mutation rates in a triploid parthenogenetic Panagrolaimus, and a diploid hermaphroditic Propanagrolaimus via long-term mutation accumulation lines. Secondly, we calculated population genetic parameters including nucleotide diversity, and fixation index (F ST) between populations of asexually and sexually reproducing nematodes. Thirdly, we used phylogenetic network methods on sexually and asexually reproducing Panagrolaimus populations to understand evolutionary relationships between them. The estimated mutation rate was slightly lower for the asexual population, as expected for taxa with this reproductive mode. Natural polyploid asexual populations revealed higher nucleotide diversity. Despite their common ancestor, a gene network revealed a high level of genetic differentiation among asexual populations. The elevated heterozygosity found in the triploid parthenogens could be explained by the third genome copy. Given their tendentially lower mutation rates it can be hypothesized that this is part of the mechanism to evade Muller's ratchet. Our findings in parthenogenetic triploid nematode populations seem to challenge common expectations of evolution under asexuality.
Collapse
Affiliation(s)
| | | | - Thomas Wiehe
- Institute for GeneticsUniversity of CologneKölnGermany
| | | | | |
Collapse
|
39
|
Minich JJ, Moore ML, Allsing NA, Aylward A, Murray ER, Tran L, Michael TP. Generating high-quality plant and fish reference genomes from field-collected specimens by optimizing preservation. Commun Biol 2023; 6:1246. [PMID: 38071270 PMCID: PMC10710401 DOI: 10.1038/s42003-023-05615-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 11/20/2023] [Indexed: 12/18/2023] Open
Abstract
Sample preservation often impedes efforts to generate high-quality reference genomes or pangenomes for Earth's more than 2 million plant and animal species due to nucleotide degradation. Here we compare the impacts of storage methods including solution type, temperature, and time on DNA quality and Oxford Nanopore long-read sequencing quality in 9 fish and 4 plant species. We show 95% ethanol largely protects against degradation for fish blood (22 °C, ≤6 weeks) and plant tissue (4 °C, ≤3 weeks). From this furthest storage timepoint, we assemble high-quality reference genomes of 3 fish and 2 plant species with contiguity (contig N50) and completeness (BUSCO) that achieve the Vertebrate Genome Project benchmarking standards. For epigenetic applications, we also report methylation frequency compared to liquid nitrogen control. The results presented here remove the necessity for cryogenic storage in many long read applications and provide a framework for future studies focused on sampling in remote locations, which may represent a large portion of the future sequencing of novel organisms.
Collapse
Affiliation(s)
- Jeremiah J Minich
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
| | - Malia L Moore
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
- Scripps Institution of Oceanography, University of California San Diego, 8622 Kennel Way, La Jolla, CA, 92093, USA
| | - Nicholas A Allsing
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
| | - Anthony Aylward
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
| | - Emily R Murray
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
| | - Loi Tran
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA
| | - Todd P Michael
- The Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA, 92037, USA.
| |
Collapse
|
40
|
Stevens L, Martínez-Ugalde I, King E, Wagah M, Absolon D, Bancroft R, Gonzalez de la Rosa P, Hall JL, Kieninger M, Kloch A, Pelan S, Robertson E, Pedersen AB, Abreu-Goodger C, Buck AH, Blaxter M. Ancient diversity in host-parasite interaction genes in a model parasitic nematode. Nat Commun 2023; 14:7776. [PMID: 38012132 PMCID: PMC10682056 DOI: 10.1038/s41467-023-43556-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 11/13/2023] [Indexed: 11/29/2023] Open
Abstract
Host-parasite interactions exert strong selection pressures on the genomes of both host and parasite. These interactions can lead to negative frequency-dependent selection, a form of balancing selection that is hypothesised to explain the high levels of polymorphism seen in many host immune and parasite antigen loci. Here, we sequence the genomes of several individuals of Heligmosomoides bakeri, a model parasite of house mice, and Heligmosomoides polygyrus, a closely related parasite of wood mice. Although H. bakeri is commonly referred to as H. polygyrus in the literature, their genomes show levels of divergence that are consistent with at least a million years of independent evolution. The genomes of both species contain hyper-divergent haplotypes that are enriched for proteins that interact with the host immune response. Many of these haplotypes originated prior to the divergence between H. bakeri and H. polygyrus, suggesting that they have been maintained by long-term balancing selection. Together, our results suggest that the selection pressures exerted by the host immune response have played a key role in shaping patterns of genetic diversity in the genomes of parasitic nematodes.
Collapse
Affiliation(s)
- Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Hinxton, UK.
| | - Isaac Martínez-Ugalde
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Erna King
- Tree of Life, Wellcome Sanger Institute, Hinxton, UK
| | - Martin Wagah
- Tree of Life, Wellcome Sanger Institute, Hinxton, UK
| | | | - Rowan Bancroft
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | | | - Jessica L Hall
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | | | | | - Sarah Pelan
- Tree of Life, Wellcome Sanger Institute, Hinxton, UK
| | - Elaine Robertson
- Institute of Immunology & Infection Research, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Amy B Pedersen
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Cei Abreu-Goodger
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Amy H Buck
- Institute of Immunology & Infection Research, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, UK.
| |
Collapse
|
41
|
Rodriguez Ruiz A, Van Dam AR. Metagenomic binning of PacBio HiFi data prior to assembly reveals a complete genome of Cosmopolites sordidus (Germar) (Coleopterea: Curculionidae, Dryophthorinae) the most damaging arthropod pest of bananas and plantains. PeerJ 2023; 11:e16276. [PMID: 38025758 PMCID: PMC10676084 DOI: 10.7717/peerj.16276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 09/20/2023] [Indexed: 12/01/2023] Open
Abstract
PacBio HiFi sequencing was employed in combination with metagenomic binning to produce a high-quality reference genome of Cosmopolites sordidus. We compared k-mer and alignment reference based pre-binning and post-binning approaches to remove contamination. We were also interested to know if the post-binning approach had interspersed bacterial contamination within intragenic regions of Arthropoda binned contigs. Our analyses identified 3,433 genes that were composed with reads identified as of putative bacterial origins. The pre-binning approach yielded a C. sordidus genome of 1.07 Gb genome composed of 3,089 contigs with 98.6% and 97.1% complete and single copy genome and protein BUSCO scores respectively. In this article we demonstrate that in this case the pre-binning approach does not sacrifice assembly quality for more stringent metagenomic filtering. We also determine post-binning allows for increased intragenic contamination increased with increasing coverage, but the frequency of gene contamination increased with lower coverage. Future work should focus on developing reference free pre-binning approaches for HiFi reads produced from eukaryotic based metagenomic samples.
Collapse
Affiliation(s)
- Alfredo Rodriguez Ruiz
- Departamento de Biología, Universidad de Puerto Rico Recinto Universitario de Mayagüez, Mayagüez, Puerto Rico, United States of America
| | - Alex R. Van Dam
- Departamento de Biología, Universidad de Puerto Rico Recinto Universitario de Mayagüez, Mayagüez, Puerto Rico, United States of America
| |
Collapse
|
42
|
Liao IJY, Lu TM, Chen ME, Luo YJ. Spiralian genomics and the evolution of animal genome architecture. Brief Funct Genomics 2023; 22:498-508. [PMID: 37507111 DOI: 10.1093/bfgp/elad029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 06/27/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Recent developments in sequencing technologies have greatly improved our knowledge of phylogenetic relationships and genomic architectures throughout the tree of life. Spiralia, a diverse clade within Protostomia, is essential for understanding the evolutionary history of parasitism, gene conversion, nervous systems and animal body plans. In this review, we focus on the current hypotheses of spiralian phylogeny and investigate the impact of long-read sequencing on the quality of genome assemblies. We examine chromosome-level assemblies to highlight key genomic features that have driven spiralian evolution, including karyotype, synteny and the Hox gene organization. In addition, we show how chromosome rearrangement has influenced spiralian genomic structures. Although spiralian genomes have undergone substantial changes, they exhibit both conserved and lineage-specific features. We recommend increasing sequencing efforts and expanding functional genomics research to deepen insights into spiralian biology.
Collapse
|
43
|
Mulhair PO, Crowley L, Boyes DH, Lewis OT, Holland PWH. Opsin Gene Duplication in Lepidoptera: Retrotransposition, Sex Linkage, and Gene Expression. Mol Biol Evol 2023; 40:msad241. [PMID: 37935057 PMCID: PMC10642689 DOI: 10.1093/molbev/msad241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 10/20/2023] [Accepted: 10/26/2023] [Indexed: 11/09/2023] Open
Abstract
Color vision in insects is determined by signaling cascades, central to which are opsin proteins, resulting in sensitivity to light at different wavelengths. In certain insect groups, lineage-specific evolution of opsin genes, in terms of copy number, shifts in expression patterns, and functional amino acid substitutions, has resulted in changes in color vision with subsequent behavioral and niche adaptations. Lepidoptera are a fascinating model to address whether evolutionary change in opsin content and sequence evolution are associated with changes in vision phenotype. Until recently, the lack of high-quality genome data representing broad sampling across the lepidopteran phylogeny has greatly limited our ability to accurately address this question. Here, we annotate opsin genes in 219 lepidopteran genomes representing 33 families, reconstruct their evolutionary history, and analyze shifts in selective pressures and expression between genes and species. We discover 44 duplication events in opsin genes across ∼300 million years of lepidopteran evolution. While many duplication events are species or family specific, we find retention of an ancient long-wavelength-sensitive (LW) opsin duplication derived by retrotransposition within the speciose superfamily Noctuoidea (in the families Nolidae, Erebidae, and Noctuidae). This conserved LW retrogene shows life stage-specific expression suggesting visual sensitivities or other sensory functions specific to the early larval stage. This study provides a comprehensive order-wide view of opsin evolution across Lepidoptera, showcasing high rates of opsin duplications and changes in expression patterns.
Collapse
Affiliation(s)
- Peter O Mulhair
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | - Liam Crowley
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | | | - Owen T Lewis
- Department of Biology, University of Oxford, Oxford OX1 3SZ, UK
| | | |
Collapse
|
44
|
Lim J, Kim W, Kim J, Lee J. Telomeric repeat evolution in the phylum Nematoda revealed by high-quality genome assemblies and subtelomere structures. Genome Res 2023; 33:gr.278124.123. [PMID: 37918961 PMCID: PMC10760449 DOI: 10.1101/gr.278124.123] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 10/16/2023] [Indexed: 11/04/2023]
Abstract
Telomeres are composed of tandem arrays of telomeric-repeat motifs (TRMs) and telomere-binding proteins (TBPs), which are responsible for ensuring end-protection and end-replication of chromosomes. TRMs are highly conserved owing to the sequence specificity of TBPs, although significant alterations in TRM have been observed in several taxa, except Nematoda. We used public whole-genome sequencing data sets to analyze putative TRMs of 100 nematode species and determined that three distinct branches included specific novel TRMs, suggesting that evolutionary alterations in TRMs occurred in Nematoda. We focused on one of the three branches, the Panagrolaimidae family, and performed a de novo assembly of four high-quality draft genomes of the canonical (TTAGGC) and novel TRM (TTAGAC) isolates; the latter genomes revealed densely clustered arrays of the novel TRM. We then comprehensively analyzed the subtelomeric regions of the genomes to infer how the novel TRM evolved. We identified DNA damage-repair signatures in subtelomeric sequences that were representative of consequences of telomere maintenance mechanisms by alternative lengthening of telomeres. We propose a hypothetical scenario in which TTAGAC-containing units are clustered in subtelomeric regions and pre-existing TBPs capable of binding both canonical and novel TRMs aided the evolution of the novel TRM in the Panagrolaimidae family.
Collapse
Affiliation(s)
- Jiseon Lim
- Department of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| | - Wonjoo Kim
- Department of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| | - Jun Kim
- Department of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea;
- Research Institute of Basic Sciences, Seoul National University, Seoul 08826, South Korea
- Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, Daejeon 34134, South Korea
| | - Junho Lee
- Department of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
- Research Institute of Basic Sciences, Seoul National University, Seoul 08826, South Korea
| |
Collapse
|
45
|
Mee L, Barribeau SM. Influence of social lifestyles on host-microbe symbioses in the bees. Ecol Evol 2023; 13:e10679. [PMID: 37928198 PMCID: PMC10620586 DOI: 10.1002/ece3.10679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/16/2023] [Accepted: 10/17/2023] [Indexed: 11/07/2023] Open
Abstract
Microbiomes are increasingly recognised as critical for the health of an organism. In eusocial insect societies, frequent social interactions allow for high-fidelity transmission of microbes across generations, leading to closer host-microbe coevolution. The microbial communities of bees with other social lifestyles are less studied, and few comparisons have been made between taxa that vary in social structure. To address this gap, we leveraged a cloud-computing resource and publicly available transcriptomic data to conduct a survey of microbial diversity in bee samples from a variety of social lifestyles and taxa. We consistently recover the core microbes of well-studied corbiculate bees, supporting this method's ability to accurately characterise microbial communities. We find that the bacterial communities of bees are influenced by host location, phylogeny and social lifestyle, although no clear effect was found for fungal or viral microbial communities. Bee genera with more complex societies tend to harbour more diverse microbes, with Wolbachia detected more commonly in solitary tribes. We present a description of the microbiota of Euglossine bees and find that they do not share the "corbiculate core" microbiome. Notably, we find that bacteria with known anti-pathogenic properties are present across social bee genera, suggesting that symbioses that enhance host immunity are important with higher sociality. Our approach provides an inexpensive means of exploring microbiomes of a given taxa and identifying avenues for further research. These findings contribute to our understanding of the relationships between bees and their associated microbial communities, highlighting the importance of considering microbiome dynamics in investigations of bee health.
Collapse
Affiliation(s)
- Lauren Mee
- Institute of Infection, Veterinary and Ecological Sciences, Department of Evolution, Ecology and BehaviourUniversity of LiverpoolLiverpoolUK
| | - Seth M. Barribeau
- Institute of Infection, Veterinary and Ecological Sciences, Department of Evolution, Ecology and BehaviourUniversity of LiverpoolLiverpoolUK
| |
Collapse
|
46
|
Lucek K, Giménez MD, Joron M, Rafajlović M, Searle JB, Walden N, Westram AM, Faria R. The Impact of Chromosomal Rearrangements in Speciation: From Micro- to Macroevolution. Cold Spring Harb Perspect Biol 2023; 15:a041447. [PMID: 37604585 PMCID: PMC10626258 DOI: 10.1101/cshperspect.a041447] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
Chromosomal rearrangements (CRs) have been known since almost the beginning of genetics. While an important role for CRs in speciation has been suggested, evidence primarily stems from theoretical and empirical studies focusing on the microevolutionary level (i.e., on taxon pairs where speciation is often incomplete). Although the role of CRs in eukaryotic speciation at a macroevolutionary level has been supported by associations between species diversity and rates of evolution of CRs across phylogenies, these findings are limited to a restricted range of CRs and taxa. Now that more broadly applicable and precise CR detection approaches have become available, we address the challenges in filling some of the conceptual and empirical gaps between micro- and macroevolutionary studies on the role of CRs in speciation. We synthesize what is known about the macroevolutionary impact of CRs and suggest new research avenues to overcome the pitfalls of previous studies to gain a more comprehensive understanding of the evolutionary significance of CRs in speciation across the tree of life.
Collapse
Affiliation(s)
- Kay Lucek
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, Switzerland
| | - Mabel D Giménez
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Instituto de Genética Humana de Misiones (IGeHM), Parque de la Salud de la Provincia de Misiones "Dr. Ramón Madariaga," N3300KAZ Posadas, Misiones, Argentina
- Facultad de Ciencias Exactas Químicas y Naturales, Universidad Nacional de Misiones, N3300LQH Posadas, Misiones, Argentina
| | - Mathieu Joron
- Centre d'Ecologie Fonctionnelle et Evolutive, Université de Montpellier, CNRS, EPHE, IRD, 34293 Montpellier, France
| | - Marina Rafajlović
- Department of Marine Sciences, University of Gothenburg, 405 30 Gothenburg, Sweden
- Centre for Marine Evolutionary Biology, University of Gothenburg, 405 30 Gothenburg, Sweden
| | - Jeremy B Searle
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York 14853, USA
| | - Nora Walden
- Centre for Organismal Studies, University of Heidelberg, 69117 Heidelberg, Germany
| | - Anja Marie Westram
- Institute of Science and Technology Austria (ISTA), 3400 Klosterneuburg, Austria
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| | - Rui Faria
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado;
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| |
Collapse
|
47
|
Blaxter ML, Spurgeon D, Kille P. The genome sequence of the common earthworm, Lumbricus terrestris (Linnaeus, 1758). Wellcome Open Res 2023; 8:500. [PMID: 38249959 PMCID: PMC10799228 DOI: 10.12688/wellcomeopenres.20178.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2023] [Indexed: 01/23/2024] Open
Abstract
We present a genome assembly from an individual Lumbricus terrestris (the common earthworm; Annelida; Clitellata; Haplotaxida; Lumbricidae). The genome sequence is 1,056.5 megabases in span. Most of the assembly is scaffolded into 18 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 15.93 kilobases in length.
Collapse
Affiliation(s)
- Mark L. Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - David Spurgeon
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | - Peter Kille
- Cardiff School of Biosciences, Cardiff University, Cardiff, Wales, UK
| | - Wellcome Sanger Institute Tree of Life programme
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
- Cardiff School of Biosciences, Cardiff University, Cardiff, Wales, UK
| | - Wellcome Sanger Institute Scientific Operations: DNA Pipelines collective
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
- Cardiff School of Biosciences, Cardiff University, Cardiff, Wales, UK
| | - Tree of Life Core Informatics collective
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
- Cardiff School of Biosciences, Cardiff University, Cardiff, Wales, UK
| | | |
Collapse
|
48
|
Ferrari G, Esselens L, Hart ML, Janssens S, Kidner C, Mascarello M, Peñalba JV, Pezzini F, von Rintelen T, Sonet G, Vangestel C, Virgilio M, Hollingsworth PM. Developing the Protocol Infrastructure for DNA Sequencing Natural History Collections. Biodivers Data J 2023; 11:e102317. [PMID: 38327316 PMCID: PMC10848826 DOI: 10.3897/bdj.11.e102317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 08/04/2023] [Indexed: 02/09/2024] Open
Abstract
Intentionally preserved biological material in natural history collections represents a vast repository of biodiversity. Advances in laboratory and sequencing technologies have made these specimens increasingly accessible for genomic analyses, offering a window into the genetic past of species and often permitting access to information that can no longer be sampled in the wild. Due to their age, preparation and storage conditions, DNA retrieved from museum and herbarium specimens is often poor in yield, heavily fragmented and biochemically modified. This not only poses methodological challenges in recovering nucleotide sequences, but also makes such investigations susceptible to environmental and laboratory contamination. In this paper, we review the practical challenges associated with making the recovery of DNA sequence data from museum collections more routine. We first review key operational principles and issues to address, to guide the decision-making process and dialogue between researchers and curators about when and how to sample museum specimens for genomic analyses. We then outline the range of steps that can be taken to reduce the likelihood of contamination including laboratory set-ups, workflows and working practices. We finish by presenting a series of case studies, each focusing on protocol practicalities for the application of different mainstream methodologies to museum specimens including: (i) shotgun sequencing of insect mitogenomes, (ii) whole genome sequencing of insects, (iii) genome skimming to recover plant plastid genomes from herbarium specimens, (iv) target capture of multi-locus nuclear sequences from herbarium specimens, (v) RAD-sequencing of bird specimens and (vi) shotgun sequencing of ancient bovid bone samples.
Collapse
Affiliation(s)
- Giada Ferrari
- Royal Botanic Garden Edinburgh, Edinburgh, United KingdomRoyal Botanic Garden EdinburghEdinburghUnited Kingdom
| | - Lore Esselens
- Royal Museum for Central Africa, Tervuren, BelgiumRoyal Museum for Central AfricaTervurenBelgium
- Royal Belgian Institute of Natural Sciences, Brussels, BelgiumRoyal Belgian Institute of Natural SciencesBrusselsBelgium
| | - Michelle L Hart
- Royal Botanic Garden Edinburgh, Edinburgh, United KingdomRoyal Botanic Garden EdinburghEdinburghUnited Kingdom
| | - Steven Janssens
- Meise Botanic Garden, Meise, BelgiumMeise Botanic GardenMeiseBelgium
- Leuven Plant Institute, Department of Biology, Leuven, BelgiumLeuven Plant Institute, Department of BiologyLeuvenBelgium
| | - Catherine Kidner
- Royal Botanic Garden Edinburgh, Edinburgh, United KingdomRoyal Botanic Garden EdinburghEdinburghUnited Kingdom
| | | | - Joshua V Peñalba
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, GermanyMuseum für Naturkunde, Leibniz Institute for Evolution and Biodiversity ScienceBerlinGermany
| | - Flávia Pezzini
- Royal Botanic Garden Edinburgh, Edinburgh, United KingdomRoyal Botanic Garden EdinburghEdinburghUnited Kingdom
| | - Thomas von Rintelen
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, GermanyMuseum für Naturkunde, Leibniz Institute for Evolution and Biodiversity ScienceBerlinGermany
| | - Gontran Sonet
- Royal Belgian Institute of Natural Sciences, Brussels, BelgiumRoyal Belgian Institute of Natural SciencesBrusselsBelgium
| | - Carl Vangestel
- Royal Belgian Institute of Natural Sciences, Brussels, BelgiumRoyal Belgian Institute of Natural SciencesBrusselsBelgium
| | - Massimiliano Virgilio
- Royal Museum for Central Africa, Department of African Zoology, Tervuren, BelgiumRoyal Museum for Central Africa, Department of African ZoologyTervurenBelgium
| | - Peter M Hollingsworth
- Royal Botanic Garden Edinburgh, Edinburgh, United KingdomRoyal Botanic Garden EdinburghEdinburghUnited Kingdom
| |
Collapse
|
49
|
Schneider B, Sweeney BA, Bateman A, Cerny J, Zok T, Szachniuk M. When will RNA get its AlphaFold moment? Nucleic Acids Res 2023; 51:9522-9532. [PMID: 37702120 PMCID: PMC10570031 DOI: 10.1093/nar/gkad726] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 08/13/2023] [Accepted: 08/22/2023] [Indexed: 09/14/2023] Open
Abstract
The protein structure prediction problem has been solved for many types of proteins by AlphaFold. Recently, there has been considerable excitement to build off the success of AlphaFold and predict the 3D structures of RNAs. RNA prediction methods use a variety of techniques, from physics-based to machine learning approaches. We believe that there are challenges preventing the successful development of deep learning-based methods like AlphaFold for RNA in the short term. Broadly speaking, the challenges are the limited number of structures and alignments making data-hungry deep learning methods unlikely to succeed. Additionally, there are several issues with the existing structure and sequence data, as they are often of insufficient quality, highly biased and missing key information. Here, we discuss these challenges in detail and suggest some steps to remedy the situation. We believe that it is possible to create an accurate RNA structure prediction method, but it will require solving several data quality and volume issues, usage of data beyond simple sequence alignments, or the development of new less data-hungry machine learning methods.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, Prumyslova 595, CZ-252 50 Vestec, Czech Republic
| | - Blake Alexander Sweeney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| | - Jiri Cerny
- Institute of Biotechnology of the Czech Academy of Sciences, Prumyslova 595, CZ-252 50 Vestec, Czech Republic
| | - Tomasz Zok
- Institute of Computing Science and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| |
Collapse
|
50
|
Parker MT, Fica SM, Barton GJ, Simpson GG. Inter-species association mapping links splice site evolution to METTL16 and SNRNP27K. eLife 2023; 12:e91997. [PMID: 37787376 PMCID: PMC10581693 DOI: 10.7554/elife.91997] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 09/18/2023] [Indexed: 10/04/2023] Open
Abstract
Eukaryotic genes are interrupted by introns that are removed from transcribed RNAs by splicing. Patterns of splicing complexity differ between species, but it is unclear how these differences arise. We used inter-species association mapping with Saccharomycotina species to correlate splicing signal phenotypes with the presence or absence of splicing factors. Here, we show that variation in 5' splice site sequence preferences correlate with the presence of the U6 snRNA N6-methyladenosine methyltransferase METTL16 and the splicing factor SNRNP27K. The greatest variation in 5' splice site sequence occurred at the +4 position and involved a preference switch between adenosine and uridine. Loss of METTL16 and SNRNP27K orthologs, or a single SNRNP27K methionine residue, was associated with a preference for +4 U. These findings are consistent with splicing analyses of mutants defective in either METTL16 or SNRNP27K orthologs and models derived from spliceosome structures, demonstrating that inter-species association mapping is a powerful orthogonal approach to molecular studies. We identified variation between species in the occurrence of two major classes of 5' splice sites, defined by distinct interaction potentials with U5 and U6 snRNAs, that correlates with intron number. We conclude that variation in concerted processes of 5' splice site selection by U6 snRNA is associated with evolutionary changes in splicing signal phenotypes.
Collapse
Affiliation(s)
- Matthew T Parker
- School of Life Sciences, University of DundeeDundeeUnited Kingdom
| | - Sebastian M Fica
- Department of Biochemistry, University of OxfordOxfordUnited Kingdom
| | | | - Gordon G Simpson
- School of Life Sciences, University of DundeeDundeeUnited Kingdom
- Cell & Molecular Sciences, James Hutton InstituteInvergowrieUnited Kingdom
| |
Collapse
|