1
|
Li Z, Xu Z, Zhu L, Qin T, Ma J, Feng Z, Yue H, Guan Q, Zhou B, Han G, Zhang G, Li C, Jia S, Qiu Q, Hao D, Wang Y, Wang W. High-quality sika deer omics data and integrative analysis reveal genic and cellular regulation of antler regeneration. Genome Res 2025; 35:188-201. [PMID: 39542648 DOI: 10.1101/gr.279448.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 10/28/2024] [Indexed: 11/17/2024]
Abstract
The antler is the only organ that can fully regenerate annually in mammals. However, the regulatory pattern and mechanism of gene expression and cell differentiation during this process remain largely unknown. Here, we obtain comprehensive assembly and gene annotation of the sika deer (Cervus nippon) genome. We construct, together with large-scale chromatin accessibility and gene expression data, gene regulatory networks involved in antler regeneration, identifying four transcription factors, MYC, KLF4, NFE2L2, and JDP2, with high regulatory activity across the whole regeneration process. Comparative studies and luciferase reporter assay suggest the MYC expression driven by a cervid-specific regulatory element might be important for antler regenerative ability. We further develop a model called combinatorial TF Oriented Program (cTOP), which integrates single-cell data with bulk regulatory networks and find PRDM1, FOSL1, BACH1, and NFATC1 as potential pivotal factors in antler stem cell activation and osteogenic differentiation. Additionally, we uncover interactions within and between cell programs and pathways during the regeneration process. These findings provide insights into the gene and cell regulatory mechanisms of antler regeneration, particularly in stem cell activation and differentiation.
Collapse
Affiliation(s)
- Zihe Li
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Ziyu Xu
- CEMS, NCMIS, HCMS, MADIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing 100049, China
| | - Lei Zhu
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710054, China
- Shaanxi Key Laboratory of Spine Bionic Treatment, Xi'an, Shaanxi 710054, China
| | - Tao Qin
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Jinrui Ma
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Zhanying Feng
- CEMS, NCMIS, HCMS, MADIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- Department of Statistics, Department of Biomedical Data Science, Bio-X Program, Stanford University, Stanford, California 94305, USA
| | - Huishan Yue
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Qing Guan
- Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Botong Zhou
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Ge Han
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Guokun Zhang
- Institute of Antler Science and Product Technology, Changchun Sci-Tech University, 130600 Changchun, China
| | - Chunyi Li
- Institute of Antler Science and Product Technology, Changchun Sci-Tech University, 130600 Changchun, China
| | - Shuaijun Jia
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710054, China
- Shaanxi Key Laboratory of Spine Bionic Treatment, Xi'an, Shaanxi 710054, China
| | - Qiang Qiu
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China;
| | - Dingjun Hao
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710054, China;
- Shaanxi Key Laboratory of Spine Bionic Treatment, Xi'an, Shaanxi 710054, China
| | - Yong Wang
- CEMS, NCMIS, HCMS, MADIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China;
- School of Mathematics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing 100049, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| | - Wen Wang
- New Cornerstone Science Laboratory, Shaanxi Key Laboratory of Qinling Ecological Intelligent Monitoring and Protection, School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China;
- Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| |
Collapse
|
2
|
Lewin TD, Liao IJY, Chen ME, Bishop JDD, Holland PWH, Luo YJ. Fusion, fission, and scrambling of the bilaterian genome in Bryozoa. Genome Res 2025; 35:78-92. [PMID: 39762050 DOI: 10.1101/gr.279636.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 10/31/2024] [Indexed: 01/24/2025]
Abstract
Groups of orthologous genes are commonly found together on the same chromosome over vast evolutionary distances. This extensive physical gene linkage, known as macrosynteny, is seen between bilaterian phyla as divergent as Chordata, Echinodermata, Mollusca, and Nemertea. Here, we report a unique pattern of genome evolution in Bryozoa, an understudied phylum of colonial invertebrates. Using comparative genomics, we reconstruct the chromosomal evolutionary history of five bryozoans. Multiple ancient chromosome fusions followed by gene mixing led to the near-complete loss of bilaterian linkage groups in the ancestor of extant bryozoans. A second wave of rearrangements, including chromosome fission, then occurred independently in two bryozoan classes, further scrambling bryozoan genomes. We also discover at least five derived chromosomal fusion events shared between bryozoans and brachiopods, supporting the traditional but highly debated Lophophorata hypothesis and suggesting macrosynteny to be a potentially powerful source of phylogenetic information. Finally, we show that genome rearrangements led to the dispersion of genes from bryozoan Hox clusters onto multiple chromosomes. Our findings demonstrate that the canonical bilaterian genome structure has been lost across all studied representatives of an entire phylum, and reveal that linkage group fission can occur very frequently in specific lineages.
Collapse
Affiliation(s)
- Thomas D Lewin
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | | | - Mu-En Chen
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | - John D D Bishop
- Marine Biological Association, Plymouth PL1 2PB, United Kingdom
| | - Peter W H Holland
- Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom
| | - Yi-Jyun Luo
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan;
| |
Collapse
|
3
|
Hunter T. The genome sequence of the tawny cockroach, Ectobius (Ectobius) pallidus (Olivier, 1789). Wellcome Open Res 2025; 10:22. [PMID: 39866809 PMCID: PMC11754957 DOI: 10.12688/wellcomeopenres.23463.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/27/2024] [Indexed: 01/28/2025] Open
Abstract
We present a genome assembly from a specimen of Ectobius pallidus (tawny cockroach; Arthropoda; Insecta; Blattodea; Ectobiidae). The assembly contains two haplotypes with total lengths of 2,087.55 megabases and 2,124.67 megabases, respectively. Most of haplotype 1 (98.55%) is scaffolded into 11 chromosomal pseudomolecules, while haplotype 2 is assembled to scaffold level. The mitochondrial genome has also been assembled and is 15.75 kilobases in length.
Collapse
Affiliation(s)
- Tony Hunter
- Entomology Section, World Museum, Liverpool, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
4
|
Fowler K. The genome sequence of a leafhopper, Allygus modestus Scott, 1876. Wellcome Open Res 2025; 10:9. [PMID: 39881685 PMCID: PMC11775446 DOI: 10.12688/wellcomeopenres.23451.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/27/2024] [Indexed: 01/31/2025] Open
Abstract
We present a genome assembly from an individual male specimen of Allygus modestus (leafhopper; Arthropoda; Insecta; Hemiptera; Cicadellidae). The genome sequence has a total length of 1,819.90 megabases. Most of the assembly (99.86%) is scaffolded into 7 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 16.69 kilobases in length.
Collapse
|
5
|
Dewar AE, Belcher LJ, West SA. A phylogenetic approach to comparative genomics. Nat Rev Genet 2025:10.1038/s41576-024-00803-0. [PMID: 39779997 PMCID: PMC7617348 DOI: 10.1038/s41576-024-00803-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/05/2024] [Indexed: 01/11/2025]
Abstract
Comparative genomics, whereby the genomes of different species are compared, has the potential to address broad and fundamental questions at the intersection of genetics and evolution. However, species, genomes and genes cannot be considered as independent data points within statistical tests. Closely related species tend to be similar because they share genes by common descent, which must be accounted for in analyses. This problem of non-independence may be exacerbated when examining genomes or genes but can be addressed by applying phylogeny-based methods to comparative genomic analyses. Here, we review how controlling for phylogeny can change the conclusions of comparative genomics studies. We address common questions on how to apply these methods and illustrate how they can be used to test causal hypotheses. The combination of rapidly expanding genomic datasets and phylogenetic comparative methods is set to revolutionize the biological insights possible from comparative genomic studies.
Collapse
Affiliation(s)
- Anna E Dewar
- Department of Biology, University of Oxford, Oxford, UK.
- St John's College, Oxford, UK.
| | | | - Stuart A West
- Department of Biology, University of Oxford, Oxford, UK
| |
Collapse
|
6
|
Xian W, Bezrukov I, Bao Z, Vorbrugg S, Gautam A, Weigel D. TIPPo: A User-Friendly Tool for De Novo Assembly of Organellar Genomes with High-Fidelity Data. Mol Biol Evol 2025; 42:msae247. [PMID: 39800935 PMCID: PMC11725521 DOI: 10.1093/molbev/msae247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 11/15/2024] [Accepted: 11/18/2024] [Indexed: 01/16/2025] Open
Abstract
Plant cells have two major organelles with their own genomes: chloroplasts and mitochondria. While chloroplast genomes tend to be structurally conserved, the mitochondrial genomes of plants, which are much larger than those of animals, are characterized by complex structural variation. We introduce TIPPo, a user-friendly, reference-free assembly tool that uses PacBio high-fidelity long-read data and that does not rely on genomes from related species or nuclear genome information for the assembly of organellar genomes. TIPPo employs a deep learning model for initial read classification and leverages k-mer counting for further refinement, significantly reducing the impact of nuclear insertions of organellar DNA on the assembly process. We used TIPPo to completely assemble a set of 54 complete chloroplast genomes. No other tool was able to completely assemble this set. TIPPo is comparable with PMAT in assembling mitochondrial genomes from most species but does achieve even higher completeness for several species. We also used the assembled organelle genomes to identify instances of nuclear plastid DNA (NUPTs) and nuclear mitochondrial DNA (NUMTs) insertions. The cumulative length of NUPTs/NUMTs positively correlates with the size of the nuclear genome, suggesting that insertions occur stochastically. NUPTs/NUMTs show predominantly C:G to T:A changes, with the mutated cytosines typically found in CG and CHG contexts, suggesting that degradation of NUPT and NUMT sequences is driven by the known elevated mutation rate of methylated cytosines. Small interfering RNA loci are enriched in NUPTs and NUMTs, consistent with the RdDM pathway mediating DNA methylation in these sequences.
Collapse
Affiliation(s)
- Wenfei Xian
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Ilja Bezrukov
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Zhigui Bao
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Sebastian Vorbrugg
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Anupam Gautam
- Algorithms in Bioinformatics, Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
- International Max Planck Research School “From Molecules to Organisms”, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
7
|
Rigden DJ, Fernández XM. The 2025 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2025; 53:D1-D9. [PMID: 39658041 PMCID: PMC11701706 DOI: 10.1093/nar/gkae1220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Accepted: 11/26/2024] [Indexed: 12/12/2024] Open
Abstract
The 2025 Nucleic Acids Research database issue contains 185 papers spanning biology and related areas. Seventy three new databases are covered, while resources previously described in the issue account for 101 update articles. Databases most recently published elsewhere account for a further 11 papers. Nucleic acid databases include EXPRESSO for multi-omics of 3D genome structure (this issue's chosen Breakthrough Resource and Article) and NAIRDB for Fourier transform infrared data. New protein databases include structure predictions for human isoforms at ASpdb and for viral proteins at BFVD. UniProt, Pfam and InterPro have all provided updates: metabolism and signalling are covered by new descriptions of STRING, KEGG and CAZy, while updated microbe-oriented databases include Enterobase, VFDB and PHI-base. Biomedical research is supported, among others, by ClinVar, PubChem and DrugMAP. Genomics-related resources include Ensembl, UCSC Genome Browser and dbSNP. New plant databases cover the Solanaceae (SolR) and Asteraceae (AMIR) families while an update from NCBI Taxonomy also features. The Database Issue is freely available on the Nucleic Acids Research website (https://academic.oup.com/nar). At the NAR online Molecular Biology Database Collection (http://www.oxfordjournals.org/nar/database/c/), 932 entries have been reviewed in the last year, 74 new resources added and 226 discontinued URLs eliminated bringing the current total to 2236 databases.
Collapse
Affiliation(s)
- Daniel J Rigden
- Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|
8
|
Bronner IF, Dawson E, Park N, Piepenburg O, Quail MA. Evaluation of controls, quality control assays, and protocol optimisations for PacBio HiFi sequencing on diverse and challenging samples. Front Genet 2025; 15:1505839. [PMID: 39845189 PMCID: PMC11752452 DOI: 10.3389/fgene.2024.1505839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Accepted: 11/19/2024] [Indexed: 01/24/2025] Open
Abstract
The Darwin Tree of Life (DToL) project aims to generate high-quality reference genomes for all eukaryotic organisms in Britain and Ireland. At the time of writing, PacBio HiFi reads are generated for all samples using the Sequel IIe systems by the Wellcome Sanger Institute's Scientific Operations teams, however we expect lessons from this work to apply directly to the Revio system too, as core principles of SMRT sequencing remain the same. We observed that HiFi yield is highly variable for DToL samples. We have investigated what drives this variation, and potential mitigations. To support these investigations a number of controls were evaluated to ensure that the library and sequencing preparation procedures, reagents, consumables, and Sequel IIe instruments, were performing as expected. Our findings support that a primary factor driving variability in HiFi yield is the quality of the DNA prior to library construction, e.g., purity, size, and damage. We investigated whether quality assessment assays could link measurable DNA damage or purity to sequencing yield. Some correlation could be established, however no assay was predictive of sequencing yield for all samples, indicating that the variability is driven by multiple factors that may interact. We demonstrate that contaminants present in some samples are the cause of very low HiFi yield, and show that these contaminants can negatively affect the PacBio internal sequencing control and samples multiplexed on the same SMRT Cell. We found that consistently high yields could be obtained if an amplification workflow was utilised, namely PacBio's ultra-low input library preparation protocol.
Collapse
Affiliation(s)
| | - Emma Dawson
- Wellcome Sanger Institute (WT), Hinxton, United Kingdom
| | | | | | | |
Collapse
|
9
|
Dyer SC, Austine-Orimoloye O, Azov AG, Barba M, Barnes I, Barrera-Enriquez VP, Becker A, Bennett R, Beracochea M, Berry A, Bhai J, Bhurji SK, Boddu S, Branco Lins PR, Brooks L, Ramaraju SB, Campbell LI, Martinez MC, Charkhchi M, Cortes LA, Davidson C, Denni S, Dodiya K, Donaldson S, El Houdaigui B, El Naboulsi T, Falola O, Fatima R, Genez T, Martinez JG, Gurbich T, Hardy M, Hollis Z, Hunt T, Kay M, Kaykala V, Lemos D, Lodha D, Mathlouthi N, Merino GA, Merritt R, Mirabueno LP, Mushtaq A, Hossain SN, Pérez-Silva JG, Perry M, Piližota I, Poppleton D, Prosovetskaia I, Raj S, Salam AI, Saraf S, Saraiva-Agostinho N, Sinha S, Sipos B, Sitnik V, Steed E, Suner MM, Surapaneni L, Sutinen K, Tricomi FF, Tsang I, Urbina-Gómez D, Veidenberg A, Walsh TA, Willhoft NL, Allen J, Alvarez-Jarreta J, Chakiachvili M, Cheema J, da Rocha JB, De Silva NH, Giorgetti S, Haggerty L, Ilsley GR, Keatley J, Loveland JE, Moore B, Mudge JM, Naamati G, Tate J, Trevanion SJ, Winterbottom A, Flint B, Frankish A, Hunt SE, Finn RD, Freeberg MA, Harrison PW, Martin FJ, Yates AD. Ensembl 2025. Nucleic Acids Res 2025; 53:D948-D957. [PMID: 39656687 PMCID: PMC11701638 DOI: 10.1093/nar/gkae1071] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/14/2024] [Accepted: 10/22/2024] [Indexed: 12/17/2024] Open
Abstract
Ensembl (www.ensembl.org) is an open platform integrating publicly available genomics data across the tree of life with a focus on eukaryotic species related to human health, agriculture and biodiversity. This year has seen a continued expansion in the number of species represented, with >4800 eukaryotic and >31 300 prokaryotic genomes available. The new Ensembl site, currently in beta, has continued to develop, currently holding >2700 eukaryotic genome assemblies. The new site provides genome, gene, transcript, homology and variation views, and will replace the current Rapid Release site; this represents a key step towards provision of a single integrated Ensembl site. Additional activities have included developing improved regulatory annotation for human, mouse and agricultural species, and expanding the Ensembl Variant Effect Predictor tool. To learn more about Ensembl, help and documentation are available along with an extensive training program that can be accessed via our training pages.
Collapse
Affiliation(s)
- Sarah C Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Olanrewaju Austine-Orimoloye
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrey G Azov
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthieu Barba
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - If Barnes
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Vianey Paola Barrera-Enriquez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Arne Becker
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ruth Bennett
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Martin Beracochea
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew Berry
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jyothish Bhai
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Simarpreet Kaur Bhurji
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sanjay Boddu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paulo R Branco Lins
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lucy Brooks
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Shashank Budhanuru Ramaraju
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lahcen I Campbell
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Manuel Carbajo Martinez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mehrnaz Charkhchi
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lucas A Cortes
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sukanya Denni
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Université de Rouen Normandie, UFR Sciences et Techniques, 3 Av. Pasteur, 76000 Rouen, France
| | - Kamalkumar Dodiya
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah Donaldson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Bilal El Houdaigui
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tamara El Naboulsi
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oluwadamilare Falola
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Reham Fatima
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thiago Genez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jose Gonzalez Martinez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tatiana Gurbich
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew Hardy
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Zoe Hollis
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Vinay Kaykala
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Diana Lemos
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Disha Lodha
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nourhen Mathlouthi
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gabriela Alejandra Merino
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ryan Merritt
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Louisse Paola Mirabueno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Aleena Mushtaq
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Syed Nakib Hossain
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - José G Pérez-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Malcolm Perry
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ivana Piližota
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Poppleton
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Irina Prosovetskaia
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Shriya Raj
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ahamed Imran Abdul Salam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Shradha Saraf
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nuno Saraiva-Agostinho
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Swati Sinha
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Botond Sipos
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Vasily Sitnik
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emily Steed
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Marie-Marthe Suner
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Likhitha Surapaneni
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kyösti Sutinen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ian Tsang
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- NIAB, Lawrence Weaver Road, Cambridge CB3 0LE, UK
- University of Nottingham, Department of Plant Science, Plant Sciences Building, Sutton Bonnington Campus, Nottingham LE12 5RD, UK
| | - David Urbina-Gómez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andres Veidenberg
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas A Walsh
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Natalie L Willhoft
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jamie Allen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Alvarez-Jarreta
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Marc Chakiachvili
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jitender Cheema
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Batista da Rocha
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nishadi H De Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stefano Giorgetti
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Garth R Ilsley
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jon Keatley
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jane E Loveland
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Benjamin Moore
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Guy Naamati
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John Tate
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen J Trevanion
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrea Winterbottom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Bethany Flint
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mallory A Freeberg
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew D Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
10
|
Blake DP. Eimeria of chickens: the changing face of an old foe. Avian Pathol 2025:1-12. [PMID: 39743984 DOI: 10.1080/03079457.2024.2441180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Revised: 11/22/2024] [Accepted: 12/03/2024] [Indexed: 01/04/2025]
Abstract
RESEARCH HIGHLIGHTS The cost of coccidiosis in chickens fluctuates considerably, peaking in 2022.Three new Eimeria species can infect chickens and escape current vaccines.Eimeria infection exerts wide-ranging effects on enteric microbiota.
Collapse
Affiliation(s)
- Damer P Blake
- Pathobiology and Population Sciences, Royal Veterinary College, North Mymms, UK
| |
Collapse
|
11
|
Secomandi S, Gallo GR, Rossi R, Rodríguez Fernandes C, Jarvis ED, Bonisoli-Alquati A, Gianfranceschi L, Formenti G. Pangenome graphs and their applications in biodiversity genomics. Nat Genet 2025; 57:13-26. [PMID: 39779953 DOI: 10.1038/s41588-024-02029-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 11/08/2024] [Indexed: 01/11/2025]
Abstract
Complete datasets of genetic variants are key to biodiversity genomic studies. Long-read sequencing technologies allow the routine assembly of highly contiguous, haplotype-resolved reference genomes. However, even when complete, reference genomes from a single individual may bias downstream analyses and fail to adequately represent genetic diversity within a population or species. Pangenome graphs assembled from aligned collections of high-quality genomes can overcome representation bias by integrating sequence information from multiple genomes from the same population, species or genus into a single reference. Here, we review the available tools and data structures to build, visualize and manipulate pangenome graphs while providing practical examples and discussing their applications in biodiversity and conservation genomics across the tree of life.
Collapse
Affiliation(s)
- Simona Secomandi
- Laboratory of Neurogenetics of Language, the Rockefeller University, New York, NY, USA
| | | | - Riccardo Rossi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| | - Carlos Rodríguez Fernandes
- Centre for Ecology, Evolution and Environmental Changes (CE3C) and CHANGE, Global Change and Sustainability Institute, Departamento de Biologia Animal, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
- Faculdade de Psicologia, Universidade de Lisboa, Lisboa, Portugal
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, the Rockefeller University, New York, NY, USA
- The Vertebrate Genome Laboratory, New York, NY, USA
| | - Andrea Bonisoli-Alquati
- Department of Biological Sciences, California State Polytechnic University, Pomona, Pomona, CA, USA
| | | | | |
Collapse
|
12
|
Hauff L, Rasoanaivo NE, Razafindrakoto A, Ravelonjanahary H, Wright PC, Rakotoarivony R, Bergey CM. De Novo Genome Assembly for an Endangered Lemur Using Portable Nanopore Sequencing in Rural Madagascar. Ecol Evol 2025; 15:e70734. [PMID: 39777412 PMCID: PMC11705420 DOI: 10.1002/ece3.70734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 12/02/2024] [Accepted: 12/03/2024] [Indexed: 01/11/2025] Open
Abstract
As one of the most threatened mammalian taxa, lemurs of Madagascar are facing unprecedented anthropogenic pressures. To address conservation imperatives such as this, researchers have increasingly relied on conservation genomics to identify populations of particular concern. However, many of these genomic approaches necessitate high-quality genomes. While the advent of next-generation sequencing technologies and the resulting reduction in associated costs have led to the proliferation of genomic data and high-quality reference genomes, global discrepancies in genomic sequencing capabilities often result in biological samples from biodiverse host countries being exported to facilities in the Global North, creating inequalities in access and training within genomic research. Here, we present the first published reference genome for the endangered red-fronted brown lemur (Eulemur rufifrons) from sequencing efforts conducted entirely within the host country using portable Oxford Nanopore sequencing. Using an archived E. rufifrons specimen, we conducted long-read, nanopore sequencing at the Centre ValBio Research Station near Ranomafana National Park, in rural Madagascar, generating over 750 Gb of sequencing data from 10 MinION flow cells. Exclusively using this long-read data, we assembled 2.157 gigabase, 2980-contig nuclear assembly with an N50 of 101.6 Mb and a 17,108 bp mitogenome. The nuclear assembly had 30× average coverage and was comparable in completeness to other primate reference genomes, with a 96.1% BUSCO completeness score for primate-specific genes. As the first published reference genome for E. rufifrons and the only annotated genome available for the speciose Eulemur genus, this resource will prove vital for conservation genomic studies while our efforts exhibit the potential of this protocol to address research inequalities and build genomic capacity.
Collapse
Affiliation(s)
- Lindsey Hauff
- Department of Ecology, Evolution, and Natural ResourcesRutgers UniversityNew BrunswickNew JerseyUSA
- Center for Human Evolutionary StudiesRutgers UniversityNew BrunswickNew JerseyUSA
- Human Genetics Institute of New JerseyPiscatawayNew JerseyUSA
| | - Noa Elosmie Rasoanaivo
- Department of Zoology and Animal BiodiversityUniversity of AntananarivoAntananarivoMadagascar
| | | | | | - Patricia C. Wright
- Centre ValBio, Ranomafana National ParkIfanadianaMadagascar
- Department of AnthropologyStony Brook UniversityStony BrookNew YorkUSA
| | - Rindra Rakotoarivony
- Department of Biological Anthropology and Sustainable DevelopmentUniversity of AntananarivoAntananarivoMadagascar
| | - Christina M. Bergey
- Center for Human Evolutionary StudiesRutgers UniversityNew BrunswickNew JerseyUSA
- Human Genetics Institute of New JerseyPiscatawayNew JerseyUSA
- Department of GeneticsRutgers UniversityPiscatawayNew JerseyUSA
| |
Collapse
|
13
|
Krabberød AK, Stokke E, Thoen E, Skrede I, Kauserud H. The Ribosomal Operon Database: A Full-Length rDNA Operon Database Derived From Genome Assemblies. Mol Ecol Resour 2025; 25:e14031. [PMID: 39428982 DOI: 10.1111/1755-0998.14031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 06/27/2024] [Accepted: 09/27/2024] [Indexed: 10/22/2024]
Abstract
Current rDNA reference sequence databases are tailored towards shorter DNA markers, such as parts of the 16/18S marker or the internally transcribed spacer (ITS) region. However, due to advances in long-read DNA sequencing technologies, longer stretches of the rDNA operon are increasingly used in environmental sequencing studies to increase the phylogenetic resolution. There is, therefore, a growing need for longer rDNA reference sequences. Here, we present the ribosomal operon database (ROD), which includes eukaryotic full-length rDNA operons fished from publicly available genome assemblies. Full-length operons were detected in 34.1% of the 34,701 examined eukaryotic genome assemblies from NCBI. In most cases (53.1%), more than one operon variant was detected, which can be due to intragenomic operon copy variability, allelic variation in non-haploid genomes, or technical errors from the sequencing and assembly process. The highest copy number found was 5947 in Zea mays. In total, 453,697 unique operons were detected, with 69,480 operon variant clusters remaining after intragenomic clustering at 99% sequence identity. The operon length varied extensively across eukaryotes, ranging from 4136 to 16,463 bp, which will lead to considerable polymerase chain reaction (PCR) bias during amplification of the entire operon. Clustering the full-length operons revealed that the different parts (i.e., 18S, 28S, and the hypervariable regions V4 and V9 of 18S) provide divergent taxonomic resolution, with 18S, the V4 and V9 regions being the most conserved. The ROD will be updated regularly to provide an increasing number of full-length rDNA operons to the scientific community.
Collapse
Affiliation(s)
- Anders K Krabberød
- Department of Biosciences, Section for Genetics and Evolutionary Biology, University of Oslo, Oslo, Norway
| | - Embla Stokke
- Department of Biosciences, Section for Genetics and Evolutionary Biology, University of Oslo, Oslo, Norway
| | - Ella Thoen
- Department of Biosciences, Section for Genetics and Evolutionary Biology, University of Oslo, Oslo, Norway
| | - Inger Skrede
- Department of Biosciences, Section for Genetics and Evolutionary Biology, University of Oslo, Oslo, Norway
| | - Håvard Kauserud
- Department of Biosciences, Section for Genetics and Evolutionary Biology, University of Oslo, Oslo, Norway
| |
Collapse
|
14
|
Orozco-Arias S, Sierra P, Durbin R, González J. MCHelper automatically curates transposable element libraries across eukaryotic species. Genome Res 2024; 34:2256-2268. [PMID: 39653419 PMCID: PMC11694758 DOI: 10.1101/gr.278821.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 09/18/2024] [Indexed: 12/25/2024]
Abstract
The number of species with high-quality genome sequences continues to increase, in part due to the scaling up of multiple large-scale biodiversity sequencing projects. While the need to annotate genic sequences in these genomes is widely acknowledged, the parallel need to annotate transposable element (TE) sequences that have been shown to alter genome architecture, rewire gene regulatory networks, and contribute to the evolution of host traits is becoming ever more evident. However, accurate genome-wide annotation of TE sequences is still technically challenging. Several de novo TE identification tools are now available, but manual curation of the libraries produced by these tools is needed to generate high-quality genome annotations. Manual curation is time-consuming, and thus impractical for large-scale genomic studies, and lacks reproducibility. In this work, we present the Manual Curator Helper tool MCHelper, which automates the TE library curation process. By leveraging MCHelper's fully automated mode with the outputs from three de novo TE identification tools, RepeatModeler2, EDTA, and REPET, in the fruit fly, rice, hooded crow, zebrafish, maize, and human, we show a substantial improvement in the quality of the TE libraries and genome annotations. MCHelper libraries are less redundant, with up to 65% reduction in the number of consensus sequences, have up to 11.4% fewer false positive sequences, and up to ∼48% fewer "unclassified/unknown" TE consensus sequences. Genome-wide TE annotations are also improved, including larger unfragmented insertions. Moreover, MCHelper is an easy-to-install and easy-to-use tool.
Collapse
Affiliation(s)
| | - Pío Sierra
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, 08003 Barcelona, Spain;
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Spain
| |
Collapse
|
15
|
Sanita Lima M, Silva Domingues D, Rossi Paschoal A, Smith DR. Long-read RNA sequencing can probe organelle genome pervasive transcription. Brief Funct Genomics 2024; 23:695-701. [PMID: 38880995 DOI: 10.1093/bfgp/elae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/20/2024] [Accepted: 05/30/2024] [Indexed: 06/18/2024] Open
Abstract
40 years ago, organelle genomes were assumed to be streamlined and, perhaps, unexciting remnants of their prokaryotic past. However, the field of organelle genomics has exposed an unparallel diversity in genome architecture (i.e. genome size, structure, and content). The transcription of these eccentric genomes can be just as elaborate - organelle genomes are pervasively transcribed into a plethora of RNA types. However, while organelle protein-coding genes are known to produce polycistronic transcripts that undergo heavy posttranscriptional processing, the nature of organelle noncoding transcriptomes is still poorly resolved. Here, we review how wet-lab experiments and second-generation sequencing data (i.e. short reads) have been useful to determine certain types of organelle RNAs, particularly noncoding RNAs. We then explain how third-generation (long-read) RNA-Seq data represent the new frontier in organelle transcriptomics. We show that public repositories (e.g. NCBI SRA) already contain enough data for inter-phyla comparative studies and argue that organelle biologists can benefit from such data. We discuss the prospects of using publicly available sequencing data for organelle-focused studies and examine the challenges of such an approach. We highlight that the lack of a comprehensive database dedicated to organelle genomics/transcriptomics is a major impediment to the development of a field with implications in basic and applied science.
Collapse
Affiliation(s)
- Matheus Sanita Lima
- Department of Biology, Western University, 1151 Richmond Street, London, Ontario N6A 5B7, Canada
| | - Douglas Silva Domingues
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Avenida Padua Dias 11, Piracicaba, SP 13418-900, Brazil
| | - Alexandre Rossi Paschoal
- Department of Computer Science, Bioinformatics and Pattern Recognition Group (BIOINFO-CP), Federal University of Technology - Paraná - UTFPR, Avenida Alberto Carazzai 1640, Cornélio Procópio, PR 86300000, Brazil
| | - David Roy Smith
- Department of Biology, Western University, 1151 Richmond Street, London, Ontario N6A 5B7, Canada
| |
Collapse
|
16
|
Mikalsen SO, Í Hjøllum J, Salter I, Djurhuus A, Í Kongsstovu S. A Faroese perspective on decoding life for sustainable use of nature and protection of biodiversity. NPJ BIODIVERSITY 2024; 3:37. [PMID: 39632982 PMCID: PMC11618374 DOI: 10.1038/s44185-024-00068-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 11/11/2024] [Indexed: 12/07/2024]
Abstract
Biodiversity is under pressure, mainly due to human activities and climate change. At the international policy level, it is now recognised that genetic diversity is an important part of biodiversity. The availability of high-quality reference genomes gives the best basis for using genetics and genetic diversity towards the global aims of (1) the protection of species, biodiversity, and nature, and (2) the management of biodiversity for achieving sustainable harvesting of nature. Protecting biodiversity is a global responsibility, also resting on small nations, like the Faroe Islands. Being in the middle of the North Atlantic Ocean and having large fisheries activity, the nation has a particular responsibility towards maritime matters. We here provide the reasoning behind the Genome Atlas of Faroese Ecology (Gen@FarE), a project based on our participation in the European Reference Genome Atlas consortium (ERGA). Gen@FarE has three major aims: (1) To acquire high-quality genomes of all eukaryotic species in the Faroe Islands and Faroese waters. (2) To establish population genetics for species of commercial or ecological interest. (3) To establish an information databank for all Faroese species, combined with a citizen science registration database, making it possible for the public to participate in acquiring and maintaining the overview of Faroese species in both terrestrial and marine environments. Altogether, we believe that this will enhance the society's interest in and awareness of biodiversity, thereby protecting the foundations of our lives. Furthermore, the combination of a wide and highly competent ERGA umbrella and more targeted national projects will help fulfil the formal and moral responsibilities that all nations, also those with limited resources, have in protecting biodiversity and achieving sustainability in harvesting from nature.
Collapse
Affiliation(s)
- Svein-Ole Mikalsen
- Faculty of Science and Technology, University of the Faroe Islands, Tórshavn, Faroe Islands.
| | - Jari Í Hjøllum
- Faculty of Science and Technology, University of the Faroe Islands, Tórshavn, Faroe Islands
| | - Ian Salter
- Faroe Marine Research Institute, Tórshavn, Faroe Islands
| | - Anni Djurhuus
- Faculty of Science and Technology, University of the Faroe Islands, Tórshavn, Faroe Islands
| | - Sunnvør Í Kongsstovu
- Faculty of Science and Technology, University of the Faroe Islands, Tórshavn, Faroe Islands
| |
Collapse
|
17
|
Augustijnen H, Arias-Sardá C, Farré M, Lucek K. A Genomic Update on the Evolutionary Impact of Chromosomal Rearrangements. Mol Ecol 2024; 33:e17602. [PMID: 39585199 DOI: 10.1111/mec.17602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 10/29/2024] [Accepted: 11/13/2024] [Indexed: 11/26/2024]
Affiliation(s)
- Hannah Augustijnen
- Unit of Ecology and Evolution, Department of Biology, University of Fribourg, Fribourg, Switzerland
| | | | - Marta Farré
- School of Biosciences, University of Kent, Kent, UK
| | - Kay Lucek
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
18
|
Provatas K, Chantzi N, Patsakis M, Nayak A, Mouratidis I, Georgakopoulos-Soares I. Microsatellites explorer: A database of short tandem repeats across genomes. Comput Struct Biotechnol J 2024; 23:3817-3826. [PMID: 39525087 PMCID: PMC11550718 DOI: 10.1016/j.csbj.2024.10.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 10/24/2024] [Accepted: 10/24/2024] [Indexed: 11/16/2024] Open
Abstract
Short tandem repeats (STRs) are widespread, repetitive elements, with a number of biological functions and are among the most rapidly mutating regions in the genome. Their distribution varies significantly between taxonomic groups in the tree of life and are highly polymorphic within the human population. Advances in sequencing technologies coupled with decreasing costs have enabled the generation of an ever-growing number of complete genomes. Additionally, the arrival of accurate long reads has facilitated the generation of Telomere-to-Telomere (T2T) assemblies of complete genomes. Nevertheless, there is no comprehensive database that encompasses the STRs found per genome across different organisms and for different human genomes across diverse ancestries. Here we introduce Microsatellites Explorer, a database of STRs found in the genomes of 117,253 organisms across all major taxonomic groups, 15 T2T genome assemblies of different organisms, and 94 human haplotypes from the human pangenome. The database currently hosts 406,758,798 STR sequences, serving as a centralized user-friendly repository to perform searches, interactive visualizations, and download existing STR data for independent analysis. Microsatellites Explorer is implemented as a web-portal for browsing, analyzing and downloading STR data. Microsatellites Explorer is publicly available at https://www.microsatellitesexplorer.com.
Collapse
Affiliation(s)
- Kimonas Provatas
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Michail Patsakis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Akshatha Nayak
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
19
|
Kalbfleisch TS, Smith ML, Ciosek JL, Li K, Doris PA. Three decades of rat genomics: approaching the finish(ed) line. Physiol Genomics 2024; 56:807-818. [PMID: 39348459 PMCID: PMC11573253 DOI: 10.1152/physiolgenomics.00110.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 09/11/2024] [Accepted: 09/26/2024] [Indexed: 10/02/2024] Open
Abstract
The rat, Rattus norvegicus, has provided an important model for investigation of a range of characteristics of biomedical importance. Here we survey the origins of this species, its introduction into laboratory research, and the emergence of genetic and genomic methods that utilize this model organism. Genomic studies have yielded important progress and provided new insight into several biologically important traits. However, some studies have been impeded by the lack of a complete and accurate reference genome for this species. New sequencing and genome assembly methods applied to the rat have resulted in a new reference genome assembly, GRCr8, which is a near telomere-to-telomere assembly of high base-level accuracy that incorporates several elements not captured in prior assemblies. As genome assembly methods continue to advance and production costs become a less significant obstacle, genome assemblies for multiple inbred rat strains are emerging. These assemblies will allow a rat pangenome assembly to be constructed that captures all the genetic variations in strains selected for their utility in research and will overcome reference bias, a limitation associated with reliance on a single reference assembly. By this means, the full utility of this model organism to genomic studies will begin to be revealed.
Collapse
Affiliation(s)
- Theodore S Kalbfleisch
- Gluck Equine Research Center, University of Kentucky, Lexington, Kentucky, United States
| | - Melissa L Smith
- Department of Biochemistry and Molecular Biology, University of Louisville School of Medicine, Louisville, Kentucky, United States
| | - Julia L Ciosek
- Gluck Equine Research Center, University of Kentucky, Lexington, Kentucky, United States
| | - Kai Li
- Gluck Equine Research Center, University of Kentucky, Lexington, Kentucky, United States
| | - Peter A Doris
- Center for Human Genetics, Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center, Houston, Texas, United States
| |
Collapse
|
20
|
Nguyen AK, Schall PZ, Kidd JM. A map of canine sequence variation relative to a Greenland wolf outgroup. Mamm Genome 2024; 35:565-576. [PMID: 39088040 DOI: 10.1007/s00335-024-10056-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 07/25/2024] [Indexed: 08/02/2024]
Abstract
For over 15 years, canine genetics research relied on a reference assembly from a Boxer breed dog named Tasha (i.e., canFam3.1). Recent advances in long-read sequencing and genome assembly have led to the development of numerous high-quality assemblies from diverse canines. These assemblies represent notable improvements in completeness, contiguity, and the representation of gene promoters and gene models. Although genome graph and pan-genome approaches have promise, most genetic analyses in canines rely upon the mapping of Illumina sequencing reads to a single reference. The Dog10K consortium, and others, have generated deep catalogs of genetic variation through an alignment of Illumina sequencing reads to a reference genome obtained from a German Shepherd Dog named Mischka (i.e., canFam4, UU_Cfam_GSD_1.0). However, alignment to a breed-derived genome may introduce bias in genotype calling across samples. Since the use of an outgroup reference genome may remove this effect, we have reprocessed 1929 samples analyzed by the Dog10K consortium using a Greenland wolf (mCanLor1.2) as the reference. We efficiently performed remapping and variant calling using a GPU-implementation of common analysis tools. The resulting call set removes the variability in genetic differences seen across samples and breed relationships revealed by principal component analysis are not affected by the choice of reference genome. Using this sequence data, we inferred the history of population sizes and found that village dog populations experienced a 9-13 fold reduction in historic effective population size relative to wolves.
Collapse
Affiliation(s)
- Anthony K Nguyen
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Peter Z Schall
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
21
|
Patsakis M, Provatas K, Mouratidis I, Georgakopoulos-Soares I. MAFcounter: An efficient tool for counting the occurrences of k-mers in MAF files. ARXIV 2024:arXiv:2411.19427v1. [PMID: 39650609 PMCID: PMC11623707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Motivation With the rapid expansion of large-scale biological datasets, DNA and protein sequence alignments have become essential for comparative genomics and proteomics. These alignments facilitate the exploration of sequence similarity patterns, providing valuable insights into sequence conservation, evolutionary relationships and for functional analyses. Typically, sequence alignments are stored in formats such as the Multiple Alignment Format (MAF). Counting k-mer occurrences is a crucial task in many computational biology applications, but currently, there is no algorithm designed for k-mer counting in alignment files. Results We have developed MAFcounter, the first k-mer counter dedicated to alignment files. MAFcounter is multithreaded, fast, and memory efficient, enabling k-mer counting in DNA and protein sequence alignment files. Availability The MAFcounter package and its Python bindings are released under GPL license as a multi-platform application and are available at: https://github.com/Georgakopoulos-Soares-lab/MAFcounter.
Collapse
Affiliation(s)
- Michail Patsakis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Kimonas Provatas
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
22
|
Boyes D. The genome sequence of the Poplar Grey moth, Subacronicta megacephala (Denis & Schiffermüller, 1775). Wellcome Open Res 2024; 9:696. [PMID: 39822595 PMCID: PMC11736114 DOI: 10.12688/wellcomeopenres.23371.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/08/2024] [Indexed: 01/19/2025] Open
Abstract
We present a genome assembly from an individual male Subacronicta megacephala (Poplar Grey moth; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence has a total length of 424.20 megabases. Most of the assembly (99.02%) is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.35 kilobases in length. Gene annotation of this assembly on Ensembl identified 18,189 protein-coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
23
|
Vazquez JM, Lauterbur ME, Mottaghinia S, Bucci M, Fraser D, Gray-Sandoval G, Gaucherand L, Haidar ZR, Han M, Kohler W, Lama TM, Le Corf A, Loyer C, Maesen S, McMillan D, Li S, Lo J, Rey C, Capel SLR, Singer M, Slocum K, Thomas W, Tyburec JD, Villa S, Miller R, Buchalski M, Vazquez-Medina JP, Pfeffer S, Etienne L, Enard D, Sudmant PH. Extensive longevity and DNA virus-driven adaptation in nearctic Myotis bats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.10.617725. [PMID: 39416019 PMCID: PMC11482938 DOI: 10.1101/2024.10.10.617725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
The genus Myotis is one of the largest clades of bats, and exhibits some of the most extreme variation in lifespans among mammals alongside unique adaptations to viral tolerance and immune defense. To study the evolution of longevity-associated traits and infectious disease, we generated near-complete genome assemblies and cell lines for 8 closely related species of Myotis. Using genome-wide screens of positive selection, analyses of structural variation, and functional experiments in primary cell lines, we identify new patterns of adaptation contributing to longevity, cancer resistance, and viral interactions in bats. We find that Myotis bats have some of the most significant variation in cancer risk across mammals and demonstrate a unique DNA damage response in primary cells of the long-lived M. lucifugus. We also find evidence of abundant adaptation in response to DNA viruses - but not RNA viruses - in Myotis and other bats in sharp contrast with other mammals, potentially contributing to the role of bats as reservoirs of zoonoses. Together, our results demonstrate how genomics and primary cells derived from diverse taxa uncover the molecular bases of extreme adaptations in non-model organisms.
Collapse
Affiliation(s)
- Juan M Vazquez
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- These authors contributed equally
| | - M. Elise Lauterbur
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
- Current affiliation: Department of Biology, University of Vermont, Burlington, VT USA
- These authors contributed equally
| | - Saba Mottaghinia
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Melanie Bucci
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
| | - Devaughn Fraser
- Wildlife Genetics Research Unit, Wildlife Health Laboratory, California Department of Fish and Wildlife, Sacramento, CA, United States
- Current affiliation: Wildlife Diversity Program, Wildlife Division, Connecticut Department of Energy and Environmental Protection, Burlington, CT, United States
| | | | - Léa Gaucherand
- Université de Strasbourg, Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Strasbourg, France
| | - Zeinab R Haidar
- Department of Biology, California State Polytechnic University, Humboldt, Arcata, CA USA
- Current affiliation: Western EcoSystems Technology Inc, Cheyenne, WY USA
| | - Melissa Han
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - William Kohler
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - Tanya M. Lama
- Department of Biological Sciences, Smith College, Northampton, MA USA
| | - Amandine Le Corf
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Clara Loyer
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Sarah Maesen
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Dakota McMillan
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Department of Science and Biotechnology, Berkeley City College, Berkeley, CA USA
| | - Stacy Li
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
| | - Johnathan Lo
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
| | - Carine Rey
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Samantha LR Capel
- Current affiliation: Wildlife Diversity Program, Wildlife Division, Connecticut Department of Energy and Environmental Protection, Burlington, CT, United States
| | - Michael Singer
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, CA USA
| | | | - William Thomas
- Department of Ecology and Evolution, Stony Brook University, Stony Brook NY USA
| | | | - Sarah Villa
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, CA USA
| | - Richard Miller
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - Michael Buchalski
- Wildlife Genetics Research Unit, Wildlife Health Laboratory, California Department of Fish and Wildlife, Sacramento, CA, United States
| | | | - Sébastien Pfeffer
- Université de Strasbourg, Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Strasbourg, France
| | - Lucie Etienne
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
- Senior author
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
- Senior author
- These authors contributed equally
| | - Peter H Sudmant
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
- Senior author
- These authors contributed equally
- Lead contact
| |
Collapse
|
24
|
Ryan H, Vernes SC, Teeling EC, Mai M. The genome sequence of the whiskered bat, Myotis mystacinus (Kuhl, 1817). Wellcome Open Res 2024; 9:684. [PMID: 39635244 PMCID: PMC11615438 DOI: 10.12688/wellcomeopenres.23345.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/04/2024] [Indexed: 12/07/2024] Open
Abstract
We present a genome assembly from an individual male Myotis mystacinus (whiskered bat; Chordata; Mammalia; Chiroptera; Vespertilionidae). The genome sequence has a total length of 2,081.20 megabases. Most of the assembly (97.52%) is scaffolded into 23 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled and is 16.93 kilobases in length.
Collapse
Affiliation(s)
| | - Sonja C. Vernes
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
| | - Emma C Teeling
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Meike Mai
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
| | - Natural History Museum Genome Acquisition Lab
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Darwin Tree of Life Barcoding collective
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Core Informatics team
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | - Tree of Life Core Informatics collective
- Kent Bat Group, Whitstable, England, UK
- School of Biology, University of St Andrews, St Andrews, Scotland, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | | |
Collapse
|
25
|
Ye X, Yang Y, Fang Q, Ye G. Genomics of insect natural enemies in agroecosystems. CURRENT OPINION IN INSECT SCIENCE 2024; 68:101298. [PMID: 39547440 DOI: 10.1016/j.cois.2024.101298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 09/26/2024] [Accepted: 11/10/2024] [Indexed: 11/17/2024]
Abstract
Currently, a wealth of genomic data are now accessible for numerous insect natural enemies, serving as valuable resources that deepen our understanding of the genetic basis of biocontrol traits in these organisms. We summarize the current state of genome sequencing and highlight candidate genes related to biocontrol traits that hold promise for genetic improvement. We also review the recent population genomic studies in biological control and the discovery of potential insecticidal genes in parasitoid wasps. Collectively, current genomic works have shown the powerful ability to identify candidate genes responsible for desirable traits or promising effectors. However, further functional study is necessary to gain a mechanistic understanding of these genes, and future efforts are also needed to develop suitable approaches to translate genomic insights into field applications.
Collapse
Affiliation(s)
- Xinhai Ye
- College of Advanced Agriculture Science, Zhejiang A&F University, Hangzhou 311300, China; Zhejiang Key Laboratory of Biology and Ecological Regulation of Crop Pathogens and Insects, Zhejiang A&F University, Hangzhou 311300, China.
| | - Yi Yang
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China
| | - Qi Fang
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China
| | - Gongyin Ye
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
26
|
Calcino A, Cooke I, Cowman P, Higgie M, Massault C, Schmitz U, Whittaker M, Field MA. Harnessing genomic technologies for one health solutions in the tropics. Global Health 2024; 20:78. [PMID: 39543642 PMCID: PMC11566161 DOI: 10.1186/s12992-024-01083-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 11/01/2024] [Indexed: 11/17/2024] Open
Abstract
BACKGROUND The targeted application of cutting-edge high-throughput molecular data technologies provides an enormous opportunity to address key health, economic and environmental issues in the tropics within the One Health framework. The Earth's tropical regions are projected to contain > 50% of the world's population by 2050 coupled with 80% of its biodiversity however these regions are relatively less developed economically, with agricultural productivity substantially lower than temperate zones, a large percentage of its population having limited health care options and much of its biodiversity understudied and undescribed. The generation of high-throughput molecular data and bespoke bioinformatics capability to address these unique challenges offers an enormous opportunity for people living in the tropics. MAIN: In this review we discuss in depth solutions to challenges to populations living in tropical zones across three critical One Health areas: human health, biodiversity and food production. This review will examine how some of the challenges in the tropics can be addressed through the targeted application of advanced omics and bioinformatics and will discuss how local populations can embrace these technologies through strategic outreach and education ensuring the benefits of the One Health approach is fully realised through local engagement. CONCLUSION Within the context of the One Health framework, we will demonstrate how genomic technologies can be utilised to improve the overall quality of life for half the world's population.
Collapse
Affiliation(s)
- Andrew Calcino
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Townsville, QLD, Australia
| | - Ira Cooke
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Townsville, QLD, Australia
| | - Pete Cowman
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- Queensland Museum, Townsville, QLD, Australia
| | - Megan Higgie
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Townsville, QLD, Australia
| | - Cecile Massault
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- Centre for Sustainable Tropical Fisheries and Aquaculture James Cook University, Townsville, QLD, Australia
| | - Ulf Schmitz
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Townsville, QLD, Australia
- Sydney Medical School, University of Sydney, Sydney, NSW, Australia
| | - Maxine Whittaker
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Townsville, QLD, Australia
| | - Matt A Field
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, Australia.
- Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW, Australia.
| |
Collapse
|
27
|
Griffiths A, Prescott T. The genome sequence of the Scalloped Hook-tip moth, Falcaria lacertinaria (Linnaeus, 1758). Wellcome Open Res 2024; 9:659. [PMID: 39649622 PMCID: PMC11624438 DOI: 10.12688/wellcomeopenres.23258.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2024] [Indexed: 12/11/2024] Open
Abstract
We present a genome assembly from an individual female Falcaria lacertinaria (the Scalloped Hook-tip; Arthropoda; Insecta; Lepidoptera; Drepanidae). The genome sequence has a total length of 300.20 megabases. Most of the assembly is scaffolded into 32 chromosomal pseudomolecules, including the W and Z sex chromosomes. The mitochondrial genome has also been assembled and is 16.07 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,709 protein-coding genes.
Collapse
Affiliation(s)
- Andy Griffiths
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
| | - Tom Prescott
- Butterfly Conservation Scotland, Stirling, Scotland, UK
| | | | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Butterfly Conservation Scotland, Stirling, Scotland, UK
| | | | | | | | | |
Collapse
|
28
|
Weber CC. Disentangling cobionts and contamination in long-read genomic data using sequence composition. G3 (BETHESDA, MD.) 2024; 14:jkae187. [PMID: 39148415 PMCID: PMC11540323 DOI: 10.1093/g3journal/jkae187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 08/02/2024] [Accepted: 08/02/2024] [Indexed: 08/17/2024]
Abstract
The recent acceleration in genome sequencing targeting previously unexplored parts of the tree of life presents computational challenges. Samples collected from the wild often contain sequences from several organisms, including the target, its cobionts, and contaminants. Effective methods are therefore needed to separate sequences. Though advances in sequencing technology make this task easier, it remains difficult to taxonomically assign sequences from eukaryotic taxa that are not well represented in databases. Therefore, reference-based methods alone are insufficient. Here, I examine how we can take advantage of differences in sequence composition between organisms to identify symbionts, parasites, and contaminants in samples, with minimal reliance on reference data. To this end, I explore data from the Darwin Tree of Life project, including hundreds of high-quality HiFi read sets from insects. Visualizing two-dimensional representations of read tetranucleotide composition learned by a variational autoencoder can reveal distinct components of a sample. Annotating the embeddings with additional information, such as coding density, estimated coverage, or taxonomic labels allows rapid assessment of the contents of a dataset. The approach scales to millions of sequences, making it possible to explore unassembled read sets, even for large genomes. Combined with interactive visualization tools, it allows a large fraction of cobionts reported by reference-based screening to be identified. Crucially, it also facilitates retrieving genomes for which suitable reference data are absent.
Collapse
Affiliation(s)
- Claudia C Weber
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| |
Collapse
|
29
|
Hartley M, Anita L, Babalola K, Russell C, Yoldaş AK, Zulueta-Coarasa T. Pictures at an exhibition: How to share your imaging data. J Microsc 2024; 296:145-149. [PMID: 37648214 DOI: 10.1111/jmi.13221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 08/22/2023] [Accepted: 08/28/2023] [Indexed: 09/01/2023]
Abstract
Open access to data underpinning published results is a key pillar of scientific reproducibility. Making data available at scale also provides opportunities for data reuse, encouraging the development of new analysis approaches. In this poster article, accompanying a recorded talk, we will explain the benefits of publicly archiving your image data alongside your published manuscripts, as well as highlight what resources are available to do this. This will include the BioImage Archive, EMBL-EBI's new resource for biological image data, https://www.ebi.ac.uk/bioimage-archive/. We will look at how image data submission works, how to prepare in advance for archiving your data and upcoming developments.
Collapse
Affiliation(s)
- Matthew Hartley
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Liviu Anita
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kolawole Babalola
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Craig Russell
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Aybüke Küpcü Yoldaş
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Teresa Zulueta-Coarasa
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
30
|
Chung G, Piano F, Gunsalus KC. TeloSearchLR: an algorithm to detect novel telomere repeat motifs using long sequencing reads. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.29.617943. [PMID: 39554068 PMCID: PMC11565940 DOI: 10.1101/2024.10.29.617943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
Telomeres are eukaryotic chromosome end structures that guard against sequence loss and aberrant chromosome fusions. Telomeric repeat motifs (TRMs), the minimal repeating unit of a telomere, vary from species to species, with some evolutionary clades experiencing a rapid sequence divergence. To explore the full scope of this evolutionary divergence, many bioinformatic tools have been developed to infer novel TRMs using repetitive sequence search on short sequencing reads. However, novel telomeric motifs remain unidentified in up to half of the sequencing libraries assayed with these tools. A possible reason may be that short reads, derived from extensively sheared DNA, preserve little to no positional context of the repetitive sequences assayed. On the other hand, if a sequencing read is sufficiently long, telomeric sequences must appear at either end rather than in the middle. The TeloSearchLR algorithm relies on this to help identify novel TRMs on long reads, in many cases where short-read search tools have failed. In addition, we demonstrate that TeloSearchLR can reveal unusually long telomeric motifs not maintained by telomerase, and it can also be used to anchor terminal scaffolds in new genome assemblies.
Collapse
|
31
|
Kaur G, Perteghella T, Carbonell-Sala S, Gonzalez-Martinez J, Hunt T, Mądry T, Jungreis I, Arnan C, Lagarde J, Borsari B, Sisu C, Jiang Y, Bennett R, Berry A, Cerdán-Vélez D, Cochran K, Vara C, Davidson C, Donaldson S, Dursun C, González-López S, Gopal Das S, Hardy M, Hollis Z, Kay M, Montañés JC, Ni P, Nurtdinov R, Palumbo E, Pulido-Quetglas C, Suner MM, Yu X, Zhang D, Loveland JE, Albà MM, Diekhans M, Tanzer A, Mudge JM, Flicek P, Martin FJ, Gerstein M, Kellis M, Kundaje A, Paten B, Tress ML, Johnson R, Uszczynska-Ratajczak B, Frankish A, Guigó R. GENCODE: massively expanding the lncRNA catalog through capture long-read RNA sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.29.620654. [PMID: 39554180 PMCID: PMC11565817 DOI: 10.1101/2024.10.29.620654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
Accurate and complete gene annotations are indispensable for understanding how genome sequences encode biological functions. For twenty years, the GENCODE consortium has developed reference annotations for the human and mouse genomes, becoming a foundation for biomedical and genomics communities worldwide. Nevertheless, collections of important yet poorly-understood gene classes like long non-coding RNAs (lncRNAs) remain incomplete and scattered across multiple, uncoordinated catalogs, slowing down progress in the field. To address these issues, GENCODE has undertaken the most comprehensive lncRNAs annotation effort to date. This is founded on the manual annotation of full-length targeted long-read sequencing, on matched embryonic and adult tissues, of orthologous regions in human and mouse. Altogether 17,931 novel human genes (140,268 novel transcripts) and 22,784 novel mouse genes (136,169 novel transcripts) have been added to the GENCODE catalog representing a 2-fold and 6-fold increase in transcripts, respectively - the greatest increase since the sequencing of the human genome. Novel gene annotations display evolutionary constraints, have well-formed promoter regions, and link to phenotype-associated genetic variants. They greatly enhance the functional interpretability of the human genome, as they help explain millions of previously-mapped "orphan" omics measurements corresponding to transcription start sites, chromatin modifications and transcription factor binding sites. Crucially, our targeted design assigned human-mouse orthologs at a rate beyond previous studies, tripling the number of human disease-associated lncRNAs with mouse orthologs. The expanded and enhanced GENCODE lncRNA annotations mark a critical step towards deciphering the human and mouse genomes.
Collapse
Affiliation(s)
- Gazaldeep Kaur
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
| | - Tamara Perteghella
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
- Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra (UPF)
| | - Sílvia Carbonell-Sala
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
| | - Jose Gonzalez-Martinez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tomasz Mądry
- Department of Computational Biology of Noncoding RNA, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Irwin Jungreis
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA 02139, USA
- The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Carme Arnan
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
| | - Julien Lagarde
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
- Flomics Biotech, SL, Carrer de Roc Boronat 31, 08005 Barcelona, Catalonia, Spain
| | - Beatrice Borsari
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Cristina Sisu
- Department of Life Sciences, Brunel University London, Uxbridge, London, UB8 3PH, UK
| | - Yunzhe Jiang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Ruth Bennett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew Berry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Cerdán-Vélez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Kelly Cochran
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Covadonga Vara
- Hospital del Mar Research Institute, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah Donaldson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Cagatay Dursun
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Silvia González-López
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
- Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra (UPF)
| | - Sasti Gopal Das
- Department of Computational Biology of Noncoding RNA, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Matthew Hardy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Zoe Hollis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Pengyu Ni
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Ramil Nurtdinov
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
| | - Emilio Palumbo
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
| | - Carlos Pulido-Quetglas
- Department of Medical Oncology, Bern University Hospital, Murtenstrasse 35, 3008 Bern, Switzerland
- School of Biology and Environmental Science, University College Dublin, University College Dublin, Belfield, Dublin 4, D04 V1W8, Ireland
| | - Marie-Marthe Suner
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Xuezhu Yu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Dingyao Zhang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Jane E Loveland
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - M Mar Albà
- Hospital del Mar Research Institute, Dr. Aiguader 88, Barcelona 08003, Spain
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, 2300 Delaware Avenue, University of California, Santa Cruz, CA 95060, USA
| | - Andrea Tanzer
- University of Vienna, Research Network Data Science, Kolingasse 14-16, 1090 Vienna, Austria
- University of Vienna, Faculty of Computer Science, Research Group Visualization and Data Analysis, Waehringerstrasse 29, 1090 Vienna, Austria
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA 02139, USA
- The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, 2300 Delaware Avenue, University of California, Santa Cruz, CA 95060, USA
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Rory Johnson
- Department of Medical Oncology, Bern University Hospital, Murtenstrasse 35, 3008 Bern, Switzerland
- School of Biology and Environmental Science, University College Dublin, University College Dublin, Belfield, Dublin 4, D04 V1W8, Ireland
| | - Barbara Uszczynska-Ratajczak
- Department of Computational Biology of Noncoding RNA, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain
- Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra (UPF)
| |
Collapse
|
32
|
Adkins P, Harley J, Brittain R, Scott-Somme K, Azzopardi F. The genome sequence of the John Dory, Zeus faber Linnaeus, 1758. Wellcome Open Res 2024; 9:150. [PMID: 38881949 PMCID: PMC11179049 DOI: 10.12688/wellcomeopenres.21140.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/11/2024] [Indexed: 06/18/2024] Open
Abstract
We present a genome assembly from an individual Zeus faber (the John Dory; Chordata; Actinopteri; Zeiformes; Zeidae). The genome sequence is 804.7 megabases in span. Most of the assembly is scaffolded into 22 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 16.72 kilobases in length.
Collapse
Affiliation(s)
- Patrick Adkins
- The Marine Biological Association, Plymouth, England, UK
| | - Joanna Harley
- The Marine Biological Association, Plymouth, England, UK
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Beaven R, Denholm B. The cryptonephridial/rectal complex: an evolutionary adaptation for water and ion conservation. Biol Rev Camb Philos Soc 2024. [PMID: 39438273 DOI: 10.1111/brv.13156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 10/08/2024] [Accepted: 10/10/2024] [Indexed: 10/25/2024]
Abstract
Arthropods have integrated digestive and renal systems, which function to acquire and maintain homeostatically the substances they require for survival. The cryptonephridial complex (CNC) is an evolutionary novelty in which the renal organs and gut have been dramatically reorganised. Parts of the renal or Malpighian tubules (MpTs) form a close association with the surface of the rectum, and are surrounded by a novel tissue, the perinephric membrane, which acts to insulate the system from the haemolymph and thus allows tight regulation of ions and water into and out of the CNC. The CNC can reclaim water and solutes from the rectal contents and recycle these back into the haemolymph. Fluid flow in the MpTs runs counter to flow within the rectum. It is this countercurrent arrangement that underpins its powerful recycling capabilities, and represents one of the most efficient water conservation mechanisms in nature. CNCs appear to have evolved multiple times, and are present in some of the largest and most evolutionarily successful insect groups including the larvae of most Lepidoptera and in a major beetle lineage (Cucujiformia + Bostrichoidea), suggesting that the CNC is an important adaptation. Here we review the knowledge of this remarkable organ system gained over the past 200 years. We first focus on the CNCs of tenebrionid beetles, for which we have an in-depth understanding from physiological, structural and ultrastructural studies (primarily in Tenebrio molitor), which are now being extended by studies in Tribolium castaneum enabled by advances in molecular and microscopy approaches established for this species. These recent studies are beginning to illuminate CNC development, physiology and endocrine control. We then take a broader view of arthropod CNCs, phylogenetically mapping their reported occurrence to assess their distribution and likely evolutionary origins. We explore CNCs from an ecological viewpoint, put forward evidence that CNCs may primarily be adaptations for facing the challenges of larval life, and argue that their loss in many aquatic species could point to a primary function in conserving water in terrestrial species. Finally, by considering the functions of renal and digestive epithelia in insects lacking CNCs, as well as the typical architecture of these organs in relation to one another, we propose that ancestral features of these organs predispose them for the evolution of CNCs.
Collapse
Affiliation(s)
- Robin Beaven
- Hugh Robson Building, George Square, Deanery of Biomedical Sciences, The University of Edinburgh, Edinburgh, EH8 9XD, UK
| | - Barry Denholm
- Hugh Robson Building, George Square, Deanery of Biomedical Sciences, The University of Edinburgh, Edinburgh, EH8 9XD, UK
| |
Collapse
|
34
|
Blaxter M, Pauperio J, Schoch C, Howe K. Taxonomy Identifiers (TaxId) for Biodiversity Genomics: a guide to getting TaxId for submission of data to public databases. Wellcome Open Res 2024; 9:591. [PMID: 39526195 PMCID: PMC11544195 DOI: 10.12688/wellcomeopenres.22949.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2024] [Indexed: 11/16/2024] Open
Abstract
Biodiversity genomics critically depends on correct taxonomic identification of the sample from which data are derived. Tracking of that taxonomic information through systems that archive data and report on genome sequencing efforts. For submission of data to the International Nucleotide Sequence Database Collaboration (INSDC) databases (DNA DataBank of Japan [DDBJ], European Nucleotide Archive [ENA] and National Center for Biotechnology Information [NCBI]), samples and data derived from them must be assigned a species-level NCBI Taxonomy taxonomic identifier (TaxId, sometimes referred to as taxId or txid). We thus need to be able to identify the TaxId for a target species efficiently. Because the NCBI Taxonomy does not include all known species and cannot preemptively represent unknown taxa, we also need an efficient process for generating new TaxIds for species not yet listed. This document provides workflows for different kinds of TaxId acquisition scenarios and was created to guide users in these processes. Although developed for European projects such as Darwin Tree of Life and the European Reference Genome Atlas, the workflows are universally applicable and describe the use of ENA in resolving taxonomic issues. Too Long: Didn't Read (TL;DR): Use the ENA REST API programmatically to retrieve TaxIds for target species and confirm that sequence data can be submitted to those TaxIds.Use the NCBI Web interface to NCBI Taxonomy to identify potential homotypic synonyms.Request a new TaxId from ENA for a species not yet in NCBI Taxonomy, and for species-like entries for which the full Linnaean binomen is not determined (see https://ena-docs.readthedocs.io/en/latest/faq/taxonomy_requests.html#creating-taxon-requests).Discuss directly with the NCBI Taxonomy curators or the curators at ENA and NCBI whenever you think there is an opportunity to improve their database.
Collapse
Affiliation(s)
- Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Joana Pauperio
- European Nucleotide Archive, European Bioinformatics Institute, Hinxton, England, UK
| | - Conrad Schoch
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland, USA
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
35
|
Patsakis M, Provatas K, Baltoumas FA, Chantzi N, Mouratidis I, Pavlopoulos GA, Georgakopoulos-Soares I. MAFin: Motif Detection in Multiple Alignment Files. ARXIV 2024:arXiv:2410.11021v1. [PMID: 39483349 PMCID: PMC11527099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Motivation Genome and Proteome Alignments, represented by the Multiple Alignment File (MAF) format, have become a standard approach in the field of comparative genomics and proteomics. However, current approaches lack a direct method for motif detection within MAF files. To address this gap, we present MAFin, a novel tool that enables efficient motif detection and conservation analysis in MAF files, streamlining genomic and proteomic research. Results We developed MAFin, the first motif detection tool for Multiple Alignment Format files. MAFin enables the multithreaded search of conserved motifs using three approaches: 1) by using user-specified k-mers to search the sequences. 2) with regular expressions, in which case one or more patterns are searched, and 3) with predefined Position Weight Matrices. Once the motif has been found, MAFin detects the motif instances and calculates the conservation across the aligned sequences. MAFin also calculates a conservation percentage, which provides information about the conservation levels of each motif across the aligned sequences, based on the number of matches relative to the length of the motif. A set of statistics enable the interpretation of each motif's conservation level, and the detected motifs are exported in JSON and CSV files for downstream analyses. Availability MAFin is released as a Python package under the GPL license as a multi-platform application and is available at: https://github.com/Georgakopoulos-Soares-lab/MAFin.
Collapse
Affiliation(s)
- Michail Patsakis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Kimonas Provatas
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari 16672, Greece
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | | | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
36
|
Prescott T. The genome sequence of the Satin Lutestring moth, Tetheella fluctuosa (Hübner, 1803). Wellcome Open Res 2024; 9:576. [PMID: 39534536 PMCID: PMC11555359 DOI: 10.12688/wellcomeopenres.23076.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2024] [Indexed: 11/16/2024] Open
Abstract
We present a genome assembly from an individual female Tetheella fluctuosa (the Satin Lutestring moth; Arthropoda; Insecta; Lepidoptera; Drepanidae). The genome sequence has a total length of 369.10 megabases. Most of the assembly is scaffolded into 32 chromosomal pseudomolecules, including the Z and W sex chromosomes. The mitochondrial genome has also been assembled and is 15.41 kilobases in length. Gene annotation of this assembly on Ensembl identified 18,318 protein-coding genes.
Collapse
Affiliation(s)
- Tom Prescott
- Butterfly Conservation Scotland, Stirling, Scotland, UK
| | | | | | | | | | | |
Collapse
|
37
|
Whitla R, Hens K, Hogan J, Martin G, Breuker C, Shreeve TG, Arif S. The last days of Aporia crataegi (L.) in Britain: Evaluating genomic erosion in an extirpated butterfly. Mol Ecol 2024; 33:e17518. [PMID: 39192591 DOI: 10.1111/mec.17518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 07/10/2024] [Indexed: 08/29/2024]
Abstract
Current rates of habitat degradation and climate change are causing unprecedented declines in global biodiversity. Studies on vertebrates highlight how conservation genomics can be effective in identifying and managing threatened populations, but it is unclear how vertebrate-derived metrics of genomic erosion translate to invertebrates, with their markedly different population sizes and life histories. The Black-veined White butterfly (Aporia crataegi) was extirpated from Britain in the 1920s. Here, we sequenced historical DNA from 17 specimens collected between 1854 and 1924 to reconstruct demography and compare levels of genomic erosion between extirpated British and extant European mainland populations. We contrast these results using modern samples of the Common Blue butterfly (Polyommatus icarus); a species with relatively stable demographic trends in Great Britain. We provide evidence for bottlenecks in both these species around the period of post-glacial colonization of the British Isles. Our results reveal different demographic histories and Ne for both species, consistent with their fates in Britain, likely driven by differences in life history, ecology and genome size. Despite a difference, by an order of magnitude, in historical effective population sizes (Ne), reduction in genome-wide heterozygosity in A. crataegi was comparable to that in P. icarus. Symptomatic of A. crataegi's disappearance were marked increases in runs-of-homozygosity (RoH), potentially indicative of recent inbreeding, and accumulation of putatively mildly and weakly deleterious variants. Our results provide a rare glimpse of genomic erosion in a regionally extinct insect and support the potential use of genomic erosion metrics in identifying invertebrate populations or species in decline.
Collapse
Affiliation(s)
- Rebecca Whitla
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Korneel Hens
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Centre for Functional Genomics, Oxford Brookes University, Oxford, UK
| | - James Hogan
- Oxford University Museum of Natural History, Oxford, UK
| | - Geoff Martin
- Insects Division, Natural History Museum, London, UK
| | - Casper Breuker
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Centre for Functional Genomics, Oxford Brookes University, Oxford, UK
| | - Timothy G Shreeve
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Saad Arif
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Centre for Functional Genomics, Oxford Brookes University, Oxford, UK
| |
Collapse
|
38
|
Thomas Thorpe JA. Phylogenomics supports a single origin of terrestriality in isopods. Proc Biol Sci 2024; 291:20241042. [PMID: 39471855 PMCID: PMC11521608 DOI: 10.1098/rspb.2024.1042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 07/18/2024] [Accepted: 09/04/2024] [Indexed: 11/01/2024] Open
Abstract
Terrestriality, the adaptation to life on land, is one of the key evolutionary transitions, occurring numerous times across the tree of life. Within Arthropoda, there have been several independent transitions: in hexapods, myriapods, arachnids and isopods. Isopoda is a morphologically diverse order within Crustacea, with species adapted to almost every environment on Earth. The order is divided into 11 suborders with the most speciose, Oniscidea, including terrestrial isopods such as woodlice and sea-slaters. Recent molecular phylogenetic studies have challenged traditional isopod morphological taxonomy, suggesting that several well-accepted suborders, including Oniscidea, may be non-monophyletic. This implies that terrestriality may have evolved multiple times. Current molecular hypotheses, however, are based on limited sequence data. Here, I collate available genome and transcriptome datasets for 36 isopods and four peracarid crustaceans from public sources, generate assemblies and use 970 single-copy orthologues to estimate isopod relationships and divergence times with molecular dating. The resulting phylogenetic analyses support monophyly of terrestrial isopods and suggest conflicting relationships based on nuclear ribosomal RNA sequences may be caused by long-branch attraction. Dating analyses suggest a Permo-Carboniferous origin of isopod terrestriality, much more recently than other terrestrial arthropods.
Collapse
|
39
|
Ye X, Yang Y, Zhao X, Fang Q, Ye G. The state of parasitoid wasp genomics. Trends Parasitol 2024; 40:914-929. [PMID: 39227194 DOI: 10.1016/j.pt.2024.08.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 08/12/2024] [Accepted: 08/12/2024] [Indexed: 09/05/2024]
Abstract
Parasitoid wasps represent a group of parasitic insects with high species diversity that have played a pivotal role in biological control and evolutionary studies. Over the past 20 years, developments in genomics have greatly enhanced our understanding of the biology of these species. Technological leaps in sequencing have facilitated the improvement of genome quality and quantity, leading to the availability of hundreds of parasitoid wasp genomes. Here, we summarize recent progress in parasitoid wasp genomics, focusing on the evolution of genome size (GS) and the genomic basis of several key traits. We also discuss the contributions of genomics in studying venom evolution and endogenization of viruses. Finally, we advocate for increased sequencing and functional research to better understand parasitoid biology and enhance biological control.
Collapse
Affiliation(s)
- Xinhai Ye
- College of Advanced Agriculture Sciences, Zhejiang A&F University, Hangzhou, China.
| | - Yi Yang
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Xianxin Zhao
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Qi Fang
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Gongyin Ye
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China.
| |
Collapse
|
40
|
Griffiths A, Wawman DC, Crowley LM. The genome sequence of the Broom moth, Ceramica pisi Linnaeus, 1758. Wellcome Open Res 2024; 9:539. [PMID: 39554249 PMCID: PMC11568375 DOI: 10.12688/wellcomeopenres.23050.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/09/2024] [Indexed: 11/19/2024] Open
Abstract
We present a genome assembly from an individual male Ceramica pisi (the Broom moth; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence spans 732.70 megabases. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.31 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,916 protein-coding genes.
Collapse
Affiliation(s)
- Andy Griffiths
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
| | - Denise C. Wawman
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | - University of Oxford and Wytham Woods Genome Acquisition Lab
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Wellcome Sanger Institute Tree of Life Core Informatics team
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Department of Biology, University of Oxford, Oxford, England, UK
| | - Tree of Life Core Informatics collective
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
- Department of Biology, University of Oxford, Oxford, England, UK
| | | |
Collapse
|
41
|
Mc Cartney AM, Formenti G, Mouton A, De Panis D, Marins LS, Leitão HG, Diedericks G, Kirangwa J, Morselli M, Salces-Ortiz J, Escudero N, Iannucci A, Natali C, Svardal H, Fernández R, De Pooter T, Joris G, Strazisar M, Wood JMD, Herron KE, Seehausen O, Watts PC, Shaw F, Davey RP, Minotto A, Fernández JM, Böhne A, Alegria C, Alioto T, Alves PC, Amorim IR, Aury JM, Backstrom N, Baldrian P, Baltrunaite L, Barta E, BedHom B, Belser C, Bergsten J, Bertrand L, Bilandija H, Binzer-Panchal M, Bista I, Blaxter M, Borges PAV, Dias GB, Bosse M, Brown T, Bruggmann R, Buena-Atienza E, Burgin J, Buzan E, Cariani A, Casadei N, Chiara M, Chozas S, Čiampor F, Crottini A, Cruaud C, Cruz F, Dalen L, De Biase A, Del Campo J, Delic T, Dennis AB, Derks MFL, Diroma MA, Djan M, Duprat S, Eleftheriadi K, Feulner PGD, Flot JF, Forni G, Fosso B, Fournier P, Fournier-Chambrillon C, Gabaldon T, Garg S, Gissi C, Giupponi L, Gomez-Garrido J, González J, Grilo ML, Grüning B, Guerin T, Guiglielmoni N, Gut M, Haesler MP, Hahn C, Halpern B, Harrison PW, Heintz J, Hindrikson M, Höglund J, Howe K, Hughes GM, Istace B, Cock MJ, Janžekovič F, Jonsson ZO, Joye-Dind S, Koskimäki JJ, Krystufek B, Kubacka J, Kuhl H, Kusza S, Labadie K, Lähteenaro M, Lantz H, Lavrinienko A, Leclère L, Lopes RJ, Madsen O, Magdelenat G, Magoga G, Manousaki T, Mappes T, Marques JP, Redondo GIM, Maumus F, McCarthy SA, Megens HJ, Melo-Ferreira J, Mendes SL, Montagna M, Moreno J, Mosbech MB, Moura M, Musilova Z, Myers E, Nash WJ, Nater A, Nicholson P, Niell M, Nijland R, Noel B, Noren K, Oliveira PH, Olsen RA, Ometto L, Oomen RA, Ossowski S, Palinauskas V, Palsson S, Panibe JP, Pauperio J, Pavlek M, Payen E, Pawlowska J, Pellicer J, Pesole G, Pimenta J, Pippel M, Pirttilä AM, Poulakakis N, Rajan J, M C Rego R, Resendes R, Resl P, Riesgo A, Rodin-Morch P, Soares AER, Fernandes CR, Romeiras MM, Roxo G, Rüber L, Ruiz-Lopez MJ, Saarma U, da Silva LP, Sim-Sim M, Soler L, Sousa VC, Santos CS, Spada A, Stefanovic M, Steger V, Stiller J, Stöck M, Struck TH, Sudasinghe H, Tapanainen R, Tellgren-Roth C, Trindade H, Tukalenko Y, Urso I, Vacherie B, Van Belleghem SM, Van Oers K, Vargas-Chavez C, Velickovic N, Vella N, Vella A, Vernesi C, Vicente S, Villa S, Pettersson OV, Volckaert FAM, Voros J, Wincker P, Winkler S, Ciofi C, Waterhouse RM, Mazzoni CJ. The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics. NPJ BIODIVERSITY 2024; 3:28. [PMID: 39289538 PMCID: PMC11408602 DOI: 10.1038/s44185-024-00054-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 07/19/2024] [Indexed: 09/19/2024]
Abstract
A genomic database of all Earth's eukaryotic species could contribute to many scientific discoveries; however, only a tiny fraction of species have genomic information available. In 2018, scientists across the world united under the Earth BioGenome Project (EBP), aiming to produce a database of high-quality reference genomes containing all ~1.5 million recognized eukaryotic species. As the European node of the EBP, the European Reference Genome Atlas (ERGA) sought to implement a new decentralised, equitable and inclusive model for producing reference genomes. For this, ERGA launched a Pilot Project establishing the first distributed reference genome production infrastructure and testing it on 98 eukaryotic species from 33 European countries. Here we outline the infrastructure and explore its effectiveness for scaling high-quality reference genome production, whilst considering equity and inclusion. The outcomes and lessons learned provide a solid foundation for ERGA while offering key learnings to other transnational, national genomic resource projects and the EBP.
Collapse
Affiliation(s)
- Ann M Mc Cartney
- Genomics Institute, University of California, Santa Cruz, CA, USA.
| | - Giulio Formenti
- The Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Department of Biology, University of Florence, Sesto Fiorentino, Italy
| | - Alice Mouton
- Department of Biology, University of Florence, Sesto Fiorentino, Italy
- InBios-Conservation Genetics Laboratory, University of Liege, Liege, Belgium
| | - Diego De Panis
- Leibniz Institut für Zoo und Wildtierforschung, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | - Luísa S Marins
- Leibniz Institut für Zoo und Wildtierforschung, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | | | | | - Joseph Kirangwa
- Institute of Zoology, University of Cologne, Cologne, Germany
| | - Marco Morselli
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy
| | - Judit Salces-Ortiz
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Nuria Escudero
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Alessio Iannucci
- Department of Biology, University of Florence, Sesto Fiorentino, Italy
| | - Chiara Natali
- Department of Biology, University of Florence, Sesto Fiorentino, Italy
| | - Hannes Svardal
- Department of Biology, University of Antwerp, Antwerp, Belgium
- Naturalis Biodiversity Center, Leiden, The Netherlands
| | - Rosa Fernández
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Tim De Pooter
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Antwerp, Belgium
- Neuromics Support Facility, Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| | - Geert Joris
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Antwerp, Belgium
- Neuromics Support Facility, Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| | - Mojca Strazisar
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Antwerp, Belgium
- Neuromics Support Facility, Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| | | | - Katie E Herron
- School of Biology and Environmental Science, University College Dublin, Belfield, Ireland
| | - Ole Seehausen
- Aquatic Ecology & Evolution, Institute of Ecology & Evolution, University of Bern, Bern, Switzerland
- Department of Fish Ecology & Evolution, Eawag, Kastanienbaum, Switzerland
| | - Phillip C Watts
- Department of Biological and Environmental Science, University of Jyvaskyla, Jyvaskyla, Finland
| | - Felix Shaw
- The Earlham Institute, Norwich Research Park, Norwich, UK
| | - Robert P Davey
- The Earlham Institute, Norwich Research Park, Norwich, UK
| | | | - José M Fernández
- Barcelona Supercomputing Center; Spanish National Bioinformatics Institute, ELIXIR Spain, Getafe, Spain
| | - Astrid Böhne
- Leibniz Institute for the Analysis of Biodiversity Change, Museum Koenig Bonn, Bonn, Germany
| | - Carla Alegria
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
| | - Tyler Alioto
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
- Universitat de Barcelona (UB), Barcelona, Spain
| | - Paulo C Alves
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- Departamento de Biologia, Faculdade de Ciencias, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairao, Vairao, Portugal
| | - Isabel R Amorim
- University of the Azores, cE3c-Centre for Ecology, Evolution and Environmental Changes, Azorean Biodiversity Group, CHANGE-Global Change and Sustainability Institute, Rua Capitão João d´Ávila, Pico da Urze, Angra do Heroísmo, Portugal
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Niclas Backstrom
- Evolutionary Biology Program, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Petr Baldrian
- Institute of Microbiology of the Czech Academy of Sciences, Praha, Czech Republic
| | | | - Endre Barta
- Institute of Biochemistry and Molecular Biology, Faculty of Medicine, University of Debrecen, Debrecen, Hungary
| | - Bertrand BedHom
- Institut de Systematique, Evolution, Biodiversite, Museum National d Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - Caroline Belser
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Johannes Bergsten
- Department of Zoology, Swedish Museum of Natural History, Stockholm, Sweden
- Department of Zoology, Faculty of Science, Stockholm University, Stockholm, Sweden
| | - Laurie Bertrand
- Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | | | - Mahesh Binzer-Panchal
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Iliana Bista
- Senckenberg Research Institute, Frankfurt, Germany
- LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany
- Wellcome CRUK Gurdon Institute, University of Cambridge, Cambridge, UK
| | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Paulo A V Borges
- University of the Azores, cE3c-Centre for Ecology, Evolution and Environmental Changes, Azorean Biodiversity Group, CHANGE-Global Change and Sustainability Institute, Rua Capitão João d´Ávila, Pico da Urze, Angra do Heroísmo, Portugal
| | - Guilherme Borges Dias
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Mirte Bosse
- VU University Amsterdam, Amsterdam, The Netherlands
- Animal Breeding & Genomics, Wageningen University & Research, Wageningen, The Netherlands
- Wageningen University & Research, Wageningen, The Netherlands
| | - Tom Brown
- Leibniz Institut für Zoo und Wildtierforschung, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- DRESDEN concept Genome Center, Dresden, Germany
| | - Rémy Bruggmann
- Interfaculty Bioinformatics Unit and Swiss Institute of Bioinformatics, University of Bern, Bern, Switzerland
| | - Elena Buena-Atienza
- Institute of Medical Genetics and Applied Genomics, University of Tubingen, Tubingen, Germany
- NGS Competence Center Tubingen, Tubingen, Germany
| | - Josephine Burgin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Elena Buzan
- University of Primorska, Faculty of Mathematics, Natural Sciences and Information Technologies, Koper, Slovenia
- Faculty of Environmental Protection, Velenje, Slovenia
| | - Alessia Cariani
- Department of Biological, Geological and Environmental Sciences, Alma Mater Studiorum Universitá di Bologna, Bologna, Italy
| | - Nicolas Casadei
- Institute of Medical Genetics and Applied Genomics, University of Tubingen, Tubingen, Germany
- NGS Competence Center Tubingen, Tubingen, Germany
| | - Matteo Chiara
- Department of Biosciences, Università degli Studi di Milano, Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Consiglio Nazionale delle Ricerche, Bari, Italy
| | - Sergio Chozas
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Sociedade Portuguesa de Botânica, Lisbon, Portugal
| | - Fedor Čiampor
- Department of Biodiversity and Ecology, Plant Science and Biodiversity Centre Slovak Academy of Sciences, Bratislava, Slovakia
| | - Angelica Crottini
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- Departamento de Biologia, Faculdade de Ciencias, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairao, Vairao, Portugal
| | - Corinne Cruaud
- Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | - Fernando Cruz
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
- Universitat de Barcelona (UB), Barcelona, Spain
| | - Love Dalen
- Department of Zoology, Stockholm University, Stockholm, Sweden
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
- Centre for Palaeogenetics, Stockholm, Sweden
| | - Alessio De Biase
- Department of Biology and Biotechnologies, Sapienza University of Rome, Rome, Italy
| | - Javier Del Campo
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Teo Delic
- University of Ljubljana, Biotechnical Faculty, Department of Biology, Ljubljana, Slovenia
| | - Alice B Dennis
- University of Namur, Department of Biology, URBE, ILEE, Namur, Belgium
| | - Martijn F L Derks
- Animal Breeding & Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | | | - Mihajla Djan
- Department of Biology and Ecology, University of Novi Sad, Novi Sad, Serbia
| | - Simone Duprat
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Klara Eleftheriadi
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Philine G D Feulner
- Eawag Swiss Federal Institute of Aquatic Science and Technology, Department of Fish Ecology & Evolution, Kastanienbaum, Switzerland
| | - Jean-François Flot
- Department of Organismal Biology, Universite libre de Bruxelles, Brussels, Belgium
| | - Giobbe Forni
- Department of Biological, Geological and Environmental Sciences, Alma Mater Studiorum Universitá di Bologna, Bologna, Italy
| | - Bruno Fosso
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
| | - Pascal Fournier
- Groupe de Recherche et d Etude pour la Gestion de l Environnement, Villandraut, France
| | | | - Toni Gabaldon
- Barcelona Supercomputing Centre (BSC), Barcelona, Spain
- Institute for Research in Biomedicine (IRB), Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
- CIBERINFEC, Instituto Carlos III, Barcelona, Spain
| | - Shilpa Garg
- NNF Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Carmela Gissi
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Consiglio Nazionale delle Ricerche, Bari, Italy
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
- CoNISMa, Consorzio Nazionale Interuniversitario per le Scienze del Mare, Roma, Italy
| | - Luca Giupponi
- Centre of Applied Studies for the Sustainable Management and Protection of Mountain Areas CRC Ge.S.Di.Mont., University of Milan, Milan, Italy
- Department of Agricultural and Environmental Sciences-Production, Landscape and Agroenergy DiSAA, University of Milan, Milan, Italy
| | - Jessica Gomez-Garrido
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
- Universitat de Barcelona (UB), Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Miguel L Grilo
- Marine and Environmental Sciences Centre, Aquatic Research Network, Instituto Universitário de Ciências Psicológicas, Sociais e da Vida, Lisboa, Portugal
- Egas Moniz Center for Interdisciplinary Research (CiiEM), Egas Moniz School of Health & Science, Caparica, Portugal
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Thomas Guerin
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | | | - Marta Gut
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
- Universitat de Barcelona (UB), Barcelona, Spain
| | - Marcel P Haesler
- Aquatic Ecology & Evolution, Institute of Ecology & Evolution, University of Bern, Bern, Switzerland
- Department of Fish Ecology & Evolution, Eawag, Kastanienbaum, Switzerland
| | - Christoph Hahn
- Department of Biology, University of Graz, Graz, Austria
| | - Balint Halpern
- MME BirdLife Hungary, Budapest, Hungary
- Doctoral School of Biology, Department of Systematic Zoology and Ecology, Institute of Biology, ELTE Eotvos Lorand University, Budapest, Hungary
- HUN-REN-ELTE-MTM Integrative Ecology Research Group, Budapest, Hungary
| | - Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Julia Heintz
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Maris Hindrikson
- Department of Zoology, Institute of Ecology and Earth Sciences, University of Tartu, Tartu, Estonia
| | - Jacob Höglund
- Institute of Life and Environmental Sciences, University of Iceland, Reykjavik, Iceland
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Graham M Hughes
- School of Biology and Environmental Science, University College Dublin, Belfield, Ireland
- UCD Conway Institute, University College Dublin, Belfield, Ireland
| | - Benjamin Istace
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Mark J Cock
- Algal Genetics Group, UMR 8227, CNRS, Sorbonne Universite, UPMC University Paris 06, Paris, France
- France Integrative Biology of Marine Models, Station Biologique de Roscoff, Roscoff, France
| | - Franc Janžekovič
- University of Maribor, Faculty of Natural Sciences and Mathematics, Maribor, Slovenia
| | - Zophonias O Jonsson
- Institute of Life and Environmental Sciences, University of Iceland, Reykjavik, Iceland
| | - Sagane Joye-Dind
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Janne J Koskimäki
- Ecology and Genetics Research Unit, University of Oulu, Oulu, Finland
| | - Boris Krystufek
- Slovenian Museum of Natural History, Ljubljana, Slovenia
- Science and Research Centre Koper, Koper, Slovenia
| | - Justyna Kubacka
- Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland
| | - Heiner Kuhl
- Department IV Fish Biology, Fisheries and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, Berlin, Germany
| | - Szilvia Kusza
- University of Debrecen, Centre for Agricultural Genomics and Biotechnology, Debrecen, Hungary
| | - Karine Labadie
- Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | - Meri Lähteenaro
- Department of Zoology, Swedish Museum of Natural History, Stockholm, Sweden
- Department of Zoology, Faculty of Science, Stockholm University, Stockholm, Sweden
| | - Henrik Lantz
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Anton Lavrinienko
- Laboratory of Food Systems Biotechnology, Institute of Food, Nutrition, and Health, ETH Zurich, Zurich, Switzerland
| | - Lucas Leclère
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Banyuls-sur-Mer, France
| | - Ricardo Jorge Lopes
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- MHNC-UP, Natural History and Science Museum of the University of Porto, Porto, Portugal
| | - Ole Madsen
- Animal Breeding & Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | | | - Giulia Magoga
- Department of Agricultural Sciences, University of Naples Federico II, Portici, Italy
| | - Tereza Manousaki
- Hellenic Centre for Marine Research (HCMR), Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Heraklion, Crete, Greece
| | - Tapio Mappes
- Department of Biological and Environmental Science, University of Jyvaskyla, Jyvaskyla, Finland
| | - Joao Pedro Marques
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairao, Vairao, Portugal
| | | | - Florian Maumus
- Universite Paris Saclay, INRAE, URGI, Versailles, France
| | - Shane A McCarthy
- Department of Genetics, University of Cambridge, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Hendrik-Jan Megens
- Animal Breeding & Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Jose Melo-Ferreira
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairao, Vairao, Portugal
- Departamento de Biologia, Faculdade de Ciencias da Universidade do Porto, Porto, Portugal
| | - Sofia L Mendes
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
| | - Matteo Montagna
- Department of Agricultural Sciences, University of Naples Federico II, Portici, Italy
- Interuniversity Center for Studies on Bioinspired Agro Environmental Technology, University of Naples Federico II, Naples, Italy
| | - Joao Moreno
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- MARE Marine and Environmental Sciences Centre, ARNET Aquatic Research Network, Lisboa, Portugal
| | - Mai-Britt Mosbech
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Mónica Moura
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Pólo dos Açores; Faculdade de Ciências e Tecnologia, Universidade dos Açores, Ponta Delgada, Portugal
- UNESCO, Chair Land Within Sea Biodiversity & Sustainability in Atlantic Islands, Portugal
| | - Zuzana Musilova
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Eugene Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- DRESDEN concept Genome Center, Dresden, Germany
| | - Will J Nash
- The Earlham Institute, Norwich Research Park, Norwich, UK
| | - Alexander Nater
- Interfaculty Bioinformatics Unit and Swiss Institute of Bioinformatics, University of Bern, Bern, Switzerland
| | - Pamela Nicholson
- Next Generation Sequencing Platform, University of Bern, Bern, Switzerland
| | - Manuel Niell
- Andorra Research and Innovation, Sant Julià de Lòria, Andorra
| | - Reindert Nijland
- Marine Animal Ecology Group, Wageningen University and Research, Wageningen, The Netherlands
| | - Benjamin Noel
- University of the Azores, cE3c-Centre for Ecology, Evolution and Environmental Changes, Azorean Biodiversity Group, CHANGE-Global Change and Sustainability Institute, Rua Capitão João d´Ávila, Pico da Urze, Angra do Heroísmo, Portugal
| | - Karin Noren
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Pedro H Oliveira
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Remi-Andre Olsen
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
| | - Lino Ometto
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- National Biodiversity Future Center, Palermo, Italy
| | - Rebekah A Oomen
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Oslo, Norway
- University of New Brunswick Saint John, Saint John, New Brunswick, Canada
| | - Stephan Ossowski
- Institute for Medical Genetics and Applied Genomics, University of Tubingen, Tubingen, Germany
- NGS Competence Center Tubingen (NCCT), University of Tubingen, Tubingen, Germany
- Institute for Bioinformatics and Medical Informatics (IBMI), University of Tubingen, Tubingen, Germany
| | | | - Snaebjorn Palsson
- Institute of Life and Environmental Sciences, University of Iceland, Reykjavik, Iceland
| | - Jerome P Panibe
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Joana Pauperio
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | - Emilie Payen
- Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | | | - Jaume Pellicer
- Institut Botànic de Barcelona, IBB (CSIC-CMCNB), Passeig del Migdia s.n., Parc de Montjüic, Barcelona, Spain
| | - Graziano Pesole
- University of Bari Aldo Moro, Department of Biosciences, Biotechnology and Environment; Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Consiglio Nazionale delle Ricerche, Bari, Italy
| | - Joao Pimenta
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- Wellcome Sanger Institute, Cambridge, UK
| | - Martin Pippel
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | | | - Nikos Poulakakis
- Department of Biology, School of Sciences and Engineering, University of Crete, Voutes University Campus, Irakleio, Greece
- Natural History Museum of Crete, School of Sciences and Engineering, University of Crete, Irakleio, Greece
| | - Jeena Rajan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Rúben M C Rego
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Pólo dos Açores; Faculdade de Ciências e Tecnologia, Universidade dos Açores, Ponta Delgada, Portugal
- UNESCO, Chair Land Within Sea Biodiversity & Sustainability in Atlantic Islands, Portugal
| | - Roberto Resendes
- Universidade dos Acores, Departamento de Biologia, Ponta Delgada, Portugal
| | - Philipp Resl
- Department of Biology, University of Graz, Graz, Austria
| | - Ana Riesgo
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales, Madrid, Spain
| | | | - Andre E R Soares
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Carlos Rodriguez Fernandes
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Faculdade de Psicologia, Universidade de Lisboa, Lisboa, Portugal
| | - Maria M Romeiras
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Linking Landscape, Environment, Agriculture and Food, Associated Laboratory TERRA, Instituto Superior de Agronomia, Universidade de Lisboa, Lisboa, Portugal
- Portugal Centre for Ecology, Evolution and Environmental Changes, Lisbon, Portugal
| | - Guilherme Roxo
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Pólo dos Açores; Faculdade de Ciências e Tecnologia, Universidade dos Açores, Ponta Delgada, Portugal
- UNESCO, Chair Land Within Sea Biodiversity & Sustainability in Atlantic Islands, Portugal
| | - Lukas Rüber
- Aquatic Ecology & Evolution, Institute of Ecology & Evolution, University of Bern, Bern, Switzerland
- Naturhistorisches Museum Bern, Bern, Switzerland
| | - Maria Jose Ruiz-Lopez
- Departamento de Biología de la Conservación y Cambio Global, Estación Biológica de Doñana (EBD), CSIC, Sevilla, Spain
- CIBER of Epidemiology and Public Health, Granada, Spain
| | - Urmas Saarma
- Department of Zoology, Institute of Ecology and Earth Sciences, University of Tartu, Tartu, Estonia
| | - Luis P da Silva
- CIBIO, Centro de Investigacao em Biodiversidade e Recursos Geneticos, InBIO Laboratorio Associado, Universidade do Porto, Vairao, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairao, Vairao, Portugal
| | - Manuela Sim-Sim
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Museu Nacional de História Natural e da Ciência, Lisboa, Portugal
- Departamento de Biologia Vegetal, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| | - Lucile Soler
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Vitor C Sousa
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Lisboa, Portugal
| | - Carla Sousa Santos
- MARE Marine and Environmental Sciences Centre, ARNET Aquatic Research Network, Lisboa, Portugal
| | - Alberto Spada
- Department of Agricultural and Environmental Sciences Production, Landscape, Agroenergy, University of Milan, Milan, Italy
| | - Milomir Stefanovic
- Department of Biology and Ecology, University of Novi Sad, Novi Sad, Serbia
| | - Viktor Steger
- Department of Genetics and Genomics, Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Godollo, Hungary
| | - Josefin Stiller
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Matthias Stöck
- Department IV Fish Biology, Fisheries and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, Berlin, Germany
| | - Torsten H Struck
- Natural History Museum, University of Oslo, Blindern, Oslo, Norway
| | - Hiranya Sudasinghe
- Naturhistorisches Museum Bern, Bern, Switzerland
- Division of Evolutionary Ecology, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | | | - Christian Tellgren-Roth
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Helena Trindade
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- Departamento de Biologia Vegetal, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| | - Yevhen Tukalenko
- Institute for Nuclear Research of the NAS of Ukraine, Kyiv, Ukraine
| | - Ilenia Urso
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Consiglio Nazionale delle Ricerche, Bari, Italy
| | - Benoit Vacherie
- Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | - Steven M Van Belleghem
- Ecology, Evolution and Conservation Biology, Department of Biology, KU Leuven, Leuven, Belgium
| | - Kees Van Oers
- Department of Animal Ecology, Netherlands Institute of Ecology, Wageningen, The Netherlands
| | - Carlos Vargas-Chavez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Nevena Velickovic
- Department of Biology and Ecology, University of Novi Sad, Novi Sad, Serbia
| | - Noel Vella
- Conservation Biology Research Group, Department of Biology, University of Malta, Msida, Malta
| | - Adriana Vella
- Conservation Biology Research Group, Department of Biology, University of Malta, Msida, Malta
| | - Cristiano Vernesi
- Forest Ecology Unit, Research and Innovation Centre-Fondazione Edmund Mach, San Michele All'Adige, Italy
| | - Sara Vicente
- CE3C-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, Portugal
- ERISA Escola Superior de Saúde Ribeiro Sanches, IPLUSO, Lisboa, Portugal
| | - Sara Villa
- Institute for Sustainable Plant Protection, National Research Council, Sesto Fiorentino, Italy
- Department of Agricultural and Environmental Sciences, University of Milan via Giovanni Celoria 2, Milan, Italy
| | - Olga Vinnere Pettersson
- SciLifeLab, Solna, Sweden
- Uppsala University, Uppsala, Sweden
- National Bioinformatics Infrastructure Sweden, Uppsala, Sweden
| | - Filip A M Volckaert
- Laboratory of Biodiversity and Evolutionary Genomics, KU Leuven, Leuven, Belgium
| | - Judit Voros
- Department of Zoology, Hungarian Natural History Museum, Budapest, Hungary
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Sylke Winkler
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Claudio Ciofi
- Department of Biology, University of Florence, Sesto Fiorentino, Italy
| | - Robert M Waterhouse
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Camila J Mazzoni
- Leibniz Institut für Zoo und Wildtierforschung, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| |
Collapse
|
42
|
Heckenhauer J, Plotkin D, Martinez JI, Bethin J, Pauls SU, Frandsen PB, Kawahara AY. Genomic resources of aquatic Lepidoptera, Elophila obliteralis and Hyposmocoma kahamanoa, reveal similarities with Trichoptera in amino acid composition of major silk genes. G3 (BETHESDA, MD.) 2024; 14:jkae093. [PMID: 38722626 PMCID: PMC11373647 DOI: 10.1093/g3journal/jkae093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 04/16/2024] [Indexed: 09/06/2024]
Abstract
While most species of butterflies and moths (Lepidoptera) have entirely terrestrial life histories, ∼0.5% of the described species are known to have an aquatic larval stage. Larvae of aquatic Lepidoptera are similar to caddisflies (Trichoptera) in that they use silk to anchor themselves to underwater substrates or to build protective cases. However, the physical properties and genetic elements of silks in aquatic Lepidoptera remain unstudied, as most research on lepidopteran silk has focused on the commercially important silkworm, Bombyx mori. Here, we provide high-quality PacBio HiFi genome assemblies of 2 distantly-related aquatic Lepidoptera species [Elophila obliteralis (Pyraloidea: Crambidae) and Hyposmocoma kahamanoa (Gelechioidea: Cosmopterigidae)]. As a step toward understanding the evolution of underwater silk in aquatic Lepidoptera, we used the genome assemblies and compared them to published genetic data of aquatic and terrestrial Lepidoptera. Sequences of the primary silk protein, h-fibroin, in aquatic moths have conserved termini and share a basic motif structure with terrestrial Lepidoptera. However, these sequences were similar to aquatic Trichoptera in that the percentage of positively and negatively charged amino acids was much higher than in terrestrial Lepidoptera, indicating a possible adaptation of silks to aquatic environments.
Collapse
Affiliation(s)
- Jacqueline Heckenhauer
- Senckenberg Research Institute and Natural History Museum Frankfurt, Terrestrial Zoology, 60325 Frankfurt am Main, Germany
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt am Main, Germany
| | - David Plotkin
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - Jose I Martinez
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - Jacob Bethin
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - Steffen U Pauls
- Senckenberg Research Institute and Natural History Museum Frankfurt, Terrestrial Zoology, 60325 Frankfurt am Main, Germany
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt am Main, Germany
- Institute for Insect Biotechnology, Justus-Liebig-University, 35392 Gießen, Germany
| | - Paul B Frandsen
- Department of Plant and Wildlife Science, Brigham Young University, Provo, UT 84602, USA
- Data Science Lab, Smithsonian Institution, Washington, DC 20560, USA
| | - Akito Y Kawahara
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
43
|
Lewin TD, Liao IJY, Luo YJ. Annelid Comparative Genomics and the Evolution of Massive Lineage-Specific Genome Rearrangement in Bilaterians. Mol Biol Evol 2024; 41:msae172. [PMID: 39141777 PMCID: PMC11371463 DOI: 10.1093/molbev/msae172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 08/05/2024] [Accepted: 08/08/2024] [Indexed: 08/16/2024] Open
Abstract
The organization of genomes into chromosomes is critical for processes such as genetic recombination, environmental adaptation, and speciation. All animals with bilateral symmetry inherited a genome structure from their last common ancestor that has been highly conserved in some taxa but seemingly unconstrained in others. However, the evolutionary forces driving these differences and the processes by which they emerge have remained largely uncharacterized. Here, we analyze genome organization across the phylum Annelida using 23 chromosome-level annelid genomes. We find that while many annelid lineages have maintained the conserved bilaterian genome structure, the Clitellata, a group containing leeches and earthworms, possesses completely scrambled genomes. We develop a rearrangement index to quantify the extent of genome structure evolution and show that, compared to the last common ancestor of bilaterians, leeches and earthworms have among the most highly rearranged genomes of any currently sampled species. We further show that bilaterian genomes can be classified into two distinct categories-high and low rearrangement-largely influenced by the presence or absence, respectively, of chromosome fission events. Our findings demonstrate that animal genome structure can be highly variable within a phylum and reveal that genome rearrangement can occur both in a gradual, stepwise fashion, or rapid, all-encompassing changes over short evolutionary timescales.
Collapse
Affiliation(s)
- Thomas D Lewin
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | | | - Yi-Jyun Luo
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
44
|
Powell A, Heckenhauer J, Pauls SU, Ríos-Touma B, Kuranishi RB, Holzenthal RW, Razuri-Gonzales E, Bybee S, Frandsen PB. Evolution of Opsin Genes in Caddisflies (Insecta: Trichoptera). Genome Biol Evol 2024; 16:evae185. [PMID: 39176990 PMCID: PMC11381090 DOI: 10.1093/gbe/evae185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 08/12/2024] [Accepted: 08/19/2024] [Indexed: 08/24/2024] Open
Abstract
Insects have evolved complex and diverse visual systems in which light-sensing protein molecules called "opsins" couple with a chromophore to form photopigments. Insect photopigments group into three major gene families based on wavelength sensitivity: long wavelength (LW), short wavelength (SW), and ultraviolet wavelength (UV). In this study, we identified 123 opsin sequences from whole-genome assemblies across 25 caddisfly species (Insecta: Trichoptera). We discovered the LW opsins have the most diversity across species and form two separate clades in the opsin gene tree. Conversely, we observed a loss of the SW opsin in half of the trichopteran species in this study, which might be associated with the fact that caddisflies are active during low-light conditions. Lastly, we found a single copy of the UV opsin in all the species in this study, with one exception: Athripsodes cinereus has two copies of the UV opsin and resides within a clade of caddisflies with colorful wing patterns.
Collapse
Affiliation(s)
- Ashlyn Powell
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, USA
| | - Jacqueline Heckenhauer
- LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Steffen U Pauls
- LOEWE Centre for Translational Biodiversity Genomics, Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Blanca Ríos-Touma
- Facultad de Ingenierías y Ciencias Aplicadas, Ingeniería Ambiental, Grupo de Investigación en Biodiversidad, Medio Ambiente y Salud, Universidad de Las Américas, Quito, Ecuador
| | - Ryoichi B Kuranishi
- Graduate School of Science, Chiba University, Chiba, Japan
- Kanagawa Institute of Technology, Kanagawa, Japan
| | | | | | - Seth Bybee
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Paul B Frandsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, USA
| |
Collapse
|
45
|
Augustijnen H, Lucek K. Beyond gene flow: (non)-parallelism of secondary contact in a pair of highly differentiated sibling species. Mol Ecol 2024; 33:e17488. [PMID: 39119885 DOI: 10.1111/mec.17488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 07/01/2024] [Accepted: 07/10/2024] [Indexed: 08/10/2024]
Abstract
Replicated secondary contact zones can provide insights into the barriers to gene flow that are important during speciation and can reveal to which degree secondary contact may result in similar evolutionary outcomes. Here, we studied two secondary contact zones between highly differentiated Alpine butterflies of the genus Erebia using whole-genome resequencing data. We assessed the genomic relationships between populations and species and found hybridization to be rare, with no to little current or historical introgression in either contact zone. There are large similarities between contact zones, consistent with an allopatric origin of interspecific differentiation, with no indications for ongoing reinforcing selection. Consistent with expected reduced effective population size, we further find that scaffolds related to the Z-chromosome show increased differentiation compared to the already high levels across the entire genome, which could also hint towards a contribution of the Z chromosome to species divergence in this system. Finally, we detected the presence of the endosymbiont Wolbachia, which can cause reproductive isolation between its hosts, in all E. cassioides, while it appears to be fully or largely absent in contact zone populations of E. tyndarus. We discuss how this rare pattern may have arisen and how it may have affected the dynamics of speciation upon secondary contact.
Collapse
Affiliation(s)
- Hannah Augustijnen
- Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Kay Lucek
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
46
|
Corominas M, Marquès-Bonet T, Arnedo M, Bayés M, Belmonte J, Escrivà H, Fernández R, Gabaldón T, Garnatje T, Germain J, Niell M, Palero F, Pons J, Puigdomènech P, Arroyo V, Cuevas-Caballé C, Obiol JF, Gut I, Gut M, Hidalgo O, Izquierdo-Arànega G, Pérez-Sorribes L, Righi E, Riutort M, Vallès J, Rozas J, Alioto T, Guigó R. The Catalan initiative for the Earth BioGenome Project: contributing local data to global biodiversity genomics. NAR Genom Bioinform 2024; 6:lqae075. [PMID: 39022326 PMCID: PMC11252852 DOI: 10.1093/nargab/lqae075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 05/10/2024] [Accepted: 06/19/2024] [Indexed: 07/20/2024] Open
Abstract
The Catalan Initiative for the Earth BioGenome Project (CBP) is an EBP-affiliated project network aimed at sequencing the genome of the >40 000 eukaryotic species estimated to live in the Catalan-speaking territories (Catalan Linguistic Area, CLA). These territories represent a biodiversity hotspot. While covering less than 1% of Europe, they are home to about one fourth of all known European eukaryotic species. These include a high proportion of endemisms, many of which are threatened. This trend is likely to get worse as the effects of global change are expected to be particularly severe across the Mediterranean Basin, particularly in freshwater ecosystems and mountain areas. Following the EBP model, the CBP is a networked organization that has been able to engage many scientific and non-scientific partners. In the pilot phase, the genomes of 52 species are being sequenced. As a case study in biodiversity conservation, we highlight the genome of the Balearic shearwater Puffinus mauretanicus, sequenced under the CBP umbrella.
Collapse
Affiliation(s)
- Montserrat Corominas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Biomedicina (IBUB), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
| | - Tomàs Marquès-Bonet
- Institute of Evolutionary Biology (IBE, UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Miquel A Arnedo
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Mònica Bayés
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Jordina Belmonte
- Departament de Biologia Animal, Biologia Vegetal i Ecologia, Facultat de Biociències, Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Catalonia, Spain
- Institut de Ciència i Tecnologia Ambientals (ICTA-UAB), Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Catalonia, Spain
| | - Hector Escrivà
- Sorbonne Université, CNRS, Biologie Intégrative des Organismes Marins, BIOM, F-66650, Banyuls-sur-Mer, France
| | - Rosa Fernández
- Institute of Evolutionary Biology (IBE, UPF-CSIC), PRBB, 08003 Barcelona, Spain
| | - Toni Gabaldón
- Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
- Barcelona Supercomputing Centre (BSC-CNS), 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
- CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
| | - Teresa Garnatje
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
- Jardí Botànic Marimurtra - Fundació Carl Faust, 17300 Blanes, Catalonia, Spain
| | - Josep Germain
- Institució Catalana d’Història Natural, 08001 Barcelona, Catalonia, Spain
| | - Manel Niell
- Andorra Recerca + Innovació (ARI), AD600 Sant Julià de Lòria, Andorra
| | - Ferran Palero
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva (ICBIBE), Paterna, Valencia, Spain
| | - Joan Pons
- Departament de Biodiversitat Animal i Microbiana, Institut Mediterrani d’Estudis Avançats (CSIC-UIB), 07190 Esporles, Illes Balears, Spain
| | - Pere Puigdomènech
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Centre de Recerca en Agrigenòmica, CSIC/IRTA/UAB/UB, 08193 Bellaterra, Catalonia, Spain
| | - Vanesa Arroyo
- Andorra Recerca + Innovació (ARI), AD600 Sant Julià de Lòria, Andorra
| | - Cristian Cuevas-Caballé
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Joan Ferrer Obiol
- Department of Environmental Science and Policy, University of Milan, Milan, Italy
| | - Ivo Gut
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Marta Gut
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Oriane Hidalgo
- CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
- Royal Botanic Gardens, Kew, TW9 3DS Richmond, UK
| | - Guillem Izquierdo-Arànega
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Laia Pérez-Sorribes
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
- Estación Biológica de Doñana, CSIC, 41092 Sevilla, Spain
| | - Emilio Righi
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Catalonia, Spain
| | - Marta Riutort
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Joan Vallès
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Laboratori de Botànica (UB), Unitat Associada al CSIC, Facultat de Farmàcia i Ciències de l’Alimentació, Universitat de Barcelona, 08028 Barcelona, Catalonia, Spain
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona (UB), 08028 Barcelona, Catalonia, Spain
| | - Tyler Alioto
- Centre Nacional d’Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
- Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Roderic Guigó
- Institut d’Estudis Catalans (IEC), 08001 Barcelona, Catalonia, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Catalonia, Spain
| |
Collapse
|
47
|
Li H, Durbin R. Genome assembly in the telomere-to-telomere era. Nat Rev Genet 2024; 25:658-670. [PMID: 38649458 DOI: 10.1038/s41576-024-00718-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2024] [Indexed: 04/25/2024]
Abstract
Genome sequences largely determine the biology and encode the history of an organism, and de novo assembly - the process of reconstructing the genome sequence of an organism from sequencing reads - has been a central problem in bioinformatics for four decades. Until recently, genomes were typically assembled into fragments of a few megabases at best, but now technological advances in long-read sequencing enable the near-complete assembly of each chromosome - also known as telomere-to-telomere assembly - for many organisms. Here, we review recent progress on assembly algorithms and protocols, with a focus on how to derive near-telomere-to-telomere assemblies. We also discuss the additional developments that will be required to resolve remaining assembly gaps and to assemble non-diploid genomes.
Collapse
Affiliation(s)
- Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Richard Durbin
- Department of Genetics, Cambridge University, Cambridge, UK.
| |
Collapse
|
48
|
Blommaert J, Sandoval-Castillo J, Beheregaray LB, Wellenreuther M. Peering into the gaps: Long-read sequencing illuminates structural variants and genomic evolution in the Australasian snapper. Genomics 2024; 116:110929. [PMID: 39216708 DOI: 10.1016/j.ygeno.2024.110929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 08/25/2024] [Accepted: 08/26/2024] [Indexed: 09/04/2024]
Abstract
Even before genome sequencing, genetic resources have supported species management and breeding programs. Current technologies, such as long-read sequencing, resolve complex genomic regions, like those rich in repeats or high in GC content. Improved genome contiguity enhances accuracy in identifying structural variants (SVs) and transposable elements (TEs). We present an improved genome assembly and SV catalogue for the Australasian snapper (Chrysophrys auratus). The new assembly is more contiguous, allowing for putative identification of 14 centromeres and transfer of 26,115 gene annotations from yellowfin seabream. Compared to the previous assembly, 35,000 additional SVs, including larger and more complex rearrangements, were annotated. SVs and TEs exhibit a distribution pattern skewed towards chromosome ends, likely influenced by recombination. Some SVs overlap with growth-related genes, underscoring their significance. This upgraded genome serves as a foundation for studying natural and artificial selection, offers a reference for related species, and sheds light on genome dynamics shaped by evolution.
Collapse
Affiliation(s)
- Julie Blommaert
- The New Zealand Institute for Plant and Food Research, Nelson, New Zealand.
| | - Jonathan Sandoval-Castillo
- Molecular Ecology Laboratory, College of Science and Engineering, Flinders University, Bedford Park, South Australia, Australia
| | - Luciano B Beheregaray
- Molecular Ecology Laboratory, College of Science and Engineering, Flinders University, Bedford Park, South Australia, Australia
| | - Maren Wellenreuther
- The New Zealand Institute for Plant and Food Research, Nelson, New Zealand; School of Biological Sciences, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
49
|
Jeiter J, Smets E. Comparative morphology at a crossroads. AMERICAN JOURNAL OF BOTANY 2024; 111:e16392. [PMID: 39148327 DOI: 10.1002/ajb2.16392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 07/09/2024] [Accepted: 07/09/2024] [Indexed: 08/17/2024]
Abstract
Morphology has been the fundamental and most important source of information in biology. We strongly believe that in the current molecular era of biology, comparative morphology still has an important role to play in understanding life on Earth and ecosystem functioning, bridging the knowledge gap between evolution, systematics, and ecology.
Collapse
Affiliation(s)
- Julius Jeiter
- Chair of Botany, Faculty of Biology, TUD Dresden University of Technology, Dresden, 01062, Germany
- Nees Institute for Biodiversity of Plants, University of Bonn, Meckenheimer Allee 170, Bonn, 53115, Germany
| | - Erik Smets
- Naturalis Biodiversity Center, Darwinweg 2, 2333 CR Leiden, Leiden, The Netherlands
- Institute of Biology Leiden, Leiden University, Sylviusweg 72, Leiden, 2333 BE, The Netherlands
- Ecology, Evolution and Biodiversity Conservation, KU Leuven, Kasteelpark Arenberg 31, Box 2435, Heverlee, 3001, Belgium
| |
Collapse
|
50
|
Khalaf A, Francis O, Blaxter ML. Genome evolution in intracellular parasites: Microsporidia and Apicomplexa. J Eukaryot Microbiol 2024; 71:e13033. [PMID: 38785208 DOI: 10.1111/jeu.13033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 03/29/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024]
Abstract
Microsporidia and Apicomplexa are eukaryotic, single-celled, intracellular parasites with huge public health and economic importance. Typically, these parasites are studied separately, emphasizing their uniqueness and diversity. In this review, we explore the huge amount of genomic data that has recently become available for the two groups. We compare and contrast their genome evolution and discuss how their transitions to intracellular life may have shaped it. In particular, we explore genome reduction and compaction, genome expansion and ploidy, gene shuffling and rearrangements, and the evolution of centromeres and telomeres.
Collapse
Affiliation(s)
- Amjad Khalaf
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | - Ore Francis
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | | |
Collapse
|