1
|
Donthu R, Marcelino JAP, Giordano R, Tao Y, Weber E, Avalos A, Band M, Akraiko T, Chen SC, Reyes MP, Hao H, Ortiz-Alvarado Y, Cuff CA, Claudio EP, Soto-Adames F, Smith-Pardo AH, Meikle WG, Evans JD, Giray T, Abdelkader FB, Allsopp M, Ball D, Morgado SB, Barjadze S, Correa-Benitez A, Chakir A, Báez DR, Chavez NHM, Dalmon A, Douglas AB, Fraccica C, Fernández-Marín H, Galindo-Cardona A, Guzman-Novoa E, Horsburgh R, Kence M, Kilonzo J, Kükrer M, Le Conte Y, Mazzeo G, Mota F, Muli E, Oskay D, Ruiz-Martínez JA, Oliveri E, Pichkhaia I, Romane A, Sanchez CG, Sikombwa E, Satta A, Scannapieco AA, Stanford B, Soroker V, Velarde RA, Vercelli M, Huang Z. HBeeID: a molecular tool that identifies honey bee subspecies from different geographic populations. BMC Bioinformatics 2024; 25:278. [PMID: 39192185 DOI: 10.1186/s12859-024-05776-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 04/10/2024] [Indexed: 08/29/2024] Open
Abstract
BACKGROUND Honey bees are the principal commercial pollinators. Along with other arthropods, they are increasingly under threat from anthropogenic factors such as the incursion of invasive honey bee subspecies, pathogens and parasites. Better tools are needed to identify bee subspecies. Genomic data for economic and ecologically important organisms is increasing, but in its basic form its practical application to address ecological problems is limited. RESULTS We introduce HBeeID a means to identify honey bees. The tool utilizes a knowledge-based network and diagnostic SNPs identified by discriminant analysis of principle components and hierarchical agglomerative clustering. Tests of HBeeID showed that it identifies African, Americas-Africanized, Asian, and European honey bees with a high degree of certainty even when samples lack the full 272 SNPs of HBeeID. Its prediction capacity decreases with highly admixed samples. CONCLUSION HBeeID is a high-resolution genomic, SNP based tool, that can be used to identify honey bees and screen species that are invasive. Its flexible design allows for future improvements via sample data additions from other localities.
Collapse
Affiliation(s)
- Ravikiran Donthu
- Puerto Rico Science, Technology and Research Trust, San Juan, PR, 00927, USA
- Centre for Life Sciences, Mahindra University, Bahadurpally, Hyderabad, 500043, India
| | - Jose A P Marcelino
- Puerto Rico Science, Technology and Research Trust, San Juan, PR, 00927, USA
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Rosanna Giordano
- Puerto Rico Science, Technology and Research Trust, San Juan, PR, 00927, USA.
- Institute of Environment, Florida International University, Miami, FL, 33199, USA.
| | - Yudong Tao
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, 33146, USA
| | - Everett Weber
- Office of Institutional Research, Dartmouth College, Hanover, NH, 03755, USA
| | - Arian Avalos
- USDA-ARS, Honey Bee Breeding, Genetics and Physiology Research, Baton Rouge, LA, 70820, USA
| | - Mark Band
- Roy J. Carver Biotechnology Center, University of Illinois, Urbana-Champaign, IL, 61801, USA
| | - Tatsiana Akraiko
- Roy J. Carver Biotechnology Center, University of Illinois, Urbana-Champaign, IL, 61801, USA
| | - Shu-Ching Chen
- Data Science and Analytics Innovation Center (dSAIC), University of Missouri-Kansas City, Kansas City, MO, 64110, USA
| | - Maria P Reyes
- Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, 33199, USA
| | - Haiping Hao
- Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | | | - Charles A Cuff
- Department of Biology, University of Puerto Rico, San Juan, PR, 00931, USA
| | - Eddie Pérez Claudio
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA
| | - Felipe Soto-Adames
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | | | - William G Meikle
- USDA-ARS, Carl Hayden Bee Research Center, Tucson, AZ, 85719, USA
| | - Jay D Evans
- USDA-ARS, Bee Research Laboratory, Beltsville, MD, 20705, USA.
| | - Tugrul Giray
- Department of Biology, University of Puerto Rico, San Juan, PR, 00931, USA.
| | - Faten B Abdelkader
- University of Carthage, National Agronomic Institute of Tunisia, 1082, Tunis, Tunisia
| | - Mike Allsopp
- Honey Bee Research Section, ARC-Plant Protection & Health, P/Bag X5017, Stellenbosch, 7599, South Africa
| | | | - Susana B Morgado
- Meltagus, Associação de Apicultores do Parque Natural do Tejo Internacional, 6000-790, Castelo Branco, Portugal
| | - Shalva Barjadze
- Institute of Zoology, Ilia State University, 3 Giorgi Tsereteli Street, 0162, Tbilisi, Georgia
| | - Adriana Correa-Benitez
- Facultad de MedicinaVeterinaria y Zootecnia, Departamento de Medicina y Zootecnia de Abejas, Conejos y Organismos Aquáticos (DMZ:ACyOA), Universidad Nacional Autónoma de México, 04510, Ciudad de Mexico, CP, Mexico
| | - Amina Chakir
- Applied Chemistry Laboratory, Semlalia Faculty of Sciences, University Cadi Ayyad, Marrakech, Morocco
| | | | - Nabor H M Chavez
- Cochabamba Beekeepers Federation (FEDAC), Aniceto Padilla, 493, Cochabamba, Bolivia
| | - Anne Dalmon
- INRAE, French National Research Institute for Agriculture, Food and Environment. UR Abeilles et Environment, 84914, Avignon, France
| | - Adrian B Douglas
- Institute of Earth Systems, Rural Sciences Farmhouse, University of Malta, Msida, 2080, MSD, Malta
| | - Carmen Fraccica
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Hermógenes Fernández-Marín
- Centro de Biodiversidad y Descubrimiento de Drogas, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Clayton Panama, 0843-01103, Panama
| | - Alberto Galindo-Cardona
- Instituto de Ecología Regional (IER), Universidad Nacional de Tucumán (UNT) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET). Yerba Buena, CC 34, CP 4107, Tucumán, Argentina
| | - Ernesto Guzman-Novoa
- School of Environmental Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - Robert Horsburgh
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Meral Kence
- Biology Department, Middle East Technical University, 06530, Ankara, Turkey
| | - Joseph Kilonzo
- International Centre of Insect Physiology and Ecology, Nairobi, Kenya
| | - Mert Kükrer
- Biology Department, Middle East Technical University, 06530, Ankara, Turkey
- Molecular Biology and Genetics Department, Kilis 7 Aralık University, Kilis, Turkey
| | - Yves Le Conte
- INRAE, French National Research Institute for Agriculture, Food and Environment. UR Abeilles et Environment, 84914, Avignon, France
| | - Gaetana Mazzeo
- Dipartimento di Agricoltura, Alimentazione e Ambiente (Di3A), Università Degli Studi Di Catania, Catania, Italy
| | - Fernando Mota
- Independent Beekeeper, 6000, Castelo Branco, Portugal
| | - Elliud Muli
- International Centre of Insect Physiology and Ecology, Nairobi, Kenya
- South Eastern Kenya University (SEKU), JXFW+X3C, Kitui, Kenya
| | - Devrim Oskay
- Department of Agricultural Biotechnology, Tekirdağ Namık Kemal University, 59030, Tekirdağ, Turkey
| | - José A Ruiz-Martínez
- Professional Training in Livestock and Animal Health, High School Lope de Vega, Fuente Obejuna, Córdoba, Spain
| | - Eugenia Oliveri
- Istituto Zooprofilattico Sperimentale della Sicilia, 90129, Palermo, Italy
| | - Igor Pichkhaia
- Chkhorotsku Local Historical Museum, David Aghmashenebeli St., 5000, Chkhorotsku, Georgia
| | - Abderrahmane Romane
- Applied Chemistry Laboratory, Semlalia Faculty of Sciences, University Cadi Ayyad, Marrakech, Morocco
| | - Cesar Guillen Sanchez
- Escuela de Agronomía, Sede del Atlántico, University of Costa Rica, Turrialba, 30501, Costa Rica
| | | | - Alberto Satta
- Department of Agricultural Sciences, University of Sassari, Viale Italia 39A, 07100, Sassari, Italy
| | | | - Brandi Stanford
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Victoria Soroker
- Agricultural Research Organization, The Volcani Center, Institute of Plant Protection, Department of Entomology, Bet-Dagan, Israel
| | - Rodrigo A Velarde
- Bolivian Apiculture Institute (IAB), PROMIEL-SEDEM, Jaimes Freyre No 2344, La Paz, Bolivia
| | | | - Zachary Huang
- Department of Entomology, MSU Apiculture Lab, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
2
|
Bharti J, Verma R, Gupta I, Chakraborty P, Eashwaran M, Sony SK, Nehra M, Thangraj A, Kaul R, Fathy K, Kaul T. Functional characterization of novel mutations in the conserved region of EPSPS for herbicide resistance in pigeonpea: structure-based coherent design. J Biomol Struct Dyn 2024; 42:6065-6080. [PMID: 37652402 DOI: 10.1080/07391102.2023.2243522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 06/21/2023] [Indexed: 09/02/2023]
Abstract
Prospectively, agroecosystems for the growth of crops provide the potential fertile, productive, and tropical environment which attracts infestation by weedy plant species that compete with the primary crop plants. Infestation by weed is a major biotic stress factor faced by pigeonpea that hampers the productivity of the crop. In the modern era with the development of chemicals the problem of weed infestation is dealt with armours called herbicides. The most widely utilized, post-emergent, broad-spectrum herbicide has an essential active ingredient called glyphosate. Glyphosate mechanistically inhibits a chloroplastic enzyme 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) by competitively interacting with the PEP binding site which hinders the shikimate pathway and the production of essential aromatic amino acids (Phe, Tyr, Trp) and other secondary metabolites in plants. Moreover, herbicide spray for weed management is lethal to both the primary crop and the weeds. Therefore, it is critical to develop herbicide-resistant crops for field purposes to reduce the associated yield and economic losses. In this study, the in-silico analysis drove the selection and validation of the point mutations in the conserved region of the EPSPS gene, which confers efficient herbicide resistance to mutated-CcEPSPS enzyme along with the retention of the normal enzyme function. An optimized in-silico validation of the target mutation before the development of the genome-edited resistant plant lines is a prerequisite for testing their efficacy as a proof of concept. We validated the combination of GATIPS mutation for its no-cost effect at the enzyme level via molecular dynamic (MD) simulation.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Jyotsna Bharti
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Rachana Verma
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Isha Gupta
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Puja Chakraborty
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Murugesh Eashwaran
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Sonia Khan Sony
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Mamta Nehra
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Arulprakash Thangraj
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Rashmi Kaul
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Khaled Fathy
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| | - Tanushri Kaul
- Nutritional Improvement of Crops Group, Plant Biology & Biotechnology, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India
| |
Collapse
|
3
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
Affiliation(s)
- Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Yingao Cai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanyan Guo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Qingzhu Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gexia Qiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
4
|
Oliveira HFM, Freire-Jr GB, Silva DC, Mata VA, Abra FD, Camargo NFD, Araujo Goebel LG, Longo GR, Silva JM, Colli GR, Domingos FMCB. Barcoding Brazilian mammals to monitor biological diversity and threats: Trends, perspectives, and knowledge gaps. ENVIRONMENTAL RESEARCH 2024; 258:119374. [PMID: 38885824 DOI: 10.1016/j.envres.2024.119374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 05/11/2024] [Accepted: 06/05/2024] [Indexed: 06/20/2024]
Abstract
DNA barcoding and environmental DNA (eDNA) represent significant advances for biomonitoring the world's biodiversity and its threats. However, these methods are highly dependent on the presence of species sequences on molecular databases. Brazil is one of the world's largest and most biologically diverse countries. However, many knowledge gaps still exist for describing, identifying, and monitoring of mammalian biodiversity using molecular methods. We aimed to unravel the patterns of the presence of Brazilian mammal species on molecular databases to improve our understanding of how effectively it would be to monitor them using DNA barcoding and environmental DNA, and contribute to mammalian conservation. We foundt many gaps in molecular databases, with many taxa being poorly represented, particularly from Amazonia, the order Lagomorpha, and arboreal, gomivorous, near extinct, and illegally traded species. Moreover, our analyses revealed that species description year was the most important factor determining the probability of a species to being sequenced. Primates are the group with the highest number of species considered a priority for sequencing due to their high level of combined threats. We highlight where investments are needed to fill knowledge gaps and increase the representativity of species on molecular databases to enable a better monitoring ability of Brazilian mammals encompassing different traits using DNA barcoding and environmental DNA.
Collapse
Affiliation(s)
- Hernani Fernandes Magalhães Oliveira
- Departamento de Zoologia, Universidade Federal do Paraná - UFPR, Curitiba, Brazil; Departamento de Zoologia, Universidade de Brasília - UnB, Brasília, Brazil; Knowledge Center for Biodiversity, Belo Horizonte, MG, 31270-901, Brazil.
| | - Geraldo Brito Freire-Jr
- Departamento de Ecologia, Universidade de Brasília - UnB, Brasília, Brazil; Department of Biology, University of Nevada - UNR, Reno, USA
| | - Daiana Cardoso Silva
- Programa de Pós-graduação em Ecologia e Conservação, Universidade do Estado de Mato Grosso - UNEMAT, Nova Xavantina, Brazil
| | - Vanessa Alves Mata
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Vairão, Vila do Conde, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Vila do Conde, Portugal
| | - Fernanda Delborgo Abra
- Smithsonian National Zoo and Conservation Biology Institute-Center for Conservation and Sustainability, Washington, DC, USA; ViaFAUNA Estudos Ambientais, São Paulo, SP, Brazil; Instituto Pró-Carnívoros, Atibaia, SP, Brazil
| | | | - L G Araujo Goebel
- Programa de Pós-graduação em Ciências Ambientais, Universidade do Estado de Mato Grosso - UNEMAT, Cáceres, Brazil
| | - Gabriela Rodrigues Longo
- Programa de Pós-graduação em Ensino de Ciências, Universidade Federal de Mato Grosso do Sul - UFMS, Campo Grande, Brazil
| | - Joaquim Manoel Silva
- Programa de Pós-graduação em Ecologia e Conservação, Universidade do Estado de Mato Grosso - UNEMAT, Nova Xavantina, Brazil
| | | | | |
Collapse
|
5
|
Ilík V, Schwarz EM, Nosková E, Pafčo B. Hookworm genomics: dusk or dawn? Trends Parasitol 2024; 40:452-465. [PMID: 38677925 DOI: 10.1016/j.pt.2024.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/28/2024] [Accepted: 04/04/2024] [Indexed: 04/29/2024]
Abstract
Hookworms are parasites, closely related to the model nematode Caenorhabditis elegans, that are a major economic and health burden worldwide. Primarily three hookworm species (Necator americanus, Ancylostoma duodenale, and Ancylostoma ceylanicum) infect humans. Another 100 hookworm species from 19 genera infect primates, ruminants, and carnivores. Genetic data exist for only seven of these species. Genome sequences are available from only four of these species in two genera, leaving 96 others (particularly those parasitizing wildlife) without any genomic data. The most recent hookworm genomes were published 5 years ago, leaving the field in a dusk. However, assembling genomes from single hookworms may bring a new dawn. Here we summarize advances, challenges, and opportunities for studying these neglected but important parasitic nematodes.
Collapse
Affiliation(s)
- Vladislav Ilík
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic; Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic.
| | - Erich M Schwarz
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Eva Nosková
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic; Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Barbora Pafčo
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic.
| |
Collapse
|
6
|
Bernal-Gallardo JJ, de Folter S. Plant genome information facilitates plant functional genomics. PLANTA 2024; 259:117. [PMID: 38592421 PMCID: PMC11004055 DOI: 10.1007/s00425-024-04397-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 03/20/2024] [Indexed: 04/10/2024]
Abstract
MAIN CONCLUSION In this review, we give an overview of plant sequencing efforts and how this impacts plant functional genomics research. Plant genome sequence information greatly facilitates the studies of plant biology, functional genomics, evolution of genomes and genes, domestication processes, phylogenetic relationships, among many others. More than two decades of sequencing efforts have boosted the number of available sequenced plant genomes. The first plant genome, of Arabidopsis, was published in the year 2000 and currently, 4604 plant genomes from 1482 plant species have been published. Various large sequence initiatives are running, which are planning to produce tens of thousands of sequenced plant genomes in the near future. In this review, we give an overview on the status of sequenced plant genomes and on the use of genome information in different research areas.
Collapse
Affiliation(s)
- Judith Jazmin Bernal-Gallardo
- Unidad de Genómica Avanzada (UGA-Langebio), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Irapuato, Mexico
| | - Stefan de Folter
- Unidad de Genómica Avanzada (UGA-Langebio), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Irapuato, Mexico.
| |
Collapse
|
7
|
Karollus A, Hingerl J, Gankin D, Grosshauser M, Klemon K, Gagneur J. Species-aware DNA language models capture regulatory elements and their evolution. Genome Biol 2024; 25:83. [PMID: 38566111 PMCID: PMC10985990 DOI: 10.1186/s13059-024-03221-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 03/20/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND The rise of large-scale multi-species genome sequencing projects promises to shed new light on how genomes encode gene regulatory instructions. To this end, new algorithms are needed that can leverage conservation to capture regulatory elements while accounting for their evolution. RESULTS Here, we introduce species-aware DNA language models, which we trained on more than 800 species spanning over 500 million years of evolution. Investigating their ability to predict masked nucleotides from context, we show that DNA language models distinguish transcription factor and RNA-binding protein motifs from background non-coding sequence. Owing to their flexibility, DNA language models capture conserved regulatory elements over much further evolutionary distances than sequence alignment would allow. Remarkably, DNA language models reconstruct motif instances bound in vivo better than unbound ones and account for the evolution of motif sequences and their positional constraints, showing that these models capture functional high-order sequence and evolutionary context. We further show that species-aware training yields improved sequence representations for endogenous and MPRA-based gene expression prediction, as well as motif discovery. CONCLUSIONS Collectively, these results demonstrate that species-aware DNA language models are a powerful, flexible, and scalable tool to integrate information from large compendia of highly diverged genomes.
Collapse
Affiliation(s)
- Alexander Karollus
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Johannes Hingerl
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Dennis Gankin
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Martin Grosshauser
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Kristian Klemon
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Julien Gagneur
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Munich Center for Machine Learning, Munich, Germany.
- Institute of Human Genetics, School of Medicine and Health, Technical University of Munich, Munich, Germany.
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany.
- Munich Data Science Institute, Technical University of Munich, Garching, Germany.
| |
Collapse
|
8
|
Wright CJ, Stevens L, Mackintosh A, Lawniczak M, Blaxter M. Comparative genomics reveals the dynamics of chromosome evolution in Lepidoptera. Nat Ecol Evol 2024; 8:777-790. [PMID: 38383850 PMCID: PMC11009112 DOI: 10.1038/s41559-024-02329-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 01/12/2024] [Indexed: 02/23/2024]
Abstract
Chromosomes are a central unit of genome organization. One-tenth of all described species on Earth are butterflies and moths, the Lepidoptera, which generally possess 31 chromosomes. However, some species display dramatic variation in chromosome number. Here we analyse 210 chromosomally complete lepidopteran genomes and show that the chromosomes of extant lepidopterans are derived from 32 ancestral linkage groups, which we term Merian elements. Merian elements have remained largely intact through 250 million years of evolution and diversification. Against this stable background, eight lineages have undergone extensive reorganization either through numerous fissions or a combination of fusion and fission events. Outside these lineages, fusions are rare and fissions are rarer still. Fusions often involve small, repeat-rich Merian elements and the sex-linked element. Our results reveal the constraints on genome architecture in Lepidoptera and provide a deeper understanding of chromosomal rearrangements in eukaryotic genome evolution.
Collapse
Affiliation(s)
| | - Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | | | | | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK.
| |
Collapse
|
9
|
Nagy NA, Tóth GE, Kurucz K, Kemenesi G, Laczkó L. The updated genome of the Hungarian population of Aedes koreicus. Sci Rep 2024; 14:7545. [PMID: 38555322 PMCID: PMC10981705 DOI: 10.1038/s41598-024-58096-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 03/25/2024] [Indexed: 04/02/2024] Open
Abstract
Vector-borne diseases pose a potential risk to human and animal welfare, and understanding their spread requires genomic resources. The mosquito Aedes koreicus is an emerging vector that has been introduced into Europe more than 15 years ago but only a low quality, fragmented genome was available. In this study, we carried out additional sequencing and assembled and characterized the genome of the species to provide a background for understanding its evolution and biology. The updated genome was 1.1 Gbp long and consisted of 6099 contigs with an N50 value of 329,610 bp and a BUSCO score of 84%. We identified 22,580 genes that could be functionally annotated and paid particular attention to the identification of potential insecticide resistance genes. The assessment of the orthology of the genes indicates a high turnover at the terminal branches of the species tree of mosquitoes with complete genomes, which could contribute to the adaptation and evolutionary success of the species. These results could form the basis for numerous downstream analyzes to develop targets for the control of mosquito populations.
Collapse
Affiliation(s)
- Nikoletta Andrea Nagy
- Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary.
- HUN-REN-UD Behavioural Ecology Research Group, University of Debrecen, Debrecen, Hungary.
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary.
| | - Gábor Endre Tóth
- National Laboratory of Virology, Szentágothai Research Centre, University of Pécs, Pecs, Hungary
- Bernhard Nocht Institute for Tropical Medicine, WHO Collaborating Centre for Arbovirus and Hemorrhagic Fever Reference and Research, Hamburg, Germany
| | - Kornélia Kurucz
- National Laboratory of Virology, Szentágothai Research Centre, University of Pécs, Pecs, Hungary
- Institute of Biology, Faculty of Sciences, University of Pécs, Pecs, Hungary
| | - Gábor Kemenesi
- National Laboratory of Virology, Szentágothai Research Centre, University of Pécs, Pecs, Hungary
- Institute of Biology, Faculty of Sciences, University of Pécs, Pecs, Hungary
| | - Levente Laczkó
- HUN-REN-UD Conservation Biology Research Group, University of Debrecen, Debrecen, Hungary
- One Health Institute, University of Debrecen, Debrecen, Hungary
| |
Collapse
|
10
|
Zhuang J, Zhang Y, Zhou C, Fan D, Huang T, Feng Q, Lu Y, Zhao Y, Zhao Q, Han B, Lu T. Dynamics of extrachromosomal circular DNA in rice. Nat Commun 2024; 15:2413. [PMID: 38499575 PMCID: PMC10948907 DOI: 10.1038/s41467-024-46691-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 03/06/2024] [Indexed: 03/20/2024] Open
Abstract
The genome's dynamic nature, exemplified by elements like extrachromosomal circular DNA (eccDNA), is crucial for biodiversity and adaptation. Yet, the role of eccDNA in plants, particularly rice, remains underexplored. Here, we identify 25,598 eccDNAs, unveiling the widespread presence of eccDNA across six rice tissues and revealing its formation as a universal and random process. Interestingly, we discover that direct repeats play a pivotal role in eccDNA formation, pointing to a unique origin mechanism. Despite eccDNA's prevalence in coding sequences, its impact on gene expression is minimal, implying its roles beyond gene regulation. We also observe the association between eccDNA's formation and minor chromosomal deletions, providing insights of its possible function in regulating genome stability. Further, we discover eccDNA specifically accumulated in rice leaves, which may be associated with DNA damage caused by environmental stressors like intense light. In summary, our research advances understanding of eccDNA's role in the genomic architecture and offers valuable insights for rice cultivation and breeding.
Collapse
Affiliation(s)
- Jundong Zhuang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
- University of Chinese Academy of Sciences, Beijing, 100039, China
| | - Yaoxin Zhang
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
- University of Chinese Academy of Sciences, Beijing, 100039, China
| | - Congcong Zhou
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Danlin Fan
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Tao Huang
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Qi Feng
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Yiqi Lu
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Yan Zhao
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Qiang Zhao
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Bin Han
- National Center for Gene Research, National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China.
| | - Tingting Lu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
11
|
Bass C, Hayward A, Troczka BJ, Haas J, Nauen R. The molecular determinants of pesticide sensitivity in bee pollinators. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 915:170174. [PMID: 38246392 DOI: 10.1016/j.scitotenv.2024.170174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/09/2024] [Accepted: 01/13/2024] [Indexed: 01/23/2024]
Abstract
Bees carry out vital ecosystem services by pollinating both wild and economically important crop plants. However, while performing this function, bee pollinators may encounter potentially harmful xenobiotics in the environment such as pesticides (fungicides, herbicides and insecticides). Understanding the key factors that influence the toxicological outcomes of bee exposure to these chemicals, in isolation or combination, is essential to safeguard their health and the ecosystem services they provide. In this regard, recent work using toxicogenomic and phylogenetic approaches has begun to identify, at the molecular level, key determinants of pesticide sensitivity in bee pollinators. These include detoxification systems that convert pesticides to less toxic forms and key residues in insecticide target-sites that underlie species-specific insecticide selectivity. Here we review this emerging body of research and summarise the state of knowledge of the molecular determinants of pesticide sensitivity in bee pollinators. We identify gaps in our knowledge for future research and examine how an understanding of the genetic basis of bee sensitivity to pesticides can be leveraged to, a) predict and avoid negative bee-pesticide interactions and facilitate the future development of pest-selective bee-safe insecticides, and b) inform traditional effect assessment approaches in bee pesticide risk assessment and address issues of ecotoxicological concern.
Collapse
Affiliation(s)
- Chris Bass
- Centre for Ecology and Conservation, University of Exeter, Penryn, Cornwall, United Kingdom.
| | - Angela Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn, Cornwall, United Kingdom
| | - Bartlomiej J Troczka
- Centre for Ecology and Conservation, University of Exeter, Penryn, Cornwall, United Kingdom
| | - Julian Haas
- Bayer AG, Crop Science Division, Alfred Nobel-Strasse 50, Monheim, Germany
| | - Ralf Nauen
- Bayer AG, Crop Science Division, Alfred Nobel-Strasse 50, Monheim, Germany.
| |
Collapse
|
12
|
OMArk, a tool for gene annotation quality control, reveals erroneous gene inference. Nat Biotechnol 2024:10.1038/s41587-024-02155-w. [PMID: 38383604 DOI: 10.1038/s41587-024-02155-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
|
13
|
Nevers Y, Warwick Vesztrocy A, Rossier V, Train CM, Altenhoff A, Dessimoz C, Glover NM. Quality assessment of gene repertoire annotations with OMArk. Nat Biotechnol 2024:10.1038/s41587-024-02147-w. [PMID: 38383603 DOI: 10.1038/s41587-024-02147-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 01/17/2024] [Indexed: 02/23/2024]
Abstract
In the era of biodiversity genomics, it is crucial to ensure that annotations of protein-coding gene repertoires are accurate. State-of-the-art tools to assess genome annotations measure the completeness of a gene repertoire but are blind to other errors, such as gene overprediction or contamination. We introduce OMArk, a software package that relies on fast, alignment-free sequence comparisons between a query proteome and precomputed gene families across the tree of life. OMArk assesses not only the completeness but also the consistency of the gene repertoire as a whole relative to closely related species and reports likely contamination events. Analysis of 1,805 UniProt Eukaryotic Reference Proteomes with OMArk demonstrated strong evidence of contamination in 73 proteomes and identified error propagation in avian gene annotation resulting from the use of a fragmented zebra finch proteome as a reference. This study illustrates the importance of comparing and prioritizing proteomes based on their quality measures.
Collapse
Affiliation(s)
- Yannis Nevers
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | - Alex Warwick Vesztrocy
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Victor Rossier
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Clément-Marie Train
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Adrian Altenhoff
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Christophe Dessimoz
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Natasha M Glover
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
14
|
Vancaester E, Blaxter ML. MarkerScan: Separation and assembly of cobionts sequenced alongside target species in biodiversity genomics projects. Wellcome Open Res 2024; 9:33. [PMID: 38617467 PMCID: PMC11016177 DOI: 10.12688/wellcomeopenres.20730.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2023] [Indexed: 04/16/2024] Open
Abstract
Contamination of public databases by mislabelled sequences has been highlighted for many years and the avalanche of novel sequencing data now being deposited has the potential to make databases difficult to use effectively. It is therefore crucial that sequencing projects and database curators perform pre-submission checks to remove obvious contamination and avoid propagating erroneous taxonomic relationships. However, it is important also to recognise that biological contamination of a target sample with unexpected species' DNA can also lead to the discovery of fascinating biological phenomena through the identification of environmental organisms or endosymbionts. Here, we present a novel, integrated method for detection and generation of high-quality genomes of all non-target genomes co-sequenced in eukaryotic genome sequencing projects. After performing taxonomic profiling of an assembly from the raw data, and leveraging the identity of small rRNA sequences discovered therein as markers, a targeted classification approach retrieves and assembles high-quality genomes. The genomes of these cobionts are then not only removed from the target species' genome but also available for further interrogation. Source code is available from https://github.com/CobiontID/MarkerScan. MarkerScan is written in Python and is deployed as a Docker container.
Collapse
Affiliation(s)
| | - Mark L. Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
15
|
Nachtweide S, Romoth L, Stanke M. Comparative Genome Annotation. Methods Mol Biol 2024; 2802:165-187. [PMID: 38819560 DOI: 10.1007/978-1-0716-3838-5_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Newly sequenced genomes are being added to the tree of life at an unprecedented fast pace. A large proportion of such new genomes are phylogenetically close to previously sequenced and annotated genomes. In other cases, whole clades of closely related species or strains ought to be annotated simultaneously. Often, in subsequent studies, differences between the closely related species or strains are in the focus of research when the shared gene structures prevail. We here review methods for comparative structural genome annotation. The reviewed methods include classical approaches such as the alignment of protein sequences or protein profiles against the genome and comparative gene prediction methods that exploit a genome alignment to annotate either a single target genome or all input genomes simultaneously. We discuss how the methods depend on the phylogenetic placement of genomes, give advice on the choice of methods, and examine the consistency between gene structure annotations in an example. Furthermore, we provide practical advice on genome annotation in general.
Collapse
Affiliation(s)
| | | | - Mario Stanke
- Institute for Mathematics and Computer Science, Greifswald, Germany.
| |
Collapse
|
16
|
Ghildiyal K, Nayak SS, Rajawat D, Sharma A, Chhotaray S, Bhushan B, Dutt T, Panigrahi M. Genomic insights into the conservation of wild and domestic animal diversity: A review. Gene 2023; 886:147719. [PMID: 37597708 DOI: 10.1016/j.gene.2023.147719] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/20/2023] [Accepted: 08/16/2023] [Indexed: 08/21/2023]
Abstract
Due to environmental change and anthropogenic activities, global biodiversity has suffered an unprecedented loss, and the world is now heading toward the sixth mass extinction event. This urges the need to step up our efforts to promote the sustainable use of animal genetic resources and plan effective strategies for their conservation. Although habitat preservation and restoration are the primary means of conserving biodiversity, genomic technologies offer a variety of novel tools for identifying biodiversity hotspots and thus, support conservation efforts. Conservation genomics is a broad area of science that encompasses the application of genomic data from thousands or tens of thousands of genome-wide markers to address important conservation biology concerns. Genomic approaches have revolutionized the way we understand and manage animal populations, providing tools to identify and preserve unique genetic variants and alleles responsible for adaptive genetic variation, reducing the deleterious consequences of inbreeding, and increasing the adaptive potential of threatened species. The advancement of genomic technologies, particularly comparative genomic approaches, and the increased accessibility of genomic resources in the form of genome-enabled taxa for non-model organisms, provides a distinct advantage in defining conservation units over traditional genetics approaches. The objective of this review is to provide an exhaustive overview of the concept of conservation genomics, discuss the rationale behind the transition from conservation genetics to genomic approaches, and emphasize the potential applications of genomic techniques for conservation purposes. We also highlight interesting case studies in both livestock and wildlife species where genomic techniques have been used to accomplish conservation goals. Finally, we address some challenges and future perspectives in this field.
Collapse
Affiliation(s)
- Kanika Ghildiyal
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Sonali Sonejita Nayak
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Divya Rajawat
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Anurodh Sharma
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Supriya Chhotaray
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- Livestock Production and Management Section, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| |
Collapse
|
17
|
Taft JM, Tolley KA, Alexander GJ, Geneva AJ. De Novo Whole Genome Assemblies for Two Southern African Dwarf Chameleons (Bradypodion, Chamaeleonidae). Genome Biol Evol 2023; 15:evad182. [PMID: 37847614 PMCID: PMC10603767 DOI: 10.1093/gbe/evad182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 09/15/2023] [Accepted: 09/28/2023] [Indexed: 10/19/2023] Open
Abstract
A complete and high-quality reference genome has become a fundamental tool for the study of functional, comparative, and evolutionary genomics. However, efforts to produce high-quality genomes for African taxa are lagging given the limited access to sufficient resources and technologies. The southern African dwarf chameleons (Bradypodion) are a relatively young lineage, with a large body of evidence demonstrating the highly adaptive capacity of these lizards. Bradypodion are known for their habitat specialization, with evidence of convergent phenotypes across the phylogeny. However, the underlying genetic architecture of these phenotypes remains unknown for Bradypodion, and without adequate genomic resources, many evolutionary questions cannot be answered. We present de novo assembled whole genomes for Bradypodion pumilum and Bradypodion ventrale, using Pacific Biosciences long-read sequencing data. BUSCO analysis revealed that 96.36% of single copy orthologs were present in the B. pumilum genome and 94% in B. ventrale. Moreover, these genomes boast scaffold N50 of 389.6 and 374.9 Mb, respectively. Based on a whole genome alignment of both Bradypodion genomes, B. pumilum is highly syntenic with B. ventrale. Furthermore, Bradypodion is also syntenic with Anolis lizards, despite the divergence between these lineages estimated to be nearly 170 Ma. Coalescent analysis of the genomic data also suggests that historical changes in effective population size for these species correspond to notable shifts in the southern African environment. These high-quality Bradypodion genome assemblies will support future research on the evolutionary history, diversification, and genetic underpinnings of adaptation in Bradypodion.
Collapse
Affiliation(s)
- Jody M Taft
- School of Animal, Plant and Environmental Sciences, University of the Witwatersrand, Johannesburg, South Africa
- South African National Biodiversity Institute, Kirstenbosch Research Centre, Claremont, South Africa
| | - Krystal A Tolley
- South African National Biodiversity Institute, Kirstenbosch Research Centre, Claremont, South Africa
- Centre for Ecological Genomics and Wildlife Conservation, University of Johannesburg, Johannesburg, South Africa
| | - Graham J Alexander
- School of Animal, Plant and Environmental Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Anthony J Geneva
- Department of Biology, Center for Computational and Integrative Biology, Rutgers University–Camden, Camden, New Jersey, USA
| |
Collapse
|
18
|
Bovio E, Rancurel C, Seassau A, Magliano M, Gislard M, Loisier A, Kuchly C, Ponchet M, Danchin EGJ, Van Ghelder C. Genome sequence and annotation of Periconia digitata a hopeful biocontrol agent of phytopathogenic oomycetes. Sci Data 2023; 10:583. [PMID: 37673954 PMCID: PMC10483032 DOI: 10.1038/s41597-023-02440-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/02/2023] [Indexed: 09/08/2023] Open
Abstract
The Periconia fungal genus belongs to the phylum Ascomycota, order Pleosporales, family Periconiaceae. Periconia are found in many habitats, but little is known about their ecology. Several species from this genus produce bioactive molecules. Periconia digitata extracts were shown to be deadly active against the pine wilt nematode. Furthermore, P. digitata was shown to inhibit the plant pathogenic oomycete Phytophthora parasitica. Because P. digitata has great potential as a biocontrol agent and high quality genomic resources are still lacking in the Periconiaceae family, we generated long-read genomic data for P. digitata. Using PacBio Hifi sequencing technology, we obtained a highly-contiguous genome assembled in 13 chromosomes and totaling ca. 39 Mb. In addition, we produced a reference transcriptome, based on 12 different culture conditions, and proteomic data to support the genome annotation. Besides representing a new reference genome within the Periconiaceae, this work will contribute to our better understanding of the Eukaryotic tree of life and opens new possibilities in terms of biotechnological applications.
Collapse
Affiliation(s)
- Elena Bovio
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France.
| | - Corinne Rancurel
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France.
| | - Aurélie Seassau
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Marc Magliano
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Marie Gislard
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Anaïs Loisier
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Claire Kuchly
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Michel Ponchet
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Etienne G J Danchin
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Cyril Van Ghelder
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| |
Collapse
|
19
|
Ruperao P, Rangan P, Shah T, Thakur V, Kalia S, Mayes S, Rathore A. The Progression in Developing Genomic Resources for Crop Improvement. Life (Basel) 2023; 13:1668. [PMID: 37629524 PMCID: PMC10455509 DOI: 10.3390/life13081668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/21/2023] [Accepted: 07/25/2023] [Indexed: 08/27/2023] Open
Abstract
Sequencing technologies have rapidly evolved over the past two decades, and new technologies are being continually developed and commercialized. The emerging sequencing technologies target generating more data with fewer inputs and at lower costs. This has also translated to an increase in the number and type of corresponding applications in genomics besides enhanced computational capacities (both hardware and software). Alongside the evolving DNA sequencing landscape, bioinformatics research teams have also evolved to accommodate the increasingly demanding techniques used to combine and interpret data, leading to many researchers moving from the lab to the computer. The rich history of DNA sequencing has paved the way for new insights and the development of new analysis methods. Understanding and learning from past technologies can help with the progress of future applications. This review focuses on the evolution of sequencing technologies, their significant enabling role in generating plant genome assemblies and downstream applications, and the parallel development of bioinformatics tools and skills, filling the gap in data analysis techniques.
Collapse
Affiliation(s)
- Pradeep Ruperao
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad 502324, India
| | - Parimalan Rangan
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi 110012, India;
| | - Trushar Shah
- International Institute of Tropical Agriculture (IITA), Nairobi 30709-00100, Kenya;
| | - Vivek Thakur
- Department of Systems & Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad 500046, India;
| | - Sanjay Kalia
- Department of Biotechnology, Ministry of Science and Technology, Government of India, New Delhi 110003, India;
| | - Sean Mayes
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad 502324, India
| | - Abhishek Rathore
- Excellence in Breeding, International Maize and Wheat Improvement Center (CIMMYT), Hyderabad 502324, India
| |
Collapse
|
20
|
Uliano-Silva M, Ferreira JGRN, Krasheninnikova K, Formenti G, Abueg L, Torrance J, Myers EW, Durbin R, Blaxter M, McCarthy SA. MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads. BMC Bioinformatics 2023; 24:288. [PMID: 37464285 PMCID: PMC10354987 DOI: 10.1186/s12859-023-05385-y] [Citation(s) in RCA: 266] [Impact Index Per Article: 266.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/13/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND PacBio high fidelity (HiFi) sequencing reads are both long (15-20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated tool for mitochondrial genome assembly using HiFi reads is still missing. RESULTS MitoHiFi was developed within the Darwin Tree of Life Project to assemble mitochondrial genomes from the HiFi reads generated for target species. The input for MitoHiFi is either the raw reads or the assembled contigs, and the tool outputs a mitochondrial genome sequence fasta file along with annotation of protein and RNA genes. Variants arising from heteroplasmy are assembled independently, and nuclear insertions of mitochondrial sequences are identified and not used in organellar genome assembly. MitoHiFi has been used to assemble 374 mitochondrial genomes (368 Metazoa and 6 Fungi species) for the Darwin Tree of Life Project, the Vertebrate Genomes Project and the Aquatic Symbiosis Genome Project. Inspection of 60 mitochondrial genomes assembled with MitoHiFi for species that already have reference sequences in public databases showed the widespread presence of previously unreported repeats. CONCLUSIONS MitoHiFi is able to assemble mitochondrial genomes from a wide phylogenetic range of taxa from Pacbio HiFi data. MitoHiFi is written in python and is freely available on GitHub ( https://github.com/marcelauliano/MitoHiFi ). MitoHiFi is available with its dependencies as a Docker container on GitHub (ghcr.io/marcelauliano/mitohifi:master).
Collapse
Affiliation(s)
| | - João Gabriel R. N. Ferreira
- Bio Bureau Biotecnologia, Rio de Janeiro, Brazil
- Instituto de Biofísica Carlos Chagas Filho, UniversidadeFederal Do Rio de Janeiro, Rio de Janeiro, Brazil
| | | | | | | | - James Torrance
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA UK
| | - Eugene W. Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Okinawa Institute of Science and Technology, Okinawa, Japan
| | - Richard Durbin
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH UK
| | - Mark Blaxter
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA UK
| | - Shane A. McCarthy
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH UK
| |
Collapse
|
21
|
Leavitt SD, DeBolt A, McQuhae E, Allen JL. Genomic Resources for the First Federally Endangered Lichen: The Florida Perforate Cladonia ( Cladonia perforata). J Fungi (Basel) 2023; 9:698. [PMID: 37504687 PMCID: PMC10381751 DOI: 10.3390/jof9070698] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/18/2023] [Accepted: 06/19/2023] [Indexed: 07/29/2023] Open
Abstract
Thirty years after its designation as a federally endangered species, the Florida Perforate Cladonia (FPC) remains imperiled in isolated populations in the Florida scrub in the southeastern USA. For threatened and endangered species, such as FPC, reference genomes provide critical insight into genomic diversity, local adaptations, landscape-level genetics, and phylogenomics. Using high-throughput sequencing, we assemble the first draft nuclear and mitochondrial genomes for the FPC mycobiont-Cladonia perforata. We also assess genetic diversity within and among populations in southeastern Florida using genome-scale data and investigate diversity across the entire nuclear ribosomal cistron, including the standard DNA barcoding marker for fungi. The draft nuclear genome spanned 33.6 Mb, and the complete, circular mitochondrial genome was 59 Kb. We also generated the first chloroplast genome, to our knowledge, for the photobiont genus associated with FPC, an undescribed Asterochloris species. We inferred the presence of multiple, distinct mycobiont parental genotypes (genets) occurring at local scales in southeastern Florida, and strikingly, no genets were shared among even the closest sample sites. All sampled thalli shared identical mitochondrial genomes, while the nuclear ribosomal cistron showed limited variability-highlighting the genetic resolution provided by nuclear genome-scale datasets. The genomic resources generated here provide critical resources for informed conservation efforts for the FPC.
Collapse
Affiliation(s)
- Steven D Leavitt
- M.L. Bean Life Science Museum and Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Ann DeBolt
- Department of Biology, Boise State University, Boise, ID 83725, USA
| | - Ethan McQuhae
- Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA
| | - Jessica L Allen
- Department of Biology, Eastern Washington University, Cheney, WA 99004, USA
| |
Collapse
|
22
|
Favreau E, Cini A, Taylor D, Câmara Ferreira F, Bentley MA, Cappa F, Cervo R, Privman E, Schneider J, Thiéry D, Mashoodh R, Wyatt CDR, Brown RL, Bodrug-Schepers A, Stralis-Pavese N, Dohm JC, Mead D, Himmelbauer H, Guigo R, Sumner S. Putting hornets on the genomic map. Sci Rep 2023; 13:6232. [PMID: 37085574 PMCID: PMC10121689 DOI: 10.1038/s41598-023-31932-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 03/20/2023] [Indexed: 04/23/2023] Open
Abstract
Hornets are the largest of the social wasps, and are important regulators of insect populations in their native ranges. Hornets are also very successful as invasive species, with often devastating economic, ecological and societal effects. Understanding why these wasps are such successful invaders is critical to managing future introductions and minimising impact on native biodiversity. Critical to the management toolkit is a comprehensive genomic resource for these insects. Here we provide the annotated genomes for two hornets, Vespa crabro and Vespa velutina. We compare their genomes with those of other social Hymenoptera, including the northern giant hornet Vespa mandarinia. The three hornet genomes show evidence of selection pressure on genes associated with reproduction, which might facilitate the transition into invasive ranges. Vespa crabro has experienced positive selection on the highest number of genes, including those putatively associated with molecular binding and olfactory systems. Caste-specific brain transcriptomic analysis also revealed 133 differentially expressed genes, some of which are associated with olfactory functions. This report provides a spring-board for advancing our understanding of the evolution and ecology of hornets, and opens up opportunities for using molecular methods in the future management of both native and invasive populations of these over-looked insects.
Collapse
Affiliation(s)
- Emeline Favreau
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK.
| | - Alessandro Cini
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
- Department of Biology, Università di Pisa, Via Volta 6, 56126, Pisa, Italy
| | - Daisy Taylor
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | | | - Michael A Bentley
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - Federico Cappa
- Department of Biology, University of Florence, Via Madonna del Piano 6, 50019, Sesto Fiorentino, Florence, Italy
| | - Rita Cervo
- Department of Biology, University of Florence, Via Madonna del Piano 6, 50019, Sesto Fiorentino, Florence, Italy
| | - Eyal Privman
- Department of Evolutionary and Environmental Biology, Institute of Evolution, University of Haifa, Abba Hushi 199, 3498838, Haifa, Israel
| | - Jadesada Schneider
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - Denis Thiéry
- INRAe, UMR 1065 Santé et Agroécologie du Vignoble, Bordeaux Sciences Agro, ISVV, Université de Bordeaux, 33883, Villenave d'Ornon, France
| | - Rahia Mashoodh
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - Christopher D R Wyatt
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - Robert L Brown
- Manaaki Whenua - Landcare Research, 54 Gerald Street, Lincoln, 7608, New Zealand
| | - Alexandrina Bodrug-Schepers
- Department of Biotechnology, Institute of Computational Biology, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190, Vienna, Austria
| | - Nancy Stralis-Pavese
- Department of Biotechnology, Institute of Computational Biology, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190, Vienna, Austria
| | - Juliane C Dohm
- Department of Biotechnology, Institute of Computational Biology, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190, Vienna, Austria
| | - Daniel Mead
- Tree of Life Programme, Wellcome Sanger Institute, Hinxton, CB10 1SA, UK
| | - Heinz Himmelbauer
- Department of Biotechnology, Institute of Computational Biology, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190, Vienna, Austria
| | - Roderic Guigo
- Centre for Genomic Regulation, Dr. Aiguader 88, 08003, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Seirian Sumner
- Centre for Biodiversity and Environmental Research, Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK.
| |
Collapse
|
23
|
Chua PYS, Bourlat SJ, Ferguson C, Korlevic P, Zhao L, Ekrem T, Meier R, Lawniczak MKN. Future of DNA-based insect monitoring. Trends Genet 2023:S0168-9525(23)00038-0. [PMID: 36907721 DOI: 10.1016/j.tig.2023.02.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 02/13/2023] [Accepted: 02/15/2023] [Indexed: 03/12/2023]
Abstract
Insects are crucial for ecosystem health but climate change and pesticide use are driving massive insect decline. To mitigate this loss, we need new and effective monitoring techniques. Over the past decade there has been a shift to DNA-based techniques. We describe key emerging techniques for sample collection. We suggest that the selection of tools should be broadened, and that DNA-based insect monitoring data need to be integrated more rapidly into policymaking. We argue that there are four key areas for advancement, including the generation of more complete DNA barcode databases to interpret molecular data, standardisation of molecular methods, scaling up of monitoring efforts, and integrating molecular tools with other technologies that allow continuous, passive monitoring based on images and/or laser imaging, detection, and ranging (LIDAR).
Collapse
Affiliation(s)
- Physilia Y S Chua
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Sarah J Bourlat
- Leibniz Institute for the Analysis of Biodiversity Change, Museum Koenig, Adenauerallee 127, 53113 Bonn, Germany
| | - Cameron Ferguson
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Petra Korlevic
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Leia Zhao
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Torbjørn Ekrem
- Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Rudolf Meier
- Museum für Naturkunde, Center for Integrative Biodiversity Discovery, Leibniz-Institut für Evolutions- und Biodiversitätsforschung, Berlin, Germany
| | - Mara K N Lawniczak
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
24
|
Wang P, Wang F. A proposed metric set for evaluation of genome assembly quality. Trends Genet 2023; 39:175-186. [PMID: 36402623 DOI: 10.1016/j.tig.2022.10.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 10/24/2022] [Accepted: 10/26/2022] [Indexed: 11/18/2022]
Abstract
Quality control is essential for genome assemblies; however, a consensus has yet to be reached on what metrics should be adopted for the evaluation of assembly quality. N50 is widely used for contiguity measurement, but its effectiveness is constantly in question. Prevailing metrics for the completeness evaluation focus on gene space, yet challenging areas such as tandem repeats are commonly overlooked. Achieving correctness has become an indispensable dimension for quality control, while prevailing assembly releases lack scores reflecting this aspect. We propose a metric set with a set of statistic indexes for effective, comprehensive evaluation of assemblies and provide a score of a finished assembly for each metric, which can be utilized as a benchmark for achieving high-quality genome assemblies.
Collapse
Affiliation(s)
- Peng Wang
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement in Southern China, Ministry of Agriculture and Rural Affairs, Institute of Tropical Crop Genetic Resources, Chinese Academy of Tropical Agricultural Sciences, No. 4 Xueyuan Rd, Haikou City, Hainan 571101, China.
| | - Fei Wang
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, No. 100 Haiquan Rd, Shanghai 201416, China.
| |
Collapse
|
25
|
Dietz L, Eberle J, Mayer C, Kukowka S, Bohacz C, Baur H, Espeland M, Huber BA, Hutter C, Mengual X, Peters RS, Vences M, Wesener T, Willmott K, Misof B, Niehuis O, Ahrens D. Standardized nuclear markers improve and homogenize species delimitation in Metazoa. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.14041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Lars Dietz
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Jonas Eberle
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
- Paris‐Lodron‐University Salzburg Austria
| | - Christoph Mayer
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Sandra Kukowka
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Claudia Bohacz
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Hannes Baur
- Naturhistorisches Museum Bern/Institute of Ecology and Evolution University of Bern Bern Switzerland
| | - Marianne Espeland
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Bernhard A. Huber
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Carl Hutter
- Museum of Natural Sciences and Department of Biological Sciences Louisiana State University Baton Rouge Louisiana USA
| | - Ximo Mengual
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Ralph S. Peters
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Miguel Vences
- Technische Universität Braunschweig Braunschweig Germany
| | - Thomas Wesener
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| | - Keith Willmott
- Florida Museum of Natural History University of Florida Gainesville Florida USA
| | - Bernhard Misof
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
- Rheinische Friedrich‐Wilhelms‐Universität Bonn Bonn Germany
| | - Oliver Niehuis
- Abt. Evolutionsbiologie und Ökologie, Institut für Biologie I, Albert‐Ludwigs‐Universität Freiburg Freiburg im Breisgau Germany
| | - Dirk Ahrens
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB) Bonn Germany
| |
Collapse
|
26
|
Dockendorff TC, Estrem B, Reed J, Simmons JR, Zadegan SB, Zagoskin MV, Terta V, Villalobos E, Seaberry EM, Wang J. The nematode Oscheius tipulae as a genetic model for programmed DNA elimination. Curr Biol 2022; 32:5083-5098.e6. [PMID: 36379215 PMCID: PMC9729473 DOI: 10.1016/j.cub.2022.10.043] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/05/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022]
Abstract
Programmed DNA elimination (PDE) is a notable exception to the paradigm of genome integrity. In metazoa, PDE often occurs coincident with germline to somatic cell differentiation. During PDE, portions of genomic DNA are lost, resulting in reduced somatic genomes. Prior studies have described the sequences lost, as well as chromosome behavior, during metazoan PDE. However, a system for studying the mechanisms and consequences of PDE in metazoa is lacking. Here, we present a functional and genetic model for PDE in the free-living Rhabditidae nematode Oscheius tipulae, a family that also includes Caenorhabditis elegans. O. tipulae was recently suggested to eliminate DNA. Using staged embryos and DNA FISH, we showed that O. tipulae PDE occurs during embryogenesis at the 8-16 cell stages. We identified a conserved motif, named Sequence For Elimination (SFE), for all 12 break sites on the six chromosomes at the junctions of retained and eliminated DNA. SFE mutants exhibited a "fail-to-eliminate" phenotype only at the modified sites. END-seq revealed that breaks can occur at multiple positions within the SFE, with extensive end resection followed by telomere addition to both retained and eliminated ends. We identified many functional SFEs at the chromosome ends through END-seq in the wild-type embryos, genome sequencing of SFE mutants, and comparative genomics of 23 wild isolates. We suggest that these alternative SFEs provide flexibility in the sequences eliminated and a fail-safe mechanism for PDE. These studies establish O. tipulae as a new, attractive model for studying the mechanisms and consequences of PDE in a metazoan.
Collapse
Affiliation(s)
- Thomas C Dockendorff
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Brandon Estrem
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Jordan Reed
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - James R Simmons
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Sobhan Bahrami Zadegan
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37996, USA
| | - Maxim V Zagoskin
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Vincent Terta
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Eduardo Villalobos
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Erin M Seaberry
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Jianbin Wang
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA; UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37996, USA.
| |
Collapse
|
27
|
Xu Y, Shao F, Chen W, Ni L, Peng Z. A chromosome-level genome of the helmet catfish (Cranoglanis bouderius). Front Genet 2022; 13:962406. [PMID: 36035162 PMCID: PMC9400026 DOI: 10.3389/fgene.2022.962406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 07/15/2022] [Indexed: 11/29/2022] Open
Affiliation(s)
- Yuan Xu
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing, China
| | - Feng Shao
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing, China
| | - Weitao Chen
- Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, China
| | - Luyun Ni
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing, China
| | - Zuogang Peng
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing, China
- Academy of Plateau Science and Sustainability, Qinghai Normal University, Xining, China
- *Correspondence: Zuogang Peng,
| |
Collapse
|
28
|
Kwak Y, Argandona JA, Degnan PH, Hansen AK. Chromosomal-level assembly of Bactericera cockerelli reveals rampant gene family expansions impacting genome structure, function and insect-microbe-plant-interactions. Mol Ecol Resour 2022; 23:233-252. [PMID: 35925827 PMCID: PMC10087415 DOI: 10.1111/1755-0998.13693] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 06/24/2022] [Accepted: 08/01/2022] [Indexed: 11/27/2022]
Abstract
Lineage specific expansions and gene duplications are some of the most important sources of evolutionary novelty in eukaryotes. Although not as prevalent in eukaryotes compared to bacteria, horizontal gene transfer events can also result in key adaptations for insects, especially for those involved in insect-microbe interactions. In this study we assemble the first chromosomal assembly of the psyllid Bactericera cockerelli and reveal that the B. cockerelli genome has experienced significantly more gene expansion events compared to other Hemipteran representatives with fully sequenced genomes. We also reveal that B. cockerelli's genome is the largest psyllid genome (567 Mb) sequenced to date and is ~15% larger than the other two psyllid species genomes sequenced (Pachypsylla venusta and Diaphorina citri). Structurally, B. cockerelli appears to have an additional chromosome compared to the distantly related psyllid species P. venusta due to a previous chromosomal fission or fusion event. The increase in genome size and dynamic nature of the B. cockerelli genome may largely be contributed to the widespread expansion of type I and type II repeat elements that are rampant across all of B. cockerelli's. chromosomes. These repeat elements are distributed near equally in both euchromatic and heterochromatic regions. Furthermore, significant gene family expansions and gene duplications were uncovered for genes that are expected to be important in its adaptation to insect-plant and microbe interactions, which include transcription factors, proteases, odorant receptors, and horizontally transferred genes that are involved in the nutritional symbioses with their long-term nutritional endosymbiont Carsonella.
Collapse
Affiliation(s)
- Younghwan Kwak
- Department of Entomology, University of California, CA, USA
| | | | - Patrick H Degnan
- Department of Microbiology and Plant Pathology, University of California, CA, USA
| | | |
Collapse
|
29
|
Wilder AP, Dudchenko O, Curry C, Korody M, Turbek SP, Daly M, Misuraca A, Gaojianyong WANG, Khan R, Weisz D, Fronczek J, Aiden EL, Houck ML, Shier DM, Ryder OA, Steiner CC. A chromosome-length reference genome for the endangered Pacific pocket mouse reveals recent inbreeding in a historically large population. Genome Biol Evol 2022; 14:6650481. [PMID: 35894178 PMCID: PMC9348616 DOI: 10.1093/gbe/evac122] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/21/2022] [Indexed: 11/16/2022] Open
Abstract
High-quality reference genomes are fundamental tools for understanding population history, and can provide estimates of genetic and demographic parameters relevant to the conservation of biodiversity. The federally endangered Pacific pocket mouse (PPM), which persists in three small, isolated populations in southern California, is a promising model for studying how demographic history shapes genetic diversity, and how diversity in turn may influence extinction risk. To facilitate these studies in PPM, we combined PacBio HiFi long reads with Omni-C and Hi-C data to generate a de novo genome assembly, and annotated the genome using RNAseq. The assembly comprised 28 chromosome-length scaffolds (N50 = 72.6 MB) and the complete mitochondrial genome, and included a long heterochromatic region on chromosome 18 not represented in the previously available short-read assembly. Heterozygosity was highly variable across the genome of the reference individual, with 18% of windows falling in runs of homozygosity (ROH) >1 MB, and nearly 9% in tracts spanning >5 MB. Yet outside of ROH, heterozygosity was relatively high (0.0027), and historical Ne estimates were large. These patterns of genetic variation suggest recent inbreeding in a formerly large population. Currently the most contiguous assembly for a heteromyid rodent, this reference genome provides insight into the past and recent demographic history of the population, and will be a critical tool for management and future studies of outbreeding depression, inbreeding depression, and genetic load.
Collapse
Affiliation(s)
- Aryn P Wilder
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Olga Dudchenko
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, USA.,Center for Theoretical Biological Physics and Department of Computer Science, Rice University, USA
| | - Caitlin Curry
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Marisa Korody
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Sheela P Turbek
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA.,Ecology and Evolutionary Biology, University of Colorado, Boulder, USA
| | | | - Ann Misuraca
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - W A N G Gaojianyong
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Ruqayya Khan
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, USA
| | - David Weisz
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, USA
| | - Julie Fronczek
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Erez Lieberman Aiden
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, USA.,Center for Theoretical Biological Physics and Department of Computer Science, Rice University, USA.,UWA School of Agriculture and Environment, The University of Western Australia, Australia.,Broad Institute of MIT and Harvard, USA.,Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech, China
| | - Marlys L Houck
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Debra M Shier
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA.,Department of Ecology & Evolutionary Biology, University of California Los Angeles, USA
| | - Oliver A Ryder
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| | - Cynthia C Steiner
- Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, USA
| |
Collapse
|
30
|
Renoz F, Ambroise J, Bearzatto B, Fakhour S, Parisot N, Ribeiro Lopes M, Gala JL, Calevro F, Hance T. The Di-Symbiotic Systems in the Aphids Sipha maydis and Periphyllus lyropictus Provide a Contrasting Picture of Recent Co-Obligate Nutritional Endosymbiosis in Aphids. Microorganisms 2022; 10:microorganisms10071360. [PMID: 35889078 PMCID: PMC9317480 DOI: 10.3390/microorganisms10071360] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 07/01/2022] [Accepted: 07/04/2022] [Indexed: 11/25/2022] Open
Abstract
Dependence on multiple nutritional bacterial symbionts forming a metabolic unit has repeatedly evolved in many insect species that feed on nutritionally unbalanced diets such as plant sap. This is the case for aphids of the subfamilies Lachninae and Chaitophorinae, which have evolved di-symbiotic systems in which the ancient obligate nutritional symbiont Buchnera aphidicola is metabolically complemented by an additional nutritional symbiont acquired more recently. Deciphering how different symbionts integrate both metabolically and anatomically in such systems is crucial to understanding how complex nutritional symbiotic systems function and evolve. In this study, we sequenced and analyzed the genomes of the symbionts B. aphidicola and Serratia symbiotica associated with the Chaitophorinae aphids Sipha maydis and Periphyllus lyropictus. Our results show that, in these two species, B. aphidicola and S. symbiotica complement each other metabolically (and their hosts) for the biosynthesis of essential amino acids and vitamins, but with distinct metabolic reactions supported by each symbiont depending on the host species. Furthermore, the S. symbiotica symbiont associated with S. maydis appears to be strictly compartmentalized into the specialized host cells housing symbionts in aphids, the bacteriocytes, whereas the S. symbiotica symbiont associated with P. lyropictus exhibits a highly invasive phenotype, presumably because it is capable of expressing a larger set of virulence factors, including a complete flagellum for bacterial motility. Such contrasting levels of metabolic and anatomical integration for two S. symbiotica symbionts that were recently acquired as nutritional co-obligate partners reflect distinct coevolutionary processes specific to each association.
Collapse
Affiliation(s)
- François Renoz
- Biodiversity Research Centre, Earth and Life Institute, Université Catholique de Louvain (UCLouvain), 1348 Louvain-la-Neuve, Belgium;
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR203, F-69621 Villeurbanne, France; (N.P.); (M.R.L.); (F.C.)
- Correspondence:
| | - Jérôme Ambroise
- Center for Applied Molecular Technologies, Institute of Experimental and Clinical Research, Université Catholique de Louvain (UCLouvain), 1200 Woluwe-Saint-Lambert, Belgium; (J.A.); (B.B.); (J.-L.G.)
| | - Bertrand Bearzatto
- Center for Applied Molecular Technologies, Institute of Experimental and Clinical Research, Université Catholique de Louvain (UCLouvain), 1200 Woluwe-Saint-Lambert, Belgium; (J.A.); (B.B.); (J.-L.G.)
| | - Samir Fakhour
- Department of Plant Protection, National Institute of Agricultural Research, Avenue Ennasr, BP 415 Rabat Principale, Rabat 10090, Morocco;
| | - Nicolas Parisot
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR203, F-69621 Villeurbanne, France; (N.P.); (M.R.L.); (F.C.)
| | - Mélanie Ribeiro Lopes
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR203, F-69621 Villeurbanne, France; (N.P.); (M.R.L.); (F.C.)
| | - Jean-Luc Gala
- Center for Applied Molecular Technologies, Institute of Experimental and Clinical Research, Université Catholique de Louvain (UCLouvain), 1200 Woluwe-Saint-Lambert, Belgium; (J.A.); (B.B.); (J.-L.G.)
| | - Federica Calevro
- Univ Lyon, INSA Lyon, INRAE, BF2I, UMR203, F-69621 Villeurbanne, France; (N.P.); (M.R.L.); (F.C.)
| | - Thierry Hance
- Biodiversity Research Centre, Earth and Life Institute, Université Catholique de Louvain (UCLouvain), 1348 Louvain-la-Neuve, Belgium;
| |
Collapse
|
31
|
Laczkó L, Jordán S, Sramkó G. The
RadOrgMiner
pipeline: Automated genotyping of organellar loci from
RADseq
data. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Levente Laczkó
- MTA‐DE “Lendület” Evolutionary Phylogenomics Research Group, Egyetem tér 1 H‐4032 Debrecen Hungary
- Department of Botany University of Debrecen Egyetem tér 1, Debrecen, H‐4032 Hungary
- ELKH‐ DE Conservation Biology Research Group, Egyetem tér 1, Debrecen, H‐4032 Hungary
- Department of Metagenomics University of Debrecen Nagyerdei körút 98., Debrecen, H‐4032 Hungary
| | - Sándor Jordán
- Department of Botany University of Debrecen Egyetem tér 1, Debrecen, H‐4032 Hungary
- Juhász‐Nagy Pál Doctoral School University of Debrecen Egyetem tér 1, Debrecen, H‐4032 Hungary
| | - Gábor Sramkó
- MTA‐DE “Lendület” Evolutionary Phylogenomics Research Group, Egyetem tér 1 H‐4032 Debrecen Hungary
- Department of Botany University of Debrecen Egyetem tér 1, Debrecen, H‐4032 Hungary
- ELKH‐ DE Conservation Biology Research Group, Egyetem tér 1, Debrecen, H‐4032 Hungary
| |
Collapse
|
32
|
Molik DC. An Outsider's Perspective on Why We Climb Mountains and Why Projects Like the i5k Matter. JOURNAL OF INSECT SCIENCE (ONLINE) 2022; 22:2. [PMID: 35780386 PMCID: PMC9250708 DOI: 10.1093/jisesa/ieac038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Indexed: 06/15/2023]
Abstract
Initiatives like the i5k are creating evermore genome assemblies. These initiatives are resource heavy, and their justifications and economics deserve attention. Scientifically, these initiatives are important, paving the way for cross-species analysis, requiring the building of new computational analysis and tools, and creating other new resources. However, an open question remains of how we quantitively measure the impact of genomes, and by extension these initiatives. This forum article discusses one such method which is to look at the publications about a species over time, however, this method does not show any signal from a published genome, leaving an open question of how to measure impact.
Collapse
|
33
|
Zhang Y, Clancy J, Jensen J, McMullin RT, Wang L, Leavitt SD. Providing Scale to a Known Taxonomic Unknown—At Least a 70-Fold Increase in Species Diversity in a Cosmopolitan Nominal Taxon of Lichen-Forming Fungi. J Fungi (Basel) 2022; 8:jof8050490. [PMID: 35628746 PMCID: PMC9146994 DOI: 10.3390/jof8050490] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 05/03/2022] [Accepted: 05/04/2022] [Indexed: 02/06/2023] Open
Abstract
Robust species delimitations provide a foundation for investigating speciation, phylogeography, and conservation. Here we attempted to elucidate species boundaries in the cosmopolitan lichen-forming fungal taxon Lecanora polytropa. This nominal taxon is morphologically variable, with distinct populations occurring on all seven continents. To delimit candidate species, we compiled ITS sequence data from populations worldwide. For a subset of the samples, we also generated alignments for 1209 single-copy nuclear genes and an alignment spanning most of the mitochondrial genome to assess concordance among the ITS, nuclear, and mitochondrial inferences. Species partitions were empirically delimited from the ITS alignment using ASAP and bPTP. We also inferred a phylogeny for the L. polytropa clade using a four-marker dataset. ASAP species delimitations revealed up to 103 species in the L. polytropa clade, with 75 corresponding to the nominal taxon L. polytropa. Inferences from phylogenomic alignments generally supported that these represent evolutionarily independent lineages or species. Less than 10% of the candidate species were comprised of specimens from multiple continents. High levels of candidate species were recovered at local scales but generally with limited overlap across regions. Lecanora polytropa likely ranks as one of the largest species complexes of lichen-forming fungi known to date.
Collapse
Affiliation(s)
- Yanyun Zhang
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Heilongtan, Kunming 650201, China;
- College of Life Science, Anhui Normal University, Wuhu 241000, China
| | - Jeffrey Clancy
- Department of Biology, Brigham Young University, 4102 Life Science Building, Provo, UT 84602, USA; (J.C.); (J.J.)
| | - Jacob Jensen
- Department of Biology, Brigham Young University, 4102 Life Science Building, Provo, UT 84602, USA; (J.C.); (J.J.)
| | | | - Lisong Wang
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Heilongtan, Kunming 650201, China;
- Correspondence: (L.W.); (S.D.L.)
| | - Steven D. Leavitt
- Department of Biology, M. L. Bean Life Science Museum, Brigham Young University, 4102 Life Science Building, Provo, UT 84602, USA
- Correspondence: (L.W.); (S.D.L.)
| |
Collapse
|
34
|
Gupta PK. Earth Biogenome Project: present status and future plans. Trends Genet 2022; 38:811-820. [DOI: 10.1016/j.tig.2022.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 04/11/2022] [Accepted: 04/22/2022] [Indexed: 10/18/2022]
|
35
|
Abstract
A decade of progress in whole-genome sequencing techniques has imbued researchers with the confidence to sequence all eukaryotic life on earth. But what will be essential to their success, and what challenges await them?
Collapse
|
36
|
Kress WJ, Soltis DE, Kersey PJ, Wegrzyn JL, Leebens-Mack JH, Gostel MR, Liu X, Soltis PS. Green plant genomes: What we know in an era of rapidly expanding opportunities. Proc Natl Acad Sci U S A 2022; 119:e2115640118. [PMID: 35042803 PMCID: PMC8795535 DOI: 10.1073/pnas.2115640118] [Citation(s) in RCA: 52] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Green plants play a fundamental role in ecosystems, human health, and agriculture. As de novo genomes are being generated for all known eukaryotic species as advocated by the Earth BioGenome Project, increasing genomic information on green land plants is essential. However, setting standards for the generation and storage of the complex set of genomes that characterize the green lineage of life is a major challenge for plant scientists. Such standards will need to accommodate the immense variation in green plant genome size, transposable element content, and structural complexity while enabling research into the molecular and evolutionary processes that have resulted in this enormous genomic variation. Here we provide an overview and assessment of the current state of knowledge of green plant genomes. To date fewer than 300 complete chromosome-scale genome assemblies representing fewer than 900 species have been generated across the estimated 450,000 to 500,000 species in the green plant clade. These genomes range in size from 12 Mb to 27.6 Gb and are biased toward agricultural crops with large branches of the green tree of life untouched by genomic-scale sequencing. Locating suitable tissue samples of most species of plants, especially those taxa from extreme environments, remains one of the biggest hurdles to increasing our genomic inventory. Furthermore, the annotation of plant genomes is at present undergoing intensive improvement. It is our hope that this fresh overview will help in the development of genomic quality standards for a cohesive and meaningful synthesis of green plant genomes as we scale up for the future.
Collapse
Affiliation(s)
- W John Kress
- National Museum of Natural History, Smithsonian Institution, Department of Botany, Washington, DC 20013-7012;
- Department of Biological Sciences, Dartmouth College, Hanover, NH 03755
- Arnold Arboretum, Harvard University, Boston, MA 02130
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611
- Biodiversity Institute, University of Florida, Gainesville, FL 32611
- Department of Biology, University of Florida, Gainesville, FL 32611
| | - Paul J Kersey
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, United Kingdom
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, Institute for Systems Genomics: Computational Biology Core, University of Connecticut, Storrs, CT 06269-3214
| | - James H Leebens-Mack
- Department of Plant Biology, 2101 Miller Plant Sciences, University of Georgia, Athens, GA 30602-7271
| | - Morgan R Gostel
- Botanical Research Institute of Texas, Fort Worth, TX 76107-3400
| | - Xin Liu
- China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611
- Biodiversity Institute, University of Florida, Gainesville, FL 32611
| |
Collapse
|
37
|
Stephan T, Burgess SM, Cheng H, Danko CG, Gill CA, Jarvis ED, Koepfli KP, Koltes JE, Lyons E, Ronald P, Ryder OA, Schriml LM, Soltis P, VandeWoude S, Zhou H, Ostrander EA, Karlsson EK. Darwinian genomics and diversity in the tree of life. Proc Natl Acad Sci U S A 2022; 119:e2115644119. [PMID: 35042807 PMCID: PMC8795533 DOI: 10.1073/pnas.2115644119] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genomics encompasses the entire tree of life, both extinct and extant, and the evolutionary processes that shape this diversity. To date, genomic research has focused on humans, a small number of agricultural species, and established laboratory models. Fewer than 18,000 of ∼2,000,000 eukaryotic species (<1%) have a representative genome sequence in GenBank, and only a fraction of these have ancillary information on genome structure, genetic variation, gene expression, epigenetic modifications, and population diversity. This imbalance reflects a perception that human studies are paramount in disease research. Yet understanding how genomes work, and how genetic variation shapes phenotypes, requires a broad view that embraces the vast diversity of life. We have the technology to collect massive and exquisitely detailed datasets about the world, but expertise is siloed into distinct fields. A new approach, integrating comparative genomics with cell and evolutionary biology, ecology, archaeology, anthropology, and conservation biology, is essential for understanding and protecting ourselves and our world. Here, we describe potential for scientific discovery when comparative genomics works in close collaboration with a broad range of fields as well as the technical, scientific, and social constraints that must be addressed.
Collapse
Affiliation(s)
- Taylorlyn Stephan
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20817
| | - Shawn M Burgess
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20817
| | - Hans Cheng
- Avian Disease and Oncology Laboratory, Agricultural Research Service, US Department of Agriculture, East Lansing, MI 48823
| | - Charles G Danko
- Department of Biomedical Sciences, Baker Institute for Animal Health, Cornell University, Ithaca, NY 14850
| | - Clare A Gill
- Department of Animal Science, Texas A&M University, College Station, TX 77843
| | - Erich D Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY 10065
- HHMI, Chevy Chase, MD 20815
| | - Klaus-Peter Koepfli
- Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA 22630
- Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC 20008
| | - James E Koltes
- Department of Animal Science, Iowa State University, Ames, IA 50011
| | - Eric Lyons
- School of Plant Sciences, BIO5 Institute, University of Arizona, Tucson, AZ 85721
| | - Pamela Ronald
- Department of Plant Pathology, University of California, Davis, CA 95616
- The Genome Center, University of California, Davis, CA 95616
- The Innovative Genomics Institute, University of California, Berkeley, CA 94720
- Grass Genetics, Joint Bioenergy Institute, Emeryville, CA 94608
| | - Oliver A Ryder
- San Diego Zoo Wildlife Alliance, Escondido, CA 92027
- Department of Evolution, Behavior, and Ecology, University of California San Diego, La Jolla, CA 92093
| | - Lynn M Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201
| | - Pamela Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611
| | - Sue VandeWoude
- Department of Micro-, Immuno-, and Pathology, Colorado State University, Fort Collins, CO 80532
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA 95616
| | - Elaine A Ostrander
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20817
| | - Elinor K Karlsson
- Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01655;
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01655
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| |
Collapse
|
38
|
Abstract
The goals of the Earth Biogenome Project-to sequence the genomes of all eukaryotic life on earth-are as daunting as they are ambitious. The Darwin Tree of Life Project was founded to demonstrate the credibility of these goals and to deliver at-scale genome sequences of unprecedented quality for a biogeographic region: the archipelago of islands that constitute Britain and Ireland. The Darwin Tree of Life Project is a collaboration between biodiversity organizations (museums, botanical gardens, and biodiversity institutes) and genomics institutes. Together, we have built a workflow that collects specimens from the field, robustly identifies them, performs sequencing, generates high-quality, curated assemblies, and releases these openly for the global community to use to build future science and conservation efforts.
Collapse
|